Software Development Engineer, AI/ML, AWS Neuron, Model Inference
Software engineering internship job in Cupertino, CA
The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The AWS Neuron SDK, developed by the Annapurna Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML compiler, runtime, and application framework that seamlessly integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance.
The Inference Enablement and Acceleration team is at the forefront of running a wide range of models and supporting novel architecture alongside maximizing their performance for AWS's custom ML accelerators. Working across the stack from PyTorch till the hardware-software boundary, our engineers build systematic infrastructure, innovate new methods and create high-performance kernels for ML functions, ensuring every compute unit is fine tuned for optimal performance for our customers' demanding workloads. We combine deep hardware knowledge with ML expertise to push the boundaries of what's possible in AI acceleration.
As part of the broader Neuron organization, our team works across multiple technology layers - from frameworks and kernels and collaborate with compiler to runtime and collectives. We not only optimize current performance but also contribute to future architecture designs, working closely with customers to enable their models and ensure optimal performance. This role offers a unique opportunity to work at the intersection of machine learning, high-performance computing, and distributed architectures, where you'll help shape the future of AI acceleration technology
You will architect and implement business critical features, and mentor a brilliant team of experienced engineers. We operate in spaces that are very large, yet our teams remain small and agile. There is no blueprint. We're inventing. We're experimenting. It is a very unique learning culture. The team works closely with customers on their model enablement, providing direct support and optimization expertise to ensure their machine learning workloads achieve optimal performance on AWS ML accelerators. The team collaborates with open source ecosystems to provide seamless integration and bring peak performance at scale for customers and developers.
This role is responsible for development, enablement and performance tuning of a wide variety of LLM model families, including massive scale large language models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side with compiler engineers and runtime engineers to create, build and tune distributed inference solutions with Trainium and Inferentia. Experience optimizing inference performance for both latency and throughput on such large models across the stack from system level optimizations through to Pytorch or JAX is a must have.
You can learn more about Neuron
*****************************************************************************************
***********************************************
*************************************
*********************************************************************************************
Key job responsibilities
This role will help lead the efforts in building distributed inference support for Pytorch in the Neuron SDK. This role will tune these models to ensure highest performance and maximize the efficiency of them running on the customer AWS Trainium and Inferentia silicon and servers. Strong software development using Python, System level programming and ML knowledge are both critical to this role. Our engineers collaborate across compiler, runtime, framework, and hardware teams to optimize machine learning workloads for our global customer base. Working at the intersection of software, hardware, and machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML model acceleration. In this role, you will:
* Design, develop, and optimize machine learning models and frameworks for deployment on custom ML hardware accelerators.
* Participate in all stages of the ML system development lifecycle including distributed computing based architecture design, implementation, performance profiling, hardware-specific optimizations, testing and production deployment.
* Build infrastructure to systematically analyze and onboard multiple models with diverse architecture.
* Design and implement high-performance kernels and features for ML operations, leveraging the Neuron architecture and programming models
* Analyze and optimize system-level performance across multiple generations of Neuron hardware
* Conduct detailed performance analysis using profiling tools to identify and resolve bottlenecks
* Implement optimizations such as fusion, sharding, tiling, and scheduling
* Conduct comprehensive testing, including unit and end-to-end model testing with continuous deployment and releases through pipelines.
* Work directly with customers to enable and optimize their ML models on AWS accelerators
* Collaborate across teams to develop innovative optimization techniques
A day in the life
You will collaborate with a cross-functional team of applied scientists, system engineers, and product managers to deliver state-of-the-art inference capabilities for Generative AI applications. Your work will involve debugging performance issues, optimizing memory usage, and shaping the future of Neuron's inference stack across Amazon and the Open Source Community. As you design and code solutions to help our team drive efficiencies in software architecture, you'll create metrics, implement automation and other improvements, and resolve the root cause of software defects.
You will also build high-impact solutions to deliver to our large customer base and participate in design discussions, code review, and communicate with internal and external stakeholders. You will work cross-functionally to help drive business decisions with your technical input. You will work in a startup-like development environment, where you're always working on the most important initiative.
About the team
The Inference Enablement and Acceleration team fosters a builder's culture where experimentation is encouraged, and impact is measurable. We emphasize collaboration, technical ownership, and continuous learning. Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we're building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future. Join us to solve some of the most interesting and impactful infrastructure challenges in AI/ML today.
BASIC QUALIFICATIONS- Bachelor's degree in computer science or equivalent
- 5+ years of non-internship professional software development experience
- 5+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Fundamentals of Machine learning and LLMs, their architecture, training and inference lifecycles along with work experience on some optimizations for improving the model execution.
- Software development experience in C++, Python (experience in at least one language is required).
- Strong understanding of system performance, memory management, and parallel computing principles.
- Proficiency in debugging, profiling, and implementing best software engineering practices in large-scale systems.
PREFERRED QUALIFICATIONS- Familiarity with PyTorch, JIT compilation, and AOT tracing.
- Familiarity with CUDA kernels or equivalent ML or low-level kernels
- Candidates with performant kernel development such as CUTLASS, FlashInfer etc., would be well suited.
- Familiar with syntax and tile-level semantics similar to Triton.
- Experience with online/offline inference serving with vLLM, SGLang, TensorRT or similar platforms in production environments.
- Deep understanding of computer architecture, operation systems level software and working knowledge of parallel computing.
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company's reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit ********************************************************* for more information. If the country/region you're applying in isn't listed, please contact your Recruiting Partner.
Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit ******************************************************** This position will remain posted until filled. Applicants should apply via our internal or external career site.
Staff Software Engineer - High-Growth AI/FinTech
Software engineering internship job in Fremont, CA
Staff Software Engineer (IC) - High-Growth AI/FinTech Startup
Full-time · Hybrid (San Francisco)
$220k-$300k + equity
A well-funded, rapidly scaling startup in the AI-driven fintech space is looking for an experienced Staff Engineer to take ownership of reshaping the foundations of their core platform. After two years of fast iteration and customer growth, the product has evolved into a set of independently built services. They now need a senior IC who can bring coherence, scalability, and long-term architectural stability as the engineering team expands.
This is a high-impact individual contributor role working directly with the CTO. You'll set technical direction, oversee major system redesigns, and help prepare the platform to support significantly larger usage, customer demands, and a future 20-40+ engineer organisation.
What You'll Be Doing
Lead architectural transformation
Redesign major components into a unified, maintainable, scalable system.
Remove legacy code, reduce fragmentation, and introduce sound architectural patterns.
Define technical standards and guide the broader engineering team towards consistent, high-quality design.
Drive high-leverage engineering work
Partner closely with the CTO on long-term technical strategy.
Lead development of workflow systems for real-time identity, income, and document verification.
Strengthen the infrastructure that powers the company's automated decisioning engine (currently >70% auto-approval/denial rate).
Support integrations with internal ML models that perform fraud detection and financial document understanding.
Influence and elevate the engineering culture
Collaborate with senior and junior engineers across backend, full-stack and infra.
Improve developer velocity and support onboarding of larger enterprise customers.
Help the company scale from an early-stage engineering organisation to a mature, high-performance team.
What They're Looking For
7-8+ years' experience as a strong backend or full-stack IC.
Proven ability to re-architect complex systems and scale codebases beyond the “early startup” phase.
Experience in a fast-growing startup (Seed → A → B or similar) where the engineering org expanded meaningfully.
Depth in modern backend or full-stack development (ideal: TypeScript, React, Node.js, Python).
Someone who thrives in ambiguity, makes pragmatic technical decisions, and moves quickly.
A high engineering bar and the ability to raise the standards of those around you.
Tech Environment
Frontend: TypeScript, React
Backend: Node.js, Python
Data: Postgres, BigQuery, Redis
Cloud: GCP
Hybrid working model; candidates must be based in or willing to relocate to the San Francisco Bay Area. (Hybrid flexibility available for senior candidates.)
Why This Role Is Exciting
Join a business with strong revenue, real customers, and top-tier backers.
Have ownership of mission-critical architecture, not just feature work.
Work alongside a highly capable CTO and shape the company's technical trajectory for years to come.
Build systems that support real-world decisions for millions of end-users.
Competitive salary, meaningful equity, and the chance to make a long-term technical mark.
Head of Computer Use/ AI Engineer
Software engineering internship job in San Jose, CA
Edward Mann are hiring for an excellent Technology Start-up based in San Francisco.
About the Role
We are seeking a Head of Computer Use / Senior AI Engineer (hands on role). Driving the evolution of the next generation of browser agents (testing browser agents).
To lead the design, training, and advancement of next-generation AI agent systems. This role involves fine-tuning large language models (LLMs) and developing intelligent agents capable of navigating and interacting with real web environments. It's a high-impact position combining hands-on engineering, experimentation, and strategic input. You'll collaborate closely with technical leadership, contribute directly to product direction, and mentor other engineers in developing advanced agent capabilities.
Key Responsibilities
Develop, train, and deploy LLM-powered agents that interact with websites through real browser interfaces.
Fine-tune foundation models using advanced methods (e.g., LoRA, PEFT, DPO, RLHF) and select the best approach for each use case.
Design reinforcement learning systems that improve agent reasoning, adaptability, and task performance.
Own the full agent pipeline-from model architecture and policy design to simulation frameworks and testing environments.
Rapidly prototype, run experiments, and refine solutions to push the limits of agent performance.
Partner with technical leadership to shape product direction and research priorities.
Mentor and support other engineers, helping to grow a strong, mission-driven technical team.
Build and coordinate multi-agent workflows with structured roles, memory systems, and effective trajectories.
What You'll Bring
Strong background in machine learning with a PhD or equivalent industry experience in AI/ML/Computer Science.
Hands-on experience fine-tuning LLMs.
Deep applied knowledge of reinforcement learning techniques.
Experience building agents for real-world applications (bonus: browser-based or robotics experience).
Strong coding and experimentation skills, with a preference for practical problem-solving over theory alone.
A sense of ownership and drive to build impactful systems beyond titles or hierarchy.
Experience mentoring, managing, or growing technical teams.
Preferred Qualifications
Record of impactful research publications or open-source contributions.
Experience in high-growth, fast-paced start-up environments.
Staff Software Engineer
Software engineering internship job in San Francisco, CA
🚀 Staff Software Engineer (TypeScript) - Member of Technical Staff
📍 SF | 💰 $200K-$300K + Equity | 🩺 Full Benefits | 🌴 Unlimited PTO | 🧭 Flexible Work
We're partnered with a next-generation AI automation company building intelligent agents that redefine how complex workflows are executed. With $10M+ in funding and an elite engineering team (MIT alumni, ex-Meta), they're entering a stage of hypergrowth!
This is a hands-on Staff/MTS role for someone who wants to set technical direction, own platform architecture, and ship core systems that power both developer and no-code automation experiences.
🧩 What You'll Own
Own major architecture & platform decisions
End-to-end TypeScript (Node / Nest / Express / React / Next)
APIs, SDKs & no-code automation tools
Event-Driven Architecture & IaC
Streaming Systems, Workflow Orchestration & Durable Systems
Advanced RAG pipelines & data orchestration layers
End-to-end feature ownership across backend, frontend & workflows
Testing excellence & CI/CD with Cypress, Playwright & modern pipelines
Technical leadership, mentorship & engineering standards
🎯 Who Thrives Here
Staff-level engineer with a history of leading complex, cross-system initiatives
Deep TypeScript mastery across platform, product & tooling
Strong experience with Node, Express/Nest, React, Next, Prisma & modern ORMs
Proven success shipping complex APIs, SDKs, dev platforms or distributed systems
Reliability-first mindset with strong automated testing
Bonus: workflow automation, durable objects, RAG, data pipelines, web scraping
Product-driven, pragmatic, and excited to build tools people actually use
💥 Why This Role Is Special
You'll shape the core of an AI agent platform from the inside
Real architectural authority & ownership
Direct access to founders & technical leadership
Work with elite engineers tackling genuinely hard engineering problems
High compensation, meaningful equity, and long-term impact
This is a rare chance to build foundational infrastructure for the future of AI agents alongside a deeply technical, fast-moving team. You'll have real ownership, real influence and the opportunity to shape a platform that is defining how automation is built at scale.
If you're excited by hard problems, massive technical leverage, and building systems that matter - we'd love to hear from you.
Staff Software Engineer
Software engineering internship job in San Jose, CA
The Role
Our client is seeking a Staff Software Engineer to join a small, senior team as a highly skilled individual contributor. In this hybrid role, you'll work across the stack to build new user-facing features and develop integrations with CAD and third-party applications. You'll partner closely with product managers, AI researchers, and other engineers to turn new ideas into production-ready systems at scale.
What You'll Do
Design and build scalable, reliable full-stack systems using React, Node.js, and Python.
Deploy an ML model to production: you've done it before, and you'll do it again: build robust products that users love.
Collaborate closely with ML and data teams to integrate models and pipelines into real-world products.
Architect backend systems around AWS services, databases, and modern data infrastructure.
Own performance and scale: build APIs, indexes, and search systems that make high-dimensional data feel instant.
Contribute to product direction: work with design, AI, and leadership to turn technical capabilities into delightful user experiences.
(Optional but exciting): advance 3D visualization, geometry, or rendering engines that make engineering feel magical.
What We're Looking For
You're a strong generalist who can build, ship, and scale complex full-stack systems.
You're fluent in React, Node.js, and Python, and comfortable designing APIs, services, and data flows end-to-end.
You've shipped large production systems, ideally ones that touch ML, data, or search.
You have experience with AWS databases, and you enjoy thinking about indexing, search, and vector data systems.
You're pragmatic, product-minded, and enjoy owning features from concept to deployment.
You collaborate naturally with AI, design, and data teams, and love turning complexity into clarity.
Bonus points if:
You've worked with large-scale data processing pipelines.
You have an interest in math, geometry, topology, rendering, or computational geometry.
You've built software in 3D printing, CAD, or computer graphics domains.
This is a rare opportunity to create the interfaces, infrastructure, and experiences that bring a new kind of intelligence to the physical world, and help define how AI becomes a tool for the imagination.
You love building systems that are elegant, fast, and deeply technical, and want to see them shape the real world.
Let's build the tools the future will be made in.
Compensation
The base salary range for this role is $175,000 - $240,000, plus equity. Flexible PTO and competitive compensation. Final offers will be based on experience, interview performance, and alignment with role requirements.
Staff Software Engineer
Software engineering internship job in San Francisco, CA
About the job
Staff Software Engineer - SF Bay (4 days onsite) | Up to $195K - $255K + Equity
TogetherWeTech is hiring a Staff Software Engineer to take end to end ownership of critical product areas, from architecture through launch, and set the technical direction for how we build at scale. You will drive design reviews, ship prototypes that push our product forward, and establish pragmatic engineering standards that balance speed and quality. Beyond your own contributions, you will mentor teammates into future tech leads and help grow a world class engineering culture.
Their mission is to reshape design by streamlining the path from concept to creation, bringing more impactful ideas into the physical world. A well-funded (Series B $51m to date), growing double-digits MoM, and expanding the core engineering team in SF. The surface area is big: realtime collaboration, GPU inference at scale, a modern TypeScript stack, and serving real enterprise
What you'll do:
Set the architecture roadmap to keep our product reliable, scalable, and a joy to develop in as usage, models, and data grow.
Ship hands-on: jump into code for critical paths, rapid prototypes, and hairy debugging (latency, memory, race conditions, WebGL/WASM quirks).
Install quality guardrails that scale: test strategy, code-review norms, perf budgets, observability baselines.
Mentor and multiply: develop senior engineers into tech leads, model clear written design, and practice crisp decision-making.
Partner cross-functionally: represent engineering trade-offs with Product, Design, and Go-To-Market; act as the CTO's delegate when needed.
Your toolbox:
Frontend: TypeScript, React, Vite, WebGL; realtime collaboration.
API/Backend: TypeScript/Node, GraphQL (PostGraphile), Postgres, Redis, background workers.
Infra: Kubernetes, Pulumi, CI/CD with GitHub Actions, Datadog for observability, feature flags.
Security/Enterprise: SSO/SAML (WorkOS), SOC 2-minded practices.
To be considered,
2+ years of Staff Software Engineer role.
If you are a passionate and an experienced Staff Softwared Engineer looking to make a difference in a fast-paced and innovative environment, we would love to hear from you!
Better Together🟢
Principal Software Engineer
Software engineering internship job in San Francisco, CA
Hi,
Greetings from Solvecube HCM
I hope you are enjoying a great day! I am from Solvecube HCM, an AI Based global consulting firm head quartered in Singapore.
Our client is a Healthcare AI Startup based in San Francisco, USA and looking for a great tech leader as their Principal Engineer. It is a permanent role with client.
Your experience at Ambience Healthcare must have equipped you with unique insights and skills that could be a great fit for this role. This opportunity offers a chance to work with a talented team and fixed Salary+Stock options.
Key Responsibilities
Set technical vision and lead architecture for AI-first platform services.
Build advanced systems in LLMOps, reinforcement learning, and AI pipelines.
Evaluate and integrate cutting-edge frameworks (LangChain, Hugging Face, RAG).
Collaborate with leadership on long-term technology strategy.
Mentor senior engineers across India and US teams.
Qualifications
10+ years of experience in software/AI engineering, including leadership roles.
Deep expertise in PyTorch, TensorFlow, LangChain, Hugging Face.
Proven ability to innovate and deliver in startup/scale-up environments.
Strong communication and collaboration skills.
If this sounds intriguing, I'd love to chat more about it. Feel free to reply to this email or let me know if you'd prefer a quick call. Looking forward to hearing from you soon!
If you are not exploring at this moment let me know if you have a strong reference for the same.
Please note: The incumbent should be a local citizen or a Green card holder to be eligible.
Best regards,
Lijy Ronnie
Mail me : ******************
Lead Software Engineer
Software engineering internship job in Fremont, CA
A top AI Native Command Center startup is looking for a lead software developer to join their growing technology team. It centralizes internal and external data for companies and matches it with external insights to help companies make better decisions and predict the future. With early traction in sports and entertainment and working with some of the biggest names like the PGA, Warriors and others and a fresh venture round, they are scaling quickly.
As Engineering Lead at Cred, you'll shape how we build. You'll manage and mentor a growing team, drive best practices across delivery and QA, and help us scale our infrastructure, pipelines, and platform with AI and automation at the core.
Key Responsibilities:
Lead and scale the engineering function with a strong delivery mindset - shipping high-quality features weekly
Own and evolve internal processes around CI/CD, QA automation, observability, and DevOps
Design and build internal tools to automate development, QA, and data workflows, powered by AI
What We're Looking For
Must-Haves:
6-10+ years of hands-on engineering experience, with at least 2+ years in engineering leadership roles
Proven ability to lead agile/scrum teams, set goals, track velocity, and manage delivery in sprint cycles
Strong background in DevOps and QA best practices - CI/CD pipelines, automated testing, infrastructure as code
Experience building and scaling data-rich products - including APIs, integrations, scraping tools, or cloud platforms
AI-native mindset - you actively look for ways to integrate LLMs and automation into engineering processes
Senior Software Engineer
Software engineering internship job in Santa Clara, CA
Founding Engineer
On-Site
San Francisco, CA
$170,000 - $200,000
About:
We are seeking versatile Sr Software Engineers who specialize across disciplines - Machine Learning, Data Engineering, and Full-Stack Development. The ideal candidate is willing to get their hands dirty, push boundries, and driven by a need to succeed.
You will be ready to work diligently and build rapidly to win the market. You should be prepared to challenge existing concepts and develop alternative solutions.
Job Summary:
You'll operate at the cutting edge of LLMs, computer vision, and data engineering to automate compliance in precision-focused industries. You'll also collaborat with major global industrial partners. Your work will help build a product that leading organizations will depend on to prevent accidents, protect lives, and transform the way they run their operations.
Who You Are:
Able to make decisions quickly.
Proactive.
Comfortable with TypeScript, Python, Docker, LLMs, YOLO, Tesseract, PostgreSQL, AWS, and React Native.
Have a history of building products that have been used.
Thrive under pressure and within an unstructured environment.
What You'll Do:
Speak with users and gather their needs, experiences, and problems.
Architect systems that will be used daily by others at billion-dollar companies.
Build agent-swarm data pipelines that will autonomously audit.
Maintain and scale infrastructure.
Produce quickly without the fear of perfection.
Work directly with the founding team and customers.
Software Engineer - Intelligent Systems
Software engineering internship job in Berkeley, CA
Compensation: Up to $135K base salary
My client is a Series C renewable-energy automation unicorn, founded in 2019 and backed by more than $200M in funding. They are building intelligent systems that transform how large-scale renewable energy projects are designed and delivered. They're hiring a Software Engineer - Intelligent Systems to develop AI-powered tools using Azure OpenAI, AWS Bedrock, and AgentCore to automate complex engineering workflows. This role is ideal for a recent M.S. or Ph.D. graduate passionate about AI, automation, and multi-cloud technologies.
What You'll Do
Build AI-driven automation workflows and reasoning chains
Develop LLM-based agents with Azure OpenAI and AWS Bedrock
Work on retrieval systems and Document AI integrations
Deploy and optimize agents across Azure, AWS, edge, and on-prem environments
Translate engineering workflows into intelligent systems
Test, validate, and document system behavior
What We're Looking For
Bachelor's or Master's in CS, AI, Computational Linguistics, or related field (M.S./Ph.D. preferred)
Experience with AI/ML, NLP, or intelligent systems
Strong Python programming skills
Familiarity with frameworks like LangChain or LangGraph
Exposure to Azure OpenAI, AWS Bedrock, and AgentCore
Understanding of REST APIs, asynchronous programming, and data integration
Senior Cloud Software Engineer
Software engineering internship job in Santa Clara, CA
Hi,
Want to Connect regarding a urgent position please review below description and let me know if you are interested.
Job Title: Senior Cloud Software Engineer (Threat Prevention & AppID)
Duration: 7+ Months
Xoriant reasonably expects the pay rate for this position to be within the following range: $50/hr-52/hr.
Job Description:
Duties:
Your Career We're seeking innovators - engineers who seek to design new products, designing state-of-the-art products that do not exist today. These engineers love to code with a drive to build global products and bring new ideas to develop security disciplines to solve real-world problems. We are looking for talent engineers who take ownership of their areas of focus and who are driven to pursue problems at every level. Collaboration is at the heart of our culture and we need engineers who can communicate at a high level and work well with multi-functional teams towards achieving a common goal.
Your Impact:
Participate in the design and implementation of threat prevention & AppID cloud services for public cloud and private cloud features Participate in all phases of the product development cycle, from definition, design, through implementation and test Provide real-time security services to customers Work with PLM on new feature requirement Work with QA and DevOps on new release deployment Work with support to handle customer issues Work with security researchers and data scientist on a new feature request
Additional Information:
The Team
We are the Threat Prevention & AppID Infrastructure team. Our engineering team is at the core of our products deliver the best of security services on the cloud to prevent cyberattacks. We are constantly innovating challenging the way we, and the industry, think about cybersecurity. Our engineers don't shy away from building products to solve problems no one has pursued before.
We define the industry, instead of waiting for directions. We need individuals who feel comfortable in ambiguity, excited by the prospect of a challenge, and empowered by the unknown risks facing our everyday lives that are only enabled by a secure digital environment.
Skills: Qualifications
Your Experience:
BS/MS in Computer Science or Computer Engineering Solid programming skills in GoLang, Python or Java Solid knowledge and skills on Linux Solid skills with Kubernetes and Docker Rich Google Cloud Platform experience is a plus Solid knowledge of web servers/proxies such as NGINX, envoy
3 years of working experience on data infrastructure platforms Strong micro-service development experience Rich Experience with SQL and No-SQL DB technologies such as MySQL, Redis Hands-on experience with the queuing system such as RabbitMQ, Kafka, experience with Pub/Sub is a plus Solid skills in multi-threads and multi-processes programming and experience in a distributed system are preferred DevOps experience a plus Teamwork, problem-solving and a can-do attitude
Education:
Bachelor s Degree in Computer Science or related field (or equivalent)
Regards,
Akangsha Mohite
Team Lead
W: **************
E: ***************************
Xoriant is an equal opportunity employer. No person shall be excluded from consideration for employment because of race, ethnicity, religion, caste, gender, gender identity, sexual orientation, marital status, national origin, age, disability or veteran status.
Staff Software Engineer - High-Growth AI/FinTech
Software engineering internship job in San Francisco, CA
Staff Software Engineer (IC) - High-Growth AI/FinTech Startup
Full-time · Hybrid (San Francisco)
$220k-$300k + equity
A well-funded, rapidly scaling startup in the AI-driven fintech space is looking for an experienced Staff Engineer to take ownership of reshaping the foundations of their core platform. After two years of fast iteration and customer growth, the product has evolved into a set of independently built services. They now need a senior IC who can bring coherence, scalability, and long-term architectural stability as the engineering team expands.
This is a high-impact individual contributor role working directly with the CTO. You'll set technical direction, oversee major system redesigns, and help prepare the platform to support significantly larger usage, customer demands, and a future 20-40+ engineer organisation.
What You'll Be Doing
Lead architectural transformation
Redesign major components into a unified, maintainable, scalable system.
Remove legacy code, reduce fragmentation, and introduce sound architectural patterns.
Define technical standards and guide the broader engineering team towards consistent, high-quality design.
Drive high-leverage engineering work
Partner closely with the CTO on long-term technical strategy.
Lead development of workflow systems for real-time identity, income, and document verification.
Strengthen the infrastructure that powers the company's automated decisioning engine (currently >70% auto-approval/denial rate).
Support integrations with internal ML models that perform fraud detection and financial document understanding.
Influence and elevate the engineering culture
Collaborate with senior and junior engineers across backend, full-stack and infra.
Improve developer velocity and support onboarding of larger enterprise customers.
Help the company scale from an early-stage engineering organisation to a mature, high-performance team.
What They're Looking For
7-8+ years' experience as a strong backend or full-stack IC.
Proven ability to re-architect complex systems and scale codebases beyond the “early startup” phase.
Experience in a fast-growing startup (Seed → A → B or similar) where the engineering org expanded meaningfully.
Depth in modern backend or full-stack development (ideal: TypeScript, React, Node.js, Python).
Someone who thrives in ambiguity, makes pragmatic technical decisions, and moves quickly.
A high engineering bar and the ability to raise the standards of those around you.
Tech Environment
Frontend: TypeScript, React
Backend: Node.js, Python
Data: Postgres, BigQuery, Redis
Cloud: GCP
Hybrid working model; candidates must be based in or willing to relocate to the San Francisco Bay Area. (Hybrid flexibility available for senior candidates.)
Why This Role Is Exciting
Join a business with strong revenue, real customers, and top-tier backers.
Have ownership of mission-critical architecture, not just feature work.
Work alongside a highly capable CTO and shape the company's technical trajectory for years to come.
Build systems that support real-world decisions for millions of end-users.
Competitive salary, meaningful equity, and the chance to make a long-term technical mark.
Staff Software Engineer
Software engineering internship job in Fremont, CA
The Role
Our client is seeking a Staff Software Engineer to join a small, senior team as a highly skilled individual contributor. In this hybrid role, you'll work across the stack to build new user-facing features and develop integrations with CAD and third-party applications. You'll partner closely with product managers, AI researchers, and other engineers to turn new ideas into production-ready systems at scale.
What You'll Do
Design and build scalable, reliable full-stack systems using React, Node.js, and Python.
Deploy an ML model to production: you've done it before, and you'll do it again: build robust products that users love.
Collaborate closely with ML and data teams to integrate models and pipelines into real-world products.
Architect backend systems around AWS services, databases, and modern data infrastructure.
Own performance and scale: build APIs, indexes, and search systems that make high-dimensional data feel instant.
Contribute to product direction: work with design, AI, and leadership to turn technical capabilities into delightful user experiences.
(Optional but exciting): advance 3D visualization, geometry, or rendering engines that make engineering feel magical.
What We're Looking For
You're a strong generalist who can build, ship, and scale complex full-stack systems.
You're fluent in React, Node.js, and Python, and comfortable designing APIs, services, and data flows end-to-end.
You've shipped large production systems, ideally ones that touch ML, data, or search.
You have experience with AWS databases, and you enjoy thinking about indexing, search, and vector data systems.
You're pragmatic, product-minded, and enjoy owning features from concept to deployment.
You collaborate naturally with AI, design, and data teams, and love turning complexity into clarity.
Bonus points if:
You've worked with large-scale data processing pipelines.
You have an interest in math, geometry, topology, rendering, or computational geometry.
You've built software in 3D printing, CAD, or computer graphics domains.
This is a rare opportunity to create the interfaces, infrastructure, and experiences that bring a new kind of intelligence to the physical world, and help define how AI becomes a tool for the imagination.
You love building systems that are elegant, fast, and deeply technical, and want to see them shape the real world.
Let's build the tools the future will be made in.
Compensation
The base salary range for this role is $175,000 - $240,000, plus equity. Flexible PTO and competitive compensation. Final offers will be based on experience, interview performance, and alignment with role requirements.
Head of Computer Use/ AI Engineer
Software engineering internship job in Fremont, CA
Edward Mann are hiring for an excellent Technology Start-up based in San Francisco.
About the Role
We are seeking a Head of Computer Use / Senior AI Engineer (hands on role). Driving the evolution of the next generation of browser agents (testing browser agents).
To lead the design, training, and advancement of next-generation AI agent systems. This role involves fine-tuning large language models (LLMs) and developing intelligent agents capable of navigating and interacting with real web environments. It's a high-impact position combining hands-on engineering, experimentation, and strategic input. You'll collaborate closely with technical leadership, contribute directly to product direction, and mentor other engineers in developing advanced agent capabilities.
Key Responsibilities
Develop, train, and deploy LLM-powered agents that interact with websites through real browser interfaces.
Fine-tune foundation models using advanced methods (e.g., LoRA, PEFT, DPO, RLHF) and select the best approach for each use case.
Design reinforcement learning systems that improve agent reasoning, adaptability, and task performance.
Own the full agent pipeline-from model architecture and policy design to simulation frameworks and testing environments.
Rapidly prototype, run experiments, and refine solutions to push the limits of agent performance.
Partner with technical leadership to shape product direction and research priorities.
Mentor and support other engineers, helping to grow a strong, mission-driven technical team.
Build and coordinate multi-agent workflows with structured roles, memory systems, and effective trajectories.
What You'll Bring
Strong background in machine learning with a PhD or equivalent industry experience in AI/ML/Computer Science.
Hands-on experience fine-tuning LLMs.
Deep applied knowledge of reinforcement learning techniques.
Experience building agents for real-world applications (bonus: browser-based or robotics experience).
Strong coding and experimentation skills, with a preference for practical problem-solving over theory alone.
A sense of ownership and drive to build impactful systems beyond titles or hierarchy.
Experience mentoring, managing, or growing technical teams.
Preferred Qualifications
Record of impactful research publications or open-source contributions.
Experience in high-growth, fast-paced start-up environments.
Lead Software Engineer
Software engineering internship job in San Francisco, CA
A top AI Native Command Center startup is looking for a lead software developer to join their growing technology team. It centralizes internal and external data for companies and matches it with external insights to help companies make better decisions and predict the future. With early traction in sports and entertainment and working with some of the biggest names like the PGA, Warriors and others and a fresh venture round, they are scaling quickly.
As Engineering Lead at Cred, you'll shape how we build. You'll manage and mentor a growing team, drive best practices across delivery and QA, and help us scale our infrastructure, pipelines, and platform with AI and automation at the core.
Key Responsibilities:
Lead and scale the engineering function with a strong delivery mindset - shipping high-quality features weekly
Own and evolve internal processes around CI/CD, QA automation, observability, and DevOps
Design and build internal tools to automate development, QA, and data workflows, powered by AI
What We're Looking For
Must-Haves:
6-10+ years of hands-on engineering experience, with at least 2+ years in engineering leadership roles
Proven ability to lead agile/scrum teams, set goals, track velocity, and manage delivery in sprint cycles
Strong background in DevOps and QA best practices - CI/CD pipelines, automated testing, infrastructure as code
Experience building and scaling data-rich products - including APIs, integrations, scraping tools, or cloud platforms
AI-native mindset - you actively look for ways to integrate LLMs and automation into engineering processes
Senior Software Engineer
Software engineering internship job in Sunnyvale, CA
Founding Engineer
On-Site
San Francisco, CA
$170,000 - $200,000
About:
We are seeking versatile Sr Software Engineers who specialize across disciplines - Machine Learning, Data Engineering, and Full-Stack Development. The ideal candidate is willing to get their hands dirty, push boundries, and driven by a need to succeed.
You will be ready to work diligently and build rapidly to win the market. You should be prepared to challenge existing concepts and develop alternative solutions.
Job Summary:
You'll operate at the cutting edge of LLMs, computer vision, and data engineering to automate compliance in precision-focused industries. You'll also collaborat with major global industrial partners. Your work will help build a product that leading organizations will depend on to prevent accidents, protect lives, and transform the way they run their operations.
Who You Are:
Able to make decisions quickly.
Proactive.
Comfortable with TypeScript, Python, Docker, LLMs, YOLO, Tesseract, PostgreSQL, AWS, and React Native.
Have a history of building products that have been used.
Thrive under pressure and within an unstructured environment.
What You'll Do:
Speak with users and gather their needs, experiences, and problems.
Architect systems that will be used daily by others at billion-dollar companies.
Build agent-swarm data pipelines that will autonomously audit.
Maintain and scale infrastructure.
Produce quickly without the fear of perfection.
Work directly with the founding team and customers.
Staff Software Engineer
Software engineering internship job in San Francisco, CA
The Role
Our client is seeking a Staff Software Engineer to join a small, senior team as a highly skilled individual contributor. In this hybrid role, you'll work across the stack to build new user-facing features and develop integrations with CAD and third-party applications. You'll partner closely with product managers, AI researchers, and other engineers to turn new ideas into production-ready systems at scale.
What You'll Do
Design and build scalable, reliable full-stack systems using React, Node.js, and Python.
Deploy an ML model to production: you've done it before, and you'll do it again: build robust products that users love.
Collaborate closely with ML and data teams to integrate models and pipelines into real-world products.
Architect backend systems around AWS services, databases, and modern data infrastructure.
Own performance and scale: build APIs, indexes, and search systems that make high-dimensional data feel instant.
Contribute to product direction: work with design, AI, and leadership to turn technical capabilities into delightful user experiences.
(Optional but exciting): advance 3D visualization, geometry, or rendering engines that make engineering feel magical.
What We're Looking For
You're a strong generalist who can build, ship, and scale complex full-stack systems.
You're fluent in React, Node.js, and Python, and comfortable designing APIs, services, and data flows end-to-end.
You've shipped large production systems, ideally ones that touch ML, data, or search.
You have experience with AWS databases, and you enjoy thinking about indexing, search, and vector data systems.
You're pragmatic, product-minded, and enjoy owning features from concept to deployment.
You collaborate naturally with AI, design, and data teams, and love turning complexity into clarity.
Bonus points if:
You've worked with large-scale data processing pipelines.
You have an interest in math, geometry, topology, rendering, or computational geometry.
You've built software in 3D printing, CAD, or computer graphics domains.
This is a rare opportunity to create the interfaces, infrastructure, and experiences that bring a new kind of intelligence to the physical world, and help define how AI becomes a tool for the imagination.
You love building systems that are elegant, fast, and deeply technical, and want to see them shape the real world.
Let's build the tools the future will be made in.
Compensation
The base salary range for this role is $175,000 - $240,000, plus equity. Flexible PTO and competitive compensation. Final offers will be based on experience, interview performance, and alignment with role requirements.
Head of Computer Use/ AI Engineer
Software engineering internship job in San Francisco, CA
Edward Mann are hiring for an excellent Technology Start-up based in San Francisco.
About the Role
We are seeking a Head of Computer Use / Senior AI Engineer (hands on role). Driving the evolution of the next generation of browser agents (testing browser agents).
To lead the design, training, and advancement of next-generation AI agent systems. This role involves fine-tuning large language models (LLMs) and developing intelligent agents capable of navigating and interacting with real web environments. It's a high-impact position combining hands-on engineering, experimentation, and strategic input. You'll collaborate closely with technical leadership, contribute directly to product direction, and mentor other engineers in developing advanced agent capabilities.
Key Responsibilities
Develop, train, and deploy LLM-powered agents that interact with websites through real browser interfaces.
Fine-tune foundation models using advanced methods (e.g., LoRA, PEFT, DPO, RLHF) and select the best approach for each use case.
Design reinforcement learning systems that improve agent reasoning, adaptability, and task performance.
Own the full agent pipeline-from model architecture and policy design to simulation frameworks and testing environments.
Rapidly prototype, run experiments, and refine solutions to push the limits of agent performance.
Partner with technical leadership to shape product direction and research priorities.
Mentor and support other engineers, helping to grow a strong, mission-driven technical team.
Build and coordinate multi-agent workflows with structured roles, memory systems, and effective trajectories.
What You'll Bring
Strong background in machine learning with a PhD or equivalent industry experience in AI/ML/Computer Science.
Hands-on experience fine-tuning LLMs.
Deep applied knowledge of reinforcement learning techniques.
Experience building agents for real-world applications (bonus: browser-based or robotics experience).
Strong coding and experimentation skills, with a preference for practical problem-solving over theory alone.
A sense of ownership and drive to build impactful systems beyond titles or hierarchy.
Experience mentoring, managing, or growing technical teams.
Preferred Qualifications
Record of impactful research publications or open-source contributions.
Experience in high-growth, fast-paced start-up environments.
Lead Software Engineer
Software engineering internship job in Alameda, CA
A top AI Native Command Center startup is looking for a lead software developer to join their growing technology team. It centralizes internal and external data for companies and matches it with external insights to help companies make better decisions and predict the future. With early traction in sports and entertainment and working with some of the biggest names like the PGA, Warriors and others and a fresh venture round, they are scaling quickly.
As Engineering Lead at Cred, you'll shape how we build. You'll manage and mentor a growing team, drive best practices across delivery and QA, and help us scale our infrastructure, pipelines, and platform with AI and automation at the core.
Key Responsibilities:
Lead and scale the engineering function with a strong delivery mindset - shipping high-quality features weekly
Own and evolve internal processes around CI/CD, QA automation, observability, and DevOps
Design and build internal tools to automate development, QA, and data workflows, powered by AI
What We're Looking For
Must-Haves:
6-10+ years of hands-on engineering experience, with at least 2+ years in engineering leadership roles
Proven ability to lead agile/scrum teams, set goals, track velocity, and manage delivery in sprint cycles
Strong background in DevOps and QA best practices - CI/CD pipelines, automated testing, infrastructure as code
Experience building and scaling data-rich products - including APIs, integrations, scraping tools, or cloud platforms
AI-native mindset - you actively look for ways to integrate LLMs and automation into engineering processes
Senior Software Engineer
Software engineering internship job in Fremont, CA
Founding Engineer
On-Site
San Francisco, CA
$170,000 - $200,000
About:
We are seeking versatile Sr Software Engineers who specialize across disciplines - Machine Learning, Data Engineering, and Full-Stack Development. The ideal candidate is willing to get their hands dirty, push boundries, and driven by a need to succeed.
You will be ready to work diligently and build rapidly to win the market. You should be prepared to challenge existing concepts and develop alternative solutions.
Job Summary:
You'll operate at the cutting edge of LLMs, computer vision, and data engineering to automate compliance in precision-focused industries. You'll also collaborat with major global industrial partners. Your work will help build a product that leading organizations will depend on to prevent accidents, protect lives, and transform the way they run their operations.
Who You Are:
Able to make decisions quickly.
Proactive.
Comfortable with TypeScript, Python, Docker, LLMs, YOLO, Tesseract, PostgreSQL, AWS, and React Native.
Have a history of building products that have been used.
Thrive under pressure and within an unstructured environment.
What You'll Do:
Speak with users and gather their needs, experiences, and problems.
Architect systems that will be used daily by others at billion-dollar companies.
Build agent-swarm data pipelines that will autonomously audit.
Maintain and scale infrastructure.
Produce quickly without the fear of perfection.
Work directly with the founding team and customers.