Software Development Engineer, AI/ML, AWS Neuron, Model Inference
Applications developer job in Cupertino, CA
The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The AWS Neuron SDK, developed by the Annapurna Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML compiler, runtime, and application framework that seamlessly integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance.
The Inference Enablement and Acceleration team is at the forefront of running a wide range of models and supporting novel architecture alongside maximizing their performance for AWS's custom ML accelerators. Working across the stack from PyTorch till the hardware-software boundary, our engineers build systematic infrastructure, innovate new methods and create high-performance kernels for ML functions, ensuring every compute unit is fine tuned for optimal performance for our customers' demanding workloads. We combine deep hardware knowledge with ML expertise to push the boundaries of what's possible in AI acceleration.
As part of the broader Neuron organization, our team works across multiple technology layers - from frameworks and kernels and collaborate with compiler to runtime and collectives. We not only optimize current performance but also contribute to future architecture designs, working closely with customers to enable their models and ensure optimal performance. This role offers a unique opportunity to work at the intersection of machine learning, high-performance computing, and distributed architectures, where you'll help shape the future of AI acceleration technology
You will architect and implement business critical features, and mentor a brilliant team of experienced engineers. We operate in spaces that are very large, yet our teams remain small and agile. There is no blueprint. We're inventing. We're experimenting. It is a very unique learning culture. The team works closely with customers on their model enablement, providing direct support and optimization expertise to ensure their machine learning workloads achieve optimal performance on AWS ML accelerators. The team collaborates with open source ecosystems to provide seamless integration and bring peak performance at scale for customers and developers.
This role is responsible for development, enablement and performance tuning of a wide variety of LLM model families, including massive scale large language models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side with compiler engineers and runtime engineers to create, build and tune distributed inference solutions with Trainium and Inferentia. Experience optimizing inference performance for both latency and throughput on such large models across the stack from system level optimizations through to Pytorch or JAX is a must have.
You can learn more about Neuron
*****************************************************************************************
***********************************************
*************************************
*********************************************************************************************
Key job responsibilities
This role will help lead the efforts in building distributed inference support for Pytorch in the Neuron SDK. This role will tune these models to ensure highest performance and maximize the efficiency of them running on the customer AWS Trainium and Inferentia silicon and servers. Strong software development using Python, System level programming and ML knowledge are both critical to this role. Our engineers collaborate across compiler, runtime, framework, and hardware teams to optimize machine learning workloads for our global customer base. Working at the intersection of software, hardware, and machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML model acceleration. In this role, you will:
* Design, develop, and optimize machine learning models and frameworks for deployment on custom ML hardware accelerators.
* Participate in all stages of the ML system development lifecycle including distributed computing based architecture design, implementation, performance profiling, hardware-specific optimizations, testing and production deployment.
* Build infrastructure to systematically analyze and onboard multiple models with diverse architecture.
* Design and implement high-performance kernels and features for ML operations, leveraging the Neuron architecture and programming models
* Analyze and optimize system-level performance across multiple generations of Neuron hardware
* Conduct detailed performance analysis using profiling tools to identify and resolve bottlenecks
* Implement optimizations such as fusion, sharding, tiling, and scheduling
* Conduct comprehensive testing, including unit and end-to-end model testing with continuous deployment and releases through pipelines.
* Work directly with customers to enable and optimize their ML models on AWS accelerators
* Collaborate across teams to develop innovative optimization techniques
A day in the life
You will collaborate with a cross-functional team of applied scientists, system engineers, and product managers to deliver state-of-the-art inference capabilities for Generative AI applications. Your work will involve debugging performance issues, optimizing memory usage, and shaping the future of Neuron's inference stack across Amazon and the Open Source Community. As you design and code solutions to help our team drive efficiencies in software architecture, you'll create metrics, implement automation and other improvements, and resolve the root cause of software defects.
You will also build high-impact solutions to deliver to our large customer base and participate in design discussions, code review, and communicate with internal and external stakeholders. You will work cross-functionally to help drive business decisions with your technical input. You will work in a startup-like development environment, where you're always working on the most important initiative.
About the team
The Inference Enablement and Acceleration team fosters a builder's culture where experimentation is encouraged, and impact is measurable. We emphasize collaboration, technical ownership, and continuous learning. Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we're building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future. Join us to solve some of the most interesting and impactful infrastructure challenges in AI/ML today.
BASIC QUALIFICATIONS- Bachelor's degree in computer science or equivalent
- 5+ years of non-internship professional software development experience
- 5+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Fundamentals of Machine learning and LLMs, their architecture, training and inference lifecycles along with work experience on some optimizations for improving the model execution.
- Software development experience in C++, Python (experience in at least one language is required).
- Strong understanding of system performance, memory management, and parallel computing principles.
- Proficiency in debugging, profiling, and implementing best software engineering practices in large-scale systems.
PREFERRED QUALIFICATIONS- Familiarity with PyTorch, JIT compilation, and AOT tracing.
- Familiarity with CUDA kernels or equivalent ML or low-level kernels
- Candidates with performant kernel development such as CUTLASS, FlashInfer etc., would be well suited.
- Familiar with syntax and tile-level semantics similar to Triton.
- Experience with online/offline inference serving with vLLM, SGLang, TensorRT or similar platforms in production environments.
- Deep understanding of computer architecture, operation systems level software and working knowledge of parallel computing.
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status.
Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company's reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit ********************************************************* for more information. If the country/region you're applying in isn't listed, please contact your Recruiting Partner.
Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit ******************************************************** This position will remain posted until filled. Applicants should apply via our internal or external career site.
Full Stack .Net Application Developer
Applications developer job in Los Angeles, CA
Duration: 12 Months
**Only On w2**
Local to , Alhambra, CA 91803
Hybrid: 2 days
The Full-Stack Application Developer will lead the design, development, and integration of enterprise-level applications and systems across PHIS and SAPC IT. This role requires expertise in software engineering using Microsoft technologies (C#.NET, ASP.NET Core, MVC, Razor, Web APIs), jQuery, Bootstrap, and SQL Server, with a focus on cloud-native development and modern design patterns, with hands-on coding experience of at least 10 years. The Full-Stack Application Developer will be responsible for end-to-end software development, testing, code reviews, and defect resolution, as well as serving as a liaison between IT, quality assurance, and business stakeholders. Experience with Agile/Scrum methodologies, API integration, and translating business needs into technical specifications. - Proficiency in the design, development, testing, and support of large-scale web applications and system integrations. - Knowledge using C#.Net, MVC, ASP.NET, .Net Core, Web APIs, Razor Pages, jQuery, Bootstrap. - Knowledge of data engineering in SQL Server, including knowledge of how to develop high-performance SQL queries. - Proficiency in the implementation of RESTful API's. - Familiarity with Microsoft Azure DevOps and GitHub. - Proficiency in modern design patterns and practices. - Ability to translate business requirements into technology requirements for inclusion in contracts and/or statements of work. - Agile/Scrum for software development. - Knowledge of cloud service models, such as PaaS and SaaS, and familiarity with cloud technologies, such as Azure and AWS.
Experience Preferred
10+ years of work experience in the design, development, testing, and support of large-scale web applications and system integrations using the Microsoft stack, with a particular focus on high-volume transactions, secure architecture, low latency, optimal performance, and proper scalability. 2+ years of work experience as a development lead. 6+ years of work experience in hands-on software development using C#.Net, MVC, ASP.NET, .Net Core, Web APIs, Razor Pages, jQuery and Bootstrap. 4+ years of experience with data engineering with an understanding of database systems (SQL Server) and distributed computing. 4+ years of experience in the design and implementation of RESTful API's. 2+ years of hands-on work experience with Azure or AWS cloud and with hybrid architectural designs and infrastructure solutions. 1+ year of experience working with Electronic Health Record systems and with FHIR APIs or similar.
Education Preferred
Bachelor's degree in Computer Science, Information Systems, or a closely related field
Additional Information
The work location is: 1000 S. Fremont St., Building West A-9, 5th floor, Alhambra, CA 91803. The candidate MUST reside in the Los Angeles area and be able to work onsite 2 days per week
Python Backend Engineer - 3D / Visualization / API / Software (On-site)
Applications developer job in San Jose, CA
A pioneering and well-funded AI company is seeking a talented Python Backend Engineer to build the core infrastructure for its revolutionary autonomous systems. This is a unique opportunity to join an innovative team at the forefront of engineering and artificial intelligence, creating a new category of software that will redefine how complex products in sectors like aerospace, automotive, and advanced manufacturing are designed and developed.
Why Join?
Build the Future of Engineering: This isn't just another backend role. Your work will directly shape how next-generation rockets, cars, and aircraft are designed, fundamentally changing the engineering landscape.
Solve Unprecedented Technical Puzzles: Tackle unique challenges in building the infrastructure for autonomous AI agents, including simulation orchestration, multi-agent coordination, and scalable model serving.
Shape a Foundational Platform: As a critical member of a pioneering team, you will have a significant impact on the technical direction and core architecture of an entirely new category of software.
Join a High-Impact Team: Work in a collaborative, fast-paced environment where your expertise is valued, and you have end-to-end ownership of critical systems.
Compensation & Location: Base salary of up to $210,000 + equity + benefits, while working on-site with the team in a modern office in downtown San Francisco.
The Role
As a Python Backend Engineer, you will be instrumental in constructing the infrastructure that underpins these autonomous engineering agents. Your responsibilities will span model serving, simulation orchestration, multi-agent coordination, and the development of robust, developer-facing APIs. This position is critical for delivering the fast, reliable, and scalable systems that professional engineers will trust and depend on in high-stakes production environments.
You will:
Own and build the core backend infrastructure for the autonomous AI agents, focusing on scalability, model serving, and multi-agent orchestration.
Design and maintain robust APIs while integrating essential third-party tools like CAD software and simulation backends into the core platform.
Develop backend services to process and serve complex 3D visualizations from simulation and geometric data.
Collaborate across ML, frontend, and simulation teams to shape the product and engage directly with early customers to drive infrastructure needs.
Make foundational architectural decisions that will define the technical future and scalability of the entire platform.
The Essential Requirements
Strong backend software engineering experience, with a primary focus on Python.
Proven experience in designing, building, and maintaining production-level APIs (FastAPI preferred but Flask and Django also considered).
Experience with 3D visualization libraries or tools such as PyVista, ParaView, or VTK.
Excellent systems-thinking skills and the ability to reason about the interactions between compute, data, and models.
Experience working in fast-paced environments where end-to-end ownership and proactivity are essential.
Exceptional communication and collaboration abilities.
What Will Make You Stand Out
Experience integrating with scientific or engineering software (such as CAD, FEA, or CFD tools).
Exposure to agent frameworks, workflow orchestration engines, or distributed systems.
Familiarity with model serving frameworks (e.g., TorchServe, Triton) or simulation backends.
Previous experience building developer-focused tools or working in high-trust, customer-facing technical roles.
If you are interested in this role, please apply with your resume through this site.
SEO Keywords for Search
Python Backend Engineer, Python Software Engineer, Backend Engineer, Software Engineer, Python Developer, AI Engineer, Machine Learning Infrastructure, MLOps Engineer, Backend Software Engineer (Python), Senior Backend Engineer, AI/ML Engineer, Infrastructure Engineer, FastAPI Developer, PyVista, ParaView, VTK, 3D Visualization, Docker, Kubernetes, Cloud Engineer, AI Platform Engineer, Distributed Systems Engineer, Simulation Software Engineer, CAD Integration, CFD, FEA, Scientific Computing, High-Performance Computing (HPC), Agent Frameworks, Workflow Orchestration, Technical Lead, Staff Engineer.
Disclaimer
Attis Global Ltd is an equal opportunities employer. No terminology in this advert is intended to discriminate on any of the grounds protected by law, and all qualified applicants will receive consideration for employment without regard to age, sex, race, national origin, religion or belief, disability, pregnancy and maternity, marital status, political affiliation, socio-economic status, sexual orientation, gender, gender identity and expression, and/or gender reassignment. M/F/D/V. We operate as a staffing agency and employment business. More information can be found at attisglobal.com.
Founding Software Engineer / Protocol Engineer
Applications developer job in San Jose, CA
We are actively searching for a Founding Protocol Engineer to join our team on a permanent basis. In this position you will If you are someone that is impressed with what Hyperliquid has accomplished then this role is for you. We are on a mission to build next generation lending and debt protocols. We are open to both Senior level and Architect level candidates for this role.
Your Rhythm:
Drive the architecture, technical design, and implementation of our lending protocol.
Collaborate closely with researchers to validate and test designs
Collaborate with auditors and security engineers to ensure safety of the protocol
Participate in code reviews, providing constructive feedback and ensuring adherence to established coding standards and best practices
Your Vibe:
5+ years of professional software Engineering experience
3+ years of experience working in Solidity in EVM in production environments, specifically focused in DeFi products
2+ years of experience working with a modern backend languages (Go, Rust, Python, etc) in distributed architectures
Experience building lending protocols in a smart contract language
Open to collaborating onsite a few days a week at our downtown SF office
Our Vibe:
Relaxed work environment
100% paid top of the line health care benefits
Full ownership, no micro management
Strong equity package
401K
Unlimited vacation
An actual work/life balance, we aren't trying to run you into the ground. We have families and enjoy life too!
Software Engineer
Applications developer job in Fremont, CA
🚀 Software Engineer - AI & Full Stack (San Francisco, CA)
💼 Full-Time | 🧠 1-4+ Years Experience | 💰 $150,000- 210,000
We're building self-improving software - AI that continuously creates, tests, and enhances digital experiences. Backed by Y Combinator, Gradient, and leaders from OpenAI, Uber, and Meta, we've raised $5M+ and are scaling fast.
If you love building things that
think for themselves
, this is your chance to help shape the next wave of intelligent software.
🧩 What You'll Do
Build an AI-powered paywall editor serving millions of users every day.
Work across the stack - Next.js frontend + Python backend - integrating the latest AI models & APIs.
Ship fast: design → code → test → deploy → learn → repeat.
Collaborate directly with founders, engineers, and customers to deliver exceptional user experiences.
⚡ What We're Looking For
Strong problem-solving and full-stack skills (Python, React, TypeScript).
Experience building user-facing products that people love.
Excellent communication and a bias for action.
Ownership mindset - you ship things that matter.
Startup experience = BONUS
BSc CompSc degree preferred
💡 Bonus Points
Experience with AI/LLM integrations.
Startup or founder-level experience.
Mobile skills (Swift, Flutter, React Native).
🧠 Tech Stack
Next.js (React/TypeScript), Zustand, Tailwind, Shadcn, Python, Supabase, Fly.io, Swift, Flutter, Expo
📍 In-person role - San Francisco (Mon-Sat)
U.S. work authorization required (O-1 visa sponsorship possible).
If you're ready to build AI that builds software, we'd love to hear from you.
👉 Apply now and help us invent the future of intelligent systems.
Robotic Software Engineer
Applications developer job in Fremont, CA
Robotics Software Engineer (Generalist/Full-Stack)
Robotic Software Engineer - Humanoid Robotics
Palo Alto, SF Bay Area (Full-time | Onsite)
$180k-$200k + equity (flexible for exceptional candidates)
We are recruiting building next-generation humanoid robotic systems that combine advanced AI with cutting-edge hardware. Our team moves fast, prototypes aggressively, and puts real robots into the world. We're now hiring a Robotic Software Engineer to help shape our core software stack and accelerate the development of our embodied AI systems.
What You'll Work On
As part of a small, high-impact engineering team, you will:
Build and optimise robotics software in C++ and ROS2
Integrate perception, control, planning, and learning modules
Work hands-on with robots to bring up new hardware and run real-world experiments
Deploy reinforcement learning / imitation learning policies onto physical robots
Develop middleware, interfaces, and tooling that connect AI → hardware
Prototype behaviours across diverse robot types (arms, humanoids, mobile platforms, drones)
This role directly supports both our AI and hardware teams and has significant ownership from day one.
Must-haves:
Strong C++ development skills (multi-threading, performance, systems-level)
Professional experience with ROS2
Hands-on robotics experience - ideally robot learning on physical hardware
Ability to work on real robots (debugging, integration, testing)
Generalist mindset and comfort in a fast-paced startup environment
Nice-to-haves:
Manipulation or kinematics (humanoids, arms, quadrupeds)
Controls for mobile robots or drones
Sensor/actuator integration, drivers, or middleware experience
VR prototyping (Meta Quest or similar)
Experience across different robot embodiments
Why Join Us
Build software that runs on real humanoid robots immediately
High ownership within a small, world-class engineering team
Competitive compensation + meaningful equity
Opportunity to influence architecture, roadmap, and product direction
Work at one of the most exciting intersections in tech: AI × robotics
Software Engineer
Applications developer job in Santa Rosa, CA
Founding Engineer
$140K - $200K + equity
San Francisco (Onsite Role)
Direct Hire
A fast growing early-stage start who recently secured a significant amount at Seed is actively hiring 3x software engineers to join their founding team. They're looking for people who are scrappy, move fast, challenge assumptions, and are driven to win. They build quickly and expect teammates to push boundaries.
Who You Are
Make quick, reversible (“two-way door”) decisions
Proactively fix problems before being asked
Comfortable working across a modern engineering stack (e.g., TypeScript, Python, containerisation, ML/LLM tooling, databases, cloud environments, mobile frameworks)
Have built real, shipped products
Thrive in ambiguity and fast-moving environments
What You'll Do
Talk directly with users to understand their workflows, pain points, and needs
Architect systems that support large enterprise usage
Build automated pipelines and intelligent agents that process and verify large volumes of data
Maintain scalable, robust infrastructure
Ship quickly - progress over perfection
The Reality
You'll work closely with the founding team and directly with customers
User value beats hype, trends, and “cool tech”
Expect a demanding, high-output culture
If you're a Software Engineer with 2 + years' experience and want to work in a growing start-up, please do apply now for immediate consideration.
Sr/Lead Golang/Python Developer (W2 Only)
Applications developer job in Pasadena, CA
Title: Sr/Lead Golang/Python Developer
Duration: 6+ Months to support Telephony application requirements.
Must Haves:
4-yr Technical Degree (CS or related) AND
8 yrs of IT/Engineering AND
4+ yrs of backend programming & API (Go & NoSQL/Mongo) AND
3+ yrs of Python/Bash scripting AND
2+ yrs of building or working with scalable apps (Kubernetes / Docker) AND
2+ yrs of designing, Installing, Configuring, Maintaining, and troubleshooting VOIP (SIP) servers, infrastructure, and applications.
2+ yrs of mentoring Jr. staff (code review, work dissemination, etc)
Java Software Engineer
Applications developer job in Concord, CA
Role: Senior Software Engineer (Java)
Contract: 12 to 24 months
Skills Needed: Backend Java, API development, Microservices, Oracle, Splunk
Client JD-
We are seeking a Senior Software Engineer (SE3) with strong backend Java experience to support the development of APIs and microservices within a large-scale banking/transaction environment. The role involves modernizing monolithic applications, contributing to cloud migration (OCP), and ensuring platform stability, performance, and security.
Key Responsibilities
Design, develop, test, and support backend APIs and microservices.
Work on modernization and cloud migration efforts.
Ensure scalability, resiliency, and secure SDLC practices.
Handle production support, monitoring, and issue resolution.
Collaborate with product managers, architects, and engineering teams.
Guide junior developers when needed.
Required Skills
4+ years Java/Spring development
4+ years API/microservices experience
2+ years Oracle database experience
Experience with Splunk or similar monitoring tools
Agile/Scrum experience
Nice to Have
Experience decomposing monolithic apps
Cloud/OCP migration experience
Kafka or event-driven architecture
API management tools (e.g., Apigee)
Exposure to GenAI/Copilot (bonus)
EEO:
“Mindlance is an Equal Opportunity Employer and does not discriminate in employment based on - Minority/Gender/Disability/Religion/LGBTQI/Age/Veterans.”
Software Engineer - Runtime
Applications developer job in Santa Clara, CA
We're a Series A and we need a systems-savvy engineer who can architect, optimize, and turbocharge our multi-target runtime from day one.
If concurrent programming is your playground, C++14 is your native language, and you think in cache lines, pipelines, and memory hierarchies, this role puts you at the heart of the action.
What You'll Do
Design, build, and continually improve our multi-target runtime
Apply cutting-edge parallelization + partitioning techniques to generate and exploit highly optimized kernels
Rapidly prototype ideas and validate them with real data
What You Bring
Deep expertise in asynchronous + concurrent programming
4+ years of modern C/C++
Strong grasp of hardware architecture (scalar vs vector, memory hierarchies, etc.)
Knowledge of OS kernel or hypervisor development
Bonus Points
CUDA/ROCm library experience
GPU programming background
HPC experience
MS/PhD in CS or equivalent
Familiarity with PyTorch, JAX, Triton
Experience wrangling large compute clusters
Why You'll Love It
You'll own critical, performance-sensitive systems that sit at the core of our stack; shaping how next-gen ML models run across diverse hardware. High impact, deep tech, zero bureaucracy.
If you want to engineer at the limits of performance and help build a runtime that changes the game let's talk!
Software Engineer, AI Data Platform
Applications developer job in Mountain View, CA
Granica is redefining how enterprises prepare and optimize data at the most fundamental layer of the AI stack-where raw information becomes usable intelligence. Our technology operates deep in the data infrastructure layer, making data efficient, secure, and ready for scale.
We eliminate the hidden inefficiencies in modern data platforms-slashing storage and compute costs, accelerating pipelines, and boosting platform efficiency. The result: 60%+ lower storage costs, up to 60% lower compute spend, 3× faster data processing, and 20% overall efficiency gains.
Why It Matters
Massive data should fuel innovation, not drain budgets. We remove the bottlenecks holding AI and analytics back-making data lighter, faster, and smarter so teams can ship breakthroughs, not babysit storage and compute bills.
Who We Are
World-renowned researchers in compression, information theory, and data systems
Elite engineers from Google, Pure Storage, Cohesity, and top cloud teams
Enterprise sellers who turn ROI into seven‑figure wins.
Powered by World-Class Investors & Customers
$65M+ raised from NEA, Bain Capital, A* Capital, and operators behind Okta, Eventbrite, Tesla, and Databricks. Our platform already processes hundreds of petabytes for industry leaders
Our Mission: We're building the default data substrate for AI, and a generational company built to endure.
Smarter Infrastructure for the AI Era:
We make data efficient, safe, and ready for scale-think smarter, more foundational infrastructure for the AI era. Our technology integrates directly with modern data stacks like Snowflake, Databricks, and S3-based data lakes, enabling:
60%+ reduction in storage costs and up to 60% lower compute spend
3x faster data processing
20% platform efficiency gains
Trusted by Industry Leaders
Enterprise leaders globally already rely on Granica to cut costs, boost performance, and unlock more value from their existing data platforms.
A Deep Tech Approach to AI
We're unlocking the layers
beneath
platforms like Snowflake and Databricks, making them faster, cheaper, and more AI-native. We combine advanced research with practical productization, powered by a dual-track strategy:
Research: Led by Chief Scientist Andrea Montanari (Stanford Professor), we publish 1-2 top-tier papers per quarter.
Product: Actively processing 100+ PBs today and targeting Exabyte scale by Q4 2025.
Backed by the Best
We've raised $60M+ from NEA, Bain Capital, A Capital, and operators behind Okta, Eventbrite, Tesla, and Databricks.
Our Mission
To convert entropy into intelligence, so every builder-human or AI-can make the impossible real.
We're building the default data substrate for AI, and a generational company built to endure beyond any single product cycle.
WHAT YOU'LL DO
This is a deep systems role for someone who lives and breathes distributed infrastructure, understands how data moves at scale, and wants to build the next-generation AI data platform from the ground up.
Own the ACID backbone. Design and harden transactional layers and metadata services so that petabyte-scale tables can time-travel in microseconds and schema evolution becomes a non-event.
Turn metadata into rocket fuel. Build compaction, caching, and pruning services that keep millions of file pointers within 50 ms from lookup to plan.
Squeeze more signal per byte. Optimize data layouts-from column ordering to dictionary and bit-packing, bloom filters, and zone-map indexes-to cut scan I/O by 10× on real-world workloads.
Ship adaptive indexing with research. Co-invent machine-driven indexes that learn access patterns and automatically re-partition nightly-no more manual “analyze table” ever again.
Scale the engine, not the babysitting. Write Spark, Flink, or batch pipelines that autoscale across S3, GCS, and ADLS; expose observability hooks; and survive chaos drills without triggering a pager storm.
Code for longevity. Write clean, test-soaked Java, Scala, Go, or C++. Document key invariants so future teams can extend the system-instead of rewriting it.
Measure success in human latency. If analysts see their dashboards refresh in blink-level time, you've won. Publish your breakthrough and mentor the next engineer to raise the bar again.
WHAT WE'RE LOOKING FOR
You've built systems where performance, resilience, and clarity of design all matter. You thrive at the intersection of infrastructure engineering and applied research, and care deeply about both how something works and how well it works at scale.
Core Skills
Distributed Systems and Storage Fundamentals - consistency, replication, sharding, durability, transactions.
Columnar Storage Optimization - deep knowledge of Parquet or similar formats (column ordering, compression, zone maps).
Metadata and Indexing Systems - experience building metadata-driven services, compaction, caching, and adaptive indexing.
Distributed Compute at Scale - production-grade Spark/Flink or equivalent pipeline development across S3, GCS, or ADLS.
Programming for Scale and Longevity - strong coding in Java, Scala, Go, or C++, with clean testing and documentation practices.
Resilient Systems and Observability - you've built systems that survive chaos drills and expose the right metrics.
Desired Skills
Exposure to open table formats such as Apache Iceberg, Delta Lake, or Hudi.
Experience with catalog services, query planning, or compaction frameworks.
OSS contributions or published work in data infrastructure or distributed systems.
WHY JOIN GRANICA
If you've helped build the modern data stack at a large company-Databricks, Snowflake, Confluent, or similar-you already know how critical lakehouse infrastructure is to AI and analytics at scale. At Granica, you'll take that knowledge and apply it where it matters most…at the most fundamental layer in the data ecosystem.
Own the product, not just the feature. At Granica, you won't be optimizing edge cases or maintaining legacy systems. You'll architect and build foundational components that define how enterprises manage and optimize data for AI.
Move faster, go deeper. No multi-month review cycles or layers of abstraction-just high-agency engineering work where great ideas ship weekly. You'll work directly with the founding team, engage closely with design partners, and see your impact hit production fast.
Work on hard, meaningful problems. From transaction layer design in Delta and Iceberg, to petabyte-scale compaction and schema evolution, to adaptive indexing and cost-aware query planning-this is deep systems engineering at scale.
Join a team of expert builders. Our engineers have designed the core internals of cloud-scale data systems, and we maintain a culture of peer-driven learning, hands-on prototyping, and technical storytelling.
Core Differentiation: We're focused on unlocking a deeper layer of AI infrastructure. By optimizing the way data is stored, processed, and retrieved, we make platforms like Snowflake and Databricks faster, more cost-efficient, and more AI-native. Our work sits at the most fundamental layer of the AI stack: where raw data becomes usable intelligence.
Be part of something early-without the chaos. Granica has already secured $65M+ from NEA, Bain Capital Ventures, A* Capital, and legendary operators from Okta, Tesla, and Databricks.
Grow with the company. You'll have the chance to grow into a technical leadership role, mentor future hires, and shape both the engineering culture and product direction as we scale.
COMPENSATION & BENEFITS
Competitive salary and meaningful equity
Unlimited PTO + quarterly recharge days
Premium health, vision, and dental
Team offsites, deep tech talks, and learning stipends
Help build the foundational infrastructure for the AI era
Granica is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.
Software Engineer
Applications developer job in San Francisco, CA
Software Engineer, C++ Systems
About the Role
We're seeking a highly skilled Software Engineer (C++ Systems) to join our client's team and help build the core of our GPU virtualization platform. This role is ideal for engineers who thrive on microsecond-level performance optimization, enjoy working deep in complex C++ systems, and are motivated by building foundational infrastructure that directly impacts customers.
You'll play a critical role in scaling our platform as we serve a rapidly growing customer base, owning production systems from day one and tackling technically demanding challenges at the forefront of GPU infrastructure.
What You'll Do
Optimize performance of our C++ GPU virtualization library at the systems level
Research and develop solutions for GPU oversubscription, checkpointing, and distributed GPU clusters
Support new hardware and software architectures with a deep, end-to-end understanding of the stack
Debug low-level systems in production environments
Diagnose and resolve performance issues in machine learning workloads
Collaborate closely with the CTO on advanced systems design and implementation
Required Experience
Proven experience building and operating low-level systems in production environments
Background working with compilers, kernels, or networking protocols
Demonstrated ability to trace and resolve performance issues across complex systems
Technical Skills
Expert-level C++ proficiency (Rust experience is acceptable, though primary development will be in C++)
Experience optimizing C++ and NIC performance
Strong systems-level debugging and performance analysis skills
Education
Degree in Computer Science or a related field from a top-tier program
Strong academic performance (3.7+ GPA)
Soft Skills
Ability to deliver high-quality output quickly in an early-stage startup environment
Comfortable taking full ownership of critical production systems
Thrives in ambiguous, high-impact problem spaces
Company & Opportunity
Building GPU virtualization software that dramatically improves GPU efficiency
Operating a fast-growing GPU cloud, scaling from $0 to $500K in revenue in just six months
Backed by Y Combinator and a recently closed $4.5M Seed round
Join as employee #5 at a pivotal moment: product-market fit validated and scaling rapidly
Work directly with the CTO on systems challenges few startups get to tackle
This is a hardcore C++ systems role focused on GPU virtualization, performance tuning, production debugging, and advanced research
Expect ownership, impact, and problems that demand top 0.1% technical skill
Why This Role
If you love squeezing performance out of low-level systems, enjoy working at the intersection of GPUs, distributed systems, and production infrastructure, and want to help scale a breakthrough platform at an early stage, this role offers a rare and exciting opportunity.
Software Engineer, Frontend
Applications developer job in Santa Clara, CA
We are looking for a Senior Front-End Software Engineer with strong software fundamentals to join a high-performing platform development team. This role combines hands-on development, mentorship, and growth opportunities. You will work on UI implementation and maintenance across multiple functional areas, contributing daily to improving user experiences and building deep expertise in the product.
Key Responsibilities
Partner with Product Managers and Designers to define and deliver new features and solutions.
Collaborate with engineering teams across the stack to build scalable, user-facing features.
Work closely with the Support team to triage bugs and resolve production issues quickly.
Drive planning and execution of mid- to large-scale projects from conception to launch.
Act as a subject matter expert while resolving complex technical challenges.
Oversee the full systems development lifecycle, including architecture definition, design, scoping, planning, implementation, testing, documentation, and maintenance.
Qualifications
6+ years of front-end development experience.
Strong technical background (degree in Computer Science, Engineering, or related field preferred, or equivalent experience).
Advanced knowledge of HTML, CSS, and ES6 JavaScript.
Advanced knowledge of React, Next.js, and TypeScript.
Experience using and consuming REST APIs with a strong understanding of client-server interaction.
Familiarity with AGILE/Scrum development methods.
Expert-level problem-solving and communication skills.
Software Engineer (Computer Vision, Robotics)
Applications developer job in Santa Clara, CA
About Us
At Autonomous Healthcare, we are at the forefront of medical innovation, developing the next generation of devices that will revolutionize patient care. Our mission is to commercialize breakthrough medical technologies by leveraging cutting-edge AI and autonomous systems. We believe that the best solutions are built together, and we are looking for a key member to join our collaborative R&D team.
About the Role
We are seeking a highly motivated and skilled engineer to join our team in developing next-generation patient monitoring systems. This role is at the intersection of computer vision, signal processing, and high-performance software engineering. You will be responsible for building the core analytical engine that transforms raw depth-sensor video into actionable health information.
This is not a purely theoretical position. You will be hands-on, designing algorithms that are efficient enough for real-time applications and robust enough for real-world clinical use. You will write the production-level Python code that brings these algorithms to life on cutting-edge edge computing platforms.
If you are a problem-solver who thrives on analyzing complex sensor data and building tangible, high-performance systems, we want to hear from you.
Key Responsibilities
Develop and implement real-time computer vision algorithms in Python to detect, track, and analyze regions of interest from video data (specifically depth sensors).
Design and build signal processing pipelines to extract, filter, and interpret physiological movement data from sensor signals.
Optimize algorithms for performance to meet strict real-time processing requirements.
Deploy and validate analysis software on edge computing platforms with GPU acceleration (e.g., NVIDIA Jetson).
Collaborate in a multidisciplinary team to integrate your solutions into a complete monitoring product.
Rigorously test, debug, and document your code and algorithms.
Required Qualifications
Strong proficiency in Python and experience writing clean, efficient, and maintainable code.
Solid foundation in computer vision principles and hands-on experience with libraries like OpenCV.
Solid foundation in digital signal processing (e.g., filtering, time-series analysis, feature extraction) and experience with libraries like SciPy or NumPy.
B.S. or M.S. in Computer Science, Robotics, Electrical Engineering, Biomedical Engineering, or a related technical field.
Demonstrable experience in analyzing imaging or sensor data to solve complex problems.
Excellent problem-solving skills and the ability to work independently and as part of a team.
Preferred Skills (We'd love to see these)
Experience with high-performance edge computing platforms (e.g., NVIDIA Jetson).
Familiarity with GPU programming (e.g., CUDA, TensorRT) for accelerating algorithms.
A background in robotics, autonomous vehicles, or real-time analysis of sensor data (e.g., LiDAR, RADAR, IMU).
Experience with depth sensors, 3D data processing, or point cloud analysis.
Knowledge of machine learning or deep learning frameworks (e.g., PyTorch, TensorFlow) for vision or time-series tasks.
Familiarity with software development best practices (e.g., Git, unit testing, CI/CD).
Backend Software Engineer - Cloud Services
Applications developer job in Sunnyvale, CA
About Company,
Droisys is an innovation technology company focused on helping companies accelerate their digital initiatives from strategy and planning through execution. We leverage deep technical expertise, Agile methodologies, and data-driven intelligence to modernize systems of engagement and simplify human/tech interaction.
Amazing things happen when we work in environments where everyone feels a true sense of belonging and when candidates have the requisite skills and opportunities to succeed. At Droisys, we invest in our talent and support career growth, and we are always on the lookout for amazing talent who can contribute to our growth by delivering top results for our clients. Join us to challenge yourself and accomplish work that matters.
We're hiring Backend Software Engineer - Cloud Services in Sunnyvale, CA .
What You'll Do
Take full ownership of your services: drive the design, contribute new features, participate in peer reviews, and deliver production-ready solutions.
Develop software primarily in Java and Python.
Work with Kubernetes or be willing to quickly ramp up on container orchestration.
Own end-to-end responsibility for major features and subsystems-from refining requirements to successful deployment in customer environments.
Manage operational health of your services, including telemetry, metrics, and rapid production issue detection.
Ensure high code quality through early testing, functional verification, and integration testing.
Collaborate closely with Product Management to clarify scope, finalize requirements, and plan delivery.
What You'll Bring
Bachelor's degree in Computer Science or similar field (Master's preferred).
3+ years of experience building scalable, distributed systems.
A strong passion for building software, learning new technologies, and collaborating in a team environment.
Hands-on experience with AWS, Azure, or GCP, particularly at the programming/API level.
Background in networking or security is a plus.
Proficiency in Java and/or Python, with familiarity using REST APIs.
Experience with CloudFormation or Terraform is beneficial.
Knowledge of Spring or similar backend frameworks.
Understanding of Kubernetes, Docker, and containerized environments is helpful.
Familiarity with classic Gang of Four design patterns.
Droisys is an equal opportunity employer. We do not discriminate based on race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. Droisys believes in diversity, inclusion, and belonging, and we are committed to fostering a diverse work environment
Python Backend Engineer - 3D / Visualization / API / Software (On-site)
Applications developer job in San Francisco, CA
A pioneering and well-funded AI company is seeking a talented Python Backend Engineer to build the core infrastructure for its revolutionary autonomous systems. This is a unique opportunity to join an innovative team at the forefront of engineering and artificial intelligence, creating a new category of software that will redefine how complex products in sectors like aerospace, automotive, and advanced manufacturing are designed and developed.
Why Join?
Build the Future of Engineering: This isn't just another backend role. Your work will directly shape how next-generation rockets, cars, and aircraft are designed, fundamentally changing the engineering landscape.
Solve Unprecedented Technical Puzzles: Tackle unique challenges in building the infrastructure for autonomous AI agents, including simulation orchestration, multi-agent coordination, and scalable model serving.
Shape a Foundational Platform: As a critical member of a pioneering team, you will have a significant impact on the technical direction and core architecture of an entirely new category of software.
Join a High-Impact Team: Work in a collaborative, fast-paced environment where your expertise is valued, and you have end-to-end ownership of critical systems.
Compensation & Location: Base salary of up to $210,000 + equity + benefits, while working on-site with the team in a modern office in downtown San Francisco.
The Role
As a Python Backend Engineer, you will be instrumental in constructing the infrastructure that underpins these autonomous engineering agents. Your responsibilities will span model serving, simulation orchestration, multi-agent coordination, and the development of robust, developer-facing APIs. This position is critical for delivering the fast, reliable, and scalable systems that professional engineers will trust and depend on in high-stakes production environments.
You will:
Own and build the core backend infrastructure for the autonomous AI agents, focusing on scalability, model serving, and multi-agent orchestration.
Design and maintain robust APIs while integrating essential third-party tools like CAD software and simulation backends into the core platform.
Develop backend services to process and serve complex 3D visualizations from simulation and geometric data.
Collaborate across ML, frontend, and simulation teams to shape the product and engage directly with early customers to drive infrastructure needs.
Make foundational architectural decisions that will define the technical future and scalability of the entire platform.
The Essential Requirements
Strong backend software engineering experience, with a primary focus on Python.
Proven experience in designing, building, and maintaining production-level APIs (FastAPI preferred but Flask and Django also considered).
Experience with 3D visualization libraries or tools such as PyVista, ParaView, or VTK.
Excellent systems-thinking skills and the ability to reason about the interactions between compute, data, and models.
Experience working in fast-paced environments where end-to-end ownership and proactivity are essential.
Exceptional communication and collaboration abilities.
What Will Make You Stand Out
Experience integrating with scientific or engineering software (such as CAD, FEA, or CFD tools).
Exposure to agent frameworks, workflow orchestration engines, or distributed systems.
Familiarity with model serving frameworks (e.g., TorchServe, Triton) or simulation backends.
Previous experience building developer-focused tools or working in high-trust, customer-facing technical roles.
If you are interested in this role, please apply with your resume through this site.
SEO Keywords for Search
Python Backend Engineer, Python Software Engineer, Backend Engineer, Software Engineer, Python Developer, AI Engineer, Machine Learning Infrastructure, MLOps Engineer, Backend Software Engineer (Python), Senior Backend Engineer, AI/ML Engineer, Infrastructure Engineer, FastAPI Developer, PyVista, ParaView, VTK, 3D Visualization, Docker, Kubernetes, Cloud Engineer, AI Platform Engineer, Distributed Systems Engineer, Simulation Software Engineer, CAD Integration, CFD, FEA, Scientific Computing, High-Performance Computing (HPC), Agent Frameworks, Workflow Orchestration, Technical Lead, Staff Engineer.
Disclaimer
Attis Global Ltd is an equal opportunities employer. No terminology in this advert is intended to discriminate on any of the grounds protected by law, and all qualified applicants will receive consideration for employment without regard to age, sex, race, national origin, religion or belief, disability, pregnancy and maternity, marital status, political affiliation, socio-economic status, sexual orientation, gender, gender identity and expression, and/or gender reassignment. M/F/D/V. We operate as a staffing agency and employment business. More information can be found at attisglobal.com.
Founding Software Engineer / Protocol Engineer
Applications developer job in San Francisco, CA
We are actively searching for a Founding Protocol Engineer to join our team on a permanent basis. In this position you will If you are someone that is impressed with what Hyperliquid has accomplished then this role is for you. We are on a mission to build next generation lending and debt protocols. We are open to both Senior level and Architect level candidates for this role.
Your Rhythm:
Drive the architecture, technical design, and implementation of our lending protocol.
Collaborate closely with researchers to validate and test designs
Collaborate with auditors and security engineers to ensure safety of the protocol
Participate in code reviews, providing constructive feedback and ensuring adherence to established coding standards and best practices
Your Vibe:
5+ years of professional software Engineering experience
3+ years of experience working in Solidity in EVM in production environments, specifically focused in DeFi products
2+ years of experience working with a modern backend languages (Go, Rust, Python, etc) in distributed architectures
Experience building lending protocols in a smart contract language
Open to collaborating onsite a few days a week at our downtown SF office
Our Vibe:
Relaxed work environment
100% paid top of the line health care benefits
Full ownership, no micro management
Strong equity package
401K
Unlimited vacation
An actual work/life balance, we aren't trying to run you into the ground. We have families and enjoy life too!
Software Engineer
Applications developer job in San Jose, CA
Founding Engineer
$140K - $200K + equity
San Francisco (Onsite Role)
Direct Hire
A fast growing early-stage start who recently secured a significant amount at Seed is actively hiring 3x software engineers to join their founding team. They're looking for people who are scrappy, move fast, challenge assumptions, and are driven to win. They build quickly and expect teammates to push boundaries.
Who You Are
Make quick, reversible (“two-way door”) decisions
Proactively fix problems before being asked
Comfortable working across a modern engineering stack (e.g., TypeScript, Python, containerisation, ML/LLM tooling, databases, cloud environments, mobile frameworks)
Have built real, shipped products
Thrive in ambiguity and fast-moving environments
What You'll Do
Talk directly with users to understand their workflows, pain points, and needs
Architect systems that support large enterprise usage
Build automated pipelines and intelligent agents that process and verify large volumes of data
Maintain scalable, robust infrastructure
Ship quickly - progress over perfection
The Reality
You'll work closely with the founding team and directly with customers
User value beats hype, trends, and “cool tech”
Expect a demanding, high-output culture
If you're a Software Engineer with 2 + years' experience and want to work in a growing start-up, please do apply now for immediate consideration.
Founding Software Engineer / Protocol Engineer
Applications developer job in Fremont, CA
We are actively searching for a Founding Protocol Engineer to join our team on a permanent basis. In this position you will If you are someone that is impressed with what Hyperliquid has accomplished then this role is for you. We are on a mission to build next generation lending and debt protocols. We are open to both Senior level and Architect level candidates for this role.
Your Rhythm:
Drive the architecture, technical design, and implementation of our lending protocol.
Collaborate closely with researchers to validate and test designs
Collaborate with auditors and security engineers to ensure safety of the protocol
Participate in code reviews, providing constructive feedback and ensuring adherence to established coding standards and best practices
Your Vibe:
5+ years of professional software Engineering experience
3+ years of experience working in Solidity in EVM in production environments, specifically focused in DeFi products
2+ years of experience working with a modern backend languages (Go, Rust, Python, etc) in distributed architectures
Experience building lending protocols in a smart contract language
Open to collaborating onsite a few days a week at our downtown SF office
Our Vibe:
Relaxed work environment
100% paid top of the line health care benefits
Full ownership, no micro management
Strong equity package
401K
Unlimited vacation
An actual work/life balance, we aren't trying to run you into the ground. We have families and enjoy life too!
Software Engineer
Applications developer job in Fremont, CA
Founding Engineer
$140K - $200K + equity
San Francisco (Onsite Role)
Direct Hire
A fast growing early-stage start who recently secured a significant amount at Seed is actively hiring 3x software engineers to join their founding team. They're looking for people who are scrappy, move fast, challenge assumptions, and are driven to win. They build quickly and expect teammates to push boundaries.
Who You Are
Make quick, reversible (“two-way door”) decisions
Proactively fix problems before being asked
Comfortable working across a modern engineering stack (e.g., TypeScript, Python, containerisation, ML/LLM tooling, databases, cloud environments, mobile frameworks)
Have built real, shipped products
Thrive in ambiguity and fast-moving environments
What You'll Do
Talk directly with users to understand their workflows, pain points, and needs
Architect systems that support large enterprise usage
Build automated pipelines and intelligent agents that process and verify large volumes of data
Maintain scalable, robust infrastructure
Ship quickly - progress over perfection
The Reality
You'll work closely with the founding team and directly with customers
User value beats hype, trends, and “cool tech”
Expect a demanding, high-output culture
If you're a Software Engineer with 2 + years' experience and want to work in a growing start-up, please do apply now for immediate consideration.