Post job

Devops engineer jobs in Fairview, CA

- 9,537 jobs
All
Devops Engineer
Software Engineer
Software Development Engineer
  • Software Development Engineer, AI/ML, AWS Neuron, Model Inference

    Annapurna Labs (U.S.) Inc. 4.6company rating

    Devops engineer job in Cupertino, CA

    The Annapurna Labs team at Amazon Web Services (AWS) builds AWS Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon's custom machine learning accelerators, Inferentia and Trainium. The AWS Neuron SDK, developed by the Annapurna Labs team at AWS, is the backbone for accelerating deep learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML compiler, runtime, and application framework that seamlessly integrates with popular ML frameworks like PyTorch and JAX enabling unparalleled ML inference and training performance. The Inference Enablement and Acceleration team is at the forefront of running a wide range of models and supporting novel architecture alongside maximizing their performance for AWS's custom ML accelerators. Working across the stack from PyTorch till the hardware-software boundary, our engineers build systematic infrastructure, innovate new methods and create high-performance kernels for ML functions, ensuring every compute unit is fine tuned for optimal performance for our customers' demanding workloads. We combine deep hardware knowledge with ML expertise to push the boundaries of what's possible in AI acceleration. As part of the broader Neuron organization, our team works across multiple technology layers - from frameworks and kernels and collaborate with compiler to runtime and collectives. We not only optimize current performance but also contribute to future architecture designs, working closely with customers to enable their models and ensure optimal performance. This role offers a unique opportunity to work at the intersection of machine learning, high-performance computing, and distributed architectures, where you'll help shape the future of AI acceleration technology You will architect and implement business critical features, and mentor a brilliant team of experienced engineers. We operate in spaces that are very large, yet our teams remain small and agile. There is no blueprint. We're inventing. We're experimenting. It is a very unique learning culture. The team works closely with customers on their model enablement, providing direct support and optimization expertise to ensure their machine learning workloads achieve optimal performance on AWS ML accelerators. The team collaborates with open source ecosystems to provide seamless integration and bring peak performance at scale for customers and developers. This role is responsible for development, enablement and performance tuning of a wide variety of LLM model families, including massive scale large language models like the Llama family, DeepSeek and beyond. The Inference Enablement and Acceleration team works side by side with compiler engineers and runtime engineers to create, build and tune distributed inference solutions with Trainium and Inferentia. Experience optimizing inference performance for both latency and throughput on such large models across the stack from system level optimizations through to Pytorch or JAX is a must have. You can learn more about Neuron ***************************************************************************************** *********************************************** ************************************* ********************************************************************************************* Key job responsibilities This role will help lead the efforts in building distributed inference support for Pytorch in the Neuron SDK. This role will tune these models to ensure highest performance and maximize the efficiency of them running on the customer AWS Trainium and Inferentia silicon and servers. Strong software development using Python, System level programming and ML knowledge are both critical to this role. Our engineers collaborate across compiler, runtime, framework, and hardware teams to optimize machine learning workloads for our global customer base. Working at the intersection of software, hardware, and machine learning systems, you'll bring expertise in low-level optimization, system architecture, and ML model acceleration. In this role, you will: * Design, develop, and optimize machine learning models and frameworks for deployment on custom ML hardware accelerators. * Participate in all stages of the ML system development lifecycle including distributed computing based architecture design, implementation, performance profiling, hardware-specific optimizations, testing and production deployment. * Build infrastructure to systematically analyze and onboard multiple models with diverse architecture. * Design and implement high-performance kernels and features for ML operations, leveraging the Neuron architecture and programming models * Analyze and optimize system-level performance across multiple generations of Neuron hardware * Conduct detailed performance analysis using profiling tools to identify and resolve bottlenecks * Implement optimizations such as fusion, sharding, tiling, and scheduling * Conduct comprehensive testing, including unit and end-to-end model testing with continuous deployment and releases through pipelines. * Work directly with customers to enable and optimize their ML models on AWS accelerators * Collaborate across teams to develop innovative optimization techniques A day in the life You will collaborate with a cross-functional team of applied scientists, system engineers, and product managers to deliver state-of-the-art inference capabilities for Generative AI applications. Your work will involve debugging performance issues, optimizing memory usage, and shaping the future of Neuron's inference stack across Amazon and the Open Source Community. As you design and code solutions to help our team drive efficiencies in software architecture, you'll create metrics, implement automation and other improvements, and resolve the root cause of software defects. You will also build high-impact solutions to deliver to our large customer base and participate in design discussions, code review, and communicate with internal and external stakeholders. You will work cross-functionally to help drive business decisions with your technical input. You will work in a startup-like development environment, where you're always working on the most important initiative. About the team The Inference Enablement and Acceleration team fosters a builder's culture where experimentation is encouraged, and impact is measurable. We emphasize collaboration, technical ownership, and continuous learning. Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we're building an environment that celebrates knowledge-sharing and mentorship. Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future. Join us to solve some of the most interesting and impactful infrastructure challenges in AI/ML today. BASIC QUALIFICATIONS- Bachelor's degree in computer science or equivalent - 5+ years of non-internship professional software development experience - 5+ years of non-internship design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Fundamentals of Machine learning and LLMs, their architecture, training and inference lifecycles along with work experience on some optimizations for improving the model execution. - Software development experience in C++, Python (experience in at least one language is required). - Strong understanding of system performance, memory management, and parallel computing principles. - Proficiency in debugging, profiling, and implementing best software engineering practices in large-scale systems. PREFERRED QUALIFICATIONS- Familiarity with PyTorch, JIT compilation, and AOT tracing. - Familiarity with CUDA kernels or equivalent ML or low-level kernels - Candidates with performant kernel development such as CUTLASS, FlashInfer etc., would be well suited. - Familiar with syntax and tile-level semantics similar to Triton. - Experience with online/offline inference serving with vLLM, SGLang, TensorRT or similar platforms in production environments. - Deep understanding of computer architecture, operation systems level software and working knowledge of parallel computing. Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status. Los Angeles County applicants: Job duties for this position include: work safely and cooperatively with other employees, supervisors, and staff; adhere to standards of excellence despite stressful conditions; communicate effectively and respectfully with employees, supervisors, and staff to ensure exceptional customer service; and follow all federal, state, and local laws and Company policies. Criminal history may have a direct, adverse, and negative relationship with some of the material job duties of this position. These include the duties and responsibilities listed above, as well as the abilities to adhere to company policies, exercise sound judgment, effectively manage stress and work safely and respectfully with others, exhibit trustworthiness and professionalism, and safeguard business operations and the Company's reputation. Pursuant to the Los Angeles County Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records. Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit ********************************************************* for more information. If the country/region you're applying in isn't listed, please contact your Recruiting Partner. Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $129,300/year in our lowest geographic market up to $223,600/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit ******************************************************** This position will remain posted until filled. Applicants should apply via our internal or external career site.
    $129.3k-223.6k yearly 1d ago
  • Senior DevOps Engineer (Ref: 194285)

    Forsyth Barnes

    Devops engineer job in Fremont, CA

    Contact: ****************************** About Us We are collaborating with our client, an innovative fintech startup headquartered in the Bay Area, which focuses on delivering real-time card data primarily aimed at B2B software firms. Established as a spin-out in early 2024 and launching their product by the summer of that year, this organization is at a pivotal growth juncture, experiencing an expanding customer base and ambitious future initiatives. With the support of notable investors including Nica Partners, QED Investors, RBC, and Visa, this organization is championing advancements in card issuance technology. Their team of 20 passionate professionals is seeking exceptional engineers to aid in scaling their operations. Job Description We are assisting this organization in their search for a Senior DevOps Engineer (L1+), tasked with steering the evolution of their infrastructure and serverless architecture. Operating primarily within an AWS framework and extensively utilizing Lambda, the ideal candidate will spearhead serverless migrations while upholding the stringent security protocols essential for the fintech and payments domain, particularly regarding PCI compliance. Key Responsibilities Design and implement serverless infrastructure using AWS Lambda and associated services. Lead the serverless migration process and help define the infrastructure roadmap. Ensure compliance with PCI standards and establish security best practices throughout the infrastructure. Manage and optimize SQL-based databases hosted on AWS. Collaborate with teams to advance the organization's Kubernetes capabilities as they grow their microservices architecture. Work collaboratively with engineering teams to enhance deployment pipelines and improve the developer experience. Engage in a hybrid work model, requiring in-office presence on Tuesdays, Wednesdays, and Thursdays. Requirements Required: A minimum of senior-level (L1+) experience in DevOps or Infrastructure engineering. In-depth knowledge of AWS, particularly Lambda and serverless architectures. A strong background in security practices, especially regarding PCI compliance for infrastructure. Experience with SQL-based database management. Familiarity with the fintech or payments industry is essential. Knowledge of the card issuance sector and experience with payment networks, ideally including Mastercard. Startup experience with a proven ability to deliver quick and effective solutions. A willingness to relocate to or already residing in the Bay Area. Preferred: Experience with Kubernetes. Expertise in microservices architecture. Prior experience at organizations such as Marqeta, Plaid, Ramp, or Brex. Experience working with payment networks. Benefits A competitive compensation package comprising both salary and equity, ensuring alignment with the best talent available. A 4% matching contribution for the 401(k) plan. Comprehensive medical and dental insurance managed through Rippling. Regular off-site events throughout the year in diverse locations across the US and Europe, including frequent trips to California and New York. Exclusive company merchandise. A hybrid work structure requiring three days of on-site work.
    $112k-153k yearly est. 1d ago
  • Senior DevOps Engineer - AWS & CI/CD

    Sigmaways Inc.

    Devops engineer job in Fremont, CA

    We are seeking a Senior DevOps Build Engineer with experience in designing, building, and maintaining secure, automated CI/CD pipelines for cloud based applications. The ideal candidate is a cloud savvy AWS expert with strong DevOps experience and good understanding of infrastructure-as-code, containerization and security scanning. In this role, you will collaborate across multiple teams including development, infrastructure, security and networking to deliver reliable, secure and scalable cloud solutions. You will also provide leadership in DevOps best practices, CI/CD automation, and secure architecture design. Responsibilities: Design, build, and optimize automated CI/CD pipelines and integrate SAST, DAST, OSS, and container security scans. Provide DevOps architectural leadership, mentorship, and best-practice guidance across teams. Lead standardization of DevOps pipeline frameworks and secure development patterns. Implement automated deployment, infrastructure, and build/release workflows. Own technical implementation for assigned development initiatives and ensure alignment with internal engineering standards. Partner closely with architecture, security, and engineering stakeholders to ensure secure and compliant build practices. Maintain platform security aligned with organizational Information Security policies. Anticipate risks and proactively propose solutions related to enterprise platform scaling and cloud adoption. Qualifications: Bachelor's or Master's degree in Computer Science, Engineering, Information Systems, or equivalent work experience. At least 7 years of AWS experience delivering solutions in a DevOps or architecture role Hands on background designing, implementing, and supporting complex multi environment platforms. Proven experience building and supporting DevOps infrastructure, CI/CD pipelines, and automation. Strong experience configuring GitLab and artifact repositories such as Nexus. Experience preparing vulnerability reports using tools such as Fortify, SonarQube, NexusIQ, and GitLab native scans (Semgrep, Gymnasium). Experience configuring Checkov and KICS for IaC scanning. Containerization experience using Docker and AWS cloud services (CloudWatch, ECS, EKS, Lambda, Fargate, EC2.) Secrets management experience using AWS Secrets Manager, AWS SSM Parameter Store, and/or Vault. Advanced scripting experience using Python, Shell, Groovy, YAML/JSON. Experience with major IDEs using Visual Studio, IntelliJ, Eclipse. Strong experience integrating build tools and technologies across C++, Java, JavaScript, and .NET application stacks. Strong Linux and Windows OS experience. Knowledge working within Agile project environments and using tools such as Jira and VersionOne.
    $112k-153k yearly est. 3d ago
  • Principal DevOps Engineer - AI/ML

    Strativ Group

    Devops engineer job in Menlo Park, CA

    We are partnered with a Series A AI Lab (backed by top-tier investors and advised by pioneering figures in generative and interactive media) that is hiring a Principal DevOps Engineer. They're backed by leading global VCs and AI research leaders (from OpenAI, DeepMind, Meta, and others), and guided by renowned figures in computer graphics and autonomous systems. The founding team brings deep expertise from frontier AI research and large-scale distributed systems, blending academic excellence with proven startup execution. As a Principal DevOps Engineer, you will work directly with the founders to architect, build, and scale the compute substrate that powers this next generation of AI. You'll design and optimize the inference platform, GPU-based training clusters, and data processing pipelines that drive real-time creativity and discovery. You'll play a key role in scaling systems for both research and production - ensuring low-latency performance, high availability, and efficient utilization across petabyte-scale data and model-serving workloads. Key Experience Required 5+ years of experience in Software / ML Infrastructure Engineering. Deep experience with distributed systems and GPU orchestration for high-performance ML workloads. Proficiency in Python, Go, or similar, and strong grasp of software engineering best practices. Hands-on expertise with Kubernetes, Docker, and IaC (Terraform). Experience optimizing model serving and data pipelines for latency and scalability. A builder's mindset - you thrive in ambiguity, pick the right tools for the job, and ship. This is a chance to join a team working at the frontier of real-time AI systems - please apply ASAP for more info.
    $112k-153k yearly est. 4d ago
  • DevOps Engineer

    Odiin

    Devops engineer job in San Francisco, CA

    You'll play a key role in optimizing deployment pipelines, improving system reliability, and ensuring the scalability of our network and infrastructure. Key Responsibilities: Design, implement, and manage CI/CD pipelines for rapid and reliable code deployment. Automate infrastructure setup and maintenance using tools like Terraform, Ansible, or Docker. Monitor system performance, identify issues, and optimize infrastructure for high availability. Collaborate with development teams to improve deployment workflows and ensure smooth integration between environments. Manage cloud infrastructure (e.g., AWS, GCP, Azure) and on-premise nodes or validators if required. Requirements: Proven experience as a DevOps Engineer, Site Reliability Engineer, or similar role. Strong knowledge of CI/CD tools (e.g., GitHub Actions, Jenkins, CircleCI). Proficiency with containerization and orchestration (Docker, Kubernetes). Experience with cloud platforms (AWS, GCP, or Azure). Familiarity with infrastructure-as-code tools (Terraform, Ansible, CloudFormation).
    $112k-154k yearly est. 1d ago
  • DevOps Engineer

    Premier Group 4.5company rating

    Devops engineer job in Fremont, CA

    Senior DevOps Engineer San Francisco, CA $175,000 - $210,000 Hybrid We're looking for an experienced Senior DevOps Engineer to lead infrastructure initiatives, optimize CI/CD systems, and mentor other engineers as we continue to scale our client's platform. Role Overview As a Senior DevOps Engineer, you'll take ownership of our cloud infrastructure and delivery pipelines. You'll collaborate closely with developers, architects, and security teams to build highly available, fault-tolerant systems as well as drive best practices in automation, observability, and reliability. Responsibilities Architect, implement, and maintain CI/CD pipelines to support rapid and reliable software delivery. Lead the design and automation of scalable infrastructure in AWS, Azure, or GCP. Define and enforce best practices for infrastructure-as-code (IaC) using Terraform, Ansible, or similar tools. Implement robust monitoring, alerting, and logging systems (Grafana, Prometheus, ELK, Datadog). Optimize cloud costs, security, and performance across multiple environments. Collaborate with development teams to design cloud-native, containerized solutions using Docker and Kubernetes. Drive incident response processes and improve system reliability (SRE principles). Mentor junior DevOps and software engineers on automation and cloud operations. Requirements 7+ years of hands-on DevOps or Site Reliability Engineering experience. Proven expertise in AWS, Azure, or GCP cloud infrastructure. Strong experience with Kubernetes and container orchestration at scale. Deep understanding of CI/CD tools (GitHub Actions, Jenkins, GitLab CI, or CircleCI). Strong scripting and automation skills (Python, Bash, or Go preferred). Proficiency in Infrastructure as Code (Terraform, CloudFormation, or Ansible). Excellent grasp of cloud security, networking, and system observability. Experience leading projects or mentoring engineering teams. Nice to Have Experience with service mesh, serverless, or multi-cloud deployments. Background in performance optimization, disaster recovery, or SRE practices. Certifications such as AWS Certified DevOps Engineer, CKA/CKAD, or Azure DevOps Expert.
    $175k-210k yearly 3d ago
  • Java Software Engineer

    Mindlance 4.6company rating

    Devops engineer job in Concord, CA

    Role: Senior Software Engineer (Java) Contract: 12 to 24 months Skills Needed: Backend Java, API development, Microservices, Oracle, Splunk Client JD- We are seeking a Senior Software Engineer (SE3) with strong backend Java experience to support the development of APIs and microservices within a large-scale banking/transaction environment. The role involves modernizing monolithic applications, contributing to cloud migration (OCP), and ensuring platform stability, performance, and security. Key Responsibilities Design, develop, test, and support backend APIs and microservices. Work on modernization and cloud migration efforts. Ensure scalability, resiliency, and secure SDLC practices. Handle production support, monitoring, and issue resolution. Collaborate with product managers, architects, and engineering teams. Guide junior developers when needed. Required Skills 4+ years Java/Spring development 4+ years API/microservices experience 2+ years Oracle database experience Experience with Splunk or similar monitoring tools Agile/Scrum experience Nice to Have Experience decomposing monolithic apps Cloud/OCP migration experience Kafka or event-driven architecture API management tools (e.g., Apigee) Exposure to GenAI/Copilot (bonus) EEO: “Mindlance is an Equal Opportunity Employer and does not discriminate in employment based on - Minority/Gender/Disability/Religion/LGBTQI/Age/Veterans.”
    $113k-153k yearly est. 5d ago
  • Software Engineer

    Acceler8 Talent

    Devops engineer job in Fremont, CA

    🚀 Software Engineer - AI & Full Stack (San Francisco, CA) 💼 Full-Time | 🧠 1-4+ Years Experience | 💰 $150,000- 210,000 We're building self-improving software - AI that continuously creates, tests, and enhances digital experiences. Backed by Y Combinator, Gradient, and leaders from OpenAI, Uber, and Meta, we've raised $5M+ and are scaling fast. If you love building things that think for themselves , this is your chance to help shape the next wave of intelligent software. 🧩 What You'll Do Build an AI-powered paywall editor serving millions of users every day. Work across the stack - Next.js frontend + Python backend - integrating the latest AI models & APIs. Ship fast: design → code → test → deploy → learn → repeat. Collaborate directly with founders, engineers, and customers to deliver exceptional user experiences. ⚡ What We're Looking For Strong problem-solving and full-stack skills (Python, React, TypeScript). Experience building user-facing products that people love. Excellent communication and a bias for action. Ownership mindset - you ship things that matter. Startup experience = BONUS BSc CompSc degree preferred 💡 Bonus Points Experience with AI/LLM integrations. Startup or founder-level experience. Mobile skills (Swift, Flutter, React Native). 🧠 Tech Stack Next.js (React/TypeScript), Zustand, Tailwind, Shadcn, Python, Supabase, Fly.io, Swift, Flutter, Expo 📍 In-person role - San Francisco (Mon-Sat) U.S. work authorization required (O-1 visa sponsorship possible). If you're ready to build AI that builds software, we'd love to hear from you. 👉 Apply now and help us invent the future of intelligent systems.
    $150k-210k yearly 5d ago
  • Robotic Software Engineer

    Insight Recruitment

    Devops engineer job in Fremont, CA

    Robotics Software Engineer (Generalist/Full-Stack) Robotic Software Engineer - Humanoid Robotics Palo Alto, SF Bay Area (Full-time | Onsite) $180k-$200k + equity (flexible for exceptional candidates) We are recruiting building next-generation humanoid robotic systems that combine advanced AI with cutting-edge hardware. Our team moves fast, prototypes aggressively, and puts real robots into the world. We're now hiring a Robotic Software Engineer to help shape our core software stack and accelerate the development of our embodied AI systems. What You'll Work On As part of a small, high-impact engineering team, you will: Build and optimise robotics software in C++ and ROS2 Integrate perception, control, planning, and learning modules Work hands-on with robots to bring up new hardware and run real-world experiments Deploy reinforcement learning / imitation learning policies onto physical robots Develop middleware, interfaces, and tooling that connect AI → hardware Prototype behaviours across diverse robot types (arms, humanoids, mobile platforms, drones) This role directly supports both our AI and hardware teams and has significant ownership from day one. Must-haves: Strong C++ development skills (multi-threading, performance, systems-level) Professional experience with ROS2 Hands-on robotics experience - ideally robot learning on physical hardware Ability to work on real robots (debugging, integration, testing) Generalist mindset and comfort in a fast-paced startup environment Nice-to-haves: Manipulation or kinematics (humanoids, arms, quadrupeds) Controls for mobile robots or drones Sensor/actuator integration, drivers, or middleware experience VR prototyping (Meta Quest or similar) Experience across different robot embodiments Why Join Us Build software that runs on real humanoid robots immediately High ownership within a small, world-class engineering team Competitive compensation + meaningful equity Opportunity to influence architecture, roadmap, and product direction Work at one of the most exciting intersections in tech: AI × robotics
    $180k-200k yearly 1d ago
  • Founding Software Engineer / Protocol Engineer

    The Crypto Recruiters 3.3company rating

    Devops engineer job in San Francisco, CA

    We are actively searching for a Founding Protocol Engineer to join our team on a permanent basis. In this position you will If you are someone that is impressed with what Hyperliquid has accomplished then this role is for you. We are on a mission to build next generation lending and debt protocols. We are open to both Senior level and Architect level candidates for this role. Your Rhythm: Drive the architecture, technical design, and implementation of our lending protocol. Collaborate closely with researchers to validate and test designs Collaborate with auditors and security engineers to ensure safety of the protocol Participate in code reviews, providing constructive feedback and ensuring adherence to established coding standards and best practices Your Vibe: 5+ years of professional software Engineering experience 3+ years of experience working in Solidity in EVM in production environments, specifically focused in DeFi products 2+ years of experience working with a modern backend languages (Go, Rust, Python, etc) in distributed architectures Experience building lending protocols in a smart contract language Open to collaborating onsite a few days a week at our downtown SF office Our Vibe: Relaxed work environment 100% paid top of the line health care benefits Full ownership, no micro management Strong equity package 401K Unlimited vacation An actual work/life balance, we aren't trying to run you into the ground. We have families and enjoy life too!
    $123k-170k yearly est. 1d ago
  • Software Engineer, AI Data Platform

    Granica

    Devops engineer job in Mountain View, CA

    Granica is redefining how enterprises prepare and optimize data at the most fundamental layer of the AI stack-where raw information becomes usable intelligence. Our technology operates deep in the data infrastructure layer, making data efficient, secure, and ready for scale. We eliminate the hidden inefficiencies in modern data platforms-slashing storage and compute costs, accelerating pipelines, and boosting platform efficiency. The result: 60%+ lower storage costs, up to 60% lower compute spend, 3× faster data processing, and 20% overall efficiency gains. Why It Matters Massive data should fuel innovation, not drain budgets. We remove the bottlenecks holding AI and analytics back-making data lighter, faster, and smarter so teams can ship breakthroughs, not babysit storage and compute bills. Who We Are World-renowned researchers in compression, information theory, and data systems Elite engineers from Google, Pure Storage, Cohesity, and top cloud teams Enterprise sellers who turn ROI into seven‑figure wins. Powered by World-Class Investors & Customers $65M+ raised from NEA, Bain Capital, A* Capital, and operators behind Okta, Eventbrite, Tesla, and Databricks. Our platform already processes hundreds of petabytes for industry leaders Our Mission: We're building the default data substrate for AI, and a generational company built to endure. Smarter Infrastructure for the AI Era: We make data efficient, safe, and ready for scale-think smarter, more foundational infrastructure for the AI era. Our technology integrates directly with modern data stacks like Snowflake, Databricks, and S3-based data lakes, enabling: 60%+ reduction in storage costs and up to 60% lower compute spend 3x faster data processing 20% platform efficiency gains Trusted by Industry Leaders Enterprise leaders globally already rely on Granica to cut costs, boost performance, and unlock more value from their existing data platforms. A Deep Tech Approach to AI We're unlocking the layers beneath platforms like Snowflake and Databricks, making them faster, cheaper, and more AI-native. We combine advanced research with practical productization, powered by a dual-track strategy: Research: Led by Chief Scientist Andrea Montanari (Stanford Professor), we publish 1-2 top-tier papers per quarter. Product: Actively processing 100+ PBs today and targeting Exabyte scale by Q4 2025. Backed by the Best We've raised $60M+ from NEA, Bain Capital, A Capital, and operators behind Okta, Eventbrite, Tesla, and Databricks. Our Mission To convert entropy into intelligence, so every builder-human or AI-can make the impossible real. We're building the default data substrate for AI, and a generational company built to endure beyond any single product cycle. WHAT YOU'LL DO This is a deep systems role for someone who lives and breathes distributed infrastructure, understands how data moves at scale, and wants to build the next-generation AI data platform from the ground up. Own the ACID backbone. Design and harden transactional layers and metadata services so that petabyte-scale tables can time-travel in microseconds and schema evolution becomes a non-event. Turn metadata into rocket fuel. Build compaction, caching, and pruning services that keep millions of file pointers within 50 ms from lookup to plan. Squeeze more signal per byte. Optimize data layouts-from column ordering to dictionary and bit-packing, bloom filters, and zone-map indexes-to cut scan I/O by 10× on real-world workloads. Ship adaptive indexing with research. Co-invent machine-driven indexes that learn access patterns and automatically re-partition nightly-no more manual “analyze table” ever again. Scale the engine, not the babysitting. Write Spark, Flink, or batch pipelines that autoscale across S3, GCS, and ADLS; expose observability hooks; and survive chaos drills without triggering a pager storm. Code for longevity. Write clean, test-soaked Java, Scala, Go, or C++. Document key invariants so future teams can extend the system-instead of rewriting it. Measure success in human latency. If analysts see their dashboards refresh in blink-level time, you've won. Publish your breakthrough and mentor the next engineer to raise the bar again. WHAT WE'RE LOOKING FOR You've built systems where performance, resilience, and clarity of design all matter. You thrive at the intersection of infrastructure engineering and applied research, and care deeply about both how something works and how well it works at scale. Core Skills Distributed Systems and Storage Fundamentals - consistency, replication, sharding, durability, transactions. Columnar Storage Optimization - deep knowledge of Parquet or similar formats (column ordering, compression, zone maps). Metadata and Indexing Systems - experience building metadata-driven services, compaction, caching, and adaptive indexing. Distributed Compute at Scale - production-grade Spark/Flink or equivalent pipeline development across S3, GCS, or ADLS. Programming for Scale and Longevity - strong coding in Java, Scala, Go, or C++, with clean testing and documentation practices. Resilient Systems and Observability - you've built systems that survive chaos drills and expose the right metrics. Desired Skills Exposure to open table formats such as Apache Iceberg, Delta Lake, or Hudi. Experience with catalog services, query planning, or compaction frameworks. OSS contributions or published work in data infrastructure or distributed systems. WHY JOIN GRANICA If you've helped build the modern data stack at a large company-Databricks, Snowflake, Confluent, or similar-you already know how critical lakehouse infrastructure is to AI and analytics at scale. At Granica, you'll take that knowledge and apply it where it matters most…at the most fundamental layer in the data ecosystem. Own the product, not just the feature. At Granica, you won't be optimizing edge cases or maintaining legacy systems. You'll architect and build foundational components that define how enterprises manage and optimize data for AI. Move faster, go deeper. No multi-month review cycles or layers of abstraction-just high-agency engineering work where great ideas ship weekly. You'll work directly with the founding team, engage closely with design partners, and see your impact hit production fast. Work on hard, meaningful problems. From transaction layer design in Delta and Iceberg, to petabyte-scale compaction and schema evolution, to adaptive indexing and cost-aware query planning-this is deep systems engineering at scale. Join a team of expert builders. Our engineers have designed the core internals of cloud-scale data systems, and we maintain a culture of peer-driven learning, hands-on prototyping, and technical storytelling. Core Differentiation: We're focused on unlocking a deeper layer of AI infrastructure. By optimizing the way data is stored, processed, and retrieved, we make platforms like Snowflake and Databricks faster, more cost-efficient, and more AI-native. Our work sits at the most fundamental layer of the AI stack: where raw data becomes usable intelligence. Be part of something early-without the chaos. Granica has already secured $65M+ from NEA, Bain Capital Ventures, A* Capital, and legendary operators from Okta, Tesla, and Databricks. Grow with the company. You'll have the chance to grow into a technical leadership role, mentor future hires, and shape both the engineering culture and product direction as we scale. COMPENSATION & BENEFITS Competitive salary and meaningful equity Unlimited PTO + quarterly recharge days Premium health, vision, and dental Team offsites, deep tech talks, and learning stipends Help build the foundational infrastructure for the AI era Granica is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.
    $106k-150k yearly est. 4d ago
  • Software Engineer - Runtime

    Oho Group Ltd. 3.7company rating

    Devops engineer job in Santa Clara, CA

    We're a Series A and we need a systems-savvy engineer who can architect, optimize, and turbocharge our multi-target runtime from day one. If concurrent programming is your playground, C++14 is your native language, and you think in cache lines, pipelines, and memory hierarchies, this role puts you at the heart of the action. What You'll Do Design, build, and continually improve our multi-target runtime Apply cutting-edge parallelization + partitioning techniques to generate and exploit highly optimized kernels Rapidly prototype ideas and validate them with real data What You Bring Deep expertise in asynchronous + concurrent programming 4+ years of modern C/C++ Strong grasp of hardware architecture (scalar vs vector, memory hierarchies, etc.) Knowledge of OS kernel or hypervisor development Bonus Points CUDA/ROCm library experience GPU programming background HPC experience MS/PhD in CS or equivalent Familiarity with PyTorch, JAX, Triton Experience wrangling large compute clusters Why You'll Love It You'll own critical, performance-sensitive systems that sit at the core of our stack; shaping how next-gen ML models run across diverse hardware. High impact, deep tech, zero bureaucracy. If you want to engineer at the limits of performance and help build a runtime that changes the game let's talk!
    $122k-168k yearly est. 3d ago
  • Software Engineer

    10X Recruiting Partners

    Devops engineer job in San Francisco, CA

    Software Engineer, C++ Systems About the Role We're seeking a highly skilled Software Engineer (C++ Systems) to join our client's team and help build the core of our GPU virtualization platform. This role is ideal for engineers who thrive on microsecond-level performance optimization, enjoy working deep in complex C++ systems, and are motivated by building foundational infrastructure that directly impacts customers. You'll play a critical role in scaling our platform as we serve a rapidly growing customer base, owning production systems from day one and tackling technically demanding challenges at the forefront of GPU infrastructure. What You'll Do Optimize performance of our C++ GPU virtualization library at the systems level Research and develop solutions for GPU oversubscription, checkpointing, and distributed GPU clusters Support new hardware and software architectures with a deep, end-to-end understanding of the stack Debug low-level systems in production environments Diagnose and resolve performance issues in machine learning workloads Collaborate closely with the CTO on advanced systems design and implementation Required Experience Proven experience building and operating low-level systems in production environments Background working with compilers, kernels, or networking protocols Demonstrated ability to trace and resolve performance issues across complex systems Technical Skills Expert-level C++ proficiency (Rust experience is acceptable, though primary development will be in C++) Experience optimizing C++ and NIC performance Strong systems-level debugging and performance analysis skills Education Degree in Computer Science or a related field from a top-tier program Strong academic performance (3.7+ GPA) Soft Skills Ability to deliver high-quality output quickly in an early-stage startup environment Comfortable taking full ownership of critical production systems Thrives in ambiguous, high-impact problem spaces Company & Opportunity Building GPU virtualization software that dramatically improves GPU efficiency Operating a fast-growing GPU cloud, scaling from $0 to $500K in revenue in just six months Backed by Y Combinator and a recently closed $4.5M Seed round Join as employee #5 at a pivotal moment: product-market fit validated and scaling rapidly Work directly with the CTO on systems challenges few startups get to tackle This is a hardcore C++ systems role focused on GPU virtualization, performance tuning, production debugging, and advanced research Expect ownership, impact, and problems that demand top 0.1% technical skill Why This Role If you love squeezing performance out of low-level systems, enjoy working at the intersection of GPUs, distributed systems, and production infrastructure, and want to help scale a breakthrough platform at an early stage, this role offers a rare and exciting opportunity.
    $106k-150k yearly est. 4d ago
  • Software Engineer, Frontend

    Evolution USA

    Devops engineer job in Santa Clara, CA

    We are looking for a Senior Front-End Software Engineer with strong software fundamentals to join a high-performing platform development team. This role combines hands-on development, mentorship, and growth opportunities. You will work on UI implementation and maintenance across multiple functional areas, contributing daily to improving user experiences and building deep expertise in the product. Key Responsibilities Partner with Product Managers and Designers to define and deliver new features and solutions. Collaborate with engineering teams across the stack to build scalable, user-facing features. Work closely with the Support team to triage bugs and resolve production issues quickly. Drive planning and execution of mid- to large-scale projects from conception to launch. Act as a subject matter expert while resolving complex technical challenges. Oversee the full systems development lifecycle, including architecture definition, design, scoping, planning, implementation, testing, documentation, and maintenance. Qualifications 6+ years of front-end development experience. Strong technical background (degree in Computer Science, Engineering, or related field preferred, or equivalent experience). Advanced knowledge of HTML, CSS, and ES6 JavaScript. Advanced knowledge of React, Next.js, and TypeScript. Experience using and consuming REST APIs with a strong understanding of client-server interaction. Familiarity with AGILE/Scrum development methods. Expert-level problem-solving and communication skills.
    $106k-150k yearly est. 5d ago
  • Software Engineer (Computer Vision, Robotics)

    Autonomous Healthcare

    Devops engineer job in Santa Clara, CA

    About Us At Autonomous Healthcare, we are at the forefront of medical innovation, developing the next generation of devices that will revolutionize patient care. Our mission is to commercialize breakthrough medical technologies by leveraging cutting-edge AI and autonomous systems. We believe that the best solutions are built together, and we are looking for a key member to join our collaborative R&D team. About the Role We are seeking a highly motivated and skilled engineer to join our team in developing next-generation patient monitoring systems. This role is at the intersection of computer vision, signal processing, and high-performance software engineering. You will be responsible for building the core analytical engine that transforms raw depth-sensor video into actionable health information. This is not a purely theoretical position. You will be hands-on, designing algorithms that are efficient enough for real-time applications and robust enough for real-world clinical use. You will write the production-level Python code that brings these algorithms to life on cutting-edge edge computing platforms. If you are a problem-solver who thrives on analyzing complex sensor data and building tangible, high-performance systems, we want to hear from you. Key Responsibilities Develop and implement real-time computer vision algorithms in Python to detect, track, and analyze regions of interest from video data (specifically depth sensors). Design and build signal processing pipelines to extract, filter, and interpret physiological movement data from sensor signals. Optimize algorithms for performance to meet strict real-time processing requirements. Deploy and validate analysis software on edge computing platforms with GPU acceleration (e.g., NVIDIA Jetson). Collaborate in a multidisciplinary team to integrate your solutions into a complete monitoring product. Rigorously test, debug, and document your code and algorithms. Required Qualifications Strong proficiency in Python and experience writing clean, efficient, and maintainable code. Solid foundation in computer vision principles and hands-on experience with libraries like OpenCV. Solid foundation in digital signal processing (e.g., filtering, time-series analysis, feature extraction) and experience with libraries like SciPy or NumPy. B.S. or M.S. in Computer Science, Robotics, Electrical Engineering, Biomedical Engineering, or a related technical field. Demonstrable experience in analyzing imaging or sensor data to solve complex problems. Excellent problem-solving skills and the ability to work independently and as part of a team. Preferred Skills (We'd love to see these) Experience with high-performance edge computing platforms (e.g., NVIDIA Jetson). Familiarity with GPU programming (e.g., CUDA, TensorRT) for accelerating algorithms. A background in robotics, autonomous vehicles, or real-time analysis of sensor data (e.g., LiDAR, RADAR, IMU). Experience with depth sensors, 3D data processing, or point cloud analysis. Knowledge of machine learning or deep learning frameworks (e.g., PyTorch, TensorFlow) for vision or time-series tasks. Familiarity with software development best practices (e.g., Git, unit testing, CI/CD).
    $106k-150k yearly est. 4d ago
  • Backend Software Engineer - Cloud Services

    Droisys 4.3company rating

    Devops engineer job in Sunnyvale, CA

    About Company, Droisys is an innovation technology company focused on helping companies accelerate their digital initiatives from strategy and planning through execution. We leverage deep technical expertise, Agile methodologies, and data-driven intelligence to modernize systems of engagement and simplify human/tech interaction. Amazing things happen when we work in environments where everyone feels a true sense of belonging and when candidates have the requisite skills and opportunities to succeed. At Droisys, we invest in our talent and support career growth, and we are always on the lookout for amazing talent who can contribute to our growth by delivering top results for our clients. Join us to challenge yourself and accomplish work that matters. We're hiring Backend Software Engineer - Cloud Services in Sunnyvale, CA . What You'll Do Take full ownership of your services: drive the design, contribute new features, participate in peer reviews, and deliver production-ready solutions. Develop software primarily in Java and Python. Work with Kubernetes or be willing to quickly ramp up on container orchestration. Own end-to-end responsibility for major features and subsystems-from refining requirements to successful deployment in customer environments. Manage operational health of your services, including telemetry, metrics, and rapid production issue detection. Ensure high code quality through early testing, functional verification, and integration testing. Collaborate closely with Product Management to clarify scope, finalize requirements, and plan delivery. What You'll Bring Bachelor's degree in Computer Science or similar field (Master's preferred). 3+ years of experience building scalable, distributed systems. A strong passion for building software, learning new technologies, and collaborating in a team environment. Hands-on experience with AWS, Azure, or GCP, particularly at the programming/API level. Background in networking or security is a plus. Proficiency in Java and/or Python, with familiarity using REST APIs. Experience with CloudFormation or Terraform is beneficial. Knowledge of Spring or similar backend frameworks. Understanding of Kubernetes, Docker, and containerized environments is helpful. Familiarity with classic Gang of Four design patterns. Droisys is an equal opportunity employer. We do not discriminate based on race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. Droisys believes in diversity, inclusion, and belonging, and we are committed to fostering a diverse work environment
    $103k-139k yearly est. 2d ago
  • Senior DevOps Engineer - AWS & CI/CD

    Sigmaways Inc.

    Devops engineer job in San Francisco, CA

    We are seeking a Senior DevOps Build Engineer with experience in designing, building, and maintaining secure, automated CI/CD pipelines for cloud based applications. The ideal candidate is a cloud savvy AWS expert with strong DevOps experience and good understanding of infrastructure-as-code, containerization and security scanning. In this role, you will collaborate across multiple teams including development, infrastructure, security and networking to deliver reliable, secure and scalable cloud solutions. You will also provide leadership in DevOps best practices, CI/CD automation, and secure architecture design. Responsibilities: Design, build, and optimize automated CI/CD pipelines and integrate SAST, DAST, OSS, and container security scans. Provide DevOps architectural leadership, mentorship, and best-practice guidance across teams. Lead standardization of DevOps pipeline frameworks and secure development patterns. Implement automated deployment, infrastructure, and build/release workflows. Own technical implementation for assigned development initiatives and ensure alignment with internal engineering standards. Partner closely with architecture, security, and engineering stakeholders to ensure secure and compliant build practices. Maintain platform security aligned with organizational Information Security policies. Anticipate risks and proactively propose solutions related to enterprise platform scaling and cloud adoption. Qualifications: Bachelor's or Master's degree in Computer Science, Engineering, Information Systems, or equivalent work experience. At least 7 years of AWS experience delivering solutions in a DevOps or architecture role Hands on background designing, implementing, and supporting complex multi environment platforms. Proven experience building and supporting DevOps infrastructure, CI/CD pipelines, and automation. Strong experience configuring GitLab and artifact repositories such as Nexus. Experience preparing vulnerability reports using tools such as Fortify, SonarQube, NexusIQ, and GitLab native scans (Semgrep, Gymnasium). Experience configuring Checkov and KICS for IaC scanning. Containerization experience using Docker and AWS cloud services (CloudWatch, ECS, EKS, Lambda, Fargate, EC2.) Secrets management experience using AWS Secrets Manager, AWS SSM Parameter Store, and/or Vault. Advanced scripting experience using Python, Shell, Groovy, YAML/JSON. Experience with major IDEs using Visual Studio, IntelliJ, Eclipse. Strong experience integrating build tools and technologies across C++, Java, JavaScript, and .NET application stacks. Strong Linux and Windows OS experience. Knowledge working within Agile project environments and using tools such as Jira and VersionOne.
    $112k-154k yearly est. 3d ago
  • Senior DevOps Engineer (Ref: 194285)

    Forsyth Barnes

    Devops engineer job in San Francisco, CA

    Contact: ****************************** About Us We are collaborating with our client, an innovative fintech startup headquartered in the Bay Area, which focuses on delivering real-time card data primarily aimed at B2B software firms. Established as a spin-out in early 2024 and launching their product by the summer of that year, this organization is at a pivotal growth juncture, experiencing an expanding customer base and ambitious future initiatives. With the support of notable investors including Nica Partners, QED Investors, RBC, and Visa, this organization is championing advancements in card issuance technology. Their team of 20 passionate professionals is seeking exceptional engineers to aid in scaling their operations. Job Description We are assisting this organization in their search for a Senior DevOps Engineer (L1+), tasked with steering the evolution of their infrastructure and serverless architecture. Operating primarily within an AWS framework and extensively utilizing Lambda, the ideal candidate will spearhead serverless migrations while upholding the stringent security protocols essential for the fintech and payments domain, particularly regarding PCI compliance. Key Responsibilities Design and implement serverless infrastructure using AWS Lambda and associated services. Lead the serverless migration process and help define the infrastructure roadmap. Ensure compliance with PCI standards and establish security best practices throughout the infrastructure. Manage and optimize SQL-based databases hosted on AWS. Collaborate with teams to advance the organization's Kubernetes capabilities as they grow their microservices architecture. Work collaboratively with engineering teams to enhance deployment pipelines and improve the developer experience. Engage in a hybrid work model, requiring in-office presence on Tuesdays, Wednesdays, and Thursdays. Requirements Required: A minimum of senior-level (L1+) experience in DevOps or Infrastructure engineering. In-depth knowledge of AWS, particularly Lambda and serverless architectures. A strong background in security practices, especially regarding PCI compliance for infrastructure. Experience with SQL-based database management. Familiarity with the fintech or payments industry is essential. Knowledge of the card issuance sector and experience with payment networks, ideally including Mastercard. Startup experience with a proven ability to deliver quick and effective solutions. A willingness to relocate to or already residing in the Bay Area. Preferred: Experience with Kubernetes. Expertise in microservices architecture. Prior experience at organizations such as Marqeta, Plaid, Ramp, or Brex. Experience working with payment networks. Benefits A competitive compensation package comprising both salary and equity, ensuring alignment with the best talent available. A 4% matching contribution for the 401(k) plan. Comprehensive medical and dental insurance managed through Rippling. Regular off-site events throughout the year in diverse locations across the US and Europe, including frequent trips to California and New York. Exclusive company merchandise. A hybrid work structure requiring three days of on-site work.
    $112k-154k yearly est. 1d ago
  • DevOps Engineer

    Premier Group 4.5company rating

    Devops engineer job in San Mateo, CA

    Senior DevOps Engineer San Francisco, CA $175,000 - $210,000 Hybrid We're looking for an experienced Senior DevOps Engineer to lead infrastructure initiatives, optimize CI/CD systems, and mentor other engineers as we continue to scale our client's platform. Role Overview As a Senior DevOps Engineer, you'll take ownership of our cloud infrastructure and delivery pipelines. You'll collaborate closely with developers, architects, and security teams to build highly available, fault-tolerant systems as well as drive best practices in automation, observability, and reliability. Responsibilities Architect, implement, and maintain CI/CD pipelines to support rapid and reliable software delivery. Lead the design and automation of scalable infrastructure in AWS, Azure, or GCP. Define and enforce best practices for infrastructure-as-code (IaC) using Terraform, Ansible, or similar tools. Implement robust monitoring, alerting, and logging systems (Grafana, Prometheus, ELK, Datadog). Optimize cloud costs, security, and performance across multiple environments. Collaborate with development teams to design cloud-native, containerized solutions using Docker and Kubernetes. Drive incident response processes and improve system reliability (SRE principles). Mentor junior DevOps and software engineers on automation and cloud operations. Requirements 7+ years of hands-on DevOps or Site Reliability Engineering experience. Proven expertise in AWS, Azure, or GCP cloud infrastructure. Strong experience with Kubernetes and container orchestration at scale. Deep understanding of CI/CD tools (GitHub Actions, Jenkins, GitLab CI, or CircleCI). Strong scripting and automation skills (Python, Bash, or Go preferred). Proficiency in Infrastructure as Code (Terraform, CloudFormation, or Ansible). Excellent grasp of cloud security, networking, and system observability. Experience leading projects or mentoring engineering teams. Nice to Have Experience with service mesh, serverless, or multi-cloud deployments. Background in performance optimization, disaster recovery, or SRE practices. Certifications such as AWS Certified DevOps Engineer, CKA/CKAD, or Azure DevOps Expert.
    $175k-210k yearly 3d ago
  • Founding Software Engineer / Protocol Engineer

    The Crypto Recruiters 3.3company rating

    Devops engineer job in Fremont, CA

    We are actively searching for a Founding Protocol Engineer to join our team on a permanent basis. In this position you will If you are someone that is impressed with what Hyperliquid has accomplished then this role is for you. We are on a mission to build next generation lending and debt protocols. We are open to both Senior level and Architect level candidates for this role. Your Rhythm: Drive the architecture, technical design, and implementation of our lending protocol. Collaborate closely with researchers to validate and test designs Collaborate with auditors and security engineers to ensure safety of the protocol Participate in code reviews, providing constructive feedback and ensuring adherence to established coding standards and best practices Your Vibe: 5+ years of professional software Engineering experience 3+ years of experience working in Solidity in EVM in production environments, specifically focused in DeFi products 2+ years of experience working with a modern backend languages (Go, Rust, Python, etc) in distributed architectures Experience building lending protocols in a smart contract language Open to collaborating onsite a few days a week at our downtown SF office Our Vibe: Relaxed work environment 100% paid top of the line health care benefits Full ownership, no micro management Strong equity package 401K Unlimited vacation An actual work/life balance, we aren't trying to run you into the ground. We have families and enjoy life too!
    $122k-169k yearly est. 1d ago

Learn more about devops engineer jobs

How much does a devops engineer earn in Fairview, CA?

The average devops engineer in Fairview, CA earns between $97,000 and $177,000 annually. This compares to the national average devops engineer range of $80,000 to $135,000.

Average devops engineer salary in Fairview, CA

$131,000

What are the biggest employers of Devops Engineers in Fairview, CA?

The biggest employers of Devops Engineers in Fairview, CA are:
  1. EGB Systems & Solutions
  2. Jobsbridge
  3. SafeTraces
  4. Grid Dynamics
  5. Nasscomm
  6. Five9
  7. Oracle
  8. Robert Half
  9. Cxapp Us, Inc.
Job type you want
Full Time
Part Time
Internship
Temporary