Devops engineer jobs in Castro Valley, CA - 10,013 jobs
All
Devops Engineer
Deployment Engineer
Software Engineer
Staff Software Engineer
Senior Operations Engineer
Senior Software Engineer
Speech ML Engineer: Scalable Training & Deployment
Apple Inc. 4.8
Devops engineer job in San Francisco, CA
A leading technology company is seeking a Machine Learning Engineer to join their team in San Francisco. This role focuses on building scalable ML pipelines for speech technologies, integrating new model architectures, and collaborating across teams. The ideal candidate has a strong software engineering background, particularly in Python, and experience in developing ML systems. Excellent communication skills and relevant academic qualifications are required. Attractive compensation and benefits package offered.
#J-18808-Ljbffr
$133k-174k yearly est. 1d ago
Looking for a job?
Let Zippia find it for you.
Sr ML Ops Engineer
The Walt Disney Company (Germany) GmbH 4.6
Devops engineer job in San Francisco, CA
The Skywalker Sound Development Group is seeking a highly skilled Sr ML Ops Engineer to build and maintain the infrastructure powering our machine learning and AI frameworks. This position is crucial in enabling seamless workflows for model training, retraining, and deployment, ensuring that cutting‑edge AI solutions operate reliably at scale.
As a Sr ML Ops Engineer, you will act as the backbone of our AI/ML efforts, bridging the gap between data science, research, and production engineering. Your expertise in DevOps principles, model deployment strategies, and scalable infrastructure will support the development of transformative audio solutions for speech processing, style transfer, and source separation in media production workflows.
This role is considered Hybrid, which means the employee will work 2‑3 days onsite at our Nicasio, CA office and occasionally from home.
What You'll Do:
Develop, deploy, and maintain scalable infrastructure for machine learning model training, retraining, and inference.
Design and optimize CI/CD pipelines specifically tailored for machine learning workflows, ensuring efficient delivery from research to production.
Implement robust monitoring and logging systems to track model performance and identify potential issues in production environments.
Collaborate with AI researchers and data scientists to ensure infrastructure aligns with project requirements and supports iterative experimentation.
Manage compute resources (cloud and on‑premises) to enable large‑scale distributed training and inference tasks.
Containerize machine learning models and applications using Docker and deploy them via Kubernetes or equivalent orchestration systems.
Automate deployment workflows for serving ML models using frameworks such as TorchServe, TensorFlow Serving and FastAPI.
Implement model versioning, rollback strategies, and governance for maintaining production stability.
Optimize cost efficiency and performance of machine learning workflows in cloud environments such as AWS, GCP, or Azure.
Stay updated with emerging ML Ops tools and practices, integrating them into existing workflows to improve performance and reliability.
What We're Looking For:
Bachelor's in Computer Science, Engineering, or a related field. Master's Degree is preferred.
5+ years of experience in DevOps, Site Reliability Engineering, or a related role, with at least 2+ years focusing on ML Ops.
Expertise in building and maintaining CI/CD pipelines for machine learning applications.
Strong proficiency with containerization (Docker) and orchestration tools (Kubernetes).
Proficiency in deploying machine learning models using frameworks such as TensorFlow Serving, TorchServe, or custom APIs.
Deep understanding of cloud infrastructure and services (AWS, GCP, or Azure) for ML workloads, including GPUs and TPU utilization.
Experience managing large‑scale distributed training workflows and optimizing resource allocation.
Familiarity with tools like MLflow, DVC, Weight+Biases, or similar for data and model tracking and versioning.
Solid understanding of security best practices for machine learning systems and sensitive data handling.
Strong scripting and programming skills in Python, Bash, or Go.
Preferred Qualifications:
Experience with data orchestration tools such as DataChain, Weights and Biases, etc, for managing ML workflows.
Hands‑on experience with automated hyperparameter tuning and optimization frameworks.
Familiarity with model monitoring tools such as Prometheus, Grafana, or custom solutions for model drift and data quality checks.
Experience integrating pre‑trained foundational models and managing their deployment at scale.
Contributions to open‑source ML Ops projects or relevant research publications.
The hiring range for this position in San Francisco, CA is $155,400 to $208,400 per year. The base pay actually offered will take into account internal equity and also may vary depending on the candidate's geographic region, job‑related knowledge, skills, and experience among other factors. A bonus and/or long‑term incentive units may be provided as part of the compensation package, in addition to the full range of medical, financial, and/or other benefits, dependent on the level and position offered.
Disability Accommodation for Employment Applications
The Walt Disney Company and its Affiliated Companies are Equal Employment Opportunity employers and welcome all job seekers including individuals with disabilities and veterans with disabilities. If you have a disability and believe you need a reasonable accommodation in order to search for a job opening or apply for a position, visit the Disney candidate disability accommodations FAQs. We will only respond to those requests that are related to the accessibility of the online application system due to a disability.
#J-18808-Ljbffr
$155.4k-208.4k yearly 3d ago
Forward Deployed Engineer Lead
Medplum
Devops engineer job in San Francisco, CA
Medplum is the operating system for modern healthcare. Because our platform is highly flexible and deeply technical, our customers-ranging from seed‑stage startups to massive health systems-require more than just "support." They require high‑level technical partnership to build their core clinical infrastructure.
As the Forward Deployed Engineering (FDE) Lead, you will head the team that sits at the intersection of Engineering, Product, and Sales. You will be the "Technical Closer" for our most strategic accounts and the primary mentor for our FDE team. Your goal is to ensure "Production Success" for our customers by helping them architect, build, and ship their clinical applications on the Medplum stack.
This is a Player‑Coach role. You will spend your time architecting complex solutions for our most strategic customers while simultaneously formalizing the "Medplum Way" of implementation to help our team scale.
What You Will Do
Lead by Example: Act as the lead architect for our most complex enterprise implementations. You will help customer engineering teams navigate build‑vs‑buy decisions and design their clinical data strategy on top of Medplum.
Formalize the FDE Playbook: Take the "chaos" of early‑stage implementations and turn them into a repeatable engine. You will define how we handle technical discovery, implementation sprints, and "go‑live" milestones.
Bridge Product & Engineering: You are the voice of the customer. You will translate real‑world implementation bottlenecks into high‑quality product requirements, helping the core team prioritize the roadmap based on what actually happens in the field.
Mentor & Scale the Team: Provide technical leadership and mentorship to the FDE team. You will help recruit new FDEs and ensure they have the training and tools (SDKs, starter kits, documentation) to be successful.
Code the "Golden Paths": Stay hands‑on by contributing to our public repositories and creating "Golden Path" implementation examples that serve as the blueprint for the entire Medplum community.
About You
The Technical Pedigree: You have 6+ years of experience in high‑stakes technical roles (Senior FDE, Staff Engineer, or Solutions Architect). You are a proficient coder in the TypeScript/Node.js ecosystem and aren't afraid to dive into the Medplum source code to unblock a customer.
Executive Presence: You can command a room of C‑suite executives at a major health system, translating Medplum's technical primitives into their business outcomes.
High Agency: You are obsessed with customer success. If a customer is blocked, you feel it personally. You have the technical chops to unblock them yourself and the leadership skills to ensure the team learns from the experience.
SF‑Based: You are based in the Bay Area and value the high‑bandwidth collaboration of being in‑person with the founding team 2‑3x/week.
About Medplum
Medplum is redefining healthcare with our open source, API‑first electronic health record (EHR) platform, trusted by leading digital health and life sciences companies. Our mission is to catalyze change in the healthcare industry by improving the access, privacy, and utility of health data. At Medplum, we have a unique opportunity to impact the lives of patients, speed medical research, and contribute to the open source ecosystem.
Competitive compensation package with equity
Flexible time off
The chance to shape the future of healthcare tech - leave your mark on this vital industry
The San Francisco, CA base salary range for this role is $160,000‑$240,000 per year. Actual base salary within this range will be determined based on job‑related skills, experience, and qualifications.
Join us in our mission to revolutionize healthcare. If you're excited about driving customer success, building strategic relationships, and shaping the future of our Forward Deployed Engineering team, we'd love to hear from you! Reach out to ******************* .
#J-18808-Ljbffr
$160k-240k yearly 1d ago
Forward Deployed Engineer
Truth Systems 4.5
Devops engineer job in San Francisco, CA
At Truth Systems, we're building the only trust and safety software any organization will ever need. A protection layer for every individual.
We're laying that foundation with AI Governance. Our product Charter is an agent that monitors and flags misuse of AI in line with firm policies and client rules in real time. We're building always-on, real-time systems that keep people safe and organizations compliant without slowing down their work.
We are:
Small and well-funded. We've raised $4M from world-class investors like Gradient Ventures, Lightspeed, The Legaltech Fund, Y Combinator, and Pear VC. We are currently a team of 4. We are hiring thoughtfully, with ~3 people in the next 6 months.
Intensely trustful + high ownership. We optimize for impact per person by requiring high ownership from each team member. We hire experts, ambitious problem-solvers, and generally unstoppable people, and we place a lot of trust in them. We prefer short (or no) meetings, full autonomy, and individualized schedules.
Your Mission
Over the next 12-18 months, you will embed with customers to help firms navigate and uncover risk surfaces within their organizations.
This role blends engineering and customer empathy. You'll work directly with clients to understand workflows, translate them into software, and ensure successful adoption.
Outcomes
Customer integrations from 0 → 1. Lead deployments of our agents and backend systems in live customer environments. Deliver reliable, secure, and fast integrations.
Feedback-driven development. Translate client insights into product improvements and new features. Influence our roadmap through real-world usage.
Customer Partnership: Collaborate deeply with clients to identify their pain points, propose solutions, and deliver measurable value.
Travel for Deployment and Collaboration: Travel to meet customers across the U.S. (e.g., NYC, Atlanta, Chicago, Seattle, Ohio, and more) for implementations, feedback sessions, and training.
Competencies
Technical and hands-on: You've spent 1+ years building real products in production - shipping code, wrestling with weird APIs, and making things work in messy, real-world stacks.
Full-stack curious: You're fluent in TypeScript/JavaScript (React or Next.js for lightweight UIs) and comfortable with at least one backend language like Python for integrations and automations.
Deeply empathetic (must be): Both our team and our customers require empathetic and low-ego people. Some of our users' careers depend on our product being ‘right', so trusting and listening to them is our most important skill.
Independent and process building (must be): We don't have a lot of structure. You'll build many processes and set standards yourself, and everyone we hire after you will follow them.
Adventurous: You're comfortable with the unknown, love building new experiences, and don't mind travel when it helps you get closer to the problem. Our team's roots span 4+ continents - we like working (and learning) with people from everywhere.
Why Join Truth Systems
Autonomy: You'll have full ownership of what you build. We trust you to make the right calls, set your own pace, and push projects from idea to production without red tape.
Output-focused, not input-focused: We care about impact, not hours. Now that is not to say there will not be days when hours don't get long, but you'll be judged by what you deliver and how it moves the needle - not how long you sit in front of a screen.
Cutting-edge meets meaningful: You'll be working at the edge of AI - designing systems that redefine how humans interact with intelligence. At Truth Systems, we care deeply about helping knowledge workers trust AI and move them toward safer, more responsible, and transparent use.
Logistics
Salary: $170K-$250K
Equity: 0.3-1%
Location: In-person in San Francisco, with regular travel to customer sites across the U.S.
Perks: Meals, housing/relocation, equipment, and benefits included
Work Authorization: We cannot sponsor visas at this time
#J-18808-Ljbffr
A pioneering tech startup in San Francisco is seeking a Founding Deployment Engineer. You will manage the entire technical sales lifecycle, ensuring customer success through direct engagement and innovative solutions. The ideal candidate has over 8 years of experience in a similar role and strong coding skills in Python or JavaScript. Join us to build impactful products and enjoy a flexible, remote-first culture with growth opportunities.
#J-18808-Ljbffr
$103k-149k yearly est. 2d ago
Field Deployment Engineer: Hardware, RF & Infra
Specter
Devops engineer job in San Francisco, CA
A leading technology company in San Francisco is looking for adaptable field engineers to install and troubleshoot innovative systems. This role combines outdoor work and technical challenges, making it ideal for those with experience in high-stakes environments like military or trades. You'll contribute to cutting-edge technology while ensuring systems run smoothly and meet regulatory requirements. Competitive compensation and benefits are offered as part of this mission-driven opportunity.
#J-18808-Ljbffr
$103k-149k yearly est. 4d ago
Forward Deployed Engineer - San Francisco
Hard Yaka
Devops engineer job in San Francisco, CA
Aircall is a unicorn AI‑powered customer communications platform used by 22,000+ companies worldwide to drive revenue, faster resolutions, and scale. We're redefining what a customer communications platform can be-by combining voice, SMS, WhatsApp, and AI into one seamless workspace.
Our momentum comes from a simple but powerful idea: help every customer‑facing team work smarter, not harder. Aircall's AI Voice Agent automates routine calls, AI Assist streamlines post‑call tasks, and AI Assist Pro delivers real‑time guidance that helps people do their best work. The result-companies grow revenue, deliver faster resolutions, and scale service.
We've built a product customers love and a business that scales fast. Aircall operates in nine global offices (Paris, New York, San Francisco, Sydney, Madrid, London, Berlin, Seattle, and Mexico City), and is backed by world‑class investors. Our teams are shipping AI innovation faster than ever and expanding across new product lines and markets.
At Aircall, you'll join a company in motion-ambitious, profitable, and product‑driven-where impact is visible, decisions are fast, and growth is real.
How We Work at Aircall
At Aircall, we believe in customer obsession, continuous learning, and delivering extraordinary outcomes. We value open collaboration, taking ownership, and making smart, informed decisions with speed and precision. If you thrive in a fast‑paced, team‑driven environment where curiosity, trust, and impact matter, you'll fit right in.
About the Team
Aircall's Forward Deployed Engineering team connects our innovative AI Agents technology to real‑world SMB workflows. We turn possibility into production-delivering automation that improves customer experience, boosts productivity, and unlocks new business value. Working cross‑functionally with Product, Engineering, Sales, Solutions, and Customer Success, the team ensures every deployment delivers measurable impact. This is a dynamic, hands‑on role at the intersection of technical consulting, project leadership, and AI adoption strategy. You don't need to be a full‑time software engineer, but you should be fluent in APIs, very confident managing technical projects, and eager to continuously learn, build, and scale new models that empower both customers and teammates.
About the Role
As a Forward‑Deployed Engineer, you'll own the end‑to‑end delivery of technically sound, low‑code AI solutions for Aircall customers. You'll design, implement, and operationalize workflows that connect the AI Agent with CRMs, help desks, and communication tools. You'll also play a key role in driving team upskilling, building reusable frameworks, and leading cross‑functional initiatives that improve how Aircall deploys AI at scale. This is a hands‑on role where you'll translate customer challenges into scalable technical solutions, working directly with Product and Engineering to refine Aircall's platform and accelerate adoption.
What you'll do
Lead customer discovery and design sessions to map business processes, identify automation opportunities, and define solution architecture.
Design, build, and deploy integrations using low/no‑code platforms (Zapier, Make, n8n, Workato) and CRM automation tools (HubSpot Workflows, Salesforce Flow) with API connectors.
Collaborate with Engineering to validate technical feasibility, resolve blockers, and share field learnings that inform product improvements.
Configure and optimize the AI Agent - defining intents, prompts, actions, guardrails, and performance metrics.
Manage complex, cross‑functional deployments - defining timelines, aligning stakeholders, ensuring accountability, and delivering on time and within scope.
Create scalable models and reusable frameworks (templates, playbooks, reference architectures) that make future projects faster and more consistent.
Champion continuous learning and enablement - train peers, run internal workshops, and document best practices to raise the technical bar across the team.
Run global, targeted outbound campaigns within the existing customer base to generate pipeline and accelerate adoption, working closely with the customer marketing team.
Collaborate with GTM leadership to embed routines and cadences that drive accountability for new product pipeline, forecast accuracy, and performance tracking.
Own regional top‑line targets for assigned products (e.g., AI Voice Agent MRR generated per area) by collaborating with AEs and AMs who hold add‑on quotas.
Act as an internal product owner within the GTM function-defining product‑specific MRR strategies, coordinating cross‑functional support, and ensuring Aircall delivers the leading AI‑enabled communication platform.
Collaborate with Product and PMM to shape the AI Voice Agent roadmap based on customer needs, integration insights, and field learnings.
Drive internal and external product education, including enablement for System Integrators (SIs) and channel partners.
Maintain deep awareness of AI and CX industry trends, ensuring Aircall's positioning remains competitive and insights continuously feed back into product and GTM strategies.
What you'll bring
5-8 years in technical consulting, solutions engineering, or integration‑focused project management.
Strong command of APIs, webhooks, and data structures (JSON, REST, GraphQL) - able to design and troubleshoot integrations confidently. Even better if you have experience with MCP Servers.
Hands‑on experience with low‑code orchestration tools (Zapier, Make, n8n, Workato) and CRM automation (HubSpot, Salesforce).
Practical understanding of AI workflow configuration, including prompt engineering, evaluation, and monitoring.
Proven ability to lead cross‑functional projects, working with Engineering, Product, and Customer Success to deliver scalable outcomes.
Excellent communication and stakeholder management skills, translating between technical and business audiences.
A growth mindset - constantly learning new tools, frameworks, and ways to improve the customer experience and team capability.
Nice‑to‑haves
Experience in telephony, voice, or contact center systems (SIP/WebRTC, call routing, containment, AHT).
Familiarity with BI and analytics tools (Looker, BigQuery, Snowflake).
Exposure to scripting (JavaScript/Python) for light customization and debugging.
What Success Looks Like
Consistent delivery of high‑quality, scalable AI Agent deployments with clear business outcomes.
Shortened time‑to‑first‑value and increased automation adoption across SMB customers.
Reusable frameworks, models, and templates actively used by peers and partners.
Strong collaboration with Engineering and Product to enhance platform capabilities.
Demonstrated leadership in team enablement and upskilling, with measurable impact on technical excellence.
Sustained improvements in customer metrics (containment, AHT, CSAT) and revenue impact (ARR, NRR) driven by your solutions.
$11,000 - $115,000 a year
This base range is not including a quarterly bonus, equity, and other benefits. The maximum OTE for this role is 190K, including a 75K bonus. The actual salary offered will carefully consider a wide range of factors, including your skills, location, qualifications, and experience.
Why This Role is Unique
This is a role for builders, connectors, and enablers. You'll be at the intersection of AI innovation, customer experience, and scalable delivery - combining technical curiosity with a consulting mindset. Every project you lead not only delivers value for customers but also strengthens Aircall's internal capability to deploy AI faster and smarter.
If you're motivated by making cutting‑edge technology accessible and impactful - while continuously learning and helping others do the same - this is the perfect next step in your career.
Why join us?
🚀 Key moment to join Aircall in terms of growth and opportunities
💆♀️ Our people matter, work‑life balance is important at Aircall
📚 Fast‑learning environment, entrepreneurial and strong team spirit
🌍 45+ Nationalities: cosmopolite & multi‑cultural mindset
💵 Competitive salary package & equity
🏨 Medical, dental, and vision insurance is 100% covered
📈401k plan with company matching!
✈️Unlimited PTO - take the time you need to come to work feeling great!
⭐️Wellness, commuter, and childcare reimbursements
💚Generous parental leave policy
DE&I Statement
At Aircall, we believe diversity, equity and inclusion - irrespective of origins, identity, background and orientations - are core to our journey.
We pride ourselves on promoting active inclusion within our business to foster a strong sense of belonging for all. We're working to create a place filled with diverse people who can enrich and learn from one another. We're committed to ensuring that everyone not only has a seat at the table but is valued and respected at it by providing equal opportunities to develop and thrive.
We will constantly challenge ourselves to make sure that we live up to our ambitions around diversity, equity and inclusion, and keep this conversation open. Above all else, we understand and acknowledge that we have work to do and much to learn.
Want to know more about candidate privacy? Find our Candidate Privacy Notice here.
#J-18808-Ljbffr
$103k-149k yearly est. 5d ago
Forward Deployment Engineer (Embedded AI / Systems Engineer)
Jeen.Ai
Devops engineer job in San Francisco, CA
R&D
Why Join Us?
Join the founding U.S. deployment team and embed directly with strategic enterprise clients. Bridge technology and business by architecting, coding, and operationalizing AI solutions in real-world production settings while shaping product direction from the field.
Key Responsibilities
Embed with customer teams (on-site or virtually), rapidly understand domain challenges, and design tailored AI/ML-driven solutions
Lead end-to-end implementation: data ingestion, model integration, application logic, UI/UX, APIs, monitoring, and scaling
Collaborate with customer stakeholders (technical and executive) to define roadmap, success metrics, and delivery plans
Iterate rapidly: prototype, test, learn, and refine in production settings
Surface lessons from client deployments back into our core platform-help shape product direction, SDKs, abstractions, and APIs
Assist the sales / pre‑sales process: technical discovery, architecture reviews, proof‑of‑concept scoping, and proposals
Ensure reliability, observability, performance, security, and compliance in deployed systems
Requirements
3-8+ years of professional software engineering experience (full stack, data, infrastructure, or ML systems)
Experience building production systems: APIs, data pipelines, scalable services, frontend/backends
Demonstrated ability to work in ambiguous environments, integrating multiple systems and APIs
Excellent communication skills: able to engage both engineers and non‑technical stakeholders
Highly autonomous, creative problem solver, comfortable working across layers (data ↔ app ↔ infra)
Willingness to travel (20%-40%) to customer sites
Preferred Qualifications
Experience in AI/ML, knowledge graphs, embeddings, LLMs, vector databases
Background in regulated industries (finance, healthcare, government)
#J-18808-Ljbffr
$103k-149k yearly est. 5d ago
Forward Deployed Engineer
Tailwind Insurance Systems, Inc.
Devops engineer job in San Francisco, CA
About Tailwind
Tailwind is organizing the world's insurance information.
Insurance is one of the largest markets in the world, and it runs entirely on PDFs and antiquated infrastructure. We like it that way. While the industry takes years to roll out change, we ship in days.
Tailwind is backed by the earliest investors in financial infrastructure and technology titans Ramp ($23B), Robinhood ($100B) and Cognition ($10B), as well as over a dozen founders and leaders of billion dollar+ companies (including Segment, Newfront, Qualtrics, Cursor, etc) and even NBA champions.
About the Role
As a Forward Deployed Engineer, you'll sit at the intersection of engineering, product, and customer success. Owning the technical integration will be the foundation of your role - especially at first - but we expect you to evolve into leading the technical delivery of special projects and initiatives for your customers.
Your primary internal partner will be the Deployment Strategist who owns customer success overall and will partner with you to deliver value quickly and repeatedly.
This role is part solutions engineer, part product specialist, part technical consultant. You'll develop software at the edges of our product, configure our core platform, and become the trusted technical advisor to business leaders inside our customers' organizations.
Technical Requirements
We're looking for someone who can quickly gain context, build lightweight technical solutions, and work across teams to drive customer outcomes. You should:
Be comfortable standing up simple applications - in a modern cloud environment (e.g., AWS, GCP, Azure), using tools like serverless functions, containerized services, or basic web frameworks. You don't need to be a full‑stack wizard, but you should know how to go from zero to working prototype.
Have experience building and debugging data pipelines - whether that's writing SQL to transform and move data, scripting in Python or Java, or orchestrating lightweight ETL workflows. You'll often need to wrangle customer data or bridge gaps between systems.
Be fluent in APIs and integration logic - including authentication (OAuth, keys), RESTful design, and tools like Postman or curl. You'll regularly connect Tailwind's systems with customers' infrastructure and tools.
Be LLM‑native - you're already using tools like Copilot, Claude, or ChatGPT to accelerate dev work, generate glue code, or prompt your way through hairy problems.
Thrive in ambiguity - this isn't a checkbox role. We're solving problems that don't always have clear requirements. You'll need to think critically, get your hands dirty, and ship solutions that move the needle for customers.
What You'll Do
Own technical integrations for new brokerage customers - from first install to full rollout.
Expand our product with additional integrations, analytical capabilities -- whatever there is a commercial need for that is not met by our core product
Work closely with deployment strategists and product ops to ensure smooth rollouts and successful adoption.
Collaborate with engineering on product fixes, light customizations, and new deployments.
Travel occasionally for key deployments or high‑touch customers, if needed.
What Makes You a Great Fit
1-6 years in software engineering, solutions engineering, or technical consulting.
Comfortable working with APIs, SQL databases, and cloud or on‑prem deployments.
Strong communication skills - you'll be customer‑facing and technical.
Excited to working in our SF office
Insurance or financial services experience a plus but not required.
Why You Should Work Here
Impact: You'll be the bridge between product and customer - ensuring success at the most critical stage of adoption. The best startups are adopting this model because it gives engineers direct exposure to customers, influence over the roadmap, and a faster feedback loop than traditional engineering roles. Skills in this area are in high demand and open doors to product, leadership, and founder paths.
Speed: Work closely with engineering to ship improvements fast.
Career growth: Forward‑deployed engineers often move into product, solutions architecture, or technical leadership as startups scale.
Elite Team: Small, ambitious, and fast‑moving. Our core team has worked together 3 times now, hailing from Carta ($7B) and AppDirect ($2B) and includes veterans from Ramp, Apple, Salesforce and C3 AI - people who know what great product looks like, and what it takes to build it fast.
Tailwind is backed by the earliest investors in financial infrastructure and technology titans Ramp ($23B), Robinhood ($100B) and Cognition ($10B), as well as over a dozen founders and leaders of billion dollar+ companies (including Segment, Newfront, Qualtrics, Cursor, etc) and even NBA champions.
#J-18808-Ljbffr
$103k-149k yearly est. 3d ago
Forward Deployed Engineer
Lancedb Inc.
Devops engineer job in San Francisco, CA
Forward Deployed Engineer - LanceDB
Team: Engineering Job Type: Full-Time
About LanceDB
LanceDB is an open-source, cloud-native vector database and multimodal AI lakehouse built on a high-performance columnar format. It enables developers and enterprises to build scalable, real-time search and analytics applications across vectors, structured data, and AI workflows. LanceDB delivers both embedded and managed deployment models with rich SDKs in Rust, Python, and other languages, and is purpose-built to power state-of-the-art retrieval, feature engineering, and large-scale AI systems.
Role Overview
As a Forward Deployed Engineer (FDE) at LanceDB, part of the Engineering organization, you will operate at the intersection of deep systems engineering and direct customer engagement. You will work hands‑on with strategic customers in the Bay Area to design, deploy, and scale LanceDB in demanding production environments.
This is a highly technical, product‑facing role. You will not only deliver customer solutions, but also contribute production‑quality code and actionable feedback directly back to LanceDB's core product lines. Your real‑world experience deploying LanceDB alongside modern data infrastructure will directly influence product architecture, APIs, and performance characteristics.
What You'll Do
Lead on‑site and remote technical deployments of LanceDB with enterprise and strategic customers, including architecture design, benchmarking, performance tuning, and operational hardening.
Write and maintain production‑grade code in Rust and Python for customer integrations, SDK enhancements, ingestion pipelines, and internal tooling.
Contribute code upstream to LanceDB's core repositories, including bug fixes, performance improvements, new features, and architectural refinements informed by customer use cases.
Capture, distill, and communicate structured product feedback from customer engagements to product and core engineering teams, influencing roadmap and design decisions.
Integrate LanceDB into existing data and AI infrastructure stacks, including Spark, Ray, and similar distributed processing frameworks.
Diagnose and resolve complex issues involving distributed systems, cloud object storage, concurrency, and large‑scale data movement.
Partner closely with product, core engineering, and GTM teams to ensure customer requirements translate into generalizable, reusable platform capabilities.
Deliver technical deep dives, workshops, and proofs‑of‑concept for engineers and architects at customer organizations.
What We're Looking For
At least 2 years of experience in a technical field or forward deployed engineering role.
Ability and willingness to be on‑site at customers daily. This role is located in the San Francisco Bay Area, and candidates must live there. No exceptions will be made for this requirement.
Proven experience building, deploying, or operating distributed, cloud‑native databases or data platforms in production.
Strong proficiency in Rust and Python, with a demonstrated ability to write performant, maintainable systems code.
Hands‑on familiarity with data infrastructure technologies such as Apache Spark, Ray, or similar distributed compute and data processing frameworks.
Experience integrating databases with batch and streaming data pipelines, ML workflows, or large‑scale analytics systems.
Demonstrated ability to contribute directly to core product codebases, not just customer‑specific glue or scripts.
Deep understanding of distributed systems concepts including sharding, replication, consistency, concurrency, and failure handling.
Strong customer‑facing skills, with the ability to work directly with engineers, architects, and technical leaders to drive solutions from concept to production.
Nice-to-Have
Experience with vector databases, similarity search, or multimodal data systems.
Prior contributions to open‑source databases, storage engines, or distributed systems projects.
Familiarity with cloud platforms (AWS, GCP, Azure), Kubernetes, Terraform, and observability tooling.
Experience with Apache Arrow-based ecosystems, large‑scale ML data pipelines, or AI infrastructure stacks.
Why LanceDB?
As a Forward Deployed Engineer at LanceDB, you'll work directly with cutting‑edge customers while shaping the core product itself. Your field experience will feed back into LanceDB's architecture, APIs, and performance roadmap, giving you a rare opportunity to influence both customer success and the evolution of next‑generation AI data infrastructure.
#J-18808-Ljbffr
$103k-149k yearly est. 2d ago
Forward Deployed Engineer
Supergood Systems, Inc.
Devops engineer job in San Francisco, CA
With a strong early technical team in place, we're looking to add a forward deployed engineer to help us add and go deeper with customers.
Your role will be in partnering with Alex Klarfeld , the CEO/Founder, in managing outreach, building demos, onboarding new customers, iterating with the engineering team on product feedback, and closing deals.
Mission
To attract net new customers through demos, qualify and gather requirements from prospects, implement integrations for new prospects and manage active customer engagements, so the company can:
Grow the top of funnel by demonstrating the product through demos
Gather requirements and help qualify prospects on sales calls
Drive the sales process by auditing new integration requests and using this to qualify customers
Implement new integrations using Supergood tooling, and take learnings from implementation to shape the product roadmap
Manage customer roll-outs, ongoing support, and net new integrations
Technically Adept: You exhibit a high understanding of technical concepts that helps establish trust from prospects and customers, ensuring the product delivered will be high quality.
Crazy Creative: You have a never ending supply of wild and crazy ideas to help showcase the product in an interesting way, or solving customer problems in a creative way.
High Level of Ownership: You're constantly and neurotically monitoring every single customer deployment, proactively identifying issues before customers even notice them.
Run through walls: You come up with creative solutions to very strange problems. You're comfortable working in a non-intuitive problem space such as reverse engineering, and know how to push towards success regardless of the resistance.
Requirements
1-3 years of years of experience in consulting, early-stage sales, or growth-focused roles at large companies
Ability to engage and build relationships with customers of all types on a technical level
Interest and ability to build from zero to one, and be a self-starter
Has tinkered with modern LLM and AI tools and understands their capabilities, and more importantly, their limitations
Experience in environments with high velocity of iteration and shipping
Based in San Francisco, CA and willingness to work in-person five days a week
Competitive salary and generous equity compensation
Medical, dental and vision insurance
Daily lunch and coffee / snacks / beverages
Supergood is an equal opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.
#J-18808-Ljbffr
A healthcare technology company in San Francisco is looking for a Forward-Deployed Engineer to enhance patient care navigation. You will collaborate closely with teams to configure workflows and build automated agents to improve care outcomes. Ideal candidates should have a Bachelor's degree in a related field, proven experience with AI content generation, and strong analytical skills. This role offers a competitive salary and benefits including health insurance and paid time off.
#J-18808-Ljbffr
$103k-149k yearly est. 5d ago
Software Engineer - Jobs Platform & Certifications
Openai 4.2
Devops engineer job in San Francisco, CA
About the Role
We're looking for exceptional engineers to help build one of the most transformative applications of AI: using ChatGPT to expand economic opportunity at scale. This is a unique chance to work at the intersection of agents, search/matching, personalization, UX, and real-world economic impact and building AI products that directly change people's lives.
Millions already turn to ChatGPT for career help, but the job market is stuck in static résumés, keyword search, and opaque hiring. We're building something different: a career agent that understands your skills, helps you prove them through certifications, and works nonstop to unlock opportunities - leading to a network where worker and employer agents match on real capabilities, not job boards.
In this role, you will:
Own major product surfaces end-to-end to build the next-generation AI-powered jobs platform
Design and ship core systems for matching, candidate verification, skills representation, and workflow automation
Work with research to turn new model and agent capabilities into production features that help users prove skills, certify expertise, and find great opportunities
Build high-trust, scalable systems that support millions of job seekers and employers
Run experiments, talk to users, and iterate quickly to identify what drives successful outcomes, not clicks or resumes
Work closely with product, design, research, and ops to create human-centered experiences powered by AI
Optimize for speed, reliability, and security, ensuring a trusted platform for economic mobility
You may be a good fit if you have:
5+ years of software engineering experience building highly-available user-facing products
Experience working across the stack (frontend + backend) and comfort owning features end-to-end
Passion for marketplaces, matching systems, or search/personalization, even if you haven't built one before
Interest in AI-driven UX, agents, and workflow automation (direct ML experience not required)
Enthusiasm for working in a fast-moving, ambiguous environment where new ideas ship quickly and user impact matters most
About OpenAI
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.
We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.
For additional information, please see OpenAI's affirmative action and equal employment opportunity policy statement.
Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.
To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.
We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.
OpenAI Global Applicant Privacy Policy
At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
#J-18808-Ljbffr
$122k-166k yearly est. 2d ago
Staff Software Engineer, Site Reliability
Asana 4.6
Devops engineer job in San Francisco, CA
Asana's rapid growth brings new challenges in keeping our systems fast, reliable, and resilient. As our product evolves, we're making a major investment in reliability - and building a brand new SRE team in Warsaw is a key part of that strategy. This is your chance to help shape it from day one.
This isn't a traditional “ops” role - we're looking for strong software engineers who are passionate about building reliable, distributed systems. You'll work closely with a small SRE team in San Francisco, infrastructure engineers in Reykjavik, and an established infrastructure team in Warsaw. Warsaw will be a significant hub for our future infrastructure engineering and operations. As one of the first engineers here, you'll have a real say in how we build reliable infrastructure, manage incidents, and support the rest of the company.
This role is based in our Warsaw office with an office‑centric hybrid schedule - in‑office days are Monday, Tuesday, and Thursday.
We offer a Contract of Employment (UoP) for our employees in Poland.
What you'll do:
Influence the future of Asana's SRE practice, especially as we grow the Warsaw team.
Lead reliability‑focused projects across our stack - from infrastructure to tooling to incident response.
Define and implement Asana's incident management process - we're investing here, and you'll help shape how it works.
Build internal platforms and frameworks that help other teams improve the reliability of their services.
Be part of (and help shape) a sustainable on‑call rotation - shared across teams in Warsaw, San Francisco, and Reykjavik. On average, we handle ~1 page per day, but it's not constant, and we care about keeping things sane.
Work with our stack: AWS, Kubernetes (EKS), Datadog, MySQL (RDS), ElasticSearch (OpenSearch), Redis, DynamoDB, Terraform, TypeScript, Scala, Go, and Python. (Yeah, we know this sounds like buzzword bingo - but we want this post to actually show up in your searches.)
About you:
You're a strong and experienced software engineer who's comfortable writing and reading code - this isn't an ops role.
You care about reliability, scalability, and long‑term maintainability, not just quick fixes.
You might have worked as an SRE before - or maybe you were a product engineer who kept getting pulled into infra work because you cared about how systems actually run.
You've seen systems at scale (or want to), and you're excited about solving infrastructure problems that have broad impact.
You're curious, take initiative, and aren't afraid to work in ambiguous spaces - especially important on a brand new team.
You collaborate well across teams and want to help others build more reliable systems.
You don't need to know our exact stack, but you're eager to learn whatever it takes to make things better.
You demonstrate curiosity about AI tools and emerging technologies, with a willingness to learn and leverage them to enhance productivity, collaboration, or decision‑making.
Why this role?
Founding team: You'll be one of the first SREs in Warsaw - and a key player in a growing team.
Drive real change: This isn't a role where you'll just patch up legacy systems. We are expecting (and supporting) real architectural changes to improve reliability, scalability, and long‑term operability.
Big impact: Our product is scaling fast, and reliability is a top company priority.
Room to grow: As this team grows, so will your influence - whether you want to lead projects, mentor others, or help shape how we scale.
Global collaboration: Work closely with experienced engineers in San Francisco, Reykjavik, and Warsaw, while helping build the future of SRE at Asana.
What we offer:
Generous, transparent and fair compensation system (base salary and generous Restricted Stock Unit for Asana Inc.).
Contract of Employment (with 50% tax deductible costs for author's rights usage for Engineers).
Health insurance with dental and travel coverage (Lux Med).
Lunch catering on the days that you work from the office.
Career growth budget.
Home office setup budget.
Gym/Fitness reimbursement.
Fertility healthcare and family‑forming support with Carrot.
Mental health support in Modern Health.
Group life insurance.
MacBooks with all necessary accessories.
For this role, the estimated base salary range is between 23 000 and 33 000 PLN gross monthly on the contract of employment (UoP). The actual base salary will vary based on various factors and individual qualifications objectively assessed during the interview process. The listed range above is a guideline, and the base compensation range for this role may be modified.
Our total compensation consists of base salary and equity (RSUs).
About us
Asana is a leading platform for human + AI collaboration. Millions of teams around the world rely on Asana to achieve their most important goals, faster. Asana has been named to Fortune's Best Workplaces for 7+ years and recognized by Fast Company, Forbes, and Gartner for excellence in workplace culture and innovation. We offer an exceptional office‑centric culture while adopting the best elements of hybrid models to ensure that every one of our global team members can work together effortlessly. With 13+ offices all over the world, we are always looking for individuals who care about building technology that drives positive change in the world and a culture where everyone feels that they belong.
Join Asana's Talent Network to stay up to date on job opportunities and life at Asana.
#J-18808-Ljbffr
$181k-242k yearly est. 3d ago
Software Engineer - Reliability
Pantera Capital
Devops engineer job in Palo Alto, CA
About xAI
xAI's mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company's mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All engineers are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.
About the Role
We are seeking a talented Site Reliability Engineer (SRE) to join our SuperComputing team. In this role, you'll ensure the reliability, scalability, and performance of our high-performance computing (HPC) infrastructure, powering cutting-edge AI research. You'll collaborate with cross-functional teams to build and maintain systems that support massive-scale data processing and model training. You\'ll ensure Grok stays reliable for millions while inventing new approaches at the intersection of SRE and cutting-edge AI to help define the future of AI reliability engineering.
What You'll Do
Design, implement, and maintain robust, scalable infrastructure for supercomputing environments.
Monitor and optimize system performance, ensuring high availability and minimal downtime.
Develop automation tools and scripts to streamline operations and improve system reliability.
Troubleshoot complex issues across distributed systems, networks, and storage solutions.
Collaborate with AI researchers and engineers to support compute-intensive workloads.
Implement security best practices to protect sensitive data and infrastructure.
Contribute to capacity planning and disaster recovery strategies.
Participate in an on-call rotation to ensure 24/7 system reliability.
Ideal Experiences
Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience).
3+ years of experience in site reliability engineering, DevOps, or systems engineering.
Proficiency in Linux system administration and scripting (e.g., Python, Bash).
Experience with containerization (e.g., Docker, Kubernetes) and cloud platforms (e.g., AWS, GCP, Azure).
Strong understanding of networking, distributed systems, and storage technologies.
Familiarity with HPC environments, GPU clusters, or large-scale data processing.
Excellent problem-solving skills and ability to work in a fast-paced, dynamic environment.
Strong communication skills and a collaborative mindset.
Bonus: Experience with Infrastructure as Code (e.g., Terraform, Ansible) or monitoring tools (e.g., Prometheus, Grafana).
Location
This role is based in the Bay Area (San Francisco and Palo Alto). Candidates are expected to be located near the Bay Area or open to relocation.
Tech Stack
Languages: Rust, Python, C++, Golang
Interview Process
Application Review: Submit your CV and a statement of exceptional work. Our team will review your application to assess fit.
Phone Interview (45 minutes): A brief conversation with a team member to discuss your background, key accomplishments, and motivation.
Main Interview Process
1 Coding assessment: Solve problems in Rust, Python, C++, or Golang
1 Skill Specific Technical Interview: Demonstrate practical skills in a live problem-solving session.
1 SRE/System Case Study: Analyze and solve a complex, real-world system design or operational problem, demonstrating your technical expertise, problem-solving skills, and ability to optimize system reliability and performance.
Project Deep-Dive: Present your past exceptional work to a small audience.
Annual Salary Range
$180,000 - $440,000 USD
Benefits
Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.
xAI is an equal opportunity employer.
California Consumer Privacy Act (CCPA) Notice
#J-18808-Ljbffr
$106k-150k yearly est. 1d ago
Software Engineer - Simulation
Doppel
Devops engineer job in San Francisco, CA
Why Join Doppel
Doppel is built to outsmart one of the great threats AI presents: mass‑manufactured social engineering. Countless scams, deepfakes, and other social engineering attacks are surging across every digital channel: websites, social media, ads, encrypted messaging apps, mobile, and more. Our mission is simple but bold: make the internet a safer place by outsmarting the world's fastest‑evolving digital threats.
Backed by top‑tier investors and trusted by some of the world's most recognized brands, Doppel is growing fast. If you're driven to solve real‑world problems with bold technology, we'd love to meet you.
What We're Building
We're building the AI‑native social engineering defense platform.
This means we're designing scalable systems that monitor billions of domains, social media accounts, apps, dark web forums, etc., and leverage AI agents to identify and neutralize digital threats.
What We're Looking For
We're looking for an experienced full‑stack engineer to help build out our new Simulation product. We only just recently launched it but market traction is growing extremely fast and we need people excited to jump and do whatever it takes to build what our customers desperately want. For example:
Build out a “Cursor for phishing simulation” flow that lets users create simulations using natural language.
Build out voice deepfakes that use AI to call employees pretending to be their co‑worker.
Here are some additional resources on what we built and why.
What We Offer
🚀 A mission‑driven culture with low ego, high ownership, deep customer obsession, and exceptional talent density
🍽️ Free lunch and dinner in the office
🌴 Flexible PTO
✈️ Quarterly team offsites
Join Doppel
Doppel is the first platform built to dismantle digital deception at scale. We scan over 150 million entities daily and deploy continuously adaptive AI SOC agents, paired with expert human analysts, to uncover and disrupt the infrastructure behind phishing, impersonation, and online fraud before attacks can spread. Our Threat Grid turns every customer signal into shared intelligence, making each disruption smarter, faster, and more effective.
We're not just another cybersecurity company. We're defining the future of social engineering defense, where trust is protected, and deception becomes unprofitable. Backed by top‑tier investors and trusted by some of the world's most recognized brands, Doppel is growing fast. If you're driven to solve real‑world problems with bold technology, we'd love to meet you.
#J-18808-Ljbffr
$106k-150k yearly est. 2d ago
Senior Software Engineer - Site Reliability
Ironclad Inc.
Devops engineer job in San Francisco, CA
Business runs on contracts. Every dollar earned, relationship formed, and advantage gained comes down to the contract that makes it real. But getting a contract done is more complicated than it should be. And when contract data is buried, leaders can't see risks, obligations, or act in time.
Ironclad is the leading AI contracting platform that transforms agreements into assets. Contracts move faster, insights surface instantly, and agents push work forward, all with you in control. Whether you're buying or selling, Ironclad unifies the entire process on one intelligent platform, providing leaders with the visibility they need to stay one step ahead. That's why the world's most transformative organizations, from OpenAI to the World Health Organization and the Associated Press, trust Ironclad to accelerate their business.
We're consistently recognized as a leader in the industry: a Leader in the Forrester Wave and Gartner Magic Quadrant for Contract Lifecycle Management, a Fortune Great Place to Work six years running, and one of Fast Company's Most Innovative Workplaces. Ironclad has also been named to Forbes' AI 50 and Business Insider's list of Companies to Bet Your Career On.
This is a hybrid role. Office attendance is required at least twice a week on Tuesdays and Thursdays for collaboration and connection. There may be additional in-office days for team or company events.
Software Engineer, Platform Infrastructure sits under the umbrella of Product and Engineering and plays a pivotal role in ensuring our developers have the tools, infrastructure, and systems to provide our customers with reliable, secure, and scalable software.
Roles & Responsibilities:
Be part of the Cloud Platform SRE Team, focused on building our Cloud Platform using modern tools and best practices.
Champion SRE best practices within the team and throughout the organization
Solve the whole problem. Architecture for resiliency, identify risks, and make it happen.
Ability to use a wide variety of open source technologies and tools
Understanding of Continuous Integration & Continuous Delivery in the software engineering process and can clearly articulate how a Software Engineer, Platform Infrastructure facilitates these practices in collaboration with the Development, Quality Assurance and Technical Operations teams to drive business goals
Ability to demonstrate a clear, energetic and excited interest in automating everything (build, test, release/deploy, monitoring, reporting), which includes Infrastructure as Code
Preference for collaboration, open communication and reaching across functional borders
Thorough understanding of backup/recovery systems, development automation routines, storage area networks and virtualization
Be on an on-call rotation to respond to incidents that impact Ironclad's availability, and provide support with internal or customer-facing incidents
Develop an understanding of the near, mid, and long-term needs of the business - and understand how the Platform SRE contributes to its success
Influence architectural decisions with a focus on security, scalability, and high performance
Be a mentor, multiply our team's output with leadership and guidance
Key Skills:
8+ years of professional DevOps / SRE experience.
5+ years of coding (Python, Golang, TypeScript Shell Scripting, Rust, etc), scaling, and architecture work. Experience with Typescript is a plus.
Expert knowledge of Docker and Kubernetes
Experience with Google Cloud Platform (or similar provider)
Experience with Build and Deployment tools such as Terraform, CircleCI, ArgoCD
Experience with multi-region support is a plus
Experience with ELK Stack, MongoDB, Postgres
Strong technical aptitude and exceptional communication skills (written and verbal)
Ability to appropriately prioritize and respond to different escalations
Troubleshooting and analytical skills, drive to help customers, and the ability to dive deep and learn a new product.
Experience and desire to work cross-functionally
Team and goal-oriented.
High output; low ego
Benefits:
Health, dental, and vision insurance
401k
Wellness reimbursement
Take what you need vacation policy
Generous parental leave for both primary and secondary caregivers
Base Salary Range: $180,000 - $200,000
The base salary range represents the minimum and maximum of the salary range for this position based at our San Francisco headquarters. The actual base salary offered for this position will depend on numerous factors, including individual proficiency, anticipated performance, and the location of the selected candidate. Our base salary is just one component of Ironclad's competitive total rewards package, which also includes equity awards (a new hire grant, along with opportunities for additional awards throughout your tenure), competitive health and wellness benefits, and a commitment to career growth and development.
Pursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.
#J-18808-Ljbffr
$180k-200k yearly 4d ago
Forward Deployed Engineer
Rely Health
Devops engineer job in San Francisco, CA
At Rely Health, we leverage a comprehensive suite of technology tools to ensure every patient receives personalized support throughout their healthcare journey. Our patient care navigators utilize advanced AI‑driven solutions, multi‑channel communication platforms, and real‑time data analytics to provide high‑quality, cost‑effective, and accessible care to diverse communities across the United States.
By combining human empathy with cutting‑edge technology, Rely Health ensures comprehensive, efficient, and accessible care navigation for all patients, regardless of their location or circumstances. Our solution not only reduces worry and frustration for patients and their families but also improves overall health outcomes and reduces the total cost of care.
About the role
The Forward‑Deployed Engineer (FDE) is a customer‑proximate, outcomes‑driven individual contributor who makes Rely's programs work end‑to‑end in the real world. This role sits at the intersection of engineering, product, and operations and is “navigator‑based”: you work closely with navigators and ops, and collaborate directly with customers and Sales/AM to ensure we deliver outcomes.
This role is not static. Tools and platforms evolve month‑to‑month as we adopt vendors, build internal systems, and improve the stack. Success requires comfort learning new tools quickly while maintaining a high bar for quality, reliability, and delivery.
Important note: what “engineering” means here
This is not a backend production engineering role.
You will build internal tools, scripts, workflow configurations, dashboards, and prototypes (often using AI‑assisted / low‑code tooling).
You will not be asked to own production backend services.
What You'll Do1) Build & Operate Operational Workflows (System of Work / Patient Manager)
Configure and evolve workflows that power navigator execution: routing, task creation, campaigns, automation, and program logic.
Partner with Product/Ops to ensure workflow configuration reflects real operations and reduces failure modes.
Build internal helpers/tools that make workflows easier to operate and safer to change.
2) Build & Operate Agent Workflows (Agent System)
Build and maintain agent workflows that handle operational tasks end‑to‑end: tool use, guardrails, escalation paths, and failure handling.
Shadow real workflows, identify friction, and ship improvements that reduce manual work and increase closure.
Maintain reusable components/templates so we don't rebuild one‑offs for every customer.
3) Build & Improve Automated Communication Agents (Voice and beyond)
Build and iterate automated agents that interact with patients/customers (voice, messaging, or other channels as needed).
Ensure behavior is operationally correct (right intent, right routing, right follow‑ups) and measurable.
Follow quality gates/playbooks for customer‑facing automation (testing/evals, rollout/rollback, post‑deploy verification).
4) Customer Outcomes, Expansion, and Getting Customers to the Next Level
Identify where outcomes break by learning the customer's real operations and constraints.
Partner with Sales/AM and customer stakeholders to move customers to the next level (new workflows, higher volume, broader scope, deeper adoption) by shipping what's needed to make it real.
Turn early signals into concrete pilots with tight scope, measurable success criteria, and clear timelines.
Convert repeated customer needs into scalable patterns (templates, components, playbooks) so we scale beyond one account.
5) Rapid Viability → Standardization (Process + Quality Gates)
Push viable solutions into production quickly: small increments, clear ownership, and verifiable results.
Create/upgrade the right proof for changes: simulations, structured reviews, eval sets, regression checks, audit queries, dashboards, or other validation mechanisms.
Do the necessary research (artifacts, logs, customer workflows, tool capabilities) to choose the right approach and avoid rework.
6) Enablement, Documentation, and Team Lift
Write short runbooks, checklists, and “how this works” docs so we don't relearn the same lessons.
Pair with teammates, share patterns/pitfalls, and contribute to a culture of high‑quality execution.
As you ramp, help newer hires ramp faster through clear documentation and lightweight support.
Operating Standards
Changes ship through a tracked change process with traceability (PRs, reviews, versioning, release notes).
Meaningful changes include appropriate validation, rollout/rollback thinking, and post‑deploy verification.
When a workflow has a specific playbook (e.g., calling), follow it.
HIPAA / PHI handling: you'll work in a healthcare environment. We expect careful handling of sensitive data, least‑privilege access, auditability, and secure operational practices.
Qualifications
Minimum Required Qualifications:
Bachelor's degree in related field (UX/UI design, Interaction Design, HCI, Computer Science) or equivalent experience.
Proven experience with GPT (or similar AI content generation technologies and Kibana).
Comfort with SQL and basic scripting (Python/TypeScript) for internal tooling.
Strong knowledge of Elasticsearch and its integration with Kibana.
Ability to communicate complex technical topics clearly and concisely.
Ability to debug across systems (configs, APIs, logs, data) and ship reliable changes.
Experience interfacing with customers in a high‑capacity and significantly driving the adoption of the product.
Experience with no‑code or low‑code platforms.
Experience with project management/documentation tools like Notion.
Preferred Qualifications:
Experience with other OpenAI products.
Experience building automation/agents, eval sets, or quality gates.
Experience in forward‑deployed engineering, solutions engineering, implementation, or technical operations.
Competencies (Knowledge/Skills/Abilities):
Strong ownership and problem‑solving mindset; you drive issues to verified resolution.
Ability to build relationship across the company to effectively design products.
Strong analytical skills to work evaluate model performance and improve prompt effectiveness.
Knowledge of data visualization principles and best practices.
Excellent problem‑solving skills and a detail‑oriented approach to design.
Comfort operating in ambiguous, fast‑moving environments with evolving tools.
Strong communication and teamwork skills, with the ability to collaborate effectively with cross‑functional teams.
Ability to scope project timelines and meet appropriate deadlines.
Ability to work evening and weekends as needed to execute on deadlines.
Creativity, open mindset to drive experimentation and internal R&D.
Proven ability to translate complex technical concepts into clear, actionable insights for non‑technical stakeholders, ensuring that customers can effectively leverage advanced product features.
The above statements are intended to describe the general nature and level of the work being performed by people assigned to this job. They are not exhaustive lists of all duties, responsibilities, knowledge, skills, abilities, and working conditions associated with it.
Rely Health does not discriminate against any person on the basis of race, creed, color, religion, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information (including family medical history), veteran status, marital status, pregnancy or related condition, or any other basis protected by law. Rely Health is committed to complying with all applicable national, state and local laws pertaining to nondiscrimination and equal opportunity.
Working Conditions
Requires frequent use of the telephone and computer. Prolonged periods of sitting at the desk, computer work and reading can be anticipated.
401(k)
Health insurance
Vision insurance
LT/ST Disability and Life Insurance
Technology reimbursement
Paid time off (Vacation, Sick, Holiday)
Paid Parental leave
Technology Reimbursement
The pay range for this role is:
90,000 - 120,000 USD per year (Headquarters)
#J-18808-Ljbffr
$103k-149k yearly est. 5d ago
Software Engineer, Reliability
Openai 4.2
Devops engineer job in San Francisco, CA
Join the engineering teams that bring OpenAI's ideas safely to the world!!
The Applied Engineering team works across research, engineering, product, and design to bring OpenAI's technology to consumers and businesses. We seek to learn from deployment and distribute the benefits of AI, while ensuring that this powerful tool is used responsibly and safely. Safety is more important to us than unfettered growth.
About the Role
As OpenAI continues to grow, we are looking for experienced, problem-solving engineers to ensure our systems scale. Our success depends on our ability to quickly iterate on products while also ensuring that they are performant and reliable. You will work in a deeply iterative, collaborative, fast-paced environment to bring our technology to millions of users around the world, and ensure it's delivered with safety and reliability in mind. Successful candidates will play a crucial role in ensuring the reliability, scalability, and performance of our systems as we continue to expand. As a reliability expert, you will be at the forefront of maintaining and enhancing the stability, scalability, and performance of our rapidly evolving infrastructure. You will work closely with cross-functional teams, including software engineers, product managers, and data scientists, to build and maintain resilient systems that can handle our growing user base and workload.
In this role, you will:
Design and implement solutions to ensure the scalability of our infrastructure to meet rapidly increasing demands.
Collaborate with development teams to make the systems they design and operate more reliable.
Implement and manage monitoring systems to proactively identify issues and anomalies in our production environment.
Develop and maintain service level objectives (SLOs) and service level indicators (SLIs) to measure and ensure system reliability.
Implement fault-tolerant and resilient design patterns to minimize service disruptions.
Build and maintain automation tools to streamline repetitive tasks and improve system reliability.
Partner with researchers, engineers, product managers, and designers to bring new features and research capabilities to the world.
Participate in an on-call rotation to respond to critical incidents and ensure 24/7 system availability.
You might thrive in this role if you:
Enjoy seeking out and addressing bottlenecks and areas for performance improvement in our systems.
Utilize Infrastructure as Code (IaC) principles to automate infrastructure provisioning and configuration management.
Are experienced in collaborating with cross-functional teams to ensure that reliability and scalability are considered in the design and development of new features and services.
Have a track record of accelerating engineering reliability by empowering your fellow engineers with excellent tooling and systems.
Have a humble attitude, an eagerness to help your colleagues, and a desire to do whatever it takes to make the team succeed.
Own problems end-to-end, and are willing to pick up whatever knowledge you're missing to get the job done.
Qualifications:
Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent work experience).
Proven experience as an reliability engineer or a similar role in a fast-paced, rapidly scaling company.
Strong proficiency in cloud infrastructure.
Proficiency in programming/scripting languages.
Experience with containerization technologies and container orchestration platforms like Kubernetes.
Knowledge of IaC tools such as Terraform or CloudFormation.
Excellent problem-solving and troubleshooting skills.
Strong communication and collaboration skills.
Experience with observability tools such as DataDog, Prometheus, Grafana, Splunk and ELK stack.
Experience with microservices architecture and service mesh technologies.
Knowledge of security best practices in cloud environments.
This role is exclusively based in our San Francisco HQ. We offer relocation assistance to new employees.
About OpenAI
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.
We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.
For additional information, please see OpenAI's Affirmative Action and Equal Employment Opportunity Policy Statement.
Qualified applicants with arrest or conviction records will be considered for employment in accordance with applicable law, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.
To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.
We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.
OpenAI Global Applicant Privacy Policy
At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.
#J-18808-Ljbffr
$122k-166k yearly est. 1d ago
Customer-Centric AI Deployment Engineer
Truth Systems 4.5
Devops engineer job in San Francisco, CA
A technology firm focused on AI safety is seeking a candidate for a mission-driven role that melds engineering with customer empathy. You will be responsible for leading client integrations of trust and safety software, ensuring successful product adoption while also translating client feedback into actionable improvements. This in-person role is based in San Francisco and includes regular travel across the U.S. Competitive salary and equity are offered.
#J-18808-Ljbffr
How much does a devops engineer earn in Castro Valley, CA?
The average devops engineer in Castro Valley, CA earns between $97,000 and $177,000 annually. This compares to the national average devops engineer range of $80,000 to $135,000.
Average devops engineer salary in Castro Valley, CA
$131,000
What are the biggest employers of Devops Engineers in Castro Valley, CA?
The biggest employers of Devops Engineers in Castro Valley, CA are: