Senior deployment engineer jobs near me - 4,429 jobs
Let us run your job search
Sit back and relax while we apply to 100s of jobs for you - $25
Presentation Engineer
Northern Canal Medical Center 4.2
Remote senior deployment engineer job
Title*: Presentation Engineer Our Mission Prezent is on a mission to transform how enterprises communicate. Founded in 2021, we have rapidly grown into a 200+ person, fully remote team that's backed by $40+ million in venture funding. Our AI-powered productivity platform, ASTRID, is the first solution purpose-built for enterprise communication needs-delivering up to 90% time savings and 60% cost reduction in presentation development.
Our Vision
We believe that effective communication accelerates business impact. By automating design best practices and tailoring content to audience dynamics, Prezent empowers teams to craft clear, engaging, and on-brand presentations at scale. Our focus is on enabling Fortune 2000 companies-particularly in industries like healthcare, biopharma, high-tech, banking, and insurance-to achieve better alignment, faster decision-making, and stronger business outcomes.
The Role
As a *Presentation Engineer*, you'll join a dynamic team of technologists, designers, and strategists who bring business communication to life. Your mission is to bridge the gap between data, story, and design-transforming complex ideas into compelling presentations that drive real-world impact.
You'll be the go-to partner and sounding board for our clients, helping them sharpen their storytelling, amplify impact, and build presentation excellence across their organizations. You'll help teams plan and execute presentation calendars, bring the best of Prezent.AI to life, and guide users in effectively leveraging ASTRID, our AI-powered communication engine.
No two days will be the same-you'll flex between understanding audience needs, engineering presentation workflows, and enabling leaders at every level to communicate with clarity, confidence, and impact.
What You'll Do
* Partner with enterprise clients to understand their most critical communication challenges, presentation workflows, and opportunities for improvement.
* Become an embedded team member for the client, providing integral insights.
* Help teams craft and structure powerful narratives that drive influence and decision-making, from executive ready communication to messaging to the masses
* Design and build scalable, reusable presentation templates and storytelling frameworks within *Prezent*
* Be a trusted advisor-helping users learn and adopt AI-driven storytelling tools to elevate their work
* Deliver customized presentation solutions and lead pilots, trainings, and office hours to drive adoption, enable power users, and establish best practices
* Provide structured feedback loops from client experiences to our *product and design teams*, shaping the future of the platform by improving the ‘presentation brain' for each account.
* Identify and nurture *warm leads* within existing accounts for software adoption and overnight presentation services
* Collaborate cross-functionally with *product*, *design*, and *engineering* teams to continuously refine user experience and product-market fit
What We're Looking For
* A *storyteller* with strong business communication skills and a passion for helping others make their ideas land with impact
* Experience in *consulting, customer success, or business operations/strategy*
* A *scientific* or *technology focused foundation*-degree in life sciences, computer science, engineering or related field
* *1-3 years* of experience as a consultant in a client-facing, fast-paced environment.
* Strong project management skills, and able to execute on multiple projects at a time
* Strong analytical and problem-solving skills with a *structured approach* to ambiguity
* Agile, adaptable, and energized by working across disciplines
* A self-starter who thrives in dynamic settings and is passionate about creating an *AI-first business communications platform*
* A blend of *creativity and technical fluency*-comfortable both discussing technical aspects in either biopharma or the tech industry and about scaling workflows
Benefits
* *ESOPs*: You'll be eligible for Employee Stock options.
* *Comprehensive Benefits*: Flexible, top-tier benefits package in line with US market standards.
* *Professional Growth*: Thrive in a fast-paced environment that encourages innovation, continuous learning, and career progression.
Job Type: Full-time
Pay: $55.00 - $65.00 per hour
Expected hours: 40 per week
Benefits:
* 401(k)
* Dental insurance
* Flexible schedule
* Health insurance
* Paid time off
* Vision insurance
Experience:
* strategic storytelling: 4 years (Required)
Work Location: In person
$55-65 hourly 60d+ ago
Looking for a job?
Let Zippia find it for you.
Forward Deployed Engineer
Workos
Remote senior deployment engineer job
🚀
WorkOS builds tools and services for developers to help them implement authentication, identity, authorization, and overall enterprise readiness. We're a fully distributed team with employees across North American time zones. We're well-funded, having raised $100m in funding from top investors including Greenoaks Capital, Lachy Groom, and Lightspeed Ventures. Our fast-growing base includes rapidly growing SaaS companies like OpenAI, Cursor, Perplexity, Vercel, Plaid, and hundreds of others.
About the role
As a Forward DeployedEngineer at WorkOS, you'll sit at the intersection of engineering, product, and GTM and work directly with customers to solve real problems and accelerate time-to-value.
This role is for engineers with strong fundamentals, excellent communication, and unusually high agency. You'll be customer-facing, but you'll also build: prototypes, integrations, demos, and pragmatic solutions that unblock adoption and expansion.
Responsibilities ✔️
Work directly with customers to design and implement solutions using WorkOS products
Own technical outcomes end-to-end: discovery, architecture, implementation, iteration, and handoff
Build lightweight tooling, scripts, demos, or proof-of-concepts to speed up customer success
Identify blockers early, propose solutions, and drive momentum without waiting for instructions
Partner with Account Executives and Solutions Engineers to support evaluations and expansions
Share structured feedback with Engineering and Product to improve the platform based on real-world usage
Qualifications
Strong engineering fundamentals and comfort building across APIs and web stacks
Proven ability to communicate clearly with customers and internal teams
High agency: you proactively find problems and solve them (show us what you've built or led from 0→1)
Excited by fast-paced, startup environments and ambiguous problem spaces
Strong curiosity and adoption of modern tooling, including AI-assisted development workflows
Bonus: Experience in developer tools, integrations, customer engineering, or technical consulting
Benefits (US Only) 💖
At WorkOS, we offer resources that emphasize personal and familial well-being. We offer healthcare coverage for you and your family, including medical, dental, and vision. We offer parental leave, paid-time off and fully remote working arrangements.
Benefits include:
- Competitive pay
- Substantial equity grants
- Healthcare insurance (Medical, Dental and Vision) for you and your family
- 401k matching
- Wellness and fitness monthly allowances
- PTO + paid holidays + unlimited sick leave
- Autonomy and flexibility with remote work
Please inquire directly with our recruiting team for benefits available to those working outside the US.
Equal Opportunity Employer
WorkOS is an equal opportunity employer, committed to diversity and inclusiveness. We will consider all qualified applicants without regard to race, color, nationality, gender, gender identity or expression, sexual orientation, religion, disability or age.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
#J-18808-Ljbffr
$103k-149k yearly est. 4d ago
Data Engineer
Newcastle Associates, Inc. 4.1
Remote senior deployment engineer job
We're looking for a Data Engineer who loves building and managing data solutions. This is a fully remote, career opportunity with a forward looking marketing company with a national footprint. You'll be working in the Azure cloud, creating reliable pipelines, making sure data is clean and accessible, and helping the business make smarter decisions. If you enjoy solving problems, working with all kinds of data, and collaborating with both technical and non-technical teammates, this role is for you. What You'll Do
Build and manage data pipelines in Azure (Data Factory, Databricks, Synapse, etc.).
Pull in data from different sources-APIs, databases, cloud apps, even streaming data.
Organize, clean, and transform data so it's ready for reporting, dashboards, or advanced analytics.
Keep everything secure and aligned with data governance and compliance rules.
Work with analysts, data scientists, and business teams to make sure they have the data they need.
Troubleshoot issues and keep systems running smoothly.
Automate and improve processes wherever possible.
Stay up to date on new Azure tools and data engineering best practices.
What We're Looking For
3-5+ years working with data engineering and data architecture
Hands-on experience with Azure tools like Data Factory, Data Lakes, Azure SQL, and storage solutions.
Strong ETL background and experience with building data pipeline from scratch.
Good understanding of data modeling, ETL/ELT, and performance tuning.
Experience with data architecture
Bonus points if you have:
Knowledge of data governance and data quality tools.
Some background in machine learning workflows.
Familiarity with other clouds like AWS or GCP.
Why You'll Love It Here You'll play a big role in shaping how we use data across the company. You'll get to work with the latest Azure tools and modern data platforms. Friendly, collaborative team where your ideas actually get heard. Solid pay, benefits, and opportunities to learn and grow.
$102k-143k yearly est. 60d+ ago
SDET - Playwright
Kellymitchell Group 4.5
Senior deployment engineer job in Columbus, OH
Our client is seeking a SDET - Playwright to join their team! This position is located in Columbus, Ohio.
Develop, maintain, and execute automated tests using Playwright (TypeScript/JavaScript)
Build reusable test libraries and utilities, including authentication, pagination, idempotency, rate limiting, and error handling
Define and execute test strategies across unit, integration, contract, and end-to-end test layers
Create robust negative, edge-case, and resilience tests
Apply mocking strategies where appropriate
Manage test data and environments, including fixtures, seeding, and synthetic data, to ensure deterministic and reliable test runs
Integrate automated test suites into CI/CD pipelines (GitHub Actions, Azure DevOps), ensuring fast, stable, and gated deployments
Participate in design and code reviews, advocating for testability, automation best practices, and overall quality
Document test frameworks, patterns, and runbooks; clearly communicate testing outcomes and recommendations to engineering teams
Collaborate cross-functionally with QA, engineering, and product teams to support successful delivery
Desired Skills/Experience:
3+ years of experience as an SDET or QA Automation Engineer with a strong focus on Playwright
Hands-on experience with Playwright using TypeScript/JavaScript, or similar automation frameworks
Experience testing POS systems or complex transactional platforms is preferred
Proven experience configuring CI/CD pipelines, test reporting, and gating on failures or coverage thresholds
Familiarity with mocking frameworks and test data management strategies
Strong debugging skills across logs, traces, and network traffic; comfort using CLI tools such as curl
Excellent written and verbal communication skills with a collaborative, team-first mindset
Benefits:
Medical, Dental, & Vision Insurance Plans
Employee-Owned Profit Sharing (ESOP)
401K offered
The approximate pay range for this position starting at $150,000. Please note that the pay range provided is a good faith estimate. Final compensation may vary based on factors including but not limited to background, knowledge, skills, and location. We comply with local wage minimums.
$150k yearly 22h ago
ML Engineer - LLM Storytelling & Personalization
Spotify
Remote senior deployment engineer job
A leading audio streaming service seeks a Machine Learning Engineer to design innovative LLM-based solutions to enhance storytelling for users. You will collaborate with cross-functional teams to develop features that personalize user experiences. Ideal candidates have a strong background in machine learning, natural language processing, and experience in production ML systems. This remote position allows for flexibility within the North America region, with competitive compensation and significant benefits offered.
#J-18808-Ljbffr
$74k-100k yearly est. 1d ago
Software Engineer
Heitmeyer Consulting
Senior deployment engineer job in Columbus, OH
This hybrid role will serve as the Software Engineer to development of microservices and integrations into the new deposit product platform. You will be part of a team of engineers to ensure scalable, secure and performing solutions in a x-matrix environment while confirming all regulatory requirements are met.
Top Required Skills:
5+ years in Java-based development ability deliver on technical requirements and produce scalable solutions.
Technical expertise with Java, Spring Boot, building microservices, API development (Apigee), CI/CD pipelines (Jenkins, Git Actions), Containerization (Open Shift), Streaming data (Kafka), Gen AI (CoPilot, Python, Prompt Engineering), developing ETL processes.
Proven experience in development work to build integration solution with microservices and APIs within agile environment.
Familiarity with large-scale transformation efforts or similar modular banking platforms.
Support CI/CD pipelines along with automation to support productivity.
Nice-to-have:
Domain experience with consumer deposit products and pricing beneficial.
Background with additional tech tools that include Flink and Redpanda.
Banking experience preferred but not required.
Should have experience working in highly regulated industry with large focus on risk/compliance requirements within SDLC.
Top Responsibilities:
Develop integration and microservice solutions using tech stack that includes Java, Spring Boot, Kafka, Apigee (API), Git Actions, Splunk and Open Shift.
Promote automation and leveraging of Gen AI tools for productivity - CoPilot, Python, Prompt Engineering.
Write integration and unit tests using TDD/BDD while enforcing code quality, and DevOps practices.
$64k-85k yearly est. 4d ago
Software Defined Vehicle Engineer
Global Connect Technologies 4.4
Senior deployment engineer job in Raymond, OH
Job Title: Software Defined Vehicle (SDV) Consultant
Employment Type: Full-Time
We are looking for an experienced Software Defined Vehicle (SDV) Consultant to support the development and maintenance of a secure, scalable vehicle software toolchain. This role involves close collaboration with IT and engineering teams, focusing on DevOps, CI/CD, cybersecurity, and automotive software systems.
Key Responsibilities
Design, implement, and maintain vehicle software development toolchains
Support on-premise server infrastructure and CI/CD pipelines
Integrate DevSecOps and cybersecurity best practices
Support OTA infrastructure and embedded operating systems
Create architecture diagrams and support Agile development
Ensure compliance with ASPICE and ISO 26262 (ASIL-B) standards
Required Skills
Strong experience in software development, DevOps, and CI/CD
Knowledge of Linux (Ubuntu), Windows, and RTOS
Hands-on experience with Docker and containerized platforms
Understanding of vehicle architecture, integrated controls, and functional safety
Familiarity with Agile methodologies
Preferred Skills
Cloud platforms (AWS, Azure, GCP)
Automotive protocols (CAN, LIN, Ethernet)
Experience in hybrid or cloud-based automotive environments
$66k-90k yearly est. 2d ago
CAE Engineer
Pentangle Tech Services | P5 Group
Senior deployment engineer job in Raymond, OH
BS Degree in ME. Minimum of 6 plus years of industry experience as CAE analyst and specific solver software usage depending on department specialty area. Minimum of 5 years of experience with LS-DYNA using pre-processor / post-processor tools (ANSA and Meta-Post) for complex vehicle system level CAE model construction, visualization, and analysis. Related advanced degree may be substituted for 2 years of experience.
Job Description Details:
Prepare and perform crash safety-related impact simulations using LS-DYNA. Evaluate the results of the simulations through careful analysis of crash simulations.
Measure these simulations against the target automobile performance criteria based on advanced safety standards such as the New Car Assessment Program and Insurance Institute for Highway Safety criteria in addition to Honda's internal safety standards.
Determine the appropriate countermeasures to meet the criteria if simulation outcomes are below target performance.
Communicate any recommendations to the appropriate Engineering teams with Honda.
Create detailed engineering documentation of simulation analysis by generating written reports and working with design and test engineers
$62k-83k yearly est. 4d ago
Senior Forward Deployed Engineer
Triedge Investments
Remote senior deployment engineer job
At TriEdge Investments, we build technology that helps real businesses operate better. Working across a portfolio of 30+ companies and select partners, we embed with operators to design and ship automation that moves the numbers, including throughput, cost, and reliability. Over time, we turn those hard-won solutions into platform tools that scale across companies, building an intelligent system that brings modern automation to businesses typically overlooked by traditional software.
As a Senior Forward Deployed Software Engineer, you'll be embedded with operators and clients, working in the field to uncover challenges and deliver mission-critical products. Unlike traditional engineering roles, you won't just be handed a spec-you'll work directly with users to shape requirements, design systems, and ship production code that solves real problems. You'll combine deep technical expertise with client-facing problem solving, ensuring our products are pragmatic, scalable, and deliver meaningful business impact.
What You'll Do
Embed with operators and clients to diagnose workflows, pain points, and opportunities for automation
Design, build, and deliver end-to-end systems across on-premises server, web, cloud, and data environments
Develop robust APIs, agentic workflows, and AI/ML components that drive decision-making and automation
Rapidly prototype, develop and iterate on solutions in client environments, ensuring adoption and impact
Optimize systems for performance, scale, and resilience in production
Collaborate cross-functionally with product managers, data engineers, and operators to align delivery with business outcomes
Troubleshoot and resolve real-time production issues in client deployments
Act as a trusted advisor to executives and operators while staying hands-on in code
About You
5+ years of full-stack software engineering experience in production environments
Proven ability to deliver directly to clients in ambiguous, high-stakes settings
Comfortable across the AI stack (e.g., Cursor, Claude Code, Vercel, Supabase, etc) and have proven experience in leveraging AI context engineering practice in small scale projects.
Strong API development skills and experience with modern frameworks
Practical experience integrating AI/ML systems into production products
Clear communicator able to advise both engineers and business stakeholders directly
Pragmatic problem solver with a bias toward ownership and forward deployed execution
Resourceful when working across diverse stacks, legacy systems, or compliance-driven environments
Appropriately fluent in AI-assisted development
Why Join TriEdge Investments
Play a key role in reshaping private equity with AI-powered platforms that scale
Help digitize and transform real businesses, not just build another SaaS product
Be at the forefront of AI and automation across financial services and healthcare
Work closely with operators and executives to deliver solutions that matter
Join a culture that values clarity, speed, reuse, humility, and ownership
Location
TriEdge Investments is headquartered in New York's Hudson Yards. We've designed our workplace to foster the collaboration and spontaneous interactions that drive innovation. Our team works in-office, with flexibility to work remotely when needed.
What We Offer
Pay Transparency
The annual base salary range for this position is $200,000-$250,000. Actual compensation offered may vary from the posted hiring range based on experience and skill level, among other factors. This role is also eligible for a discretionary fund performance bonus.
Benefits
$0 deductible and 100% employee-covered health, vision, and dental insurance
401(k) matching program of 50% up to 6% of annual salary
Unlimited PTO
Beautiful custom-built office in NYC with daily lunch
Please note: We are proud to be an equal opportunity employer, and we are committed to diversity and inclusion. All qualified applicants will receive consideration for employment without regard to race, color, religion, ethnicity, sex, age, national origin, citizenship status, disability, marital status, partnership status, sexual orientation, gender identity and expression, military or veteran status, or any other characteristic protected by federal, state or local law.
$200k-250k yearly Auto-Apply 60d+ ago
Senior Forward Deployed Engineer
Recruiting From Scratch
Remote senior deployment engineer job
Who is Recruiting from Scratch: Recruiting from Scratch is a talent firm that focuses on placing the best candidate for our clients. Our team is 100% remote and we work with teams across North America, South America, and Europe to help them hire. https://www.recruitingfromscratch.com/
Title of Role: Senior Forward DeployedEngineer
Location: New York City, NY (On-site)
Company Stage of Funding: Seed Stage, Well-Funded with Rapid Traction
Office Type: On-site, Full-Time
Salary: $200K base + Equity Opportunities
Company Description
Our client is a fast-growing, venture-backed startup building AI-powered workflow systems for industries that are critical yet often overlooked-such as manufacturing, logistics, and construction. Their platform connects messy enterprise data with advanced machine learning models to deliver measurable operational improvements at scale. Already working with Fortune 500 companies and large enterprises, they're on a strong growth trajectory with proven product-market fit and rapid customer adoption.
This is a unique opportunity to join an early team (≈10 employees) and work directly at the intersection of AI, enterprise systems, and real-world operations.
What You Will Do
Work directly with customers (including ~20% travel) to translate business challenges into AI-driven solutions.
Design, prototype, and ship end-to-end workflows using Go, TypeScript, and React.
Build and maintain integrations with major enterprise platforms (SAP, Oracle, Salesforce, and others).
Run rapid feedback loops to drive user adoption and maximize ROI.
Partner closely with executives, operators, and internal teams to scope, deliver, and scale solutions that make an impact.
Ideal Candidate Background
5-7 years of professional software engineering experience in high-paced, demanding environments.
Strong experience with backend and full-stack development (Go, TypeScript, React or similar).
Comfort working directly with customers and owning solutions end-to-end.
Excellent problem-solving skills and ability to navigate ambiguous, complex challenges.
Preferred
Experience building or deploying enterprise-scale systems in industries such as manufacturing, logistics, or supply chain.
Familiarity with large-scale ERP or enterprise system integrations (SAP, Oracle, Salesforce, etc.).
Startup experience or background working in environments where speed, adaptability, and ownership are key.
Compensation and Benefits
Salary: $200K base
Equity: Significant ownership opportunities in an early-stage startup
Comprehensive benefits package (health, vision, dental, and more)
Visa and green card sponsorship available
Collaborative on-site work environment in New York City's SoHo neighborhood
Opportunity to shape the future of an AI-first company while working on impactful real-world deployments
$200k yearly 60d+ ago
Senior Operations Technology Engineer
Antora Energy
Remote senior deployment engineer job
Antora builds and deploys thermal batteries to power always-on industrial operations with low-cost energy. Factory-built in the United States, Antora's modular thermal batteries deliver reliable heat and power, enabling industrial facilities of any size to decarbonize predictably and profitably. Antora is electrifying global industry while supporting U.S. manufacturing jobs, lowering costs for energy consumers, and enhancing the competitiveness of American industry.
We are growing our company with people who put team and mission first, value connection through laughter and joy, and build with humility and openness. We are committed to continue building a diverse, passionate, and creative team dedicated to a future where every industrial facility, everywhere on earth, is powered by abundant, clean, low-cost energy.
Position Summary
The Operation Technology (OT) Engineer will be responsible for Antora OT Networks including identifying and developing company standards, designing and deploying networks at sites, and maintaining sites.
Roles & Responsibilities
Assess Cyber Security needs for Antora's OT Networks, and develop standards to cover those.
Work with Controls Engineers on projects to design and deploy Firewalls/Switches/Servers/Workstations to meet project requirement availability, maintainability and security requirements
Maintain deployed OT Networks, identify required upgrades including those needed to keep maintainability.
Determine and manage key vendors for security, routing, switching devices, and computing for devices used in OT Networks
Hire and manage consultants/service providers as needed for Antora OT Network needs.
Key Qualifications
Bachelors or equivalent in an engineering field (e.g. mechanical, industrial, process, electrical, software, automation etc.) and/or 7+ years' experience in industrial applications (oil & gas, chemicals, power plants, etc.). Strong IT experience with an interest in OT Technologies may be considered as well.
Experience with industrial control OT networks, including designing with Cyber Security in mind.
Experience designing high availability networks.
Experience deploying and maintaining networks with security zones, managed switches and remote access.
Critical thinking and problem solving skills, ability to think about problems from a first-principles perspective.
Experience working independently, as well as working in a team-orientated and fast-paced startup-like environment.
Additional Qualifications Desired
Experience with PLCs and SCADA
Experience with Hypervisors/Virtual Machines
Scripting/Automation Tooling experience
Work Location: Remote
Salary Range: $183,000 USD - $230,000 USD
Salary Basis: Annual
Please note that the salary range listed above reflects Antora Energy's estimated pay for this position. The actual salary offered will be within the posted range and determined based on several factors including but not limited to a candidate's experiences, credentials and expertise, as they pertain to the position's requirements.
In addition to a competitive base salary, Antora Energy's Total Rewards program includes equity compensation in the form of stock options, a premium health benefits package with life and disability insurance, a 401K plan with employer contributions, flexible spending accounts, and an industry leading paid-time-off policy that features flexible and inclusive holiday observance, as well as paid volunteer time off.
When it comes to stopping climate change, we need everyone. We believe that having a diversity of backgrounds and experiences strengthens all of us, and we strive to create an environment where every one of us is empowered to create meaningful change.
$183k-230k yearly Auto-Apply 60d+ ago
Senior Large Language Model (LLM) Operations Engineer
N-Power Medicine
Remote senior deployment engineer job
About N-Power MedicineN-Power Medicine aims to establish a new paradigm in drug development by reinventing the ‘how' and transforming clinical trials through better integration with clinical practice, ensuring broader participation by physicians and patients. We are building an exceptional multi-disciplinary team with diverse expertise spanning healthcare, engineering, technology and regulatory, and with people who share our core value of Empowering Community through generosity, curiosity and humility. We are working with urgency to bring better therapies to patients faster.
Position OverviewN-Power Medicine seeks a Senior LLM Operations Engineer to execute our technical strategy for scaling AI innovation in clinical variable abstraction and note generation. You will be responsible for architecting, building, and owning the production systems, infrastructure, and development paradigms that enable our AI-powered products. You will own the technical direction for our MLOps and LLM Ops roadmap, ensuring robust, scalable, and automated deployment of our machine learning solutions.You'll act as a key technical leader, enabling the AI & Data Science team to rapidly iterate and deploy high-impact solutions while upholding rigorous ethical and quality standards. This role offers the chance to solve complex infrastructure and automation challenges, shape the company's long-term AI operational strategy, and deliver significant healthcare impact by building the factory for our AI models.The ideal candidate is a recognized technical expert who excels at building scalable systems, driving automation, and tackling complex system design through ambiguity. Exceptional communication and strategic thinking are critical for success.This position is remote within the United States.
Role Objectives and Responsibilities-Architect and spearhead the development of cutting-edge, scalable AI infrastructure, including novel human-in-the-loop (HITL) paradigms, ensuring our systems learn effectively from feedback.-Lead the technical design and implementation of core MLOps components and systems for our LLMs-including CI/CD, monitoring, and automated feedback loops-ensuring robustness, scalability, and adherence to software engineering best practices.-Define and shape solutions for complex automation and deployment challenges, enabling the strategic application of our cutting-edge AI.-Drive technical alignment and integration with AI Data Science and Software Engineering teams, ensuring the seamless transition of AI solutions from research into production environments and influencing architectural standards.-Define and establish standards for the rigorous validation, monitoring, and lifecycle management of AI products, ensuring continuous accuracy improvement and reliability in production.-Define, champion, and drive adoption of best practices for MLOps, including model/data versioning, experiment tracking, and reproducibility within the AI/ML domain; actively mentor others.-Identify, champion, and integrate state-of-the-art MLOps technologies and frameworks, driving innovation and maintaining our technical edge in AI deployment.-Provide expert guidance on applying safeguards and protections (HIPAA, privacy laws) to our model deployment and data handling pipelines; champion and uphold the highest compliance, quality, and security standards.
Education, Experience, Behavioral Competencies, & Skills-3+ years of professional experience in an MLOps, DevOps, or Software Engineering role with a focus on machine learning systems.-MSc/BSc graduate in engineering, computer science, or a relevant field, with extensive equivalent experience. A PhD is a plus.-Deep, hands-on expertise in Python and proficiency in modern software development practices.-Hands-on experience with a major cloud platform (AWS, GCP, or Azure).-Strong experience with containerization and orchestration technologies (Docker, Kubernetes).-Proven experience building and maintaining CI/CD pipelines for complex applications (e.g., GitHub Actions, Jenkins), particularly those that include data + model versioning.-A proven track record of technical leadership and high-impact contributions in building and scaling production machine learning systems.-Proven ability to independently define, architect, and lead solutions for complex, ambiguous infrastructure problems, clearly articulating business value.-Demonstrated ability to lead the decomposition of large-scale systems and guide teams in delivering incremental solutions.-Track record of designing sustainable, reusable, and high-quality code and influencing team/organizational standards.-Exceptional written, verbal, and presentation skills; ability to influence stakeholders at all levels.-Recognized technical leader, proactive, strategic thinker, and takes end-to-end ownership.-Generous, Curious, and Humble.
Preferred Qualifications-Direct experience productionizing Large Language Models (LLMs), including knowledge of prompting strategies, RAG, and fine-tuning.-Deep expertise with the Databricks platform, including MLflow, Delta Tables, and Unity Catalog.-Experience building data annotation and Human-in-the-Loop (HITL) systems from the ground up.-Familiarity with vector databases (e.g., Pinecone, Chroma) and model serving frameworks (e.g., Ray Serve, Triton, and -Databricks/Mosaic).-Experience working in a regulated environment, particularly with healthcare data (HIPAA).
Travel Requirements Ability to travel, up to 10%, may be required
Pay InformationThe expected salary range for this position is $165,000 and $205,000. Actual pay will be determined based on experience, qualifications, geographic location, and other job-related factors permitted by law. N-Power Medicine (NPM) offers equity at hire as well as a discretionary annual bonus which may be available based on Company performance. This position is eligible for company benefits.
More About Us:We are a mission-driven, well-funded, rapidly growing company, eager to attract passionate professionals offering a highly attractive compensation package with a balanced and flexible work environment, competitive industry benefits as well as a 401K plan and other great company “perks.”
We are an Equal Opportunity Employer and value diversity at our company. We do not discriminate based upon race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics.
Covid-19 Policy - The Company is committed to providing and maintaining a safe workplace, and to safeguard the health and well-being of our employees, families, visitors, and the community. While vaccination remains one of the most important tools in advancing the health and safety of employees and promoting the efficiency of workplaces, we are now in a different phase of our response when these measures are no longer necessary. We currently do not have mandatory COVID-19 vaccination requirements for our employees and contractors, as the COVID-19 public health emergency has ended. However, there are certain N-Power Medicine employees and contractors who, based on their role, will be required to continue to follow our 2021 COVID-19 vaccination and other requirements as mandated by N-Power Medicine's partners they serve. We reserve the right to modify or amend our corporate policy at any time.
Applicants must be currently authorized to work in the U.S. on a full-time basis. The Company will not sponsor applicants for work visas.
Notice on fraudulent job offers: Only positions posted on ****************************************** site are legitimate. Please be mindful of recruitment fraud and job scams.
$165k-205k yearly Auto-Apply 60d+ ago
Senior Technical Operations Engineer
Inspiren
Remote senior deployment engineer job
About the company
Inspiren offers the most complete and connected ecosystem in senior living. Founded by Michael Wang, a former Green Beret turned cardiothoracic nurse, Inspiren proves that compassionate care and technology can coexist - bringing peace of mind to residents, families, and staff.
Our integrated solutions seamlessly fit into existing workflows, capturing everything happening within a community. Backed by nurse specialists and powerful analytics, we provide the data operators need to make informed clinical and operational decisions - driving efficiency, profitability, and better care outcomes.
About the role
We are seeking a highly skilled Senior Technical Operations Engineer to join our dynamic team. In this role, you will work directly with customers and their IT and Network providers, internal teams, and fulfillment operations in diverse environments, leveraging your technical expertise to provision, deploy, customize, and maintain our hardware and software solutions, including those not owned by Inspiren, but necessary to have uninterrupted uptime. You will play a crucial role in ensuring customer satisfaction and success, while also acting as a bridge between the customer, our implementation team, our customer success team, our product support, and our internal development teams.
Your responsibilities will span from supporting new implementations, remote device provisioning at fulfillment locations, troubleshooting and diagnosing technical issues, and strategizing for optimal performance. Strong leadership skills, proficiency in system management, and dedication to excellence in customer satisfaction are a must in this role.
What you'll do
Remote Device Provisioning: Coordinate and execute remote provisioning processes for hardware devices at warehouse locations, collaborating closely with the warehouse fulfillment team. Utilize command-line tools on Linux or MacOS to run Python and Bash scripts for bulk device updates and configuration. Issue AWS CLI commands to manage and provision IoT and edge devices, ensuring successful deployment and accurate configuration.
Customer Engagement: Collaborate with customers to understand their technical requirements related to the deployment, customization, and maintenance of Inspiren's hardware and software solutions. Provide on-site technical support, where necessary, to ensure successful implementation and usage of our products. Ensure continuous adherence to SLAs and provide timely written and verbal communications, along with the ability to effectively communicate with various stakeholders.
Deployment & Configuration: Partner with the implementation managers on the installation, configuration, and optimization of software solutions in customer environments. Ensure systems are set up and running smoothly and efficiently.
Troubleshooting & Support: Quickly diagnose and resolve technical issues that arise during deployment and usage, ensuring their needs are met and issues are promptly addressed. Act as the first-level technical support via phone, email, or onsite visits and escalate to the engineering teams.
Customization: Work with customers to tailor our solutions to their specific requirements. Develop and implement custom features and integrations as needed.
Network Monitoring & Maintenance: Monitor network performance and troubleshoot issues to ensure optimal operation. Conduct regular network assessments and recommend improvements/optimizations to maintain performance standards.
Documentation: Create and maintain comprehensive documentation related to deployment processes, customer configurations, and troubleshooting procedures.
Training & Education: Conduct training sessions for customer teams to ensure they are proficient in using our solutions and to maximize the value they derive from our products.
Leadership: Demonstrates strong leadership by collaborating effectively with cross-functional team members to develop scalable processes to enhance customer support and service stability. Exhibits proactive approach in evolving processes and experience guiding and supporting junior team members, fostering team growth, while encouraging culture of action and accountability.
About you
Bachelor's degree in Computer Science, Information Technology, or a related field.
Proven experience in a tech/IT support role, preferably in a customer facing capacity.
Related work experience provisioning and deploying hardware within customer's network with remote management.
Proficiency with command-line interfaces, Linux/MacOS environments, AWS CLI, and scripting languages (Python, Bash)
Familiarity with typical operating systems (Windows, MacOS, Android, etc.), common software applications, and hardware troubleshooting.
Proven experience with network infrastructure, including routers, switches, firewalls, and VPNs and network monitoring tools (SolarWinds, Wireshark, etc.)
Passionate about service availability and quality, with strong problem-solving abilities and attention to detail.
Strong communication and interpersonal skills, with the ability to build relationships with customers, and the ability to thrive in a fast-paced startup environment.
Willingness to travel as required to meet customer needs and work on-call shifts for issue resolution.
Details
The annual salary for this role is $165,000-$190,000+ equity + benefits (including medical, dental, and vision)
This role will require up to 50% travel.
Flexible PTO
Location: Remote, US
Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, or disability status.
$165k-190k yearly Auto-Apply 60d+ ago
Senior HPC Operations Engineer
Lambda 4.2
Remote senior deployment engineer job
Lambda, The Superintelligence Cloud, is a leader in AI cloud infrastructure serving tens of thousands of customers. Our customers range from AI researchers to enterprises and hyperscalers. Lambda's mission is to make compute as ubiquitous as electricity and give everyone the power of superintelligence. One person, one GPU.
If you'd like to build the world's best AI cloud, join us.
*Note: This position requires presence in our San Francisco/San Jose or Bellevue office location 4 days per week; Lambda's designated work from home day is currently Tuesday.
Engineering at Lambda is responsible for building and scaling our cloud offering. Our scope includes the Lambda website, cloud APIs and systems as well as internal tooling for system deployment, management and maintenance.
What You'll Do
Remotely deploy and configure large-scale HPC clusters for AI workloads (up to many thousands of nodes)
Remotely install and configure operating systems, firmware, software, and networking on HPC clusters both manually and using automation tools
Troubleshoot and resolve HPC cluster issues working closely with physical deployment teams on-site
Provide clear and detailed requirements back to other engineering teams on gaps and improvement areas, specifically in the areas of simplification, stability, and operational efficiency
Contribute to the creation of and maintenance of Standard Operating Procedures
Provide regular and well-communicated updates to project leads throughout each deployment
Mentor and assist less experienced team members
Stay up-to-date on the latest HPC/AI technologies and best practices
You
Are a deeply experienced HPC engineer comfortable with logical provisioning of a cluster
Have a strong understanding of HPC/AI architecture, operating systems, firmware, software, and networking
10+ years of experience in deploying and configuring HPC clusters for AI workloads
Have an innate attention to detail
Have experience with Bright Cluster Manager or similar cluster management tools
Are in expert in configuring and troubleshooting:
SFP+ fiber, Infiniband (IB), and 100 GbE network fabrics
Ethernet, switching, power infrastructure, GPU direct, RDMA, NCCL, Horovod environments
Linux based compute nodes, firmware updates, driver installation
SLURM, Kubernetes, or other job scheduling systems
Work well under deadlines and structured project plans also knowing when and how to ask for changes to project timelines
Have excellent problem solving and troubleshooting skills
Have flexibility to travel to our North American data centers as on-site needs arise or as part of training exercises
Are able to work independently and as part of a team
Are comfortable mentoring and supporting junior HPC engineers on cluster deployments
Nice to Have
Experience with machine learning and deep learning frameworks (PyTorch, Tensorflow) and benchmarking tools (DeepSpeed, MLPerf)
Experience with containerization technologies ( Docker, Kubernetes)
Experience working with the technologies that underpin our cloud business ( GPU acceleration, virtualization, and cloud computing)
Keen situational awareness in customer situations, employing diplomacy and tact
Bachelors degree in EE, CS, Physics, Mathematics, or equivalent work experience
Salary Range Information
The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description.
About Lambda
Founded in 2012, with 500+ employees, and growing fast
Our investors notably include TWG Global, US Innovative Technology Fund (USIT), Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, Gradient Ventures, Mercato Partners, SVB, 1517, and Crescent Cove
We have research papers accepted at top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG
Our values are publicly available: *************************
We offer generous cash & equity compensation
Health, dental, and vision coverage for you and your dependents
Wellness and commuter stipends for select roles
401k Plan with 2% company match (USA employees)
Flexible paid time off plan that we all actually use
A Final Note:
You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills.
Equal Opportunity Employer
Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.
$99k-145k yearly est. Auto-Apply 60d+ ago
Senior HPC Operations Engineer
Lambda Labs
Remote senior deployment engineer job
Lambda, The Superintelligence Cloud, is a leader in AI cloud infrastructure serving tens of thousands of customers. Our customers range from AI researchers to enterprises and hyperscalers. Lambda's mission is to make compute as ubiquitous as electricity and give everyone the power of superintelligence. One person, one GPU.
If you'd like to build the world's best AI cloud, join us.
* Note: This position requires presence in our San Francisco/San Jose or Bellevue office location 4 days per week; Lambda's designated work from home day is currently Tuesday.
Engineering at Lambda is responsible for building and scaling our cloud offering. Our scope includes the Lambda website, cloud APIs and systems as well as internal tooling for system deployment, management and maintenance.
What You'll Do
* Remotely deploy and configure large-scale HPC clusters for AI workloads (up to many thousands of nodes)
* Remotely install and configure operating systems, firmware, software, and networking on HPC clusters both manually and using automation tools
* Troubleshoot and resolve HPC cluster issues working closely with physical deployment teams on-site
* Provide clear and detailed requirements back to other engineering teams on gaps and improvement areas, specifically in the areas of simplification, stability, and operational efficiency
* Contribute to the creation of and maintenance of Standard Operating Procedures
* Provide regular and well-communicated updates to project leads throughout each deployment
* Mentor and assist less experienced team members
* Stay up-to-date on the latest HPC/AI technologies and best practices
You
* Are a deeply experienced HPC engineer comfortable with logical provisioning of a cluster
* Have a strong understanding of HPC/AI architecture, operating systems, firmware, software, and networking
* 10+ years of experience in deploying and configuring HPC clusters for AI workloads
* Have an innate attention to detail
* Have experience with Bright Cluster Manager or similar cluster management tools
* Are in expert in configuring and troubleshooting:
* SFP+ fiber, Infiniband (IB), and 100 GbE network fabrics
* Ethernet, switching, power infrastructure, GPU direct, RDMA, NCCL, Horovod environments
* Linux based compute nodes, firmware updates, driver installation
* SLURM, Kubernetes, or other job scheduling systems
* Work well under deadlines and structured project plans also knowing when and how to ask for changes to project timelines
* Have excellent problem solving and troubleshooting skills
* Have flexibility to travel to our North American data centers as on-site needs arise or as part of training exercises
* Are able to work independently and as part of a team
* Are comfortable mentoring and supporting junior HPC engineers on cluster deployments
Nice to Have
* Experience with machine learning and deep learning frameworks (PyTorch, Tensorflow) and benchmarking tools (DeepSpeed, MLPerf)
* Experience with containerization technologies ( Docker, Kubernetes)
* Experience working with the technologies that underpin our cloud business ( GPU acceleration, virtualization, and cloud computing)
* Keen situational awareness in customer situations, employing diplomacy and tact
* Bachelors degree in EE, CS, Physics, Mathematics, or equivalent work experience
Salary Range Information
The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description.
About Lambda
* Founded in 2012, with 500+ employees, and growing fast
* Our investors notably include TWG Global, US Innovative Technology Fund (USIT), Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, Gradient Ventures, Mercato Partners, SVB, 1517, and Crescent Cove
* We have research papers accepted at top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG
* Our values are publicly available: *************************
* We offer generous cash & equity compensation
* Health, dental, and vision coverage for you and your dependents
* Wellness and commuter stipends for select roles
* 401k Plan with 2% company match (USA employees)
* Flexible paid time off plan that we all actually use
A Final Note:
You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills.
Equal Opportunity Employer
Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.
$95k-132k yearly est. 60d+ ago
Operations Engineer III - Senior TechOps Engineer
Rldatix
Remote senior deployment engineer job
Operations Engineer III - Operations (Senior TechOps Engineer)
Every single day around the world, thousands of patients are harmed from care delivery errors, many of which are preventable. We want to change that. RLDatix is on a mission to improve healthcare by enabling a world where patients receive the best and safest care possible. Trusted by thousands of clients around the world, our connected healthcare operations platform combines software and trusted services to empower organisations with critical data insights across risk, safety, compliance, provider lifecycle and workforce management. Our user-centric approach provides a holistic, real-time view of healthcare operations, connecting disparate information across the enterprise - thus giving organisational leadership the contextualised data they need to make better informed decisions.
RLDatix is truly global, with over 2,000 employees across the UK, Europe, Middle East, Australia, Canada, and the United States. Our strategy is fuelled by organic and inorganic growth that brings together the brightest minds and the latest technology - including AI - to deliver marketing leading solutions for our clients. We are looking for people to join our team who are passionate about making a positive change in healthcare. Join us as we work towards our vision of safer, better healthcare for all.
TEAM - It all starts with Team... we win 'together'... with our customers. In this role, you'll collaborate across multiple departments and teams, building unified approaches that strengthen our collective impact on healthcare safety.
RESPECT - Everyone is accepted and expected to be a 'pro' (fessional). You'll demonstrate professional excellence whilst instructing and enabling teams to deliver world-class outcomes that reflect our commitment to healthcare transformation.
LEAD - We innovate to solve problems and believe that everyone can lead by making people and situations better. You'll lead through influence and expertise, innovating solutions that make complex landscapes more navigable for our teams and clients
DELIVER - We set big goals (Rocks) and focus on making progress every day. You'll drive measurable improvements across the organisation whilst maintaining daily focus on systematic progress that enhances our data maturity.
What You Will Do:
As an Operations Engineer III, you will optimize data migration and archival implementation workflows by analyzing processes, identifying inefficiencies, and implementing improvements that elevate team productivity and quality by looking at our implementation process with a software engineering lens. Leveraging your expertise, you will collaborate with engineering, product, and leadership teams to deliver help automate and modernize ETL/ELT processes. Your work includes mapping workflows, producing analysis documentation, and supporting process and system improvements. Your contributions will directly influence immediate project outcomes and operational excellence within the engineering function. This position will serve as the equivalent of a Forward DeployedEngineer (FDE), except the "customer" is our own implementation team.
Key Responsibilities: Process Analysis & Workflow Optimization
· Analyze implementation workflows and operational data to identify process inefficiencies and propose targeted improvements.
· Document process bottlenecks and articulate clear optimization requirements for engineering operations teams.
· Implement advanced analysis techniques to support continuous workflow enhancement and uphold process quality standards.
· Provide first-line triage and resolution for migration and archival pipelines.
· Diagnose and remediate ETL, extraction, and scripting issues including SQL tuning, query optimization, indexing, partitioning, and job orchestration.
· Escalate to development/tooling teams only with clear reproduction steps, impact analysis, and proposed resolution paths.
Technical Implementation & Automation
· Deploy and maintain engineering tools, automation systems, and integrations to support seamless operational processes.
· Identify and address technical gaps in implementation processes.
· Work extensively with relational databases such as Oracle, MS SQL Server, Azure Synapse Analytics and others.
· Perform advanced diagnostics, performance tuning, and query optimization.
Stakeholder Management & Engagement
· Facilitate effective communication between engineering, product, and leadership teams to capture diverse requirements.
· Build collaborative relationships to coordinate internal stakeholders and ensure operational solutions align with business objectives.
· Produce stakeholder engagement plans that foster productive working environments and support collaborative project delivery.
Experience You Will Need:
· 7+ years of experience with SQL optimization in multiple DBMS such as Oracle, SQL Server.
· 5+ years of experience with OLAP-specific SQL query optimization and Azure Synapse Analytics (or non-cloud equivalents).
· 3+ years of experience with Azure Data Factory or similar data movement platforms.
· Experience with scripting languages like PowerShell as well C#/.NET is a big plus.
Essential Requirements:
· Strong stakeholder management skills, including the ability to articulate requirements, coordinate relationship-building, and facilitate collaborative project environments across engineering and product teams.
· Advanced technical implementation skills to deploy, automate, and optimize data engineering processes while maintaining high quality standards across diverse platforms.
· Demonstrated business insight with the ability to connect operational improvements to measurable business outcomes and support strategic initiatives through data-driven analysis.
What You Will Gain:
In this impactful Operations Engineer III role, you'll accelerate your technical growth by optimizing data engineering workflows and driving efficiency in an environment that processes petabytes of data. You'll have the opportunity to solve complex challenges, collaborate with cross-functional teams, and directly contribute to continuous improvement initiatives that enhance team productivity. This position empowers you to build expertise in automation and process optimisation, while influencing engineering operations within a supportive, innovation-driven environment. You'll develop valuable analytical and problem-solving skills, making a measurable difference in the quality and effectiveness of engineering delivery.
Team Structure & Size: You will be the first and leading member of a new team that will bring an engineering mindset to our implementation processes. This role will report to our VP of Engineering on our data platform.
Geographic Scope: This role is for our North American team. We have offices in Chicago, Boston, Burlington (VT). You can work remotely or from any of our offices.
Educational & Professional Requirements: 5+ years in an engineering role
RLDatix is an equal opportunity employer, and our employment decisions are made without regard to race, colour, religion, age, gender, national origin, disability, handicap, marital status or any other status or condition.
$95k-132k yearly est. 55d ago
Senior AI Engineer, GenAI & ML Evaluation Frameworks - Grafana Ops, AI/ML | USA | Remote
Grafana Labs 3.6
Remote senior deployment engineer job
Grafana Labs is a remote-first, open-source powerhouse. There are more than 20M users of Grafana, the open source visualization tool, around the globe, monitoring everything from beehives to climate change in the Alps. The instantly recognizable dashboards have been spotted everywhere from a NASA launch and Minecraft HQ to Wimbledon and the Tour de France. Grafana Labs also helps more than 3,000 companies -- including Bloomberg, JPMorgan Chase, and eBay -- manage their observability strategies with the Grafana LGTM Stack, which can be run fully managed with Grafana Cloud or self-managed with the Grafana Enterprise Stack, both featuring scalable metrics (Grafana Mimir), logs (Grafana Loki), and traces (Grafana Tempo).
We're scaling fast and staying true to what makes us different: an open-source legacy, a global collaborative culture, and a passion for meaningful work. Our team thrives in an innovation-driven environment where transparency, autonomy, and trust fuel everything we do.
You may not meet every requirement, and that's okay. If this role excites you, we'd love you to raise your hand for what could be a truly career-defining opportunity.
This is a remote opportunity and we would be interested in applicants from USA time zones only at this time.
SeniorEngineer - GenAI & ML Evaluation Frameworks
The Opportunity:
At Grafana, we build observability tools that help users understand, respond to, and improve their systems - regardless of scale, complexity, or tech stack. The Grafana AI teams play a key role in this mission by helping users make sense of complex observability data through AI-driven features. These capabilities reduce toil, lower the barrier of domain expertise, and surface meaningful signals from noisy environments.
We are looking for an experienced engineer with expertise in evaluating Generative AI systems, particularly Large Language Models (LLMs), to help us build and evolve our internal evaluation frameworks, and/or integrate existing best-of-breed tools. This role involves designing and scaling automated evaluation pipelines, integrating them into CI/CD workflows, and defining metrics that reflect both product goals and model behavior. As the team matures, there's a broad opportunity to expand or redefine this role based on impact and initiative.
What You'll Be Doing:
Design and implement robust evaluation frameworks for GenAI and LLM-based systems, including golden test sets, regression tracking, LLM-as-judge methods, and structured output verification.
Develop tooling to enable automated, low-friction evaluation of model outputs, prompts, and agent behaviors.
Define and refine metrics for both structure and semantics, ensuring alignment with realistic use cases and operational constraints.
Lead the development of dataset management processes and guide teams across Grafana in best practices for GenAI evaluation.
What Makes You a Great Fit:
Experience designing and implementing evaluation frameworks for AI/ML systems.
Familiarity with prompt engineering, structured output evaluation, and context-window management in LLM systems.
High autonomy to collaborate and translate team goals into clear, testable criteria supported by effective tooling.
Bonus Points For:
Experience working in environments with rapid iteration and experimental development.
A pragmatic mindset that values reproducibility, developer experience, and thoughtful trade-offs when scaling GenAI systems.
A passion for minimizing human toil and building AI systems that actively support engineers.
Compensation & Rewards:
In the United States, the Base compensation range for this role is USD 154,445 - USD 185,334. Actual compensation may vary based on level, experience, and skillset as assessed in the interview process. Benefits include equity, bonus (if applicable) and other benefits listed here.
All of our roles include Restricted Stock Units (RSUs), giving every team member ownership in Grafana Labs' success. We believe in shared outcomes-RSUs help us stay aligned and invested as we scale globally.
*Compensation ranges are country specific. If you are applying for this role from a different location than listed above, your recruiter will discuss your specific market's defined pay range & benefits at the beginning of the process.
Why You'll Thrive at Grafana Labs:
100% Remote, Global Culture - As a remote-only company, we bring together talent from around the world, united by a culture of collaboration and shared purpose.
Scaling Organization - Tackle meaningful work in a high-growth, ever-evolving environment.
Transparent Communication - Expect open decision-making and regular company-wide updates.
Innovation-Driven - Autonomy and support to ship great work and try new things.
Open Source Roots - Built on community-driven values that shape how we work.
Empowered Teams - High trust, low ego culture that values outcomes over optics.
Career Growth Pathways - Defined opportunities to grow and develop your career.
Approachable Leadership - Transparent execs who are involved, visible, and human.
Passionate People - Join a team of smart, supportive folks who care deeply about what they do.
In-Person onboarding - We want you to thrive from day 1 with your fellow new ‘Grafanistas' to learn all about what we do and how we do it.
Balance is Key - We operate a global annual leave policy of 30 days per annum. 3 days of your annual leave entitlement are reserved for Grafana Shutdown Days to allow the team to really disconnect.
*We will comply with local legislation where applicable.
Equal Opportunity Employer: We will recruit, train, compensate and promote regardless of race, religion, color, national origin, gender, disability, age, veteran status, and all the other fascinating characteristics that make us different and unique. We believe that equality and diversity builds a strong organization and we're working hard to make sure that's the foundation of our organization as we grow.
Grafana Labs may utilize AI tools in its recruitment process to assist in matching information provided in CVs to job postings. The recruitment team will continue to review inbound CVs manually to identify alignment with current openings.
#LI-Remote
For information about how your personal data is used once you've applied to a job, check out our privacy policy.
$111k-156k yearly est. Auto-Apply 5d ago
SR Technical Operations Engineer (Solana)
Pearster
Remote senior deployment engineer job
Job Description
Spoiler: Your next challenge in serious decentralized tech starts right here.
What this Journey Looks Like Pearster is looking for a Senior Technical Operations Engineer (Solana) to help power the core systems of a major US blockchain player.
This company is a cloud-based infrastructure provider powering the global blockchain ecosystem. Its mission is to be the indispensable utility that enables companies and innovators worldwide to build next-generation, Web3-enabled businesses and applications using blockchain technology.
In this role, you'll help keep Solana fast and reliable in production, operating validators, RPC, and indexing at scale. You'll tune hardware, optimize Agave/Jito, develop tools in Go or Python, and lead incident responses-treating keys, latency, and SLOs with production-level precision. Experience patching clients or contributing upstream fixes is a strong plus.
What You'll Lead and Build
Run validators: Deploy, upgrade, and tune Agave/Jito; minimize missed slots; maintain healthy voting and high leader performance.
Operate high-throughput RPC: Set smart connection and queue limits, optimize PubSub fan-out and backpressure, and ensure indexers are efficiently served without starving nodes
Extract performance from hardware: Select optimal servers, tune BIOS, kernel, NIC, and NVMe configurations, and validate performance gains through profiling and metrics.
Automate everything: Implement reproducible images, manage fleet changes with Terraform and Ansible, create snapshot pipelines, verify state-sync and replay processes, and build automated release systems.
Lead incidents (SEV0-2): Quickly isolate issues, execute safe roll-forwards or roll-backs, publish clear root cause analyses, and implement preventive measures to avoid recurrences.
Collaborate with the ecosystem: Reproduce complex bugs, share performance traces, and contribute targeted patches upstream when beneficial.
Code where it counts: Develop and extend tools for snapshots, replay/load, and state-sync verification; patch client bugs impacting production and upstream relevant fixes when valuable.
What You Bring to the Table
Location: Europe (remote-based).
Linux systems + kernel tuning: NUMA, IRQ affinity, hugepages, cpusets, I/O schedulers, sysctl; filesystem/NVMe layout; BIOS/firmware setup (C-states, power governors); NIC queues/offloads (RSS/RPS/XPS, GRO/LRO/TSO).
Hardware performance engineering: Choose and tune CPU/RAM/NVMe/NIC; measure replay throughput, p95/p99 RPC latency, IOPS/egress-and push them lower/faster.
Agave/Jito operations: Build from source; manage feature gates and config flags; snapshots (create/consume), ledger compaction/repair/replay health; accounts-DB tuning; version management.
Read protocols & surfaces: Operate and tune JSON-RPC (HTTP/WS), gRPC, and PubSub; design connection pools, concurrency limits, caching, timeouts, and backpressure that hold under peak.
Transaction sending logic: Understand direct-to-TPU (QUIC) vs RPC send Transaction; preflight/simulation trade-offs; priority fees and compute budget tuning; leader-schedule awareness.
Go or Python (plus Bash): Build small, sharp tools/CLIs (snapshot/restore pipelines, state-sync verification, health checks, replay/load harnesses).
Observability that matters: SLOs/error budgets; Prometheus/Grafana; alerts that page only when users hurt (RPC latency, PubSub backlog, missed leader slots, replay stalls).
Key management & safety: KMS/HSM/Vault; authority rotations; secure backups; tested DR paths; controlled, auditable change windows.
Benefits that Move you Forward.
We're here to amplify your brilliance, not contain it.
Work from anywhere with true flexibility and freedom.
Earn in USD with compensation that matches your expertise.
Recharge confidently with dedicated paid time off.
Advance your career with fully covered international certifications.
Access coworking spaces worldwide whenever you want a professional setup.
Strengthen your English and expand your global reach.
Connect and have fun with activities that unite our international team.
Feel appreciated with personalized gifts and a thoughtful welcome kit.
Grow our community and earn through our referral program.
At Pearster, your journey matters, and we're here to help you go further than you imagined.
$83k-120k yearly est. 25d ago
(REMOTE) Senior Cloud Operations Engineer
Geosite
Remote senior deployment engineer job
Job DescriptionDescription
is open to US residents and citizens only
Who We're Seeking We are looking for Senior Cloud Operations Engineer to help (1) build and maintain our cloud infrastructure using modern orchestration tools; and (2) implement cybersecurity best practices in pursuit of our compliance objectives. You will be joining an established engineering operations group, working with our production stack and product development teams.
Your Roles and Responsibilities
Orchestrate, manage, and further secure our cloud infrastructure using services on the cloud as well as modern tools
Make sure our stack follows best security practices for compliance as required by government and enterprise clients
Design all the cloud elements surrounding our Kubernetes cluster
Navigate complex compliance landscapes
Work with all teams to design monitoring dashboards to provide metrics to the respective teams
Lead and mentor other engineers in regards to elements in the cloud
Experience & Good to Haves
5+ years managing a stack running on AWS, GCP, etc.
Expertise in designing secure networks optimized for multi-cloud infrastructure
Experience building useful dashboards around logs, metric, and tracing statistics using services like Grafana, Cloudwatch, Datadog, etc.
Familiarity with provisioning tools like Terraform, CloudFormation, etc.
Strong communication skills
Bonus qualifications
Working knowledge of Kubernetes and Docker
Coding experience to help manage third party software (viz. Python, Go)
Experience building CI/CD pipelines (Jenkins, CircleCI, Gitlab CI, etc.)
Knowledge around automated testing frameworks that leverage tools like Selenium
Compensation & Benefits
160k - $200k
0.18% - 0.27% equity
Health, vision and dental insurance
Life insurance included
401(k) plan
Family-friendly, flexible work hours
Unlimited PTO with additional holidays off
Winter holiday time off in December
Occasional in-person meetings for development and team-building
Monthly social activities
$200k yearly 19d ago
Sr. Integration Engineer
Vertiv 4.5
Senior deployment engineer job in Westerville, OH
The
Sr. Integration Engineer
plays a major role in the Vertiv digital transformation journey. You will mentor and lead integration engineers for projects. You will be part of a learning culture, where teamwork and collaboration are encouraged, excellence is rewarded, and diversity is respected and valued.
Responsibilities:
Build integrations with interfaces at different level of complexity supporting different business domains.
Provide middleware expertise; play a key role in building scalable web services, APIs, and Microservices.
Anchor proof of concept developments and support multiple streams.
Collaborate with some of the best talent in the industry to create and implement innovative high-quality solutions, lead and participate in business discussions and pursuits focused on our business needs.
Requirements:
Bachelor's Degree in Computer Science, MIS, Electrical Engineering, or a related field of study; will also consider three years of progressive experience in the specialty in lieu of every year of education.
Five plus years of IT development experience.
One year of integration or API development experience
Preferred Requirements:
Strong experience as an integration engineer in large digital transformation projects involving complex landscapes.
Experience in on-premise and cloud middleware systems like Tibco, Oracle SOA, BizTalk, Oracle Integration cloud or MuleSoft.
Strong experience with building complex interfaces, APIs, and microservices using known middleware platform
Experience with API gateway systems like Apigee or Azure API. Working knowledge of API standard languages like Swagger and Open API
Strong conceptual and working knowledge of latest trends in PaaS technology and industry best practices, patterns, and standards
Strong analytical and communication skills and experience collaborating with multiple teams and vendors
Ability to understand and analyze business and functional requirements, and build scalable solutions
Ability to report risks, issues and test statuses to the stakeholders
The successful candidate will embrace Vertiv's Core Principals & Behaviors to help execute our Strategic Priorities.
OUR CORE PRINCIPALS:
Safety. Integrity. Respect. Teamwork. Diversity & Inclusion.
OUR STRATEGIC PRIORITIES
• Customer Focus
• Operational Excellence
• High-Performance Culture
• Innovation
• Financial Strength
OUR BEHAVIORS
• Own It
• Act With Urgency
• Foster a Customer-First Mindset
• Think Big and Execute
• Lead by Example
• Drive Continuous Improvement
• Learn and Seek Out Development
About Vertiv
Vertiv is a $8.0 billion global critical infrastructure and data center technology company. We ensure customers' vital applications run continuously by bringing together hardware, software, analytics and ongoing services. Our portfolio includes power, cooling and IT infrastructure solutions and services that extends from the cloud to the edge of the network. Headquartered in Columbus, Ohio, USA, Vertiv employs around 20,000 people and does business in more than 130 countries. Visit Vertiv.com to learn more.
Work Authorization
No calls or agencies please. Vertiv will only employ those who are legally authorized to work in the United States. This is not a position for which sponsorship will be provided. Individuals with temporary visas such as E, F-1, H-1, H-2, L, B, J, or TN or who need sponsorship for work authorization now or in the future, are not eligible for hire.
Equal Opportunity Employer
Vertiv is an Equal Opportunity/Affirmative Action employer. We promote equal opportunities for all with respect to hiring, terms of employment, mobility, training, compensation, and occupational health, without discrimination as to age, race, color, religion, creed, sex, pregnancy status (including childbirth, breastfeeding, or related medical conditions), marital status, sexual orientation, gender identity / expression (including transgender status or sexual stereotypes), genetic information, citizenship status, national origin, protected veteran status, political affiliation, or disability. If you have a disability and are having difficulty accessing or using this website to apply for a position, you can request help by sending an email to ********************.