Post job

Reliability engineer work from home jobs - 393 jobs

  • Remote Site Reliability Engineer - Build Resilient Systems

    Booz Allen Hamilton 4.9company rating

    Remote job

    A leading consulting firm in the U.S. is seeking a Site Reliability Engineer skilled in building resilient infrastructure and automating processes. You will lead teams, optimize systems, and implement monitoring tools. The ideal candidate has extensive experience in cloud technologies, Unix/Linux, and application troubleshooting, along with a master's degree or equivalent experience. This role offers a competitive salary range between $99,000 and $225,000 annually, with a flexible work model. #J-18808-Ljbffr
    $99k-225k yearly 1d ago
  • Job icon imageJob icon image 2

    Looking for a job?

    Let Zippia find it for you.

  • Site Reliability Engineer

    Workos

    Remote job

    WorkOS builds tools and services for developers to help them implement authentication, identity, authorization, and overall enterprise readiness. We're a fully distributed team with employees across North American time zones. We're well-funded, having raised $100m in funding from top investors including Greenoaks Capital, Lachy Groom, and Lightspeed Ventures. About the Site Reliability Engineering Team The Site Reliability Engineering (SRE) team ensures the WorkOS platform remains fast, reliable, and resilient at scale. We build the systems and practices that keep everything running smoothly-handling hundreds of millions of requests, minimizing downtime, and continuously improving service performance. Our team works across the stack and collaborates closely with infrastructure and product engineering teams. We embed reliability into everything we do-whether it's designing scalable systems, improving observability, or leading incident response. If you're motivated by complex systems, passionate about uptime and performance, and excited to make reliability a first-class concern-this role offers the opportunity to make a lasting impact. Who we're looking for We're looking for engineers who are excited to improve the reliability of complex systems and enjoy digging into how things work. As an early member of the SRE team, you'll help shape our approach to reliability at scale and collaborate closely across the company. You might be a great fit if you: Bring a generalist mindset and are comfortable working across infrastructure layers-from compute and networking to storage, databases, and app runtime environments Are curious and proactive, with a strong desire to understand systems end-to-end and uncover hidden failure modes Care deeply about uptime, observability, and performance, and see reliability as a product feature Think through architectural trade-offs with reliability, simplicity, and maintainability in mind Take initiative, work independently, and follow through-from identifying reliability risks to driving improvements Collaborate well with engineers across disciplines and enjoy supporting teams through production readiness, incident response, and postmortem reviews Responsibilities Design and evolve the systems, tooling, and processes that improve the reliability and performance of WorkOS Collaborate with product and infrastructure teams to ensure services are production-ready, observable, and resilient to failure Define and measure SLIs/SLOs to guide reliability improvements Write and optimize backend systems (in TypeScript) with a focus on performance, maintainability, and graceful degradation Improve our incident response process, lead postmortems, and drive follow-through on reliability risks Develop internal tools and automations that make it easier to operate and scale our systems Participate in our on-call rotation-responding to, resolving, and learning from production incidents Contribute to design and architecture discussions with a focus on operability and long-term sustainability Document systems, share learnings, and help grow a reliability-minded engineering culture Qualifications Experience operating and scaling production systems in cloud environments (we use AWS) Familiarity with service reliability concepts-monitoring, alerting, incident response, and root cause analysis Comfort working across infrastructure layers (e.g. compute, networking, storage, observability tooling) Strong debugging and systems thinking skills-you can follow problems across services and layers Ability to work independently, take ownership, and drive projects from problem discovery through resolution Nice to have Familiarity with Kubernetes or similar orchestration systems Exposure to observability stacks (e.g. Prometheus, Grafana, Datadog, OpenTelemetry) Exposure to TypeScript or interest in working in a TypeScript-based codebase Benefits (US Only) At WorkOS, we offer resources that emphasize personal and familial well-being. We offer healthcare coverage for you and your family, including medical, dental, and vision. We offer parental leave, paid‑time‑off and fully remote working arrangements. Competitive pay Substantial equity grants Healthcare insurance (Medical, Dental and Vision) for you and your family 401k matching Wellness and fitness monthly allowances PTO + paid holidays + unlimited sick leave Autonomy and flexibility with remote work Please inquire directly with our recruiting team for benefits available to those working outside the US. Equal Opportunity Employer WorkOS is an equal opportunity employer, committed to diversity and inclusiveness. We will consider all qualified applicants without regard to race, color, nationality, gender, gender identity or expression, sexual orientation, religion, disability or age. We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us. #J-18808-Ljbffr
    $113k-160k yearly est. 5d ago
  • Site Reliability Engineer

    Gamma.App

    Remote job

    We're building the creative layer for modern communication. Every month, over a billion people make presentations - but the tools they use to make them haven't evolved in decades. We're changing that, using AI to disrupt a massive market. 📈 Millions of people rely on Gamma to create, teach, and persuade, creating more than 1 million gammas every day. 💻 We see Gamma as the next great workplace tool, combining viral B2C love with a massive B2B opportunity. We believe AI can be a true creative partner: one that understands context, clarity, and taste. 💸 We've reached a $2.1B valuation, crossed $100M in annual recurring revenue, and have been profitable since 2023. 💙 We're an imaginative, passionate team who takes our work seriously, but not ourselves. Our culture is warm, a little quirky, and fueled by curiosity. About the role Gamma's infrastructure needs to be rock-solid for millions of daily users while enabling our engineering teams to ship fast. You'll own the operational health of our full backend platform, building automation and tooling that improves reliability and partnering with engineering to design systems that are observable, resilient, and easy to operate. Your work directly impacts every Gamma user's experience. This is a high-impact role where you'll balance reliability with velocity, knowing when to move fast and when to prioritize stability. You'll lead incident response, drive systemic improvements, and help shape how Gamma scales to serve its next 100 million users. Our team has a strong in-office culture and works in person 4-5 days per week in San Francisco. We love working together to stay creative and connected, with flexibility to work from home when focus matters most. What you'll do Own reliability, availability, and performance of Gamma's production systems across primarily AWS infrastructure Build observability infrastructure with metrics, logging, tracing, and alerting that provide deep visibility into system health Design automation to reduce toil, improve deployment safety, and accelerate incident resolution Lead incident response, conduct blameless post-mortems, and drive systemic improvements to prevent recurring issues Partner with engineering teams on architecture reviews, SLOs/SLIs, and reliability best practices Manage and optimize our infrastructure including compute, networking, databases, and managed services What you'll bring 5+ years in Site Reliability Engineering, DevOps, or systems engineering roles with deep AWS expertise Strong programming skills (Python, Go, or TypeScript/Node.js) for building tools and automation Experience with infrastructure-as-code (Terraform, CloudFormation) and comprehensive observability solutions Track record improving system reliability through automation, monitoring, and architectural improvements Solid understanding of networking, distributed systems, containerization (Docker, Kubernetes), and database performance Strong incident management and debugging skills for complex production issues (Nice to have) Experience scaling SaaS applications to millions of users (Nice to have) Background with real-time collaborative systems, Kafka, chaos engineering, or service mesh technologies (Nice to have) AWS certifications or experience with security/compliance requirements (SOC 2, ISO 27001) Compensation range Final offer amounts are determined by multiple factors, including but not limited to experience and expertise in the requirements listed above. If you're interested in this role but you don't meet every requirement, we encourage you to apply anyway! We're always excited about meeting great people. We're building on a full Typescript stack centered around some of the most modern and popular technologies. We use our own custom, open-source AI prompting framework, AIJSX. We have a lot of custom tools built in-house, but also new ones like Vercel AI SDK. Our tiny team operates at massive scale: 1M+ 70M users around the world 6M+ AI images generated daily 1 trillion LLM tokens processed per month Life at Gamma You get energy from small teams doing big things. You love when design, code, and storytelling overlap. You default to action, even when the answer isn't clear yet. You value details, but know when to ship and move on. You bring both the spreadsheets and the sparkle, equal parts workhorse and unicorn. You believe AI should amplify creativity, not replace it. You know kindness and intensity are not opposites. You like working with people who care deeply: about their craft, their teammates, and the users on the other side of the screen. Who we are Gamma is full of imaginative, passionate people who take their work seriously but not themselves. The culture is warm, a little quirky, and fueled by curiosity. It's the kind of place where you'll debate a pixel on Monday, laugh over someone's keyboard setup on Tuesday, and ship something remarkable by Friday. We care about craft, move with intention, and don't mind getting a little scrappy. It's fast, creative, and occasionally chaotic - but that's what makes it interesting. Here's a bit about what it's like to work here, from people on the inside: “quirky, inspiring, fun, a little wild in the best way” “You can have an idea and just run with it.” “Everyone's talented and humble - the mix keeps you sharp.” “We ship cool stuff, learn a ton, and laugh a lot doing it.” Meet the team We're a team of dreamers and doers building in beautiful San Francisco 🌉 We're kabbadi enthusiasts, pickleballers, dog herders, woodworkers, keyboard nerds, potters, and more - and we can't wait to meet you! #J-18808-Ljbffr
    $113k-160k yearly est. 4d ago
  • Remote Site Reliability Engineer - Windows, AD & ITIL

    Iron Mountain 4.3company rating

    Remote job

    A global leader in information management is looking for a talented Systems Engineer in Boston. This role requires U.S. Citizenship and the ability to obtain government clearance. Responsibilities include troubleshooting, providing support, and performing system documentation. Ideal candidates will have a Bachelor's degree, strong technical skills, and experience with Windows Server and Linux. The expected salary range is between $93,400 to $124,500, offering opportunities for professional growth in a fast-paced environment. #J-18808-Ljbffr
    $93.4k-124.5k yearly 3d ago
  • Senior SRE - Remote-First, Observability & Reliability

    Captivateiq, Inc. 4.3company rating

    Remote job

    A tech company focused on sales performance is seeking a Site Reliability Engineer in San Francisco. This role involves collaborating with development teams, automating infrastructure, and ensuring service reliability. Ideal candidates will have extensive experience in SRE or DevOps, with skills in infrastructure as code and strong communication abilities. The company offers generous benefits including health coverage and a 401k plan, fostering a diverse and inclusive work environment. #J-18808-Ljbffr
    $142k-189k yearly est. 2d ago
  • NLP Engineer - Production ML for PII Redaction (Remote)

    Tonicai

    Remote job

    A leading data privacy firm in San Francisco is seeking a hands-on Machine Learning Engineer to develop production-grade NLP systems. The ideal candidate will have over 3 years of experience in applied machine learning, particularly in NLP, and proficiency in Python and PyTorch. This role offers high autonomy and the opportunity to work with impactful data in various domains, including healthcare and finance. Competitive salary and comprehensive benefits are provided. #J-18808-Ljbffr
    $109k-160k yearly est. 4d ago
  • Senior Quality Systems Engineer (Remote) - Drive QMS Excellence

    Getinge 4.5company rating

    Remote job

    A global medical device company is seeking a Senior Quality Systems Engineer for a remote position. The role involves leading the enhancement of Quality Management System documents, driving best practices, and ensuring regulatory compliance. Candidates should have at least 5 years of experience in Quality Assurance with a strong understanding of medical device regulations. Comprehensive benefits package including health insurance and a registered pension plan is offered. #J-18808-Ljbffr
    $78k-103k yearly est. 2d ago
  • Staff Site Reliability Engineer

    Motive 4.3company rating

    Remote job

    Who we are: Motive empowers the people who run physical operations with tools to make their work safer, more productive, and more profitable. For the first time ever, safety, operations and finance teams can manage their drivers, vehicles, equipment, and fleet related spend in a single system. Combined with industry leading AI, the Motive platform gives you complete visibility and control, and significantly reduces manual workloads by automating and simplifying tasks. Motive serves nearly 100,000 customers - from Fortune 500 enterprises to small businesses - across a wide range of industries, including transportation and logistics, construction, energy, field service, manufacturing, agriculture, food and beverage, retail, and the public sector. Visit gomotive.com to learn more. About the Role: As a Staff Site Reliability Engineer on the Platform team, your role will be crucial in helping us design, scale, and manage our growing AWS-backed services for millions of connected IoT devices, mobile, and SaaS users. Your expertise in cloud-native and highly elastic service design and scaling practices is going to ensure our growing services, as well as new products and features operate smoothly and without manual intervention to achieve Motive's strong 99.99% availability SLOs. Leveraging and advancing our robust and fully-codified infrastructure and Kubernetes environment, paired with AWS components that require thoughtful implementations, and of course advanced troubleshooting with teams, you can be a large part of Motive's growth to the next million devices and beyond. What You'll Do: Collaborate with other engineering and product teams to design and build the infrastructure and services required to deliver new features to customers in a cloud-native and event-driven fashion. Leverage and progress our IaC (Terraform) and CM (Helm) code and strategies for advanced scaling and self-service usage by engineering teams. Identify and remove bottlenecks from systems in production throughout AWS services and with our Kubernetes platform. Ensure 99.99% customer-facing uptime. Continuously improve the monitoring and alerting capabilities of our platform, enabling us to be proactive instead of reactive. What We're Looking For: 7+ years of professional SRE/DevOps experience, and a demonstrated ability working on high volume production systems Demonstrable systems architect expertise, solving complex technical problems and implementing company wide solutions. Advanced knowledge of AWS services and technologies (ALB/ELB, IAM permissions, DynamoDB, SNS, EKS/Fargate, etc.) Experience with infrastructure as code and configuration management (Terraform and Helm charts especially) to design and provision new services Knowledge of Python, Bash or other scripting languages. Knowledge of Ruby or Golang is a plus. High-level of ownership and drive to work with others and see improvements through to production. Pay Transparency Your compensation may be based on several factors, including education, work experience, and certifications. For certain roles, total compensation may include restricted stock units. Motive offers benefits including health, pharmacy, optical and dental care benefits, paid time off, sick time off, short term and long term disability coverage, life insurance as well as 401k contribution (all benefits are subject to eligibility requirements). Learn more about our benefits by visiting Motive Perks & Benefits. The compensation range for this position will depend on where you reside. For this role, the compensation range is: United States$164,000-$226,000 USD Creating a diverse and inclusive workplace is one of Motive's core values. We are an equal opportunity employer and welcome people of different backgrounds, experiences, abilities and perspectives. Please review our Candidate Privacy Notice here. UK Candidate Privacy Notice here. The applicant must be authorized to receive and access those commodities and technologies controlled under U.S. Export Administration Regulations. It is Motive's policy to require that employees be authorized to receive access to Motive products and technology. #LI-Remote
    $164k-226k yearly Auto-Apply 8d ago
  • Staff Systems Reliability Engineer

    Irhythm Technologies 4.8company rating

    Remote job

    Career-defining. Life-changing. At iRhythm, you'll have the opportunity to grow your skills and your career while impacting the lives of people around the world. iRhythm is shaping a future where everyone, everywhere can access the best possible cardiac health solutions. Every day, we collaborate, create, and constantly reimagine what's possible. We think big and move fast, driven by our commitment to put patients first and improve lives. We need builders like you. Curious and innovative problem solvers looking for the chance to meaningfully shape the future of cardiac health, our company, and your career About This Role: We are seeking a highly experienced and strategic Staff System Reliability Engineer V to lead the design, scalability, and resilience of our cloud infrastructure. This role is ideal for someone with deep expertise in AWS, infrastructure automation, and observability who thrives in complex, high-availability environments. As a senior technical leader, you'll work closely with engineering and security teams to optimize performance, improve deployment pipelines, and uphold service reliability across mission-critical systems. What You Will Be Doing Design and implement scalable, fault-tolerant AWS-based infrastructure using Terraform and/or CloudFormation for regulated workloads (e.g., HIPAA, FDA CFR Part 11, EU MDR). Develop and maintain CI/CD pipelines using tools like GitLab CI, ArgoCD, or similar. Write automation tools and scripts in Python and/or Go to support operations, monitoring, and self-healing systems. Lead incident response efforts, root cause analysis, and postmortem documentation for system failures. GitLab pipeline authoring Kubernetes (EKS) cluster management support. Ability to migrate applications from ELB/ALB EC2 instances to k8s using Helm for configuration management. Define and monitor SLOs, SLAs, and error budgets across key services. Implement and manage observability tools (e.g., Prometheus, Grafana, CloudWatch, OpenTelemetry). Collaborate with software engineers to ensure systems are designed for reliability and security from the ground up. Harden system security by implementing least privilege IAM, automated patching, and vulnerability management. Evaluate and onboard new technologies to improve infrastructure efficiency and resilience. Mentor junior SREs and promote best practices in reliability engineering across the organization. What We Need to See Requires a minimum of 12 years of related experience with a Bachelor's degree; or 8 years and a Master's degree; or a PhD with 5 years' experience; or equivalent experience. Ways To Stand Out Expert-level knowledge of AWS services (EC2, Lambda, VPC, IAM, RDS, ECS/EKS, etc.). Helm, Argo CD GitLab: ability to abstract complexity to templated pipeline archetypes for similar development projects. Strong proficiency in Python and/or Go for automation and tooling. Deep understanding of infrastructure-as-code and GitOps workflows. Experience managing observability and alerting systems at scale. Strong grasp of Linux systems, networking, and distributed architecture principles. Familiarity with regulatory requirements such as FDA 21 CFR Part 11, HIPAA, ISO 13485, and EU MDR as they relate to infrastructure and DevOps. Strong written and verbal communication skills, including documentation and incident reporting. Work Environment / Other Requirements: Occasional travel to office if in Bay area. What's In It for You: Competitive compensation including base salary, annual performance bonus, and stock/equity opportunities. Outstanding benefits package with comprehensive medical, dental, vision, and wellness programs. Generous paid time off including vacation, holidays, and sick leave - because work/life balance matters. Flexible work options including hybrid and remote arrangements, depending on your location. 401(k) with company match and financial wellness resources to support your long-term goals. Mission-driven work - contribute to life-saving technology that improves patient outcomes around the world. Professional development with access to training, certifications, and leadership opportunities. Supportive and inclusive culture that values transparency, autonomy, and accountability. Location: Remote - US Actual compensation may vary depending on job-related factors including knowledge, skills, experience, and work location. Estimated Pay Range $146,000.00 - $190,000.00 As a part of our core values, we ensure an inclusive workforce. We welcome and celebrate people of all backgrounds, experiences, skills, and perspectives. iRhythm Technologies, Inc. is an Equal Opportunity Employer. We will consider for employment all qualified applicants with arrest and conviction records in accordance with all applicable laws. iRhythm provides reasonable accommodations for qualified individuals with disabilities in job application procedures, including those who may have any difficulty using our online system. If you need such an accommodation, you may contact us at ********************* About iRhythm Technologies iRhythm is a leading digital healthcare company that creates trusted solutions that detect, predict, and prevent disease. Combining wearable biosensors and cloud-based data analytics with powerful proprietary algorithms, iRhythm distills data from millions of heartbeats into clinically actionable information. Through a relentless focus on patient care, iRhythm's vision is to deliver better data, better insights, and better health for all. Make iRhythm your path forward. Zio, the heart monitor that changed the game. There have been instances where individuals not associated with iRhythm have impersonated iRhythm employees pretending to be involved in the iRhythm recruiting process, or created postings for positions that do not exist. Please note that all open positions will always be shown here on the iRhythm Careers page, and all communications regarding the application, interview and hiring process will come from ****************** email address. Please check any communications to be sure they come directly ********************* email address. If you believe you have been the victim of an imposter or want to confirm that the person you are communicating with is legitimate, please contact *********************. Written offers of employment will be extended in a formal offer letter from ******************* email address ONLY. For more information, see *********************************************************************************** and *****************************************
    $146k-190k yearly Auto-Apply 60d+ ago
  • Manager, Site Reliability Engineer

    Wildlife Studios 3.6company rating

    Remote job

    We're looking for a talented and passionate Site Reliability Engineer Manager, to join Wildlife's Cloud Platform team. As an SRE Manager you will have the goal to provide easy-to-use, highly available systems to all the engineers in the company. As an SRE Manager, your main goal is to enable your team to improve the infrastructure services, using and refining our existing automations while being able to contribute in technical and business decisions for new services that will support the scalability and usability of the infrastructure services in the company and improving the team career growth, engagement and retention. We know that the work we do has a high impact on our company's success and culture. The right person for this position is curious by nature, proactive, loves solving problems, and can thrive in a fast and growing business. What you'll do Be the manager of a cross-functional team, contributing to the team roadmap and growth of its individual contributors; Develop, maintain, and optimize infrastructure clusters (e.g., Kubernetes, NATS, ETCD, Postgres, MongoDB, Redis, Elasticsearch), infrastructure services (e.g., Gitlab, Jenkins, Vault, Artifactory, Datadog, Jaeger, etc.), and our APIs and automations to manage them (e.g., Kubernetes Operators, Infrastructure as code, Pipelines, CLIs,); Analyze costs of infrastructure services and help define and optimize the budget of our infrastructure and game teams; Contribute to improvements on monitoring and observability patterns for infrastructure services; Troubleshoot, manage and lead incidents in production; Manage and improve the tools and processes related to infrastructure management across the company (Infrastructure-as-code standards, CI/CD design, build of our Internal Developer Platform, etc.); Help partner teams to architect and scale their applications and infrastructure with cloud-native best practices. What you'll need We expect our Managers to be Technical, dedicating around 50% of their time to working together with the ICs in their day-to-day work and being an active voice and participative on the team technical roadmap. Experience managing small teams with infrastructure background; Some level of leadership skills, including the areas of people management, communications, project management, talent development, performance management, team effectiveness, agility, hiring, decision making, planning, budgeting, and collaboration; Coding experience in at least one programming language. We work mostly with Go and Python; University degree in courses related to computing such as Computer Engineering, Computer Science, Information Systems, and Systems Analysis and Development or equivalent Market Experience; Solid understanding of computer concepts (operational systems, networking, concurrency, memory management, and algorithm analysis); Experience with cloud computing services such as Amazon AWS, Google Cloud, or Microsoft Azure; Experience with Infrastructure as Code automations, such as Terraform, Packer, Ansible, Crossplane, etc; Experience managing Kubernetes clusters and developing Kubernetes operators; Experience automating routine tasks, such as deployments and monitoring setup; Experience with incident management and being oncall for productive systems and workloads; Strong written and spoken communication skills in English; Experience with complex, large-scale, and high-available systems; Experience with monitoring and telemetry in applications and infrastructure; History of technical leadership and ownership of critical projects, including the mentoring of junior team members. More about you Player focused. We are player-oriented, and infrastructure has a great impact on their experience. You have empathy with our players and focus on ensuring they have an amazing experience. You aim for a top-level infrastructure, guaranteeing the highest availability possible. Automation is key to scaling. We look for engineers who have a history of projecting and executing automation projects in order to get rid of any manual and repetitive tasks. Calm and pragmatism. When everything seems to be falling apart around you, you have a plan and keep calm. Bleeding edge. You are curious and like to study new technologies, test new solutions, and measure the impact brought by changes. We want to ensure we are using the best stack possible. Metrics-oriented. We make decisions based on data and metrics. We measure the results of our tasks against the expected outcome. And we ensure our work has delivered the correct impact on our customers. We believe in ownership and in shipping features end to end. Bar raiser. You want to elevate your team skills and raise the bar, by mentoring your peers, spreading knowledge, being proactive and a tech lead. About Wildlife Wildlife is one of the leading mobile game developers and publishers in the world. We have released more than 60 titles, reaching billions of people around the globe. Here, we create games that will excite, intrigue, and engage our players for years to come! Equal Opportunity Wildlife is proud to be an Equal Opportunity and Affirmative Action employer. We do not discriminate based upon race, color, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state, and local law. We're committed to providing accommodations for candidates with disabilities in our recruiting process.
    $94k-136k yearly est. Auto-Apply 60d+ ago
  • Site Reliability Engineer

    Strike 4.8company rating

    Remote job

    Better Money Strike is the Bitcoin company. With Strike, you can buy and sell bitcoin, pay bills, and borrow against your holdings. From individuals to businesses, Strike is purpose-built for every step of the Bitcoin journey. Available in more than 100 countries - including the U.S., Europe, Latin America, and Africa - Strike is building a better financial system powered by Bitcoin. Bitcoin is better money. Strike is how you use it. Role: We are seeking a highly experienced Site Reliability Engineer located in Europe, with a strong track record of tackling complex reliability and scalability challenges, and a history of providing technical guidance to teams. If you're a seasoned problem-solver with a passion for automation and operational excellence, and enjoy elevating the skills of those around you, we want to hear from you. What You'll Do: Lead Technical Initiatives: Drive key technical initiatives focused on improving the reliability, performance, and scalability of our critical systems, often leading technical aspects within projects. Architect and Implement Advanced Solutions: Design and implement sophisticated resilient and scalable solutions, leveraging your deep understanding of distributed systems. Master Troubleshooting and Optimization: Lead complex troubleshooting efforts, identify deep-seated root causes, and implement advanced optimizations. Build and Evangelize Automation: Develop and champion the adoption of robust automation frameworks and tools, potentially guiding more junior engineers in their development. Elevate Observability Practices: Design and implement comprehensive and insightful monitoring and logging solutions, ensuring actionable insights are available across teams. Provide Leadership in Incident Management: Take a leadership role in incident response, providing critical technical direction and mentorship during high-pressure situations. Champion Post-Mortem Excellence: Lead and contribute to in-depth blameless post-mortem analyses, driving significant improvements based on learnings. Mentor and Guide Team Members: Share your extensive knowledge and experience to mentor and guide other SREs and engineers, fostering their technical growth. What We're Looking For: Extensive experience with minimum 5 years in SRE, platform engineering, or software development with a strong operational focus. Demonstrated experience in providing technical leadership, guidance, or mentorship to engineering teams. Expert-level practical knowledge of cloud platforms, especially GCP. Deep hands-on experience with container orchestration (Kubernetes) and infrastructure-as-code (Terraform, Helm, ArgoCD). Strong command of multiple scripting and programming languages (Python, Go, Bash). Proven expertise in building and leveraging advanced monitoring and observability tools (Prometheus, Grafana, ELK stack). Exceptional analytical, problem-solving, and debugging skills at a senior level. Excellent communication, collaboration, and influencing skills. Organizational and leadership skills are a big plus Compensation and Benefits: Location dependent We do not make hiring decisions based on educational history whatsoever. Our Founder is a college dropout. We work with high school dropouts, PHD candidates and everything in-between. We do not hire credentials. We simply partner with talented, passionate individuals who are excited to be a part of our team. By clicking submit application below, you consent to our use and processing of your data as described in our Candidate Privacy Notice.
    $88k-128k yearly est. Auto-Apply 39d ago
  • Systems Engineer - Safety Methodologies

    Waabi

    Remote job

    Waabi, founded by AI pioneer and visionary Raquel Urtasun, is an AI company building the next generation of self-driving technology. With a world class team and an innovative approach that unleashes the power of AI to “drive” safely in the real world, Waabi is bringing the promise of self-driving closer to commercialization than ever before. Waabi is backed by best-in-class investors across the technology, logistics and the Canadian innovation ecosystem. With offices in Toronto, San Francisco, Dallas, and Pittsburgh, Waabi is growing quickly and looking for diverse, innovative and collaborative candidates who want to impact the world in a positive way. To learn more visit: ************ At the heart of our mission is an unwavering commitment to safety. We are seeking a passionate and experienced safety or systems engineer to spearhead the development and implementation of critical safety framework methods that underpin our driverless autonomy readiness decisions. This is a unique opportunity to shape how Waabi quantitatively ensures and validates the safety of our autonomous trucking solution, working with our highly realistic simulator, real-world data, and cutting-edge generative AI techniques. You will play a pivotal role in creating the evidence for safe operation and leading efforts in a rapidly evolving and groundbreaking field. You will…- Lead the development of conflict avoidance method focused on proactively preventing dangerous situations, reducing surprises to other drivers, and ensuring our vehicle does not initiate conflict- Innovate and implement tools and processes to streamline safety validation and continuously evolve the safety method- Define verification coverage using Waabi's simulator to verify driving behaviors, and develop quantitative metrics to evaluate system readiness against established safety targets- Establish human performance benchmarks for robust comparative safety assessments by collaborating with engineering teams and external partners- Identify safety gaps, recommend improvements, and ensure traceability between requirements, validation artifacts, and safety case claims- Ensure clear, structured documentation of safety artifacts and readiness decisions to ensure transparency and traceability- Mentor peers by fostering a culture of technical excellence and driving clear, constructive collaboration between teams Qualifications:- Undergrad required; Masters or PhD within an engineering discipline preferred- 3+ years of automotive, robotics or related industry experience- Experience contributing to the development of a safety case for autonomous vehicles- Experience with using simulation and real-world testing to make readiness decisions- Knowledge of relevant safety methods and standards such as STPA, SOTIF (ISO 21448) and/or UL 4600- Strong fundamentals in mathematics, engineering and physics- Excellent data analysis skills and Python scripting skills- Ability to communicate complex concepts or data in a simple-yet-accurate manner- Collaborative team player who works effectively across functional boundaries- Passionate about self-driving technologies, solving hard problems, and creating innovative solutions Bonus/nice to have:- Experience in launching a driverless product- Experience implementing software systems components- Advanced skills in data mining, mathematics, and statistical analysis The US yearly salary range for this role is: $140,000 - $190,000 USD in addition to competitive perks & benefits. Waabi (US) Inc.'s yearly salary ranges are determined based on several factors in accordance with the Company's compensation practices. The salary base range is reflective of the minimum and maximum target for new hire salaries for the position across all US locations. Note: The Company provides additional compensation for employees in this role, including equity incentive awards and an annual performance bonus. Perks/Benefits:- Competitive compensation and equity awards.- Health and Wellness benefits encompassing Medical, Dental and Vision coverage (for full-time employees only).- Unlimited Vacation.- Flexible hours and Work from Home support.- Daily drinks, snacks and catered meals (when in office).- Regularly scheduled team building activities and social events both on-site, off-site & virtually.- As we grow, this list continues to evolve! Waabi is a technology start-up building technologies to transform the way the world moves. Join our talented team to be a part of the future and to make an impact! Waabi is an equal opportunity employer. We celebrate diversity and are committed to creating a supportive, inclusive, and accessible workplace for all our employees. We seek applicants of all backgrounds and identities, across race, color, ethnicity, national origin or ancestry, age, citizenship, religion, sex, sexual orientation, gender identity or expression, military or veteran status, marital status, pregnancy or parental status, caregiver status, disability, or any other characteristic protected by law. We make workplace accommodations for qualified individuals with disabilities as required by applicable law. If reasonable accommodation is needed to participate in the job application or interview process please let our recruiting team know.
    $140k-190k yearly Auto-Apply 60d+ ago
  • Systems Safety Engineer (Remote)

    Precision Personnel

    Remote job

    The Aerospace Engineer, Systems Safety position is responsible for supporting the focal systems engineer by design, developing, testing and certifying a system/s for a project. This role requires the significant exercise of independent discretion and judgment in matters of significance. Responsibilities: Generate System and Parts safety requirements, typically: Create system safety requirements Create validation and verification plans for safety requirements Assess compliance of systems, components, installation and operation for systems against the defined requirements Communicate the system needs to internal and external consumers of the design Compare design solutions for safety Determine key parameters for designs, including AFHA, SFHA, DFMEA and FMEA/FMES, Fault trees at unit, system and aircraft design level Knowledge of the System design process. Knowledge to generate Safety analysis, including AFHA, SFHA, ALRs, SLRs, FMEA, DFMEA and other documentation per SAE ARP 4754A and SAE ARP 4761 Review and support Certification Activities, including but not limited to: Create and review Certification Plans. Create and review Component Qualification Test Plans, Qualification Test Procedures and Qualification Test Reports. Review system safety documents (AFHA, PSSA, FMEA, SSA and CCAs). Create, review and approve ground test procedures, flight test procedures and their corresponding test reports. Participate in Systems Design Reviews. Coordinate with suppliers to review technical data, safety process and design compliance with suppliers. Coordinate with certification authorities to seek clarification on requirements. Review certification guidance material (advisory circulars, etc.) and incorporate the necessary guidance into the certification plans. Determine action for special certification requirements and suitable data capture methods Analysis of mechanical failures and determine root cause and corrective actions Review of requirements for completeness, suitable verification and validation methods Write test reports. Complete special projects and tasks assigned by Group Lead. Knowledge, Skills, and Abilities: Knowledge of the system engineering development process. Including the origination of requirements, their management, revision validation and verification. Knowledge of flight test operations and flight certification. Good leadership presence as well as people management skills: Future-oriented in thinking and operation. Able to lead by example and live/work by company values. Ability to successfully contribute to a positive and productive work environment. Able to instill a sense of urgency in team members. Able to be patient and objective in difficult situations with different types of people. Strong customer service tool box: Professional mannerisms, appearance and actions (self-confident and committed to high ethics). Strong follow-through, quick thinking and resourceful. Ability to remain calm, cool and collected in stressful situations. Strong sense of urgency to resolve customer needs. Strong organizational, time management and prioritization skills: Able to multi-task, maintain focus on several different projects at one time and hit deadlines. Able to be flexible with attention and priority. Ability to work in a progressive, fast-paced environment (work well under pressure). Strong analytical skills, with an ability to troubleshoot, problem-solve and effectively and efficiently make decisions. Strong communication skills (oral, written, presentation) with both external and internal customers. Act as an active listener, seeking to understand and then to be understood, articulating clearly and confidently. Uses people relationship and business management skills to make decisions on what and when to communicate with employees and customers. Excels at communicating clearly and effectively verbally. Strong proficiency in writing of documents, reports, and presentations. Strong interpersonal skills, with the ability to build strong relationships at all levels. Ability to influence others as well as relate to individuals at all levels of the organization. Good project management skills, including the ability to take ownership for accomplishing assigned tasks. Results-oriented planner and delegator who ensures that goals are met. Able to set priorities and keep to projected schedules. Computer Skills: Comfortable and effective working in Microsoft Office, RELEX, CAFTA. Intermediate Excel proficiency required. Able to quickly learn new software and systems. Proven track record of improving the efficiency of assigned processes or procedures. Education: Four (4) year degree in Engineering (Aerospace, Avionics, Electrical, or Instrumentation). (Aerospace, Avionics, Electrical, or Instrumentation). Required Experience: Four (4) years Aerospace engineering experience in Mechanical Systems, including experience in at least one of the following: Mechanical Systems, Flight Control Systems, Landing Gear Controls, Hydraulic System and/or Brake Control Systems utilized in aircraft.
    $67k-104k yearly est. 4d ago
  • System Safety Engineer

    Open Roles

    Remote job

    Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver-The World's Most Experienced Driver™-to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo's fully autonomous ride-hail service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over ten million rider-only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states. The Waymo Safety team works to promote and help to continuously improve the safety of Waymo's fully autonomous driving technology. Our experts develop safety goals and strategies, and conduct safety engineering analyses to ensure safety is being considered throughout the design and development of our vehicles. The team develops and promotes safety strategies and policies for autonomous vehicles for its work with regulatory authorities, lawmakers, law enforcement and public and non-profit organizations. Our Safety Team also helps advise on compliance with applicable environmental, health, and safety regulations. In this remote role, you will report into one of our System Safety Leads. You will: Lead cross-functional efforts to perform safety analyses and identify potential hazard and their causes, assess safety risks (qualitatively and quantitatively), support the definition of safety requirements, inform design decisions, propose mitigations, and support verification strategies Participate in hardware, software, and operations/product design and readiness reviews and discussions to help ensure that safety mitigations and requirements are addressed and ensure the safety mitigations are working effectively in the field Provide guidance on the adoption and adaptation of relevant safety standards, and industry best practices, and help ensure Waymo and its partners comply with relevant safety regulations Evaluates the continued effectiveness of implemented risk control strategies; supports the identification of new hazards Provide leadership in defining, implementing, and improving relevant System Safety processes while promoting our safety culture inside Waymo Provide leadership in integrating the System Safety Methodology with other methodologies of the Waymo safety framework You have: Bachelor of Science in Electrical/Mechatronics/Aerospace Engineering, Computer Engineering, Computer Science, or related technical fields Strong technical project management skills with the ability to manage multiple parallel projects 10+ years of relevant engineering work experience that includes minimum 5 years of System Safety engineering Proficient in selecting and performing appropriate inductive/deductive safety analyses in order to efficiently identify hazards and their level of risk Proficient in applying Systems Engineering principles including defining SMART safety mitigations/requirements and collaborating on effective verification and validation methods Extensive background in risk management and aligning risk based decisions with stakeholders We prefer: A Master's degree or PhD in Electrical/Mechatronics/Aerospace Engineering, Computer Engineering, Computer Science, or related technical fields Experience in complex, safety critical fleet operations and the impact to hardware, software, and systems design Experience in conducting safety analysis and risk assessments on Perception and Behavior/Planner software stack. Experience in writing code and performing unit tests as well as analyzing algorithmic structures in C, C++, and Python Knowledge of relevant industry safety standards like ISO 26262, ISO 21448, ISO 5083, MIL-STD-882E, ARP4761 Travel: Must be able to travel once a quarter for team meetings as well as travel to support safety audits and assessment activities as needed The expected base salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-related factors, including exact work location, experience, relevant training and education, and skill level. Your recruiter can share more about the specific salary range for the role location or, if the role can be performed remote, the specific salary range for your preferred location, during the hiring process. Waymo employees are also eligible to participate in Waymo's discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements. Salary Range$196,000-$248,000 USD
    $61k-108k yearly est. Auto-Apply 6d ago
  • System Safety Engineer (Remote from US)

    Jobgether

    Remote job

    This position is posted by Jobgether on behalf of a partner company. We are currently looking for a System Safety Engineer - Onboard in United States.We are seeking an experienced System Safety Engineer to ensure the safe design, development, and deployment of advanced autonomous systems. This role leads cross-functional safety analyses, identifies hazards, and develops mitigations to protect people and assets while supporting design decisions. You will collaborate with software, product, and engineering teams to integrate safety requirements, assess risks, and implement effective verification strategies. The position involves contributing to safety processes, promoting a strong safety culture, and aligning with regulatory and industry standards. The ideal candidate combines deep technical expertise, risk management experience, and hands-on system engineering skills to influence safe and reliable operations. This role also provides mentorship and guidance on adopting best practices across safety methodologies.Accountabilities: Lead cross-functional efforts to perform safety analyses, identify hazards, assess safety risks, and support the definition of safety requirements. Inform design decisions, propose mitigations, and support verification strategies to ensure operational safety. Participate in software and product design reviews to confirm safety mitigations are properly implemented and effective in the field. Guide adoption of safety standards and best practices, ensuring compliance with regulatory requirements. Evaluate the continued effectiveness of implemented risk control strategies and identify new hazards. Define, implement, and improve system safety processes while promoting a strong safety culture. Integrate system safety methodology with other safety frameworks across engineering functions. Requirements: Bachelor of Science in Computer Engineering, Computer Science, or related technical field. 10+ years of relevant safety engineering experience, including a minimum of 5 years in System Safety engineering. Strong technical project management skills with the ability to manage multiple concurrent projects. Proficiency in performing inductive/deductive safety analyses to efficiently identify hazards and assess risk levels. Experience applying Systems Engineering principles, including defining SMART safety mitigations/requirements and collaborating on verification and validation methods. Extensive background in risk management and aligning risk-based decisions with stakeholders. Experience coding, performing unit tests, and analyzing algorithmic structures in C, C++, and Python. Preferred Qualifications: Master's degree or PhD in Electrical, Mechatronics, Aerospace Engineering, Computer Engineering, Computer Science, or related fields. Experience conducting safety analysis and risk assessments on SAE L3+ Perception and Behavior/Planner software. Benefits: Competitive base salary range: $196,000 - $248,000 USD, with potential performance-based bonus eligibility. Participation in equity incentive plans and comprehensive company benefits programs. Remote work flexibility with periodic travel to HQ (~10%) for audits and assessments. Professional development and growth opportunities within a mission-driven, safety-focused team. Why Apply Through Jobgether?We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.We appreciate your interest and wish you the best! Why Apply Through Jobgether? Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time. #LI-CL1
    $61k-108k yearly est. Auto-Apply 4d ago
  • Site Reliability Engineer (SRE) with strong Middleware expertise

    Hexaware Technologies, Inc. 4.2company rating

    Remote job

    What Working at Hexaware offers: Hexaware is a dynamic and innovative IT organization committed to delivering cutting-edge solutions to our clients worldwide. We pride ourselves on fostering a collaborative and inclusive work environment where every team member is valued and empowered to succeed. Hexaware provides access to a vast array of tools that enhance, revolutionize, and advance professional profile. We complete the circle with excellent growth opportunities, chances to collaborate with highly visible customers, chances to work alongside bright brains, and the perfect work-life balance. With an ever-expanding portfolio of capabilities, we delve deep into and identify the source of our motivation. Although technology is at the core of our solutions, it is still the people and their passion that fuel Hexaware's commitment towards creating smiles. “At Hexaware we encourage to challenge oneself to achieve full potential and propel growth. We trust and empower to disrupt the status quo and innovate for a better future. We encourage an open and inspiring culture that fosters learning and brings talented, passionate, and caring people together.” We are always interested in, and want to support, the professional and personal you. We offer a wide array of programs to help expand skills and supercharge careers. We help discover passion-the driving force that makes one smile and innovate, create, and make a difference every day. The Hexaware Advantage: Your Workplace Benefits · Excellent Health benefits with low-cost employee premium. · Wide range of voluntary benefits such as Legal, Identity theft and Critical Care Coverage · Unlimited training and upskilling opportunities through Udemy and Hexavarsity Who we are? At Hexaware Technologies, we are a leading global IT Services company, dedicated to driving digital transformation and innovation for businesses around the world. Founded in 1990, Hexaware has grown into a global trusted partner for enterprises, offering comprehensive AI empowered services including IT Consulting, Application Development, Infrastructure and Cloud Management and Business Process services. At Hexaware we are a community of creative, diverse, and open-minded Hexawarians creating smiles through the power of great people and technology. We pride ourselves on our people-centric culture and commitment to sustainability. Our diverse team of over 30,000 professionals across 30 countries is driven by a shared passion for innovation and excellence. We foster a collaborative environment where creativity and continuous learning are encouraged, enabling our employees to thrive and grow. Position: Site Reliability Engineer (SRE) with strong Middleware expertise Location: Plano TX- (5 Days onsite & 24x7 Rotational) Shift: Rotational (Shift 1 (8 AM - 5 PM), Shift 2 (4 PM - 1 AM), Shift 3 (12 AM - 9 AM)) also on weekend based upon Roaster Duties and Responsibilities: Job Summary: We are seeking a Site Reliability Engineer (SRE) with strong Middleware expertise to design, operate, and continuously improve highly available, secure, and scalable enterprise platforms. This role blends deep middleware operations (WebLogic, API gateways, Java platforms) with SRE principles such as automation, observability, SLIs/SLOs, error budgets, and incident reduction. The ideal candidate will partner with application, infrastructure, security, and DevOps teams to ensure platform reliability while driving automation, standardization, and operational excellence. Key Responsibilities: Reliability & SRE Practices: Define, implement, and track SLIs, SLOs, and error budgets for middleware and platform services Drive MTTR reduction, availability improvements, and operational resilience Lead incident response, root cause analysis (RCA), and post-incident reviews Implement proactive monitoring and alerting to reduce noise and prevent outages Middleware Platform Engineering: Administer and support enterprise middleware platforms including: Oracle WebLogic, Apache, NGINX API Gateways (Apigee Edge / X) Java application servers and JVM-based services Perform patching, upgrades, configuration tuning, and capacity planning Manage certificates, keystores, truststores, and TLS configurations Ensure platform security, compliance, and performance standards Observability & Monitoring: Design and maintain end-to-end observability using tools such as: Dynatrace, ELK/Kibana, Splunk (or equivalent) Build executive and operational dashboards for real-time health visibility Reduce alert fatigue through smart alerting, thresholds, and suppression Monitor JVM metrics, GC behavior, thread utilization, and API performance Automation & Infrastructure Efficiency: Develop automation and self-healing solutions using: Shell scripting, Python, Ansible, Terraform, or similar tools Automate routine operational tasks (restarts, validations, health checks) Enable CI/CD-friendly middleware deployments and configuration management Standardize environments across DEV / QA / UAT / PROD Cloud, Containers & Modern Platforms: Support middleware workloads on: Kubernetes / OpenShift Public or hybrid cloud environments (AWS, Azure, GCP) Integrate platform reliability into containerized and microservices architectures Collaborate with DevOps teams on deployment pipelines and release strategies Collaboration & Leadership: Act as a reliability advisor to application and development teams Partner with Unix/Linux, Database, Network, and Security teams Provide mentoring, documentation, and best-practice guidance Participate in on-call rotations and production support leadership Required Skills & Experience: Technical Skills: 6+ years of experience in Middleware / Platform Operations / SRE Strong expertise in WebLogic, Java middleware, Apache/NGINX Hands-on experience with observability platforms (Dynatrace, ELK, Splunk) Solid understanding of Linux/Unix systems and networking fundamentals Experience with API platforms (Apigee preferred) Automation and scripting skills (Shell, Python, Ansible, Terraform) Experience with Kubernetes/OpenShift and containerized workloads SRE & Operational Excellence: Practical experience implementing SRE principles in production Strong troubleshooting skills (thread dumps, heap analysis, GC logs) Experience with incident management, RCA, and change management Ability to balance reliability vs delivery velocity Nice-to-Have: Experience with cloud-native architectures and service meshes Knowledge of IAM / Security integrations (OAuth, SAML, mTLS) Exposure to CI/CD tools (Jenkins, GitHub Actions, GitLab CI) Experience supporting 24x7 enterprise environments ITIL or SRE certifications What you'll get from us: Insert US/employee benefits here e.g.: • Competitive Salary • Company Pension Scheme • Comprehensive Health Insurance • Flexible Work Hours and Hybrid Work Options • XX days paid annual holidays + public holidays. • Professional Development and Training Opportunities • Employee Assistance Program (EAP) • Diversity, Equity, and Inclusion Initiatives • Company Events and Team-Building Activities Equal Opportunities Employer: Hexaware Technologies is an equal opportunity employer. We are dedicated to providing a work environment free from discrimination and harassment. All employment decisions at Hexaware are based on business needs, job requirements, and individual qualifications. We do not discriminate based on race including colour, nationality, ethnic or national origin, religion or belief, sex, age, disability, marital status, sexual orientation, parental status, gender reassignment, or any other status protected by law. We encourage candidates of all backgrounds to apply.
    $76k-103k yearly est. Auto-Apply 8d ago
  • Process Engineer / Specialist (Instrumentation & Controls)

    Graymont 4.0company rating

    Remote job

    Process Engineer / Specialist (Instrumentation and Controls) Full-Time, Permanent Any state in which Graymont has operations The Process Engineer / Specialist (Instrumentation & Controls) provides technical expertise and ongoing support for process automation, control systems and remote operations systems across Graymont's network of lime manufacturing facilities. This role ensures safe and effective operation of Supervisory Control and Data Acquisition (SCADA) systems, Model Predictive Controls (MPC), High Performance Human Machine Interface (HPHMI), alarm management systems and portable devices. This position also supports continuous improvement initiatives in process performance and reliability. The role involves a high degree of collaboration with a diverse group of cross-functional stakeholders such as: Remote Operations, Facility Operations, Information Systems, HSE, Project Engineering and Maintenance. A high degree of understanding of manufacturing processes & equipment, product quality, control philosophies, and associated Graymont standards and industry best practices are essential. Responsibilities: * Ensure safe work practices and implement safeguards to mitigate hazards and prevent incidents. * Collaborate with relevant teams to ensure compliance with health, safety, environmental regulations, and quality standards. * Serve as a primary interface with Remote Operations and plant facilities, ensuring automation and controls function as designed and support continuous improvement initiatives. * Work closely and in collaboration with controls engineers, process engineers, integrators/contractors, Information Services, and Remote Operations as needed. * Maintain and enhance SCADA systems at Graymont facilities, including graphical HPHMI, alarm management, reporting, and effective communication with Remote Operations and plant facilities. * Test and support MPC systems and HPHMI, ensuring documentation is current and change management processes are followed. * Develop a thorough understanding of plant operations and existing systems (control, network, data management, etc.). * Analyze instrumentation, tune control loops, and optimize advanced and expert control systems (MPC). * Ability to work irregular hours or provide on-call support during project implementation or to address issues at plants or ROC operations. * Travel throughout North America (US & Canada) to provide hands-on support to sites. Qualifications: * Education: Bachelor's degree in a related engineering field: Controls and Instrumentation, Electrical, Mechatronics or a related field with a combination of controls and automation experience. * Professional Experience: Minimum 2 years of experience in controls and/or process engineering within the lime or a comparable industry. * Technology Requirements: Proficiency in control disciplines, including Model Predictive Control (MPC), Distributed Control Systems (DCS), Programmable Logic Controllers (PLC), Supervisory Control and Data Acquisition (SCADA) * Experience with Ignition by Inductive Automation and Rockwell products (PLCs, drives, sensors, networks, etc.) * Additional Assets: Experience with Ignition by Inductive Automation and Rockwell products (PLCs, drives, sensors, networks, etc.). * Travel: This position can require frequent travel based on business and operational needs throughout the U.S. and Canada and can be up to 50%. Who You Are: * Effective Communicator: You are an active listener who can communicate effectively with different audiences in diverse situations. * Collaborative: You thrive in a multi-disciplinary team environment and believe that we can get further, faster by working together. * Strong Work Ethic: You demonstrate reliability, responsibility, and a strong commitment to delivering high-quality results. Who We Are: Founded in 1948, Graymont is a trusted global leader in essential calcium-based solutions. Professionally managed and family-owned, we proudly serve a wide range of markets, customers, and communities in North America and Asia Pacific. Graymont is also the strategic partner of Grupo Calidra, the largest lime producer in Latin America. Graymont's strategy is anchored in its strong commitment to its core values of integrity, respect, teamwork, innovation, excellence, accountability, and long-term perspective. Central to our philosophy is a long-term approach to our business, built on a solid commitment to sustainable growth and focus on decarbonization, all of which is embodied in our mission statement: Contributing to a decarbonized world by providing essential lime and limestone solutions. To learn more about the employment experience at Graymont, click here. If you're interested in exploring our current job opportunities, please visit us at ****************************
    $71k-94k yearly est. 36d ago
  • Quality Operations Process Engineer

    Brightspring Health Services

    Remote job

    Job Description PharMerica is are seeking a seasoned Process Engineer with a strong background in pharmacy operations to drive continuous improvement and operational excellence across our pharmacy services. The ideal candidate will have 3-5 years of hands-on experience in process engineering. Experience working in a Long-Term Care (LTC) pharmacy environment is a plus. This role requires a strategic thinker with a passion for optimizing workflows, leveraging automation, and integrating emerging technologies such as Generative AI (GenAI), Large Language Models (LLMs), and Agentic AI. Remote opportunity. Applicants can live anywhere within the Continental USA. Travel: 25-50% Schedule: Monday - Friday, 8:00am - 5:00pm We offer: DailyPay Flexible schedules Competitive pay Shift differential Health, dental, vision and life insurance benefits Company paid STD and LTD Tuition Assistance Employee Discount Program 401k Paid-time off Tuition reimbursement Non-retail/Closed-door environment This position will be posted a minimum of 5 days Responsibilities Analyze existing pharmacy workflows and identify opportunities for standardization, process improvement, automation, and cost reduction Develop and maintain process maps, SOPs, and documentation to support operational consistency and compliance Lead Lean and Six Sigma initiatives to enhance efficiency, reduce waste, and improve service quality Collaborate with cross-functional teams including IT, operations, and clinical staff to implement innovative solutions Evaluate and integrate AI technologies (GenAI, LLMs, Agentic AI) to streamline decision-making, documentation, and customer service processes Monitor performance metrics and KPIs to assess the impact of process changes and drive data-informed decisions Support change management efforts and training programs to ensure successful adoption of new processes and technologies Qualifications Required Qualifications: Bachelor's degree in Engineering, Industrial Engineering, Pharmacy, or related field. 3-5 years of experience in process engineering, preferably in a pharmacy or healthcare setting. Proven expertise in Lean, Six Sigma, or other continuous improvement methodologies (Green Belt or higher preferred). Proficiency in process mapping tools (e.g., Visio, Lucidchart) and data analysis platforms (e.g., Excel, Power BI). Familiarity with automation technologies, GenAI, LLMs, and Agentic AI applications in operational settings. Strong analytical, problem-solving, and project management skills. Excellent communication and stakeholder engagement abilities. Preferred Qualifications: Experience in Long-Term Care (LTC) pharmacy operations. Exposure to regulatory compliance in pharmacy or healthcare environments. Experience with digital transformation initiatives or AI implementation in operational workflows. Key Competencies Strategic Thinking Innovation & Technology Adoption Process Optimization Cross-functional Collaboration Data-Driven Decision Making Change Management Travel Requirements: 25-50% travel
    $64k-84k yearly est. 6d ago
  • System Reliability Technician I

    Capital Metropolitan Transportation Authority 4.2company rating

    Remote job

    WHO WE'RE LOOKING FOR Ready to power up Austin, one bike at a time? The System Reliability Technician I is essential to keeping the Bikeshare program running smooth primarily by ensuring the system's bicycles are properly balanced throughout the network and e-bikes are kept charged to maximize system usability. Provide simple repairs in the field to bicycles and stations when possible and remove defective equipment to be repaired in the workshop. Evaluate the condition of assets and update their status, submit service requests for repairs or maintenance when needed, and engage with customers to offer friendly assistance. Provide support for special events throughout the community, including nights, weekends, and holidays. This position is responsible for maintaining a safe, inclusive, and fun environment that ensures Bikeshare is an enjoyable place to work for everyone. WHAT YOU BRING High school diploma or GED. Two (2) years of experience with bicycle repair and use preferred. Must have a valid driver's license for the past three years, Class B CDL preferred, and be able to complete a driver safety course provided by CapMetro within the first 90 days. Class A or B Misdemeanor - Disqualified if 7 years or less from date of conviction or deferred adjudication. Submit to CapMetro for review if between 7-10 years since conviction or deferred adjudication or more than 2 convictions in a lifetime. Class C Misdemeanor - Disqualified if more than 2 moving violations in the past 5 years (Any more than one driving safety course taken for a moving violation that appears on a five (5) year record will be treated as a moving violation and will count against the applicant. Ability to pass an agility test and physical exam. Knowledge, Skills, and Physical Abilities Ability to work well independently and in a team environment. Ability to communicate effectively with peers, management, and patrons. WORK ENVIRONMENT AND PHYSICAL DEMANDS Work is generally performed at the noted locations in which there is minimal exposure to unpleasant and/or hazardous working conditions using company vehicles to go to and from these locations. This position involves multi-tasking based on priorities and timelines that can change periodically based on the Company needs. To work in this position, you will need to be able to walk, bend, stoop, balance, crawl and reach for at least 8 hours a day; operate heavy equipment; work in varying weather conditions; lift a maximum of 80 pounds without assistance; be comfortable and able to climb up and down an 8-foot ladder; and to work independently without direct supervision from CapMetro Bikeshare facilities. Reasonable accommodations may be made to enable individuals with disability to perform the essential functions as previously described. Incumbent may be subject to drug and alcohol testing in accordance with CapMetro policies and applicable law in the event of reasonable suspicion or an accident. Mobility Status: As a Circulating position, the incumbent as an assigned gathering space. This is usually an operations position. The employee is expected to come in daily for check-in and check-out processes but does not work in an individual workspace. WHAT YOU'LL BE DOING Note: The duties and primary responsibilities below are intended to describe the general content of and requirements of this job and are not intended to be an exhaustive statement of duties. Operate a non-revenue van, or other non-revenue vehicle, to move bicycles around the community based on the demand, charging and swapping batteries on e-bikes, and cleaning stations/bicycles (power washing, removing stickers, cleaning graffiti, etc.). Maintain and clean the battery charging locations, non-revenue vehicles, warehouse spaces, and other storage areas at the main facility to ensure the safety of the team and proper care of the program assets. Provide support for special events during a wide variety of hours and days that may involve shifting regular work hours. Support System Reliability Technician IIs and IIIs during complex operations or projects where support is needed such as station installations, branding of assets, system testing, or other support duties as assigned. Maintain Bikeshare facilities to a high standard of cleanliness, requiring disposal of debris and power washing areas around the stations and docks. Must operate in conditions using PPE (Personal Protective Equipment) in a Bio-Hazard environment. Always maintain a safe working environment. Takes initiative to resolve problems and/or present solutions with minimal instructions from management. Support Capital Metropolitan Transportation Authority's Safety Management Systems (SMS) process by ensuring staff follows safety and security policies, considers safety in every action, and ensures safety and security concerns are reported. Perform other position related duties as required and/or assigned.
    $42k-58k yearly est. Auto-Apply 60d+ ago
  • Plant Process Engineer

    Captiveaire 4.4company rating

    Remote job

    This role will be responsible for maintaining and improving production across various products and equipment at our West Union manufacturing facility. Why Work for CaptiveAire? Nation's leading manufacturer of commercial kitchen ventilation systems, and now offering a complete solution of fans, heaters, ductwork and HVAC equipment. Our primary purpose is to provide fully integrated, sustainable HVAC Systems. Leader in the industry for over 40 years with innovative technologies, unmatched service, competitive pricing, and rapid lead times. Mission: to provide the highest quality products and service to our users at the lowest possible price Strong commitment to the development of our employees, including continuous education opportunities like sponsorship for Professional Engineering license and continuous education through weekly webinars and company developed technical videos What our employees have to say: I love the mindset of continuous learning and pushing the bounds of your capabilities and knowledge. I love the people I work with and the environment, particularly in a world where remote work is common. I love how Captiveaire is all about connections, with customers, coworkers, end users, and everyone in between. What I truly admire about CaptiveAire is the company's unwavering commitment to innovation and continuous improvement… Equally impressive is the culture of open communication that exists throughout the organization. From my own experience, upper management never takes the stance that a task is "not their job." Instead, there's a shared understanding that every role is essential to the company's success. This mindset, combined with the transparent communication across departments and facilities, creates a collaborative environment that truly sets CaptiveAire apart. We want to stay on the cutting edge and so are constantly sourcing and utilizing the best equipment available. Any position can provide feedback that is listened to and incorporated into processes. Collaboration is key at CaptiveAire and so there is no being "Silo-ed" into one area. CaptiveAire is fast-moving and no-nonsense. We operate differently than any other company that I've worked for with our decentralized structure. Quick action is taken when a good idea is presented. We are focused on end users where the rest of the industry is very short-sighted. We are on the front lines, actively changing the landscape of the HVAC industry. Learn more about CaptiveAire and our products here A Day in the Life: No two days are ever the same in this role. Tasks can include: Training operators in proper process, tool usage, and tool maintenance. Collaborating with other engineering teams on process efficiency, quality, and safety improvement projects. Developing new designs using 3D modeling software like Solidworks for carts or production tools. Then work with the fab shops to complete the fabrication of them in a timely manner. Work closely with contractors to coordinate utility needs such as air and electrical drops, and oversee the installation of new equipment. Coordinate daily, weekly, and monthly preventative maintenance tasks on machinery. Troubleshoot machines whenever the team needs extra support. Provide hands-on support to production lines as needed. Complete hands-on testing for new process changes From a Plant Process Engineer: I enjoy the constant problem solving and the drive toward innovation that keeps things exciting. I have had jobs in the past where the day could not end soon enough because I was ready to be done. This is not one of them. I find the job rewarding and find myself here past scheduled hours often not because I have to but because I want to. Primary Job Responsibilities: 95% of the workday will be on the manufacturing floor working on projects, operating and repair of equipment Involvement with equipment operators to ensure parts are programmed, training and any process related issues Responsible for working with Safety, Quality, and Industrial engineering teams on parts, assembly, and process improvements Implementation of Engineer Change Notices (ECN) and Process Change Instructions (PCI) Support production as needed, capable of building all products to the expected quality standards in the designated area of focus on an as needed basis Assist production staff with product related questions Work with Plant manager on various facility improvement projects Job Requirements: 0-10 yrs experience 4-year technical degree, in an electrical, electronics or mechanically oriented curriculum Must be strong in mechanical and electrical engineering. Internship/co-op experience is preferred Must enjoy hands on product exposure Electrical skills and experience needed AutoCAD or similar software a must Multi-tasking, problem solving and strong communication skills a must Strong emphasis on perfect product quality and to maintain a safe work environment Physical Requirements: Ability to work standing for 8 - 10 hours at a time Required to use ladder, forklift or other means to acquire parts for product assembly Able to use power & hand tools, as well as electrical testing and measuring equipment Ability to lift 35 to 50 pounds independently Benefits: Medical, dental and vision insurance Disability & life insurance based upon election of medical insurance 401k with employer match Paid holidays Paid time off (PTO) based upon tenure Flexible spending account (FSA) Tuition reimbursement, including for Professional Engineering (PE) License Relocation assistance Salary: $65k-$85k base, negotiable dependent on experience, with additional monthly bonus based on productivity and profits. Captive-Aire Systems, Inc. is proud to be an equal opportunity workplace. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, uniformed services, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law. To qualify, applicants must be legally authorized to work in the United States. At this time we are not able to consider applicants that require sponsorship, now or in the future, for employment visa status. This position is classified as a safety-sensitive position. Employees in this position are subject to drug and alcohol testing in accordance with CaptiveAire's Drug-Free Workplace policy. #P1
    $65k-85k yearly Auto-Apply 27d ago

Learn more about reliability engineer jobs