Post job

Senior Reliability Engineer jobs at Oracle

- 3920 jobs
  • Senior Site Reliability Engineer - Cloud Automation (Oracle Health Cloud, Remote US)

    Oracle 4.6company rating

    Senior reliability engineer job at Oracle

    Senior Site Reliability Engineer - Cloud Automation (Oracle Health | Remote US) Make real-world impact at scale. Join Oracle Health to build a modern, automated healthcare platform that millions rely on. You'll design, automate, and operate secure, highly available cloud services-driving reliability, speed, and efficiency across our platform. What you'll do Own service reliability end-to-end: architecture, production operations, and on-call excellence Build automation and self-healing systems using IaC (e.g., Terraform) and CI/CD Design, implement, and evolve observability (metrics, tracing, logging) and SLO/error budgets Lead capacity planning, performance tuning, and cost/sustainability initiatives Develop tooling and services to improve scalability, availability, and developer productivity Partner with cross-functional teams to deliver features safely (canary/blue‑green, progressive delivery) Drive incident response, root-cause analysis, and prevention through automation Prototype and standardize platform services and best practices across teams Disclaimer: Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates. Range and benefit information provided in this posting are specific to the stated locations only US: Hiring Range in USD from: $86,400 to $199,500 per annum. May be eligible for bonus and equity. Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business. Candidates are typically placed into the range based on the preceding factors as well as internal peer equity. Oracle US offers a comprehensive benefits package which includes the following: 1. Medical, dental, and vision insurance, including expert medical opinion 2. Short term disability and long term disability 3. Life insurance and AD&D 4. Supplemental life insurance (Employee/Spouse/Child) 5. Health care and dependent care Flexible Spending Accounts 6. Pre-tax commuter and parking benefits 7. 401(k) Savings and Investment Plan with company match 8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation. 9. 11 paid holidays 10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours. 11. Paid parental leave 12. Adoption assistance 13. Employee Stock Purchase Plan 14. Financial planning and group legal 15. Voluntary benefits including auto, homeowner and pet insurance The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted. Career Level - IC4 What you'll bring US citizenship and the ability to obtain/maintain a federal security clearance Experience operating large-scale, distributed, fault-tolerant systems in production Strong scripting/programming (Python, Bash; Java/C++ a plus) Infrastructure as Code and automation (Terraform; Ansible/Chef/Puppet/Packer a plus) CI/CD pipelines and tooling (Git, GitLab/Jenkins/Rundeck) Cloud experience (OCI, AWS, Azure or similar) Deep knowledge of monitoring, alerting, incident management, and postmortems Solid grasp of networking, security fundamentals, and performance engineering Nice to have Experience in regulated or high-compliance environments Data/analytics and platform sustainability optimization Containers and orchestration (Kubernetes, Docker) Why Oracle Health Net-new business with startup energy and enterprise backing High ownership, high impact: shape platform reliability and automation from the ground up Mission-driven work improving healthcare through secure, scalable technology Remote role within the US Eligibility: Remote (US). US citizenship required; ability to obtain and maintain a federal security clearance. #LI-ND1
    $86.4k-199.5k yearly Auto-Apply 12d ago
  • Sr. Site Reliability Engineer (SRE)

    Avenue Code 3.5company rating

    Mountain View, CA jobs

    About the Opportunity: We're seeking an experienced, highly collaborative SRE to partner with product teams and tackle our most critical infrastructure challenges. You'll be hands-on in designing, building, and operating our cloud platform-and driving the reliability, performance, and security that empower our engineering organization. Responsibilities: Infrastructure as Code & CI/CD: Automate provisioning and deployments with Terraform and integrate best-practice pipelines (GitHub Actions, ArgoCD, etc.). Reliability Engineering: Define SLIs/SLOs, manage error budgets, and build dashboards & alerts to proactively measure and improve system health. Security & Compliance: Enforce least-privilege IAM policies, automate vulnerability scans, and maintain audit logging for compliance. Monitoring & Observability: Instrument services with metrics, logs, and distributed tracing to enable rapid troubleshooting, aid teams in alerting, custom metrics, and dashboarding Incident Management: Own on-call rotations, lead real-time incident response, conduct post-mortems, and drive continuous improvements. Cost Optimization: Implement tagging strategies, right-size resources, and leverage concrete data to decide on optimal methods to control cloud spend at scale. Documentation & Mentorship: Author runbooks, standards, and best-practice guides-and coach dev teams on implementing modern DevOps, reliability, and security patterns. Required Qualifications: Have 5+ years of experience running production critical systems. Deep proficiency with the AWS Cloud and Cloud-Native best practices. Experience with Kubernetes (EKS, GKE) and Container Orchestration at scale. Skilled in Terraform to declaratively provision and maintain infrastructure services. Working knowledge of managing and debugging databases like Redis and Postgres. Strong familiarity with VPC, VPN, Load Balancing, and cloud networking components. Proficiency with Git workflows, branching strategies, and CI/CD systemintegrations. Solid understanding of web and network protocols and standards (HTTP, REST, TLS, DNS, etc...) Professional proficiency in English (both written and spoken) is required for this role. Nice to Have Skills: Bachelor's degree, or equivalent in Computer Science, Engineering, or a related field. Experience with ArgoCD, GitHub Actions, Jenkins, or other CI/CD pipeline solutions. Working knowledge of Python, Golang, and Helm templating languages. Node.js experience a plus, including running scalable, resilient Node microservices. Grasp of foundational security best practices for cloud infrastructure. Awareness of Terragrunt, managing Terraform state, and optimal project structure. Seasoned in production readiness fundamentals amidst a fast-moving team. Avenue Code reinforces its commitment to privacy and to all the principles guaranteed by the most accurate global data protection laws, such as GDPR, LGPD, CCPA and CPRA. The Candidate data shared with Avenue Code will be kept confidential and will not be transmitted to disinterested third parties, nor will it be used for purposes other than the application for open positions. As a Consultancy company, Avenue Code may share your information with its clients and other Companies from the CompassUol Group to which Avenue Code's consultants are allocated to perform its services.
    $144k-188k yearly est. 1d ago
  • Senior Site Reliability Engineer (Dynatrace)

    Intraedge 3.9company rating

    Phoenix, AZ jobs

    Enterprise Observability Engineer Details Hands-on experience with design and implementation of observability frameworks. Dynatrace Managed and/or SaaS experience including hands on expertise with designing, instrumenting, and administering application performance monitoring. Hands-on experience with enterprise observability technologies from open source and/or leading vendors ie. Grafana, Open Telemetry, Splunk, Datadog, OpenSearch etc. Hands-on experience with implementing standard methodologies for monitoring, logging, and alerting across widely distributed infrastructure stacks. Hands-on experience with at least 1 programming language (Python, JavaScript, or Java) strongly preferred. Experience with Infrastructure provisioning, configuration and decommission automation - ideally using IaC patterns. Experience with Public Cloud (AWS, GCP or Azure). Experience with collaboration tools (GitHub, JIRA, Confluence, etc.). Experience with Dynatrace Query Language, automated build and test frameworks and container and container orchestration technologies a plus.
    $110k-146k yearly est. 5d ago
  • Site Reliability Engineer

    Matlen Silver 3.7company rating

    Alpharetta, GA jobs

    Title: Senior Cloud Security Engineer/Architect Environment: Onsite Duration: 6 month contract to hire Contract pay: $68-$90/hour W2 Conversion salary: $150k-$188k NO C2C ** Due to client requirements, US Citizen or GC Holder ONLY ** Requirements Minimum 13+ years of professional experience in Cloud Infrastructure, DevOps, or Site Reliability Engineering. Strong Infrastructure as Code (IaC) expertise with Terraform-hands-on experience creating and managing EKS clusters, repositories, and Terraform modules. Architect, implement, and manage Azure IaaS infrastructure encompassing VNets, subnets, network security groups, VPN gateways, CDNs, Traffic Manager, peering, custom routes, DNS, DHCP, and virtual appliances. Proven proficiency across Azure and/or AWS (multi-cloud experience preferred). Strong security mindset with practical experience in IAM, vulnerability remediation, encryption, and patching. Solid understanding of DNS, Docker, Kubernetes, and containerization best practices. Experience with Windows and Linux/Unix system and network administration (8+ years). Proficiency in one or more programming/scripting languages: Python, Go, Bash, or Ruby. Expertise in Terraform, Ansible, or Chef for automation and configuration management. Hands-on experience with cloud services (AWS, Azure, GCP) - including EC2, S3, Kubernetes, and serverless environments. Knowledge of networking fundamentals: DNS, firewalls, load balancing, and VPNs. Experience with container orchestration using Docker, Kubernetes, or OpenShift. Experience with monitoring and observability tools such as Prometheus, Grafana, Datadog, or New Relic. CI/CD pipeline development using Jenkins, GitLab CI, GitHub Actions, or CircleCI. Bonus: Experience with HashiCorp Vault and advanced Terraform module design. Deep understanding of access control, encryption standards, secure coding practices, and regulatory frameworks Skilled in incident management, root cause analysis, automation, and performance tuning. Understanding of SLOs/SLAs, system scalability, redundancy, and resilience best practices.
    $150k-188k yearly 1d ago
  • Site Reliability Engineer

    Bcforward 4.7company rating

    Jersey City, NJ jobs

    *Presently we are unable to sponsor and request applicants to apply who are authorized to work without sponsorship* (Can work only on W2) Below are the few details of the opportunity. Job Title: Software Engineering (SRE/DevOps/Windows Eng) Location: Jersey City, NJ 07310 - Onsite Duration: Contract to Hire Job Description: About Candidate: End to end - development, deployment, automation & monitor - using Automation CI/CD pipelines Working with SQL servers, oracle Most apps deployed on windows servers - (windows stack - deployment front end web servers, application servers and database servers) Manage vendor applications Experience with reporting Observability - is key - Graphana, dashboards, Dynatrace, SQL monitoring Agile Skills (required) - Windows PowerShell - scripting / APIs (post man, swagger) Automation - (jewls PL), this is an CI/CD process
    $86k-115k yearly est. 4d ago
  • Site Reliability Engineer

    Ascendum Solutions 4.5company rating

    Cincinnati, OH jobs

    On-site role, 5 days/week. Candidates should be eligible to work for any employer in the United States without needing Visa sponsorship. As a Site Reliability Engineer/DevOps Engineer, you will be responsible for ensuring the availability, performance, and reliability of Fulfillment Technology solutions to support an omni-channel strategy. You will work closely with the development, testing, and operations teams to design, implement, and maintain scalable, reliable, and efficient solutions for the production environment. You will also troubleshoot and resolve any issues that may arise in the production systems, using various tools and techniques such as monitoring, logging, alerting, automation, and incident management. You will also contribute to the continuous improvement of the DevOps practices and processes, such as CI/CD, configuration management, infrastructure as code, and cloud computing. You will have a strong background in software engineering, system administration, networking, and cloud technologies. You will also have excellent communication and collaboration skills, as well as a passion for learning new technologies and solving complex problems. Minimum Position Qualifications 4+ years of experience in the cloud SRE/DevOps/Infrastructure, or any related fields 4+ years experience working with databases, web applications and micro-services, event-driven applications, messaging systems, REST APIs and integrations, cloud, support tools, observability and containerization technologies. Knowledge of Java, Spring boot, Microservices, Kafka, Cassandra & SQL Server Proficiency in scripting languages such as Python / Shell scripting 1 year of experience managing System Observability tools (DynaTrace, ELK, PagerDuty, Datadog, Azure Monitor, Grafana, etc) Hands-on experience with GitActions for CI/CD automations Knowledge of Linux architecture, security, administration, performance monitoring/tuning, troubleshooting, and production operations Demonstrated skill in working in an Agile environment Demonstrated skill in working with multi-location global teams Proven ability to think and contribute at the strategic level Demonstrated knowledge of eCommerce, Fulfillment, or Retail Technology solutions Demonstrated written, oral and presentation/public speaking communication skills Desired Previous Experience/Education 4+ years of experience in designing/working in high volume eCommerce applications 2+ years of experience configuring and managing cloud infrastructure (Azure, AWS, GCP) 1 year of experience with technologies such as Apache Kafka, Azure Cosmos DB, Apache Cassandra, Ansible, Terraform, Docker and Kubernetes Experience with Nginx, HAProxy, Squid Experience with CI/CD pipelines using tools such as Jenkins, Spinnaker, Azure DevOps, TeamCity, etc. Proficient in implementing and managing RoyalTS or similar cross-platform remote management solutions, ensuring secure and efficient remote access and system administration across diverse environments. Responsibilities Partner and collaborate with application engineering, observability, and other support teams as well as our business operation partners and third parties (as appropriate) to prioritize, address and drive the resolution of issues and incidents that impact customer pickup or delivery domains Drive root-cause analysis of critical business and production issues to prevent future occurrences and review/approve potential solutions Lead Major Incident calls impacting the Pickup Fulfillment domain and provide clear, timely updates on status of service restoration to key stakeholders Work with the engineering teams to continuously implement and improve reliable and speedy build environments Increase automation to improve efficiency and quality Ensure traceability, observability, and retrievability of system behavior Build logging, monitoring, and alerting systems to identify bottlenecks and assist with debugging, analysis, and optimization in cloud, on-prem and store environments Craft solid and clearly explained designs, playbooks, and documentation Participate in an off-hours on-call rotation, and perform periodic off-hours work during maintenance windows
    $75k-99k yearly est. 3d ago
  • Site Reliability Engineer (Azure App Services)

    The Judge Group 4.7company rating

    Irving, TX jobs

    The Judge Group, a Technology, Talent & Learning Solutions company based in Wayne, PA, that helps professionals find top jobs with the nation's leading brands. We're looking to hire a Site Reliability Engineer (Azure App Services) for a Full-Time, permanent position based in Irving, TX. About the Role We are looking for a Site Reliability Engineer (SRE) with strong expertise in monitoring, debugging, and optimizing Azure App Services. This role ensures platform reliability, performance, and scalability. You will combine hands-on Azure experience with code-level debugging, observability best practices, and automation to prevent issues, reduce MTTD/MTTR, and deliver an exceptional experience for end-users. Qualifications Bachelor's degree in Computer Science, IT, or related field. Microsoft Azure Fundamentals (AZ-900) certification required. Proven SRE experience focusing on monitoring, debugging, and incident response. Hands-on experience with Azure App Services, Application Insights, and Azure Monitor. Skilled in Diagnose and Troubleshoot Tools, Kudu, and PowerShell scripting. Strong programming fundamentals with ability to read and troubleshoot .NET/C# and Angular code. Experience in on-call operations, incident response, and RCA writing. Bonus: Experience with Grafana/Prometheus, DataDog/Dynatrace, Azure Front Door, CDN, Function Apps, WebJobs, Service Bus, or Event Hub. Excellent communication, collaboration, and problem-solving skills. Additional Azure certifications are a strong plus.
    $84k-112k yearly est. 2d ago
  • Principal Reliability Engineer

    Raytheon 4.6company rating

    Lowell, MA jobs

    Country: United States of America Onsite U.S. Citizen, U.S. Person, or Immigration Status Requirements: Active and transferable U.S. government issued security clearance is required prior to start date. U.S. citizenship is required, as only U.S. citizens are eligible for a security clearance Security Clearance: Secret - Current At Raytheon, the foundation of everything we do is rooted in our values and a higher calling - to help our nation and allies defend freedoms and deter aggression. We bring the strength of more than 100 years of experience and renowned engineering expertise to meet the needs of today's mission and stay ahead of tomorrow's threat. Our team solves tough, meaningful problems that create a safer, more secure world. Life Cycle Engineering (LCE) is responsible for ensuring our products are safe, reliable, maintainable, and delivered on time. LCE comprises multiple disciplines that support engineering, our program offices, and our customers. These disciplines are involved throughout the entire life cycle of our products-from conception to deactivation. Our primary focus is product support, which includes the following key disciplines: Reliability, System Safety, and Supportability. To help drive this mission forward, Raytheon currently has an exciting opportunity for a Principal Reliability Engineer, within our Land and Strategic Missile Defense business area. Your work will play a key role in supporting Raytheon's mission of making the world a safer place. What You Will Do Provide reliability predictions for the Patriot missile Mentor team members and provide internal training Support design reviews Lead failure analysis and root cause and corrective action studies to assure customer satisfaction Provide technical direction across various stages of missile operations, including the disposition of field return assets, complex assembly and disassembly issues, and ensuring mission readiness for flight tests. Work effectively with various engineering teams (design, manufacturing, quality, test) to conduct test data analysis and trending Communicate complex technical information to cross-functional audiences, including customers and management. Qualifications You Must Have Typically requires a degree in Science, Technology, Engineering or Mathematics (STEM) and a minimum of 8 years of prior relevant experience. Experience with reliability predictions and/or missile scoring analysis. Qualifications We Prefer Ability to manage multiple technical projects and customer requests in a fast-paced operations environment Experience in gathering, analyzing and / or presenting data utilizing Database SQL Experience using scripts to organize data for trending analysis Experience developing and providing technical presentations to audiences including the U.S. and international customers Experience with National instruments LabView or LabWindows software packages, PXI, my DAQ, or Compact RIO hardware. Experience with system assembly, disassembly and test Experience with MS Office tools Experience with FRACAS (Failure Reporting, Analysis, and Corrective Action Systems) Proficiency in reliability modeling (e.g. Weibull analysis) Ability to conduct and oversee reliability testing (accelerated life testing, degradation testing, HALT/HASS) Expertise in various reliability methodologies (e.g. FMEA, FMECA, FTA, RCA) What We Offer Our values drive our actions, behaviors, and performance with a vision for a safer, more connected world. At RTX we value: Safety, Trust, Respect, Accountability, Collaboration, and Innovation. This position is eligible for relocation assistance! Learn More & Apply Now! Please consider the following role type definition as you apply for this role: This is a full-time, onsite position based in Tewksbury, MA. Onsite: Employees who are working in Onsite roles will work primarily onsite. This includes all production and maintenance employees, as they are essential to the development of our products. As part of our commitment to maintaining a secure hiring process, candidates may be asked to attend select steps of the interview process in-person at one of our office locations, regardless of whether the role is designated as on-site, hybrid or remote. The salary range for this role is 101,000 USD - 203,000 USD. The salary range provided is a good faith estimate representative of all experience levels. RTX considers several factors when extending an offer, including but not limited to, the role, function and associated responsibilities, a candidate's work experience, location, education/training, and key skills.Hired applicants may be eligible for benefits, including but not limited to, medical, dental, vision, life insurance, short-term disability, long-term disability, 401(k) match, flexible spending accounts, flexible work schedules, employee assistance program, Employee Scholar Program, parental leave, paid time off, and holidays. Specific benefits are dependent upon the specific business unit as well as whether or not the position is covered by a collective-bargaining agreement.Hired applicants may be eligible for annual short-term and/or long-term incentive compensation programs depending on the level of the position and whether or not it is covered by a collective-bargaining agreement. Payments under these annual programs are not guaranteed and are dependent upon a variety of factors including, but not limited to, individual performance, business unit performance, and/or the company's performance.This role is a U.S.-based role. If the successful candidate resides in a U.S. territory, the appropriate pay structure and benefits will apply.RTX anticipates the application window closing approximately 40 days from the date the notice was posted. However, factors such as candidate flow and business necessity may require RTX to shorten or extend the application window. RTX is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability or veteran status, or any other applicable state or federal protected class. RTX provides affirmative action in employment for qualified Individuals with a Disability and Protected Veterans in compliance with Section 503 of the Rehabilitation Act and the Vietnam Era Veterans' Readjustment Assistance Act. Privacy Policy and Terms: Click on this link to read the Policy and Terms
    $80k-103k yearly est. 1d ago
  • Site Reliability Engineer (no c2c)

    Theoris 3.8company rating

    Indianapolis, IN jobs

    Job Title: DevOps Support Engineer (Service Reliability Engineer) Industry: Pharmaceutical JOB DESCRIPTION: Theoris is seeking a Service Reliability Engineer (SRE) Consultant to provide day-to-day support, monitoring, troubleshooting, and issue resolution for the MD3 infrastructure. The consultant will work alongside and under the technical direction of client staff, focusing on ensuring the reliability and performance of enterprise vendor applications hosted in AWS and a proprietary Kubernetes platform. The ideal candidate is proactive, self-sufficient, and comfortable working in an ambiguous environment. This role requires collaborating with vendors, driving deployments, and contributing to ongoing support as applications move through deployment cycles and into production. RESPONSIBILTIES: Continuously monitor the health and performance of the MD3 infrastructure, including data observations, HPC, and LiveDesign tasks. Utilize monitoring tools such as ServiceNow, Splunk, and Grafana to detect and respond to incidents in real-time. Perform regular job queue checks and maintenance activities. Monitor the MD3 dashboard and community chats/channels for issues or alerts. Diagnose, troubleshoot, and resolve technical issues related to the MD3 infrastructure. Collaborate with DevOps engineers and technical teams to resolve incidents. Document and communicate root cause analyses and resolution steps. Develop and implement automation scripts to streamline monitoring and troubleshooting. Identify areas for improvement in the infrastructure and propose solutions to enhance reliability. Participate in post-incident reviews to address monitoring and support gaps. Work closely with the DevOps team to align with business goals and research needs. Provide updates on incidents and resolutions to stakeholders. Participate in regular standups and scrums to discuss ongoing issues. Build and share bi-weekly reports on MD3 infrastructure status and performance. Develop and maintain knowledge articles for the help desk (ServiceNow) and user FAQs. Ensure documentation is up-to-date and easily accessible for the support team and end-users. Establish SLAs based on current ITSM practices for incident and problem resolution. Ensure all incidents and problems are addressed within defined SLAs. Performance Optimization: Fine-tune applications and infrastructure to meet performance benchmarks. Capacity Planning: Anticipate growth needs to scale infrastructure and optimize resource utilization. Create and maintain runbooks for critical alerts. REQUIREMENTS: Experience working with AWS-hosted vendor applications and Kubernetes platforms. Strong background in monitoring, troubleshooting, and supporting cloud-based infrastructures. Proficiency with monitoring tools such as ServiceNow, Splunk, and Grafana. Experience collaborating with DevOps engineers and technical teams. Ability to develop automation scripts to improve support workflows. Excellent communication and collaboration skills. Self-sufficient, proactive, and comfortable in ambiguous environments. Experience working in agile environments and participating in standups and scrums. Experience working with enterprise vendor applications and customized deployments. Background in performance tuning and capacity planning. Familiarity with help desk knowledge management and incident response SLAs. About Theoris: Our goal is to Fuel Your Career! As a Theoris team member, you join a culture based on people-centered values and an environment that fosters both personal and professional growth. We build long-term relationships with our clients and our consultants. With over 30 years of building strong relationships in the industry, we're uniquely positioned to make the right connections. This knowledge is used to find the right job placement. Our recruiting teams are experts dedicated to the information technology and engineering staffing space and are highly respected by our client base. Best-In-Class-Benefits We are in the people business; treating people right is our ONLY priority. Theoris Services consultants are full-time employees with full benefits, including: Robust Health Insurance 401(k) plan PTO accrual Paid holidays Excellent cash-based referral program
    $63k-85k yearly est. 1d ago
  • Principal Reliability Engineer

    Raytheon 4.6company rating

    Salem, NH jobs

    Country: United States of America Onsite U.S. Citizen, U.S. Person, or Immigration Status Requirements: Active and transferable U.S. government issued security clearance is required prior to start date. U.S. citizenship is required, as only U.S. citizens are eligible for a security clearance Security Clearance: Secret - Current At Raytheon, the foundation of everything we do is rooted in our values and a higher calling - to help our nation and allies defend freedoms and deter aggression. We bring the strength of more than 100 years of experience and renowned engineering expertise to meet the needs of today's mission and stay ahead of tomorrow's threat. Our team solves tough, meaningful problems that create a safer, more secure world. Life Cycle Engineering (LCE) is responsible for ensuring our products are safe, reliable, maintainable, and delivered on time. LCE comprises multiple disciplines that support engineering, our program offices, and our customers. These disciplines are involved throughout the entire life cycle of our products-from conception to deactivation. Our primary focus is product support, which includes the following key disciplines: Reliability, System Safety, and Supportability. To help drive this mission forward, Raytheon currently has an exciting opportunity for a Principal Reliability Engineer, within our Land and Strategic Missile Defense business area. Your work will play a key role in supporting Raytheon's mission of making the world a safer place. What You Will Do Provide reliability predictions for the Patriot missile Mentor team members and provide internal training Support design reviews Lead failure analysis and root cause and corrective action studies to assure customer satisfaction Provide technical direction across various stages of missile operations, including the disposition of field return assets, complex assembly and disassembly issues, and ensuring mission readiness for flight tests. Work effectively with various engineering teams (design, manufacturing, quality, test) to conduct test data analysis and trending Communicate complex technical information to cross-functional audiences, including customers and management. Qualifications You Must Have Typically requires a degree in Science, Technology, Engineering or Mathematics (STEM) and a minimum of 8 years of prior relevant experience. Experience with reliability predictions and/or missile scoring analysis. Qualifications We Prefer Ability to manage multiple technical projects and customer requests in a fast-paced operations environment Experience in gathering, analyzing and / or presenting data utilizing Database SQL Experience using scripts to organize data for trending analysis Experience developing and providing technical presentations to audiences including the U.S. and international customers Experience with National instruments LabView or LabWindows software packages, PXI, my DAQ, or Compact RIO hardware. Experience with system assembly, disassembly and test Experience with MS Office tools Experience with FRACAS (Failure Reporting, Analysis, and Corrective Action Systems) Proficiency in reliability modeling (e.g. Weibull analysis) Ability to conduct and oversee reliability testing (accelerated life testing, degradation testing, HALT/HASS) Expertise in various reliability methodologies (e.g. FMEA, FMECA, FTA, RCA) What We Offer Our values drive our actions, behaviors, and performance with a vision for a safer, more connected world. At RTX we value: Safety, Trust, Respect, Accountability, Collaboration, and Innovation. This position is eligible for relocation assistance! Learn More & Apply Now! Please consider the following role type definition as you apply for this role: This is a full-time, onsite position based in Tewksbury, MA. Onsite: Employees who are working in Onsite roles will work primarily onsite. This includes all production and maintenance employees, as they are essential to the development of our products. As part of our commitment to maintaining a secure hiring process, candidates may be asked to attend select steps of the interview process in-person at one of our office locations, regardless of whether the role is designated as on-site, hybrid or remote. The salary range for this role is 101,000 USD - 203,000 USD. The salary range provided is a good faith estimate representative of all experience levels. RTX considers several factors when extending an offer, including but not limited to, the role, function and associated responsibilities, a candidate's work experience, location, education/training, and key skills.Hired applicants may be eligible for benefits, including but not limited to, medical, dental, vision, life insurance, short-term disability, long-term disability, 401(k) match, flexible spending accounts, flexible work schedules, employee assistance program, Employee Scholar Program, parental leave, paid time off, and holidays. Specific benefits are dependent upon the specific business unit as well as whether or not the position is covered by a collective-bargaining agreement.Hired applicants may be eligible for annual short-term and/or long-term incentive compensation programs depending on the level of the position and whether or not it is covered by a collective-bargaining agreement. Payments under these annual programs are not guaranteed and are dependent upon a variety of factors including, but not limited to, individual performance, business unit performance, and/or the company's performance.This role is a U.S.-based role. If the successful candidate resides in a U.S. territory, the appropriate pay structure and benefits will apply.RTX anticipates the application window closing approximately 40 days from the date the notice was posted. However, factors such as candidate flow and business necessity may require RTX to shorten or extend the application window. RTX is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability or veteran status, or any other applicable state or federal protected class. RTX provides affirmative action in employment for qualified Individuals with a Disability and Protected Veterans in compliance with Section 503 of the Rehabilitation Act and the Vietnam Era Veterans' Readjustment Assistance Act. Privacy Policy and Terms: Click on this link to read the Policy and Terms
    $69k-88k yearly est. 1d ago
  • Site Reliability Engineer-- SHUDC5697371

    Compunnel Inc. 4.4company rating

    Dallas, TX jobs

    Job Title: Site Reliability Engineer - W2 only-We can not provdie sponsorship for this role. Duration: Long Term Shift: M-F 8-5pm On-Call 8AM to 8PM Monday - Sunday rotation Skills: - Production Support - Python or Java and/or JavaScript - Cloud Computing, DevOps concepts including CI / CD Pipelines - Hands-on Kubernetes - Observability tools (Datadog, Splunk etc). The Expertise You Have Bachelor's Degree in Computer Science, Information Science, (or equivalent) Minimum 8 years of software engineering experience Hands-on Linux experience preferred (user and permissions management, file systems, performance tuning) Hands-on Experience in AWS compute and storage services (AWS Lambda, S3, Glue, Route 53 & IAM), Kubernetes Observability skills such as Datadog, Splunk, SLI/SLO and other tools. Some databases and using SQL (like Oracle, MySQL, Postgres, or Dynamo DB) required Proficiency with Data Processing and ETL (Control-M & Informatica) Experience with tools like: GIT, Maven, Jenkins, uDeploy, JIRA, Artifactory, Sonar. Proficiency with scripting languages like Python, Java, Bash or Power Shell preferred Proven experience with CI/CD pipelines using technology such as Groovy, Jenkins, JenkinsCore and Urbancode Deploy preferred Experience with ITSM (Incident, Change & Problem Management) Automation skills Excellent at communicating and building relationships across teams and technology partners Confidence to work independently and with minimum supervision The Value You Deliver Work on the dev team to complete agile workflows Share knowledge while also learning from other expert engineers Relationship builder, participate in complex projects, sometimes across multiple squads, business partners and external vendors Effectively represent projects and team efforts at business venues, user groups, and to organizational leaders Actively participate in development and migration to cloud-centric solutions and participate in research efforts Work effectively in unstructured environments, anticipating impact of changes and providing quick resolution for high impact incidents Think creatively to identify and develop innovative, secure solutions which may not conform to traditional practices
    $82k-107k yearly est. 2d ago
  • Site Reliability Engineer II

    Optomi 4.5company rating

    Dallas, TX jobs

    *6-month Contract to Hire* Hybrid: 2x a week onsite in Dallas, TX Optomi, in partnership with a leading financial services company, is seeking a highly skilled, hands-on Site Reliability Engineer to join our client's engineering team for a hybrid role! This role is focused on building, optimizing, and supporting cloud-native systems. The ideal candidate brings strong engineering fundamentals, experience in Azure, and the ability to improve reliability through automation, infrastructure as code, and modern DevOps practices. Key Responsibilities: Hands-On Engineering Design, build, and maintain scalable, reliable, cloud-native systems with a strong emphasis on engineering over operations. Drive automation, performance tuning, and resilience improvements across services and infrastructure. Cloud & Infrastructure Work extensively with Azure Cloud Services, including: Service Bus EventHub SQL Server AKS (Azure Kubernetes Service) Function Apps App Services Implement and manage infrastructure using Terraform. Containerization & Development Build and deploy containerized applications using Docker (2-3 years of experience required). Leverage a strong .NET development background to improve system reliability and automate engineering workflows. DevOps & CI/CD Use Azure DevOps (ADO) to build, maintain, and optimize CI/CD pipelines and release processes. Monitoring & Observability Implement and support observability solutions; experience with Splunk Observability Cloud is strongly preferred. Experience & Qualifications: 3-5+ years of direct Site Reliability Engineering experience. Bachelor's degree in a related field or equivalent practical experience required. Master's degree in a related field is preferred.
    $87k-123k yearly est. 4d ago
  • Site Reliability Engineer

    Optomi 4.5company rating

    Irving, TX jobs

    Optomi, in partnership with our client, are seeking an experienced SRE II to join their team for a 6 month contract to hire opportunity that is 2 days hybrid onsite in Irving, TX. W2 only - no C2C/sponsorship at this time. We are seeking a highly skilled Site Reliability Engineer II to join our engineering organization. This role focuses on building resilient, scalable, and automated systems-not traditional production support. The ideal candidate has hands-on engineering experience across cloud infrastructure, observability, automation, and reliability-focused development. You will work closely with development, cloud engineering, and platform teams to ensure high availability, optimal performance, and operational excellence of critical customer-facing applications. Key Responsibilities Contribute directly to the reliability, scalability, performance, and security of critical applications. Build reusable services, automation, and frameworks that improve platform stability and developer velocity. Cloud & Platform Engineering Design and enhance cloud infrastructure using Azure services including: Azure Service Bus Event Hub Azure SQL AKS (Azure Kubernetes Service) Function Apps App Services Implement and manage Infrastructure as Code (IaC) using Terraform. Containerization & Orchestration Build and deploy containerized applications using Docker (2-3+ years). Support Kubernetes workloads via AKS, including scaling, upgrades, and cluster reliability improvements. Development & DevOps Collaborate with development teams using a working knowledge of .NET. Improve CI/CD workflows using Azure DevOps (ADO). Monitoring, Observability & Incident Response Implement and optimize monitoring and alerting strategies. Use Splunk Observability Cloud (preferred) or equivalent observability platforms to enhance visibility and reduce MTTR. Drive proactive incident identification, root-cause analysis, and long-term fixes. Performance, Reliability & Scalability Enhancements Design and implement SLOs, SLIs, and error budgets. Develop auto-scaling policies, failover strategies, and disaster recovery procedures. Optimize application and database performance to ensure reliability across high-traffic, mission-critical systems. Required Qualifications 3-5+ years of hands-on SRE experience Bachelor's degree in Computer Science, Engineering, or a related technical field (or equivalent experience) Master's degree preferred Hands-on experience with: Azure Cloud (AKS, Service Bus, Event Hub, SQL, Function Apps, App Services) Terraform Docker Azure DevOps Monitoring tools (Splunk Observability Cloud preferred) .NET ecosystem (understanding of development fundamentals) Preferred Skills Experience designing resilient, distributed systems Strong troubleshooting and analytical skills Performance tuning across applications, databases, and cloud services Experience improving uptime, latency, throughput, or cost efficiency of production applications Familiarity with SRE principles and modern operational practices
    $87k-122k yearly est. 4d ago
  • Quality Engineer with Automation (Only W2 resources)

    Tek Leaders Inc. 3.9company rating

    Cincinnati, OH jobs

    Expert-level automation experience in: Selenium for UI validation Playwright for modern and responsive web interfaces. Karate Framework for REST and SOAP API validation testing (using tools such as Postman, SoapUI, or Rest Assured). Experience testing in Retail Point-of-Sale (POS) environments or similar retail systems. Git, Jenkins, CI/CD pipelines, and integration with test management tools (e.g., QMetry, JIRA, Zephyr). BDD/TDD, data-driven testing, and mocking service layers. coding/scripting skills in Java, JavaScript, or Python.
    $61k-80k yearly est. 1d ago
  • Quality Validation Engineer (USC/GC/EAD)

    Raas Infotek 4.1company rating

    San Diego, CA jobs

    Hii, Hope you are doing well. I have an immediate requirement, please let me know if you are interested in this role . Job Title : Quality Validation Engineer (USC/GC/EAD) Mode : Contract Type : C2C/W2 Job Description : Product software as a medical device - verification and validation activities for new products (Quality Engineer supporting the R&D team). Medical device - R&D product software . resource shall have a background in the medical/pharma domain. resource shall have product software validation experience and a minimum of 2 to 3 years of experience in Quality. shall have experience as a Software Quality Engineer or Validation Engineer and Quality Engineer. Job Summary: Provides technical and quality system guidance related to establishing product software as a medical device requirements. Provide quality oversight for product software as a medical device verification and validation activities for new products in accordance with design planning procedures. This includes, but is not limited to, reviewing and approving software test case protocols and reports, review of software development plans, and review of other system and software documentation. Leads meetings to prioritize, review and/or approve of action plans for addressing issues captured in problem resolution systems during development. Leads risk evaluation and associated management activities related to product software development including Risk assessments (e.g. FMEA), product risk analysis, and mitigation of software issues. Participates in technical and management reviews to ensure design plans, product design and deliverables related to product software are met. Represent the quality engineering function for the review and approval of designated design controls. May provide quality oversight for non-product software validation by assessing the need for validation and preparing and/or supporting protocols, reports and other documentation as required. May be involved with supporting product cybersecurity assessments in conjunction with a cross-functional team Complies with US FDA regulations, other country regulatory requirements, company policies, and procedures. Maintains a strong, collaborative partnership with cross functional team members especially with software supplier. Works as an individual contributor and may provide guidance of other QE team members. -- -- Thank you, Deepak Singh Email: ****************************
    $78k-103k yearly est. 5d ago
  • Computer Serialization Validation Engineer

    Sharp Services 4.5company rating

    Allentown, PA jobs

    This role is responsible for the development and implementation of computer validation activities related to computerized GMP systems and technology within commercial and clinical Sharp locations. Utilization of a system development lifecycle approach, applying industry guidance (USP, EMEA, HC, ICH) with knowledge of 21CFR-part 11 requirements to ensure compliance of all systems. Working with Engineering, IT, Technical Services, Project Management, Operations, Sales and Quality Assurance to develop and implement serialization technology solutions and computer systems validation across multiple platforms to meet client needs and industry standards. Scope of the position includes Allentown, Bethlehem, Conshohocken and Macungie. Primary location will be dependent on location of individual at time of hiring with the expectation that commuting between locations can/will be required. ESSENTIAL DUTIES AND RESPONSIBILITIES: The following is a list of minimum responsibilities related to this position. Other duties may be assigned. Support the computer validation program by contributing to development of validation approach, design and execution. Development and implementation of CSV master plan. Develop and write IQ/OQ/PQ protocols and complete validation activities that may include: Requirements analysis Traceability Matrix Summary report CSV Assessments for appropriate equipment and systems within the guidelines of cGMP. Execution of protocols at designated sites as needed. Responsible for communicating computer validation approach and requirements with customers and internal staff. Responsible for supervising the execution of validation activities at designated Sharp facility which involves but not limited to serialized packaging, and computer systems validation including environmental monitoring systems, quality systems, networking, and baseline equipment qualifications. Participate in customer/regulatory audits specific to computer validation activity at all Sharp facilities at the direction of the CSV Supervisor. Review executed protocols and write final reports as required. Provide support for customer audits and external regulatory audits specific to computer validation activity at all Sharp facilities at the direction of the CSV Supervisor. EDUCATION and/or EXPERIENCE: Bachelor's degree in technical discipline (BS/BA) from a four-year college or university preferred with five to seven years related experience and or training; or equivalent combination of education and experience. Knowledge of FDA regulations including cGMPs, current industry practice and computer validation guidance documents including 21CFR-part11. Knowledge and understanding of quality engineering, operations and validation principles and practices. Ability to structure validation protocols in conformance with a planned validation approach is required. Familiarity with ISO 9000 (beneficial)
    $68k-86k yearly est. 5d ago
  • Manufacturing Diagnostics Engineer

    Comrise 4.3company rating

    Tuscaloosa, AL jobs

    Company is helping our client find a Manufacturing Diagnostics Engineer to support the bring-up, validation, testing, and troubleshooting of the company's products, as well as the development and implementation of diagnostic systems and infrastructure. In this role, you'll develop, refine, and scale test processes for the electrical and firmware systems of the company's primary drive module subassembly, which is ultimately assembled onto the vehicle. The ideal candidate has strong debugging and diagnostic skills, hands-on experience with automotive networking and firmware testing, and enjoys working cross-functionally in a fast-paced manufacturing environment. As a Manufacturing Diagnostics Engineer, you will: Serve as the first line of defense for electrical, firmware, and functional issues on the drive module manufacturing assembly, including software, firmware, harnessing, networking (CAN, LIN, Automotive & Standard Ethernet, GMSL), hardware, and infrastructure. Troubleshoot and resolve electrical and firmware issues at the manufacturing supplier (e.g., flashing firmware, updating test sequences, repairing harness connectors, terminals, and wires). Work closely with multidisciplinary engineering teams to interpret component- and vehicle-level requirements and translate them into scalable, system-level validation test scripts and cases.7 Develop work instructions, troubleshooting guides, and workaround documentation for operators on the manufacturing line. Design and implement next-generation diagnostic architectures to support higher-volume production in future manufacturing lines. Daily tasks Responsibilities Serve as the first line of defense for all electrical and functional issues on the drive module manufacturing assembly including but not limited to software / firmware, harness, networking (CAN, LIN, Automotive and Standard Ethernet, GMSL, etc.), hardware, and infrastructure issues. Resolve electrical and firmware issues at manufacturing supplier by flashing firmware, updating test sequences, repairing harness connectors / terminals / wires, etc. Work closely with multidisciplinary engineers to collect and interpret the component and vehicle level requirements and translate them into scalable system-level validation test scripts and test cases Develop work instructions and troubleshooting / workaround guides for operators on the Manufacturing line Design and implement future manufacturing lines involving the next generation diagnostic architecture to support higher volume production Required skills Qualifications Bachelor's degree in a relevant area such as Electrical Engineering or Computer Science. Very strong troubleshooting and debugging skills and experience, including reading, troubleshooting, and injecting packets over various network protocols such as CAN, LIN, and Ethernet 2-4 years of experience with automotive controller integration testing or test script development / programming in high-level languages such as Python Experience with Github or similar tools for software management Strong background in Linux and shell / bash / terminal scripting Bonus Qualifications Master's degree in a relevant area such as Computer Science or Engineering Experience with integration and automation of manufacturing and test equipment Background in creating, reading, and writing to tables in SQL database Experience with computer network engineering, TCP / IP protocols, communication over API Business driver of role As a Manufacturing Diagnostics Engineer, you will be a part of the team that executes processes to bring up, validate, test, and troubleshoot the company products as well as implements diagnostic systems / infrastructure. You will develop and improve our processes for testing the electrical and firmware systems of the primary drive module subassembly that is ultimately assembled onto the company vehicle.
    $67k-89k yearly est. 2d ago
  • Manufacturing Engineer, OFP

    The Judge Group 4.7company rating

    Waterloo, IA jobs

    Duration: 24 months with possible extension About the Role As a Manufacturing Engineer, you will plan, coordinate, and execute manufacturing engineering activities for projects or processes within the enterprise product delivery or order fulfillment process. You will work in a team environment to improve manufacturing systems, ensure quality standards, and support continuous improvement initiatives. This role involves hands-on work in a factory setting and requires strong problem-solving and decision-making skills. Required Qualifications Bachelor's degree in Industrial Technology or Engineering (or equivalent). Ability to work onsite in a factory environment and travel between facilities as needed. Comfortable spending ~70% of time on feet and operating factory vehicles Preferred Qualifications Recent graduates with manufacturing internships are welcome. Software programming Hands-on manufacturing environments Strong decision-making and problem-solving skills.
    $57k-75k yearly est. 4d ago
  • Process Engineer

    Talent Software Services 3.6company rating

    Maple Grove, MN jobs

    Are you an experienced Process Engineer with a desire to excel? If so, then Talent Software Services may have the job for you! Our client is seeking an experienced Process Engineer to work at their company in Maple Grove, MN. This position offers the opportunity to support process development and product commercialisation throughout our global plant network. You will work cross-functionally to lead process development of complex technologies on new products, drive/develop new business activities and rapid prototyping execution and ensure efficient and effective transfer of products into production. We are looking for a highly motivated and driven individual who can solve complex, technical problems in both a hands-on manner and a team setting. This opportunity will require you to work in a fast-paced environment across multiple functions in a global company. You will leverage technical and collaboration skills alongside a passion for innovation and continuous improvement to drive growth through efficient and effective commercialisation of new products and targeted improvements. Primary Responsibilities/Accountabilities: Partners with R&D to develop design specifications, draft test methods, and drive material selection. Provides Design for Manufacturability (DFM) input to the engineering print package. Applies technical knowledge to innovate, design, and develop processes, procedures, tooling and automation. Contributes creative, innovative ideas and solutions to solve complex technical problems. Proposes or investigates new technologies to pave a path for future business. Oversees development builds as well as initial production builds associated with the project using special work requests (SWRs). Trains and/or provides work direction to technicians and engineers, and may train manufacturing personnel when required as part of a validation. Assesses process capabilities, prioritises process improvement opportunities, and innovates and implements process improvements on platform or derivative projects. Prepares and presents technical data and recommendations to project stakeholders at technical reviews to influence business decisions. Writes and reviews validation protocols and reports applicable to new processes. Executes the functional deliverables associated with the PDP/TDP and Quality System. Ensures proper documentation is completed to meet quality systems requirements (e.g. print package, drawing trees, BOMs, routers, process risk documentation, SWRs, process flowcharts, process characterisation documentation, validation plans, process verification and validation documentation, including TMV and OQ/PQ, etc.). Collaborates with operations counterparts in manufacturing engineering and quality engineering to prepare for product transfer from development to production for commercialisation. Qualifications BS in Engineering and 2-4 years of experience. Strong engineering fundamentals, conflict resolution experience, and problem-solving skills. Demonstrated ability to develop and drive creative, innovative solutions for both processes and products. Demonstrated cross-functional teamwork and collaboration in a technically demanding environment. Strong communication and time management skills. Open to travel up to 10% of the time, which may include vendor visits, supplier visits, or conferences. Preferred: Experience developing and characterising various types of extruded processes, such as free extrusion, over the core, coextrusion, over braid, reel to reel, multi-lumen, and bump extrusion. Experience working with polymers - particularly Pebax, Nylon, PEEK, and Urethanes. Familiarity with how extrusion inputs can impact outputs. Demonstrated ability to develop equipment/fixtures from concept to prototype to production. Demonstrated use of DFSS tools (DOE, problem solving). Experience with mechanical design, automation and controls, and programming. Strong understanding of statistics and experience using statistical analysis software tools. Medical device or other regulated industry experience, including an understanding of the quality system. Experience as a technical project lead or responsibilities such as coordinating teams, encouraging cr
    $67k-85k yearly est. 5d ago
  • Site Reliability Engineer

    Oracle 4.6company rating

    Senior reliability engineer job at Oracle

    This role aligns to work done for the US Federal Government and requires US citizenship among other qualification outlined below. Including a Federal Investigation into your background to gain Public Trust. RTHS DevOps is responsible for the CareAware Cloud Saas across all our cloud regions internal and client facing. The team is responsible for keeping the lights on as well as other needed deployments, projects, and new implementations. As a member of the RTHS DevOps team you will be responsible for daily operational tasks required to run it for all our cloud clients. You will monitor and maintain server performance, availability, and ensure compliance to Service Level Agreements. You will address operational systems issues as needed. You will deploy new code, onboard new clients or new solutions and complete technology upgrades. As we move into the future projects, we have critical involvement in our OCI cloud build out and client migrations giving an opportunity to get involved from the ground of these new regions and apply dev ops thinking from the beginning. Disclaimer: Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates. Range and benefit information provided in this posting are specific to the stated locations only US: Hiring Range in USD from: $79,800 to $178,100 per annum. May be eligible for bonus and equity. Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business. Candidates are typically placed into the range based on the preceding factors as well as internal peer equity. Oracle US offers a comprehensive benefits package which includes the following: 1. Medical, dental, and vision insurance, including expert medical opinion 2. Short term disability and long term disability 3. Life insurance and AD&D 4. Supplemental life insurance (Employee/Spouse/Child) 5. Health care and dependent care Flexible Spending Accounts 6. Pre-tax commuter and parking benefits 7. 401(k) Savings and Investment Plan with company match 8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation. 9. 11 paid holidays 10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours. 11. Paid parental leave 12. Adoption assistance 13. Employee Stock Purchase Plan 14. Financial planning and group legal 15. Voluntary benefits including auto, homeowner and pet insurance The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted. Career Level - IC3 As a member of the RTHS DevOps team you will be responsible for daily operational tasks required to run it for all our cloud clients. You will monitor and maintain server performance, availability, and ensure compliance to Service Level Agreements. You will address operational systems issues as needed. You will deploy new code, onboard new clients or new solutions and complete technology upgrades. As we move into the future projects, we have critical involvement in our OCI cloud build out and client migrations giving an opportunity to get involved from the ground of these new regions and apply dev ops thinking from the beginning. Qualifications: Deep Linux Knowledge Strong knowledge of Kubernetes System Monitoring and troubleshooting Networking Monitoring and troubleshooting Cloud experience OCI or AWS preferred
    $79.8k-178.1k yearly Auto-Apply 60d+ ago

Learn more about Oracle jobs

View all jobs