Reliability Engineer jobs at Booz Allen Hamilton - 27 jobs
Remote Site Reliability Engineer - Build Resilient Systems
Booz Allen Hamilton 4.9
Reliability engineer job at Booz Allen Hamilton
A leading consulting firm in the U.S. is seeking a Site Reliability Engineer skilled in building resilient infrastructure and automating processes. You will lead teams, optimize systems, and implement monitoring tools. The ideal candidate has extensive experience in cloud technologies, Unix/Linux, and application troubleshooting, along with a master's degree or equivalent experience. This role offers a competitive salary range between $99,000 and $225,000 annually, with a flexible work model.
#J-18808-Ljbffr
$99k-225k yearly 2d ago
Looking for a job?
Let Zippia find it for you.
Site Reliability Engineer DevOps | REMOTE (US Citizenship required)
Oracle 4.6
Columbus, OH jobs
The Oracle Analytics Service Excellence (OASE) team within Oracle Analytics Cloud (OAC) is responsible for developing tools, technologies, processes, and driving change/ improvement through data driven decisions. You will work alongside software development teams to ensure service and feature parity for government customers. OASE is focused on building cloud based technologies managing, improving, maintaining, and evolving the cloud services offerings; as well as automating processes, developing code and services, operating, and reporting on internal applications that support the growth of the business, education of partners, and ensuring compliance across the breadth of Oracle Analytics.
**Responsibilities**
**Roles and responsibilities:**
- Perform DevOps activities to support customers, engineers, and processes through our release cycles as well as production
- Respond to incidents, troubleshoot issues and drive to completion, driving and participate in root cause analysis
- Perform change management activities ensuring environment operates on latest versions and configurations
- Provide full stack support to Oracle Analytics Cloud including
- Become expert in Analytic services, to prevent, resolve customer issues effectively and prevent regressions and repeats
- Document various processes & run books; update existing processes
- Execute, with excellence, delivery of interim patches and hot-fixes as required
- Work with various teams to take ownership of and resolve service failure/outages
- Monitor metrics and develop ways to improve the CI and CD tools utilized by the team
- Follow all best practices and procedures as established by the company
- Mentor and train other engineers and seek to continually improve processes
- Participate in 24 x 7 DevOps model with on call rotations for daytime, nighttime, and weekends
- Other duties as assigned
**The candidate requires these attributes:**
- A BS or MS in Computer Science, or equivalent
- Providing cloud networking, infrastructure, and service support, configuration, operations, tools, and processes
- Understand networking, and TCP/IP fundamentals and services such as DNS, HTTP, etc.
- Linux/Unix system administration including system level knowledge of Linux on OCI Gen 2, creating and executing scripts
- Experience developing & operating cloud services or large distributed applications in production
- Methodical approaches to troubleshooting and solving complex technical problems, reverse engineering existing applications
- Producing documentation in support of developed work (KBs, run books, help guides)
- Utilizing agile methodologies
- Communicating effectively in a team environment
- Working with remote and global teams
- Working independently and in a self-directed manner
- Able to work extended week day and week-end shifts as required for on-call, after hours upgrades, and other duties as assigned
**The ideal candidate for this position will have the following attributes:**
- Experience supporting multiple cloud services, troubleshooting of customer requirements and product capability, and reporting root cause for product fixes
- Knowledge of Oracle Analytics server, BI publisher, Oracle Analytics Cloud; Oracle database, Oracle Autonomous DB, MySQL (experience with MS SQL and/or NoSQL is a plus)
- 2+ years of experience of running large scale customer facing Web Applications
- Oracle Cloud Infrastructure (OCI) or AWS, Azure, GCP compute, storage, and network operational experience
- Programming and scripting languages (Python, Ansible, bash, Java Script - additional experience with PHP, Groovy, Java, and/or Go is a plus)
- Using CI/CD scripting tools such as Ansible, Puppet, or Chef
- Experience in Cloud Native application development using Containers and orchestration ( Kubernetes) independently scalable micro-services
- Issue tracking and collaboration (Jira and Confluence)
Disclaimer:
**Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.**
**Range and benefit information provided in this posting are specific to the stated locations only**
US: Hiring Range in USD from: $79,800 to $178,100 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC3
**About Us**
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_************* or by calling *************** in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
$79.8k-178.1k yearly 60d+ ago
Site Reliability Engineer 2 DevOps | REMOTE (US Citizenship required)
Oracle 4.6
Columbus, OH jobs
Are you a creative person who loves a challenge? Solve the complex puzzles you've been dreaming of as our Engineer. If you have a passion for innovation in tech, we want you on our team! Thrive in this crucial automation role. Oracle is a technology leader that's changing how the world does business. We're looking for an experienced and self-motivated person. We appreciate you taking the time to review the list of qualifications and to apply for the position.
Come and join us! Building off our Cloud momentum, Oracle has formed a new organization - Oracle Health. This team will focus on product deployment, sustainability, troubleshooting and product strategy for Oracle Health, while building out a complete platform supporting modernized, automated healthcare. This is a net new line of business, constructed with an entrepreneurial spirit that promotes an energetic and creative environment. We are unencumbered and will need your contribution to make it a world class engineering center with the focus on excellence.
As a Site Reliability DevOps Engineer, you will be responsible for defining and deploying key services with deep focus on architecture, production operations, capacity planning, performance management, deployment, and release engineering. You will work with multiple cross-functional teams helping deliver new and outstanding experiences to our collaborators while ensuring reliability and performance.
**Responsibilities includes:**
Take ownership of the architecture, analysis, design, implementation and production operations of a wide array of Core System Framework solutions
React to production deficiencies by continuously implementing automation, self-healing, and real-time monitoring to production systems
Be a strong contributor to supporting and development of platform services including architecture, provisioning, configuration, deployment, and support
Partner with the distributed team in prototyping new platform services
Stay informed of new technologies
Innovate
Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence
Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services
Develop designs, architectures, standards, and methods for large-scale distributed systems
Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning and performance.
**Responsibilities**
**Key Requirements/Experience include** :
- 3-5 years of experience as a Site Reliability or DevOps Engineer
- The ability to acquire & maintain a federal security clearance vital for this role, which requires you to be a US citizen
- Developing/operating large scale distributed services / applications
- Container administration and development applying Kubernetes, Docker, Mesos, or similar
- Infrastructure automation through Terraform, Chef, Ansible, Puppet, Packer or similar
- Experience with Cloud Orchestration frameworks, development and SRE support of these systems
- Experience with CI/CD pipelines including VCS (git, svn, etc), Gitlab Runners, Jenkins, Rundeck
- Working with or supporting production, test, and development environments for medium to large user environments
- Experience in developing scripts to automate software deployments and installations using PowerShell or Bash
- Knowledge of cloud compute technologies, network monitoring, data processing and analytics
- Experience with a modern programming language such as Java, Python, or C++ or equivalent
- Experience working with fault tolerant, highly available, high throughput, distributed, scalable systems
- Experience operating services in one of the major Clouds such as AWS, OCI, Azure, etc
Disclaimer:
**Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.**
**Range and benefit information provided in this posting are specific to the stated locations only**
US: Hiring Range in USD from: $63,000 to $126,100 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC2
**About Us**
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_************* or by calling *************** in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
Are you a creative person who loves a challenge? Solve the complex puzzles you've been dreaming of as our Engineer. If you have a passion for innovation in tech, we want you on our team! Thrive in this crucial automation role. Oracle is a technology leader that's changing how the world does business. We're looking for an experienced and self-motivated person. We appreciate you taking the time to review the list of qualifications and to apply for the position.
Come and join us! Building off our Cloud momentum, Oracle has formed a new organization - Oracle Health. This team will focus on product deployment, sustainability, troubleshooting and product strategy for Oracle Health, while building out a complete platform supporting modernized, automated healthcare. This is a net new line of business, constructed with an entrepreneurial spirit that promotes an energetic and creative environment. We are unencumbered and will need your contribution to make it a world class engineering center with the focus on excellence.
As a Site Reliability DevOps Engineer, you will be responsible for defining and deploying key services with deep focus on architecture, production operations, capacity planning, performance management, deployment, and release engineering. You will work with multiple cross-functional teams helping deliver new and outstanding experiences to our collaborators while ensuring reliability and performance.
Responsibilities includes:
Take ownership of the architecture, analysis, design, implementation and production operations of a wide array of Core System Framework solutions
React to production deficiencies by continuously implementing automation, self-healing, and real-time monitoring to production systems
Be a strong contributor to supporting and development of platform services including architecture, provisioning, configuration, deployment, and support
Partner with the distributed team in prototyping new platform services
Stay informed of new technologies
Innovate
Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence
Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services
Develop designs, architectures, standards, and methods for large-scale distributed systems
Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning and performance.
**Responsibilities**
Key Requirements/Experience include:
- The ability to acquire & maintain a federal security clearance vital for this role, which requires you to be a US citizen
- Developing/operating large scale distributed services / applications
- Container administration and development applying Kubernetes, Docker, Mesos, or similar
- Infrastructure automation through Terraform, Chef, Ansible, Puppet, Packer or similar
- Experience with Cloud Orchestration frameworks, development and SRE support of these systems
- Experience with CI/CD pipelines including VCS (git, svn, etc), Gitlab Runners, Jenkins, Rundeck
- Working with or supporting production, test, and development environments for medium to large user environments
- Experience in developing scripts to automate software deployments and installations using PowerShell or Bash
- Knowledge of cloud compute technologies, network monitoring, data processing and analytics
- Experience with a modern programming language such as Java, Python, or C++ or equivalent
- Experience working with fault tolerant, highly available, high throughput, distributed, scalable systems
- Experience operating services in one of the major Clouds such as AWS, OCI, Azure, etc
Disclaimer:
**Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.**
**Range and benefit information provided in this posting are specific to the stated locations only**
US: Hiring Range in USD from: $79,100 to $158,200 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC3
**About Us**
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_************* or by calling *************** in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
Are you a creative person who loves a challenge? Solve the complex puzzles you've been dreaming of as our Engineer. If you have a passion for innovation in tech, we want you on our team! Thrive in this crucial automation role. Oracle is a technology leader that's changing how the world does business. We're looking for an experienced and self-motivated person. We appreciate you taking the time to review the list of qualifications and to apply for the position.
Come and join us! Building off our Cloud momentum, Oracle has formed a new organization - Oracle Health. This team will focus on product deployment, sustainability, troubleshooting and product strategy for Oracle Health, while building out a complete platform supporting modernized, automated healthcare. This is a net new line of business, constructed with an entrepreneurial spirit that promotes an energetic and creative environment. We are unencumbered and will need your contribution to make it a world class engineering center with the focus on excellence.
As a Site Reliability DevOps Engineer, you will be responsible for defining and deploying key services with deep focus on architecture, production operations, capacity planning, performance management, deployment, and release engineering. You will work with multiple cross-functional teams helping deliver new and outstanding experiences to our collaborators while ensuring reliability and performance.
Responsibilities includes:
Take ownership of the architecture, analysis, design, implementation and production operations of a wide array of Core System Framework solutions
React to production deficiencies by continuously implementing automation, self-healing, and real-time monitoring to production systems
Be a strong contributor to supporting and development of platform services including architecture, provisioning, configuration, deployment, and support
Partner with the distributed team in prototyping new platform services
Stay informed of new technologies
Innovate
Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence
Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services
Develop designs, architectures, standards, and methods for large-scale distributed systems
Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning and performance.
**Responsibilities**
**Key Requirements/Experience include:**
- The ability to acquire & maintain a federal security clearance vital for this role, which requires you to be a US citizen
- Developing/operating large scale distributed services / applications
- Container administration and development applying Kubernetes, Docker, Mesos, or similar
- Infrastructure automation through Terraform, Chef, Ansible, Puppet, Packer or similar
- Experience with Cloud Orchestration frameworks, development and SRE support of these systems
- Experience with CI/CD pipelines including VCS (git, svn, etc), Gitlab Runners, Jenkins, Rundeck
- Working with or supporting production, test, and development environments for medium to large user environments
- Experience in developing scripts to automate software deployments and installations using PowerShell or Bash
- Knowledge of cloud compute technologies, network monitoring, data processing and analytics
- Experience with a modern programming language such as Java, Python, or C++ or equivalent
- Experience working with fault tolerant, highly available, high throughput, distributed, scalable systems
- Experience operating services in one of the major Clouds such as AWS, OCI, Azure, etc
Disclaimer:
**Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.**
**Range and benefit information provided in this posting are specific to the stated locations only**
US: Hiring Range in USD from: $74,900 to $158,200 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC3
**About Us**
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_************* or by calling *************** in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
$74.9k-158.2k yearly 60d+ ago
Principal Site Reliability Engineer
Oracle 4.6
Remote
Our Team
Building off our Cloud momentum, Oracle has formed a new organization - Oracle Health Data, Analytics Platform. This team will focus on product development and product strategy for Oracle Health, while building out a complete platform supporting modernized, automated healthcare. This is a net new line of business, constructed with an entrepreneurial spirit that promotes an energetic and creative environment. We are unencumbered and will need your contribution to make it a world class engineering center with the focus on excellence.
Oracle Health Data, Analytics Platform has a rare opportunity to play a critical role in how Oracle Health products impact and disrupt the healthcare industry by transforming how healthcare and technology intersect.
You will have the opportunity to:
Reach billions of people with our products & services
Create technology in which truly impacts the world
Ability to have immediate impact on developing technology
Unlimited growth potential with inspiring work
Work with the best minds in the industry
Enjoy working in an open, diverse, and productive environment
About The Job
This role provides technical leadership for the core data platforms behind Oracle Health's Data & Analytics Platform. As a Principal Site Reliability Engineer (SRE), you will own shared, mission-critical systems used by multiple products and teams.
You will lead the design and operation of large-scale, stateful distributed platforms, including Hadoop ecosystem components (HDFS, YARN, HBase) deployed on Oracle Big Data Service (BDS), Kafka, and Storm. These multi-tenant platforms are deployed and operated through Ansible- and Terraform-based automation and require strong architectural ownership to manage scale, change, and broad blast radius.
What You'll Do
Platform Ownership & Technical Leadership
Own the end-to-end reliability, scalability, and operability of shared data platforms
Define platform standards, architectural direction, and operational guardrails
Influence cross-team technical decisions and long-term platform strategy
Drive long-term platform evolution and influence reliability strategy across the data ecosystem
Architecture & Design
Lead platform architecture and design reviews
Clearly articulate system behavior, dependencies, and failure modes
Make principled trade-offs between reliability, performance, cost, and complexity
Provide guidance and guardrails that enable downstream teams to use platforms safely and effectively
Operations Engineering
Establish capacity models, scaling strategies, and operational best practices
Design platforms that behave predictably under load, failure, and change
Own platform lifecycle events: upgrades, expansions, decommissioning, and recovery
Distributed Systems Expertise
Operate and evolve stateful distributed systems where data placement, replication, and recovery are critical
Reason about failure modes such as backpressure, rebalancing, region movement, replication lag, and rolling upgrades
Security
Operate and maintain Kerberized platforms, including authentication, authorization, and secure service-to-service communication
Treat security as a first-class architectural concern
Automation
Design and evolve an Ansible- and Terraform-driven automation framework
Treat automation as production software: versioned, reviewed, tested, and improved
Eliminate operational toil by encoding reliability and safety into the platform
Incident Leadership & Prevention
Serve as the ultimate escalation point for complex or ambiguous incidents
Focus on eliminating entire classes of failure, not just resolving individual issues
Representation
Represent SRE and platform engineering in high-visibility and sensitive forums
Communicate clearly with engineering leadership and partner teams
Responsibilities
The team operates within the Oracle Health Data & Analytics Platform, supporting one of Oracle Health's core products, HealtheIntent. We operate the big data and streaming infrastructure that enables downstream teams to deliver reliable customer-facing solutions at scale, while continuously improving operability and efficiency.
Required Experience
8+ years operating large-scale, customer-facing distributed platforms
Deep experience with HDFS, YARN, HBase, Kafka, Storm, or similar systems
Strong background in Linux, networking, and distributed system troubleshooting
Infrastructure-as-Code using Ansible and Terraform
Scripting and automation using Python, Ruby, and Bash
Hands-on experience operating Kerberized environments
Proven ability to define and document technical architecture for complex systems
Demonstrated ownership of shared platforms with broad blast radius and multiple downstream consumers
Experience designing observability and capacity models for distributed platforms
Required Qualifications:
U.S. Citizenship and eligibility for a Federal Security Clearance
10+ years of technical experience relevant to this position
Ability to communicate effectively and build rapport with team members
BS or MS in Computer Science, or equivalent
#LI-HR1
Disclaimer:
Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.
Range and benefit information provided in this posting are specific to the stated locations only
US: Hiring Range in USD from: $86,400 to $199,500 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC4
Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.
$86.4k-199.5k yearly Auto-Apply 6d ago
Principal Site Reliability Engineer - Automation / Containers
Oracle 4.6
Remote
Our Team
Building off our Cloud momentum, Oracle has formed a new organization - Health Data Intelligence Platform. This team will focus on product development and product strategy for Oracle Health, while building out a complete platform supporting modernized, automated healthcare. This is a net new line of business, constructed with an entrepreneurial spirit that promotes an upbeat and creative environment. We are unencumbered and will need your contribution to make it a special engineering center with the focus on excellence.
Health Data Intelligence Platform has a rare opportunity to play a critical role in how Oracle Health products impact and redefine the healthcare industry byredefiningg how healthcare and technology intersect.
You will have the opportunity to:
Reach billions of people with our products & services
Create technology in which truly impacts the world
Ability to have immediate impact on developing technology
Unlimited growth potential with inspiring work
Work with the best minds in the industry
Enjoy working in an open, diverse, and productive environment
About The Job
A unique opportunity to join a rapidly growing extraordinary team to engineer groundbreaking Oracle Cloud technologies and infrastructure that make up the Oracle Cloud solutions. As part of the SRE team, you will be continually challenged and have an opportunity to chip in to the Oracle Cloud success every day, working closely with the development partners.
As a Site Reliability Engineer, you will solve interesting technical challenges by defining, designing, deploying, and solving key Oracle Cloud services, platforms, and infrastructure, always thinking about reliability, scalability, resilience, security, and performance.
The ideal candidate for this engaging and visible technical leadership role would have the experience of a developer, the wits of a systems and infrastructure whiz, and the courage of a spirited "closer". All these qualities bundled up in an affable communicator in order to make our Oracle Cloud customers successful.
Disclaimer:
Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.
Range and benefit information provided in this posting are specific to the stated locations only
US: Hiring Range in USD from: $86,400 to $199,500 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC4
What You'll Do
Service Ownership -You will be part of the SRE team, whose mission is the shared full stack ownership of a collection of services and/or technology areas, with our Development partners.
Ownership Scope - As an SRE, you will understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of the production services you own. In partnership with your Development partners, you will have the responsibility to ensure that services are designed and delivered to be critical with focus on security, resiliency, scale, and performance. SREs are the ultimate authority and are accountable for the end-to-end performance and operability of the services they own.
Service Design - As the Oracle Cloud evolves; you will partner with development teams in defining and implementing improvements in service architecture, both current and future. As an SRE, you will be a guide at articulating technical characteristics of your services and the dependencies between services, and guide Development teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. As an SRE, you will support federal project submission process and security compliance for new platforms and system resources.
Operations Engineering - You will understand and be able to communicate the scale, capacity, security, performance attributes and requirements of the services you own. You are a domain guide, able to understand and communicate every characteristic of your service stack, such as:
degradation and behavior under load of the services and their dependencies
end-to-end tuning needs, optimizing resource utilization, as load patterns fluctuate
Instrumentation and metrics that clearly describe the service behaviors
scaling requirements and patterns
resiliency and recoverability, ensuring that backup / restore and disaster recovery capabilities are implemented, tested and maintained
Security operations and vulnerability remediation, verifying vulnerabilities are patched or remediated while conforming to corporate and federal security standards and processes.
Automation - You will have a clear understanding of automation and orchestration principles, and will be eager to automate, wherever and whenever the possibility arises, while simultaneously eliminating technical debt. Automation must be part of your DNA.
Prevention - Once you have authoritatively resolved an issue, you will immediately work on how to more quickly resolve the problem next time, with the goal to eventually prevent the problem happening ever again
Technical Experts - As service owner, you are the ultimate partner concern point for complex or critical issues that have not yet been documented as SOPs for Level1 staff. You will usually get called in during major incidents as an SME, when the source of a problem is unclear. You will have the deep understanding of service topology and their dependencies required to solve issues and define mitigations.
Broad Interests - SREs are a rare mix of sysadmins and development Engineers, and as such have the ability to understand and explain the affect of product architecture decisions on the ability to run as distributed systems. They are driven by professional curiosity and a desire to a develop deep understanding of the their services and the technologies they depend upon.
Represent SRE - Proactive, self-motivated, customer-focused, organized, and a good communicator. SRE can be expected to represent Cloud products and engineering in critical forums.
#LI-HR1
$86.4k-199.5k yearly Auto-Apply 8d ago
Site Reliability Engineer, Safety (Austin, TX or Nashville, Relo available)
Oracle 4.6
Remote
Prototype, design, and implement security solutions for new and challenging problems
Drive and champion security tool development (e.g. scanning tools)
Consult software development teams in design and architecture of safe and secure systems through Threat Modeling and modeling exercises
Champion and consult on secure development lifecycle practices
Design and integrate verification and posture reporting mechanisms
Define security configuration and implementation best practices
Disclaimer:
Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.
Range and benefit information provided in this posting are specific to the stated locations only
US: Hiring Range in USD from: $96,400 to $199,500 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC4
Qualifications:
Bachelor's or Master's degree in Computer Science or related field
5+ years of experience in security engineering or related field or equivalent experience
Experience building automated security solutions
Strong security experience, particularly with focus in one of the following areas:
Defensive Security
Offensive Security
Service architecture and Design Patterns
Strong collaboration and communication skills
Preferred Skills
Experience scaling operational activities via Python, Bash, and other tools
DevOps or SRE experience operating large, distributed, continuously deployed services
Experience operating large, distributed, continuously deployed services
Expertise in designing databases schemas in (NoSQL / SQL).
Knowledge on bridging security engineering requirements into the software development life cycle.
Security training and mentoring experience
Experience with statistical/mathematical predictive modeling
Experience with machine learning / artificial intelligence
Experience designing resilient systems that support quick recovery
Experience with container orchestration and management
History of collaborating and integrating processes with software development teams, data scientists, business and other technical roles
Experience with Java or Python development
$96.4k-199.5k yearly Auto-Apply 60d+ ago
Site Reliability Engineer DevOps | REMOTE (US Citizenship required)
Oracle 4.6
Remote
The Oracle Analytics Service Excellence (OASE) team within Oracle Analytics Cloud (OAC) is responsible for developing tools, technologies, processes, and driving change/ improvement through data driven decisions. You will work alongside software development teams to ensure service and feature parity for government customers. OASE is focused on building cloud based technologies managing, improving, maintaining, and evolving the cloud services offerings; as well as automating processes, developing code and services, operating, and reporting on internal applications that support the growth of the business, education of partners, and ensuring compliance across the breadth of Oracle Analytics.
**Responsibilities**
**Roles and responsibilities:**
- Perform DevOps activities to support customers, engineers, and processes through our release cycles as well as production
- Respond to incidents, troubleshoot issues and drive to completion, driving and participate in root cause analysis
- Perform change management activities ensuring environment operates on latest versions and configurations
- Provide full stack support to Oracle Analytics Cloud including
- Become expert in Analytic services, to prevent, resolve customer issues effectively and prevent regressions and repeats
- Document various processes & run books; update existing processes
- Execute, with excellence, delivery of interim patches and hot-fixes as required
- Work with various teams to take ownership of and resolve service failure/outages
- Monitor metrics and develop ways to improve the CI and CD tools utilized by the team
- Follow all best practices and procedures as established by the company
- Mentor and train other engineers and seek to continually improve processes
- Participate in 24 x 7 DevOps model with on call rotations for daytime, nighttime, and weekends
- Other duties as assigned
**The candidate requires these attributes:**
- A BS or MS in Computer Science, or equivalent
- Providing cloud networking, infrastructure, and service support, configuration, operations, tools, and processes
- Understand networking, and TCP/IP fundamentals and services such as DNS, HTTP, etc.
- Linux/Unix system administration including system level knowledge of Linux on OCI Gen 2, creating and executing scripts
- Experience developing & operating cloud services or large distributed applications in production
- Methodical approaches to troubleshooting and solving complex technical problems, reverse engineering existing applications
- Producing documentation in support of developed work (KBs, run books, help guides)
- Utilizing agile methodologies
- Communicating effectively in a team environment
- Working with remote and global teams
- Working independently and in a self-directed manner
- Able to work extended week day and week-end shifts as required for on-call, after hours upgrades, and other duties as assigned
**The ideal candidate for this position will have the following attributes:**
- Experience supporting multiple cloud services, troubleshooting of customer requirements and product capability, and reporting root cause for product fixes
- Knowledge of Oracle Analytics server, BI publisher, Oracle Analytics Cloud; Oracle database, Oracle Autonomous DB, MySQL (experience with MS SQL and/or NoSQL is a plus)
- 2+ years of experience of running large scale customer facing Web Applications
- Oracle Cloud Infrastructure (OCI) or AWS, Azure, GCP compute, storage, and network operational experience
- Programming and scripting languages (Python, Ansible, bash, Java Script - additional experience with PHP, Groovy, Java, and/or Go is a plus)
- Using CI/CD scripting tools such as Ansible, Puppet, or Chef
- Experience in Cloud Native application development using Containers and orchestration ( Kubernetes) independently scalable micro-services
- Issue tracking and collaboration (Jira and Confluence)
Disclaimer:
**Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.**
**Range and benefit information provided in this posting are specific to the stated locations only**
US: Hiring Range in USD from: $79,800 to $178,100 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC3
**About Us**
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_************* or by calling *************** in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
$79.8k-178.1k yearly 60d+ ago
Site Reliability Engineer DevOps | REMOTE (US Citizenship required)
Oracle 4.6
Remote
The Oracle Analytics Service Excellence (OASE) team within Oracle Analytics Cloud (OAC) is responsible for developing tools, technologies, processes, and driving change/ improvement through data driven decisions. You will work alongside software development teams to ensure service and feature parity for government customers. OASE is focused on building cloud based technologies managing, improving, maintaining, and evolving the cloud services offerings; as well as automating processes, developing code and services, operating, and reporting on internal applications that support the growth of the business, education of partners, and ensuring compliance across the breadth of Oracle Analytics.
Disclaimer:
Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.
Range and benefit information provided in this posting are specific to the stated locations only
US: Hiring Range in USD from: $79,800 to $178,100 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC3
Roles and responsibilities:
- Perform DevOps activities to support customers, engineers, and processes through our release cycles as well as production
- Respond to incidents, troubleshoot issues and drive to completion, driving and participate in root cause analysis
- Perform change management activities ensuring environment operates on latest versions and configurations
- Provide full stack support to Oracle Analytics Cloud including
- Become expert in Analytic services, to prevent, resolve customer issues effectively and prevent regressions and repeats
- Document various processes & run books; update existing processes
- Execute, with excellence, delivery of interim patches and hot-fixes as required
- Work with various teams to take ownership of and resolve service failure/outages
- Monitor metrics and develop ways to improve the CI and CD tools utilized by the team
- Follow all best practices and procedures as established by the company
- Mentor and train other engineers and seek to continually improve processes
- Participate in 24 x 7 DevOps model with on call rotations for daytime, nighttime, and weekends
- Other duties as assigned
The candidate requires these attributes:
- A BS or MS in Computer Science, or equivalent
- Providing cloud networking, infrastructure, and service support, configuration, operations, tools, and processes
- Understand networking, and TCP/IP fundamentals and services such as DNS, HTTP, etc.
- Linux/Unix system administration including system level knowledge of Linux on OCI Gen 2, creating and executing scripts
- Experience developing & operating cloud services or large distributed applications in production
- Methodical approaches to troubleshooting and solving complex technical problems, reverse engineering existing applications
- Producing documentation in support of developed work (KBs, run books, help guides)
- Utilizing agile methodologies
- Communicating effectively in a team environment
- Working with remote and global teams
- Working independently and in a self-directed manner
- Able to work extended week day and week-end shifts as required for on-call, after hours upgrades, and other duties as assigned
The ideal candidate for this position will have the following attributes:
- Experience supporting multiple cloud services, troubleshooting of customer requirements and product capability, and reporting root cause for product fixes
- Knowledge of Oracle Analytics server, BI publisher, Oracle Analytics Cloud; Oracle database, Oracle Autonomous DB, MySQL (experience with MS SQL and/or NoSQL is a plus)
- 2+ years of experience of running large scale customer facing Web Applications
- Oracle Cloud Infrastructure (OCI) or AWS, Azure, GCP compute, storage, and network operational experience
- Programming and scripting languages (Python, Ansible, bash, Java Script - additional experience with PHP, Groovy, Java, and/or Go is a plus)
- Using CI/CD scripting tools such as Ansible, Puppet, or Chef
- Experience in Cloud Native application development using Containers and orchestration ( Kubernetes) independently scalable micro-services
- Issue tracking and collaboration (Jira and Confluence)
$79.8k-178.1k yearly Auto-Apply 16d ago
Site Reliability Engineer DevOps | REMOTE (US Citizenship required)
Oracle 4.6
Remote
Are you a creative person who loves a challenge? Solve the complex puzzles you've been dreaming of as our Engineer. If you have a passion for innovation in tech, we want you on our team! Thrive in this crucial automation role. Oracle is a technology leader that's changing how the world does business. We're looking for an experienced and self-motivated person. We appreciate you taking the time to review the list of qualifications and to apply for the position.
Come and join us! Building off our Cloud momentum, Oracle has formed a new organization - Oracle Health. This team will focus on product deployment, sustainability, troubleshooting and product strategy for Oracle Health, while building out a complete platform supporting modernized, automated healthcare. This is a net new line of business, constructed with an entrepreneurial spirit that promotes an energetic and creative environment. We are unencumbered and will need your contribution to make it a world class engineering center with the focus on excellence.
As a Site Reliability DevOps Engineer, you will be responsible for defining and deploying key services with deep focus on architecture, production operations, capacity planning, performance management, deployment, and release engineering. You will work with multiple cross-functional teams helping deliver new and outstanding experiences to our collaborators while ensuring reliability and performance.
Responsibilities includes:
Take ownership of the architecture, analysis, design, implementation and production operations of a wide array of Core System Framework solutions
React to production deficiencies by continuously implementing automation, self-healing, and real-time monitoring to production systems
Be a strong contributor to supporting and development of platform services including architecture, provisioning, configuration, deployment, and support
Partner with the distributed team in prototyping new platform services
Stay informed of new technologies
Innovate
Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence
Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services
Develop designs, architectures, standards, and methods for large-scale distributed systems
Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning and performance.
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_************* or by calling *************** in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
$87k-119k yearly est. Auto-Apply 60d+ ago
Site Reliability Engineer 2 DevOps | REMOTE (US Citizenship required)
Oracle 4.6
Remote
Are you a creative person who loves a challenge? Solve the complex puzzles you've been dreaming of as our Engineer. If you have a passion for innovation in tech, we want you on our team! Thrive in this crucial automation role. Oracle is a technology leader that's changing how the world does business. We're looking for an experienced and self-motivated person. We appreciate you taking the time to review the list of qualifications and to apply for the position.
Come and join us! Building off our Cloud momentum, Oracle has formed a new organization - Oracle Health. This team will focus on product deployment, sustainability, troubleshooting and product strategy for Oracle Health, while building out a complete platform supporting modernized, automated healthcare. This is a net new line of business, constructed with an entrepreneurial spirit that promotes an energetic and creative environment. We are unencumbered and will need your contribution to make it a world class engineering center with the focus on excellence.
As a Site Reliability DevOps Engineer, you will be responsible for defining and deploying key services with deep focus on architecture, production operations, capacity planning, performance management, deployment, and release engineering. You will work with multiple cross-functional teams helping deliver new and outstanding experiences to our collaborators while ensuring reliability and performance.
Responsibilities includes:
Take ownership of the architecture, analysis, design, implementation and production operations of a wide array of Core System Framework solutions
React to production deficiencies by continuously implementing automation, self-healing, and real-time monitoring to production systems
Be a strong contributor to supporting and development of platform services including architecture, provisioning, configuration, deployment, and support
Partner with the distributed team in prototyping new platform services
Stay informed of new technologies
Innovate
Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence
Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services
Develop designs, architectures, standards, and methods for large-scale distributed systems
Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning and performance.
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_************* or by calling *************** in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
$87k-119k yearly est. Auto-Apply 50d ago
Principal Site Reliability Engineer - Automation / Containers
Oracle 4.6
Columbus, OH jobs
**Our Team** Building off our Cloud momentum, Oracle has formed a new organization - Health Data Intelligence Platform. This team will focus on product development and product strategy for Oracle Health, while building out a complete platform supporting modernized, automated healthcare. This is a net new line of business, constructed with an entrepreneurial spirit that promotes an upbeat and creative environment. We are unencumbered and will need your contribution to make it a special engineering center with the focus on excellence.
Health Data Intelligence Platform has a rare opportunity to play a critical role in how Oracle Health products impact and redefine the healthcare industry byredefiningg how healthcare and technology intersect.
You will have the opportunity to:
+ Reach billions of people with our products & services
+ Create technology in which truly impacts the world
+ Ability to have immediate impact on developing technology
+ Unlimited growth potential with inspiring work
+ Work with the best minds in the industry
+ Enjoy working in an open, diverse, and productive environment
**About The Job**
A unique opportunity to join a rapidly growing extraordinary team to engineer groundbreaking Oracle Cloud technologies and infrastructure that make up the Oracle Cloud solutions. As part of the SRE team, you will be continually challenged and have an opportunity to chip in to the Oracle Cloud success every day, working closely with the development partners.
As a Site Reliability Engineer, you will solve interesting technical challenges by defining, designing, deploying, and solving key Oracle Cloud services, platforms, and infrastructure, always thinking about reliability, scalability, resilience, security, and performance.
The ideal candidate for this engaging and visible technical leadership role would have the experience of a developer, the wits of a systems and infrastructure whiz, and the courage of a spirited "closer". All these qualities bundled up in an affable communicator in order to make our Oracle Cloud customers successful.
**Responsibilities**
**What You'll Do**
+ **Service Ownership** -You will be part of the SRE team, whose mission is the shared full stack ownership of a collection of services and/or technology areas, with our Development partners.
+ **Ownership Scope** - As an SRE, you will understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of the production services you own. In partnership with your Development partners, you will have the responsibility to ensure that services are designed and delivered to be critical with focus on security, resiliency, scale, and performance. SREs are the ultimate authority and are accountable for the end-to-end performance and operability of the services they own.
+ **Service Design** - As the Oracle Cloud evolves; you will partner with development teams in defining and implementing improvements in service architecture, both current and future. As an SRE, you will be a guide at articulating technical characteristics of your services and the dependencies between services, and guide Development teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. As an SRE, you will support federal project submission process and security compliance for new platforms and system resources.
+ **Operations Engineering** - You will understand and be able to communicate the scale, capacity, security, performance attributes and requirements of the services you own. You are a domain guide, able to understand and communicate every characteristic of your service stack, such as:
+ degradation and behavior under load of the services and their dependencies
+ end-to-end tuning needs, optimizing resource utilization, as load patterns fluctuate
+ Instrumentation and metrics that clearly describe the service behaviors
+ scaling requirements and patterns
+ resiliency and recoverability, ensuring that backup / restore and disaster recovery capabilities are implemented, tested and maintained
+ Security operations and vulnerability remediation, verifying vulnerabilities are patched or remediated while conforming to corporate and federal security standards and processes.
+ **Automation** - You will have a clear understanding of automation and orchestration principles, and will be eager to automate, wherever and whenever the possibility arises, while simultaneously eliminating technical debt. Automation must be part of your DNA.
+ **Prevention** - Once you have authoritatively resolved an issue, you will immediately work on how to more quickly resolve the problem next time, with the goal to eventually prevent the problem happening ever again
+ **Technical Experts** - As service owner, you are the ultimate partner concern point for complex or critical issues that have not yet been documented as SOPs for Level1 staff. You will usually get called in during major incidents as an SME, when the source of a problem is unclear. You will have the deep understanding of service topology and their dependencies required to solve issues and define mitigations.
+ **Broad Interests** - SREs are a rare mix of sysadmins and development Engineers, and as such have the ability to understand and explain the affect of product architecture decisions on the ability to run as distributed systems. They are driven by professional curiosity and a desire to a develop deep understanding of the their services and the technologies they depend upon.
+ **Represent SRE** - Proactive, self-motivated, customer-focused, organized, and a good communicator. SRE can be expected to represent Cloud products and engineering in critical forums.
\#LI-HR1
Disclaimer:
**Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.**
**Range and benefit information provided in this posting are specific to the stated locations only**
US: Hiring Range in USD from: $86,400 to $199,500 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC4
**About Us**
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_************* or by calling *************** in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
$86.4k-199.5k yearly 8d ago
Site Reliability Engineer, Safety (Austin, TX or Nashville, Relo available)
Oracle 4.6
Columbus, OH jobs
+ Prototype, design, and implement security solutions for new and challenging problems + Drive and champion security tool development (e.g. scanning tools) + Consult software development teams in design and architecture of safe and secure systems through Threat Modeling and modeling exercises
+ Champion and consult on secure development lifecycle practices
+ Design and integrate verification and posture reporting mechanisms
+ Define security configuration and implementation best practices
**Responsibilities**
Qualifications:
+ Bachelor's or Master's degree in Computer Science or related field
+ 5+ years of experience in security engineering or related field or equivalent experience
+ Experience building automated security solutions
+ Strong security experience, particularly with focus in one of the following areas:
+ Defensive Security
+ Offensive Security
+ Service architecture and Design Patterns
+ Strong collaboration and communication skills
Preferred Skills
+ Experience scaling operational activities via Python, Bash, and other tools
+ DevOps or SRE experience operating large, distributed, continuously deployed services
+ Experience operating large, distributed, continuously deployed services
+ Expertise in designing databases schemas in (NoSQL / SQL).
+ Knowledge on bridging security engineering requirements into the software development life cycle.
+ Security training and mentoring experience
+ Experience with statistical/mathematical predictive modeling
+ Experience with machine learning / artificial intelligence
+ Experience designing resilient systems that support quick recovery
+ Experience with container orchestration and management
+ History of collaborating and integrating processes with software development teams, data scientists, business and other technical roles
+ Experience with Java or Python development
Disclaimer:
**Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.**
**Range and benefit information provided in this posting are specific to the stated locations only**
US: Hiring Range in USD from: $96,400 to $199,500 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC4
**About Us**
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_************* or by calling *************** in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
$96.4k-199.5k yearly 60d+ ago
Principal Site Reliability Engineer
Oracle 4.6
Columbus, OH jobs
**Our Team** Building off our Cloud momentum, Oracle has formed a new organization - Oracle Health Data, Analytics Platform. This team will focus on product development and product strategy for Oracle Health, while building out a complete platform supporting modernized, automated healthcare. This is a net new line of business, constructed with an entrepreneurial spirit that promotes an energetic and creative environment. We are unencumbered and will need your contribution to make it a world class engineering center with the focus on excellence.
Oracle Health Data, Analytics Platform has a rare opportunity to play a critical role in how Oracle Health products impact and disrupt the healthcare industry by transforming how healthcare and technology intersect.
You will have the opportunity to:
+ Reach billions of people with our products & services
+ Create technology in which truly impacts the world
+ Ability to have immediate impact on developing technology
+ Unlimited growth potential with inspiring work
+ Work with the best minds in the industry
+ Enjoy working in an open, diverse, and productive environment
**About The Job**
This role provides technical leadership for the core data platforms behind Oracle Health's Data & Analytics Platform. As a Principal Site Reliability Engineer (SRE), you will own shared, mission-critical systems used by multiple products and teams.
You will lead the design and operation of large-scale, stateful distributed platforms, including Hadoop ecosystem components (HDFS, YARN, HBase) deployed on Oracle Big Data Service (BDS), Kafka, and Storm. These multi-tenant platforms are deployed and operated through Ansible- and Terraform-based automation and require strong architectural ownership to manage scale, change, and broad blast radius.
**What You'll Do**
**Platform Ownership & Technical Leadership**
+ Own the end-to-end reliability, scalability, and operability of shared data platforms
+ Define platform standards, architectural direction, and operational guardrails
+ Influence cross-team technical decisions and long-term platform strategy
+ Drive long-term platform evolution and influence reliability strategy across the data ecosystem
**Architecture & Design**
+ Lead platform architecture and design reviews
+ Clearly articulate system behavior, dependencies, and failure modes
+ Make principled trade-offs between reliability, performance, cost, and complexity
+ Provide guidance and guardrails that enable downstream teams to use platforms safely and effectively
**Operations Engineering**
+ Establish capacity models, scaling strategies, and operational best practices
+ Design platforms that behave predictably under load, failure, and change
+ Own platform lifecycle events: upgrades, expansions, decommissioning, and recovery
**Distributed Systems Expertise**
+ Operate and evolve stateful distributed systems where data placement, replication, and recovery are critical
+ Reason about failure modes such as backpressure, rebalancing, region movement, replication lag, and rolling upgrades
**Security**
+ Operate and maintain Kerberized platforms, including authentication, authorization, and secure service-to-service communication
+ Treat security as a first-class architectural concern
**Automation**
+ Design and evolve an Ansible- and Terraform-driven automation framework
+ Treat automation as production software: versioned, reviewed, tested, and improved
+ Eliminate operational toil by encoding reliability and safety into the platform
**Incident Leadership & Prevention**
+ Serve as the ultimate escalation point for complex or ambiguous incidents
+ Focus on eliminating entire classes of failure, not just resolving individual issues
**Representation**
+ Represent SRE and platform engineering in high-visibility and sensitive forums
+ Communicate clearly with engineering leadership and partner teams
**Responsibilities**
The team operates within the Oracle Health Data & Analytics Platform, supporting one of Oracle Health's core products, HealtheIntent. We operate the big data and streaming infrastructure that enables downstream teams to deliver reliable customer-facing solutions at scale, while continuously improving operability and efficiency.
**Required Experience**
+ 8+ years operating large-scale, customer-facing distributed platforms
+ Deep experience with HDFS, YARN, HBase, Kafka, Storm, or similar systems
+ Strong background in Linux, networking, and distributed system troubleshooting
+ Infrastructure-as-Code using Ansible and Terraform
+ Scripting and automation using Python, Ruby, and Bash
+ Hands-on experience operating Kerberized environments
+ Proven ability to define and document technical architecture for complex systems
+ Demonstrated ownership of shared platforms with broad blast radius and multiple downstream consumers
+ Experience designing observability and capacity models for distributed platforms
**Required Qualifications:**
+ U.S. Citizenship and eligibility for a Federal Security Clearance
+ 10+ years of technical experience relevant to this position
+ Ability to communicate effectively and build rapport with team members
+ BS or MS in Computer Science, or equivalent
\#LI-HR1
**Responsibilities**
Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.
Disclaimer:
**Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.**
**Range and benefit information provided in this posting are specific to the stated locations only**
US: Hiring Range in USD from: $86,400 to $199,500 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC4
**About Us**
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_************* or by calling *************** in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
$86.4k-199.5k yearly 6d ago
Senior Site Reliability Engineer - Cloud Automation (Oracle Health Cloud, Remote US)
Oracle 4.6
Remote
Senior Site Reliability Engineer - Cloud Automation (Oracle Health | Remote US)
Make real-world impact at scale. Join Oracle Health to build a modern, automated healthcare platform that millions rely on. You'll design, automate, and operate secure, highly available cloud services-driving reliability, speed, and efficiency across our platform.
What you'll do
Own service reliability end-to-end: architecture, production operations, and on-call excellence
Build automation and self-healing systems using IaC (e.g., Terraform) and CI/CD
Design, implement, and evolve observability (metrics, tracing, logging) and SLO/error budgets
Lead capacity planning, performance tuning, and cost/sustainability initiatives
Develop tooling and services to improve scalability, availability, and developer productivity
Partner with cross-functional teams to deliver features safely (canary/blue‑green, progressive delivery)
Drive incident response, root-cause analysis, and prevention through automation
Prototype and standardize platform services and best practices across teams
Disclaimer:
Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.
Range and benefit information provided in this posting are specific to the stated locations only
US: Hiring Range in USD from: $86,400 to $199,500 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC4
What you'll bring
US citizenship and the ability to obtain/maintain a federal security clearance
Experience operating large-scale, distributed, fault-tolerant systems in production
Strong scripting/programming (Python, Bash; Java/C++ a plus)
Infrastructure as Code and automation (Terraform; Ansible/Chef/Puppet/Packer a plus)
CI/CD pipelines and tooling (Git, GitLab/Jenkins/Rundeck)
Cloud experience (OCI, AWS, Azure or similar)
Deep knowledge of monitoring, alerting, incident management, and postmortems
Solid grasp of networking, security fundamentals, and performance engineering
Nice to have
Experience in regulated or high-compliance environments
Data/analytics and platform sustainability optimization
Containers and orchestration (Kubernetes, Docker)
Why Oracle Health
Net-new business with startup energy and enterprise backing
High ownership, high impact: shape platform reliability and automation from the ground up
Mission-driven work improving healthcare through secure, scalable technology
Remote role within the US
Eligibility: Remote (US). US citizenship required; ability to obtain and maintain a federal security clearance.
#LI-ND1
Are you a creative person who loves a challenge? Solve the complex puzzles you've been dreaming of as our Engineer. If you have a passion for innovation in tech, we want you on our team! Thrive in this crucial automation role. Oracle is a technology leader that's changing how the world does business. We're looking for an experienced and self-motivated person. We appreciate you taking the time to review the list of qualifications and to apply for the position.
Come and join us! Building off our Cloud momentum, Oracle has formed a new organization - Oracle Health. This team will focus on product deployment, sustainability, troubleshooting and product strategy for Oracle Health, while building out a complete platform supporting modernized, automated healthcare. This is a net new line of business, constructed with an entrepreneurial spirit that promotes an energetic and creative environment. We are unencumbered and will need your contribution to make it a world class engineering center with the focus on excellence.
As a Site Reliability DevOps Engineer, you will be responsible for defining and deploying key services with deep focus on architecture, production operations, capacity planning, performance management, deployment, and release engineering. You will work with multiple cross-functional teams helping deliver new and outstanding experiences to our collaborators while ensuring reliability and performance.
Responsibilities includes:
Take ownership of the architecture, analysis, design, implementation and production operations of a wide array of Core System Framework solutions
React to production deficiencies by continuously implementing automation, self-healing, and real-time monitoring to production systems
Be a strong contributor to supporting and development of platform services including architecture, provisioning, configuration, deployment, and support
Partner with the distributed team in prototyping new platform services
Stay informed of new technologies
Innovate
Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence
Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services
Develop designs, architectures, standards, and methods for large-scale distributed systems
Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning and performance.
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_************* or by calling *************** in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
$98k-127k yearly est. Auto-Apply 60d+ ago
Senior Site Reliability Engineer - Automation / Containers
Oracle 4.6
Columbus, OH jobs
**Our Team** Building off our Cloud momentum, Oracle has formed a new organization - Health Data Intelligence Platform. This team will focus on product development and product strategy for Oracle Health, while building out a complete platform supporting modernized, automated healthcare. This is a net new line of business, constructed with an entrepreneurial spirit that promotes an upbeat and creative environment. We are unencumbered and will need your contribution to make it a special engineering center with the focus on excellence.
Health Data Intelligence Platform has a rare opportunity to play a critical role in how Oracle Health products impact and redefine the healthcare industry byredefiningg how healthcare and technology intersect.
You will have the opportunity to:
+ Reach billions of people with our products & services
+ Create technology in which truly impacts the world
+ Ability to have immediate impact on developing technology
+ Unlimited growth potential with inspiring work
+ Work with the best minds in the industry
+ Enjoy working in an open, diverse, and productive environment
**About The Job**
A unique opportunity to join a rapidly growing extraordinary team to engineer groundbreaking Oracle Cloud technologies and infrastructure that make up the Oracle Cloud solutions. As part of the SRE team, you will be continually challenged and have an opportunity to chip in to the Oracle Cloud success every day, working closely with the development partners.
As a Site Reliability Engineer, you will solve interesting technical challenges by defining, designing, deploying, and solving key Oracle Cloud services, platforms, and infrastructure, always thinking about reliability, scalability, resilience, security, and performance.
The ideal candidate for this engaging and visible technical leadership role would have the experience of a developer, the wits of a systems and infrastructure whiz, and the courage of a spirited "closer". All these qualities bundled up in an affable communicator in order to make our Oracle Cloud customers successful.
**Responsibilities**
**What You'll Do**
+ **Service Ownership** -You will be part of the SRE team, whose mission is the shared full stack ownership of a collection of services and/or technology areas, with our Development partners.
+ **Ownership Scope** - As an SRE, you will understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of the production services you own. In partnership with your Development partners, you will have the responsibility to ensure that services are designed and delivered to be critical with focus on security, resiliency, scale, and performance. SREs are the ultimate authority and are accountable for the end-to-end performance and operability of the services they own.
+ **Service Design** - As the Oracle Cloud evolves; you will partner with development teams in defining and implementing improvements in service architecture, both current and future. As an SRE, you will be a guide at articulating technical characteristics of your services and the dependencies between services, and guide Development teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. As an SRE, you will support federal project submission process and security compliance for new platforms and system resources.
+ **Operations Engineering** - You will understand and be able to communicate the scale, capacity, security, performance attributes and requirements of the services you own. You are a domain guide, able to understand and communicate every characteristic of your service stack, such as:
+ degradation and behavior under load of the services and their dependencies
+ end-to-end tuning needs, optimizing resource utilization, as load patterns fluctuate
+ Instrumentation and metrics that clearly describe the service behaviors
+ scaling requirements and patterns
+ resiliency and recoverability, ensuring that backup / restore and disaster recovery capabilities are implemented, tested and maintained
+ Security operations and vulnerability remediation, verifying vulnerabilities are patched or remediated while conforming to corporate and federal security standards and processes.
+ **Automation** - You will have a clear understanding of automation and orchestration principles, and will be eager to automate, wherever and whenever the possibility arises, while simultaneously eliminating technical debt. Automation must be part of your DNA.
+ **Prevention** - Once you have authoritatively resolved an issue, you will immediately work on how to more quickly resolve the problem next time, with the goal to eventually prevent the problem happening ever again
+ **Technical Experts** - As service owner, you are the ultimate partner concern point for complex or critical issues that have not yet been documented as SOPs for Level1 staff. You will usually get called in during major incidents as an SME, when the source of a problem is unclear. You will have the deep understanding of service topology and their dependencies required to solve issues and define mitigations.
+ **Broad Interests** - SREs are a rare mix of sysadmins and development Engineers, and as such have the ability to understand and explain the affect of product architecture decisions on the ability to run as distributed systems. They are driven by professional curiosity and a desire to a develop deep understanding of the their services and the technologies they depend upon.
+ **Represent SRE** - Proactive, self-motivated, customer-focused, organized, and a good communicator. SRE can be expected to represent Cloud products and engineering in critical forums.
\#LI-HR1
Disclaimer:
**Certain US customer or client-facing roles may be required to comply with applicable requirements, such as immunization and occupational health mandates.**
**Range and benefit information provided in this posting are specific to the stated locations only**
US: Hiring Range in USD from: $79,100 to $158,200 per annum. May be eligible for bonus and equity.
Oracle maintains broad salary ranges for its roles in order to account for variations in knowledge, skills, experience, market conditions and locations, as well as reflect Oracle's differing products, industries and lines of business.
Candidates are typically placed into the range based on the preceding factors as well as internal peer equity.
Oracle US offers a comprehensive benefits package which includes the following:
1. Medical, dental, and vision insurance, including expert medical opinion
2. Short term disability and long term disability
3. Life insurance and AD&D
4. Supplemental life insurance (Employee/Spouse/Child)
5. Health care and dependent care Flexible Spending Accounts
6. Pre-tax commuter and parking benefits
7. 401(k) Savings and Investment Plan with company match
8. Paid time off: Flexible Vacation is provided to all eligible employees assigned to a salaried (non-overtime eligible) position. Accrued Vacation is provided to all other employees eligible for vacation benefits. For employees working at least 35 hours per week, the vacation accrual rate is 13 days annually for the first three years of employment and 18 days annually for subsequent years of employment. Vacation accrual is prorated for employees working between 20 and 34 hours per week. Employees working fewer than 20 hours per week are not eligible for vacation.
9. 11 paid holidays
10. Paid sick leave: 72 hours of paid sick leave upon date of hire. Refreshes each calendar year. Unused balance will carry over each year up to a maximum cap of 112 hours.
11. Paid parental leave
12. Adoption assistance
13. Employee Stock Purchase Plan
14. Financial planning and group legal
15. Voluntary benefits including auto, homeowner and pet insurance
The role will generally accept applications for at least three calendar days from the posting date or as long as the job remains posted.
Career Level - IC3
**About Us**
As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_************* or by calling *************** in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
$79.1k-158.2k yearly 8d ago
FOS Reliability Engineer
L3Harris 4.4
Cincinnati, OH jobs
Conduct reliability analysis and safety assessment of electronic and electro-mechanical devices containing analog, digital and RF circuitry. Analyses include Stress Analysis, Reliability Prediction, Failure Modes Effects and Criticiality Analysis (FMECA), Testability Analysis, Fault Tree Analysis and System Hazard Analysis. Write detailed reliability analysis and safety assessment reports.
Essential Functions:
Generate reliability and safety plans and procedures.
Interpret and allocate reliability and safety requirements.
Perform reliability, maintainability, availability, safety, FMEA, FMECA and Fault Tree Analysis.
Support FRACAS activities: Monitor and assist with failure recording. Participate in failure analysis to determine root cause and corrective action.
Actively engage with system design engineers in the development of new designs/configurations.
Provide recommendations to assure compliance with reliability and safety requirements.
Participate in system design reviews, presentations, and meetings with customers.
Liaise with clients and other technical disciplines within the company on reliability and safety issues.
Ability to handle multiple projects simultaneously during development and production stages.
Qualifications and Experience
Bachelor's degree in electrical engineering and minimum 4 years of prior relevant experience. Graduate degree and a minimum of 2 years of prior related experience. In lieu of a degree, minimum of 8 years of prior related experience.
Two years experience performing electrical stress/derating analysis, FMECA (MIL-STD-1629A), reliability predictions and analysis (MIL-STD-785/MIL-HDBK-217/ MIL-HDBK-1547A), Fault Tree Analysis (IEC 61025/ NUREG-0492), and system hazard analysis (MIL-STD-882) on electronic and electro-mechanical devices, Aerospace, defense, or government program experience, reading and analyzing electrical schematics.
Preferred Additional Skills:
Eligible for a security clearance.
Experience with Microsoft Office tools.
Experience with prediction software tools such as PTC Windchill Quality Solutions, Relyence, or similar tools.
Experience with Environmental Stress Screening (MIL-HDBK-344A).
$71k-93k yearly est. 60d ago
FOS Reliability Engineer
L3Harris 4.4
Cincinnati, OH jobs
L3Harris is dedicated to recruiting and developing high-performing talent who are passionate about what they do. Our employees are unified in a shared dedication to our customers' mission and quest for professional growth. L3Harris provides an inclusive, engaging environment designed to empower employees and promote work-life success. Fundamental to our culture is an unwavering focus on values, dedication to our communities, and commitment to excellence in everything we do.
L3Harris Technologies is the Trusted Disruptor in the defense industry. With customers' mission-critical needs always in mind, our employees deliver end-to-end technology solutions connecting the space, air, land, sea and cyber domains in the interest of national security.
Job Title: Reliability Engineer
Job Code: 31515
Job Location: L3Harris Fuzing and Ordnance Systems - Cincinnati, OH
Schedule: M-Th 10hr/day
Job Description:
Conduct reliability analysis and safety assessment of electronic and electro-mechanical devices containing analog, digital and RF circuitry. Analyses include Stress Analysis, Reliability Prediction, Failure Modes Effects and Criticiality Analysis (FMECA), Testability Analysis, Fault Tree Analysis and System Hazard Analysis. Write detailed reliability analysis and safety assessment reports.
Essential Functions:
+ Generate reliability and safety plans and procedures.
+ Interpret and allocate reliability and safety requirements.
+ Perform reliability, maintainability, availability, safety, FMEA, FMECA and Fault Tree Analysis.
+ Support FRACAS activities: Monitor and assist with failure recording. Participate in failure analysis to determine root cause and corrective action.
+ Actively engage with system design engineers in the development of new designs/configurations.
+ Provide recommendations to assure compliance with reliability and safety requirements.
+ Participate in system design reviews, presentations, and meetings with customers.
+ Liaise with clients and other technical disciplines within the company on reliability and safety issues.
+ Ability to handle multiple projects simultaneously during development and production stages.
Qualifications and Experience
+ Bachelor's degree in electrical engineering and minimum 4 years of prior relevant experience. Graduate degree and a minimum of 2 years of prior related experience. In lieu of a degree, minimum of 8 years of prior related experience.
+ Two years experience performing electrical stress/derating analysis, FMECA (MIL-STD-1629A), reliability predictions and analysis (MIL-STD-785/MIL-HDBK-217/ MIL-HDBK-1547A), Fault Tree Analysis (IEC 61025/ NUREG-0492), and system hazard analysis (MIL-STD-882) on electronic and electro-mechanical devices, Aerospace, defense, or government program experience, reading and analyzing electrical schematics.
Preferred Additional Skills:
+ Eligible for a security clearance.
+ Experience with Microsoft Office tools.
+ Experience with prediction software tools such as PTC Windchill Quality Solutions, Relyence, or similar tools.
+ Experience with Environmental Stress Screening (MIL-HDBK-344A).
L3Harris Technologies is proud to be an Equal Opportunity Employer. L3Harris is committed to treating all employees and applicants for employment with respect and dignity and maintaining a workplace that is free from unlawful discrimination. All applicants will be considered for employment without regard to race, color, religion, age, national origin, ancestry, ethnicity, gender (including pregnancy, childbirth, breastfeeding or other related medical conditions), gender identity, gender expression, sexual orientation, marital status, veteran status, disability, genetic information, citizenship status, characteristic or membership in any other group protected by federal, state or local laws. L3Harris maintains a drug-free workplace and performs pre-employment substance abuse testing and background checks, where permitted by law.
Please be aware many of our positions require the ability to obtain a security clearance. Security clearances may only be granted to U.S. citizens. In addition, applicants who accept a conditional offer of employment may be subject to government security investigation(s) and must meet eligibility requirements for access to classified information.
By submitting your resume for this position, you understand and agree that L3Harris Technologies may share your resume, as well as any other related personal information or documentation you provide, with its subsidiaries and affiliated companies for the purpose of considering you for other available positions.
L3Harris Technologies is an E-Verify Employer. Please click here for the E-Verify Poster in English (******************************************************************************************** or Spanish (******************************************************************************************** . For information regarding your Right To Work, please click here for English (****************************************************************************************** or Spanish (******************************************************************************************** .