Reliability Engineer jobs at Regions Bank - 525 jobs

Site Reliability Engineer
The Voleon Group 4.1
Berkeley, CA jobs
Voleon is a technology company that applies state‑of‑the‑art AI and machine learning techniques to real‑world problems in finance. For nearly two decades, we have led our industry and worked at the frontier of applying AI/ML to investment management. We have become a multibillion‑dollar asset manager, and we have ambitious goals for the future. Your colleagues will include internationally recognized experts in artificial intelligence and machine learning research as well as highly experienced finance and technology professionals. The people who shape our company come from other backgrounds, including concert music performances, humanitarian aid, opera singing, sports writing, and BMX racing. You will be part of a team that loves to succeed together. In addition to our enriching and collegial working environment, we offer highly competitive compensation and benefits packages, technology talks by our experts, a beautiful modern office, daily catered lunches, and more. As a Site Reliability Engineer (SRE), you will work at the intersection of production operations and software development as you improve, manage, and monitor production‑critical infrastructure and data pipelines. At Voleon, many SREs serve together on a Production Operations team tasked with improving shared production infrastructure. Others are embedded with teams of software engineers to improve specific production systems owned by those teams. Voleon SREs work on important real‑world problems and collaborate with passionate and talented colleagues in an empowering, results‑driven environment. This role is a way to make a real difference: your contributions will make our critical systems more reliable, lower operational risk, and increase the efficiency of our engineering effort. Responsibilities Improve fault‑tolerance and maintainability of code in proprietary data pipelines and trading systems Diagnose and fix bugs in code Lead complex deployments Automate manual workflows Track and prioritize outstanding production‑related issues Share an on‑call rotation responding to incidents to ensure the continuous operation of production‑critical systems Requirements Experience with coding and debugging Python Experience with Linux Familiarity with Relational Databases & SQL Sharp analytical and problem‑solving skills and a persistent drive to make things work (better) Strong growth mindset and a passion for learning Strong technical communication skills Attention to detail 2 years of relevant industry experience An undergraduate degree or comparable training in a quantitative field or equivalent, relevant industry experience Preferred Qualifications Familiarity with best practices concerning code maintainability, documentation, quality assurance, continuous integration and deployment Experience supporting production systems Experience with any of the following: gRPC microservices, Postgres, Pandas, Golang, R, Git, Jenkins, Bazel, Prometheus, Grafana, Airflow, Kubernetes The base salary for this position is $120,000 to $160,000 in the location(s) of this posting. Individual salaries are determined through a variety of factors, including, but not limited to, education, experience, knowledge, skills, and geography. Base salary does not include other forms of total compensation such as bonus compensation and other benefits. Our benefits package includes medical, dental and vision coverage, life and AD&D insurance, 20 days of paid time off, 9 sick days, and a 401(k) plan with a company match. Friends of Voleon Candidate Referral Program If you have a great candidate in mind for this role and would like to have the potential to earn $7,500 - $15,000 if your referred candidate is successfully hired and employed by The Voleon Group, please use this to submit your referral. For more details regarding eligibility, terms and conditions please make sure to review the Voleon Referral Bonus Program. Equal Opportunity Employer The Voleon Group is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law. #J-18808-Ljbffr
$120k-160k yearly 5d ago

Looking for a job?

Let Zippia find it for you.

Senior AI SRE: Scale GenAI Reliability & Impact
Charles Schwab Corporation 4.8
San Francisco, CA jobs
A leading financial services firm is seeking a Senior AI Site Reliability Engineer responsible for designing and managing the reliability of AI-driven applications. In this role, you'll work on innovative projects and mentor junior engineers while collaborating with cross-functional teams. Candidates should have extensive experience in software development and reliability engineering, with a particular focus on AI systems. This on-site position is located in San Francisco and offers opportunities for professional growth and development. #J-18808-Ljbffr
$118k-152k yearly est. 1d ago
Staff Site Reliability Engineer
Visa 4.5
Ashburn, VA jobs
Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure payments network, enabling individuals, businesses, and economies to thrive while driven by a common purpose - to uplift everyone, everywhere by being the best way to pay and be paid. Make an impact with a purpose-driven industry leader. Join us today and experience Life at Visa. Job Description What a Staff Reliability Engineer Does at Visa? As a Staff Site Reliability Engineering (SRE) team, you will be part of a cross-functional Operations & Infrastructure group responsible for the reliability, availability, performance, and optimization of Visa Spend Clarity for Enterprises (VSCE). You will support teams in running robust applications, lead incident resolution efforts, and drive operational excellence through automation, observability, and platform modernization. This role is critical to Visa's transformation as we scale our product to a broader range of issuers through cloud infrastructure and automation. You will work closely with engineering, operations, and product teams to ensure our systems are resilient, secure, and continuously improving. Why This Role Matters You will be part of a critical global function within the VSCE product at a time when we are modernizing our platform through cloud infrastructure and automation. This transformation enables us to scale our product to a broader range of issuers and is a key focus area within Visa Commercial Solutions with ambitious growth goals. Our Culture At Visa, your individuality fits right in. Working here gives you an opportunity to impact the world, invest in your career growth, and be part of an inclusive and diverse workplace. We are a global team of disruptors, trailblazers, innovators, and risk-takers who are helping drive economic growth in even the most remote parts of the world. We're creatively moving the industry forward and doing meaningful work that brings financial literacy and digital commerce to millions of unbanked and underserved consumers. You're an individual. We're the team for you. Together, let's transform the way the world pays. Essential Functions Operate and improve distributed systems and SaaS applications in production environments. Lead and coordinate incident response efforts, ensuring timely resolution and root cause analysis. Collaborate with engineering teams to enhance system reliability, uptime, and performance. Automate operational tasks using scripting and orchestration tools (e.g., PowerShell). Support and configure middleware, load balancers, and Web Application Firewalls. Drive strategic initiatives such as cloud migration and platform modernization. Apply AWS cloud expertise to solve infrastructure problems and scalability challenges. Monitor and manage enterprise systems using observability and alerting tools. Participate in a 24/7/365 On Call rotation, including shift and weekend support as needed. Contribute to internal platform development with a product-led mindset. Ensure secure and compliant software delivery in regulated environments. Support geographically dispersed systems across multiple time zones. Provide support and documentation for task handoffs and transitions. This is a hybrid position. Expectation of days in office will be confirmed by your hiring manager. Qualifications Basic Qualifications * 5 or more years of relevant work experience with a Bachelors Degree or at least 2 years of work experience with an Advanced degree (e.g. Masters, MBA, JD, MD) or 0 years of work experience with a PhD Preferred Qualifications * 6 or more years of work experience with a Bachelors Degree or 4 or more years of relevant experience with an Advanced Degree (e.g. Masters, MBA, JD, MD) or up to 3 years of relevant experience with a PhD * Experience with transactional systems (e.g., banking, finance, telecommunications). * Proficiency in Microsoft stack (Windows Server, IIS, MS SQL Server). * Familiarity with middleware technologies (e.g., MQ, Active Directory, Session State). * Advanced experience with AWS cloud services, including designing and troubleshooting scalable, resilient infrastructure. * Knowledge of certificate management and secure system design (basic to intermediate level). * Strong troubleshooting, performance tuning, and capacity planning skills. * Exposure to PCI and other audit/control frameworks. * Experience with enterprise monitoring and orchestration tools. * Ability to work across time zones and with geographically dispersed teams. * Excellent communication, collaboration, and stakeholder management skills. * Self-motivated, adaptable, and committed to continuous learning and growth. * Experience leading initiatives and influencing across teams. * Customer-oriented mindset for both internal and external clients. * Committed to continuous learning and growth, with the ability to adapt quickly to evolving challenges and technologies. Additional Information Work Hours: Varies upon the needs of the department. Travel Requirements: This position requires travel5-10% of the time. Mental/Physical Requirements: This position will be performed in an office setting. The position will require the incumbent to sit and stand at a desk, communicate in person and by telephone, frequently operate standard office equipment, such as telephones and computers. Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law. Visa will consider for employment qualified applicants with criminal histories in a manner consistent with applicable local law, including the requirements of Article 49 of the San Francisco Police Code. U.S. APPLICANTS ONLY: The estimated salary range for this positionis $131,600 to $210,300USD per year, which may include potential sales incentive payments (if applicable). Salary may vary depending on job-related factors which may include knowledge, skills, experience, and location. In addition, this position may be eligible for bonus and equity. Visa has a comprehensive benefits package for which this position may be eligible that includes Medical, Dental, Vision, 401 (k), FSA/HSA, Life Insurance, Paid Time Off, and Wellness Program.
$131.6k-210.3k yearly 5d ago
Staff Site Reliability Engineer, ServiceNow
Visa 4.5
Highlands Ranch, CO jobs
Visa is a world leader in payments technology, facilitating transactions between consumers, merchants, financial institutions and government entities across more than 200 countries and territories, dedicated to uplifting everyone, everywhere by being the best way to pay and be paid. At Visa, you'll have the opportunity to create impact at scale - tackling meaningful challenges, growing your skills and seeing your contributions impact lives around the world. Join Visa and do work that matters - to you, to your community, and to the world. Progress starts with you. Job Description The CMDB Site Reliability Engineer will hold the responsibility for developing on the Service Now platform CMDB, ITOM Discovery and other CMDB related components as part of the Scrum based Agile framework to advance and maintain Visa's CMDB functionality. This will include, but is not limited to, story implementation, data management (CI and Platform), and operational support (Incidents and Requests). Essential Functions: Be part of a global team that has operational support responsibilities where you will be required to work with our internal customers to resolve issues using the ServiceNow Incident management process and ticketing system. Expand and enhance your knowledge of the ServiceNow Platform and its advanced features and capabilities including developing table / attribute level security controls, implementing Business Rules, Flow Designer, Workflows, Client Scripts, UI Policy, and UI Actions as part of development activities for Catalogs, Scoped Apps or other platform activities. Design, develop, and maintain ETL solutions using Microsoft SSIS to support data warehousing and business intelligence initiatives. Monitor, troubleshoot, and optimize existing SSIS packages and ETL jobs for performance and reliability. Ingest, transform, and load data from various sources including relational databases, flat files (CSV, TXT), REST APIs, and ODBC connections. Write and optimize complex SQL queries, stored procedures, triggers, and scripts for data extraction and transformation. Develop and maintain comprehensive documentation for data flows, processes, and systems. Ensure data quality, integrity, and security throughout the ETL process. Participate in code reviews and contribute to best practices for SSIS and ETL development. Collaborate with stakeholders, analysts, and business users to understand requirements and deliver robust data pipelines. Apply data governance, security, and compliance standards to all processes. Work with ServiceNow ITOM Discovery and its Cloud Discovery capabilities in a multi-public Cloud environment. Work with end - users and educate them on how to be more effective and efficient working on the platform either through informal meetings, brown bags or scheduled calls. Develop technical requirements, documentation and technical diagrams. Perform data profiling and root cause analysis for data issues. Implement data validation and reconciliation processes. Work with stakeholders to review stories and requirements to develop new capabilities as part of our monthly Sprints and Releases. This is a hybrid position. Expectation of days in office will be confirmed by your hiring manager. Visa will accept applications for this role until at least January 31, 2026. Qualifications Basic Qualifications: 5+ years of relevant work experience with a Bachelor's Degree or at least 2 years of work experience with an Advanced degree (e.g. Masters, MBA, JD, MD) or 0 years of work experience with a PhD, OR 8+ years of relevant work experience. Preferred Qualifications: Candidate must have direct work experience troubleshooting issues on the ServiceNow platform with Catalog requests, data imports, or other business-related logic. Effective writing skills with the ability to document application configurations, system design and architecture, simple run books. Good communication skills to work effectively with team members, support personnel, management and customers in geographically dispersed locations and ability to work as part of a team as well as independently with minimum guidance. Ideal candidate would have completed their ServiceNow Certified System Administrator. Familiarity with source control and deployment processes for SSIS projects. Experience with data profiling, data quality assessment, and cleansing techniques. 8+ years of hands-on ServiceNow development and CMDB experience, ETL solutions using Microsoft SSIS. Strong proficiency in SQL (T-SQL) for data manipulation, querying, and optimization. Experience in ingesting data from ODBC data sources and processing data from flat files (CSV, TXT, etc.). Experience integrating data from REST APIs (JSON, XML) using SSIS or related tools. Proficient in data modeling (relational and dimensional) and data warehousing best practices. Knowledge of data governance, privacy, and security practices. Familiarity with job scheduling, automation tools, and workflow orchestration. Experience with metadata management and data lineage tracking. Additional Information Work Hours: Varies upon the needs of the department. Travel Requirements: This position requires travel5-10% of the time. Mental/Physical Requirements: This position will be performed in an office setting. The position will require the incumbent to sit and stand at a desk, communicate in person and by telephone, frequently operate standard office equipment, such as telephones and computers. Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law. Visa will consider for employment qualified applicants with criminal histories in a manner consistent with applicable local law, including the requirements of Article 49 of the San Francisco Police Code. U.S. APPLICANTS ONLY: The estimated salary range for this positionis $124,300 to $198,600 USD per year, which may include potential sales incentive payments (if applicable). Salary may vary depending on job-related factors which may include knowledge, skills, experience, and location. In addition, this position may be eligible for bonus and equity. Visa has a comprehensive benefits package for which this position may be eligible that includes Medical, Dental, Vision, 401 (k), FSA/HSA, Life Insurance, Paid Time Off, and Wellness Program.
$124.3k-198.6k yearly 5d ago
Sr. Site Reliability Engineer
Visa 4.5
Austin, TX jobs
Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure payments network, enabling individuals, businesses, and economies to thrive while driven by a common purpose - to uplift everyone, everywhere by being the best way to pay and be paid. Make an impact with a purpose-driven industry leader. Join us today and experience Life at Visa. Job Description Visa Technology & Operations LLC, a Visa Inc. company, needs a Sr. Site Reliability Engineer (multiple openings) in Austin, TX to: Provide technical support to Tier 0 Applications ensuring it meets all service level agreements and team objectives. Participate in root cause and analysis (RCA) for issues encountered for associated services. Assist in disaster recovery plan without impacting any related services. Implement extensive application monitoring objectives. Implement self-healing services to minimize or eliminate downtime. Implement application and system changes according to best practice. Work with the development team to resolve issues, enhance applications, and advice. Resolve incidents and problems in accordance within defined guidelines and meet operational level agreements. Ability to work after hours including weekends, night, early morning on rotational shifts. Position reports to the Austin, Texas office and may allow for partial telecommuting. This position requires travel 5-10% of the time. Qualifications Basic Qualifications: Bachelor's degree in Computer Science, Engineering, Business Analytics or related field, followed by 2 years of experience in the job offered or in a related systems engineer or data engineer occupation. Alternatively, a Master's degree in Computer Science, Engineering, Business Analytics, or related field. Position requires experience in the following: Linux Operating Systems. Virtual Machines. Containers. Databases. MQ or KAFKA. Middleware JVMs. Storage. Supporting Web Services (API) or Web UI or Batch applications based on Linux, Java, BASH, Python, Perl, Oracle, DB2, Hazelcast or Hadoop. Firewalls, Load Balancer, DNS, HTTP, TCP/IP, PKI, SSL, TLS, Digital Certificates, Encryption, Security Scanning or equivalent. Additional Information Worksite: Austin, TX This is a hybrid position. Hybrid employees can alternate time between both remote and office. Employees in hybrid roles are expected to work from the office 2-3 set days a week (determined by leadership/site), with a general guidepost of being in the office 50% or more of the time based on business needs. Travel Requirements:This position requires travel 5-10% of the time. Mental/Physical Requirements:This position will be performed in an office setting. The position will require the incumbent to sit and stand at a desk, communicate in person and by telephone, frequently operate standard office equipment, such as telephones and computers. Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law. U.S. APPLICANTS ONLY: The estimated salary range for a new hire into this position is $111,238.00 USD to $171,800.00 USD per year, which may include potential sales incentive payments (if applicable). Salary may vary depending on job-related factors which may include knowledge, skills, experience, and location. In addition, this position may be eligible for bonus and equity. Visa has a comprehensive benefits package for which this position may be eligible that includes Medical, Dental, Vision, 401 (k), FSA/HSA, Life Insurance, Paid Time Off, and Wellness Program.
$111.2k-171.8k yearly 5d ago
Sr. Site Reliability Engineer - Talent Day
Visa 4.5
Austin, TX jobs
Visa is a world leader in payments and technology, with over 259 billion payments transactions flowing safely between consumers, merchants, financial institutions, and government entities in more than 200 countries and territories each year. Our mission is to connect the world through the most innovative, convenient, reliable, and secure payments network, enabling individuals, businesses, and economies to thrive while driven by a common purpose - to uplift everyone, everywhere by being the best way to pay and be paid. Make an impact with a purpose-driven industry leader. Join us today and experience Life at Visa. Job Description Essential Functions We are seeking a Site Reliability Engineer to work in the Product Reliability Engineering function within Operations & Infrastructure. This individual will: Perform day-to-day site reliability engineering functions including maintenance and incident resolution for Visa's applications, products, and services. Perform ongoing/proactive analysis of applications to detect potential problems and actively engage & facilitate the discussion to find the best possible solution. Work under direct supervision to ensure on-time delivery of projects, and production support plans for upgrades, enhancements, and deployments. Work closely with service partners such as product development, engineering teams to seamlessly implement the innovative solutions to improve the reliability, scalability, and efficiency. Assist in automating the routine tasks and processes to improve overall efficiency and reduce human errors. Actively participate in troubleshooting activities and SWAT calls and drive investigation towards swift resolution. Build comprehensive and robust documentation repositories that can facilitate knowledge transfer among PRE and Global Operations peers. Assist the team with implementing GenAI and machine learning trends to continuously optimize the application reliability and efficiency. Participate in on-call roster to support business including off-hours. Self-motivated, and have excellent interpersonal and communication skills. *This is a hybrid position. Expectation of days in office will be confirmed by your hiring manager. Qualifications Basic Qualifications 2+ years of relevant work experience and a Bachelor's degree, OR 5+ years of relevant work experience. Preferred Qualifications 3 or more years of work experience with a Bachelor's Degree or more than 2 years of work experience with an Advanced Degree (e.g. Masters, MBA, JD, MD) Working knowledge of one or more programming languages such as Python, Java, .NET, C#, PowerShell, Bash scripting. Understanding of Linux/Unix systems Understanding of networking concepts, protocols, and architecture. Proven track record of automating complex tasks and processes to improve efficiency and reliability Basic understanding of AI frameworks and libraries. Additional knowledge in one of the following: Cloud technologies such as AWS, Azure, etc. Database management systems such as MSSQL,MongoDB, etc. Middleware technologies such as Tomcat, Apache, etc. Containerization technologies such as Docker, Kubernetes, etc. Infrastructure-as-code tools such as Terraform, Ansible, etc. Monitoring tools such as Splunk or other Additional Information Work Hours:Varies upon the needs of the department. Travel Requirements:This position requires travel5-10% of the time. Mental/Physical Requirements:This position will be performed in an office setting. The position will require the incumbent to sit and stand at a desk, communicate in person and by telephone, frequently operate standard office equipment, such as telephones and computers. Visa is an EEO Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status. Visa will also consider for employment qualified applicants with criminal histories in a manner consistent with EEOC guidelines and applicable local law. U.S. APPLICANTS ONLY: The estimated salary range for this positionis $110,700 to $171,800USD per year, which may include potential sales incentive payments (if applicable). Salary may vary depending on job-related factors which may include knowledge, skills, experience, and location. In addition, this position may be eligible for bonus and equity. Visa has a comprehensive benefits package for which this position may be eligible that includes Medical, Dental, Vision, 401 (k), FSA/HSA, Life Insurance, Paid Time Off, and Wellness Program.
$110.7k-171.8k yearly 5d ago
Process Improvement Specialist
DZ Corporation 4.3
The Villages, FL jobs
Reports To: Operations Manager The Process Improvement Specialist is responsible for optimizing production processes within the precast concrete facility. This role focuses on identifying inefficiencies, implementing process enhancements, and supporting quality and safety improvements across manufacturing operations. Working closely with production teams, engineers, and supervisors, the specialist helps streamline workflows, reduce waste, and ensure consistent product quality. Key Responsibilities: Process Analysis & Optimization: Observe and analyze daily production activities (casting, curing, reinforcement, finishing, etc.) to identify bottlenecks and improvement opportunities. Data Collection & Reporting: Gather and track production data such as cycle times, material usage, downtime, and defect rates to support improvement projects. Continuous Improvement Projects: Assist in implementing Lean, 5S, or Six Sigma initiatives to improve plant efficiency, reduce waste, and enhance workplace organization. Standard Work & Documentation: Help develop and update standard operating procedures (SOPs), work instructions, and visual management tools. Quality & Safety Support: Collaborate with Quality Control and Safety teams to ensure process changes meet safety standards and product specifications. Technical Support: Support the introduction of new molds, equipment, or materials by conducting process trials and documenting results. Collaboration: Partner with maintenance, engineering, and production supervisors to troubleshoot recurring process issues. Qualifications: Education: Associate's degree or technical diploma in Manufacturing Technology, Industrial Engineering, or related field. Equivalent experience in precast concrete production or process improvement will be considered. Experience: 2+ years in a manufacturing or precast concrete environment. Familiarity with Lean Manufacturing, 6S, or Continuous Improvement principles. Skills: Strong mechanical aptitude and understanding of production equipment. Ability to collect and interpret process data (cycle times, scrap, yield, etc.). Proficiency in Microsoft Office and basic data entry tools. Good communication and problem-solving skills. Team-oriented and hands-on approach. Preferred Qualifications: Experience with precast or concrete manufacturing processes (casting, curing, form setup, reinforcement, finishing). Knowledge of quality systems such as NPCA or PCI standards. Basic CAD or technical drawing reading ability. Certification in Lean or Six Sigma or willingness to acquire. Performance Indicators: Reduction in process waste or rework rates. Increased production throughput and efficiency. Improved safety compliance and incident reduction. Consistency in meeting product quality standards. Implementation and sustainability of improvement projects.
$68k-100k yearly est. 5d ago
Senior ML Engineer: Production Pipelines & HPC Expert
Capital One 4.7
McLean, VA jobs
A leading financial services company in Virginia seeks an experienced professional to design and build data-intensive solutions. The role requires expertise in C, C++, Python, Scala, and machine learning, along with the ability to lead teams and communicate complex concepts effectively. Candidates should possess a Bachelor's and preferably a Master's degree, with a proven track record in production-ready data pipelines and ML lifecycle. Competitive compensation and comprehensive benefits are offered. #J-18808-Ljbffr
$90k-111k yearly est. 2d ago
Site Reliability Engineer
The Voleon Group 4.1
Remote
Voleon is a technology company that applies state-of-the-art AI and machine learning techniques to real-world problems in finance. For nearly two decades, we have led our industry and worked at the frontier of applying AI/ML to investment management. We have become a multibillion-dollar asset manager, and we have ambitious goals for the future. Your colleagues will include internationally recognized experts in artificial intelligence and machine learning research as well as highly experienced finance and technology professionals. The people who shape our company come from other backgrounds, including concert music performances, humanitarian aid, opera singing, sports writing, and BMX racing. You will be part of a team that loves to succeed together. In addition to our enriching and collegial working environment, we offer highly competitive compensation and benefits packages, technology talks by our experts, a beautiful modern office, daily catered lunches, and more. As a Site Reliability Engineer (SRE), you will work at the intersection of production operations and software development as you improve, manage, and monitor production-critical infrastructure and data pipelines. At Voleon, many SREs serve together on a Production Operations team tasked with improving shared production infrastructure. Others are embedded with teams of software engineers to improve specific production systems owned by those teams. Voleon SREs work on important real-world problems and collaborate with passionate and talented colleagues in an empowering, results-driven environment. This role is a way to make a real difference: your contributions will make our critical systems more reliable, lower operational risk, and increase the efficiency of our engineering effort.Responsibilities Improve fault-tolerance and maintainability of code in proprietary data pipelines and trading systems Diagnose and fix bugs in code Lead complex deployments Automate manual workflows Track and prioritize outstanding production-related issues Share an on-call rotation responding to incidents to ensure the continuous operation of production-critical systems Requirements Experience with coding and debugging Python Experience with Linux Familiarity with Relational Databases & SQL Sharp analytical and problem-solving skills and a persistent drive to make things work (better) Strong growth mindset and a passion for learning Strong technical communication skills Attention to detail 2 years of relevant industry experience An undergraduate degree or comparable training in a quantitative field or equivalent, relevant industry experience Preferred Qualifications Familiarity with best practices concerning code maintainability, documentation, quality assurance, continuous integration and deployment Experience supporting production systems Experience with any of the following: gRPC microservices, Postgres, Pandas, Golang, R, Git, Jenkins, Bazel, Prometheus, Grafana, Airflow, Kubernetes The base salary for this position is $120,000 to $160,000 in the location(s) of this posting. Individual salaries are determined through a variety of factors, including, but not limited to, education, experience, knowledge, skills, and geography. Base salary does not include other forms of total compensation such as bonus compensation and other benefits. Our benefits package includes medical, dental and vision coverage, life and AD&D insurance, 20 days of paid time off, 9 sick days, and a 401(k) plan with a company match. “Friends of Voleon” Candidate Referral ProgramIf you have a great candidate in mind for this role and would like to have the potential to earn $7,500 - $15,000 if your referred candidate is successfully hired and employed by The Voleon Group, please use this form to submit your referral. For more details regarding eligibility, terms and conditions please make sure to review the Voleon Referral Bonus Program. Equal Opportunity EmployerThe Voleon Group is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.
$120k-160k yearly Auto-Apply 41d ago
Site Reliability Engineer 2
Drivewealth 4.0
Remote
DriveWealth is a global B2B financial technology organization dedicated to democratizing access to financial independence around the world. Our mission is realized through an API-based platform, empowering our partners to offer seamless investing and trading experiences to clients worldwide, all from their mobile devices. Our technology provides partners with a modern, extensible toolkit, enabling traditional investment workflows and innovative techniques like fractional share ownership. DriveWealth has evolved into a global platform offering trading of US equities, mutual funds, ETFs, fixed income, and options. We seek enthusiastic professionals to contribute diverse perspectives and experiences to our Brokerage-as-a-Service platform. Our culture blends the pace and opportunity of a tech start-up with the impact, stability, and significance of Wall Street. We encourage creativity and experimentation while ensuring institutional-grade execution and regulatory compliance in everything we do. We value diversity and inclusion, celebrating the unique differences of our employees as we scale and grow together. We're guided by operating principles grounded in accountability, teamwork, integrity, and solutions built to scale. Join us! About The Role As a Site Reliability Engineer 2, you will enhance the reliability and performance of our Brokerage-as-a-Service platform during critical 7/24 operations. This role demands a proactive approach to managing technical challenges and system optimizations that align with our global operational strategies. What You'll Do Support the SRE team in developing and implementing enhancements to support workflows, focusing on automation and efficiency improvements. Handle technical escalations, troubleshoot complex issues, and actively participate in on-call rotations to ensure rapid response and resolution during non-traditional hours. Adhere and administer incident and change management policies. Coordinate incident resolution efforts and implement change management protocols to maintain and enhance system reliability, especially during critical system operations at night. Work closely with the New York office to ensure smooth operation and alignment of SRE practices across time zones. What You'll Need 3+ years in a SRE role or a similar position, demonstrating deep knowledge and expertise in site reliability engineering and operations. Working knowledge in REST APIs and understanding of API integration. Python proficiency in scripting for automation and system management, with a track record of developing and implementing automation solutions. SQL and Database expertise in transactional databases, including querying and troubleshooting. Analytical and troubleshooting skills with a demonstrated ability to perform troubleshooting and root cause analysis of technical issues. Availability for flexible work hours and willingness to cover US markets trading sessions, including L2 on-call coverage. Knowledge of Change Management Process and Risk Management. Nice to Have, But No Required Experience in the brokerage or financial industry Proficient with cloud services, particularly AWS, and knowledgeable about cloud architecture best practices, including IAM, EC2, S3, and DynamoDB Experience maintaining and supporting containerized systems, with familiarity in orchestration tools Knowledge of Infrastructure as Code (IaC) practices and tools such as Terraform or CloudFormation Ability to manage and troubleshoot job scheduling tools like Rundeck or Apache Airflow Advanced skills in managing containerized environments using Kubernetes and OpenShift Practical experience with Confluent Cloud for event streaming architectures Experience with Java applications and a basic understanding of using the browser developer console for front-end debugging Additional Notes: This role is critical for our continuous operations and requires a commitment to nighttime hours, aligning with the global nature of our financial services. Candidates must be prepared for intense collaboration periods and proactive communication across global teams. Applicants must be authorized to work for any employer in the U.S. DriveWealth is unable to sponsor or take over sponsorship of an employment Visa at this time. Compensation Compensation package offerings are based on candidate experience and technical qualifications, as it relates to the role. These are identified and determined throughout your interviewing experience. Please note: at this time, we are not able to hire in all states. Remote (Most US States) Pay Range$130,000-$150,000 USD Benefits Competitive medical, dental, and vision insurance options Mental health resources Generous paid time off with observed holidays (varies per country) Paid parental leave for biological and adoptive parents Up to $2,500 or local equivalent each year to invest in continued education and personal development Up to $900 each year or local equivalent for fitness and wellness reimbursement Company-provided phone (varies by country) For HQ in-office employees, a daily lunch stipend, unlimited snacks, and engaging office space in the Financial District Pre-tax commuter benefits (US only) Employer 401K match (US only) Benefit offerings vary based on country and are subject to change. Equal Employment Opportunity To build technology and products that are used and loved by people and solve real-world problems, we need to build a team with many different perspectives and experiences. We are an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. We encourage candidates from all backgrounds to apply. Applicants in need of special assistance or accommodation during the interview process or in accessing our website may contact us at **************************. Agency Disclaimer DriveWealth does not accept agency resumes. Please do not forward resumes to our jobs alias, employees, or any other organization location. DriveWealth is not responsible for any fees related to unsolicited resumes.
$130k-150k yearly Auto-Apply 32d ago
Staff Site Reliability Engineer
Figure 4.5
San Jose, CA jobs
Figure is an AI robotics company developing autonomous general-purpose humanoid robots. The goal of the company is to ship humanoid robots with human level intelligence. Its robots are engineered to perform a variety of tasks in the home and commercial markets. Figure is headquartered in San Jose, CA. We are looking for a Site Reliability Engineer to own our internal systems infrastructure. This role is responsible for setting up and managing cloud and on-prem infrastructure to deliver highly available, reliable, and automated systems. Responsibilities: Be the go to person for mission critical infrastructure enabling critical operations such as Source Configuration Management, CI/CD systems, software distribution, supplier portals, manufacturing and more. Migrate SaaS to self-hosted solutions to enhance security and reliability. Implement monitoring and alerting systems, and define incident response plans and runbooks. Reduce human workload through automation to automate deployment and scaling. Establish strong relationships with stakeholders to identify infrastructure needs and establish Service Level Objectives. Use a data driven approach to demonstrate service robustness and track optimization work. Partner with the security team to ensure that security remediations and updates are applied in a timely manner. Requirements: Strong experience with Linux/Unix systems administration Proficiency in programming/scripting Extensive experience with cloud platforms (Azure, AWS, GCP) and on-prem hardware architectures Experience designing, deploying, and operating high-availability, fault-tolerant, and distributed systems. Mastery of infrastructure as code (Terraform, CloudFormation, Ansible…) Familiarity with monitoring, logging, and alerting tools (Prometheus, Grafana, Datadog…) Solid understanding of networking fundamentals (TCP/IP, DNS, HTTP, load balancers, firewalls) Experience defining Service Level Objectives (SLO), developing runbooks/incident response plans, facilitating post-mortems and managing systems assets. Ability to work in cross-functional teams with developers, infra, and product teams Excellent verbal and written communication skills The US base salary range for this full-time position is between $175,000 - $250,000 annually. The pay offered for this position may vary based on several individual factors, including job-related knowledge, skills, and experience. The total compensation package may also include additional components/benefits depending on the specific role. This information will be shared if an employment offer is extended.
$175k-250k yearly Auto-Apply 35d ago
Site Reliability Engineer
The Voleon Group 4.1
Berkeley, CA jobs
Voleon is a technology company that applies state-of-the-art AI and machine learning techniques to real-world problems in finance. For nearly two decades, we have led our industry and worked at the frontier of applying AI/ML to investment management. We have become a multibillion-dollar asset manager, and we have ambitious goals for the future. Your colleagues will include internationally recognized experts in artificial intelligence and machine learning research as well as highly experienced finance and technology professionals. The people who shape our company come from other backgrounds, including concert music performances, humanitarian aid, opera singing, sports writing, and BMX racing. You will be part of a team that loves to succeed together. In addition to our enriching and collegial working environment, we offer highly competitive compensation and benefits packages, technology talks by our experts, a beautiful modern office, daily catered lunches, and more. As a Site Reliability Engineer (SRE), you will work at the intersection of production operations and software development as you improve, manage, and monitor production-critical infrastructure and data pipelines. At Voleon, many SREs serve together on a Production Operations team tasked with improving shared production infrastructure. Others are embedded with teams of software engineers to improve specific production systems owned by those teams. Voleon SREs work on important real-world problems and collaborate with passionate and talented colleagues in an empowering, results-driven environment. This role is a way to make a real difference: your contributions will make our critical systems more reliable, lower operational risk, and increase the efficiency of our engineering effort.Responsibilities Improve fault-tolerance and maintainability of code in proprietary data pipelines and trading systems Diagnose and fix bugs in code Lead complex deployments Automate manual workflows Track and prioritize outstanding production-related issues Share an on-call rotation responding to incidents to ensure the continuous operation of production-critical systems Requirements Experience with coding and debugging Python Experience with Linux Familiarity with Relational Databases & SQL Sharp analytical and problem-solving skills and a persistent drive to make things work (better) Strong growth mindset and a passion for learning Strong technical communication skills Attention to detail 2 years of relevant industry experience An undergraduate degree or comparable training in a quantitative field or equivalent, relevant industry experience Preferred Qualifications Familiarity with best practices concerning code maintainability, documentation, quality assurance, continuous integration and deployment Experience supporting production systems Experience with any of the following: gRPC microservices, Postgres, Pandas, Golang, R, Git, Jenkins, Bazel, Prometheus, Grafana, Airflow, Kubernetes The base salary for this position is $120,000 to $160,000 in the location(s) of this posting. Individual salaries are determined through a variety of factors, including, but not limited to, education, experience, knowledge, skills, and geography. Base salary does not include other forms of total compensation such as bonus compensation and other benefits. Our benefits package includes medical, dental and vision coverage, life and AD&D insurance, 20 days of paid time off, 9 sick days, and a 401(k) plan with a company match. “Friends of Voleon” Candidate Referral ProgramIf you have a great candidate in mind for this role and would like to have the potential to earn $7,500 - $15,000 if your referred candidate is successfully hired and employed by The Voleon Group, please use this form to submit your referral. For more details regarding eligibility, terms and conditions please make sure to review the Voleon Referral Bonus Program. Equal Opportunity EmployerThe Voleon Group is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.
$120k-160k yearly Auto-Apply 41d ago
Staff Site Reliability Engineer
CME Group 4.4
Chicago, IL jobs
We're looking for a Staff Site Reliability Engineer to join our team, focusing on the core systems that power global financial markets. This isn't just about keeping the lights on; it's about pioneering the future of financial technology. As a member of our Clearing department, you'll be on the front lines, ensuring the integrity and performance of mission-critical systems that facilitate billions of dollars in daily transactions. If you're a builder at heart, driven by a passion for creating ultra-reliable and resilient systems, you'll thrive here. This is a hybrid role. You must be in our office 2+ days a week What You'll Get * A supportive environment fostering career progression, continuous learning, and an inclusive culture. * Broad exposure to CME's diverse products, asset classes, and cross-functional teams. * A competitive salary and comprehensive benefits package. Learn more about our career opportunities here. What You'll Do As a Staff Site Reliability Engineer, you'll be a visionary builder of our resilient infrastructure. You'll move beyond conventional operations to apply software engineering principles to every facet of our clearing systems. * Pioneer solutions to guarantee the reliability, performance, and availability of our CME clearing and risk systems, where every millisecond and every transaction counts. * Architect and implement cutting-edge solutions for application resiliency and fault tolerance. * Drive automation and continuous improvement across the entire system lifecycle, eliminating manual toil and enhancing operational excellence. * Integrate SRE principles directly into the software development lifecycle, embedding reliability from day one. * Collaborate with cross-functional development and platform teams, providing expert-level guidance to deploy and maintain critical applications. * Innovate and lead efforts to prevent incidents, enhance operational processes, and automate solutions at a global scale. * Spearhead the adoption of observability and performance testing, guiding teams to a "build with SRE mindset" culture. * Own the end-to-end operational integrity of products, understanding and contributing to the bigger picture of the organization. What You'll Bring * A strong academic background: Bachelor's degree in Engineering, Computer Science, Information Technology, or a related field is strongly preferred. * Cloud expertise: Hands-on experience deploying and operating applications using IaaS and PaaS on major cloud providers, preferably Google Cloud Services. * Coding fluency: Proficiency in one or more of the following languages: Java, Python, Bash, or Go. Typescript and/or Rust are a significant plus. * Infrastructure as Code (IaC) mastery: Experience with tools such as GKE, Terraform, CloudFormation, and Chef. * Proven reliability engineering skills: Deep knowledge of SRE and security best practices, with a track record of implementing them into workflows. A solid understanding of performance testing tools is essential, along with the ability to help teams resolve complex performance issues. * Automation prowess: Demonstrated experience with automation, CI/CD, orchestration, and configuration management. * Observability knowledge: Familiarity with logging and observability platforms such as OpenTelemetry and Prometheus. * A security-first mindset: Strong understanding of security and compliance frameworks. * Problem-solving abilities: Excellent written and verbal communication skills, with the ability to convey complex technical concepts clearly to both technical and non-technical audiences. * Strong collaboration skills: An agile team player who is self-motivated and can work with minimal supervision while juggling multiple concurrent projects. * A passion for innovation: A continuous desire to learn and stay up-to-date with the latest technologies and industry trends. #LI-JK1 #LI-Hybrid CME Group is committed to offering a competitive total rewards package for our employees that recognizes their contributions to the business and reflects our long-term investment in their future. The pay range for this role is $128,500-$214,100. Actual salary offered will be dependent on a wide array of factors including but not limited to: relevant experience, skills, education and comparison to internal employees (where relevant). Our compensation program also includes an annual target bonus opportunity for all employees, as well as the opportunity to become an owner in the company through our broad-based equity program. Through our benefits program, we strive to offer flexibility, value and choice. From comprehensive health coverage, to a retirement package that includes both a 401(k) and an active pension plan, to highly competitive education reimbursement provisions, paid time off and a mental health benefit, CME Group offers a holistic benefits package for our team and their dependents. CME Group: Where Futures are Made CME Group is the world's leading derivatives marketplace. But who we are goes deeper than that. Here, you can impact markets worldwide. Transform industries. And build a career by shaping tomorrow. We invest in your success and you own it - all while working alongside a team of leading experts who inspire you in ways big and small. Problem solvers, difference makers, trailblazers. Those are our people. And we're looking for more. At CME Group, we embrace our employees' unique experiences and skills to ensure that everyone's perspectives are acknowledged and valued. As an equal-opportunity employer, we consider all potential employees without regard to any protected characteristic. Important Notice: Recruitment fraud is on the rise, with scammers using misleading promises of job offers and interviews to solicit money and personal information from job seekers. CME Group adheres to established procedures designed to maintain trust, confidence and security throughout our recruitment process. Learn more here.
$128.5k-214.1k yearly 60d+ ago
Site Reliability Engineer
Tata Consulting Services 4.3
Miami, FL jobs
Must-Have * Strong development experience in .NET and Java frameworks. * Proven leadership managing SRE and DevOps teams. * Incident and problem management using ServiceNow. * Expertise in Observability: AppDynamics, PagerDuty, Grafana, Splunk. * Deep understanding of CI/CD with Azure ADO, GitHub, Maven, Gradle. * Automated regression and performance testing experience with Selenium, JMeter. * Experience building self-healing systems. * Strong skills in root cause analysis (RCA) and problem identification. * Ability to define and enforce SLAs and response metrics. * Document and maintain version-controlled knowledge repositories. * Exposure to self-healing systems in SRE or DevOps context. Good-to-Have * Certifications in AWS/GCP/Azure Salary Range-$100,000-$120,000 a year #LI-KR3 TCS Employee Benefits Summary: Discretionary Annual Incentive. Comprehensive Medical Coverage: Medical & Health, Dental & Vision, Disability Planning & Insurance, Pet Insurance Plans. Family Support: Maternal & Parental Leaves. Insurance Options: Auto & Home Insurance, Identity Theft Protection. Convenience & Professional Growth: Commuter Benefits & Certification & Training Reimbursement. Time Off: Vacation, Time Off, Sick Leave & Holidays. Legal & Financial Assistance: Legal Assistance, 401K Plan, Performance Bonus, College Fund, Student Loan Refinancing. Experience working in a Travel/Tourism industry
$100k-120k yearly 10d ago
Site Reliability Engineer - Capital Markets
Jefferies Financial Group Inc. 4.8
New York, NY jobs
Jefferies is seeking for Site Reliability Engineer to play an instrumental role in supporting Equity Front office trading application, risk and middle office real time products, developed and used for Equity Cash and ETS application. As part of the wider platform engineering team, you will be working closely with the Business users interactively throughout the day, along with technical, analysis and testing colleagues. Investigation and resolution of the work items at hand will require competent technical skills and a keen intellect. The business is a growth area, with current investments taking place in all the technology, business and middle office areas. Responsibilities: * Front Line Site Reliable Engineering and Support functions for Equity trading systems used by Jefferies clients as well as internal users. * Build monitoring tools for application and infrastructure components. * Implement and manage scalable infrastructure using cloud-native technologies and tools. * Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding. * Partner with business, development and infrastructure teams to improve services through rigorous testing and release procedures. * Develop and maintain CI/CD pipelines to streamline deployment processes. * Expedient deployment of new systems. Capacity planning, Platform Management, and support for increasing volumes and business growth. * Create sustainable systems and services through automation. * Collaborate with Application team to establish and enforce production and development standards. * Document procedures, best practices and troubleshooting FAQs. * Resolve complex application and technical problems. * Debugging the system and fixing the production related issues. * Escalate / follow-up on permanent fix for development related issues. * Lead incident response efforts and post-mortem analysis to prevent future occurrences. * Handles complex operational tasks and recommends process and technology changes. * Global support and includes weekend availability to troubleshoot production related issues and perform checkouts. * Ability to work both independently and in groups in an energetic, diverse environment. * Participate in on-call rotations to ensure 24/7 system availability and support. * Support compliance and legal queries. Qualifications: * Strong experience in Windows and Linux/Unix services. * Strong experience in scripting language like Power shell, Python and SQL. * Strong Knowledge of monitoring tools - Nagios, Splunk, OTEL, Datadog * Strong Knowledge of FIX protocol * Strong Domain skills - Must have working experience in Capital Markets across modules and instruments especially - CASH, ETS, Bonds, Options, Futures, Swaps products * Experience in BFSI (Banking and Financial Industry) Domain applications with a proper understanding of the Trade Lifecycle. * Excellent communication, time management and project management skills. Primary Location Full Time Salary Range of $175,000 - $200,000
$175k-200k yearly Auto-Apply 41d ago
Network Reliability Engineer III
CME Group 4.4
Chicago, IL jobs
As we embark on a journey to transform the Network Services Group in CME, we are seeking a Network Reliability Engineer III to join our dynamic team. In this role, you will design, develop and maintain self-service tools and applications that enhance productivity and reduce operational costs. You will work across the full stack-both front-end and back-end-to architect microservices (GKE) in Google Cloud Platform (GCP), driving our infrastructure towards greater automation and reliability. We are a global team across US, UK, India and Singapore made up of a diverse range of people from varied backgrounds who each bring unique network experiences and skill sets. The relatively new Network Reliability/Automation team are responsible for building a suite of custom automation tools and developing our self-healing capabilities while working closely with other members of the Network Services team in project delivery to ensure one of the largest Exchange network infrastructures in the world is highly available, resilient, secure and reliable. Responsibilities * Design, develop and maintain self-service and automation tools to streamline IT operations and reduce manual effort. * Engage in full-stack development, delivering responsive front-end interfaces as well as robust scalable back-end services. * With support Architect, deploy and scale microservices on GCP, with particular emphasis on containers and Google Kubernetes Engine (GKE). * Manage cloud infrastructure via Infrastructure-as-Code (IaC), primarily using Terraform to provision and maintain resources. * Operate and troubleshoot solutions on Linux-based platforms, leveraging Visual Studio Code (VSCode) as the primary development environment. * Adhere to software engineering best practices, including PEP8 coding standards, SOLID design principles, and established SDLC processes. * Implement and manage CI/CD pipelines with a DevOps mindset, ensuring rapid, reliable delivery of code. * Develop and consume Flask-based RESTful APIs to support network and security automation. * Collaborate within an Agile Scrum framework, utilizing tools such as Bitbucket and Jira to track progress and manage sprints. * Apply strong analytical and problem-solving skills to balance multiple project variables and deliver high-quality solutions on schedule. What we are looking for * Approximately 2-3 years' hands-on Python programming experience, with a demonstrable track record of automation or tooling projects. * Knowledge and experience working with both Python Django and Flask in a corporate environment. * Any experience in network and security automation, coupled with understanding of network fundamentals (routing, switching, firewalls, VPNs) would be beneficial. * Experience developing REST APIs using Flask (or a comparable Python framework). * Applicants with front-end experience using Javascript/JQuery/HTML5/CSS would be ideal. * Familiarity with Infrastructure-as-Code using Terraform (or similar) to manage cloud resources. * Comfortable working in Linux environments and proficient in using Visual Studio Code (VSCode). * Strong software engineering mindset: adherence to PEP8, SOLID principles, and best practices for SDLC, CI/CD and DevOps. * Excellent communication skills, both verbal and written, with the ability to convey technical concepts to diverse stakeholders. * Highly analytical, with the ability to troubleshoot complex issues and manage multiple tasks concurrently. * Experience working in Agile Scrum teams, utilizing Bitbucket and Jira (or equivalent tools) for version control and project tracking. Personal Attributes * Proactive and positive attitude, taking initiative to identify and resolve issues ahead of time. * Collaborative team player, eager to contribute knowledge and assist colleagues. * Innovative thinker who brings fresh ideas and constructive suggestions for continuous improvement. Education Bachelor's Degree in Computer Science, Engineering or a related field is preferred. Equivalent practical experience will also be considered. #LI - Hybrid #LI - JK1 CME Group is committed to offering a competitive total rewards package for our employees that recognizes their contributions to the business and reflects our long-term investment in their future. The pay range for this role is $100,700-$167,800. Actual salary offered will be dependent on a wide array of factors including but not limited to: relevant experience, skills, education and comparison to internal employees (where relevant). Our compensation program also includes an annual target bonus opportunity for all employees, as well as the opportunity to become an owner in the company through our broad-based equity program. Through our benefits program, we strive to offer flexibility, value and choice. From comprehensive health coverage, to a retirement package that includes both a 401(k) and an active pension plan, to highly competitive education reimbursement provisions, paid time off and a mental health benefit, CME Group offers a holistic benefits package for our team and their dependents. CME Group: Where Futures are Made CME Group is the world's leading derivatives marketplace. But who we are goes deeper than that. Here, you can impact markets worldwide. Transform industries. And build a career by shaping tomorrow. We invest in your success and you own it - all while working alongside a team of leading experts who inspire you in ways big and small. Problem solvers, difference makers, trailblazers. Those are our people. And we're looking for more. At CME Group, we embrace our employees' unique experiences and skills to ensure that everyone's perspectives are acknowledged and valued. As an equal-opportunity employer, we consider all potential employees without regard to any protected characteristic. Important Notice: Recruitment fraud is on the rise, with scammers using misleading promises of job offers and interviews to solicit money and personal information from job seekers. CME Group adheres to established procedures designed to maintain trust, confidence and security throughout our recruitment process. Learn more here.
$100.7k-167.8k yearly 60d+ ago
Reliability Engineer*
3M 4.6
Clarkston, GA jobs
Job Title Reliability Engineer Collaborate with Innovative 3Mers Around the World Choosing where to start and grow your career has a major impact on your professional and personal life, so it's equally important you know that the company that you choose to work at, and its leaders, will support and guide you. With a wide variety of people, global locations, technologies and products, 3M is a place where you can collaborate with other curious, creative 3Mers. This position provides an opportunity to transition from other private, public, government or military experience to a 3M career. The Impact You'll Make in this Role As a(n) HANDS ON Reliability Engineer, you will have the opportunity to tap into your curiosity and collaborate with some of the most innovative people around the world. Here, you will make an impact by: Perform failure mode and effect analysis to assure the proper Preventive & Predictive Maintenance programs are implemented, audited and improved on all existing and future assets. Application of Reliability Based Maintenance programs such as Reliability Centered Maintenance (RCM) and Total Productive Maintenance (TPM). Assess & develop capability of mechanics on their role in reliability improvement and to advance their technical capabilities. Analyze data (failure, cost, uptime, etc.) and apply appropriate reliability analysis tools to develop & implement improvement plans. Perform & document equipment criticality analysis in support of an effective critical spares strategy. Submit recommendations and justification for capital expenditures that support and improve the Reliability Program. Provide an external awareness of methods and technologies that advance our own internal body of knowledge for the improvement of our operations reliability. Your Skills and Expertise To set you up for success in this role from day one, 3M requires (at a minimum) the following qualifications: Technical degree or higher (completed and verified prior to start) and Two (2) years of manufacturing experience in a private, public, government or military environment. OR Associates Degree or higher (completed and verified prior to start) and Two (2) years of manufacturing experience in a private, public, government or military environment. AND One (1) year of experience with mechanical and electrical drawings. Additional qualifications that could help you succeed even further in this role include: Bachelor's degree in Electrical, Mechanical, or Mechatronics Engineering from an accredited institution Five (5) years of manufacturing in automotive or aerospace private, public, government or military environment Experience with reliability analysis, predictive (PdM), and preventative maintenance (PM). Skills include… Strong communication, independent, strategic, problem solving. PLC, Automation, variable frequency drives Work location: On-site Clarkston, GA Travel: May include up to 5% domestic/international] Relocation Assistance: Not Authorized Must be legally authorized to work in country of employment without sponsorship for employment visa status (e.g., H1B status). Responsibilities of this position may include direct and/or indirect physical or logical access to information, systems, technologies subjected to the regulations/compliance with U.S. Export Control Laws. U.S. Export Control laws and U.S. Government Department of Defense contracts and sub-contracts impose certain restrictions on companies and their ability to share export-controlled and other technology and services with certain "non-U.S. persons" (persons who are not U.S. citizens or nationals, lawful permanent residents of the U.S., refugees, "Temporary Residents" (granted Amnesty or Special Agricultural Worker provisions), or persons granted asylum (but excluding persons in nonimmigrant status such as H-1B, L-1, F-1, etc.) or non-U.S. citizens. To comply with these laws, and in conjunction with the review of candidates for those positions within 3M that may present access to export controlled technical data, 3M must assess employees' U.S. person status, as well as citizenship(s). The questions asked in this application are intended to assess this and will be used for evaluation purposes only. Failure to provide the necessary information in this regard will result in our inability to consider you further for this particular position. The decision whether or not to file or pursue an export license application is at 3M Company's sole election. Supporting Your Well-being 3M offers many programs to help you live your best life - both physically and financially. To ensure competitive pay and benefits, 3M regularly benchmarks with other companies that are comparable in size and scope. Chat with Max For assistance with searching through our current job openings or for more information about all things 3M, visit Max, our virtual recruiting Applicable to US Applicants Only:The expected compensation range for this position is $81,983 - $100,202, which includes base pay plus variable incentive pay, if eligible. This range represents a good faith estimate for this position. The specific compensation offered to a candidate may vary based on factors including, but not limited to, the candidate's relevant knowledge, training, skills, work location, and/or experience. In addition, this position may be eligible for a range of benefits (e.g., Medical, Dental & Vision, Health Savings Accounts, Health Care & Dependent Care Flexible Spending Accounts, Disability Benefits, Life Insurance, Voluntary Benefits, Paid Absences and Retirement Benefits, etc.). Additional information is available at: ******************************************************************* Faith Posting Date Range 08/11/2025 To 09/10/2025 Or until filled All US-based 3M full time employees will need to sign an employee agreement as a condition of employment with 3M. This agreement lays out key terms on using 3M Confidential Information and Trade Secrets. It also has provisions discussing conflicts of interest and how inventions are assigned. Employees that are Job Grade 7 or equivalent and above may also have obligations to not compete against 3M or solicit its employees or customers, both during their employment, and for a period after they leave 3M.Learn more about 3M's creative solutions to the world's problems at ********** or on Instagram, Facebook, and LinkedIn @3M.Responsibilities of this position include that corporate policies, procedures and security standards are complied with while performing assigned duties.Safety is a core value at 3M. All employees are expected to contribute to a strong Environmental Health and Safety (EHS) culture by following safety policies, identifying hazards, and engaging in continuous improvement.Pay & Benefits Overview: https://**********/3M/en_US/careers-us/working-at-3m/benefits/3M does not discriminate in hiring or employment on the basis of race, color, sex, national origin, religion, age, disability, veteran status, or any other characteristic protected by applicable law. Please note: your application may not be considered if you do not provide your education and work history, either by: 1) uploading a resume, or 2) entering the information into the application fields directly. 3M Global Terms of Use and Privacy Statement Carefully read these Terms of Use before using this website. Your access to and use of this website and application for a job at 3M are conditioned on your acceptance and compliance with these terms. Please access the linked document by clicking here, select the country where you are applying for employment, and review. Before submitting your application, you will be asked to confirm your agreement with the terms.
$82k-100.2k yearly Auto-Apply 60d+ ago
Site Reliability Engineer III
Jpmorgan Chase & Co 4.8
Wilmington, NC jobs
There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems. As a Site Reliability Engineer III at JPMorgan Chase within the within the enterprise technology, Corporate Data & analytical service team, you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform. Job responsibilities Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines Collaborates with other software engineers and teams to design, develop, test, and implement availability, reliability, scalability, and solutions in their applications Implements infrastructure, configuration, and network as code for the applications and platforms in your remit Collaborates with technical experts, key stakeholders, and team members to resolve complex problems Understands service level indicators and utilizes service level objectives to proactively resolve issues before they impact customers Supports the adoption of site reliability engineering best practices within your team Required qualifications, capabilities, and skills Formal training or certification on site reliability culture and principles concepts and 3+ years applied experience Proficient in site reliability culture and principles and familiarity with how to implement site reliability within an application or platform Proficient in at least one programming language such as Python, Java/Spring Boot, and .Net Proficient knowledge of software applications and technical processes within a given technical discipline (e.g., Cloud, artificial intelligence, Android, etc.) Experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform Familiarity with container and container orchestration such as ECS, Kubernetes, and Docker Experience in cloud computing (preferably AWS). Familiarity with troubleshooting common networking technologies and issues Preferred qualifications, capabilities, and skills Ability to contribute to large and collaborative teams by presenting information in a logical and timely manner with compelling language and limited supervision Ability to proactively recognize road blocks and demonstrates interest in learning technology that facilitates innovation Ability to identify new technologies and relevant solutions to ensure design constraints are met by the software team Ability to initiate and implement ideas to solve business problems Certifications in AWS, Splunk, Dynatrace, Terraform would be preferred Experience in cloud data lakes (databricks/snowflake)
$97k-118k yearly est. Auto-Apply 60d+ ago
Java Site Reliability Engineer, Messaging Platforms
Pimco 4.9
Austin, TX jobs
We are a leading global asset management firm with over 3,000 employees across 20 offices in 15 countries; we help millions of investors around the world pursue their financial goals. We hire critical thinkers. People who thrive in a collaborative culture like ours where we solve real problems while building the future of finance. You * Are excited to be part of a vibrant engineering community that values diversity, hard work, and continuous learning. * Love solving complex real-world business problems. * Recognize that cross-functional collaboration is a core component of success for the team. * Believe there are multiple ways to solve most technical problems and are willing to debate the trade-offs. * Have become a stronger engineer by making mistakes and learning from them. * Are a doer, someone who wants to grow their career and gain experience across technologies and business functions. We * Continuously invest in a high-performance and inclusive culture, in which a diversity of backgrounds, experiences and viewpoints are celebrated and valued. * Encourage career mobility, so you can benefit from learning different functions and technologies, and we gain the benefits of your experience across teams. * Run technology pro bono programs that help the non-profit community and give our engineering community opportunities to volunteer and participate. * Offer education reimbursements and ongoing training in technology, communication, and diversity & inclusion. * Embrace knowledge sharing through lunch-and-learns, demos, and technical forums. * Consider our people to be our greatest asset-we will help you learn what PIMCO Technology has to offer so you can participate in activities that benefit your career while delivering impactful technology solutions. As a Java SRE in Trading Technology, you will: * As our immediate need * Help support the messaging platforms in use (MQ, AMPS, Kafka, etc.). * driving the firm's best use of these platforms, making sure all choice make sense, the correct tools issued for the solving each job, and that we build a sustainable messaging strategy. * Improve the operational efficiency and reduce the operational risk of our messaging platforms through better tools, better design, and better monitoring. * In the future * there will be new architectural or coding problems that we will need an experienced engineer to help solve. * Work closely with the business and other teams to design and implement solutions that have immediate impact to the business and help us build towards our strategic vision across all our trade floor applications. We need someone proficient in Java, passionate about SRE practices, and able to collaborate effectively with an infrastructure team. We expect you to have a strong passion for messaging systems, including their proper setup, monitoring, and maintenance. At the same time, this role involves software development for target platforms once the immediate needs related to messaging platforms are resolved. You will work with a team consisting of 1 SRE and 1 Unix SA, with full support from the infrastructure and DevOps teams. Position Requirements * Bachelor's degree in computer science or equivalent * Strong Linux skills (including chef, puppet, ansible configuration tools) * Strong experience with different messaging systems (Kafka, AMPS, MQ, FIX, etc.). * Strong engineering culture (unit tests, CI/CD) * Ability to work independently and in teams * Good communication skills * Working from the office in Austin 4 days a week. PIMCO follows a total compensation approach when rewarding employees which includes a base salary and a discretionary bonus. Base salary is the fixed component of compensation that is determined by core job responsibilities, relevant experience, internal level, and market factors. The discretionary bonus is used to award performance and therefore is determined by company, business, team, and individual performance. Salary Range: $ 175,000.00 - $ 240,000.00 Equal Employment Opportunity and Affirmative Action Statement PIMCO recruits and hires qualified candidates without regard to race, national origin, ancestry, religion (including religious dress and grooming practices), sex (including pregnancy, childbirth, breastfeeding, or related medical conditions), sexual orientation, gender (including gender identity and expression), age, military or veteran status, disability (physical or mental), any factor prohibited by law, and as such affirms in policy and practice to support and promote the concept of equal employment opportunity and affirmative action, in accordance with all applicable federal, state, provincial and municipal laws. The company also prohibits discrimination on other basis such as medical condition, or marital status under applicable laws. Applicants with Disabilities PIMCO is an Equal Employment Opportunity/Affirmative Action employer. We provide reasonable accommodation for qualified individuals with disabilities, including veterans, in job application procedures. If you have any difficulty using our online system due to a disability and you would like to request an accommodation, you may contact us at ************ and leave a message. This is a dedicated line designed exclusively to assist job seekers with disabilities to apply online. Only messages left for this purpose will be considered. A response to your request may take up to two business days.
$175k-240k yearly Auto-Apply 60d+ ago
Java Site Reliability Engineer, Messaging Platforms
Pacific Investment Management Co 4.9
Austin, TX jobs
We are a leading global asset management firm with over 3,000 employees across 20 offices in 15 countries; we help millions of investors around the world pursue their financial goals. We hire critical thinkers. People who thrive in a collaborative culture like ours where we solve real problems while building the future of finance. You Are excited to be part of a vibrant engineering community that values diversity, hard work, and continuous learning. Love solving complex real-world business problems. Recognize that cross-functional collaboration is a core component of success for the team. Believe there are multiple ways to solve most technical problems and are willing to debate the trade-offs. Have become a stronger engineer by making mistakes and learning from them. Are a doer, someone who wants to grow their career and gain experience across technologies and business functions. We Continuously invest in a high-performance and inclusive culture, in which a diversity of backgrounds, experiences and viewpoints are celebrated and valued. Encourage career mobility, so you can benefit from learning different functions and technologies, and we gain the benefits of your experience across teams. Run technology pro bono programs that help the non-profit community and give our engineering community opportunities to volunteer and participate. Offer education reimbursements and ongoing training in technology, communication, and diversity & inclusion. Embrace knowledge sharing through lunch-and-learns, demos, and technical forums. Consider our people to be our greatest asset-we will help you learn what PIMCO Technology has to offer so you can participate in activities that benefit your career while delivering impactful technology solutions. As a Java SRE in Trading Technology, you will: As our immediate need Help support the messaging platforms in use (MQ, AMPS, Kafka, etc.). driving the firm's best use of these platforms, making sure all choice make sense, the correct tools issued for the solving each job, and that we build a sustainable messaging strategy. Improve the operational efficiency and reduce the operational risk of our messaging platforms through better tools, better design, and better monitoring. In the future there will be new architectural or coding problems that we will need an experienced engineer to help solve. Work closely with the business and other teams to design and implement solutions that have immediate impact to the business and help us build towards our strategic vision across all our trade floor applications. We need someone proficient in Java, passionate about SRE practices, and able to collaborate effectively with an infrastructure team. We expect you to have a strong passion for messaging systems, including their proper setup, monitoring, and maintenance. At the same time, this role involves software development for target platforms once the immediate needs related to messaging platforms are resolved. You will work with a team consisting of 1 SRE and 1 Unix SA, with full support from the infrastructure and DevOps teams. Position Requirements Bachelor's degree in computer science or equivalent Strong Linux skills (including chef, puppet, ansible configuration tools) Strong experience with different messaging systems (Kafka, AMPS, MQ, FIX, etc.). Strong engineering culture (unit tests, CI/CD) Ability to work independently and in teams Good communication skills Working from the office in Austin 4 days a week. PIMCO follows a total compensation approach when rewarding employees which includes a base salary and a discretionary bonus. Base salary is the fixed component of compensation that is determined by core job responsibilities, relevant experience, internal level, and market factors. The discretionary bonus is used to award performance and therefore is determined by company, business, team, and individual performance. Salary Range: $ 175,000.00 - $ 240,000.00 Equal Employment Opportunity and Affirmative Action Statement PIMCO recruits and hires qualified candidates without regard to race, national origin, ancestry, religion (including religious dress and grooming practices), sex (including pregnancy, childbirth, breastfeeding, or related medical conditions), sexual orientation, gender (including gender identity and expression), age, military or veteran status, disability (physical or mental), any factor prohibited by law, and as such affirms in policy and practice to support and promote the concept of equal employment opportunity and affirmative action, in accordance with all applicable federal, state, provincial and municipal laws. The company also prohibits discrimination on other basis such as medical condition, or marital status under applicable laws. Applicants with Disabilities PIMCO is an Equal Employment Opportunity/Affirmative Action employer. We provide reasonable accommodation for qualified individuals with disabilities, including veterans, in job application procedures. If you have any difficulty using our online system due to a disability and you would like to request an accommodation, you may contact us at ************ and leave a message. This is a dedicated line designed exclusively to assist job seekers with disabilities to apply online. Only messages left for this purpose will be considered. A response to your request may take up to two business days.
$175k-240k yearly Auto-Apply 43d ago