Post job

Reliability Engineer jobs at The Hartford

- 668 jobs
  • Sr. Cloud Engineer - Intelligent Document Processing

    The Hartford 4.5company rating

    Reliability engineer job at The Hartford

    Sr Cloud Engineer - IE07NE We're determined to make a difference and are proud to be an insurance company that goes well beyond coverages and policies. Working here means having every opportunity to achieve your goals - and to help others accomplish theirs, too. Join our team as we help shape the future. We are seeking a highly capable and technically proficient Senior Cloud Engineer to join our Intelligent Document Processing (IDP) team. This role demands hands-on expertise in cloud-native engineering, automation of document workflows, and integration of advanced data extraction technologies. You will be instrumental in scaling and optimizing our IDP platform, working alongside engineering leads and business stakeholders to deliver robust, production-grade solutions. This position is ideal for someone who thrives in a fast-paced, agile environment and is ready to take ownership of complex systems, drive technical excellence, and contribute to strategic platform evolution. This role will have a Hybrid work schedule, with the expectation of working in an office (Columbus, OH, Chicago, IL, Hartford, CT or Charlotte, NC) 3 days a week (Tuesday through Thursday). Key Responsibilities: + Engineer and maintain cloud-native IDP solutions using AWS services and third-party vendor platforms. + Develop and optimize automated workflows for document ingestion, classification, extraction, and validation. + Integrate IDP components with enterprise systems, ensuring seamless data flow and operational reliability. + Lead performance tuning and cost optimization efforts across cloud infrastructure. + Implement monitoring, alerting, and fault-tolerant mechanisms to ensure platform resilience. + Collaborate with software engineers, data scientists, and architects to align IDP capabilities with business goals. + Author and maintain technical documentation, including architecture diagrams, deployment guides, and operational runbooks. + Evaluate emerging technologies and contribute to platform roadmap discussions. Qualifications: + Bachelor's degree in Computer Science, Information Technology, or a related field. + 5+ years of experience in cloud engineering, with direct involvement in Intelligent Document Processing or similar automation platforms. + Deep expertise in AWS services (e.g., Lambda, S3, Step Functions, ECS, IAM). + Strong programming skills in .NET and Python, with experience building scalable backend services. + Hands-on experience with OCR, NLP, and machine learning technologies in production environments. + Proficiency with containerization (Docker) and orchestration (Kubernetes). + Proven ability to troubleshoot complex systems and implement resilient cloud architectures. + Excellent communication skills and a collaborative mindset. + AWS certifications (e.g., Solutions Architect, Developer, SysOps). + Experience with GenAI-powered IDP platforms (e.g., MEA, Azure Form Recognizer). + Familiarity with insurance-specific document formats (e.g., SOVs, financial statements). + Exposure to CI/CD pipelines and infrastructure-as-code tools (e.g., Terraform, CloudFormation). Candidate must be authorized to work in the US without company sponsorship. The company will not support the STEM OPT I-983 Training Plan endorsement for this position. Compensation The listed annualized base pay range is primarily based on analysis of similar positions in the external market. Actual base pay could vary and may be above or below the listed range based on factors including but not limited to performance, proficiency and demonstration of competencies required for the role. The base pay is just one component of The Hartford's total compensation package for employees. Other rewards may include short-term or annual bonuses, long-term incentives, and on-the-spot recognition. The annualized base pay range for this role is: $136,000 - $204,000 Equal Opportunity Employer/Sex/Race/Color/Veterans/Disability/Sexual Orientation/Gender Identity or Expression/Religion/Age About Us (************************************* | Our Culture (******************************************************* | What It's Like to Work Here (************************************************** | Perks & Benefits (********************************************* Every day, a day to do right. Showing up for people isn't just what we do. It's who we are - and have been for more than 200 years. We're devoted to finding innovative ways to serve our customers, communities and employees-continually asking ourselves what more we can do. Is our policy language as simple and inclusive as it can be? Can we better help businesses navigate our ever-changing world? What else can we do to destigmatize mental health in the workplace? Can we make our communities more equitable? That we can rise to the challenge of these questions is due in no small part to our company values that our employees have shaped and defined. And while how we contribute looks different for each of us, it's these values that drive all of us to do more and to do better every day. About Us (************************************* Our Culture What It's Like to Work Here (************************************************** Perks & Benefits Legal Notice (***************************************** Accessibility Statement Producer Compensation (************************************************** EEO Privacy Policy (************************************************** California Privacy Policy Your California Privacy Choices (****************************************************** International Privacy Policy Canadian Privacy Policy (**************************************************** Unincorporated Areas of LA County, CA (Applicant Information) MA Applicant Notice (******************************************** Hartford India Prospective Personnel Privacy Notice
    $136k-204k yearly 29d ago
  • NPD Quality Engineer

    Tata Consultancy Services 4.3company rating

    Plymouth, MA jobs

    Must Have Technical/Functional Skills • Knowledge on Quality Management and its tools & techniques • Knowledge about GMP (Good Manufacturing Practices), FDA, ISO 13485 and compliance regulations • Knowledge on Medical Device Regulatory Standards, MDD and MDR • Knowledge on NC, CAPA, Root Cause Analysis and Audit processes • Knowledge on Validation process, writing protocols/ reports • Very good understanding/ experience in writing procedures, product specs and work instructions • Knowledge in Statistics, Risk Management and Design control • Must possess good communication skills (verbal and written), familiar with project management methodology, problem solving, and presentation skills • Experience in creating FMEAs & Writing reports • Experience in PMS (Post Market Surveillance) • Experience in PLM Tool (Windchill) • Good understanding of Design, Drawing and GD&T • Excellent Interpersonal / communication skills, Organizational / planning and Project management skills preferred • Personal computer skills, Windows: word processing, presentation, e-mail, web browsers & spreadsheet software • Ability to work efficiently, meet timelines, and communicate status (generate trackers, send emails, etc.) Roles & Responsibilities • Under limited supervision and in accordance with all applicable federal, state and local laws/regulations and Corporate Johnson & Johnson, procedures and guidelines, the duties and responsibilities for this position are: • Development and review of PDP (Product development Process) deliverables • Review and approve R&D/ Engineering protocol/ reports • Development of Risk management records (i.e. DFMEA/ PFMEA) in collaboration with SMEs • Support and provide guidance on Validations and if required write Validation Protocols/ Reports • Support/ Remediation of Validation/ Quality Documentation • Support Root Cause Investigation and closure of NC and CAPA • Review and approve the Change Orders (CR/ CN) • Review and update the design/ process control documents like procedures/ work instructions/ product specs etc. • Work with cross functional teams and internal teams to create deliverables • Performs other duties assigned as needed Salary Range: $90,000 $95,000 Year TCS Employee Benefits Summary: Discretionary Annual Incentive. Comprehensive Medical Coverage: Medical & Health, Dental & Vision, Disability Planning & Insurance, Pet Insurance Plans. Family Support: Maternal & Parental Leaves. Insurance Options: Auto & Home Insurance, Identity Theft Protection. Convenience & Professional Growth: Commuter Benefits & Certification & amp; Training Reimbursement. Time Off: Vacation, Time Off, Sick Leave & Holidays. Legal & Financial Assistance: Legal Assistance, 401K Plan, Performance Bonus, College Fund, Student Loan Refinancing.
    $90k-95k yearly 2d ago
  • Validation Engineer

    Tata Consultancy Services 4.3company rating

    Warren, MI jobs

    Must Have Technical/Functional Skills • 2-3 years of experience in QA, validation, or testing of AI/robotics systems • Familiarity with ablation testing methodologies and performance evaluation metrics • Experience with simulation platforms (Isaac Sim, Omniverse, ROS/Gazebo) • Proficiency in Python, C++, and scripting for test automation • Understanding of computer vision models and robotic perception systems • Strong analytical and documentation skills • Exposure to CI/CD tools and version control systems (Git, Jenkins) is a plus Roles & Responsibilities • Design and execute ablation tests to evaluate model robustness under varying lighting, pose, and occlusion conditions • Track and analyze success metrics such as accuracy, precision, recall, and latency • Validate AI models and robotic behaviors in simulation (Isaac Sim, Omniverse) and real-world setups • Document validation procedures, test results, and performance benchmarks • Collaborate with ML/CV and robotics teams to identify failure modes and improvement areas • Maintain test environments and ensure reproducibility of validation workflows • Support continuous integration and testing pipelines for AI model deployment• Generic Managerial Skills, If any The engineer will work closely with AI, simulation, and robotics teams to ensure robustness, reliability, and safety of deployed systems. Base Salary Range: $130,000 - $170,000 per annum TCS Employee Benefits Summary: Discretionary Annual Incentive. Comprehensive Medical Coverage: Medical & Health, Dental & Vision, Disability Planning & Insurance, Pet Insurance Plans. Family Support: Maternal & Parental Leaves. Insurance Options: Auto & Home Insurance, Identity Theft Protection. Convenience & Professional Growth: Commuter Benefits & Certification & Training Reimbursement. Time Off: Vacation, Time Off, Sick Leave & Holidays. Legal & Financial Assistance: Legal Assistance, 401K Plan, Performance Bonus, College Fund, Student Loan Refinancing.
    $130k-170k yearly 1d ago
  • Packaging Engineer

    Tata Consultancy Services 4.3company rating

    Fort Washington, PA jobs

    Must Have Technical/Functional Skills A minimum of 8-10 years of industry experience is required with at least 3 years of Package Development experience. Specific experience within the Consumer, OTC, or Pharmaceutical industry is must. GMP experience is must. Experience in a highly regulated environment is preferred. Demonstrated technical knowledge related to package materials, equipment, testing and package development is required. Roles & Responsibilities o Plan and execute package engineering assignments concerned with large life cycle management initiatives. o Engage in the development of the material and structural aspects of packages, including Primary, Secondary & Tertiary materials to ultimately deliver a robust packaging system to the market. o Design, Create & approve component specifications. Work closely with R&D for primary components design and product related changes. o Lead the package design development and assessment, Develop, write, gain cross-functional alignment, and route for approval package development documentation that captures the end-to-end project specific information. (Examples of documentation: Package Component Specifications, Package Development Assessment and Plan documents, Packaging Line Trial Protocols and Reports, Package Development Reports, etc.) o Lead packaging development projects. o Determine and coordinate physical testing to ensure product and package integrity for manufacturing through end users. o Lead troubleshooting in the resolution of packaging related issues in manufacturing and the field. o Execute packaging projects in compliance with government and corporate guidelines. o Execute package line trials and package testing. o cGMP (Current Good Manufacturing Practice) working experience. cGMP documentation proficiency. o Creating: Copy and graphics specification, Pallet patterns and Finished Put-up specifications. o SAP related tasks including Data Entry, Raw Material Code requests, Bill Of Material Creation and revisions. Generic Managerial Skills, If any Co-ordination with Stakeholders to trace, monitor & package development process Salary Range: $70,000 $90,000 Year TCS Employee Benefits Summary: Discretionary Annual Incentive. Comprehensive Medical Coverage: Medical & Health, Dental & Vision, Disability Planning & Insurance, Pet Insurance Plans. Family Support: Maternal & Parental Leaves. Insurance Options: Auto & Home Insurance, Identity Theft Protection. Convenience & Professional Growth: Commuter Benefits & Certification & amp; Training Reimbursement. Time Off: Vacation, Time Off, Sick Leave & Holidays. Legal & Financial Assistance: Legal Assistance, 401K Plan, Performance Bonus, College Fund, Student Loan Refinancing.
    $70k-90k yearly 2d ago
  • Production Engineer

    American Honda Motor Co 4.6company rating

    Summerfield, NC jobs

    What Makes a Honda, is Who makes a Honda Honda has a clear vision for the future, and it's a joyful one. We are looking for individuals with the skills, courage, persistence, and dreams that will help us reach our future-focused goals. At our core is innovation. Honda is constantly innovating and developing solutions to drive our business with record success. We strive to be a company that serves as a source of “power” that supports people around the world who are trying to do things based on their own initiative and that helps people expand their own potential. To this end, Honda strives to realize “the joy and freedom of mobility” by developing new technologies and an innovative approach to achieve a “zero environmental footprint.” We are looking for qualified individuals with diverse backgrounds, experiences, continuous improvement values, and a strong work ethic to join our team. If your goals and values align with Honda's, we want you to join our team to Bring the Future! Job Purpose Lead, create, and implement innovative technical activities and solutions in the areas of Mass Production, Business Plan and New Model to efficiently meet or exceed Safety, Environment, Quality, Delivery, Cost, and Morale characteristic targets. Key Accountabilities Effectively communicate upstream and downstream to all levels of the organization to assure common understanding and direction. Review and analyze daily report(s) to identify safety, quality, delivery gaps and develop potential countermeasures and /or root cause analysis opportunities striving for continuous improvement. Utilize data analysis and PDCA to lead, support, develop and justify solutions with related groups/departments for your area of responsibility to solve complex problems. Monitor and manage equipment and processes to ensure optimal manufacturing performance and function while minimizing operating expense. Develop capability of self, colleagues, and team through training, mentoring, and sharing of experiences in area of technical expertise and understanding. Establish priorities and make decisions based on data analytics to most effectively accomplish business objectives. Manage project implementation, schedule, budget and resource allocations to ensure successful completion and target achievement. Test, evaluate, and implement new and innovative technologies to improve overall equipment and process efficiency. Develop and manage investment and expense budgets to achieve overall cost targets. Qualifications, Experience, and Skills Minimum Educational Qualifications Bachelors or Associates degree in engineering or engineering technology with relevant experience (mechanical, manufacturing, industrial or electrical, etc) with interest in manufacturing, if no degree 6 years of experience required Minimum Experience Mfg. co-op experience preferred but not required Decisions ExpectedWorking Conditions Work in production environment requiring PPE and lockout in manufacturing operations Manufacturing environment with the potential of working near hydraulic oils, cutting lubricants, ferrous and aluminum materials Work in production environment requiring PPE and lockout in manufacturing operations Working near oils, cutting lubricants Hands-on investigation and troubleshooting within equipment to countermeasure issues and to determine improvement activity Working with hand/power tools, quality gauging and instrumentation 50% office environment/ 50% manufacturing lineside activity Possible weekend or off-shift support as necessary 10-15 hours overtime per week Possible weekend or off-shift support as necessary Travel 5% (domestic & international) What differentiates Honda and make us an employer of choice? Total Rewards: • Competitive Base Salary (pay will be based on several variables that include, but not limited to geographic location, work experience, etc.) • Paid Overtime • Regional Bonus (when applicable) • Industry-leading Benefit Plans (Medical, Dental, Vision, Rx) • Paid time off, including vacation, holidays, shutdown • Company Paid Short-Term and Long-Term Disability • 401K Plan with company match + additional contribution • Relocation assistance (if eligible) Career Growth: • Advancement Opportunities • Career Mobility • Education Reimbursement for Continued Learning • Training and Development programs Additional Offerings: • Tuition Assistance & Student Loan Repayment • Lifestyle Account • Childcare Reimbursement Account • Elder Care Support • Wellbeing Program • Community Service and Engagement Programs • Product Programs • Free Drinks Onsite Honda is an equal opportunity employer and considers qualified applicants for employment without regard to race, color, creed, religion, national origin, sex, sexual orientation, gender identity and expression, age, disability, veteran status, or any other protected factor.
    $53k-71k yearly est. 1d ago
  • Site Reliability Engineer

    The Voleon Group 4.1company rating

    Remote

    Voleon is a technology company that applies state-of-the-art AI and machine learning techniques to real-world problems in finance. For nearly two decades, we have led our industry and worked at the frontier of applying AI/ML to investment management. We have become a multibillion-dollar asset manager, and we have ambitious goals for the future. Your colleagues will include internationally recognized experts in artificial intelligence and machine learning research as well as highly experienced finance and technology professionals. The people who shape our company come from other backgrounds, including concert music performances, humanitarian aid, opera singing, sports writing, and BMX racing. You will be part of a team that loves to succeed together. In addition to our enriching and collegial working environment, we offer highly competitive compensation and benefits packages, technology talks by our experts, a beautiful modern office, daily catered lunches, and more. As a Site Reliability Engineer (SRE), you will work at the intersection of production operations and software development as you improve, manage, and monitor production-critical infrastructure and data pipelines. At Voleon, many SREs serve together on a Production Operations team tasked with improving shared production infrastructure. Others are embedded with teams of software engineers to improve specific production systems owned by those teams. Voleon SREs work on important real-world problems and collaborate with passionate and talented colleagues in an empowering, results-driven environment. This role is a way to make a real difference: your contributions will make our critical systems more reliable, lower operational risk, and increase the efficiency of our engineering effort.Responsibilities Improve fault-tolerance and maintainability of code in proprietary data pipelines and trading systems Diagnose and fix bugs in code Lead complex deployments Automate manual workflows Track and prioritize outstanding production-related issues Share an on-call rotation responding to incidents to ensure the continuous operation of production-critical systems Requirements Experience with coding and debugging Python Experience with Linux Familiarity with Relational Databases & SQL Sharp analytical and problem-solving skills and a persistent drive to make things work (better) Strong growth mindset and a passion for learning Strong technical communication skills Attention to detail 2 years of relevant industry experience An undergraduate degree or comparable training in a quantitative field or equivalent, relevant industry experience Preferred Qualifications Familiarity with best practices concerning code maintainability, documentation, quality assurance, continuous integration and deployment Experience supporting production systems Experience with any of the following: gRPC microservices, Postgres, Pandas, Golang, R, Git, Jenkins, Bazel, Prometheus, Grafana, Airflow, Kubernetes The base salary for this position is $115,000 to $135,000 in the location(s) of this posting. Individual salaries are determined through a variety of factors, including, but not limited to, education, experience, knowledge, skills, and geography. Base salary does not include other forms of total compensation such as bonus compensation and other benefits. Our benefits package includes medical, dental and vision coverage, life and AD&D insurance, 20 days of paid time off, 9 sick days, and a 401(k) plan with a company match. “Friends of Voleon” Candidate Referral ProgramIf you have a great candidate in mind for this role and would like to have the potential to earn $7,500 - $15,000 if your referred candidate is successfully hired and employed by The Voleon Group, please use this form to submit your referral. For more details regarding eligibility, terms and conditions please make sure to review the Voleon Referral Bonus Program. Equal Opportunity EmployerThe Voleon Group is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.
    $115k-135k yearly Auto-Apply 3d ago
  • Site Reliability Engineer 2

    Drivewealth 4.0company rating

    Remote

    DriveWealth is a global B2B financial technology organization dedicated to democratizing access to financial independence around the world. Our mission is realized through an API-based platform, empowering our partners to offer seamless investing and trading experiences to clients worldwide, all from their mobile devices. Our technology provides partners with a modern, extensible toolkit, enabling traditional investment workflows and innovative techniques like fractional share ownership. DriveWealth has evolved into a global platform offering trading of US equities, mutual funds, ETFs, fixed income, and options. We seek enthusiastic professionals to contribute diverse perspectives and experiences to our Brokerage-as-a-Service platform. Our culture blends the pace and opportunity of a tech start-up with the impact, stability, and significance of Wall Street. We encourage creativity and experimentation while ensuring institutional-grade execution and regulatory compliance in everything we do. We value diversity and inclusion, celebrating the unique differences of our employees as we scale and grow together. We're guided by operating principles grounded in accountability, teamwork, integrity, and solutions built to scale. Join us! About The Role As a Site Reliability Engineer 2, you will enhance the reliability and performance of our Brokerage-as-a-Service platform during critical 7/24 operations. This role demands a proactive approach to managing technical challenges and system optimizations that align with our global operational strategies. What You'll Do Support the SRE team in developing and implementing enhancements to support workflows, focusing on automation and efficiency improvements. Handle technical escalations, troubleshoot complex issues, and actively participate in on-call rotations to ensure rapid response and resolution during non-traditional hours. Adhere and administer incident and change management policies. Coordinate incident resolution efforts and implement change management protocols to maintain and enhance system reliability, especially during critical system operations at night. Work closely with the New York office to ensure smooth operation and alignment of SRE practices across time zones. What You'll Need 3+ years in a SRE role or a similar position, demonstrating deep knowledge and expertise in site reliability engineering and operations. Working knowledge in REST APIs and understanding of API integration. Python proficiency in scripting for automation and system management, with a track record of developing and implementing automation solutions. SQL and Database expertise in transactional databases, including querying and troubleshooting. Analytical and troubleshooting skills with a demonstrated ability to perform troubleshooting and root cause analysis of technical issues. Availability for flexible work hours and willingness to cover US markets trading sessions, including L2 on-call coverage. Knowledge of Change Management Process and Risk Management. Nice to Have, But No Required Experience in the brokerage or financial industry Proficient with cloud services, particularly AWS, and knowledgeable about cloud architecture best practices, including IAM, EC2, S3, and DynamoDB Experience maintaining and supporting containerized systems, with familiarity in orchestration tools Knowledge of Infrastructure as Code (IaC) practices and tools such as Terraform or CloudFormation Ability to manage and troubleshoot job scheduling tools like Rundeck or Apache Airflow Advanced skills in managing containerized environments using Kubernetes and OpenShift Practical experience with Confluent Cloud for event streaming architectures Experience with Java applications and a basic understanding of using the browser developer console for front-end debugging Additional Notes: This role is critical for our continuous operations and requires a commitment to nighttime hours, aligning with the global nature of our financial services. Candidates must be prepared for intense collaboration periods and proactive communication across global teams. Applicants must be authorized to work for any employer in the U.S. DriveWealth is unable to sponsor or take over sponsorship of an employment Visa at this time. Compensation Compensation package offerings are based on candidate experience and technical qualifications, as it relates to the role. These are identified and determined throughout your interviewing experience. Please note: at this time, we are not able to hire in all states. Remote (Most US States) Pay Range$130,000-$150,000 USD Benefits Competitive medical, dental, and vision insurance options Mental health resources Generous paid time off with observed holidays (varies per country) Paid parental leave for biological and adoptive parents Up to $2,500 or local equivalent each year to invest in continued education and personal development Up to $900 each year or local equivalent for fitness and wellness reimbursement Company-provided phone (varies by country) For HQ in-office employees, a daily lunch stipend, unlimited snacks, and engaging office space in the Financial District Pre-tax commuter benefits (US only) Employer 401K match (US only) Benefit offerings vary based on country and are subject to change. Equal Employment Opportunity To build technology and products that are used and loved by people and solve real-world problems, we need to build a team with many different perspectives and experiences. We are an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status. We encourage candidates from all backgrounds to apply. Applicants in need of special assistance or accommodation during the interview process or in accessing our website may contact us at **************************. Agency Disclaimer DriveWealth does not accept agency resumes. Please do not forward resumes to our jobs alias, employees, or any other organization location. DriveWealth is not responsible for any fees related to unsolicited resumes.
    $130k-150k yearly Auto-Apply 18d ago
  • Staff Site Reliability Engineer

    Figure 4.5company rating

    Sunnyvale, CA jobs

    Figure is an AI robotics company developing autonomous general-purpose humanoid robots. The goal of the company is to ship humanoid robots with human level intelligence. Its robots are engineered to perform a variety of tasks in the home and commercial markets. Figure is headquartered in San Jose, CA. We are looking for a Site Reliability Engineer to own our internal systems infrastructure. This role is responsible for setting up and managing cloud and on-prem infrastructure to deliver highly available, reliable, and automated systems. Responsibilities: Be the go to person for mission critical infrastructure enabling critical operations such as Source Configuration Management, CI/CD systems, software distribution, supplier portals, manufacturing and more. Migrate SaaS to self-hosted solutions to enhance security and reliability. Implement monitoring and alerting systems, and define incident response plans and runbooks. Reduce human workload through automation to automate deployment and scaling. Establish strong relationships with stakeholders to identify infrastructure needs and establish Service Level Objectives. Use a data driven approach to demonstrate service robustness and track optimization work. Partner with the security team to ensure that security remediations and updates are applied in a timely manner. Requirements: Strong experience with Linux/Unix systems administration Proficiency in programming/scripting Extensive experience with cloud platforms (Azure, AWS, GCP) and on-prem hardware architectures Experience designing, deploying, and operating high-availability, fault-tolerant, and distributed systems. Mastery of infrastructure as code (Terraform, CloudFormation, Ansible…) Familiarity with monitoring, logging, and alerting tools (Prometheus, Grafana, Datadog…) Solid understanding of networking fundamentals (TCP/IP, DNS, HTTP, load balancers, firewalls) Experience defining Service Level Objectives (SLO), developing runbooks/incident response plans, facilitating post-mortems and managing systems assets. Ability to work in cross-functional teams with developers, infra, and product teams Excellent verbal and written communication skills The US base salary range for this full-time position is between $175,000 - $250,000 annually. The pay offered for this position may vary based on several individual factors, including job-related knowledge, skills, and experience. The total compensation package may also include additional components/benefits depending on the specific role. This information will be shared if an employment offer is extended.
    $175k-250k yearly Auto-Apply 31d ago
  • Principal Site Reliability Engineer

    Jpmorgan Chase 4.8company rating

    Palo Alto, CA jobs

    Join a globally recognized financial organization and advance your profession to new heights by contributing to revolutionary projects. You've discovered the perfect environment to have a major impact. As a **Principal Site Reliability Engineer** at JPMorgan Chase within the **Enterprise Technology, AI/ML & Data Platforms division** , you will utilize your expertise to create innovative solutions that improve critical incident management and streamline the software development lifecycle throughout the organization. Your role will involve overseeing, designing, and deploying infrastructure components to enhance reliability and ensure operational efficiency. **Job responsibilities** + Architect and implement observability platforms and tools for proactive detection and continuous improvement. + Lead the design and development of core observability services, including metrics pipelines and log aggregation. + Leverage modern technologies such as Open Telemetry and AI/ML for anomaly detection and automated insights. + Collaborate with engineering and SRE teams to define service-level objectives (SLOs) and error budgets. + Provide technical leadership and mentorship to engineering teams, ensuring best practices in system design. + Champion observability as a first-class concern in the software development lifecycle. + Influence platform strategy and roadmap through deep technical insight and alignment with business priorities. + Write advanced documentation and create executive presentations that translate technical issues into business impact. + Participate in industry professional forums and monitor relevant industry technologies and standards. + Lead medium to large projects by bringing together the proper perspective and integrating feedback from team members. + Participate in support responsibilities for coverage of critical applications. **Required qualifications, capabilities, and skills** + Formal training or certification on site reliability engineering concepts and 10+ years applied experience. + Ability to determine how each system relates to each other and build automation to improve reliability. + Experience with translating research, analysis, and tests into business recommendations. + Ability to balance and be accountable for the work of multiple architects and designers. + Understands and leads partnerships across job functions to develop efficient systems. + Engages team members and expresses complex ideas with appropriate level of detail, while providing constructive feedback. + Self-motivated and able to work well under pressure with minimal supervision. + Ability to tackle a problem by using a logical, systematic, sequential approach. **Preferred qualifications, capabilities, and skills** + Experience with cloud-native instrumentation and streaming data platforms. + Influence technology and policy decisions while fostering commitment and confidence in team members. + Develop effective solutions and analyze competitive positions by considering market trends. + Support the introduction of innovative methods and communicate clearly to persuade audiences. + Demonstrate concern and meet the needs of both internal and external customers. \#LI-RB3 JPMorganChase, one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the world's most prominent corporate, institutional and government clients under the J.P. Morgan and Chase brands. Our history spans over 200 years and today we are a leader in investment banking, consumer and small business banking, commercial banking, financial transaction processing and asset management. We offer a competitive total rewards package including base salary determined based on the role, experience, skill set and location. Those in eligible roles may receive commission-based pay and/or discretionary incentive compensation, paid in the form of cash and/or forfeitable equity, awarded in recognition of individual achievements and contributions. We also offer a range of benefits and programs to meet employee needs, based on eligibility. These benefits include comprehensive health care coverage, on-site health and wellness centers, a retirement savings plan, backup childcare, tuition reimbursement, mental health support, financial coaching and more. Additional details about total compensation and benefits will be provided during the hiring process. We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants' and employees' religious practices and beliefs, as well as mental health or physical disability needs. Visit our FAQs for more information about requesting an accommodation. JPMorgan Chase & Co. is an Equal Opportunity Employer, including Disability/Veterans **Base Pay/Salary** Palo Alto,CA $204,250.00 - $285,000.00 / year; Jersey City,NJ $204,250.00 - $285,000.00 / year
    $204.3k-285k yearly 60d+ ago
  • Site Reliability Engineer - Capital Markets

    Jefferies 4.8company rating

    Jersey City, NJ jobs

    Jefferies is seeking for Site Reliability Engineer to play an instrumental role in supporting Equity Front office trading application, risk and middle office real time products, developed and used for Equity Cash and ETS application. As part of the wider platform engineering team, you will be working closely with the Business users interactively throughout the day, along with technical, analysis and testing colleagues. Investigation and resolution of the work items at hand will require competent technical skills and a keen intellect. The business is a growth area, with current investments taking place in all the technology, business and middle office areas. Responsibilities: Front Line Site Reliable Engineering and Support functions for Equity trading systems used by Jefferies clients as well as internal users. Build monitoring tools for application and infrastructure components. Implement and manage scalable infrastructure using cloud-native technologies and tools. Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding. Partner with business, development and infrastructure teams to improve services through rigorous testing and release procedures. Develop and maintain CI/CD pipelines to streamline deployment processes. Expedient deployment of new systems. Capacity planning, Platform Management, and support for increasing volumes and business growth. Create sustainable systems and services through automation. Collaborate with Application team to establish and enforce production and development standards. Document procedures, best practices and troubleshooting FAQs. Resolve complex application and technical problems. Debugging the system and fixing the production related issues. Escalate / follow-up on permanent fix for development related issues. Lead incident response efforts and post-mortem analysis to prevent future occurrences. Handles complex operational tasks and recommends process and technology changes. Global support and includes weekend availability to troubleshoot production related issues and perform checkouts. Ability to work both independently and in groups in an energetic, diverse environment. Participate in on-call rotations to ensure 24/7 system availability and support. Support compliance and legal queries. Qualifications: Strong experience in Windows and Linux/Unix services. Strong experience in scripting language like Power shell, Python and SQL. Strong Knowledge of monitoring tools - Nagios, Splunk, OTEL, Datadog Strong Knowledge of FIX protocol Strong Domain skills - Must have working experience in Capital Markets across modules and instruments especially - CASH, ETS, Bonds, Options, Futures, Swaps products Experience in BFSI (Banking and Financial Industry) Domain applications with a proper understanding of the Trade Lifecycle. Excellent communication, time management and project management skills. Primary Location Full Time Salary Range of $175,000 - $200,000
    $175k-200k yearly Auto-Apply 21d ago
  • Site Reliability Engineer

    Tata Consulting Services 4.3company rating

    Atlanta, GA jobs

    Must Have Technical/Functional Skills * Monitoring solutions - CloudWatch, Dynatrace, PagerDuty * DevOps - GitLab, GitLab CI/CD, AWS Cloud Development Kit (CDK), CloudFormation (CFT) and CodePipeline * Languages, IDEs, Tools & Architectures - Node.js, TypeScript, YAML, VSCode, IntelliJ, Eclipse, REST API, Postman, Docker, * AWS Technologies - API Gateway, Route 53, Lambda, Kafka, ElastiCache, PostgeSQL, SNS, Quarkus, EventBridge, Secret Manager Roles & Responsibilities * Building and supporting a reliable application suite for the environment to meet the development and maintenance * requirements of systems/platforms * Implement Service Reliability Engineering by working as part of the development team to evaluate the health, stability, and reliability of applications * Lead the team in best practices in incident, problem, and change management * Utilizing monitoring, alerts, dashboards, and management tools to ensure the availability, reliability, cost, and performance of applications and services * Constantly working to improve and implement automation of applications tasks * Providing technical support for systems/platforms according to application SLA's * Responsible for designing and developing resiliency in the application code, troubleshooting incidents, engaging with squads to address failure patterns, and participating in incident management * Develop delivery pipelines and automated deployment scripts * Configure services, such as databases and monitoring Salary Range-$100,000-$125,000 a year #LI-KR3 TCS Employee Benefits Summary: Discretionary Annual Incentive. Comprehensive Medical Coverage: Medical & Health, Dental & Vision, Disability Planning & Insurance, Pet Insurance Plans. Family Support: Maternal & Parental Leaves. Insurance Options: Auto & Home Insurance, Identity Theft Protection. Convenience & Professional Growth: Commuter Benefits & Certification & Training Reimbursement. Time Off: Vacation, Time Off, Sick Leave & Holidays. Legal & Financial Assistance: Legal Assistance, 401K Plan, Performance Bonus, College Fund, Student Loan Refinancing.
    $100k-125k yearly 30d ago
  • Tencent Cloud PaaS Associate Site Reliability Engineer

    Tencent 4.5company rating

    Palo Alto, CA jobs

    Business UnitWhat the Role EntailsJob Description: Research industry solutions, combine the customer's business technology solutions and the characteristics of Tencent's audio and video products, sort out valuable solutions, and organize them into sales support materials. Work closely with the business team to analyze the technical structure of the customer's media business and explore the customer's needs and value in audio and video scenarios. Provide industry solutions and cases serving the international market, such as OTT, social networking, games, education, business, etc. Conduct industry analysis and research, find a list of customers that meet the goals, and conduct business development work;Who We Look ForBachelor degree or above, computer, MBA related majors are preferred. Fluent English can be used as a working language, good communication skills and customer service awareness, and good desk research and writing skills; Good at thinking, high business sensitivity, excellent learning ability, logical thinking ability and problem-solving ability; Self-motivated and responsible, with passion for work, good stress resistance and team spirit. Location State(s) US-California-Palo AltoThe expected base pay range for this position in the location(s) listed above is $76,400.00 to $143,900.00 per year. Actual pay may vary depending on job-related knowledge, skills, and experience. Employees hired for this position may be eligible for a sign on payment, relocation package, and restricted stock units, which will be evaluated on a case-by-case basis. Subject to the terms and conditions of the plans in effect, hired applicants are also eligible for medical, dental, vision, life and disability benefits, and participation in the Company's 401(k) plan. The Employee is also eligible for up to 15 to 25 days of vacation per year (depending on the employee's tenure), up to 13 days of holidays throughout the calendar year, and up to 10 days of paid sick leave per year. Your benefits may be adjusted to reflect your location, employment status, duration of employment with the company, and position level. Benefits may also be pro-rated for those who start working during the calendar year.Equal Employment Opportunity at Tencent As an equal opportunity employer, we firmly believe that diverse voices fuel our innovation and allow us to better serve our users and the community. We foster an environment where every employee of Tencent feels supported and inspired to achieve individual and common goals.
    $76.4k-143.9k yearly Auto-Apply 60d+ ago
  • Site Reliability Engineer - Capital Markets

    Jefferies Financial Group Inc. 4.8company rating

    New York, NY jobs

    Jefferies is seeking for Site Reliability Engineer to play an instrumental role in supporting Equity Front office trading application, risk and middle office real time products, developed and used for Equity Cash and ETS application. As part of the wider platform engineering team, you will be working closely with the Business users interactively throughout the day, along with technical, analysis and testing colleagues. Investigation and resolution of the work items at hand will require competent technical skills and a keen intellect. The business is a growth area, with current investments taking place in all the technology, business and middle office areas. Responsibilities: * Front Line Site Reliable Engineering and Support functions for Equity trading systems used by Jefferies clients as well as internal users. * Build monitoring tools for application and infrastructure components. * Implement and manage scalable infrastructure using cloud-native technologies and tools. * Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding. * Partner with business, development and infrastructure teams to improve services through rigorous testing and release procedures. * Develop and maintain CI/CD pipelines to streamline deployment processes. * Expedient deployment of new systems. Capacity planning, Platform Management, and support for increasing volumes and business growth. * Create sustainable systems and services through automation. * Collaborate with Application team to establish and enforce production and development standards. * Document procedures, best practices and troubleshooting FAQs. * Resolve complex application and technical problems. * Debugging the system and fixing the production related issues. * Escalate / follow-up on permanent fix for development related issues. * Lead incident response efforts and post-mortem analysis to prevent future occurrences. * Handles complex operational tasks and recommends process and technology changes. * Global support and includes weekend availability to troubleshoot production related issues and perform checkouts. * Ability to work both independently and in groups in an energetic, diverse environment. * Participate in on-call rotations to ensure 24/7 system availability and support. * Support compliance and legal queries. Qualifications: * Strong experience in Windows and Linux/Unix services. * Strong experience in scripting language like Power shell, Python and SQL. * Strong Knowledge of monitoring tools - Nagios, Splunk, OTEL, Datadog * Strong Knowledge of FIX protocol * Strong Domain skills - Must have working experience in Capital Markets across modules and instruments especially - CASH, ETS, Bonds, Options, Futures, Swaps products * Experience in BFSI (Banking and Financial Industry) Domain applications with a proper understanding of the Trade Lifecycle. * Excellent communication, time management and project management skills. Primary Location Full Time Salary Range of $175,000 - $200,000
    $175k-200k yearly Auto-Apply 3d ago
  • Site Reliability Engineer

    The Voleon Group 4.1company rating

    Berkeley, CA jobs

    Voleon is a technology company that applies state-of-the-art AI and machine learning techniques to real-world problems in finance. For nearly two decades, we have led our industry and worked at the frontier of applying AI/ML to investment management. We have become a multibillion-dollar asset manager, and we have ambitious goals for the future. Your colleagues will include internationally recognized experts in artificial intelligence and machine learning research as well as highly experienced finance and technology professionals. The people who shape our company come from other backgrounds, including concert music performances, humanitarian aid, opera singing, sports writing, and BMX racing. You will be part of a team that loves to succeed together. In addition to our enriching and collegial working environment, we offer highly competitive compensation and benefits packages, technology talks by our experts, a beautiful modern office, daily catered lunches, and more. As a Site Reliability Engineer (SRE), you will work at the intersection of production operations and software development as you improve, manage, and monitor production-critical infrastructure and data pipelines. At Voleon, many SREs serve together on a Production Operations team tasked with improving shared production infrastructure. Others are embedded with teams of software engineers to improve specific production systems owned by those teams. Voleon SREs work on important real-world problems and collaborate with passionate and talented colleagues in an empowering, results-driven environment. This role is a way to make a real difference: your contributions will make our critical systems more reliable, lower operational risk, and increase the efficiency of our engineering effort.Responsibilities Improve fault-tolerance and maintainability of code in proprietary data pipelines and trading systems Diagnose and fix bugs in code Lead complex deployments Automate manual workflows Track and prioritize outstanding production-related issues Share an on-call rotation responding to incidents to ensure the continuous operation of production-critical systems Requirements Experience with coding and debugging Python Experience with Linux Familiarity with Relational Databases & SQL Sharp analytical and problem-solving skills and a persistent drive to make things work (better) Strong growth mindset and a passion for learning Strong technical communication skills Attention to detail 2 years of relevant industry experience An undergraduate degree or comparable training in a quantitative field or equivalent, relevant industry experience Preferred Qualifications Familiarity with best practices concerning code maintainability, documentation, quality assurance, continuous integration and deployment Experience supporting production systems Experience with any of the following: gRPC microservices, Postgres, Pandas, Golang, R, Git, Jenkins, Bazel, Prometheus, Grafana, Airflow, Kubernetes The base salary for this position is $115,000 to $135,000 in the location(s) of this posting. Individual salaries are determined through a variety of factors, including, but not limited to, education, experience, knowledge, skills, and geography. Base salary does not include other forms of total compensation such as bonus compensation and other benefits. Our benefits package includes medical, dental and vision coverage, life and AD&D insurance, 20 days of paid time off, 9 sick days, and a 401(k) plan with a company match. “Friends of Voleon” Candidate Referral ProgramIf you have a great candidate in mind for this role and would like to have the potential to earn $7,500 - $15,000 if your referred candidate is successfully hired and employed by The Voleon Group, please use this form to submit your referral. For more details regarding eligibility, terms and conditions please make sure to review the Voleon Referral Bonus Program. Equal Opportunity EmployerThe Voleon Group is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.
    $115k-135k yearly Auto-Apply 3d ago
  • Network Reliability Engineer III

    CME Group 4.4company rating

    Chicago, IL jobs

    As we embark on a journey to transform the Network Services Group in CME, we are seeking a Network Reliability Engineer III to join our dynamic team. In this role, you will design, develop and maintain self-service tools and applications that enhance productivity and reduce operational costs. You will work across the full stack-both front-end and back-end-to architect microservices (GKE) in Google Cloud Platform (GCP), driving our infrastructure towards greater automation and reliability. We are a global team across US, UK, India and Singapore made up of a diverse range of people from varied backgrounds who each bring unique network experiences and skill sets. The relatively new Network Reliability/Automation team are responsible for building a suite of custom automation tools and developing our self-healing capabilities while working closely with other members of the Network Services team in project delivery to ensure one of the largest Exchange network infrastructures in the world is highly available, resilient, secure and reliable. Responsibilities * Design, develop and maintain self-service and automation tools to streamline IT operations and reduce manual effort. * Engage in full-stack development, delivering responsive front-end interfaces as well as robust scalable back-end services. * With support Architect, deploy and scale microservices on GCP, with particular emphasis on containers and Google Kubernetes Engine (GKE). * Manage cloud infrastructure via Infrastructure-as-Code (IaC), primarily using Terraform to provision and maintain resources. * Operate and troubleshoot solutions on Linux-based platforms, leveraging Visual Studio Code (VSCode) as the primary development environment. * Adhere to software engineering best practices, including PEP8 coding standards, SOLID design principles, and established SDLC processes. * Implement and manage CI/CD pipelines with a DevOps mindset, ensuring rapid, reliable delivery of code. * Develop and consume Flask-based RESTful APIs to support network and security automation. * Collaborate within an Agile Scrum framework, utilizing tools such as Bitbucket and Jira to track progress and manage sprints. * Apply strong analytical and problem-solving skills to balance multiple project variables and deliver high-quality solutions on schedule. What we are looking for * Approximately 2-3 years' hands-on Python programming experience, with a demonstrable track record of automation or tooling projects. * Knowledge and experience working with both Python Django and Flask in a corporate environment. * Any experience in network and security automation, coupled with understanding of network fundamentals (routing, switching, firewalls, VPNs) would be beneficial. * Experience developing REST APIs using Flask (or a comparable Python framework). * Applicants with front-end experience using Javascript/JQuery/HTML5/CSS would be ideal. * Familiarity with Infrastructure-as-Code using Terraform (or similar) to manage cloud resources. * Comfortable working in Linux environments and proficient in using Visual Studio Code (VSCode). * Strong software engineering mindset: adherence to PEP8, SOLID principles, and best practices for SDLC, CI/CD and DevOps. * Excellent communication skills, both verbal and written, with the ability to convey technical concepts to diverse stakeholders. * Highly analytical, with the ability to troubleshoot complex issues and manage multiple tasks concurrently. * Experience working in Agile Scrum teams, utilizing Bitbucket and Jira (or equivalent tools) for version control and project tracking. Personal Attributes * Proactive and positive attitude, taking initiative to identify and resolve issues ahead of time. * Collaborative team player, eager to contribute knowledge and assist colleagues. * Innovative thinker who brings fresh ideas and constructive suggestions for continuous improvement. Education Bachelor's Degree in Computer Science, Engineering or a related field is preferred. Equivalent practical experience will also be considered. #LI - Hybrid #LI - JK1 CME Group is committed to offering a competitive total rewards package for our employees that recognizes their contributions to the business and reflects our long-term investment in their future. The pay range for this role is $100,700-$167,800. Actual salary offered will be dependent on a wide array of factors including but not limited to: relevant experience, skills, education and comparison to internal employees (where relevant). Our compensation program also includes an annual target bonus opportunity for all employees, as well as the opportunity to become an owner in the company through our broad-based equity program. Through our benefits program, we strive to offer flexibility, value and choice. From comprehensive health coverage, to a retirement package that includes both a 401(k) and an active pension plan, to highly competitive education reimbursement provisions, paid time off and a mental health benefit, CME Group offers a holistic benefits package for our team and their dependents. CME Group: Where Futures are Made CME Group is the world's leading derivatives marketplace. But who we are goes deeper than that. Here, you can impact markets worldwide. Transform industries. And build a career by shaping tomorrow. We invest in your success and you own it - all while working alongside a team of leading experts who inspire you in ways big and small. Problem solvers, difference makers, trailblazers. Those are our people. And we're looking for more. At CME Group, we embrace our employees' unique experiences and skills to ensure that everyone's perspectives are acknowledged and valued. As an equal-opportunity employer, we consider all potential employees without regard to any protected characteristic. Important Notice: Recruitment fraud is on the rise, with scammers using misleading promises of job offers and interviews to solicit money and personal information from job seekers. CME Group adheres to established procedures designed to maintain trust, confidence and security throughout our recruitment process. Learn more here.
    $100.7k-167.8k yearly 37d ago
  • Principal Site Reliability Engineer

    Jpmorganchase 4.8company rating

    Palo Alto, CA jobs

    Join a globally recognized financial organization and advance your profession to new heights by contributing to revolutionary projects. You've discovered the perfect environment to have a major impact. As a Principal Site Reliability Engineer at JPMorgan Chase within the Enterprise Technology, AI/ML & Data Platforms division, you will utilize your expertise to create innovative solutions that improve critical incident management and streamline the software development lifecycle throughout the organization. Your role will involve overseeing, designing, and deploying infrastructure components to enhance reliability and ensure operational efficiency. Job responsibilities Architect and implement observability platforms and tools for proactive detection and continuous improvement. Lead the design and development of core observability services, including metrics pipelines and log aggregation. Leverage modern technologies such as Open Telemetry and AI/ML for anomaly detection and automated insights. Collaborate with engineering and SRE teams to define service-level objectives (SLOs) and error budgets. Provide technical leadership and mentorship to engineering teams, ensuring best practices in system design. Champion observability as a first-class concern in the software development lifecycle. Influence platform strategy and roadmap through deep technical insight and alignment with business priorities. Write advanced documentation and create executive presentations that translate technical issues into business impact. Participate in industry professional forums and monitor relevant industry technologies and standards. Lead medium to large projects by bringing together the proper perspective and integrating feedback from team members. Participate in support responsibilities for coverage of critical applications. Required qualifications, capabilities, and skills Formal training or certification on site reliability engineering concepts and 10+ years applied experience. Ability to determine how each system relates to each other and build automation to improve reliability. Experience with translating research, analysis, and tests into business recommendations. Ability to balance and be accountable for the work of multiple architects and designers. Understands and leads partnerships across job functions to develop efficient systems. Engages team members and expresses complex ideas with appropriate level of detail, while providing constructive feedback. Self-motivated and able to work well under pressure with minimal supervision. Ability to tackle a problem by using a logical, systematic, sequential approach. Preferred qualifications, capabilities, and skills Experience with cloud-native instrumentation and streaming data platforms. Influence technology and policy decisions while fostering commitment and confidence in team members. Develop effective solutions and analyze competitive positions by considering market trends. Support the introduction of innovative methods and communicate clearly to persuade audiences. Demonstrate concern and meet the needs of both internal and external customers. #LI-RB3
    $140k-177k yearly est. Auto-Apply 44d ago
  • Reliability Engineer II

    Mastercard 4.7company rating

    OFallon, MO jobs

    Our Purpose Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential. Title and Summary Reliability Engineer IIThe BizOps team is looking for a Site Reliability Engineer who can help us solve problems, build our CI/CD pipeline and lead Mastercard in DevOps automation and best practices. • Are you a born problem solver who loves to figure out how something works? • Are you a CI/CD geek who loves all things automation? • Do you have a low tolerance for manual work and look to automate everything you can? Business Operations is leading the DevOps transformation at Mastercard through our tooling and by being an advocate for change & standards throughout the development, quality, release, and product organizations. We need team members with an appetite for change and pushing the boundaries of what can be done with automation. Experience in working across development, operations, and product teams to prioritize needs and to build relationships is a must. Role The role of business operations is to be the production readiness steward for the platform. This is accomplished by closely partnering with developers to design, build, implement, and support technology services. A business operations engineer will ensure operational criteria like system availability, capacity, performance, monitoring, self-healing, and deployment automation are implemented throughout the delivery process. Business Operations plays a key role in leading the DevOps transformation at Mastercard through our tooling and by being an advocate for change and standards throughout the development, quality, release, and product organizations. We accomplish this transformation through supporting daily operations with a hyper focus on triage and then root cause by understanding the business impact of our products. The goal of every biz ops team is to shift left to be more proactive and upfront in the development process, and to proactively manage production and change activities to maximize customer experience, and increase the overall value of supported applications. Biz Ops teams also focus on risk management by tying all our activities together with an overarching responsibility for compliance and risk mitigation across all our environments. A biz ops focus is also on streamlining and standardizing traditional application specific support activities and centralizing points of interaction for both internal and external partners by communicating effectively with all key stakeholders. Ultimately, the role of biz ops is to align Product and Customer Focused priorities with Operational needs. We regularly review our run state not only from an internal perspective, but also understanding and providing the feedback loop to our development partners on how we can improve the customer experience of our applications. All About You • Engage in and improve the whole lifecycle of services-from inception and design, through deployment, operation and refinement. • Analyze ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns • Support services before they go live through activities such as system design consulting, capacity planning and launch reviews. • Maintain services once they are live by measuring and monitoring availability, latency and overall system health. • Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity. • Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating, and lead Mastercard in DevOps automation and best practices. • Practice sustainable incident response and blameless postmortems. • Take a holistic approach to problem solving, by connecting the dots during a production event thru the various technology stack that makes up the platform, to optimize mean time to recover • Work with a global team spread across tech hubs in multiple geographies and time zones • Share knowledge and mentor junior resources Qualifications • BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent practical experience. • Experience with algorithms, data structures, scripting, pipeline management, and software design. • Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive. • Ability to help debug and optimize code and automate routine tasks. • We support many different stakeholders. Experience in dealing with difficult situations and making decisions with a sense of urgency is needed. • Experience in one or more of the following is preferred: C, C++, Java, Python, Go, Perl or Ruby. • Interest in designing, analyzing and troubleshooting large-scale distributed systems. • We need team members with an appetite for change and pushing the boundaries of what can be done with automation. Experience in working across development, operations, and product teams to prioritize needs and to build relationships is a must. • Experience in industry standard CI/CD tools like Git/BitBucket, Jenkins, Maven, Artifactory, and Chef. Experience designing and implementing an effective and efficient CI/CD flow that gets code from dev to prod with high quality and minimal manual effort is desired.Mastercard is a merit-based, inclusive, equal opportunity employer that considers applicants without regard to gender, gender identity, sexual orientation, race, ethnicity, disabled or veteran status, or any other characteristic protected by law. We hire the most qualified candidate for the role. In the US or Canada, if you require accommodations or assistance to complete the online application process or during the recruitment process, please contact reasonable_accommodation@mastercard.com and identify the type of accommodation or assistance you are requesting. Do not include any medical or health information in this email. The Reasonable Accommodations team will respond to your email promptly. Corporate Security Responsibility All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must: Abide by Mastercard's security policies and practices; Ensure the confidentiality and integrity of the information being accessed; Report any suspected information security violation or breach, and Complete all periodic mandatory security trainings in accordance with Mastercard's guidelines. In line with Mastercard's total compensation philosophy and assuming that the job will be performed in the US, the successful candidate will be offered a competitive base salary based on location, experience and other qualifications for the role and may be eligible for an annual bonus or commissions depending on the role. Mastercard benefits for full time (and certain part time) employees generally include: insurance (including medical, prescription drug, dental, vision, disability, life insurance), flexible spending account and health savings account, paid leaves (including 16 weeks new parent leave, up to 20 paid days bereavement leave), 10 annual paid sick days, 10 or more annual paid vacation days based on level, 5 personal days, 10 annual paid U.S. observed holidays, 401k with a best-in-class company match, deferred compensation for eligible roles, fitness reimbursement or on-site fitness facilities, eligibility for tuition reimbursement, gender-inclusive benefits and many more. Pay Ranges O'Fallon, Missouri: $75,000 - $125,000 USD
    $75k-125k yearly Auto-Apply 60d+ ago
  • Java Site Reliability Engineer, Messaging Platforms

    Pacific Investment Management Co 4.9company rating

    Austin, TX jobs

    We are a leading global asset management firm with over 3,000 employees across 20 offices in 15 countries; we help millions of investors around the world pursue their financial goals. We hire critical thinkers. People who thrive in a collaborative culture like ours where we solve real problems while building the future of finance. You Are excited to be part of a vibrant engineering community that values diversity, hard work, and continuous learning. Love solving complex real-world business problems. Recognize that cross-functional collaboration is a core component of success for the team. Believe there are multiple ways to solve most technical problems and are willing to debate the trade-offs. Have become a stronger engineer by making mistakes and learning from them. Are a doer, someone who wants to grow their career and gain experience across technologies and business functions. We Continuously invest in a high-performance and inclusive culture, in which a diversity of backgrounds, experiences and viewpoints are celebrated and valued. Encourage career mobility, so you can benefit from learning different functions and technologies, and we gain the benefits of your experience across teams. Run technology pro bono programs that help the non-profit community and give our engineering community opportunities to volunteer and participate. Offer education reimbursements and ongoing training in technology, communication, and diversity & inclusion. Embrace knowledge sharing through lunch-and-learns, demos, and technical forums. Consider our people to be our greatest asset-we will help you learn what PIMCO Technology has to offer so you can participate in activities that benefit your career while delivering impactful technology solutions. As a Java SRE in Trading Technology, you will: As our immediate need Help support the messaging platforms in use (MQ, AMPS, Kafka, etc.). driving the firm's best use of these platforms, making sure all choice make sense, the correct tools issued for the solving each job, and that we build a sustainable messaging strategy. Improve the operational efficiency and reduce the operational risk of our messaging platforms through better tools, better design, and better monitoring. In the future there will be new architectural or coding problems that we will need an experienced engineer to help solve. Work closely with the business and other teams to design and implement solutions that have immediate impact to the business and help us build towards our strategic vision across all our trade floor applications. We need someone proficient in Java, passionate about SRE practices, and able to collaborate effectively with an infrastructure team. We expect you to have a strong passion for messaging systems, including their proper setup, monitoring, and maintenance. At the same time, this role involves software development for target platforms once the immediate needs related to messaging platforms are resolved. You will work with a team consisting of 1 SRE and 1 Unix SA, with full support from the infrastructure and DevOps teams. Position Requirements Bachelor's degree in computer science or equivalent Strong Linux skills (including chef, puppet, ansible configuration tools) Strong experience with different messaging systems (Kafka, AMPS, MQ, FIX, etc.). Strong engineering culture (unit tests, CI/CD) Ability to work independently and in teams Good communication skills Working from the office in Austin 4 days a week. PIMCO follows a total compensation approach when rewarding employees which includes a base salary and a discretionary bonus. Base salary is the fixed component of compensation that is determined by core job responsibilities, relevant experience, internal level, and market factors. The discretionary bonus is used to award performance and therefore is determined by company, business, team, and individual performance. Salary Range: $ 175,000.00 - $ 240,000.00 Equal Employment Opportunity and Affirmative Action Statement PIMCO recruits and hires qualified candidates without regard to race, national origin, ancestry, religion (including religious dress and grooming practices), sex (including pregnancy, childbirth, breastfeeding, or related medical conditions), sexual orientation, gender (including gender identity and expression), age, military or veteran status, disability (physical or mental), any factor prohibited by law, and as such affirms in policy and practice to support and promote the concept of equal employment opportunity and affirmative action, in accordance with all applicable federal, state, provincial and municipal laws. The company also prohibits discrimination on other basis such as medical condition, or marital status under applicable laws. Applicants with Disabilities PIMCO is an Equal Employment Opportunity/Affirmative Action employer. We provide reasonable accommodation for qualified individuals with disabilities, including veterans, in job application procedures. If you have any difficulty using our online system due to a disability and you would like to request an accommodation, you may contact us at ************ and leave a message. This is a dedicated line designed exclusively to assist job seekers with disabilities to apply online. Only messages left for this purpose will be considered. A response to your request may take up to two business days.
    $175k-240k yearly Auto-Apply 60d+ ago
  • Hardware Reliability Technician

    Figure 4.5company rating

    San Jose, CA jobs

    Figure is an AI robotics company developing autonomous general-purpose humanoid robots. The goal of the company is to ship humanoid robots with human level intelligence. Its robots are engineered to perform a variety of tasks in the home and commercial markets. Figure is headquartered in San Jose, CA. We are looking for a Hardware Reliability Technician to build test setups and execute reliability tests in humanoid robot development from prototype through production. Responsibilities: Set up and perform reliability tests at component, module and final product levels. Build and assemble fixtures and functional check stations for reliability tests. Analyze test data to evaluate product performance against technical requirements/specifications. Maintain and debug test equipment to ensure equipment uptime. Assist failure analysis actions such as visual inspection, function check, teardown analysis, electrical debug, and material analysis. Coordinate with external testing and FA partners for outsourced tasks. Manage multiple work streams and test equipment in parallel, assist test planning and scheduling, ensuring timely completion of tests and delivery of results. Document reliability test procedure, test observation, analyze test data, report to reliability engineers and crossfunctional teams. Requirements: Minimum 5 year hands-on experience in hardware reliability testing. Strong experience with operation and maintenance of reliability test equipment, such as ED shakers, temperature humidity chambers, shock/drop towers, IP (ingress protection) testers, salt mist/spray testers, Instron machines. Experience with measurement and data aquisition equipment, such as accelerometers, strain gauges, thermocouples, digital multimeters, source meters, oscilloscopes. Ability to build and dis-assemble test setups, mechanical assemblies with little to no documentation or engineering support, familiar with torque tools. Ability to use basic hand tools and perform electrical hardware tasks including wiring, soldering, and assembling custom harnesses. Ability to interpret engineering drawings, CAD models, PCBA and harness schematics to set up tests and diagnose issues. Ability to read test code, interpret and diagnose test results. Familiarity with computer based tools such as but not limited to CMD prompt, Google Suite, Confluence, Jira, and in house development GUI's. Ability to manage multiple test flows and equipment in parallel with attention to details. Ability to learn and execute quickly in a dynamic and fast-paced environment, results-driven. Bonus Qualifications: Hardware reliability failure modes and mechanisms knowledge. CATIA 3D design tool experience. Failure analysis experience, such as use of microscopes, X-ray/CT. PCB Rework experience. Programming experience, e.g. Python, Java, C++. Experience with 3D printing, machining, or other in-house fabrication Experience of delivering hardware products to the market. Robotics experience. The US base salary range for this full-time position is between $45-$65 an hour. The pay offered for this position may vary based on several individual factors, including job-related knowledge, skills, and experience. The total compensation package may also include additional components/benefits depending on the specific role. This information will be shared if an employment offer is extended.
    $45-65 hourly Auto-Apply 60d+ ago
  • Site Reliability Engineer II-1

    Mastercard 4.7company rating

    Bogota, NJ jobs

    Our Purpose Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential. Title and Summary Site Reliability Engineer II-1 Overview The GBSC EPMS team is looking for a Site Reliability Engineer who can help us solve problems, implement automation, and leverage best practices. * Are you a born problem solver who loves to figure out how something works? * Are you a detail -oriented individual who enjoys complex problem solving? * Do you love determining the correct actions required to fix a problem? * Do you have a low tolerance for manual work and look to automate everything you can? Business Operations is leading the Site Reliability Engineering (SRE) transformation at Mastercard through our tooling and by being an advocate for change & standards throughout the development, quality, release, and product organizations. We need team members with an appetite for change and pushing the boundaries of what can be done with automation. Experience in working across development, operations, and product teams to prioritize needs and to build relationships is a must. Responsibilities * Engage in and improve the whole lifecycle of services-from inception and design, through deployment, operation and refinement. * Analyze ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns * Support services before they go live through activities such as system design consulting, capacity planning and launch reviews. * Maintain services once they are live by measuring and monitoring availability, latency and overall system health. * Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity. * Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating, and lead Mastercard in DevOps automation and best practices. * Practice sustainable incident response and blameless postmortems. * Take a holistic approach to problem solving, by connecting the dots during a production event thru the various technology stack that makes up the platform, to optimize mean time to recover * Work with a global team spread across tech hubs in multiple geographies and time zones * Share knowledge and mentor junior resources All About You * BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent practical experience. * Experience with algorithms, data structures, scripting, pipeline management, software design and OLAP systems. * Hands on experience with understanding custom objects using JavaScript, HTML5, CSS and API integrations. * Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive. * Ability to help debug and optimize code and automate routine tasks. * We support many different stakeholders. Experience in dealing with difficult situations and making decisions with a sense of urgency is needed. * Experience in one or more of the following is preferred: C, C++, Java, Python, Go, Perl, Ruby, MDX. * Interest in designing, analyzing and troubleshooting large-scale distributed systems. * We need team members with an appetite for change and pushing the boundaries of what can be done with automation. Experience in working across development, operations, and product teams to prioritize needs and to build relationships is a must. Corporate Security Responsibility All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must: * Abide by Mastercard's security policies and practices; * Ensure the confidentiality and integrity of the information being accessed; * Report any suspected information security violation or breach, and * Complete all periodic mandatory security trainings in accordance with Mastercard's guidelines.
    $88k-119k yearly est. Auto-Apply 9d ago

Learn more about The Hartford jobs

View all jobs