Data Reliability Engineer II
Ridgefield, NJ jobs
Zeta is a Next-Gen Banking Tech company that empowers banks and fintechs to launch banking products for the future. It was founded by Bhavin Turakhia and Ramki Gaddipati in 2015.Our flagship processing platform - Zeta Tachyon - is the industry's first modern, cloud-native, and fully API-enabled stack that brings together issuance, processing, lending, core banking, fraud & risk, and many more capabilities as a single-vendor stack. 20M+ cards have been issued on our platform globally.Zeta is actively working with the largest Banks and Fintechs in multiple global markets transforming customer experience for multi-million card portfolios.
Zeta has over 1700+ employees - with over 70% roles in R&D - across locations in the US, EMEA, and Asia. We raised $340 million at a $2 billion valuation from Softbank, Mastercard, and other investors in 2021.Learn more @ ************** careers.zeta.tech, Linkedin, TwitterResponsibilities
Proactively monitor PostgreSQL RDS instances for performance, availability, and resource utilization (CPU, memory, storage, connections) using established monitoring tools (e.g., CloudWatch, Prometheus).
Assist in identifying performance bottlenecks in PostgreSQL RDS. Apply basic performance tuning techniques like reviewing query execution plans, adding missing indexes, and recommending parameter adjustments.
Monitor the health and performance of Debezium and Kafka Connect connectors, identifying and troubleshooting basic issues related to data capture and delivery.
Monitor Apache Nifi data flows for errors, backpressure, and performance issues. Assist in troubleshooting and resolving common Nifi flow failures.
Provide support for data related issues and participate in root cause analysis.
Monitor the execution of Apache Airflow DAGs, identify failed tasks, and troubleshooting and re-runs.
Develop and maintain automation scripts and infrastructure as code (IAC) templates (e.g., using Crossplane, Terraform) to automate routine database tasks, deployments, and updates.
Participate in on-call rotations to respond to database-related incidents and perform troubleshooting and root cause analysis.
Assist in implementing and maintaining security best practices for cloud databases, including access controls, encryption, and compliance with regulatory requirements.
Regularly audit and assess database security configurations.
Configure and manage database backup and recovery strategies to ensure data integrity and availability in case of failures or data loss.
Analyse database query performance and collaborate with developers to optimize SQL queries and schemas.
Participate in continuous improvement initiatives to enhance the reliability, scalability, and performance of cloud databases.
Assist in the design and optimization of database schemas for cloud environments.
Skills
Familiarity with data pipeline concepts and technologies like Debezium, Kafka Connect, Apache Nifi.
Basic understanding of Amazon Redshift and S3.
Exposure to Apache Spark for data processing.
Basic understanding of Apache Airflow for workflow orchestration.
Strong SQL scripting skills for querying and basic data manipulation.
Familiarity with scripting languages (e.g., Python, Bash) is a plus.
Knowledge of database security best practices, including access controls, encryption, and compliance with regulatory requirements (e.g., GDPR, HIPAA).
Having ‘AWS Certified Database - Specialty' certification is a plus
Experience and Qualifications
Bachelor's degree in Computer Science, Information Technology, or a related field.
3-5 years of experience in database administration, with a focus on PostgreSQL.
1-2 years of hands-on experience with PostgreSQL RDS.
Equal Opportunity
Zeta is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We encourage applicants from all backgrounds, cultures, and communities to apply and believe that a diverse workforce is key to our success
Auto-ApplyData Reliability Engineer II
Ridgefield, NJ jobs
Zeta is a Next-Gen Banking Tech company that empowers banks and fintechs to launch banking products for the future. It was founded by Bhavin Turakhia and Ramki Gaddipati in 2015.Our flagship processing platform - Zeta Tachyon - is the industry's first modern, cloud-native, and fully API-enabled stack that brings together issuance, processing, lending, core banking, fraud & risk, and many more capabilities as a single-vendor stack. 20M+ cards have been issued on our platform globally.Zeta is actively working with the largest Banks and Fintechs in multiple global markets transforming customer experience for multi-million card portfolios.
Zeta has over 1700+ employees - with over 70% roles in R&D - across locations in the US, EMEA, and Asia. We raised $340 million at a $2 billion valuation from Softbank, Mastercard, and other investors in 2021.Learn more @ ************** careers.zeta.tech, Linkedin, TwitterResponsibilities
Proactively monitor PostgreSQL RDS instances for performance, availability, and resource utilization (CPU, memory, storage, connections) using established monitoring tools (e.g., CloudWatch, Prometheus).
Assist in identifying performance bottlenecks in PostgreSQL RDS. Apply basic performance tuning techniques like reviewing query execution plans, adding missing indexes, and recommending parameter adjustments.
Monitor the health and performance of Debezium and Kafka Connect connectors, identifying and troubleshooting basic issues related to data capture and delivery.
Monitor Apache Nifi data flows for errors, backpressure, and performance issues. Assist in troubleshooting and resolving common Nifi flow failures.
Provide support for data related issues and participate in root cause analysis.
Monitor the execution of Apache Airflow DAGs, identify failed tasks, and troubleshooting and re-runs.
Develop and maintain automation scripts and infrastructure as code (IAC) templates (e.g., using Crossplane, Terraform) to automate routine database tasks, deployments, and updates.
Participate in on-call rotations to respond to database-related incidents and perform troubleshooting and root cause analysis.
Assist in implementing and maintaining security best practices for cloud databases, including access controls, encryption, and compliance with regulatory requirements.
Regularly audit and assess database security configurations.
Configure and manage database backup and recovery strategies to ensure data integrity and availability in case of failures or data loss.
Analyse database query performance and collaborate with developers to optimize SQL queries and schemas.
Participate in continuous improvement initiatives to enhance the reliability, scalability, and performance of cloud databases.
Assist in the design and optimization of database schemas for cloud environments.
Skills
Familiarity with data pipeline concepts and technologies like Debezium, Kafka Connect, Apache Nifi.
Basic understanding of Amazon Redshift and S3.
Exposure to Apache Spark for data processing.
Basic understanding of Apache Airflow for workflow orchestration.
Strong SQL scripting skills for querying and basic data manipulation.
Familiarity with scripting languages (e.g., Python, Bash) is a plus.
Knowledge of database security best practices, including access controls, encryption, and compliance with regulatory requirements (e.g., GDPR, HIPAA).
Having ‘AWS Certified Database - Specialty' certification is a plus
Experience and Qualifications
Bachelor's degree in Computer Science, Information Technology, or a related field.
3-5 years of experience in database administration, with a focus on PostgreSQL.
1-2 years of hands-on experience with PostgreSQL RDS.
Equal Opportunity
Zeta is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. We encourage applicants from all backgrounds, cultures, and communities to apply and believe that a diverse workforce is key to our success
Site Reliability Engineer - Core Platform Services
Edison, NJ jobs
Company Profile:
At Morgan Stanley, we advise, originate, trade, manage and distribute capital for governments, institutions and individuals, and always do so with a standard of excellence. We are a leading global financial services firm that conducts its business through three principal business segments-Institutional Securities, Wealth Management (WM), and Investment Management. The Firm's employees serve clients worldwide from more than 1,200 offices in 43 countries.
Our WM business is one of the largest in the world with more than $2 trillion in client assets, $73 billion in lending balances, and nearly 16,000 Financial Advisors in 600+ offices across the U.S. Our Financial Advisors focus on delivering timely, customized solutions and services that help clients meet their financial and life goals. Our offering includes brokerage and investment advisory services, financial and wealth planning, access to credit and lending, cash management, annuities and insurance, and retirement services.
As a market leader, the talent and passion of our people is critical to our success. Together, we share a common set of values rooted in integrity, excellence and strong team ethic. Morgan Stanley can provide a superior foundation for building a professional career - a place for people to learn, to achieve and grow. A philosophy that balances personal lifestyles, perspectives and needs is an important part of our culture.
Department Profile:
Reliability Operations is responsible for risk mitigation, stability, driving performance, and efficiency across Wealth Management Technology. Through Production Operations, Observability Engineering, Resiliency Assessment & Validation and Reliability Engineering, we will improve and increase Wealth Management stability, reliability, resiliency, efficiency, and performance. If you are an exceptional individual who is interested in solving complex problems and building sophisticated solutions in a dynamic team environment, Reliability Operations is the place for you. The ‘Site Reliability Engineer' role is within the Core Platform Services Super Department in Wealth Management Technology.
Job Summary:
We are looking for a Site Reliability Engineer at the Associate, Director and Vice President levels. The position in the Reliability Operations team is focused on delivering exceptional services to both BU and Dev partners to minimize/avoid any production outages. The role will focus on production support, automating deployments and working with the agile teams to build and support stable and reliable production systems. The ideal candidate will be passionate about automation and skilled in one of the programming languages: Python/PERL/ SHELL, Ruby, JAVA, C# or the like. Candidate should possess a strong understanding of database concepts, job scheduler, MQ, Web services, UNIX/LINUX/Windows OS as well as experience with debugging applications. We are looking for a strong leader with excellent communications skills who is committed to continuously improving and delivering results. Candidate should be organized, disciplined, detail-oriented, self-motivated, and delivery-focused.
Responsibilities:
Maintain applications once they are live by measuring and monitoring availability, latency and overall system health with a focus on business activities and continuously evaluate cost and TOIL.
Engage in and improve the whole lifecycle of services from inception and design, through deployment, operation, capacity planning and launch reviews.
Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity; includes automation for other various operational needs.
Troubleshoot infrastructure issues, reviewing log files, updating documentation, and having knowledge base with resolutions
Work closely with the application Development team to understand the platform and create tools/utilities to help with production management
Work with upstream data providers and upstream consumers, and reducing the amount of escalation to development teams
Develop scripts and assist with code changes along with operational tasks/activities.
Work closely with Application Development to ensure that the support team has excellent knowledge of the application set, own and maintain support knowledgebase and documents.
Use analytical skills to find trends in the environment and drive out problems.
Lead effort to determine improvement areas to stabilize the plant.
Identify risks and work with a sense of urgency, working within a team or independently.
Test and tune network, hardware, and software configurations to maximize performance
Interface with different teams like IT Dev managers, Infrastructure teams and lead as a Subject Matter Expert (SME) for the application(s) supported.
Understand the overall business flow of supported application systems and its interface with clients
Take ownership and managing production requests, questions, issues and perform Root Cause Analysis for outages/incidents
Understand the overall business flow of supported application systems and its interface with clients
Be flexible to provide weekend on call rotation and available for offshore time lead
Be accountable for the Production Environments as well as the non-Production Environments for the existing GBOT team and be part of 24/7 production support coverage.
Skills Required:
10+ years of experience in a production environment with a solid software development background and understanding of performance tuning, end-to-end troubleshooting, networking fundamentals and appropriate attention to detail
Ability to focus, provide resolutions for production issues in a high demanding and pressured environment
10+ years hands-on experience in designing, developing, and implementing technical solutions, or significant experience in deep technical support
Strong experience in scripting language (Shell scripting, Python, Perl, etc.) and cloud driven development
Strong database skills with DB2, Sybase or Oracle
Hands-on experience with Autosys or other batch scheduling software
Strong experience in Continuous Integration and Continuous Deployment
Strong experience in environment on demand for both Virtual Machines and containers
Knowledge and hands-on experience on with monitoring tools like Splunk, IP Soft, Sockeye
Practical experience on Agile Methodology (e.g. Scrum)
Knowledge or experience with automating deployments using Jenkins, Train or Windeploy
Ability to diagnose technical problems, debug, optimize code, and automate routine tasks
Hands-on experience in application and database troubleshooting/issue resolution in a fast-paced environment
Excellent communication and ability to think out of the box for process improvements.
Knowledge of Cloud based deployment, security, networking concepts in Azure and AWS
Bachelor's/Master's Degree in Computer Science, Information Systems or related field
Skills Desired:
Knowledge or experience with algorithms, data structures, complexity analysis and software design
Interest in designing, analyzing and troubleshooting large-scale distributed systems
Educational Qualification:
Minimum BS degree in Computer Science, Engineering or a related field
WHAT YOU CAN EXPECT FROM MORGAN STANLEY:
We are committed to maintaining the first-class service and high standard of excellence that have defined Morgan Stanley for over 89 years. Our values - putting clients first, doing the right thing, leading with exceptional ideas, committing to diversity and inclusion, and giving back - aren't just beliefs, they guide the decisions we make every day to do what's best for our clients, communities and more than 80,000 employees in 1,200 offices across 42 countries. At Morgan Stanley, you'll find an opportunity to work alongside the best and the brightest, in an environment where you are supported and empowered. Our teams are relentless collaborators and creative thinkers, fueled by their diverse backgrounds and experiences. We are proud to support our employees and their families at every point along their work-life journey, offering some of the most attractive and comprehensive employee benefits and perks in the industry. There's also ample opportunity to move about the business for those who show passion and grit in their work.
To learn more about our offices across the globe, please copy and paste ***************************************************** into your browser.
Expected base pay rates for the role will be between $70,000 and $120,000 per year for Associate and between $95,000 and $135,000 per year for Director and between $120,000 and $170,000 for Vice President at the commencement of employment. However, base pay if hired will be determined on an individualized basis and is only part of the total compensation package, which, depending on the position, may also include commission earnings, incentive compensation, discretionary bonuses, other short and long-term incentive packages, and other Morgan Stanley sponsored benefit programs.
Morgan Stanley's goal is to build and maintain a workforce that is diverse in experience and background but uniform in reflecting our standards of integrity and excellence. Consequently, our recruiting efforts reflect our desire to attract and retain the best and brightest from all talent pools. We want to be the first choice for prospective employees.
It is the policy of the Firm to ensure equal employment opportunity without discrimination or harassment on the basis of race, color, religion, creed, age, sex, sex stereotype, gender, gender identity or expression, transgender, sexual orientation, national origin, citizenship, disability, marital and civil partnership/union status, pregnancy, veteran or military service status, genetic information, or any other characteristic protected by law.
Morgan Stanley is an equal opportunity employer committed to diversifying its workforce (M/F/Disability/Vet).
Auto-ApplySite Reliability Engineer - Capital Markets
Jersey City, NJ jobs
Jefferies is seeking for Site Reliability Engineer to play an instrumental role in supporting Equity Front office trading application, risk and middle office real time products, developed and used for Equity Cash and ETS application.
As part of the wider platform engineering team, you will be working closely with the Business users interactively throughout the day, along with technical, analysis and testing colleagues. Investigation and resolution of the work items at hand will require competent technical skills and a keen intellect. The business is a growth area, with current investments taking place in all the technology, business and middle office areas.
Responsibilities:
Front Line Site Reliable Engineering and Support functions for Equity trading systems used by Jefferies clients as well as internal users.
Build monitoring tools for application and infrastructure components.
Implement and manage scalable infrastructure using cloud-native technologies and tools.
Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding.
Partner with business, development and infrastructure teams to improve services through rigorous testing and release procedures.
Develop and maintain CI/CD pipelines to streamline deployment processes.
Expedient deployment of new systems. Capacity planning, Platform Management, and support for increasing volumes and business growth.
Create sustainable systems and services through automation.
Collaborate with Application team to establish and enforce production and development standards.
Document procedures, best practices and troubleshooting FAQs.
Resolve complex application and technical problems.
Debugging the system and fixing the production related issues.
Escalate / follow-up on permanent fix for development related issues.
Lead incident response efforts and post-mortem analysis to prevent future occurrences.
Handles complex operational tasks and recommends process and technology changes.
Global support and includes weekend availability to troubleshoot production related issues and perform checkouts.
Ability to work both independently and in groups in an energetic, diverse environment.
Participate in on-call rotations to ensure 24/7 system availability and support.
Support compliance and legal queries.
Qualifications:
Strong experience in Windows and Linux/Unix services.
Strong experience in scripting language like Power shell, Python and SQL.
Strong Knowledge of monitoring tools - Nagios, Splunk, OTEL, Datadog
Strong Knowledge of FIX protocol
Strong Domain skills - Must have working experience in Capital Markets across modules and instruments especially - CASH, ETS, Bonds, Options, Futures, Swaps products
Experience in BFSI (Banking and Financial Industry) Domain applications with a proper understanding of the Trade Lifecycle.
Excellent communication, time management and project management skills.
Primary Location Full Time Salary Range of $175,000 - $200,000
Auto-ApplyPrincipal Reliability Engineer - Information Security
Hartford, CT jobs
Principal Security Engineer - IS06BE
We're determined to make a difference and are proud to be an insurance company that goes well beyond coverages and policies. Working here means having every opportunity to achieve your goals - and to help others accomplish theirs, too. Join our team as we help shape the future.
The Hartford's Information Security team (THIP) is seeking an experienced and highly motivated Technology expert who will be responsible for leading the Reliability Engineering team for run processes. This expert will be accountable for building, adopting, and maturing the Reliability Engineering (RE) tools, practices, and automation of the THIP technology environment. The expert will be responsible for building, optimizing, and maintaining automation capabilities to enable infrastructure provisioning, application availability, testing, quality, application deployment, resiliency, recovery, and efficiency of IT applications. Additionally they will be the point person for all critical incidents within the THIP portfolio.
The Principal Reliability Engineer will have end-to-end accountability for the reliability of IT services for our application portfolio. They will ensure the implementation of IT Security and service hardening requirements and contribute to the long-term strategic evolution of the portfolio. They will drive the sustained advancement of the RE practice within THIP.
Key measures of success will include service reliability (such as availability, latency, quality), feature velocity and deployment quality, as well as technical debt reduction and cost efficiency.
RESPONSIBILITIES:
Responsible for managing and directing all critical incidents, inclusive of defining root cause, developing and implementing remediation plans
Responsible for building reliability engineering, automation, and quality capabilities across 20+ applications and systems
Accountable for Operations, RE, DevSecOps, Quality, and Middleware technologies.
Build tools and capabilities needed by our software engineering teams to optimize development by providing a level of advancement in technology and achievement of efficiency for application teams.
Building an engineering culture with automation across our technology stack and application footprints across traditional and modern architectures resulting in overall IT Productivity improvements.
Support enterprise needs with improvements in Performance, Scalability, Resiliency, Reliability, Stability, Observability, Security, etc.. continuously evolving and modernizing available services to improve productivity, automation, quality, and optimize operational cost.
Market research on emerging trends for technology enablers in the field. Information protection and secure development practices.
Lead transformational change management by championing the adoption of automation capabilities built and foster a culture of continuous learning and improvement mindset for the organization.
This role will have a
Hybrid work schedule,
with the expectation of working in an office (Columbus, OH, Chicago, IL, Hartford, CT or Charlotte, NC) 3 days a week (Tuesday through Thursday).
QUALIFICATIONS:
8 + years IT professional experience in financial services or insurance in a large corporation.
8+ years of having assumed leadership, engineering, application management and operations roles with a demonstrated track record of technical innovation and experience influencing technically diverse teams.
Strong track record of production support, incident management and problem solving.
Strong cloud engineering mindset with cloud experience across public cloud providers and the technologies most frequently used in engineering and managing highly reliable and automated technology environments.
Demonstrated ability to own, transform, mature, and deliver reliability engineering tools and capabilities.
Strong knowledge and experience with cloud product management, cloud engineering, and Agile principles.
Experience with Performance and Observability tools such as Dynatrace, Splunk, CloudWatch, Cloud Trail, and related tools.
Experience with continuous integration and DevOps methodologies, preferred tools such as GitHub, Rally, SonarQube etc..
Strong solution engineering orientation to enable expedient troubleshooting, issue-resolution and root-cause removal.
Proven execution/delivery running and maintaining cloud-based and on prem automation tools and services across various service delivery models.
Quality Engineer leadership experience along with Test Data Management, Test Automation including unit, functional, regression, and integration testing, and Defect Management.
Demonstrated ability to act as a strategic thought leader and be seen as a credible business partner by peers.
Highly collaborative and team oriented
Exceptional critical thinking and problem-solving skills.
Able to influence diverse teams and build strong business relationships.
Bachelor of Science in Computer Science or equivalent preferred.
Candidate must be authorized to work in the US without company sponsorship. The company will not support the STEM OPT I-983 Training Plan endorsement for this position.
Compensation
The listed annualized base pay range is primarily based on analysis of similar positions in the external market. Actual base pay could vary and may be above or below the listed range based on factors including but not limited to performance, proficiency and demonstration of competencies required for the role. The base pay is just one component of The Hartford's total compensation package for employees. Other rewards may include short-term or annual bonuses, long-term incentives, and on-the-spot recognition. The annualized base pay range for this role is:
$149,360 - $224,040
Equal Opportunity Employer/Sex/Race/Color/Veterans/Disability/Sexual Orientation/Gender Identity or Expression/Religion/Age
About Us | Our Culture | What It's Like to Work Here | Perks & Benefits
Auto-ApplySite Reliability Engineer III
Jersey City, NJ jobs
JobID: 210681440 JobSchedule: Full time JobShift: Day Base Pay/Salary: Jersey City,NJ $133,000.00-$185,000.00 There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.
As a Site Reliability Engineer III at JPMorgan Chase within the Consumer and Community banking team, you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform.
Job responsibilities
* Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate
* Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines
* Implements infrastructure, configuration, and network as code for the applications and platforms in your remit
* Collaborates with technical experts, key stakeholders, and team members to resolve complex problems
* Understands service level indicators and utilizes service level objectives to proactively resolve issues before they impact customers
* Supports the adoption of site reliability engineering best practices within your team
Required qualifications, capabilities, and skills
* Formal training or certification on software engineering concepts and 3+ years applied experience
* Proficient in site reliability culture and principles and familiarity with how to implement site reliability within an application or platform
* Proficient in at least one programming language such as Python, Java/Spring Boot, and .Net
* Proficient knowledge of software applications and technical processes within a given technical discipline (e.g., Cloud, artificial intelligence, Android, etc.)
* Experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others
* Familiarity with container and container orchestration such as ECS, Kubernetes, and Docker
* Familiarity with troubleshooting common networking technologies and issues
* Ability to contribute to large and collaborative teams by presenting information in a logical and timely manner with compelling language and limited supervision
* Ability to proactively recognize road blocks and demonstrates interest in learning technology that facilitates innovation
* Ability to identify new technologies and relevant solutions to ensure design constraints are met by the software team
Preferred qualifications, capabilities, and skills
* Participate on call support rota for high severity issues to help with diagnosis and collect facts for RCA
* Ability to facilitate post mortem meetings for Root Cause Analysis and implement effective steps for stability improvements
* Ability to write technical documentation for lessons learnt from issues and help improve runbook steps for Mission Control teams.
* Completed AWS Solution Architect certification
Auto-ApplySite Reliability Engineer III
Jersey City, NJ jobs
JobID: 210689758 JobSchedule: Full time JobShift: Day Base Pay/Salary: Jersey City,NJ $133,000.00-$185,000.00 There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.
As a Site Reliability Engineer III at JPMorgan Chase within the [insert LOB or sub LOB], you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform.
Job responsibilities
* Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate
* Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines
* Collaborates with other software engineers and teams to design, develop, test, and implement availability, reliability, scalability, and solutions in their applications
* Implements infrastructure, configuration, and network as code for the applications and platforms in your remit
* Collaborates with technical experts, key stakeholders, and team members to resolve complex problems
* Understands service level indicators and utilizes service level objectives to proactively resolve issues before they impact customers
* Supports the adoption of site reliability engineering best practices within your team
Required qualifications, capabilities, and skills
* Formal training or certification on software engineering concepts and 3+ years applied experience
* Proficient in site reliability culture and principles and familiarity with how to implement site reliability within an application or platform
* Proficient in at least one programming language such as Python, Java/Spring Boot, and .Net
* Proficient knowledge of software applications and technical processes within a given technical discipline (e.g., Cloud, artificial intelligence, Android, etc.)
* Experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others
* Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform
* Familiarity with container and container orchestration such as ECS, Kubernetes, and Docker
* Familiarity with troubleshooting common networking technologies and issues
* Ability to contribute to large and collaborative teams by presenting information in a logical and timely manner with compelling language and limited supervision
Preferred qualifications, capabilities, and skills
* Ability to proactively recognize road blocks and demonstrates interest in learning technology that facilitates innovation
* Ability to identify new technologies and relevant solutions to ensure design constraints are met by the software team
* Ability to initiate and implement ideas to solve business problems
Auto-ApplySite Reliability Engineer III
Jersey City, NJ jobs
There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems. As a Site Reliability Engineer III at JPMorgan Chase within the Consumer and Community banking team, you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform.
**Job responsibilities**
+ Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate
+ Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines
+ Implements infrastructure, configuration, and network as code for the applications and platforms in your remit
+ Collaborates with technical experts, key stakeholders, and team members to resolve complex problems
+ Understands service level indicators and utilizes service level objectives to proactively resolve issues before they impact customers
+ Supports the adoption of site reliability engineering best practices within your team
**Required qualifications, capabilities, and skills**
+ Formal training or certification on software engineering concepts and 3+ years applied experience
+ Proficient in site reliability culture and principles and familiarity with how to implement site reliability within an application or platform
+ Proficient in at least one programming language such as Python, Java/Spring Boot, and .Net
+ Proficient knowledge of software applications and technical processes within a given technical discipline (e.g., Cloud, artificial intelligence, Android, etc.)
+ Experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others
+ Familiarity with container and container orchestration such as ECS, Kubernetes, and Docker
+ Familiarity with troubleshooting common networking technologies and issues
+ Ability to contribute to large and collaborative teams by presenting information in a logical and timely manner with compelling language and limited supervision
+ Ability to proactively recognize road blocks and demonstrates interest in learning technology that facilitates innovation
+ Ability to identify new technologies and relevant solutions to ensure design constraints are met by the software team
**Preferred qualifications, capabilities, and skills**
+ Participate on call support rota for high severity issues to help with diagnosis and collect facts for RCA
+ Ability to facilitate post mortem meetings for Root Cause Analysis and implement effective steps for stability improvements
+ Ability to write technical documentation for lessons learnt from issues and help improve runbook steps for Mission Control teams.
+ Completed AWS Solution Architect certification
Chase is a leading financial services firm, helping nearly half of America's households and small businesses achieve their financial goals through a broad range of financial products. Our mission is to create engaged, lifelong relationships and put our customers at the heart of everything we do. We also help small businesses, nonprofits and cities grow, delivering solutions to solve all their financial needs.
We offer a competitive total rewards package including base salary determined based on the role, experience, skill set and location. Those in eligible roles may receive commission-based pay and/or discretionary incentive compensation, paid in the form of cash and/or forfeitable equity, awarded in recognition of individual achievements and contributions. We also offer a range of benefits and programs to meet employee needs, based on eligibility. These benefits include comprehensive health care coverage, on-site health and wellness centers, a retirement savings plan, backup childcare, tuition reimbursement, mental health support, financial coaching and more. Additional details about total compensation and benefits will be provided during the hiring process.
We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants' and employees' religious practices and beliefs, as well as mental health or physical disability needs. Visit our FAQs for more information about requesting an accommodation.
Equal Opportunity Employer/Disability/Veterans
**Base Pay/Salary**
Jersey City,NJ $133,000.00 - $185,000.00 / year
Site Reliability Engineer III- Kafka Platform Engineering
Jersey City, NJ jobs
There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems. As a Site Reliability Engineer III at JPMorgan Chase within the Infrastructure Platforms, you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform.
**Job responsibilities**
+ Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate.
+ Demonstrate deep knowledge of Kafka technology, Kafka connect framework, and distributed systems technologies, with the ability to operate in and migrate across public and private clouds.
+ Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines
+ Collaborates with other software engineers and teams to design, develop, test, and implement availability, reliability, scalability, and solutions in their applications
+ Implements infrastructure, configuration, and network as code for the applications and platforms in your remit.
+ Collaborates with technical experts, key stakeholders, and team members to resolve complex problems.
+ Understands service level indicators and utilizes service level objectives to proactively resolve issues before they impact customers.
+ Contribute to the development of technical documentation, including service APIs using Swagger, ensuring robust logging, auditability, security, and monitoring features.
+ Supports the adoption of site reliability engineering best practices within your team.
+ Engage in periodic on-call rotation shifts, providing client support and ensuring thorough monitoring of the platform.
**Required qualifications, capabilities, and skills**
+ Formal training or certification on computer science and reliability concepts and 3+ years applied experience.
+ Proficient in site reliability culture and principles and familiarity with how to implement site reliability within an application or platform
+ Proficient in at least one programming language such as Java/Spring Boot, python.
+ Proficient knowledge of software applications and technical processes within a given technical discipline (e.g., Cloud, artificial intelligence, Android, etc.)
+ Experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc.
+ Experience with public cloud platforms like AWS, GCP or Azure.
+ Experience with Kafka ecosystem products: Kafka, Kafka Connect, Kafka Streams.
+ Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform.
+ Familiarity with container and container orchestration such as ECS, Kubernetes, and Docker.
+ Familiarity with troubleshooting common networking technologies and issues.
+ Ability to contribute to large and collaborative teams by presenting information in a logical and timely manner with compelling language and limited supervision
+ Ability to proactively recognize road blocks and demonstrates interest in learning technology that facilitates innovation
+ Ability to identify new technologies and relevant solutions to ensure design constraints are met by the software team
+ Ability to initiate and implement ideas to solve business problems.
**Preferred qualifications, capabilities, and skills**
+ Familiarity with running Apache Flink.
+ Understanding of authentication and authorization technologies (e.g., OAUTH, Kerberos).
+ Experience with AWS cloud services and Kubernetes platform orchestration.
JPMorganChase, one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the world's most prominent corporate, institutional and government clients under the J.P. Morgan and Chase brands. Our history spans over 200 years and today we are a leader in investment banking, consumer and small business banking, commercial banking, financial transaction processing and asset management.
We offer a competitive total rewards package including base salary determined based on the role, experience, skill set and location. Those in eligible roles may receive commission-based pay and/or discretionary incentive compensation, paid in the form of cash and/or forfeitable equity, awarded in recognition of individual achievements and contributions. We also offer a range of benefits and programs to meet employee needs, based on eligibility. These benefits include comprehensive health care coverage, on-site health and wellness centers, a retirement savings plan, backup childcare, tuition reimbursement, mental health support, financial coaching and more. Additional details about total compensation and benefits will be provided during the hiring process.
We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants' and employees' religious practices and beliefs, as well as mental health or physical disability needs. Visit our FAQs for more information about requesting an accommodation.
JPMorgan Chase & Co. is an Equal Opportunity Employer, including Disability/Veterans
**Base Pay/Salary**
Jersey City,NJ $133,000.00 - $185,000.00 / year
Site Reliability Engineer III
Jersey City, NJ jobs
There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems. As a Site Reliability Engineer III at JPMorgan Chase within the (insert LOB or sub LOB), you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform.
**Job responsibilities**
+ Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate
+ Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines
+ Collaborates with other software engineers and teams to design, develop, test, and implement availability, reliability, scalability, and solutions in their applications
+ Implements infrastructure, configuration, and network as code for the applications and platforms in your remit
+ Collaborates with technical experts, key stakeholders, and team members to resolve complex problems
+ Understands service level indicators and utilizes service level objectives to proactively resolve issues before they impact customers
+ Supports the adoption of site reliability engineering best practices within your team
**Required qualifications, capabilities, and skills**
+ Formal training or certification on software engineering concepts and 3+ years applied experience
+ Proficient in site reliability culture and principles and familiarity with how to implement site reliability within an application or platform
+ Proficient in at least one programming language such as Python, Java/Spring Boot, and .Net
+ Proficient knowledge of software applications and technical processes within a given technical discipline (e.g., Cloud, artificial intelligence, Android, etc.)
+ Experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others
+ Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform
+ Familiarity with container and container orchestration such as ECS, Kubernetes, and Docker
+ Familiarity with troubleshooting common networking technologies and issues
+ Ability to contribute to large and collaborative teams by presenting information in a logical and timely manner with compelling language and limited supervision
**Preferred qualifications, capabilities, and skills**
+ Ability to proactively recognize road blocks and demonstrates interest in learning technology that facilitates innovation
+ Ability to identify new technologies and relevant solutions to ensure design constraints are met by the software team
+ Ability to initiate and implement ideas to solve business problems
JPMorganChase, one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the world's most prominent corporate, institutional and government clients under the J.P. Morgan and Chase brands. Our history spans over 200 years and today we are a leader in investment banking, consumer and small business banking, commercial banking, financial transaction processing and asset management.
We offer a competitive total rewards package including base salary determined based on the role, experience, skill set and location. Those in eligible roles may receive commission-based pay and/or discretionary incentive compensation, paid in the form of cash and/or forfeitable equity, awarded in recognition of individual achievements and contributions. We also offer a range of benefits and programs to meet employee needs, based on eligibility. These benefits include comprehensive health care coverage, on-site health and wellness centers, a retirement savings plan, backup childcare, tuition reimbursement, mental health support, financial coaching and more. Additional details about total compensation and benefits will be provided during the hiring process.
We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants' and employees' religious practices and beliefs, as well as mental health or physical disability needs. Visit our FAQs for more information about requesting an accommodation.
JPMorgan Chase & Co. is an Equal Opportunity Employer, including Disability/Veterans
**Base Pay/Salary**
Jersey City,NJ $133,000.00 - $185,000.00 / year
Network Reliability Engineer III
Chicago, IL jobs
As we embark on a journey to transform the Network Services Group in CME, we are seeking a Network Reliability Engineer III to join our dynamic team. In this role, you will design, develop and maintain self-service tools and applications that enhance productivity and reduce operational costs. You will work across the full stack-both front-end and back-end-to architect microservices (GKE) in Google Cloud Platform (GCP), driving our infrastructure towards greater automation and reliability.
We are a global team across US, UK, India and Singapore made up of a diverse range of people from varied backgrounds who each bring unique network experiences and skill sets. The relatively new Network Reliability/Automation team are responsible for building a suite of custom automation tools and developing our self-healing capabilities while working closely with other members of the Network Services team in project delivery to ensure one of the largest Exchange network infrastructures in the world is highly available, resilient, secure and reliable.
Responsibilities
* Design, develop and maintain self-service and automation tools to streamline IT operations and reduce manual effort.
* Engage in full-stack development, delivering responsive front-end interfaces as well as robust scalable back-end services.
* With support Architect, deploy and scale microservices on GCP, with particular emphasis on containers and Google Kubernetes Engine (GKE).
* Manage cloud infrastructure via Infrastructure-as-Code (IaC), primarily using Terraform to provision and maintain resources.
* Operate and troubleshoot solutions on Linux-based platforms, leveraging Visual Studio Code (VSCode) as the primary development environment.
* Adhere to software engineering best practices, including PEP8 coding standards, SOLID design principles, and established SDLC processes.
* Implement and manage CI/CD pipelines with a DevOps mindset, ensuring rapid, reliable delivery of code.
* Develop and consume Flask-based RESTful APIs to support network and security automation.
* Collaborate within an Agile Scrum framework, utilizing tools such as Bitbucket and Jira to track progress and manage sprints.
* Apply strong analytical and problem-solving skills to balance multiple project variables and deliver high-quality solutions on schedule.
What we are looking for
* Approximately 2-3 years' hands-on Python programming experience, with a demonstrable track record of automation or tooling projects.
* Knowledge and experience working with both Python Django and Flask in a corporate environment.
* Any experience in network and security automation, coupled with understanding of network fundamentals (routing, switching, firewalls, VPNs) would be beneficial.
* Experience developing REST APIs using Flask (or a comparable Python framework).
* Applicants with front-end experience using Javascript/JQuery/HTML5/CSS would be ideal.
* Familiarity with Infrastructure-as-Code using Terraform (or similar) to manage cloud resources.
* Comfortable working in Linux environments and proficient in using Visual Studio Code (VSCode).
* Strong software engineering mindset: adherence to PEP8, SOLID principles, and best practices for SDLC, CI/CD and DevOps.
* Excellent communication skills, both verbal and written, with the ability to convey technical concepts to diverse stakeholders.
* Highly analytical, with the ability to troubleshoot complex issues and manage multiple tasks concurrently.
* Experience working in Agile Scrum teams, utilizing Bitbucket and Jira (or equivalent tools) for version control and project tracking.
Personal Attributes
* Proactive and positive attitude, taking initiative to identify and resolve issues ahead of time.
* Collaborative team player, eager to contribute knowledge and assist colleagues.
* Innovative thinker who brings fresh ideas and constructive suggestions for continuous improvement.
Education
Bachelor's Degree in Computer Science, Engineering or a related field is preferred. Equivalent practical experience will also be considered.
#LI - Hybrid
#LI - JK1
CME Group is committed to offering a competitive total rewards package for our employees that recognizes their contributions to the business and reflects our long-term investment in their future. The pay range for this role is $100,700-$167,800. Actual salary offered will be dependent on a wide array of factors including but not limited to: relevant experience, skills, education and comparison to internal employees (where relevant). Our compensation program also includes an annual target bonus opportunity for all employees, as well as the opportunity to become an owner in the company through our broad-based equity program. Through our benefits program, we strive to offer flexibility, value and choice. From comprehensive health coverage, to a retirement package that includes both a 401(k) and an active pension plan, to highly competitive education reimbursement provisions, paid time off and a mental health benefit, CME Group offers a holistic benefits package for our team and their dependents.
CME Group: Where Futures are Made
CME Group is the world's leading derivatives marketplace. But who we are goes deeper than that. Here, you can impact markets worldwide. Transform industries. And build a career by shaping tomorrow. We invest in your success and you own it - all while working alongside a team of leading experts who inspire you in ways big and small. Problem solvers, difference makers, trailblazers. Those are our people. And we're looking for more.
At CME Group, we embrace our employees' unique experiences and skills to ensure that everyone's perspectives are acknowledged and valued. As an equal-opportunity employer, we consider all potential employees without regard to any protected characteristic.
Important Notice: Recruitment fraud is on the rise, with scammers using misleading promises of job offers and interviews to solicit money and personal information from job seekers. CME Group adheres to established procedures designed to maintain trust, confidence and security throughout our recruitment process. Learn more here.
Equity Site Reliability Engineer
Jersey City, NJ jobs
Jefferies is seeking for Site Reliability Engineer to play an instrumental role in supporting Equity Front office trading application, risk and middle office real time products, developed and used for Equity Cash and ETS application.
As part of the wider platform engineering team, you will be working closely with the Business users interactively throughout the day, along with technical, analysis and testing colleagues. Investigation and resolution of the work items at hand will require competent technical skills and a keen intellect. The business is a growth area, with current investments taking place in all the technology, business and middle office areas.
Job Duties:
Front Line Site Reliable Engineering and Support functions for Equity trading systems used by Jefferies clients as well as internal users.
Build monitoring tools for application and infrastructure components.
Implement and manage scalable infrastructure using cloud-native technologies and tools.
Gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding.
Partner with business, development and infrastructure teams to improve services through rigorous testing and release procedures.
Develop and maintain CI/CD pipelines to streamline deployment processes.
Expedient deployment of new systems. Capacity planning, Platform Management, and support for increasing volumes and business growth.
Create sustainable systems and services through automation.
Collaborate with Application team to establish and enforce production and development standards.
Document procedures, best practices and troubleshooting FAQs.
Resolve complex application and technical problems.
Debugging the system and fixing the production related issues.
Escalate / follow-up on permanent fix for development related issues.
Lead incident response efforts and post-mortem analysis to prevent future occurrences.
Handles complex operational tasks and recommends process and technology changes.
Global support and includes weekend availability to troubleshoot production related issues and perform checkouts.
Ability to work both independently and in groups in an energetic, diverse environment.
Participate in on-call rotations to ensure 24/7 system availability and support.
Support compliance and legal queries.
Experience/skills Required:
Strong experience in Windows and Linux/Unix services.
Strong experience in scripting language like Power shell, Python and SQL.
Strong Knowledge of monitoring tools - Nagios, Splunk, OTEL, Datadog
Strong Knowledge of FIX protocol
Strong Domain skills - Must have working experience in Capital Markets across modules and instruments especially - CASH, ETS, Bonds, Options, Futures, Swaps products
Experience in BFSI (Banking and Financial Industry) Domain applications with a proper understanding of the Trade Lifecycle.
Excellent communication, time management and project management skills.
The Salary Range for this role is $150,000-$225,000
#LI-JR1
Auto-ApplySite Reliability Engineer II- Physical Security Technology
Jersey City, NJ jobs
JobID: 210659688 JobSchedule: Full time JobShift: Base Pay/Salary: Jersey City,NJ $118,750.00-$150,000.00 Play a key role in ensuring system reliability at one of the world's most iconic and largest financial institutions.
As a Site Reliability Engineer II at JPMorgan Chase within the enterprise technology, finance technology team, you will use technology to solve business problems and leverage software engineering best practices as we strive towards excellence. This role often works independently to execute small to medium projects, but you'll also have the opportunity to collaborate with cross functional teams to continually improve your level of knowledge about JPMorgan Chase's business and relevant technologies.
Job responsibilities
* Assist in the deployment and configuration of Genetec Security Center on Windows servers, ensuring successful implementation and integration of security systems.
* Provide first-level technical support to end-users, troubleshoot issues related to Genetec Security Center, and offer recommendations on best practices.
* Recognize and eliminate toil through systems engineering or automation, and implement observability patterns to improve service level indicators, objectives monitoring, and alerting solutions.
* Collaborate with senior IT staff and participate in training sessions to improve knowledge and skills related to Genetec Security Center and other IT systems.
* Monitor system performance and availability of core Genetec Security Center services, ensuring optimal transparency and analysis.
* Document and maintain accurate records of system configurations, changes, and support requests, ensuring clear communication and organization.
* Package Genetec Security Center software for client and server installs, and provide on-call support as needed to address urgent issues.
Required qualifications, capabilities, and skills
* Formal training or certification on software engineering concepts and 2+ years applied experience.
* 2+ years' experience working with Genetec Security Center, including configuring federations, and experience with installing and upgrading Genetec Security Center software while managing Windows patching.
* Familiarity with observability practices such as white and black box monitoring, service level objective alerting, and telemetry collection using tools like Grafana, Dynatrace, Prometheus, Datadog, and Splunk.
* Possession of Genetec Security Center Omnicast certification and a good understanding of network protocols and security principles.
* Experience working with third-party applications deployed on Windows Server environments and the ability to work with SQL Server, including running queries.
* Strong problem-solving skills and attention to detail, ensuring effective troubleshooting and resolution of issues.
* Excellent communication and interpersonal skills, facilitating collaboration and effective interaction with team members and stakeholders.
* Ability to work independently and as part of a team, demonstrating flexibility and adaptability in various work environments.
* Willingness to learn and adapt to new technologies, staying current with industry trends and advancements.
Preferred qualifications, capabilities, and skills
* General knowledge of financial services industry
* Experience working with third-party applications.
* Experience working with any other video management solutions
* Experience working with Intrusion Detection systems
* Genetec Mission Control certification is a plus
#LI-ID1
Auto-ApplySite Reliability Engineer II- Physical Security Technology
Jersey City, NJ jobs
Play a key role in ensuring system reliability at one of the world's most iconic and largest financial institutions. As a Site Reliability Engineer II at JPMorgan Chase within the enterprise technology, finance technology team, you will use technology to solve business problems and leverage software engineering best practices as we strive towards excellence. This role often works independently to execute small to medium projects, but you'll also have the opportunity to collaborate with cross functional teams to continually improve your level of knowledge about JPMorgan Chase's business and relevant technologies.
**Job responsibilities**
+ Assist in the deployment and configuration of Genetec Security Center on Windows servers, ensuring successful implementation and integration of security systems.
+ Provide first-level technical support to end-users, troubleshoot issues related to Genetec Security Center, and offer recommendations on best practices.
+ Recognize and eliminate toil through systems engineering or automation, and implement observability patterns to improve service level indicators, objectives monitoring, and alerting solutions.
+ Collaborate with senior IT staff and participate in training sessions to improve knowledge and skills related to Genetec Security Center and other IT systems.
+ Monitor system performance and availability of core Genetec Security Center services, ensuring optimal transparency and analysis.
+ Document and maintain accurate records of system configurations, changes, and support requests, ensuring clear communication and organization.
+ Package Genetec Security Center software for client and server installs, and provide on-call support as needed to address urgent issues.
**Required qualifications, capabilities, and skills**
+ Formal training or certification on software engineering concepts and 2+ years applied experience.
+ 2+ years' experience working with Genetec Security Center, including configuring federations, and experience with installing and upgrading Genetec Security Center software while managing Windows patching.
+ Familiarity with observability practices such as white and black box monitoring, service level objective alerting, and telemetry collection using tools like Grafana, Dynatrace, Prometheus, Datadog, and Splunk.
+ Possession of Genetec Security Center Omnicast certification and a good understanding of network protocols and security principles.
+ Experience working with third-party applications deployed on Windows Server environments and the ability to work with SQL Server, including running queries.
+ Strong problem-solving skills and attention to detail, ensuring effective troubleshooting and resolution of issues.
+ Excellent communication and interpersonal skills, facilitating collaboration and effective interaction with team members and stakeholders.
+ Ability to work independently and as part of a team, demonstrating flexibility and adaptability in various work environments.
+ Willingness to learn and adapt to new technologies, staying current with industry trends and advancements.
**Preferred qualifications, capabilities, and skills**
+ General knowledge of financial services industry
+ Experience working with third-party applications.
+ Experience working with any other video management solutions
+ Experience working with Intrusion Detection systems
+ Genetec Mission Control certification is a plus
\#LI-ID1
JPMorganChase, one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the world's most prominent corporate, institutional and government clients under the J.P. Morgan and Chase brands. Our history spans over 200 years and today we are a leader in investment banking, consumer and small business banking, commercial banking, financial transaction processing and asset management.
We offer a competitive total rewards package including base salary determined based on the role, experience, skill set and location. Those in eligible roles may receive commission-based pay and/or discretionary incentive compensation, paid in the form of cash and/or forfeitable equity, awarded in recognition of individual achievements and contributions. We also offer a range of benefits and programs to meet employee needs, based on eligibility. These benefits include comprehensive health care coverage, on-site health and wellness centers, a retirement savings plan, backup childcare, tuition reimbursement, mental health support, financial coaching and more. Additional details about total compensation and benefits will be provided during the hiring process.
We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants' and employees' religious practices and beliefs, as well as mental health or physical disability needs. Visit our FAQs for more information about requesting an accommodation.
JPMorgan Chase & Co. is an Equal Opportunity Employer, including Disability/Veterans
**Base Pay/Salary**
Jersey City,NJ $118,750.00 - $150,000.00 / year
Site Reliability Engineer III
Jersey City, NJ jobs
There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.
As a Site Reliability Engineer III at JPMorgan Chase within the Consumer and Community banking team, you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform.
Job responsibilities
Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate
Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines
Implements infrastructure, configuration, and network as code for the applications and platforms in your remit
Collaborates with technical experts, key stakeholders, and team members to resolve complex problems
Understands service level indicators and utilizes service level objectives to proactively resolve issues before they impact customers
Supports the adoption of site reliability engineering best practices within your team
Required qualifications, capabilities, and skills
Formal training or certification on software engineering concepts and 3+ years applied experience
Proficient in site reliability culture and principles and familiarity with how to implement site reliability within an application or platform
Proficient in at least one programming language such as Python, Java/Spring Boot, and .Net
Proficient knowledge of software applications and technical processes within a given technical discipline (e.g., Cloud, artificial intelligence, Android, etc.)
Experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others
Familiarity with container and container orchestration such as ECS, Kubernetes, and Docker
Familiarity with troubleshooting common networking technologies and issues
Ability to contribute to large and collaborative teams by presenting information in a logical and timely manner with compelling language and limited supervision
Ability to proactively recognize road blocks and demonstrates interest in learning technology that facilitates innovation
Ability to identify new technologies and relevant solutions to ensure design constraints are met by the software team
Preferred qualifications, capabilities, and skills
Participate on call support rota for high severity issues to help with diagnosis and collect facts for RCA
Ability to facilitate post mortem meetings for Root Cause Analysis and implement effective steps for stability improvements
Ability to write technical documentation for lessons learnt from issues and help improve runbook steps for Mission Control teams.
Completed AWS Solution Architect certification
Auto-ApplySite Reliability Engineer III- Kafka Platform Engineering
Jersey City, NJ jobs
There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.
As a Site Reliability Engineer III at JPMorgan Chase within the Infrastructure Platforms, you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform.
Job responsibilities
Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate.
Demonstrate deep knowledge of Kafka technology, Kafka connect framework, and distributed systems technologies, with the ability to operate in and migrate across public and private clouds.
Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines
Collaborates with other software engineers and teams to design, develop, test, and implement availability, reliability, scalability, and solutions in their applications
Implements infrastructure, configuration, and network as code for the applications and platforms in your remit.
Collaborates with technical experts, key stakeholders, and team members to resolve complex problems.
Understands service level indicators and utilizes service level objectives to proactively resolve issues before they impact customers.
Contribute to the development of technical documentation, including service APIs using Swagger, ensuring robust logging, auditability, security, and monitoring features.
Supports the adoption of site reliability engineering best practices within your team.
Engage in periodic on-call rotation shifts, providing client support and ensuring thorough monitoring of the platform.
Required qualifications, capabilities, and skills
Formal training or certification on computer science and reliability concepts and 3+ years applied experience.
Proficient in site reliability culture and principles and familiarity with how to implement site reliability within an application or platform
Proficient in at least one programming language such as Java/Spring Boot, python.
Proficient knowledge of software applications and technical processes within a given technical discipline (e.g., Cloud, artificial intelligence, Android, etc.)
Experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc.
Experience with public cloud platforms like AWS, GCP or Azure.
Experience with Kafka ecosystem products: Kafka, Kafka Connect, Kafka Streams.
Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform.
Familiarity with container and container orchestration such as ECS, Kubernetes, and Docker.
Familiarity with troubleshooting common networking technologies and issues.
Ability to contribute to large and collaborative teams by presenting information in a logical and timely manner with compelling language and limited supervision
Ability to proactively recognize road blocks and demonstrates interest in learning technology that facilitates innovation
Ability to identify new technologies and relevant solutions to ensure design constraints are met by the software team
Ability to initiate and implement ideas to solve business problems.
Preferred qualifications, capabilities, and skills
Familiarity with running Apache Flink.
Understanding of authentication and authorization technologies (e.g., OAUTH, Kerberos).
Experience with AWS cloud services and Kubernetes platform orchestration.
Auto-ApplySite Reliability Engineer III
Jersey City, NJ jobs
There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.
As a Site Reliability Engineer III at JPMorgan Chase within the [insert LOB or sub LOB], you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform.
Job responsibilities
Guides and assists others in the areas of building appropriate level designs and gaining consensus from peers where appropriate
Collaborates with other software engineers and teams to design and implement deployment approaches using automated continuous integration and continuous delivery pipelines
Collaborates with other software engineers and teams to design, develop, test, and implement availability, reliability, scalability, and solutions in their applications
Implements infrastructure, configuration, and network as code for the applications and platforms in your remit
Collaborates with technical experts, key stakeholders, and team members to resolve complex problems
Understands service level indicators and utilizes service level objectives to proactively resolve issues before they impact customers
Supports the adoption of site reliability engineering best practices within your team
Required qualifications, capabilities, and skills
Formal training or certification on software engineering concepts and 3+ years applied experience
Proficient in site reliability culture and principles and familiarity with how to implement site reliability within an application or platform
Proficient in at least one programming language such as Python, Java/Spring Boot, and .Net
Proficient knowledge of software applications and technical processes within a given technical discipline (e.g., Cloud, artificial intelligence, Android, etc.)
Experience in observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others
Experience with continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform
Familiarity with container and container orchestration such as ECS, Kubernetes, and Docker
Familiarity with troubleshooting common networking technologies and issues
Ability to contribute to large and collaborative teams by presenting information in a logical and timely manner with compelling language and limited supervision
Preferred qualifications, capabilities, and skills
Ability to proactively recognize road blocks and demonstrates interest in learning technology that facilitates innovation
Ability to identify new technologies and relevant solutions to ensure design constraints are met by the software team
Ability to initiate and implement ideas to solve business problems
Auto-ApplySite Reliability Engineer II- Physical Security Technology
Jersey City, NJ jobs
Play a key role in ensuring system reliability at one of the world's most iconic and largest financial institutions.
As a Site Reliability Engineer II at JPMorgan Chase within the enterprise technology, finance technology team, you will use technology to solve business problems and leverage software engineering best practices as we strive towards excellence. This role often works independently to execute small to medium projects, but you'll also have the opportunity to collaborate with cross functional teams to continually improve your level of knowledge about JPMorgan Chase's business and relevant technologies.
Job responsibilities
Assist in the deployment and configuration of Genetec Security Center on Windows servers, ensuring successful implementation and integration of security systems.
Provide first-level technical support to end-users, troubleshoot issues related to Genetec Security Center, and offer recommendations on best practices.
Recognize and eliminate toil through systems engineering or automation, and implement observability patterns to improve service level indicators, objectives monitoring, and alerting solutions.
Collaborate with senior IT staff and participate in training sessions to improve knowledge and skills related to Genetec Security Center and other IT systems.
Monitor system performance and availability of core Genetec Security Center services, ensuring optimal transparency and analysis.
Document and maintain accurate records of system configurations, changes, and support requests, ensuring clear communication and organization.
Package Genetec Security Center software for client and server installs, and provide on-call support as needed to address urgent issues.
Required qualifications, capabilities, and skills
Formal training or certification on software engineering concepts and 2+ years applied experience.
2+ years' experience working with Genetec Security Center, including configuring federations, and experience with installing and upgrading Genetec Security Center software while managing Windows patching.
Familiarity with observability practices such as white and black box monitoring, service level objective alerting, and telemetry collection using tools like Grafana, Dynatrace, Prometheus, Datadog, and Splunk.
Possession of Genetec Security Center Omnicast certification and a good understanding of network protocols and security principles.
Experience working with third-party applications deployed on Windows Server environments and the ability to work with SQL Server, including running queries.
Strong problem-solving skills and attention to detail, ensuring effective troubleshooting and resolution of issues.
Excellent communication and interpersonal skills, facilitating collaboration and effective interaction with team members and stakeholders.
Ability to work independently and as part of a team, demonstrating flexibility and adaptability in various work environments.
Willingness to learn and adapt to new technologies, staying current with industry trends and advancements.
Preferred qualifications, capabilities, and skills
General knowledge of financial services industry
Experience working with third-party applications.
Experience working with any other video management solutions
Experience working with Intrusion Detection systems
Genetec Mission Control certification is a plus
#LI-ID1
Auto-ApplySite Reliability Engineer III
Tampa, FL jobs
JobID: 210688698 JobSchedule: Full time JobShift: : Play a key role in ensuring system reliability at one of the world's most iconic and largest financial institutions. As a Site Reliability Engineer II at JPMorgan Chase within the Commercial and Investment bank, Digital and platform devices team , you will use technology to solve business problems and leverage software engineering best practices as we strive towards excellence. This role often works independently to execute small to medium projects, but you'll also have the opportunity to collaborate with cross functional teams to continually improve your level of knowledge about JPMorgan Chase's business and relevant technologies.
Job responsibilities
* Executes small to medium projects independently with initial direction and eventually graduates to designing and delivering projects by yourself
* Leverages technology to solve business problems by writing high quality, maintainable, and robust code following best practices in software engineering
* Participates in triaging, examining, diagnosing, and resolving incidents and work with others to solve problems at their root
* Recognizes the toil within your role and proactively works towards eliminating it through either systems engineering or updating application code
* Understands observability patterns and strives to implement and improve service level indicators, objectives monitoring, and alerting solutions for optimal transparency and analysis
Required qualifications, capabilities, and skills
* Formal training or certification on software engineering concepts and 3+ years of applied experience
* Ability to code in at least one programming language
* Experience maintaining a Cloud-base infrastructure
* Familiar with site reliability concepts, principles, and practices
* Familiar with observability such as white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others
* Familiarity with containers or a common Server OS such as Linux and Windows
* Emerging knowledge of software, applications and technical processes within a given technical discipline (e.g., Cloud, artificial intelligence, Android, etc.)
* Emerging knowledge of continuous integration and continuous delivery tools like Jenkins, GitLab, or Terraform
* Emerging knowledge of common networking technologies
Preferred qualifications, capabilities, and skills
* General knowledge of financial services industry
* Ability to work in a large, collaborative team and demonstrates the willingness to vocalize ideas with peers and managers
* Understanding of how to prioritize and adjust work plans to adapt to changes in assigned responsibilities and projects
* Eagerness to participate in learning opportunities to enhance one's effectiveness in executing day-to-day project activities
* Ability to demonstrate and apply existing and new system processes, methodologies, and skills to contribute to the development of systems
Auto-ApplySite Reliability Engineer II-1
Bogota, NJ jobs
Our Purpose Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we're helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential.
Title and Summary
Site Reliability Engineer II-1
Overview
The GBSC EPMS team is looking for a Site Reliability Engineer who can help us solve problems, implement automation, and leverage best practices.
* Are you a born problem solver who loves to figure out how something works?
* Are you a detail -oriented individual who enjoys complex problem solving?
* Do you love determining the correct actions required to fix a problem?
* Do you have a low tolerance for manual work and look to automate everything you can?
Business Operations is leading the Site Reliability Engineering (SRE) transformation at Mastercard through our tooling and by being an advocate for change & standards throughout the development, quality, release, and product organizations. We need team members with an appetite for change and pushing the boundaries of what can be done with automation. Experience in working across development, operations, and product teams to prioritize needs and to build relationships is a must.
Responsibilities
* Engage in and improve the whole lifecycle of services-from inception and design, through deployment, operation and refinement.
* Analyze ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns
* Support services before they go live through activities such as system design consulting, capacity planning and launch reviews.
* Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
* Scale systems sustainably through mechanisms like automation, and evolve systems by pushing for changes that improve reliability and velocity.
* Support the application CI/CD pipeline for promoting software into higher environments through validation and operational gating, and lead Mastercard in DevOps automation and best practices.
* Practice sustainable incident response and blameless postmortems.
* Take a holistic approach to problem solving, by connecting the dots during a production event thru the various technology stack that makes up the platform, to optimize mean time to recover
* Work with a global team spread across tech hubs in multiple geographies and time zones
* Share knowledge and mentor junior resources
All About You
* BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent practical experience.
* Experience with algorithms, data structures, scripting, pipeline management, software design and OLAP systems.
* Hands on experience with understanding custom objects using JavaScript, HTML5, CSS and API integrations.
* Systematic problem-solving approach, coupled with strong communication skills and a sense of ownership and drive.
* Ability to help debug and optimize code and automate routine tasks.
* We support many different stakeholders. Experience in dealing with difficult situations and making decisions with a sense of urgency is needed.
* Experience in one or more of the following is preferred: C, C++, Java, Python, Go, Perl, Ruby, MDX.
* Interest in designing, analyzing and troubleshooting large-scale distributed systems.
* We need team members with an appetite for change and pushing the boundaries of what can be done with automation. Experience in working across development, operations, and product teams to prioritize needs and to build relationships is a must.
Corporate Security Responsibility
All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must:
* Abide by Mastercard's security policies and practices;
* Ensure the confidentiality and integrity of the information being accessed;
* Report any suspected information security violation or breach, and
* Complete all periodic mandatory security trainings in accordance with Mastercard's guidelines.
Auto-Apply