SRE Lead (Machine Learning Engineering Team)

JPMorgan Chase
Apply Now

Job Description

As a Site Reliability Engineer (SRE), you'll help build a meaningful engineering discipline, combining software and systems to develop creative engineering solutions to operations problems. Much of our support and software development focuses on optimizing existing systems, building infrastructure, and reducing work through automation. You’ll join a team of curious problem solvers with a diverse set of perspectives who are thinking big and taking risks. In this environment, you’ll take the lead on relevant projects, supported by an organization that provides the support and mentorship you need to learn and grow. As an SRE, you’ll be focused on running better production applications and systems.

This role is open for the Machine Learning team sitting in credit/risk. They are looking for someone to monitor all data sources, making sure they are coming in on time and asking the questions, why did this fail? Will be working with lots of data - running models. Part of the role will be handling all of the releases, maintain releasing of the products. You will be responsible for disaster recovery and producing runbooks, as well as documenting, managing and monitoring. The business users are in the US and the business processes are done in the UK. 

Qualifications:

  • Bachelor’s degree or equivalent experience in an software engineering discipline
  • Expertise in at least one technology stack designing, coding, testing, and delivering software
  • Proficiency in one or more technology domains, may be a cross-domain expert able to solve complex and mission critical problems within a business or across the firm
  • Working knowledge of infrastructure components (e.g. routers, load balancers, cloud products, container systems, compute, storage, and networks)
  • Excellent debugging and trouble shooting skills

Responsibilities:

  • Design, code, test, and deliver software to automate manual operational work
  • Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents
  • Engage with development team throughout the life cycle to help develop software for reliability and scale, ensuring minimal refactoring or changes
  • Identify application patterns and analytics in support of better service level objectives
  • Design self-healing and resiliency patterns
  • Design automated software and product upgrades, change management, and release management solutions
  • Coach or manage teams as applicable
  • Participate in the 24x7 support coverage as needed

Company Info.

JPMorgan Chase

For over 200 years, JPMorgan Chase & Co has provided innovative financial solutions for consumers, small businesses, corporations, governments and institutions around the world. Today, we're a leading global financial services firm with operations servicing clients in more than 100 countries. JPMorgan Chase & Co. is an American multinational investment bank and financial services holding company headquartered in New York City. JPMorgan Chase

Get Similar Jobs In Your Inbox

JPMorgan Chase is currently hiring Data Analyst Jobs in Houston, TX, USA with average base salary of $120,000 - $190,000 / Year.

Similar Jobs View More