Current jobs related to Aumni - Site Reliability Engineer III - MLOPS - Jersey City, New Jersey - JPMorganChase


  • Jersey City, New Jersey, United States Open Systems Technologies Full time

    Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Open Systems Technologies. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our distributed systems.Key Responsibilities:Design, implement, and maintain distributed systems to ensure...


  • Jersey City, New Jersey, United States Aloden, Inc. Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Aloden, Inc. in Jersey City, New Jersey. As a Site Reliability Engineer, you will be responsible for ensuring the stability and reliability of our Onyx blockchain platform in production.Key Responsibilities:Ensure the stability and reliability of...


  • Jersey City, New Jersey, United States Goldman Sachs Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Goldman Sachs. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our production systems, handling issues, managing incidents, and providing support to our users.ResponsibilitiesOwn production processes, handling...


  • Jersey City, New Jersey, United States Syntricate Technologies Full time

    Job Title: Site Reliability Engineer - AWSWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure, particularly on AWS.Key Responsibilities:Design, implement, and maintain scalable and...


  • Jersey City, New Jersey, United States Fidelity Investments Full time

    Job Overview:About the RoleFidelity Investments is seeking a highly skilled Principal Site Reliability Engineer to join our Technical Operations team. As a key member of our team, you will be responsible for designing, implementing, and maintaining our cloud infrastructure to ensure high availability, scalability, and security.Key ResponsibilitiesDesign and...


  • Jersey City, New Jersey, United States Fidelity Investments Full time

    The RoleWe are seeking a highly skilled Principal Site Reliability Engineer to join our Technical Operations team at Fidelity Investments. As a member of this team, you will play a critical role in ensuring the reliability, scalability, and security of our cloud-based infrastructure. You will work closely with our engineering partners to design, implement,...


  • Jersey City, New Jersey, United States Fidelity Investments Full time

    Job SummaryWe are seeking a highly skilled Principal Site Reliability Engineer to join our TechOps team at Fidelity Investments. As a key member of our team, you will play a critical role in ensuring the reliability, scalability, and security of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement highly available, secure, and scalable...


  • Jersey City, New Jersey, United States Syntricate Technologies Full time

    We are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a key member of our infrastructure team, you will be responsible for ensuring the reliability and scalability of our cloud-based systems.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure using AWS servicesDevelop and...


  • Jersey City, New Jersey, United States Fidelity Investments Full time

    About the RoleWe are seeking a highly skilled Principal Site Reliability Engineer to join our TechOps SRE team at Fidelity Investments. As a member of this team, you will work closely with our engineering partners to help enable and drive initiatives from design to implementation.Key ResponsibilitiesDesign and implement highly available, secure, scalable...


  • Jersey City, New Jersey, United States RBC Capital Markets, LLC Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at RBC Capital Markets, LLC. As a key member of our Application Support team, you will be responsible for ensuring the reliability and performance of our applications and infrastructure.Key ResponsibilitiesPerform application production support, including off-hours...


  • Jersey City, New Jersey, United States Goldman Sachs Full time

    About the RoleAt Goldman Sachs, we're seeking a talented Site Reliability Engineer to join our Platforms team. As a key member of our global engineering team, you'll be responsible for designing, developing, and operating distributed systems that provide observability for our mission-critical applications and platform services.Your ImpactYou'll work closely...


  • Jersey City, New Jersey, United States Fidelity Investments Full time

    About the RoleWe are seeking a highly skilled Principal Site Reliability Engineer to join our TechOps SRE team at Fidelity Investments. As a member of this team, you will work closely with our engineering partners to help enable and drive initiatives from design to implementation.Key ResponsibilitiesDesign and implement highly available, secure, scalable...


  • Jersey City, New Jersey, United States RBC Capital Markets, LLC Full time

    Job SummaryWe are seeking a highly skilled Lead Site Reliability Engineer to join our team at RBC Capital Markets, LLC. As a key member of our Technology and Operations team, you will be responsible for designing, implementing, and maintaining scalable and reliable cloud infrastructure solutions.Key ResponsibilitiesDesign and implement monitoring and...


  • Jersey City, New Jersey, United States Fidelity Investments Full time

    The RoleWe are seeking a highly skilled Site Reliability Engineer to join our TechOps team. As a member of this team, you will work closely with our engineering partners to design, implement, and maintain our enterprise-grade infrastructure strategy. Our team is responsible for ensuring the high availability and scalability of our multi-region Kubernetes...


  • Jersey City, New Jersey, United States The Goldman Sachs Group Full time

    About the RoleAt The Goldman Sachs Group, we're seeking a highly skilled Site Reliability Engineering Specialist to join our Platforms team. As a key member of our global engineering team, you'll be responsible for designing, developing, and operating distributed systems that provide observability for our mission-critical applications and platform...


  • Jersey City, New Jersey, United States RBC Capital Markets, LLC Full time

    Job SummaryWe are seeking a highly skilled Lead Site Reliability Engineer to join our team at RBC Capital Markets, LLC. As a key member of our Technology and Operations team, you will be responsible for designing, implementing, and maintaining scalable and reliable cloud infrastructure solutions.Key ResponsibilitiesDesign and implement monitoring and...


  • Jersey City, New Jersey, United States Hispanic Technology Executive Council Full time

    Job Description:We are seeking a highly skilled Cloud Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for designing, building, and maintaining our next-gen AWS platform.You will work closely with our development and infrastructure teams to ensure the reliability, scalability, and...


  • Jersey City, New Jersey, United States The Goldman Sachs Group Full time

    About the RoleWe are seeking a talented Site Reliability Engineer to join our Platforms team at Goldman Sachs. As a key member of our team, you will be responsible for designing, developing, and operating distributed systems that provide observability for our mission-critical applications and platform services.Your ImpactYou will work with customers, product...


  • Jersey City, New Jersey, United States RBC Capital Markets, LLC Full time

    Job SummaryThe Lead Site Reliability Engineer will be responsible for designing, implementing, and maintaining Site Reliability Engineering solutions for all applications within RBC Capital Markets, LLC.Key ResponsibilitiesPerform application production support role including off-hours support.Spearhead the development of SRE solutions (monitoring and...


  • Jersey City, New Jersey, United States Fidelity Investments Full time

    Job Description:The RoleAs a member of the TechOps SRE team at Fidelity Investments, you will work closely with our engineering partners to enable and drive initiatives from design to implementation. Our highly available multi-region Kubernetes (AWS EKS) environments are best-in-class and central to our enterprise-grade infrastructure strategy. These growing...

Aumni - Site Reliability Engineer III - MLOPS

3 months ago


Jersey City, New Jersey, United States JPMorganChase Full time

Job Description
There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.
As a Site Reliability Engineer III at JPMorgan Chase within the Digital Private Markets /Aumni (A JP Morgan Chase Company), you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform. As MLops Engineer, you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize the models produced by our data science teams and their associated. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability in the AI/ML space.
Job responsibilities

  • Guides and assists others in the areas of designing and deploying new AI/ML models in the cloud, gaining consensus from peers where appropriate
  • Designs and implements automated continuous integration and continuous delivery pipelines for the Data Science teams to develop and train AI/ML models
  • Writes and deploys infrastructure as code for the models and pipelines you support
  • Collaborates with technical experts, key stakeholders, and team members to resolve complex technical problems
  • Understands the importance of monitoring and observability in the AI/ML space - i.e. service level indicators and utilizes service level objectives
  • Proactively resolve issues before they impact internal and external stakeholders of deployed models
  • Supports the adoption of MLops best practices within your team

Required qualifications, capabilities, and skills

  • Formal training or certification on site reliability engineering concepts and 3+ years applied experience
  • Understanding of MLops culture and principles and familiarity with how to implement associated concepts at scale
  • Domain knowledge of machine learning applications and technical processes within the AWS ecosystem
  • Experience with infrastructure as code tooling such as Terraform, Cloudformation
  • Experience with container and container orchestration such as ECS, Kubernetes, and Docker
  • Knowledge of continuous integration and continuous delivery tools like Jenkins, GitLab, or Github Actions
  • Proficiency in the following programming languages: Python, Bash
  • Hands-on knowledge of Linux and networking internals
  • Understanding of the different roles served by data engineers, data scientists, machine learning engineers, and system architects, and how MLops contributes to each of these workstreams
  • Ability to identify new technologies and relevant solutions to ensure design constraints are met by the Data Science and Machine Learning teams

Preferred qualifications, capabilities, and skills

  • Experience with Model training and deployment pipelines, managing scoring endpoints
  • Familiarity with observability concepts and telemetry collection using tools such as Datadog, Grafana, Prometheus, Splunk, and others
  • Understanding of data engineering platforms such as Databricks or Snowflake, and machine learning platforms such as AWS Sagemaker
  • Comfortable troubleshooting common containerization technologies and issues
  • Ability to proactively recognize road blocks and demonstrates interest in learning technology that facilitates innovation

About Us
JPMorgan Chase & Co., one of the oldest financial institutions, offers innovative financial solutions to millions of consumers, small businesses and many of the world's most prominent corporate, institutional and government clients under the J.P. Morgan and Chase brands. Our history spans over 200 years and today we are a leader in investment banking, consumer and small business banking, commercial banking, financial transaction processing and asset management.
We offer a competitive total rewards package including base salary determined based on the role, experience, skill set, and location. For those in eligible roles, we offer discretionary incentive compensation which may be awarded in recognition of firm performance and individual achievements and contributions. We also offer a range of benefits and programs to meet employee needs, based on eligibility. These benefits include comprehensive health care coverage, on-site health and wellness centers, a retirement savings plan, backup childcare, tuition reimbursement, mental health support, financial coaching and more. Additional details about total compensation and benefits will be provided during the hiring process.
We recognize that our people are our strength and the diverse talents they bring to our global workforce are directly linked to our success. We are an equal opportunity employer and place a high value on diversity and inclusion at our company. We do not discriminate on the basis of any protected attribute, including race, religion, color, national origin, gender, sexual orientation, gender identity, gender expression, age, marital or veteran status, pregnancy or disability, or any other basis protected under applicable law. We also make reasonable accommodations for applicants' and employees' religious practices and beliefs, as well as mental health or physical disability needs. Visit our FAQs for more information about requesting an accommodation.
JPMorgan Chase is an Equal Opportunity Employer, including Disability/Veterans
About the Team
The new Digital Private Markets business is leveraging the power of J.P. Morgan to create an integrated suite of transactional and analytics solutions, focused on the private markets.
This innovative multi-disciplinary team of engineers, data scientists, product managers, designers, and securities professionals is building bespoke digital solutions for private market investors, and high-growth companies.
We aim to transform private markets by offering a combination of portfolio management, data, and analytics tools, as well as a suite of primary capital raising tools for companies and funds, and secondary liquidity services.