Aumni - Site Reliability Engineer III - MLOPS
6 months ago
There’s nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.
As a Site Reliability Engineer III at JPMorgan Chase within the Digital Private Markets /Aumni (A JP Morgan Chase Company), you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform. As MLops Engineer, you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize the models produced by our data science teams and their associated. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability in the AI/ML space.
Job responsibilities
Guides and assists others in the areas of designing and deploying new AI/ML models in the cloud, gaining consensus from peers where appropriate Designs and implements automated continuous integration and continuous delivery pipelines for the Data Science teams to develop and train AI/ML models Writes and deploys infrastructure as code for the models and pipelines you support Collaborates with technical experts, key stakeholders, and team members to resolve complex technical problems Understands the importance of monitoring and observability in the AI/ML space – . service level indicators and utilizes service level objectives Proactively resolve issues before they impact internal and external stakeholders of deployed models Supports the adoption of MLops best practices within your teamRequired qualifications, capabilities, and skills
Formal training or certification on site reliability engineering concepts and 3+ years applied experience Understanding of MLops culture and principles and familiarity with how to implement associated concepts at scale Domain knowledge of machine learning applications and technical processes within the AWS ecosystem Experience with infrastructure as code tooling such as Terraform, Cloudformation Experience with container and container orchestration such as ECS, Kubernetes, and Docker Knowledge of continuous integration and continuous delivery tools like Jenkins, GitLab, or Github Actions Proficiency in the following programming languages: Python, Bash Hands-on knowledge of Linux and networking internals Understanding of the different roles served by data engineers, data scientists, machine learning engineers, and system architects, and how MLops contributes to each of these workstreams Ability to identify new technologies and relevant solutions to ensure design constraints are met by the Data Science and Machine Learning teams Preferred qualifications, capabilities, and skills Experience with Model training and deployment pipelines, managing scoring endpoints Familiarity with observability concepts and telemetry collection using tools such as Datadog, Grafana, Prometheus, Splunk, and others Understanding of data engineering platforms such as Databricks or Snowflake, and machine learning platforms such as AWS Sagemaker Comfortable troubleshooting common containerization technologies and issues Ability to proactively recognize road blocks and demonstrates interest in learning technology that facilitates innovation-
Aumni - MLOps SRE III
6 months ago
Jersey City, United States JPMorgan Chase & Co. Full timeThere’s nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems. As a Site Reliability Engineer III at JPMorgan Chase within the Technology Infrastructure of Aumni, you will solve complex and broad business...
-
Senior MLOps Engineer
4 weeks ago
new york city, United States Harnham Full timeSenior MLOps EngineerFully Remote6-Month Contract to Hire$80-$100/hrAre you looking to advance your MLOps career in an innovative and AI driven healthcare company? Apply below!THE COMPANYWe are currently partnered with one of the largest healthcare systems in the U.S. - they are disrupting the healthcare field by innovating and advancing ML/AI through their...
-
Senior MLOps Engineer
4 weeks ago
new york city, United States Harnham Full timeSenior MLOps EngineerFully Remote6-Month Contract to Hire$80-$100/hrAre you looking to advance your MLOps career in an innovative and AI driven healthcare company? Apply below!THE COMPANYWe are currently partnered with one of the largest healthcare systems in the U.S. - they are disrupting the healthcare field by innovating and advancing ML/AI through their...
-
Aumni - Manager of Software Engineering
21 hours ago
Jersey City, United States JPMorganChase Full timeJob DescriptionJOB DESCRIPTIONAs a Lead of Software Engineering at JPMorgan Chase within the Aumni team, you serve in a leadership role by providing technical coaching and advisory for multiple technical teams, as well as anticipate the needs and potential dependencies of other functions within the firm. As an expert in your field, your insights influence...
-
Aumni - Lead Software Engineer, Data
21 hours ago
Jersey City, United States JPMorganChase Full timeJob DescriptionJOB DESCRIPTIONBe one of the driving forces behind designing the data platform that is powering the future of private markets. As a Lead Software Engineer within Aumni’s Data Platform team, you will be integral in implementing the data infrastructure and delivering the data backing key to stakeholders across the organization in a secure,...
-
Site Reliability Engineer
4 weeks ago
Foster City, California, United States Omega Solutions Inc Full timeJob Title: Site Reliability EngineerAt Omega Solutions Inc, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, and performance of our critical platforms and applications.Key Responsibilities:* 8+ years of experience in Site Reliability...
-
Site Reliability Engineer
1 month ago
Oklahoma City, Oklahoma, United States Paycom Online Full timeJob DescriptionPaycom Online is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the integrity, functionality, and reliability of our applications and sites.Key ResponsibilitiesDevelop software to detect unusual error activity and implement workflows and processes to identify...
-
Site Reliability Engineer
4 weeks ago
Jersey City, New Jersey, United States The Goldman Sachs Group, Inc Full timeJob Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Goldman Sachs. As a Site Reliability Engineer, you will be responsible for designing, developing, and operating distributed systems that provide observability for our mission-critical applications and platform services.Your...
-
Site Reliability Engineer
4 weeks ago
Foster City, California, United States Bayone Full timeJob SummaryAt Bayone, we are seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for ensuring the uptime and performance of our large production service.Key ResponsibilitiesHost OS upgradesDocker image upgradesSSL certificate upgradesRequirementsBachelor's degree in...
-
Site Reliability Engineer
1 month ago
Oklahoma City, United States Paycom Payroll Llc Full timeSite reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionality, and reliability of applications and sites.Do not wait to apply after reading...
-
Site Reliability Engineer
4 months ago
Oklahoma City, United States Paycom Payroll Llc Full timeSite reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionality, and reliability of applications and sites.RESPONSIBILITIESDevelop software to...
-
Site Reliability Engineer
3 weeks ago
Oklahoma City, United States Paycom Payroll Llc Full timeSite reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionality, and reliability of applications and sites.Do not wait to apply after reading...
-
Site Reliability Engineer
3 weeks ago
Oklahoma City, United States Paycom Payroll Llc Full timeSite reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionality, and reliability of applications and sites.RESPONSIBILITIESDevelop software to...
-
Site Reliability Engineer
4 weeks ago
Jersey City, New Jersey, United States The Goldman Sachs Group, Inc Full timeAbout the RoleWe are seeking a talented Site Reliability Engineer to join our SRE Platforms team at Goldman Sachs. As a Site Reliability Engineer, you will be responsible for designing, developing, and operating distributed systems that provide observability for our mission-critical applications and platform services.Our team is responsible for designing and...
-
Site Reliability Engineer
1 month ago
Oklahoma City, United States Paycom Online Full timeSite reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionality, and reliability of applications and sites. RESPONSIBILITIES Develop...
-
AWS Site Reliability Engineer
4 weeks ago
Jersey City, New Jersey, United States Syntricate Technologies Full timeWe are seeking a highly skilled AWS Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure, particularly our AWS environment.The ideal candidate will have strong experience with AWS, with a focus on SRE principles...
-
JP Morgan Chase | Aumni
20 hours ago
jersey city, United States JP Morgan Chase Full timeBe one of the driving forces behind designing the data platform that is powering the future of private markets. As a Lead Software Engineer within Aumni’s Data Platform team, you will be integral in implementing the data infrastructure and delivering the data backing key to stakeholders across the organization in a secure, stable and scalable way. Job...
-
Structural Engineer III
4 weeks ago
Salt Lake, Utah, United States Dennis Group Full timeAbout the JobWe are seeking a talented Structural Engineer III to support our building structural practice (industrial & Commercial buildings). You will play a key role in the design, permitting, and construction of food facilities throughout the United States. As a member of our structural department, your responsibilities will include:Working independently...
-
Site Reliability Engineer
4 weeks ago
Jersey City, New Jersey, United States Syntricate Technologies Full timeWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure, particularly on AWS. Your strong AWS experience and 2-3 years of recent experience will be invaluable in this role.The ideal...
-
Site Reliability Engineer
4 weeks ago
Kansas City, Missouri, United States Datum Technologies Group Full timeJob SummaryAt Datum Technologies Group, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining efficient technology platforms that meet both internal and external customer needs while effectively managing associated risks.Key...