Machine Learning Engineer/ SRE Site Reliability Engineer

3 weeks ago


Chicago, United States Georgia IT Inc Full time
Role: Machine Learning Engineer/SRELocation: Chicago, IL or RemoteDuration: 12 MonthsRate: DOE

US Citizens and Green cards & GC-EAD Only. No Third-party C2C available for this job

We are seeking a highly skilled and motivated Machine Learning Engineer who possesses expertise in developing, deploying, and managing machine learning models. In this role, you will be an integral part of our AI Engineering and Site Reliability Engineering (SRE) teams, responsible for managing Azure infrastructure for AI model development and deployment, monitoring and reporting model performance, and responding to outages/incidents related to model operations.

Key Responsibilities:

Manage Azure Infrastructure: Configure, maintain, and optimize Azure infrastructure for AI model development and deployment, ensuring scalability and performance.
Model Performance Monitoring: Implement and maintain monitoring systems to track model performance, proactively identifying and addressing issues as they arise.
Incident Response: Collaborate with the SRE team to respond promptly to outages and incidents related to model operations, ensuring minimal downtime and rapid issue resolution.

Skills and Qualifications:

Azure Infrastructure Experience: Proficiency in managing Azure infrastructure components, including virtual machines, storage, and networking, to support AI model development and deployment.
CI/CD Pipeline Experience: Experience with Continuous Integration/Continuous Deployment (CI/CD) pipelines, including the automation of model deployment processes.
Containerization in the Cloud: Strong knowledge of containerization technologies in the cloud, such as Docker and Kubernetes, for efficient deployment and scaling of machine learning models.
Machine Learning Expertise: Proficient in building and optimizing machine learning models, with a deep understanding of various Client algorithms and frameworks.
Programming Skills: Proficiency in programming languages commonly used in machine learning, such as Python and libraries like TensorFlow and PyTorch.
Data Management: Experience in data preprocessing, feature engineering, and data pipeline development for machine learning.
Collaborative Team Player: Excellent communication skills and the ability to work collaboratively with cross-functional teams, including AI engineers and SREs.
Documentation: Effective documentation skills to maintain clear and organized records of models, infrastructure configurations, and incident responses.
Preferred Qualifications :

Experience with cloud-based machine learning platforms (e.g., Azure Machine Learning).
Experience with CI/ CD tools to deploying Client services and applications specific to Azure cloud platform
Familiarity with DevOps practices and tools for automating infrastructure and deployments.
Knowledge of model versioning and model management tools.
Understanding of security best practices in AI model deployment.
Certifications in relevant areas, such as Azure certifications or machine learning certifications.

Job titles of folks with these skills may vary - e.g. MLOps Lead, MLOps Solution/Delivery Architect or Senior Client Engineer


  • Chicago, Illinois, United States Georgia IT Inc Full time

    Role: Machine Learning Engineer/SRELocation: Chicago, IL or RemoteDuration: 12 MonthsRate: DOEUS Citizens and Green cards & GC-EAD Only. No Third-party C2C available for this jobWe are seeking a highly skilled and motivated Machine Learning Engineer who possesses expertise in developing, deploying, and managing machine learning models. In this role, you will...


  • Chicago, United States Balyasny Asset Management Full time

    We are looking for a Senior Site Reliability Engineer who can cultivate our SRE philosophy, processes, and technologies from the ground up. As a Senior Site Reliability Engineer within the Platform group, you will lay the groundwork for our SRE infrastructure. Your role will entail driving standards and fostering adoption across our technology teams, whilst...


  • Chicago, Illinois, United States Balyasny Asset Management L. P Full time

    We are looking for a Senior Site Reliability Engineer who can cultivate our SRE philosophy, processes, and technologies from the ground up.As a Senior Site Reliability Engineer within the Platform group, you will lay the groundwork for our SRE infrastructure.Develop and promote our SRE philosophy, establishing best practices and processes that will be...


  • Chicago, United States Saxon Global Full time

    Site Reliability Engineer (SRE) - (Azure, Systems background) Client: Lexis Nexis Location: REMOTE Rate: $62 C2C Duration: 1 Year Notes: Azure, Systems background experience •BSc Engineering/Computer Science or relevant experience. •Proven background working in a technical, IT related position. •Desirable -Azure Certifications ...


  • Chicago, United States Apolis Full time

    Cloud Site Reliability Engineer (SRE) 6-12 months Hybrid, need be onsite 3 days/week in any of these locations: Chicago, IL Kennesaw, GA Jacksonville, FL Must haves: Strong scripting/programming skills: Python Ansible Powershell/PowerCLI REST API / Swagger Understanding of ITIL processes is a plus Hands on...


  • Chicago, United States JobRialto Full time

    Top 3 requirements: Ecommerce experience (think Nordstrom, Target, where you purchase a product) Java Spring boot Kubernetes Plusses: Azure Kubernetes preferred Description: Client is looking for a forward-thinking, energetic Site Reliability Engineering Manager to join our team. Client serves the ecommerce needs of leading and growing grocery retailers...


  • Chicago, United States Cleo Full time

    Site Reliability Engineer At Cleo, we make doing business easy! Cleo is an established software company with a start-up feel. We have awesome products, which go hand in hand with our awesome culture! We are devoted to our people and pride ourselves on creating a fun, laid-back, but fast-paced work environment. Not only do we work hard, we play hard. We have...


  • Chicago, United States McDonald's Corporation Full time

    Job Description This opportunity is part of the DevOps COE in CPP Delivery office, where our mission is to help our product engineering teams deliver faster with improved quality and reliability. We work multi-functional with our global product teams and market teams in defining and executing on our automation test strategy, improving our build and deploy...


  • Chicago, United States CME Group Full time

    Senior Data Reliability Engineer (Data SRE) CME Group is the world's leading and most diverse derivatives marketplace. But who we are goes deeper than that, here you can impact markets worldwide, transform industries and build a career shaping tomorrow. We invest in your success and you own it, all the while working alongside a team of leading experts who...


  • Chicago, United States R2 Global Full time

    Our client, a financial services giant, is looking for a Principal SRE professional to join the team and lead observability efforts throughout a major cloud project and beyond. This role will work 3x's a week in the Downtown Chicago area onsite. Key Responsibilities: Lead and mentor a team of site reliability engineers, fostering a culture of collaboration,...


  • Chicago, United States R2 Global Full time

    Our client, a financial services giant, is looking for a Principal SRE professional to join the team and lead observability efforts throughout a major cloud project and beyond.This role will work 3x's a week in the Downtown Chicago area onsite.Key Responsibilities:Lead and mentor a team of site reliability engineers, fostering a culture of collaboration,...


  • Chicago, United States R2 Global Full time

    Our client, a financial services giant, is looking for a Principal SRE professional to join the team and lead observability efforts throughout a major cloud project and beyond.This role will work 3x's a week in the Downtown Chicago area onsite.Key Responsibilities:Lead and mentor a team of site reliability engineers, fostering a culture of collaboration,...


  • Chicago, United States R2 Global Full time

    Our client, a financial services giant, is looking for a Principal SRE professional to join the team and lead observability efforts throughout a major cloud project and beyond.This role will work 3x's a week in the Downtown Chicago area onsite.Key Responsibilities:Lead and mentor a team of site reliability engineers, fostering a culture of collaboration,...


  • Chicago, United States Manufacturing Engineer Full time

    Seeking to hire a Process Engineer for a direct hire opportunity in Chicago, IL.The Process Engineer will be responsible for maintaining, developing, and operating manufacturing systems to keep production costs down, while maintaining the quality of products. Will make recommendations to improve productivity and efficiency of operations. Essential Duties and...


  • Chicago, United States Chicago Mercantile Exchange Inc. Full time

    Description This role is hybrid requires to be 2 days on site in our Chicago office. This role does not allow to work outside of Illinois state. Position Overview: Data System Reliability Engineer (dSRE) CME Group: Where Futures Are Made CME Group is the world's leading and most diverse derivatives marketplace. But who we are goes deeper than that, here...


  • Chicago, United States CME Group Full time

    Description This role is hybrid requires to be 2 days on site in our Chicago office. This role does not allow to work outside of Illinois state. Position Overview: Data System Reliability Engineer (dSRE) CME Group: Where Futures Are Made CME Group is the world's leading and most diverse derivatives marketplace. But who we are goes deeper than...


  • Chicago, United States Chicago Mercantile Exchange Inc. Full time

    Description Position Overview: Data System Reliability Engineer (dSRE) CME Group: Where Futures Are Made CME Group is the world's leading and most diverse derivatives marketplace. But who we are goes deeper than that, here you can impact markets worldwide, transform industries and build a career shaping tomorrow. We invest in your success and you own it, all...

  • SRE Lead

    1 day ago


    Chicago, United States Diverse Lynx Llc Full time

    SRE Lead Location : Chicago (local to Chicago area who can travel to work in Chicago downtown 3 days a week.)Start Date: ASAP ( after clearing the client interview)  SRE Role OverviewClient is looking for a Lead Software Engineer to join our Public Sector Core Framework platform team. This individual will play a SRE (Site Reliability Engineer) role in an...


  • Chicago, United States Info Way Solutions Full time

    Site Reliability Engineer in Wealth Management Chicago (IL) / Tempe (AZ) Onsite Job ROLE: This role will be Responsible for application observability, maintenance, and support, identifying and implementing preventive measures proactively, evaluates and makes recommendation on techniques, practices, or technologies that would enhance business needs. As a SRE...


  • Chicago, United States Oak Street Health Full time

    Role DescriptionAs an Engineer I - Site Reliability Engineer (SRE), you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications. You will work closely with cross-functional teams to implement automation, optimize processes, and enhance observability to maintain high availability and performance of our...