Principal Machine Learning Engineer

2 days ago


Houston TX United States MD Anderson Cancer Center Full time

MD Anderson is expanding our Enterprise MLOps and Analytics Platform capabilities to enhance support for MLOps and ModelOps, facilitating the operationalization, monitoring, and management of in-house and third-party AI solutions. This expansion is integral to strengthening our overall AI Governance framework.

We are actively seeking a Principal MLOps Engineer to lead and establish a team of MLOps engineers tasked with designing and enhancing our Enterprise MLOps and Analytics Platform. The Principal MLOps Engineer will be pivotal in managing the operations and overseeing the entire MLOps and ModelOps tech stack, ensuring a robust end-to-end lifecycle management and governance of machine learning models throughout the organization. As the principal figure in MLOps, this role involves defining the team dynamics, shaping the culture, and implementing processes and technologies essential for supporting ML solutions within our hybrid data and compute framework. Collaborating closely with IT, cybersecurity, and compliance teams, the Principal MLOps Engineer will be instrumental in creating a secure and compliant infrastructure for the scalable management of AI/ML models.

Key responsibilities include:
  • Lead and mentor a team of MLOps Engineers to create a scalable MLOps and Analytics platform within a hybrid compute environment, including Kubernetes and Azure.
  • Design, implement, and oversee CI/CD pipelines, ensuring the infrastructure is conducive to ML model training, deployment, and monitoring while upholding security, scalability, reliability, and performance.
  • Develop, refine, and standardize Model Governance integrations, performance tracking for bias and impact, and a model catalog with standardized scorecards and deployments.
  • Innovate automated validation, deployment, observability, and management tools for scalable and reproducible AI solutions.
  • Design fallback and decommissioning strategies for AI solutions to ensure operational continuity.
  • Deliver training on AI solutions to enhance understanding and application across the organization.
  • Engage with technology trends, contribute to tech communities, and foster a culture of continuous learning and innovation.
Technical Expertise
  • Demonstrate deep understanding of the AI/ML Platform infrastructure and cloud architecture.
  • Experience developing and deploying AI/ML algorithms into production.
  • Strong proficiency in Python and C++ or C#, complemented by experience with machine learning libraries such as TensorFlow, PyTorch, and Scikit-learn.
  • Knowledge of DevOps practices, CI/CD pipelines, including tools like Azure DevOps or Git Actions.
  • Proficiency in working with containers such as Docker and container orchestration systems like Kubernetes. Familiarity with process orchestration/DAGs tools.
  • Experience with data, code, and model artifact management processes and MLOps tools.
  • Experience with on-premises, cloud-based, and hybrid computing environments, as well as cloud-native tools and services.
  • Knowledge of ISO standards for software and/or AI development lifecycle management.
Analytical Skills
  • Experience with project management methodologies (e.g. SAFe agile, PRINCE2, Lean methodology).
  • Deep understanding of the AI/ML Model Lifecycle Management.
  • Proficient in decision-making, problem-solving, and the successful execution of AI/ML solutions in a healthcare environment.
  • Manage AI/ML projects throughout their lifecycle, ensuring timely delivery, budget adherence, and quality standards compliance.
  • Experience leading an ML engineering and/or data scientist team focused on developing, deploying, and maintaining production-ready models.
  • Experience working closely with third-party vendors and partners to integrate new AI solutions into existing infrastructure and workflows.
  • Experience implementing risk identification and mitigation strategies, including contingency planning, to prevent project delays and complications.
  • Preference for working knowledge of hospital workflows.
  • Preference for experience with healthcare data privacy and security protocols, such as HIPAA.
  • Preference for experience with flowcharts, business process models, mapping tools, data flow diagrams, and process flow diagrams.
Professionalism: Oral and Written
  • Work closely with data scientists, ML engineers, software engineers, and other stakeholders to understand requirements and integrate machine learning models into the overall system.
  • Create and maintain comprehensive documentation for CI/CD pipelines, deployment processes, and infrastructure configurations.
  • Experience reporting on project progress, impact, and risks to leadership and stakeholders, providing strategic advice to help prioritize AI/ML solutions use-cases.
  • Experience stakeholder management to drive adoption, address concerns, and prioritize solution support.
Other duties as assigned Education Required:

Bachelor's degree in Computer Science, Software Engineering, Data Science, Physics, Math & Statistics, or another related engineering discipline.

Preferred Education:

Master's Level Degree

Experience Required:

Seven years of experience in machine learning engineering, data science, data engineering, and/or software engineering. With Master's degree, five years' experience required. With PhD, three years of experience required.

Preferred Experience:

Experience working on production quality healthcare focused machine learning solutions.

Proficiency in architecting, implementing, and using MLOps solutions.

It is the policy of The University of Texas MD Anderson Cancer Center to provide equal employment opportunity without regard to race, color, religion, age, national origin, sex, gender, sexual orientation, gender identity/expression, disability, protected veteran status, genetic information, or any other basis protected by institutional policy or by federal, state or local laws unless such distinction is required by law.

Additional Information
  • Requisition ID: 166432
  • Employment Status: Full-Time
  • Employee Status: Regular
  • Work Week: Days
  • Minimum Salary: US Dollar (USD) 160,500
  • Midpoint Salary: US Dollar (USD) 203,000
  • Maximum Salary : US Dollar (USD) 245,500
  • FLSA: exempt and not eligible for overtime pay
  • Fund Type: Hard
  • Work Location: Remote (within Texas only)
  • Pivotal Position: Yes
  • Referral Bonus Available?: Yes
  • Relocation Assistance Available?: Yes
  • Science Jobs: No
#J-18808-Ljbffr

  • Boston, MA, United States NLP PEOPLE Full time

    Job Title: Principal / Director Machine Learning Engineer Location: United States About Us: Join a leading organization at the forefront of innovation in AI, machine learning, and signal processing technologies. We are dedicated to pushing the boundaries of technology in radar and RF signals, video and image processing, and real-time embedded systems. Our...


  • Chicago, IL, United States Gateway Recruiting Full time

    JOB DESCRIPTION: The Principal Machine Learning Engineer reports to the Head of Technology, Digital Solutions. The Principal Machine Learning Engineer will closely work with cross-functional product teams and is responsible for designing, developing, and deploying state of the art data engineering techniques and streamlined data ingestion processes to...


  • Houston, TX, United States Trading Firm Full time

    The ML Platform is a critical component in increasing revenue at our firm. The role willrequire everything from high-level architecture design to performant implementation.Responsibilities include:• Build distributed systems supporting complex modeling over time series data• Lead design, development, and deployment of machine learning systems•...


  • Palo Alto, CA, United States AiDash AiDash Inc. Full time

    About AiDash AiDash is making critical infrastructure industries climate-resilient and sustainable with satellites and AI. Using our full-stack SaaS solutions, customers in electric, gas, and water utilities, transportation, and construction are transforming asset inspection and maintenance – and complying with biodiversity net gain mandates and carbon...


  • Austin, TX, United States AI Technologies LLC. Full time

    Job Overview Job ID: J36993 Specialized Area: Machine learning Job Title: Machine Learning Engineer Location: Austin, TX Duration: 11 Months Domain Exposure: Work Authorization: Client: To Be Discussed Later Employment Type: W-2 (Consultant must be on our company payroll. C2C is not allowed) The Machine Learning Engineer is responsible for designing and...


  • Dallas, TX, United States Cloud Analytics Technologies, LLC Full time

    Job Overview Job ID: J36993 Specialized Area: Machine Learning Engineer Location: To Be Discussed Later Duration: 6 Months Domain Exposure: Banking & Finance, Insurance, Education Work Authorization: Client Employment Type: W-2 (Consultant must be on our company payroll. C2C is not allowed) Demonstrate up-to-date knowledge in software engineering practices...


  • Houston, TX, United States Attis Full time

    AI Engineer - Graph Neural Networks (GNN)Hybrid in Houston, TX$100k-$170k annually, plus options packageIndustry regulations mean you must be a US Citizen, or Permanent US Resident to be eligibleAI Engineer required for a rapidly growing tech start-up, with a strong focus on Graph Neural Networks (GNN). They have recently secured significant government...


  • Dallas, TX, United States Ethereum Technologies LLC Full time

    Job Overview Job ID: J36993 Specialized Area: Machine Learning Engineer Location: To Be Discussed Later Duration: 6 Months Domain Exposure: Banking & Finance, Insurance, Education Work Authorization: W-2 (Consultant must be on our company payroll. C2C is not allowed) Demonstrate up-to-date knowledge in software engineering practices and provides...


  • Dallas, TX, United States Robotics Prcocess Automation, LLC Full time

    Job Overview Job ID: J36993 Specialized Area: Machine learning Job Title: Machine Learning Engineer Location: To Be Discussed Later Duration: 6 Months Domain Exposure: Not specified Work Authorization: Not specified Client: To Be Discussed Later Employment Type: W-2 (Consultant must be on our company payroll. C2C is not allowed) Builds and supports machine...


  • Dallas, TX, United States Automation Technologies LLC Full time

    Job Overview Job ID: J36993 Specialized Area: Machine learning Job Title: Machine Learning Engineer Location: To Be Discussed Later Duration: 6 Months Domain Exposure: To Be Discussed Later Work Authorization: To Be Discussed Later Client: To Be Discussed Later Employment Type: W-2 (Consultant must be on our company payroll. C2C is not allowed) Builds and...


  • Austin, TX, United States Statt Full time

    Role Description:We are seeking a Machine Learning Engineer with approximately 5 years of experience in the field. The ideal candidate will have a strong foundation in low-level machine learning skills, data science, and advanced AI techniques. You will be working with Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), and other...


  • Austin, TX, United States Robotics Prcocess Automation, LLC Full time

    W-2 Open Positions Need to be Filled Immediately. Consultant must be on our company payroll, Corp-to-Corp (C2C) is not allowed. Candidates encouraged to apply directly using this portal. We do not accept resumes from other company/ third-party recruiters. Job Overview Job ID: J36993 Specialized Area: Machine learning Job Title: Machine Learning...


  • Austin, TX, United States Kungfu Full time

    KUNGFU.AI is a management consulting and engineering firm focused exclusively on artificial intelligence. We empower CEOs and senior executives to leverage the full potential of AI so they remain competitive in a rapidly evolving world. Our expert team delivers AI strategy and bespoke production-grade solutions that allow clients to rapidly realize value. We...


  • Dallas, TX, United States Robotics Prcocess Automation, LLC Full time

    W-2 Open Positions Need to be Filled Immediately. Consultant must be on our company payroll, Corp-to-Corp (C2C) is not allowed. Candidates encouraged to apply directly using this portal. We do not accept resumes from other company/ third-party recruiters. Job Overview Job ID: J36993 Job Title: Machine Learning Engineer Location: To Be Discussed Later...


  • Dallas, TX, United States Robotics Technologies LLC Full time

    W-2 Open Positions Need to be Filled Immediately. Consultant must be on our company payroll, Corp-to-Corp (C2C) is not allowed. Candidates encouraged to apply directly using this portal. We do not accept resumes from other company/third-party recruiters. Job Overview Job ID: J36993 Specialized Area: Machine learning Job Title: Machine Learning Engineer...


  • Austin, TX, United States Ethereum Technologies LLC Full time

    The Machine Learning Engineer is responsible for designing and supporting machine learning systems within Revionics’ analytical pricing software. You will be responsible for developing an expert-level understanding of the core scientific capabilities of Revionics’ solutions and building production grade machine learning systems to augment these...


  • Dallas, TX, United States Cloud Analytics Technologies, LLC Full time

    W-2 Open Positions Need to be Filled Immediately. Consultant must be on our company payroll, Corp-to-Corp (C2C) is not allowed. Candidates encouraged to apply directly using this portal. We do not accept resumes from other company/third-party recruiters. Job Overview Job ID: J36993 Specialized Area: Machine learning Job Title: Machine Learning Engineer...


  • Austin, TX, United States Apple Full time

    Machine Learning Engineer Austin, Texas, United States Software and Services Imagine what you could do here! The people here at Apple don’t just create products — they build the kind of wonder that’s revolutionized entire industries. It’s the diversity of those people and their ideas that inspires the innovation that runs through everything we do,...


  • Dallas, TX, United States Quantum Technologies. LLC Full time

    W-2 Open Positions Need to be Filled Immediately. Consultant must be on our company payroll, Corp-to-Corp (C2C) is not allowed. Candidates encouraged to apply directly using this portal. We do not accept resumes from other company/third-party recruiters. Job Overview Specialized Area: Machine learning Job Title: Machine Learning Engineer Location: To Be...


  • Dallas, TX, United States Ethereum Technologies LLC Full time

    W-2 Open Positions Need to be Filled Immediately. Consultant must be on our company payroll, Corp-to-Corp (C2C) is not allowed. Candidates encouraged to apply directly using this portal. We do not accept resumes from other company/ third-party recruiters. Job Overview Job ID: J36993 Specialized Area: Machine learning Job Title: Machine Learning Engineer...