Lead ML Operations Engineer

2 weeks ago


San Francisco, United States Unreal Gigs Full time
Job DescriptionJob Description

Company Overview: Welcome to the forefront of machine learning operations (MLOps) Our company is dedicated to leveraging the power of machine learning to drive innovation and transform industries. We're committed to developing cutting-edge ML solutions that deliver real-world impact and value to our customers. Join us and lead our team in shaping the future of MLOps.

Position Overview: As the Lead ML Operations Engineer, you'll be responsible for leading our MLOps efforts and driving the design, implementation, and optimization of infrastructure and processes for deploying, monitoring, and managing machine learning models at scale. You'll lead a team of talented engineers and collaborate closely with data scientists, software engineers, and DevOps teams to streamline the machine learning lifecycle and ensure reliable and efficient model operations. If you're a seasoned engineer with a passion for machine learning and a track record of designing and implementing MLOps solutions, we want you on our team.

Requirements

Key Responsibilities:

  1. Technical Leadership: Lead and mentor a team of ML Operations Engineers, providing guidance, direction, and support in driving MLOps innovation and execution.
  2. Infrastructure Design: Design and implement scalable and reliable infrastructure for deploying and serving machine learning models, leveraging cloud platforms and containerization technologies.
  3. Model Deployment: Develop automated pipelines for deploying machine learning models into production environments, ensuring consistency, reliability, and reproducibility.
  4. Monitoring and Alerting: Implement monitoring and alerting systems to track model performance, data drift, and other metrics, enabling proactive detection and mitigation of issues.
  5. Model Versioning and Management: Establish version control and management processes for machine learning models, enabling easy tracking, rollback, and experimentation.
  6. Continuous Integration/Continuous Deployment (CI/CD): Implement CI/CD pipelines for automating model training, testing, and deployment, reducing time to market and improving agility.
  7. Scalability and Efficiency: Optimize the performance and scalability of machine learning infrastructure, leveraging techniques such as distributed computing, parallelization, and resource management.
  8. Security and Compliance: Ensure machine learning systems comply with security and privacy standards, implementing access controls, encryption, and other security measures as needed.
  9. Documentation and Best Practices: Document MLOps processes, best practices, and standards, providing guidance and training to data scientists and engineers.
  10. Collaboration: Collaborate with cross-functional teams, including data scientists, software engineers, and DevOps teams, to streamline the machine learning lifecycle and drive continuous improvement.
  11. Research and Innovation: Stay informed about the latest advancements in MLOps tools and technologies, exploring innovative approaches and techniques to enhance machine learning operations.


Qualifications:

  • Bachelor's degree or higher in Computer Science, Engineering, Mathematics, or related field.
  • 7+ years of experience in software engineering, DevOps, or related roles, with a focus on building and maintaining infrastructure for machine learning operations.
  • Leadership experience, with a demonstrated ability to lead and mentor a team of engineers.
  • Strong understanding of machine learning concepts and techniques, with experience working with data science teams and machine learning models.
  • Proficiency in programming languages such as Python, Java, or Scala, and experience with cloud platforms such as AWS, Azure, or Google Cloud Platform.
  • Experience with containerization technologies such as Docker and orchestration tools such as Kubernetes.
  • Familiarity with machine learning frameworks and libraries such as TensorFlow, PyTorch, scikit-learn, or MLflow.
  • Experience with CI/CD pipelines, version control systems, and automation tools such as Jenkins, GitLab, or CircleCI.
  • Strong problem-solving skills and analytical thinking, with the ability to troubleshoot complex issues and optimize system performance.
  • Excellent communication and collaboration skills, with the ability to work effectively in cross-functional teams and communicate technical concepts to non-technical stakeholders.

Benefits

  • Competitive salary: The industry standard salary for Lead ML Operations Engineers typically ranges from $150,000 to $250,000 per year, depending on experience and qualifications.
  • Comprehensive health, dental, and vision insurance plans.
  • Flexible work hours and remote work options.
  • Generous vacation and paid time off.
  • Professional development opportunities, including access to training programs, conferences, and workshops.
  • State-of-the-art technology environment with access to cutting-edge tools and resources.
  • Vibrant and inclusive company culture with opportunities for growth and advancement.
  • Exciting projects with real-world impact at the forefront of MLOps innovation.


Join Us:
Ready to lead the charge in MLOps innovation? Apply now to join our team and shape the future of machine learning operations



  • San Francisco, CA, United States Unreal Gigs Full time

    Company Overview: Welcome to the forefront of machine learning operations (MLOps)! Our company is dedicated to leveraging the power of machine learning to drive innovation and transform industries. We're committed to developing cutting-edge ML solutions that deliver real-world impact and value to our customers. Join us and lead our team in shaping the...


  • San Francisco, United States Unreal Gigs Full time

    Job DescriptionJob DescriptionCompany Overview: Welcome to the forefront of machine learning operations! At our company, we're driving the next wave of AI revolution through cutting-edge ML operations technologies. Our mission is to develop scalable and reliable ML systems that empower businesses and revolutionize industries. Join us and be part of a...


  • San Francisco, United States Unreal Gigs Full time

    Job DescriptionJob DescriptionCompany Overview: Welcome to the forefront of machine learning operations (MLOps)! Our company is dedicated to leveraging the power of machine learning to drive innovation and transform industries. We're committed to developing cutting-edge ML solutions that deliver real-world impact and value to our customers. Join us and...


  • San Francisco, CA, United States Unreal Gigs Full time

    Company Overview: Welcome to the forefront of machine learning operations! At our company, we're driving the next wave of AI revolution through cutting-edge ML operations technologies. Our mission is to develop scalable and reliable ML systems that empower businesses and revolutionize industries. Join us and be part of a dynamic team committed to...


  • San Francisco, CA, United States Twelve Labs Full time

    Who we are We’re a fast-moving, diverse team pushing the frontiers of artificial intelligence. At Twelve Labs, our mission is to help developers build programs that can see, listen, and understand the world as we do by bringing the world’s most powerful video understanding infrastructure to market. As a part of achieving this mission, we are building...


  • San Francisco, United States Unreal Gigs Full time

    Job DescriptionJob DescriptionCompany Overview: Welcome to the forefront of artificial intelligence and machine learning innovation! Our company is dedicated to leveraging the power of data science to drive transformative change and solve complex problems across industries. We're committed to developing cutting-edge AI and ML solutions that push the...


  • San Francisco, CA, United States Robust Intelligence Inc. Full time

    Robust Intelligence 's mission is to eliminate AI Risk. As the world increasingly adopts AI into automated decision processes, we inherit great risk. Our flagship product is built to be integrated with existing AI systems to enumerate and eliminate risks caused by unintentional and intentional (adversarial) failure modes. Our Generative AI Firewall...


  • San Francisco, CA, United States Unreal Gigs Full time

    Job Description Job Description Company Overview: Welcome to the forefront of artificial intelligence and machine learning innovation! Our company is dedicated to leveraging the power of data science to drive transformative change and solve complex problems across industries. We're committed to developing cutting-edge AI and ML solutions that push the...


  • San Francisco, CA, United States Unreal Staffing, Inc Full time

    Company Overview: Welcome to the forefront of artificial intelligence and machine learning innovation! Our company is dedicated to leveraging the power of data science to drive transformative change and solve complex problems across industries. We're committed to developing cutting-edge AI and ML solutions that push the boundaries of what's...


  • San Francisco, CA, United States Twelve Labs Full time

    Who we are We’re a fast-moving, diverse team pushing the frontiers of artificial intelligence. At Twelve Labs, our mission is to help developers build programs that can see, listen, and understand the world as we do by bringing the world’s most powerful video understanding infrastructure to market. As a part of achieving this mission, we are building...


  • San Francisco, United States Unreal Gigs Full time

    Job DescriptionJob DescriptionCompany Overview: Welcome to the forefront of machine learning operations (MLOps)! Our company is dedicated to harnessing the power of machine learning to drive innovation and transform industries. We're committed to developing cutting-edge ML solutions that deliver real-world impact and value to our customers. Join us and...

  • Senior ML Engineer

    2 months ago


    San Francisco, United States Cleanlab Full time

    At Cleanlab you willPioneer novel software systems for the rapidly growing field of data-centric AI. Our tools enable data scientists/engineers (across all industries) to effectively diagnose/fix issues in their datasets thus improving the quality of their business’s core asset.Determine how to best leverage new Generative AI advances/infrastructure for...


  • San Francisco, California, United States Weights & Biases Full time

    At Weights & Biases, our mission is to build the best developer tools for machine learning. Weights & Biases is a series C company with $250 million in funding and a rapidly growing user base. Our platform is an essential piece of the daily work for machine learning engineers, from academic research institutions like FAIR and UC Berkeley to massive...


  • San Francisco, United States BayOne Solutions Full time

    This is an opportunity for a Machine Learning Engineering Manager to come in and drive Data Science and ML initiatives for the enterprise. Our Client continues to inspire our loyal customers in beauty space and AI/ML is redefining the way we inspire our customers.Some exciting initiatives in action:Generative AI use cases to help our customers discover...


  • San Francisco, United States BayOne Solutions Full time

    This is an opportunity for a Machine Learning Engineering Manager to come in and drive Data Science and ML initiatives for the enterprise. Our Client continues to inspire our loyal customers in beauty space and AI/ML is redefining the way we inspire our customers.Some exciting initiatives in action:Generative AI use cases to help our customers discover...


  • San Francisco, United States BayOne Solutions Full time

    This is an opportunity for a Machine Learning Engineering Manager to come in and drive Data Science and ML initiatives for the enterprise. Our Client continues to inspire our loyal customers in beauty space and AI/ML is redefining the way we inspire our customers.Some exciting initiatives in action:Generative AI use cases to help our customers discover...

  • AI/ML Ops Engineer

    3 days ago


    San Francisco, CA, United States Advocate Full time

    Advocate is a mission-driven technology company revolutionizing the way Americans access critical federal benefits. Our cutting-edge AI platform streamlines the application process, ensuring that every submission is complete, optimized, and tailored to the specific requirements of each federal program. Our innovative technology not only simplifies the...


  • San Francisco, CA, United States Understanding Recruitment Inc Full time

    ML Performance Engineer Join Our Team as an ML Performance Engineer Are you ready to pioneer the future of AI, making it private, convenient, and profitable for all? At our company, we're on a mission to empower developers and enterprises worldwide by migrating inference to user devices and supercharging existing on-device inference. We believe this...


  • San Francisco, United States Stripe Full time

    Who we are About Stripe Stripe is a financial infrastructure platform for businesses. Millions of companies-from the world's largest enterprises to the most ambitious startups-use Stripe to accept payments, grow their revenue, and accelerate new business opportunities. Our mission is to increase the GDP of the internet, and we have a staggering amount of...


  • San Francisco, CA, United States X Corp. Full time

    Are you prepared to join the X team and help build the ultimate real-time information-sharing app, revolutionizing how people connect? At X, we're on a mission to become a trusted global digital public square, committed to minimal censorship within legal boundaries. Our goal is to empower every user to freely create and share ideas, fostering open public...