Machine Learning Infrastructure Architect

3 weeks ago


San Francisco, California, United States ZipRecruiter Full time

Job Title: Cloud Engineering Manager - AI

">

We are seeking a seasoned Cloud Engineering Manager with expertise in Artificial Intelligence and Machine Learning to lead our cloud infrastructure initiatives. As a Cloud Engineering Manager, you will oversee the design, development, and optimization of our cloud-based infrastructure solutions to support machine learning workflows.

">

About the Role:

">
  1. Technical Leadership: Provide strategic guidance, mentorship, and technical leadership to a team of cloud engineers, fostering a culture of excellence, innovation, and collaboration.
  2. Infrastructure Design: Lead the design and architecture of scalable and reliable cloud infrastructure solutions to support machine learning workflows, including data ingestion, model training, evaluation, and deployment.
  3. Data Pipeline Development: Lead the development and optimization of data pipelines to ingest, preprocess, and transform data for training machine learning models, ensuring data quality, integrity, and scalability.
  4. Model Training Infrastructure: Design and optimize infrastructure for training machine learning models at scale, leveraging distributed computing frameworks and accelerators for performance and efficiency.
  5. Model Deployment: Lead the design and implementation of systems for deploying and managing machine learning models in production environments, ensuring reliability, scalability, and real-time inference capabilities.
  6. Monitoring and Logging: Implement robust monitoring and logging solutions to track the performance and health of machine learning infrastructure and models, proactively identifying and resolving issues.
  7. Automation and Orchestration: Develop automation and orchestration tools to streamline machine learning workflows, reducing manual intervention and improving operational efficiency.
  8. Security and Compliance: Implement security controls and ensure compliance with data privacy regulations in machine learning infrastructure and workflows, protecting sensitive data and ensuring regulatory compliance.
  9. Documentation and Best Practices: Define and promote best practices for cloud engineering and machine learning infrastructure, ensuring clear and comprehensive documentation to facilitate understanding and collaboration among team members.
  10. Collaboration: Collaborate closely with data scientists, machine learning engineers, and software developers to understand requirements and deliver infrastructure solutions that meet business needs.
  11. Mentorship and Development: Mentor and coach junior engineers, providing guidance, support, and opportunities for skill development and career growth, and foster a culture of continuous learning and improvement within the team.

What We Offer:

">
  • Estimated salary: $225,000 per year
  • A comprehensive benefits package, including health insurance, retirement plans, and wellness programs
  • Flexible work arrangements, including remote work options and flexible hours
  • Generous vacation and paid time off
  • Professional development opportunities, including access to training programs, conferences, and workshops
  • A state-of-the-art technology environment with access to cutting-edge tools and resources
  • A vibrant and inclusive company culture with opportunities for growth and advancement


  • San Francisco, California, United States Unreal Gigs Full time

    Company Overview: Welcome to Unreal Gigs, a cutting-edge company leveraging AI-driven innovation. We're pioneers in machine learning, committed to building robust infrastructure that powers our models at scale.We're seeking an experienced Senior Machine Learning Infrastructure Engineer to lead the design, development, and optimization of our machine learning...


  • San Francisco, California, United States Unreal Gigs Full time

    Company Overview: At Unreal Gigs, we're driving the future of AI innovation by building cutting-edge machine learning infrastructure. Our team is dedicated to developing robust and scalable systems that power our models at scale.Position Overview: As a Senior Machine Learning Infrastructure Engineer, you'll lead the design and development of our machine...


  • San Francisco, California, United States Unreal Gigs Full time

    At Unreal Gigs, we're on the cutting-edge of AI-driven innovation. As a Senior Machine Learning Infrastructure Engineer, you'll lead the design, development, and optimization of our machine learning infrastructure.About the RoleYou'll work on challenging projects, from building scalable data pipelines to deploying and managing machine learning models in...


  • San Francisco, California, United States Unreal Gigs Full time

    Company Overview:Welcome to Unreal Gigs, a pioneer in AI-driven innovation. We're dedicated to building robust infrastructure that powers machine learning models at scale. As a Senior Machine Learning Infrastructure Engineer, you'll lead the design, development, and optimization of our machine learning infrastructure. You'll work on challenging projects,...


  • San Francisco, California, United States Unreal Gigs Full time

    Unreal GigsWe are looking for a highly skilled Machine Learning Infrastructure Architect to lead our MLOps strategy and build the backbone of our AI operations.About the Role:Job Description:As a Machine Learning Infrastructure Architect, you will be responsible for designing and implementing scalable, secure, and efficient MLOps infrastructure that...


  • San Francisco, California, United States Unreal Gigs Full time

    Unreal Gigs OverviewWelcome to Unreal Gigs, a pioneering force in AI-driven innovation. We're committed to building robust infrastructure that powers our machine learning models at scale.Salary: $195,000 - $255,000 per yearPosition SummaryWe're seeking a seasoned Senior Machine Learning Infrastructure Engineer to lead the design, development, and...


  • San Francisco, California, United States OpenAI Full time

    We are seeking a visionary Machine Learning Infrastructure Architect to join our team at OpenAI in San Francisco, CA. This role involves designing and maintaining robust and secure systems that power the training and advanced use cases of next-gen AI models.You will work closely with researchers to enhance system capabilities and support experimental and...


  • San Francisco, California, United States CentML Full time

    We are seeking a highly skilled and motivated Machine Learning Infrastructure Architect to join our team at CentML. In this role, you will play a crucial part in designing and building the CentML platform, a cost-effective infrastructure for serving and training large-scale machine learning models.Responsibilities:Taking part in the design and development of...


  • San Francisco, California, United States ZipRecruiter Full time

    Company OverviewWelcome to ZipRecruiter, a leading online job search platform. At our company, we're passionate about using artificial intelligence and machine learning to connect job seekers with their dream jobs. Our mission is to develop innovative solutions that empower our users to find the right job opportunities and advance their careers.Position...


  • San Francisco, California, United States ZipRecruiter Full time

    About the RoleAs an experienced Machine Learning Systems Engineer at Abridge, you will play a pivotal role in scaling and deploying machine learning models to handle increasing traffic demands. Your primary responsibility will be to architect, design, and implement scalable infrastructure that not only supports current deployments but also lays the...


  • San Francisco, California, United States ZipRecruiter Full time

    Job OverviewWe're seeking a highly skilled Machine Learning Infrastructure Solutions Specialist to join our team at ZipRecruiter. As a key member of our infrastructure engineering team, you will play a crucial role in designing, building, and optimizing our machine learning infrastructure to support the needs of our organization.Key Responsibilities:Design...


  • San Francisco, California, United States BaseTen Labs, Inc. Full time

    About BasetenWe're a cutting-edge technology company that's pushing the boundaries of machine learning infrastructure. Our mission is to empower organizations to harness the power of AI and ML by providing robust, scalable, and secure solutions.The RoleWe're seeking an exceptional Infrastructure Architect Lead to spearhead our efforts in designing,...


  • South San Francisco, California, United States Genentech Full time

    About the RoleWe're looking for a Machine Learning Infrastructure Lead to join our team at Genentech Computational Sciences. As a key member of our Prescient Design group, you'll play a leading role in developing and maintaining large-scale machine learning models and infrastructure.About the ResponsibilitiesThis role involves:Contributing to cutting-edge...


  • San Francisco, California, United States Unreal Gigs Full time

    Company Overview: At Unreal Gigs, we're at the forefront of AI-driven innovation. We're committed to building robust infrastructure that powers our machine learning models at scale.


  • San Francisco, California, United States Sentry Full time

    About the RoleAs a Senior Machine Learning Systems Engineer at Sentry, you will play a pivotal role in shaping the company's AI/ML landscape. Your primary responsibility will be to design and build the core infrastructure required for developing, evaluating, deploying, and iterating on models and pipelines at scale.This position is crucial as it involves...


  • San Francisco, California, United States Atlassian Full time

    Job OverviewWe are seeking a Senior Principal Machine Learning Engineer to join our Central AI organization.The Central AI team develops the foundational infrastructure, data pipelines, frameworks, models, and other capabilities to expedite AI feature development throughout Atlassian.Your Future TeamCentral AI's mission is to accelerate AI innovation across...


  • San Francisco, California, United States Perplexity AI Full time

    We're on a mission to revolutionize search with our AI-powered answer engine. As a Machine Learning Solution Architect, you'll play a crucial role in designing and implementing machine learning solutions to enhance user experience.About the RoleAs a key member of our team, you'll focus on delivering high-quality machine learning solutions to address complex...


  • San Francisco, California, United States Recruiting from Scratch Full time

    Machine Learning Infrastructure SpecialistWe are scaling our inference systems to handle millions of LLM requests daily, requiring exceptional talent to drive growth.This role involves designing and implementing large-scale, fault-tolerant systems for AI infrastructure. Key responsibilities include:Architecting distributed systems for our inference...


  • San Jose, California, United States TikTok Full time

    About TikTokTikTok is the leading destination for short-form mobile video, inspiring creativity and bringing joy to over 1 billion users. Our global offices span across Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul, and Tokyo.Why Work with UsWe're a diverse and inclusive community that celebrates unique perspectives. Our...


  • San Diego, California, United States Diverse Lynx Full time

    Job DescriptionWe are seeking a skilled Machine Learning Architect to lead the development of innovative AI solutions. The successful candidate will have a minimum of 10 years of industry experience and a strong background in AIML modeling, Python, and AIML infrastructure. Our ideal candidate will be based in San Diego, CA, and will work closely with...