Cloud-Scale AI Infrastructure Lead

1 week ago


San Francisco, California, United States Unreal Gigs Full time
About the Role

Unreal Gigs is seeking a seasoned expert in cloud-scale AI infrastructure to lead our team of engineers. As a Cloud-Scale AI Infrastructure Lead, you will be responsible for designing and architecting scalable and reliable infrastructure solutions to support machine learning workflows.

Our ideal candidate has 5+ years of experience in infrastructure engineering, with a focus on machine learning infrastructure. They possess strong programming skills in languages such as Python, Java, or Scala, and have experience with distributed computing frameworks like Apache Spark or TensorFlow.

The successful candidate will also have expertise in cloud platforms such as AWS, Azure, or Google Cloud Platform, and services like AWS SageMaker, Azure Machine Learning, or Google AI Platform. They will be able to collaborate effectively with cross-functional teams, communicate technical concepts to non-technical stakeholders, and mentor junior engineers.

Responsibilities
  • Design and Architecture: Design scalable and reliable infrastructure solutions to support machine learning workflows, including data ingestion, model training, evaluation, and deployment.
  • Data Pipeline Development: Develop and maintain data pipelines to ingest, preprocess, and transform data for training machine learning models.
  • Model Training Infrastructure: Build and optimize infrastructure for training machine learning models at scale, leveraging distributed computing frameworks and accelerators.
  • Model Deployment: Design and implement systems for deploying and managing machine learning models in production environments.
  • Monitoring and Logging: Implement monitoring and logging solutions to track the performance and health of machine learning infrastructure and models.

We offer a competitive salary range of $170,000 - $230,000 per year, depending on experience and qualifications. Our benefits package includes comprehensive health, dental, and vision insurance plans, flexible work hours, remote work options, generous vacation and paid time off, professional development opportunities, and a state-of-the-art technology environment.


  • Infrastructure Lead

    7 days ago


    San Francisco, California, United States Naptha AI Full time

    Naptha AI is looking for a talented Cloud-Scale Distributed Systems Engineer to lead the development of our AI infrastructure. You will be responsible for designing and implementing scalable infrastructure for massive agent networks, architecting systems for efficient agent communication and coordination, and building robust, distributed systems for agent...

  • AI Engineering Lead

    2 weeks ago


    San Francisco, California, United States Scale AI Full time

    Company OverviewAbout Scale AI:We are a leading AI data foundry, accelerating the development of AI applications. Our mission is to make AI accessible to every organization across industries. We empower businesses to build and deploy AI at scale.At Scale AI, we believe in the transformative power of AI. Our team is dedicated to making AI more accessible,...


  • San Francisco, California, United States Scale AI Full time

    About UsScale AI is transforming how organizations build and deploy AI. We power the world's most advanced LLMs, generative models, and computer vision models. Our products are trusted by top companies like OpenAI, Meta, and Microsoft.Job Description:We are seeking a highly skilled Cloud Native Machine Learning Expert to join our team. As a key member, you...


  • San Francisco, California, United States Scale AI Full time

    About ScaleScale AI is a pioneering company that's revolutionizing the way organizations build and deploy AI. With a strong mission to accelerate AI development, we provide innovative data solutions that fuel the most exciting advancements in AI.Our team is committed to making AI more accessible, powering the world's most advanced LLMs, generative models,...

  • AI Development Lead

    3 weeks ago


    San Francisco, California, United States Scale AI Full time

    About ScaleAt Scale AI, we are dedicated to accelerating the development of AI applications.We believe that the transition from traditional software to AI is one of the most significant shifts of our time.Our mission is to make this happen faster across every industry, and our team is revolutionizing how organizations build and deploy AI.About Data EngineOur...


  • San Francisco, California, United States Scale AI Full time

    Job OverviewWe are seeking a highly skilled and experienced AI Model Development Manager to lead our Generative AI team at Scale AI. As the primary point of contact for this role, you will be responsible for managing a team of research engineers and ML engineers focused on delivering scalable, production-ready solutions to support our GenAI Data...

  • AI Research Scientist

    1 month ago


    San Francisco, California, United States Scale AI, Inc. Full time

    About the RoleScale AI, Inc. is seeking a highly skilled AI Research Scientist to drive the development of our generative AI products. As a key member of our data science team, you will lead the charge in building and refining our AI infrastructure, leveraging your expertise to advance the state-of-the-art in machine learning and artificial...


  • San Francisco, California, United States Scale AI Full time

    Unlock the Future of AI with Scale AIWe're pushing the boundaries of artificial intelligence at Scale AI, and we need your expertise to make it happen. As a key member of our Safety and Evaluation Lab (SEAL), you'll play a vital role in developing cutting-edge evaluation products and tackling complex research problems.About the RoleAs an AI Safety...


  • San Francisco, California, United States WEX, Inc. Full time

    About WEX, Inc.WEX is an innovative global commerce platform and payments technology company that aims to simplify the business of doing business for customers. We are on a mission to create a consistent world-class user experience across our products and services, leveraging customer-focused innovations in big data, AI, and Risk.We are looking for a highly...


  • San Francisco, California, United States Crusoe Full time

    Transformative Cloud Infrastructure Solutions at CrusoeCrusoe is redefining AI cloud infrastructure with a mission to align the future of computing with the future of the climate. Our data centers are optimized for AI workloads and powered by clean, renewable energy. As a Senior Software Engineer, you'll partner with the broader engineering organization to...


  • San Francisco, California, United States Together AI Full time

    About the Role">We are seeking a highly skilled DevOps Engineer to join our team at Together AI. As an MLOps engineer, you will develop systems and APIs that enable our customers to perform inference and fine-tune LLMs.">Key Responsibilities">Implement runtime systems that perform inference at scale using AI/ML models from simple models up to the largest...


  • San Francisco, California, United States Scale AI Full time

    About ScaleAt Scale AI, our mission is to accelerate the development of AI applications. For years, we've been leading the way in AI data foundry, helping fuel exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. With our recent Series F round, we're accelerating the abundance of frontier data to pave the road...

  • AI Data Design Lead

    2 weeks ago


    San Francisco, California, United States Scale AI Full time

    About UsAt Scale AI, our mission is to accelerate the development of AI applications. With our recent Series F round, we're accelerating the abundance of frontier data to pave the road to Artificial General Intelligence (AGI). Our products power the world's most advanced LLMs, generative models, and computer vision models.Job OverviewWe are seeking a...


  • San Francisco, California, United States Scale AI Full time

    About the PositionScale AI is seeking an experienced Applied Machine Learning Director to lead our Generative AI Applied ML team. The ideal candidate will have a strong background in deep learning and natural language processing, with experience managing teams and developing production-ready solutions. This role is critical for designing and executing a...

  • AI Researcher

    2 weeks ago


    San Francisco, California, United States Scale AI Full time

    About Scale AI">We are a leading company in the field of artificial intelligence, dedicated to making AI accessible and transparent. Our mission is to accelerate the development of AI applications across various industries.At Scale AI, we believe that everyone should be able to bring their whole selves to work. We are an affirmative action employer and an...


  • San Francisco, California, United States WEX Full time

    Overview:Achieve technical excellence in AI infrastructure development with WEX, a leading global commerce platform and payments technology company. We're seeking an experienced Staff Cloud Engineer to spearhead our AI infrastructure initiatives, leveraging cloud-based solutions and cutting-edge technologies.About the Role:This is an exceptional opportunity...


  • San Francisco, California, United States Recruiting from Scratch Full time

    Cloud AI Engineer LeadRecruiting from Scratch is seeking a Cloud AI Engineer Lead to scale our inference systems, handling millions of LLM requests daily.Key responsibilities include:Designing and implementing large-scale, fault-tolerant systems for AI infrastructure.Architecting and implementing distributed systems for our inference network.Developing...


  • San Francisco, California, United States Scale AI Full time

    About ScaleMission and VisionAt Scale AI, our mission is to accelerate the development of AI applications that transform industries. We are committed to making this happen faster across every sector, and our team is pioneering how organizations build and deploy AI.Job OverviewWe're looking for a highly skilled Senior Product Designer to lead the end-to-end...


  • San Francisco, California, United States Scale AI Full time

    About the RoleAs the Generative AI Technology Lead, you will be responsible for leading a team of research engineers and ML engineers in developing and implementing cutting-edge Generative AI technologies. Your primary focus will be on delivering scalable, production-ready solutions that power Scale's GenAI Data Engine, including rater-assistant models, LLM...


  • San Francisco, California, United States Naptha AI Full time

    Company OverviewNaptha AI is a pre-seed company that aims to revolutionize AI agent infrastructure. Our team has deep expertise in AI and distributed systems, and we are looking for experienced technical leaders to help shape our technical strategy.SalaryWe offer a highly competitive salary, with the amount based on your experience and qualifications. The...