AI Infrastructure Architect

3 weeks ago


San Francisco, California, United States Recruiting from Scratch Full time

About Recruiting from Scratch

We are a talent firm focused on placing the best candidates for our clients. Our team is 100% remote and works with teams across North America, South America, and Europe to help them hire.

Our company is looking for a Senior ML Infrastructure Engineer to join our team in San Francisco, CA. As a Senior ML Infrastructure Engineer, you will be responsible for architecting and implementing large-scale, fault-tolerant systems.

The Role: We are seeking an experienced engineer to design and implement distributed systems for our inference network. You will work with our founders and engineering teams to drive architectural decisions and best practices.

What You'll Do:

  • Design and implement distributed systems for our inference network
  • Develop resource allocation models across heterogeneous hardware
  • Optimize network performance metrics (latency, throughput, availability)
  • Build robust monitoring and observability systems
  • Drive architectural decisions and best practices
  • Collaborate directly with founders and engineering teams

What You Bring:

  • 5+ years building high-performance, scalable distributed systems
  • Strong programming skills in TypeScript, Python, and either Go, Rust, or C++
  • Experience with Kubernetes/Nomad orchestration
  • Hands-on experience with AI tooling (ChatGPT, Claude, Cursor)
  • GPU programming and optimization skills (CUDA experience is a plus)
  • Startup experience (pre-seed to series A)

Location & Details:

  • San Francisco, CA (In-person)
  • Full-time W-2 position
  • $180K - $300K + Equity (0.1-3%) | Visa Sponsorship Available


  • San Francisco, California, United States Cambio AI Inc. Full time

    About Cambio AI Inc.We are a cutting-edge platform that enables the creation and deployment of AI workers to automate communication. Our innovative solution connects to any system or data source, handling phone calls, email, and messages with ease.Our primary focus is on the logistics industry, which relies heavily on communication for tasks such as booking,...


  • San Francisco, California, United States Naptha AI Full time

    We are seeking a highly skilled Distributed Infrastructure Architect to design and build the foundational systems that will power the next generation of AI agent networks at Naptha AI. This is a rare opportunity to shape the future of AI infrastructure at a massively ambitious scale.Naptha AI offers a unique chance to work on building the infrastructure for...


  • San Francisco, California, United States Naptha AI Full time

    About this RoleNaptha AI is seeking an exceptional AI Expert Architect to build and nurture relationships with frontier AI developers, shaping the future of AI agent development. This rare opportunity influences the future of AI agent infrastructure at a massively ambitious scale, backed by industry veterans and technical leaders, NVIDIA Inception, Google...


  • San Francisco, California, United States ZipRecruiter Full time

    Job Title:AI Infrastructure Systems ArchitectAbout the Role:We are seeking an experienced AI Infrastructure Systems Architect to design and build scalable infrastructure that supports AI workloads. The ideal candidate will have a deep understanding of cloud and on-premise infrastructure solutions and be able to optimize them for AI.Key...


  • San Francisco, California, United States Unreal Gigs Full time

    Unreal Gigs: AI Infrastructure Solutions ArchitectWe are seeking an experienced AI Infrastructure Solutions Architect to join our team at Unreal Gigs.About the Role:The successful candidate will design, deploy, and maintain the infrastructure that powers AI innovation. This role involves collaborating with data scientists, software engineers, and DevOps...


  • San Francisco, California, United States OpenAI Full time

    About the RoleWe are seeking an experienced AI Infrastructure Architect to join our team at OpenAI. As a key member of our Agent Infrastructure team, you will play a crucial role in designing and maintaining robust and secure systems that facilitate the training of next-gen AI models at a massive scale.The ideal candidate will have deep experience building...

  • Infrastructure Lead

    3 weeks ago


    San Francisco, California, United States Naptha AI Full time

    Naptha AI is looking for a talented Cloud-Scale Distributed Systems Engineer to lead the development of our AI infrastructure. You will be responsible for designing and implementing scalable infrastructure for massive agent networks, architecting systems for efficient agent communication and coordination, and building robust, distributed systems for agent...


  • San Francisco, California, United States Spellbrush Full time

    We are seeking a highly skilled AI Infrastructure Architect to join our team at Spellbrush, the world's leading generative AI studio.OverviewSpellbrush is passionate about creating high-quality anime games and pushing the boundaries of generative AI. Our goal is to enable millions of users to participate in an evolving creative movement.SalaryThe estimated...


  • San Francisco, California, United States Naptha AI Full time

    Company OverviewNaptha AI is a pre-seed company that aims to revolutionize AI agent infrastructure. Our team has deep expertise in AI and distributed systems, and we are looking for experienced technical leaders to help shape our technical strategy.SalaryWe offer a highly competitive salary, with the amount based on your experience and qualifications. The...


  • San Francisco, California, United States Unreal Gigs Full time

    Transformative AI Infrastructure LeaderWe are seeking an exceptional AI Infrastructure Architect to spearhead the development and implementation of cutting-edge AI infrastructure solutions. This visionary role demands a strategic leader who can harmonize technology advancements with business objectives, ensuring seamless integration of AI capabilities across...


  • San Francisco, California, United States ZipRecruiter Full time

    Job SummaryWe are seeking an experienced Cloud AI Infrastructure Architect to join our team at Storj. This is a remote opportunity based in the Bay Area, CA, and requires flexibility with working hours.About the RoleThe ideal candidate will have 4+ years of experience as a solution architect and a working knowledge of cloud and multi-cloud data strategies....

  • Technical Leader

    1 day ago


    San Francisco, California, United States Naptha AI Full time

    About this roleWe are seeking multiple former startup CTOs and technical co-founders to join Naptha AI in roles tailored to each individual's unique strengths and experiences. This is an opportunity to shape the future of AI agent infrastructure at a massive scale, backed by industry veterans and technical leaders.We're building foundational infrastructure...


  • San Francisco, California, United States Together AI Full time

    About the Role">We are seeking a highly skilled DevOps Engineer to join our team at Together AI. As an MLOps engineer, you will develop systems and APIs that enable our customers to perform inference and fine-tune LLMs.">Key Responsibilities">Implement runtime systems that perform inference at scale using AI/ML models from simple models up to the largest...


  • San Francisco, California, United States Cambio AI Inc. Full time

    About Cambio AI Inc.We are a cutting-edge platform to develop and deploy AI workers that automate communication, catering to the logistics industry. Our mission is to streamline communication processes, enhancing efficiency and productivity for freight brokers, 3PLs, freight forwarders, shippers, warehouses, and supply chain enterprises.Our innovative...


  • San Francisco, California, United States Hamming AI Full time

    Backend and Infrastructure EngineerWe are a fast-growing voice AI testing company that has seen significant revenue growth.As a Backend and Infrastructure Engineer, you will play a crucial role in scaling our current products and infrastructure to support 100x growth.Optimize and productize processes that humans currently handle, ensuring seamless...


  • San Mateo, California, United States Lumino Ai Full time

    About Lumino AiWe are a technology company that builds infrastructure enabling anyone to create AI models. Our mission is to unlock the power of AI for every human.


  • San Francisco, California, United States Revery AI Full time

    Revery AI is seeking a highly experienced Backend Infrastructure Architect to join our team. As a key member of our engineering team, you will be responsible for designing, implementing, and maintaining the company's backend infrastructure.Key Responsibilities:Design and implement scalable and secure backend infrastructureMigrate existing applications to the...


  • San Francisco, California, United States WEX, Inc. Full time

    About WEX, Inc.WEX is an innovative global commerce platform and payments technology company that aims to simplify the business of doing business for customers. We are on a mission to create a consistent world-class user experience across our products and services, leveraging customer-focused innovations in big data, AI, and Risk.We are looking for a highly...


  • San Francisco, California, United States Unreal Gigs Full time

    Overview:At Unreal Gigs, we're pioneers in leveraging machine learning to revolutionize industries. We're committed to building robust infrastructure that powers our machine learning models at scale. As a Senior AI Infrastructure Architect, you'll lead the design, development, and optimization of our machine learning infrastructure.Key...


  • San Francisco, California, United States Acceler8 Talent Full time

    About the Role: We are seeking an exceptional Senior AI Infrastructure Architect to join our innovative team at Acceler8 Talent, where human-computer collaboration is a reality. Our multidisciplinary team is dedicated to tackling complex, real-world AI challenges.Key Responsibilities: Design and implement scalable ML systems using state-of-the-art...