Cloud AI Architect

2 weeks ago


San Francisco, California, United States Crusoe Full time
Crusoe - Building the Future of Cloud Infrastructure

We are pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications.

Crusoe is redefining AI cloud infrastructure with a mission to align the future of computing with the future of the climate. Our AI platform is recognized as the "gold standard" for reliability and performance.

We are seeking an experienced Senior/Staff Software Engineer to lead the design and implementation of core systems for our AI services, including resilient fault-tolerant queues, model catalogs, and scheduling mechanisms optimized for cost and performance.

This role gives you the opportunity to build and scale infrastructure capable of handling millions of API requests per second across thousands of customers.

About the Role

You will have a pivotal role in shaping the architecture and scalability of our next-generation AI inference platform. As part of a dynamic, fast-growing team, you will collaborate cross-functionally, influence the long-term vision of the platform, and contribute to cutting-edge AI technologies.

This is a unique opportunity to build a high-performance AI product that will be central to Crusoe's business growth.

A Day In the Life
  • You will play a crucial role in building the infrastructure to serve artificial neural networks and large language models (LLMs) at scale.
  • You will own the design and implementation of key subsystems for resiliency and quality of service.
  • You will work closely with cross-functional teams, including product management and business strategy, to develop a customer-facing API that serves real-world AI models.
Your Skills and Qualifications
  • Advanced degree in Computer Science, Engineering, or a related field.
  • Demonstrable experience in distributed systems design and implementation.
  • Proven track record of delivering early-stage projects under tight deadlines.
  • Expertise in using cloud-based services, such as elastic compute, object storage, virtual private networks, managed database, etc.
  • Experience with Generative AI (Large Language Models, Multimodal).
  • Experience with container runtimes (e.g., Kubernetes) and microservices architectures.
  • Experience using REST APIs and common communication protocols, such as gRPC.
  • Demonstrated experience in the software development cycle and familiarity with CI/CD tools.
Growth Opportunities
  • Shape the foundation of a cutting-edge, customer-facing AI inference platform.
  • Become a technical leader in performance optimization and AI infrastructure.
  • Collaborate with partners like Intel and NVIDIA on pushing the limits of AI performance.
  • Contribute to open-source AI frameworks and gain visibility in the AI community.
  • Take on leadership roles as the team scales, with opportunities to mentor junior engineers and influence the product roadmap.
Benefits
  • Hybrid work schedule.
  • Industry-competitive pay ($183,000 - $250,000)
  • Restricted Stock Units in a fast-growing, well-funded technology company.
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents.
  • Employer contributions to HSA accounts.
  • Paid Parental Leave.
  • Paid life insurance, short-term and long-term disability.
  • Teladoc.
  • 401(k) with a 100% match up to 4% of salary.
  • Generous paid time off and holiday schedule.
  • Cell phone reimbursement.
  • Tuition reimbursement.
  • Subscription to the Calm app.
  • MetLife Legal.
  • Company-paid commuter benefit; $50 per pay period.

  • AI Cloud Architect

    5 days ago


    San Francisco, California, United States Lambda Inc. Full time

    Cloud Infrastructure ExpertiseLambda Inc. is seeking a skilled AI Cloud Architect to join our team of innovative experts in designing and deploying cutting-edge cloud infrastructure solutions. As an integral part of our organization, you will be responsible for architecting and implementing scalable, secure, and high-performance cloud environments that meet...


  • San Francisco, California, United States Amazon Web Services, Inc. Full time

    Unlock the Power of Generative AI with Amazon Web ServicesWe're seeking a seasoned Cloud Solutions Architect to join our team at Amazon Web Services, Inc.The ideal candidate will have deep expertise in AI/ML (specifically Generative AI) and prior experience working as a Solution Architect. They will thrive in ambiguous situations, bring focus to deliver...


  • San Francisco, California, United States Untether AI Full time

    Software Architect for AI InferenceWe are seeking an exceptional Software Architect to join our team at Untether AI, where you will play a key role in designing and developing software that interacts with our innovative chip. As part of our top-notch team, you will collaborate closely with hardware engineers and fellow software engineers to create software...


  • San Francisco, California, United States Scale AI Full time

    Cloud AI Engineer Position at ScaleWe are seeking an experienced Cloud AI Engineer to join our team at Scale, a leading provider of AI solutions. As a Cloud AI Engineer, you will play a key role in designing and developing our cloud infrastructure platforms and systems.The ideal candidate will have extensive experience in software development and a deep...


  • San Francisco, California, United States Distyl AI Full time

    At Distyl AI, we're at the forefront of bringing practical AI adoption to the Fortune 500. As a key member of our Front End Engineering team, you will play a pivotal role in shaping the future of work by crafting intuitive and delightful user experiences.We're seeking an experienced Front End Engineer who is passionate about human-centered design and has a...


  • San Francisco, California, United States Magic AI Full time

    About MagicMagic is a cutting-edge technology company committed to developing safe Artificial General Intelligence (AGI) that accelerates humanity's progress on the world's most pressing challenges. Our mission revolves around automating research and code generation to improve models and solve alignment more reliably than humans alone.We believe our approach...


  • San Francisco, California, United States Amazon Full time

    Transformative Sales OpportunityWe are seeking an experienced sales professional to join our team as a AI Specialist, Cloud Solutions Architect. This role offers a unique chance to leverage your technical expertise and business acumen to drive revenue growth and customer satisfaction.About the Role:Develop and implement long-term transformational account...

  • AI Cloud Architect

    6 days ago


    San Francisco, California, United States Hyperbolic Labs Full time

    Role OverviewWe are on a mission to democratize AI by breaking down barriers to computing power with our Open-Access AI Cloud. As pioneers at the intersection of AI and open-source technology, we believe in an open future where AI innovation is limited only by imagination, not by access to resources. About the RoleAs a Frontend Engineer at Hyperbolic Labs,...


  • San Francisco, California, United States Pano AI Full time

    About the RolePano AI is a leader in wildfire early detection, leveraging IoT, AI, satellites, and SaaS software to deliver actionable intelligence to customers. As a Cloud Data Engineer, you will be part of the data platform pod that develops and deploys data ingestion and processing systems.You will develop pipelines to ingest, process, and publish large...


  • San Francisco, California, United States Federal Reserve Bank Full time

    About the RoleWe are seeking an experienced Data Architect to join our team at the Federal Reserve Bank of San Francisco. In this role, you will be responsible for designing and implementing cloud infrastructure for AI innovation.Key ResponsibilitiesDesign and implement cloud-based data and AI solutions using AWS services like Amazon S3, Amazon Redshift, and...


  • San Francisco, California, United States Crusoe Energy Inc Full time

    About CrusoeCrusoe Energy Inc. is pioneering vertically integrated, purpose-built AI infrastructure solutions trusted by Fortune 500 companies to power their most advanced AI applications.We're redefining AI cloud infrastructure with a mission to align the future of computing with the future of the climate. Our AI platform is recognized as the "gold...


  • San Francisco, California, United States Perplexity AI Full time

    Company OverviewWe're Perplexity AI, a rapidly growing company that has experienced tremendous growth and adoption since launching the world's first fully functional conversational answer engine. Our AI-powered search assistant has amassed 10 million monthly active users, with our mobile apps installed over 1 million times across iOS and Android devices....


  • San Francisco, California, United States Together AI Full time

    Are you a skilled DevOps engineer looking to take your career to the next level? Do you have a passion for designing and building automated infrastructure pipelines? We are seeking a talented Senior DevOps Engineer to join our cloud engineering team at Together AI. About the RoleWe are hiring a highly experienced Senior DevOps Engineer to lead the...


  • San Francisco, California, United States Distyl AI, Inc. Full time

    As a pioneer in production-grade AI systems, Distyl AI, Inc. empowers Fortune 500 companies to harness the power of AI.We collaborate with OpenAI, leveraging our expertise in enterprise AI and strategic partnerships to deliver customized AI solutions that drive rapid time-to-value.Our leadership team, comprised of seasoned professionals from top companies...


  • San Francisco, California, United States Oleria Corp. Full time

    About Oleria Corp.Oleria Corp. is an innovative enterprise cybersecurity startup that aims to revolutionize access control for cloud applications using cutting-edge AI and graph technology. Founded by industry veterans with deep security, data, and SaaS experience, our mission is to combat identity-based attacks and data breaches.With over $43M in funding...


  • San Francisco, California, United States Magic AI Full time

    About Magic AIAt Magic AI, we're building safe Artificial General Intelligence (AGI) to accelerate humanity's progress on the world's most pressing challenges. Our approach combines frontier-scale pre-training, domain-specific reinforcement learning, ultra-long context, and inference-time compute to achieve this goal.We're seeking a skilled Distributed...


  • San Francisco, California, United States ZipRecruiter Full time

    Job OverviewWe are seeking a seasoned AI Cloud Solutions Expert to join our team. This role offers a unique opportunity to shape the adoption of AI technology and work with cutting-edge cloud and AI infrastructure.About the CompanyOur client is a leading provider of AI-centric cloud platforms, empowering organizations to drive breakthroughs in AI. Their...


  • San Francisco, California, United States Operant AI Full time

    We're Operant AI, a cutting-edge cybersecurity company revolutionizing the way we protect applications across every layer of the cloud native stack. We're seeking an experienced Senior SRE Engineer to lead our DevOps/SRE functions and help build out our operant roadmap.Role OverviewThis is a hands-on, leadership role that requires a deep understanding of...


  • San Francisco, California, United States Abridge AI Inc. Full time

    Unlock the Potential of Healthcare with AbridgeAbridge AI Inc. is revolutionizing the healthcare industry with cutting-edge AI technology, empowering clinicians to focus on patient care while streamlining clinical documentation processes.About the RoleWe are seeking an experienced Transformative AI Systems Architect to join our team and play a pivotal role...


  • San Francisco, California, United States Magic AI Full time

    Job OverviewMagic AI's mission is to build safe Artificial General Intelligence (AGI) that accelerates humanity's progress on the world's most pressing challenges. We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than humans alone.About the RoleThis Senior...