AI Infrastructure Specialist

22 hours ago


San Francisco, California, United States Unreal Gigs Full time

Job Summary

">

We are seeking an experienced AI Infrastructure Specialist to join our team at Unreal Gigs. As an AI Infrastructure Specialist, you will play a critical role in designing, building, and maintaining the infrastructure that supports machine learning and AI workloads.

The ideal candidate will have deep expertise in cloud platforms, containerization, and automation tools. You will work closely with data scientists and engineers to ensure that AI systems run efficiently, securely, and at scale.

Key Responsibilities:

  • Design and Build AI Infrastructure:
    • Architect and implement scalable infrastructure that supports AI workloads, including machine learning model training, large-scale data processing, and real-time inference.
  • Support AI Model Development and Deployment:
    • Collaborate with data scientists and engineers to build pipelines that automate the end-to-end machine learning lifecycle, from data ingestion to model training, deployment, and monitoring.
  • Optimize AI Workloads for Performance:
    • Implement strategies to optimize compute resources for AI workloads, including GPU/TPU provisioning, memory management, and parallel processing.
  • Cloud and On-Premise Infrastructure Management:
    • Manage cloud-based AI platforms (AWS, GCP, Azure) as well as on-premise infrastructure for AI development.
  • Automation and Continuous Integration/Deployment (CI/CD):
    • Implement and maintain CI/CD pipelines for machine learning models to enable rapid experimentation, testing, and deployment.
  • Security and Compliance:
    • Ensure that the AI infrastructure complies with security best practices and regulatory requirements.
  • Monitor and Troubleshoot AI Infrastructure:
    • Continuously monitor the health and performance of AI infrastructure, identifying bottlenecks, reducing latency, and troubleshooting issues.

Requirements:

  • AI Infrastructure Expertise: Deep experience in designing and building infrastructure that supports AI and machine learning workloads.
  • Cloud Platforms and Tools: Strong experience with cloud platforms like AWS, GCP, or Azure, particularly with AI services and infrastructure management.
  • Automation and DevOps: Expertise in automating infrastructure provisioning and model deployment using tools such as Terraform, Ansible, Jenkins, or GitLab CI.
  • GPU/TPU Optimization: Hands-on experience with GPU/TPU optimization for machine learning and deep learning tasks.
  • Security and Compliance: Strong understanding of security best practices, including data encryption, access management, and compliance with regulations like GDPR and HIPAA.

Salary Estimate: $140,000 - $170,000 per year, depending on experience.

Benefits:

  • Comprehensive Medical, Dental, and Vision Insurance Plans
  • Competitive Vacation, Sick Leave, and 20 Paid Holidays per Year
  • Flexible Work Schedules and Telecommuting Options
  • Opportunities for Training, Certification Reimbursement, and Career Advancement Programs
  • Access to Wellness Programs, Including Gym Memberships, Health Screenings, and Mental Health Resources
  • Life and Disability Insurance
  • Employee Assistance Program (EAP)
  • Tuition Reimbursement
  • Community Engagement Opportunities
  • Employee Recognition Programs


  • San Francisco, California, United States Naptha AI Full time

    About Naptha AIWe are seeking exceptional Software Engineering interns to join Naptha AI and contribute to building the future of AI agent infrastructure.This internship offers hands-on experience working with frontier AI technology, backed by industry veterans and technical leaders through NVIDIA Inception, Google for Startups, and Microsoft for Startups.As...


  • San Francisco, California, United States Together AI Full time

    Company Overview:At Together AI, we believe open and transparent AI systems will drive innovation and create the best outcomes for society. Our team has been behind technological advancements such as FlashAttention, Hyena, FlexGen, and RedPajama.Job Description:We are seeking an experienced MLOps engineer to develop systems and APIs that enable our customers...


  • San Francisco, California, United States ZipRecruiter Full time

    Job DescriptionWe're looking for a highly skilled Ai Infrastructure Specialist to join our team of engineers and data scientists. As an AI Infrastructure Specialist, you'll play a key role in designing, building, and optimizing our AI infrastructure to support the needs of our organization.About the RoleDesign and Build Infrastructure: Design and build...


  • San Francisco, California, United States Scale AI Full time

    Cloud AI Engineer Position at ScaleWe are seeking an experienced Cloud AI Engineer to join our team at Scale, a leading provider of AI solutions. As a Cloud AI Engineer, you will play a key role in designing and developing our cloud infrastructure platforms and systems.The ideal candidate will have extensive experience in software development and a deep...


  • San Mateo, California, United States Lumino Ai Full time

    About UsLumino is a leading provider of AI infrastructure solutions. We're passionate about empowering humans to unlock the potential of AI. Our mission is to create a world where AI is accessible to everyone.We're looking for a talented Machine Learning Engineer to join our team. As a key member of our engineering team, you will be responsible for designing...


  • San Francisco, California, United States Magic AI Full time

    Company OverviewMagic AI is a cutting-edge technology company dedicated to building safe Artificial General Intelligence (AGI) that accelerates humanity's progress on the world's most important problems.We believe the most promising path to safe AGI lies in automating research and code generation to improve models and solve alignment more reliably than...


  • San Francisco, California, United States Together AI Full time

    Are you a skilled DevOps engineer looking to take your career to the next level? Do you have a passion for designing and building automated infrastructure pipelines? We are seeking a talented Senior DevOps Engineer to join our cloud engineering team at Together AI. About the RoleWe are hiring a highly experienced Senior DevOps Engineer to lead the...


  • San Francisco, California, United States Unreal Gigs Full time

    Job OverviewWe are seeking a highly skilled AI Infrastructure Specialist to join our team at Unreal Gigs. As an AI Infrastructure Specialist, you will be responsible for designing, building, and managing scalable infrastructure for machine learning workloads.The ideal candidate will have strong experience with cloud platforms such as AWS, GCP, or Azure, and...


  • San Mateo, California, United States Lumino Ai Full time

    About Lumino AiWe are a technology company that builds infrastructure enabling anyone to create AI models. Our mission is to unlock the power of AI for every human.


  • San Francisco, California, United States Abridge AI Inc. Full time

    Abridge AI Inc. is a pioneering force in healthcare technology, utilizing artificial intelligence to empower deeper understanding and improve clinical documentation efficiency.Role OverviewWe are seeking an exceptional ML Systems Engineer to join our team, responsible for scaling and deploying machine learning models to handle increasing traffic demands and...


  • San Mateo, California, United States Lumino Ai Full time

    An exciting opportunity awaits at Lumino, where you'll have the chance to shape the future of AI infrastructure. As a software engineer, you'll work on designing, building, and maintaining systems that enable AI model creation. With a focus on scalability and reliability, you'll drive innovation and growth. Our team is collaborative and cross-functional,...


  • San Francisco, California, United States Abridge AI Inc. Full time

    Abridge AI Inc. is a trailblazing organization that empowers deeper understanding in healthcare through innovative AI solutions. Our mission-driven approach has led to the development of industry-leading natural language understanding products.Job OverviewWe are seeking a highly skilled Software Engineering Infrastructure Specialist to join our growing team...


  • San Francisco, California, United States WEX, Inc. Full time

    About WEX, Inc.We're a global commerce platform and payments technology company forging the way in a rapidly changing environment. Our mission is to simplify the business of doing business for customers, freeing them to focus on what matters most. We're committed to building a consistent world-class user experience across our products and services,...


  • San Francisco, California, United States Unreal Gigs Full time

    Design and Build AI InfrastructureArchitect and implement scalable infrastructure that supports AI workloads, including machine learning model training, large-scale data processing, and real-time inference.As an AI Infrastructure Engineer, you'll design solutions that ensure high availability, fault tolerance, and performance optimization.


  • San Francisco, California, United States WEX, Inc. Full time

    About the RoleWe are seeking a highly skilled AI Cloud Infrastructure Specialist to join our team at WEX, Inc. This is an exciting opportunity to work on building and maintaining robust, scalable, and secure cloud infrastructure that powers our AI and machine learning initiatives.Your ResponsibilitiesYou will collaborate with partners and stakeholders to...


  • San Francisco, California, United States Perplexity AI Full time

    Perplexity AI is a leading innovator in conversational AI technology.SalaryThe estimated annual salary for this role is $240,000, reflecting the company's commitment to attracting top talent and rewarding expertise.Job DescriptionWe're seeking an experienced Data Science Specialist to join our team and play a key role in optimizing our conversational AI...

  • AI Engineer

    3 weeks ago


    San Francisco, California, United States Abridge AI Inc. Full time

    Company OverviewAbridge AI Inc. is a pioneering organization in the field of healthcare technology, leveraging AI to empower deeper understanding and improve clinical documentation efficiencies.


  • San Mateo, California, United States Lumino Ai Full time

    Lumino Ai is a leading developer of innovative AI solutions. We're currently seeking a highly skilled Machine Learning Engineer to join our team. This is an excellent opportunity to contribute to the development of cutting-edge AI technologies and work with a talented group of professionals who share your passion for innovation.About the Role:We're looking...


  • San Francisco, California, United States Unum AI Full time

    At Unum AI, we're revolutionizing data infrastructure with our cutting-edge technology. We're seeking a highly skilled Ai Infrastructure Engineer to join our team in designing and implementing next-generation database management systems.About the RoleThis is an exciting opportunity for a passionate engineer to orchestrate software development and hardware...


  • San Francisco, California, United States Unreal Gigs Full time

    About the RoleWe are seeking a seasoned High-Performance AI Infrastructure Specialist to join our team at Unreal Gigs. In this role, you will be responsible for designing, developing, and optimizing scalable infrastructure solutions to support machine learning workflows.