Petabyte Scale Reliability Expert

6 days ago


San Diego, California, United States Apple Full time
Job Description

We're seeking a Site Reliability Engineer to join our team. If working on large scale problems excites you, we'd love to talk to you. Our team helps Apple engineers answer mission critical questions about their hardware, firmware, and software. We work with engineers across Apple to ensure the reliability and availability of our analytics applications.



  • San Diego, California, United States Apple Full time

    **About the Role:**We are seeking an experienced Site Reliability Engineer to join our Data Analytics team at Apple. As a key member of our team, you will be responsible for building, monitoring, and troubleshooting complex data infrastructure at the petabyte scale.**Responsibilities:**Builddesign, deploy, and manage complex data infrastructure at the...


  • San Francisco, California, United States Genmo Full time

    About the RoleWe are seeking an experienced Senior/Staff AI Infra Engineer to join our team at Genmo.Job Summary:As a Senior/Staff AI Infra Engineer, you'll be responsible for designing, building, and scaling our petabyte-scale data infrastructure.Key Responsibilities:Design highly scalable data infrastructure and systems to process petabyte-scale data...


  • San Francisco, California, United States Genmo Full time

    About UsGenmo is a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI.Job OpportunityWe're seeking a skilled Data Infrastructure Engineer to join our team and contribute to the development of our petabyte-scale data infrastructure.Responsibilities:Design and implement scalable data...


  • San Diego, California, United States Apple Full time

    **Job Description:**As a Site Reliability Engineer on our Data Analytics team, you will be responsible for designing, deploying, and managing complex data infrastructure at the petabyte scale. You will work closely with engineers across Apple to help keep our suite of analytics applications available and ensure the integrity of their data.**Key...


  • San Diego, California, United States Qualcomm Full time

    About UsQualcomm is a world-leading technology company that innovates and creates cutting-edge products. We are committed to delivering high-quality solutions to our customers and continuously strive to improve our services.Job DescriptionWe are seeking an expert in reliability engineering to join our team. As a Reliability Engineering Expert, you will be...


  • San Francisco, California, United States Cruise Full time

    Join Cruise's Data Science Team: We're looking for a talented Staff Software Engineer to join our team as a key member of our ML Data Platform. As a member of our team, you will be responsible for designing, developing, and deploying large-scale data systems in the cloud. Your expertise in Beam and Spark will be instrumental in building a next-generation...


  • San Francisco, California, United States Genmo Full time

    Genmo is a research lab dedicated to building open, state-of-the-art models for video generation. We're seeking an experienced Senior/Staff AI Infra Engineer to join our team and help us shape the future of AI.Job DescriptionYou will design, build, and scale our petabyte-scale data infrastructure, creating robust, scalable systems that manage and process...


  • San Diego, California, United States Apple Full time

    Job DescriptionWe are seeking a Site Reliability Engineer to be a member of our team. If you enjoy working on large scale problems, then we're excited to talk to you. The successful candidate will write code to automate our processes to ensure reliability and manage thousands of compute and storage instances across large heterogeneous infrastructure. You'll...


  • San Francisco, California, United States Genmo Full time

    The Ideal Candidate:We're looking for a senior professional with 5+ years of experience working with large-scale systems. You should have a strong understanding of computer science fundamentals, excellent problem-solving skills, and the ability to communicate complex ideas clearly.Additionally, we're interested in candidates who have:Familiarity with...


  • San Francisco, California, United States Scale AI Full time

    About the Role:">We are seeking a highly skilled Advanced LLM Development Expert to join our SEAL team at Scale AI. In this role, you will design and develop innovative solutions to tackle complex challenges in AI safety and evaluation.">Your Key Responsibilities:">">Design and implement novel machine learning models to improve the reliability and...


  • San Diego, California, United States Apple Full time

    About the Role:This is an exciting opportunity to join Apple's Data Analytics team as a Site Reliability Engineer, Data Analytics. As a key member of our team, you will be responsible for ensuring the reliability and scalability of our data infrastructure.Responsibilities:Build, monitor, troubleshoot complex data infrastructure at the petabyte scale.Support...


  • San Jose, California, United States Tik Tok Full time

    Job Summary">As a Senior Data Engineer on our Ads Data Team, you'll play a critical role in building and maintaining the data infrastructure that supports TikTok's global Ads business. You'll work closely with cross-functional teams to design, implement, and optimize data pipelines, ensuring the accuracy, consistency, and scalability of our data...


  • San Francisco, California, United States Fintool Full time

    Document Analysis Expert WantedFintool is a leading-edge technology company that seeks a highly skilled Document Analysis Expert to develop and optimize high-performance RAG systems on large-scale document datasets. As a key member of our team, you will be responsible for designing and implementing custom embeddings, rankers, and hybrid search algorithms to...


  • San Francisco, California, United States Genmo Full time

    We are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI. Our team is extremely technical with leaders in distributed systems, GPU programming and large-scale training.Job OverviewWe're seeking an experienced Senior/Staff AI Infra Engineer to design, build, and scale our...


  • San Diego, California, United States PRDB Enterpise LLC Full time

    About PRDB Enterpise LLCWe are a dynamic and forward-thinking organization seeking an experienced Culinary Expert to join our team. As we prepare for various large-scale events throughout the year, we require a skilled Line Cook to deliver exceptional culinary experiences to our guests.SalaryThe successful candidate will be offered a competitive salary of...


  • San Jose, California, United States Tik Tok Full time

    About the RoleStreaming Data Engineer, Large-Scale Systems is a critical position in our ad data platform team. You will work closely with product managers and data analysts to build state-of-the-art streaming and batch data processing solutions. The entire data pipeline supports both the TikTok ads platform and our internal business intelligence...


  • San Francisco, California, United States Genmo Full time

    Job TitleData Infrastructure EngineerCompany OverviewWe are Genmo, a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of AGI.Salary$250,000 - $350,000 per year, depending on experience.Job DescriptionWe're seeking an experienced Senior/Staff AI Infra Engineer to design, build, and scale...


  • San Francisco, California, United States Genmo Full time

    About GenmoWe are a research lab dedicated to building open, state-of-the-art models for video generation towards unlocking the right brain of Artificial General Intelligence (AGI). Our team consists of leaders in distributed systems, GPU programming, and large-scale training.

  • Observability Expert

    4 weeks ago


    San Francisco, California, United States Openai Full time

    **About the Role**At OpenAI, we're scaling our systems to bring AI safely to the world. We seek experienced engineers to ensure our technology's reliability and performance.As an Observability Expert, you'll play a crucial role in maintaining and enhancing the stability, scalability, and performance of our rapidly evolving infrastructure. You will work...


  • San Francisco, California, United States Genmo Full time

    At Genmo, we're pushing the boundaries of video generation and unlocking the right brain of AGI.Job DescriptionWe're seeking an experienced Senior/Staff AI Infra Engineer to design, build, and scale our petabyte-scale data infrastructure.Key Responsibilities:Design highly scalable data infrastructure and systems to process petabyte-scale data stores.Manage...