Current jobs related to Senior Site Reliability Engineer - Mountain View, California - Groq


  • Mountain View, California, United States Tik Tok Full time

    About the RoleTikTok is seeking an experienced Senior Engineering Manager to lead our site reliability engineering teams and algorithm teams across Trust and Safety Platform, E-Commerce Platform, and several other platforms.As a Senior Engineering Manager, you will be responsible for leading complex projects, managing day-to-day operations, and influencing...


  • Mountain View, California, United States Moveworks Full time

    About the RoleMoveworks is the universal AI copilot for search and automation across all your business applications. We give employees one place to go to find information and get support while reducing costs for your business. The Moveworks Copilot is powered by an industry-leading Reasoning Engine that uses a combination of public and proprietary language...


  • Mountain View, California, United States Insight Global Full time

    Site Reliability Engineer Opportunity in the Bay AreaWe are seeking a highly motivated Site Reliability Engineer to join our team in the Bay Area. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure.Key Responsibilities:* Strong Linux System Admin fundamentals (bash/shell...


  • Mountain View, California, United States Groq Full time

    Site Reliability EngineerAt Groq, we're pushing the boundaries of AI accessibility. Our Language Processing Unit (LPU) technology outpaces GPUs in speed, power, efficiency, and cost-effectiveness. As a Site Reliability Engineer, you'll play a crucial role in ensuring the reliability, scalability, and performance of our tools and services.Responsibilities:...


  • Mountain View, California, United States Groq Full time

    Job DescriptionWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Groq. As a Principal Site Reliability Engineer, you will be responsible for ensuring the reliability of our APIs as customers route their AI workloads through our insanely fast, purpose-built hardware and software systems.Key Responsibilities:Enhance system...


  • Mountain View, California, United States Moveworks Full time

    About the RoleMoveworks is the universal AI copilot for search and automation across all your business applications. We give employees one place to go to find information and get support while reducing costs for your business.The Moveworks Copilot is powered by an industry-leading Reasoning Engine that uses a combination of public and proprietary language...


  • Mountain View, California, United States Samsung Electronics America North America Full time

    Job Title: Embedded Site Reliability EngineerAt Samsung Ads, we're seeking a highly skilled Embedded Site Reliability Engineer to join our Global Ads Product & Engineering team. As a key member of our team, you will play a crucial role in ensuring the reliability, scalability, and performance of our advertising technology platform.Key Responsibilities:Design...


  • Mountain View, California, United States Tik Tok Full time

    Job DescriptionTikTok is seeking a highly skilled Site Reliability Engineer to join our Edge Services team. As a key member of our team, you will be responsible for designing, implementing, and operating large-scale, massively distributed infrastructure to ensure the reliability and performance of our content delivery network.Our Edge SREs work closely with...


  • Mountain View, California, United States Tik Tok Full time

    About the Role:As a Site Reliability Engineer on the AML team at TikTok, you will play a critical role in designing, building, and maintaining highly available, scalable, and fault-tolerant systems. Your expertise in analyzing and troubleshooting Linux-based distributed systems will be essential in ensuring the smooth operation of our massively distributed...


  • Mountain View, California, United States Tik Tok Full time

    About the RoleTikTok is the leading destination for short-form mobile video, and our mission is to inspire creativity and bring joy. As a Site Reliability Engineer in the USDS division, you will play a critical role in ensuring the reliability, fault-tolerance, and scalability of our Ads data platform.ResponsibilitiesPerform SRE duties and operations across...


  • Mountain View, California, United States Tik Tok Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Global E-commerce team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our E-commerce platform.Key ResponsibilitiesSupport the service level of a critical, revenue-generating E-commerce platform and related...


  • Mountain View, California, United States Tik Tok Full time

    About U.Data SecurityTikTok is the leading destination for short-form mobile video. U.Data Security (USDS) is a subsidiary of TikTok in the U.This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep U.Our focus is on providing oversight and protection of the...

  • Senior Cloud Engineer

    1 month ago


    Mountain View, California, United States Microsoft Corporation Full time

    Job DescriptionMicrosoft Corporation is seeking a highly skilled Senior Cloud Engineer to join our Edge Infrastructure Engineering team. As a key member of our team, you will be responsible for designing, developing, and deploying cloud-based solutions that enable our customers to manage and operate their edge infrastructure with ease.Key...


  • Mountain View, California, United States Tik Tok Full time

    Job SummaryTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. The U.S. Data Security (USDS) division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep U.S. users safe. This new division focuses on providing oversight and...


  • Mountain View, California, United States Aeva, Inc Full time

    About Aeva, Inc.Aeva is a pioneering company in the field of sensing and perception for autonomous vehicles and beyond. Our innovative 4D LiDAR technology is built from the ground up at silicon photonics scale for mass-market applications. We are seeking a highly skilled Reliability Test Engineer to join our dynamic Quality and Reliability Team.Key...


  • Mountain View, California, United States Tik Tok Full time

    About the RoleWe are seeking an experienced Senior Software Engineer to join our Risk and Response Detection and Optimization team within Trust and Safety. As a key member of our team, you will be responsible for architecting and shipping compelling and usable tools with React, Python, and Golang. You will work directly with users to inform, refine, and...


  • Mountain View, California, United States Aurora Innovation Full time

    We are seeking a highly skilled Staff Hardware Reliability Engineer - Computer to join our team at Aurora Innovation.The Hardware Reliability team is dedicated to ensuring the robustness and dependability of hardware systems in the Aurora hardware stack.As a Staff Hardware Reliability Engineer - Computer, you will lead and oversee hardware reliability...


  • Mountain View, California, United States Photon Full time

    Job Title: Senior Java Software EngineerJob Summary:We are seeking an experienced Senior Java Engineer to play a key role in designing, building, and maintaining our real-time analytics infrastructure. As a Senior Engineer, you will work closely with our data scientists, product managers, and other engineers to develop and deploy scalable, efficient, and...


  • Mountain View, California, United States Photon Full time

    Job Summary:We are seeking an experienced Senior Java Engineer, Analytics with a strong focus on Streaming to join our team. As a Senior Engineer, you will play a key role in designing, building, and maintaining our real-time analytics infrastructure. You will work closely with our data scientists, product managers, and other engineers to develop and deploy...


  • Mountain View, California, United States Databricks Full time

    About the Role:We are seeking a skilled Senior Performance Engineer to join our team at Databricks. As a key member of our performance engineering team, you will be responsible for evaluating the performance of our products and features, identifying performance bottlenecks, and partnering with engineers to solve performance and scalability issues.Key...

Senior Site Reliability Engineer

1 month ago


Mountain View, California, United States Groq Full time
Unlock the Power of AI with Groq

At Groq, we're revolutionizing the AI economy by making processing power more accessible, faster, and more affordable. Our Language Processing Unit (LPU) outpaces the GPU in speed, power, efficiency, and cost-effectiveness, empowering a world where AI is universally accessible.

Join Our Mission

We're seeking a Senior Site Reliability Engineer to ensure the reliability, scalability, and performance of our tools and services. As a key member of our team, you'll design and implement scalable and reliable architectures, establish comprehensive monitoring systems, and lead the investigation and resolution of production incidents.

Responsibilities
  • Design and implement reliable architectures for platform infrastructure
  • Establish comprehensive monitoring systems to track key performance indicators
  • Lead the investigation and resolution of production incidents
  • Develop and implement automated testing frameworks to ensure software quality and reliability
Requirements
  • 6+ years of experience in site reliability engineering or a related field
  • Deep understanding of cloud-native technologies and infrastructure as a service (IaaS)
  • Expertise in monitoring and alerting systems, incident management processes, and disaster recovery planning
  • Strong analytical and problem-solving skills with a focus on root cause analysis and mitigation
What We Offer
  • Competitive base salary range: $132,000 to $211,500
  • Equity and benefits package
  • Remote work opportunities with asynchronous partnerships and collaboration methods

Groq is an equal opportunity employer committed to diversity, inclusion, and belonging in all aspects of our organization. We value and celebrate diversity in thought, beliefs, talent, expression, and backgrounds.