We have other current jobs related to this field that you can find below


  • Los Angeles, California, United States First Resonance Full time

    Job Title: Senior Site Reliability EngineerFirst Resonance is a forward-thinking company at the forefront of hardware development for cutting-edge products like electric airplanes, autonomous vehicles, and robotics. As a Senior Site Reliability Engineer at First Resonance, you will be instrumental in enhancing the efficiency, scalability, and reliability of...


  • Los Angeles, United States Dice Full time

    Dice is the leading career destination for tech experts at every stage of their careers. Our client, Motion Recruitment Partners, LLC, is seeking the following. Apply via Dice today! Job Description A Fortune 500 consulting company is looking for SREs with Subject Matter Expertise with Dynatrace. You'll design, install, and configure Dynatrace onto...


  • Los Angeles, United States Motion Recruitment Full time

    Our Client, A Global Entertainment and Technology Company is looking for an Site Reliability Engineer to join their team in either San Diego, Los Angeles, or San Francisco!REMOTE POSITION: however candidates will need to be local to one of the three worksites to go in for occasional meetings and team events. ***This is a 6 month Contract Position With a...


  • Los Angeles, United States Motion Recruitment Full time

    Our Client, A Global Entertainment and Technology Company is looking for an Site Reliability Engineer to join their team in either San Diego, Los Angeles, or San Francisco!REMOTE POSITION: however candidates will need to be local to one of the three worksites to go in for occasional meetings and team events. ***This is a 6 month Contract Position With a...


  • Los Angeles, United States Adastra replica Full time

    Job DescriptionJob DescriptionOur client is looking for an experienced Site Reliability Engineer to design, operate, maintain, and scale mission-critical infrastructure and products. Products include (but are not limited to) automated Hardware-In-The-Loop (HITL) data analysis systems, vehicle configuration sign-off tools, continuous integration systems for...


  • Los Angeles, United States TikTok Full time

    This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep.The teams within USDS that deliver on this commitment daily span across Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and more.Site Reliability...


  • Los Angeles, United States eTek IT Services, Inc. Full time

    Job DescriptionJob DescriptionOverviewThe Site Reliability Engineer will play a crucial role in ensuring the reliability, scalability, and performance of our infrastructure and applications, ultimately contributing to the seamless operations of our systems. This role is vital in maintaining a high level of uptime and system efficiency, enhancing the overall...


  • Los Angeles, California, United States Motion Recruitment Full time

    Job OverviewA prominent consulting firm is seeking experienced Site Reliability Engineers (SREs) with specialized knowledge in Dynatrace. In this role, you will be responsible for the design, installation, and configuration of Dynatrace on Kubernetes clusters for a variety of enterprise clients. This position is remote, with occasional travel to one of the...


  • Los Angeles, California, United States City National Bank Full time

    PRINCIPAL SITE RELIABILITY ENGINEERWHAT IS THE OPPORTUNITY?As a Principal Site Reliability Engineer, you will leverage your expertise in software development, systems engineering, and operational management to design and maintain robust, scalable systems. Your primary focus will be to guarantee the reliability, scalability, and optimal uptime of City...


  • Los Angeles, California, United States City National Bank Full time

    POSITION: SITE RELIABILITY PRINCIPAL ENGINEEROVERVIEW:As a Site Reliability Engineer (SRE), you will leverage your expertise in software development, systems engineering, and operational practices to construct and maintain large-scale, resilient systems. Your primary responsibility will be to guarantee the reliability, scalability, and optimal uptime of City...


  • Los Angeles, California, United States Westlake Chemical Corporation Full time

    Senior Reliability Engineer - Reliability LeadThis role is responsible for overseeing and coordinating the activities of engineers within the designated area, providing essential guidance and direction.Key ResponsibilitiesResponsibilities may include, but are not limited to, the following:- Supervise and coordinate the activities of engineers in the assigned...

  • Reliability Engineer

    3 months ago


    Los Angeles, United States Kindeva Drug Delivery Company Full time

    The Reliability Engineer will lead the sites Asset Reliability agenda, effectively promoting analytical problem-solving techniques and structured reliability improvement processes. We have an immediate opening for a Reliability Engineers at Kindeva’s Northridge, CA manufacturing facility. The Reliability Engineer will lead the sites Asset Reliability...


  • Los Angeles, United States Journal Technologies Full time

    Job DescriptionJob DescriptionSalary: $85,000.00 to $105,000.00 USD Who We Are: At Journal Technologies, we believe our technology can be a force for good in the world ensuring the proper and efficient functioning of some of the most foundational aspects of society - the courts and justice system. We create and implement enterprise software that supports...


  • Los Angeles, California, United States Riot Games Full time

    Software Reliability Engineering at Riot is challenged with diving into our most ambiguous technology spaces between games, central services and infrastructure to solve our reliability and visibility challenges as Riot continues to scale into a multi-game ecosystem. In order to succeed as a Staff Engineer on this team you will need to be able to partner with...


  • Los Angeles, United States Luytens Technology Solutions Pvt. Ltd. Full time

    Job DescriptionJob DescriptionEx Google Candidate required:Overview:We are seeking a talented GCP Site Reliability Engineer with prior experience at Google to join our team. The role is of great importance as it involves ensuring the reliability, scalability, and performance of our infrastructure on Google Cloud Platform (GCP). The GCP Site Reliability...


  • Los Angeles, United States eTek IT Services, Inc. Full time

    Job DescriptionJob DescriptionJob DescriptionPosition: Site reliability EngineerLocation: RemoteDuration: 1 yearRequired Qualification:6+ years of demonstrated influence across one or more teams for large scale projects that drive impact and improvement across the organization& 6+ years of developing tools for automation of processes or augmenting off the...


  • Los Angeles, California, United States eTek IT Services, Inc. Full time

    Site Reliability EngineereTek IT Services, Inc. is seeking a skilled Site Reliability Engineer to enhance our operational capabilities. This position plays a crucial role in ensuring the dependability, scalability, and efficiency of our systems and applications, thereby improving overall user satisfaction.Core Responsibilities:Architect and deploy monitoring...


  • Los Gatos, California, United States Netflix Full time

    "At Netflix, we strive to bring joy to people across the world through amazing stories. As we grow internationally, we are continually enhancing our cloud-based infrastructure to improve our performance, scalability, and reliability.The SRE team's goal is to ensure customer joy by successfully managing risk and minimizing impact across Netflix. We do this...


  • Los Angeles, California, United States Kindeva Drug Delivery Company Full time

    Position Overview: The Reliability Engineer is responsible for spearheading the Asset Reliability initiatives at our manufacturing facility, utilizing analytical problem-solving methodologies and structured processes for reliability enhancement.Key Responsibilities:Maximize equipment uptime across all essential machinery.Oversee and enhance the Root Cause...


  • Los Angeles, California, United States Motion Recruitment Full time

    Our client, Motion Recruitment, is seeking a Site Reliability Engineer to enhance their team.REMOTE POSITION: Candidates must be local to designated worksites for occasional meetings and team events.***This is a 6-month Contract Position With Potential for Conversion or Extension***As a Site Reliability Engineer, you will be part of the CICD and Cloud Site...

Senior Site Reliability Engineer

3 months ago


Los Angeles, United States Avesta Computer Services Full time

Job Title: Senior Site Reliability Engineer (Devops) - (Live Streaming, Video, Media processing)

Location: Tempe, Arizona / Los Angeles, California, United States

Type: Fulltime


Job Description:

Our clients stands as a beacon of innovation, crafting world-class, large scale digital products that redefine the entertainment experience. We're on the lookout for visionary individuals to join our pioneering team, tasked with shaping the future of streaming products. Now is your chance to be part of creating and delivering extraordinary digital experiences spanning Sports and Entertainment. As a key member of our team, you'll drive innovation and significantly contribute to our mission of pioneering the next generation of streaming products. Your opportunity to create unparalleled fan experiences for these iconic sports events is here. Our current advanced digital solutions, accessed by millions across web, mobile, and living room devices, signify just the start of our ambitious journey.


About The Role:

Our client is hiring a Principal SRE to build and operate infrastructure and platforms to support APIs around our live direct to consumer APIs for major live events such as the Super Bowl, World Cup, and World Series. The principal engineer will be the technical lead for solving thundering herd problems including partnering with the application team to load test, scale up and scale back down again and help design the platform and infrastructure to meet their needs.

A collaborative, peacemaker mindset is a must while fostering a culture of learning and continuous improvement for the entire team. The principal engineer will additionally work with the Director, Platform Engineering to visualize workflows, and refine processes and policies to keep the team throughput high.


A Snapshot of Your Responsibilities:

  • Serve as technical lead for the implementation and operation of cloud-based infrastructure and platform including EKS and other AWS services supporting direct to consumer APIs and solving associated thundering herd problems including load testing, scaling up and scaling back down again.
  • Work closely with Video & Player Engineering and 3rd party teams to help design and implement scalability, cost visibility and observability in the platform.
  • Help to mentor and train less senior members of the team
  • Assist with product/technology selection including evaluating maturity, support and design and implementation of POCs.
  • Work with the Director, Site Reliability Engineering to foster a culture of learning and continuous improvement, help to conceptualize and visualize workflows and processes.
  • Perform post-incident analysis to identify root causes and potential workarounds/solutions.
  • Be fluid and open to change and evolving processes and tools.
  • Other duties as assigned.


What You Will Need:

  • Expert with EKS, Kubernetes and AWS including IAM, auto scaling, networking and load balancing/request routing.
  • Proven experience with solving scalability problems both up and down including thundering herd scenarios.
  • Expert with troubleshooting and root cause analysis
  • Expert with at least 2 programming languages
  • Strong analytical skills
  • Strong communication skills, both verbal and written
  • Proven experience with building deployment pipelines and enabling self-service.
  • Strong teamwork and willingness to collaborate with others.
  • Proven experience with training and mentoring engineers


Nice To Have, But Not a Deal breaker:

  • BS or equivalent
  • AWS Solutions Architect Professional certification