We have other current jobs related to this field that you can find below


  • Los Angeles, California, United States Riot Games Full time

    Software Reliability Engineering at Riot is challenged with diving into our most ambiguous technology spaces between games, central services and infrastructure to solve our reliability and visibility challenges as Riot continues to scale into a multi-game ecosystem. In order to succeed as a Staff Engineer on this team you will need to be able to partner with...


  • Los Angeles, United States Motion Recruitment Full time

    Our Client, A Global Entertainment and Technology Company is looking for an Site Reliability Engineer to join their team in either San Diego, Los Angeles, or San Francisco!REMOTE POSITION: however candidates will need to be local to one of the three worksites to go in for occasional meetings and team events. ***This is a 6 month Contract Position With a...


  • Los Angeles, United States Motion Recruitment Full time

    Our Client, A Global Entertainment and Technology Company is looking for an Site Reliability Engineer to join their team in either San Diego, Los Angeles, or San Francisco!REMOTE POSITION: however candidates will need to be local to one of the three worksites to go in for occasional meetings and team events. ***This is a 6 month Contract Position With a...


  • Los Angeles, United States Adastra replica Full time

    Job DescriptionJob DescriptionOur client is looking for an experienced Site Reliability Engineer to design, operate, maintain, and scale mission-critical infrastructure and products. Products include (but are not limited to) automated Hardware-In-The-Loop (HITL) data analysis systems, vehicle configuration sign-off tools, continuous integration systems for...


  • Los Angeles, United States TikTok Full time

    This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols to keep.The teams within USDS that deliver on this commitment daily span across Trust & Safety, Security & Privacy, Engineering, User & Product Ops, Corporate Functions and more.Site Reliability...


  • Los Angeles, United States eTek IT Services, Inc. Full time

    Job DescriptionJob DescriptionOverviewThe Site Reliability Engineer will play a crucial role in ensuring the reliability, scalability, and performance of our infrastructure and applications, ultimately contributing to the seamless operations of our systems. This role is vital in maintaining a high level of uptime and system efficiency, enhancing the overall...


  • Los Angeles, California, United States First Resonance Full time

    Job Title: Senior Site Reliability EngineerFirst Resonance is a forward-thinking company at the forefront of hardware development for cutting-edge products like electric airplanes, autonomous vehicles, and robotics. As a Senior Site Reliability Engineer at First Resonance, you will be instrumental in enhancing the efficiency, scalability, and reliability of...


  • Los Angeles, United States Journal Technologies Full time

    Job DescriptionJob DescriptionSalary: $85,000.00 to $105,000.00 USD Who We Are: At Journal Technologies, we believe our technology can be a force for good in the world ensuring the proper and efficient functioning of some of the most foundational aspects of society - the courts and justice system. We create and implement enterprise software that supports...


  • Los Angeles, California, United States Motion Recruitment Full time

    Job OverviewA prominent consulting firm is seeking experienced Site Reliability Engineers (SREs) with specialized knowledge in Dynatrace. In this role, you will be responsible for the design, installation, and configuration of Dynatrace on Kubernetes clusters for a variety of enterprise clients. This position is remote, with occasional travel to one of the...


  • Los Angeles, California, United States City National Bank Full time

    PRINCIPAL SITE RELIABILITY ENGINEERWHAT IS THE OPPORTUNITY?As a Principal Site Reliability Engineer, you will leverage your expertise in software development, systems engineering, and operational management to design and maintain robust, scalable systems. Your primary focus will be to guarantee the reliability, scalability, and optimal uptime of City...


  • Los Angeles, California, United States City National Bank Full time

    POSITION: SITE RELIABILITY PRINCIPAL ENGINEEROVERVIEW:As a Site Reliability Engineer (SRE), you will leverage your expertise in software development, systems engineering, and operational practices to construct and maintain large-scale, resilient systems. Your primary responsibility will be to guarantee the reliability, scalability, and optimal uptime of City...

  • Reliability Engineer

    3 months ago


    Los Angeles, United States Kindeva Drug Delivery Company Full time

    The Reliability Engineer will lead the sites Asset Reliability agenda, effectively promoting analytical problem-solving techniques and structured reliability improvement processes. We have an immediate opening for a Reliability Engineers at Kindeva’s Northridge, CA manufacturing facility. The Reliability Engineer will lead the sites Asset Reliability...


  • Los Angeles, United States Luytens Technology Solutions Pvt. Ltd. Full time

    Job DescriptionJob DescriptionEx Google Candidate required:Overview:We are seeking a talented GCP Site Reliability Engineer with prior experience at Google to join our team. The role is of great importance as it involves ensuring the reliability, scalability, and performance of our infrastructure on Google Cloud Platform (GCP). The GCP Site Reliability...


  • Los Angeles, United States Riot Games Full time

    Software Reliability Engineering at Riot is challenged with diving into our most ambiguous technology spaces between games, central services and infrastructure to solve our reliability and visibility challenges as Riot continues to scale into a multi-game ecosystem. In order to succeed as a Staff Engineer on this team you will need to be able to partner with...


  • Los Angeles, United States Dice Full time

    Dice is the leading career destination for tech experts at every stage of their careers. Our client, Motion Recruitment Partners, LLC, is seeking the following. Apply via Dice today! Job Description A Fortune 500 consulting company is looking for SREs with Subject Matter Expertise with Dynatrace. You'll design, install, and configure Dynatrace onto...


  • Los Angeles, United States eTek IT Services, Inc. Full time

    Job DescriptionJob DescriptionJob DescriptionPosition: Site reliability EngineerLocation: RemoteDuration: 1 yearRequired Qualification:6+ years of demonstrated influence across one or more teams for large scale projects that drive impact and improvement across the organization& 6+ years of developing tools for automation of processes or augmenting off the...


  • Los Angeles, California, United States eTek IT Services, Inc. Full time

    Site Reliability EngineereTek IT Services, Inc. is seeking a skilled Site Reliability Engineer to enhance our operational capabilities. This position plays a crucial role in ensuring the dependability, scalability, and efficiency of our systems and applications, thereby improving overall user satisfaction.Core Responsibilities:Architect and deploy monitoring...


  • Los Angeles, California, United States Kindeva Drug Delivery Company Full time

    Position Overview: The Reliability Engineer is responsible for spearheading the Asset Reliability initiatives at our manufacturing facility, utilizing analytical problem-solving methodologies and structured processes for reliability enhancement.Key Responsibilities:Maximize equipment uptime across all essential machinery.Oversee and enhance the Root Cause...


  • Los Angeles, California, United States Motion Recruitment Full time

    Our client, Motion Recruitment, is seeking a Site Reliability Engineer to enhance their team.REMOTE POSITION: Candidates must be local to designated worksites for occasional meetings and team events.***This is a 6-month Contract Position With Potential for Conversion or Extension***As a Site Reliability Engineer, you will be part of the CICD and Cloud Site...


  • Los Angeles, California, United States LOOP LLC Full time

    Company OverviewLOOP LLC is a significant player in the energy sector, serving as a crucial conduit for waterborne crude oil entering the United States. As a joint venture among leading companies, we have established ourselves as the only Deepwater Port in the U.S., facilitating the loading and unloading of various vessel sizes.Position SummaryWe are seeking...

Staff Site Reliability Engineer

2 months ago


Los Angeles, United States Okta, Inc. Full time

Get to know Okta Okta is The World’s Identity Company. We free everyone to safely use any technology—anywhere, on any device or app. Our Workforce and Customer Identity Clouds enable secure yet flexible access, authentication, and automation that transforms how people move through the digital world, putting Identity at the heart of business security and growth. At Okta, we celebrate a variety of perspectives and experiences. We are not looking for someone who checks every single box - we’re looking for lifelong learners and people who can make us better with their unique experiences. Join our team We’re building a world where Identity belongs to you. As a Staff SRE Engineer, you will champion all things pertaining to reliability at Okta on our Customer Identity (CIC) product. Working closely with the product engineers, quality engineers, platform engineers and architecture teams, your primary focus will be on ensuring production systems remain operational at all times, while continually setting and achieving long-term performance, reliability and scalability goals in a platform with an exponential growth plan for the coming years. With CIC’s increased dedication to ensuring customer availability expectations are exceeded in every way, you will play a key role as we evolve our system architecture to meet the demands of enormous growth and support the hundreds of millions of users who rely on us to provide uninterrupted access to business-critical enterprise and consumer applications. You will: Core contributor to OKTA’s FedRAMP initiative Collaborate with engineering teams to improve availability, reliability, and observability of their services. Participate in regular on-call rotations to ensure 24/7 coverage of all critical systems Use existing monitoring tools to identify problems and resolve and/or escalate to service teams Implement changes to enable or improve infrastructure resilience, monitoring, and alerting Lead the development and continuous refinement of SRE tools and processes to improve software delivery, observability, reliability, and operational efficiency. Daily coding, scripting, and development - Go, Terraform, Helm, etc Optimize existing systems and eliminate toil through simplification and automation. You might be a good fit if you: Are a U.S. Person Status (e.g. a U.S. Citizen, National, Lawful Permanent Resident, Refugee or Asylee)* Have experience working in a FedRAMP environment Have 6+ years industry experience as a Site Reliability Engineer or adjust disciplines (DevOps/Platform/etc) Are proficient in Golang Have experience in managing infrastructure with Terraform at scale Are comfortable working with a fully distributed team Have 4+ years as software developer in a SaaS environment Have experience in a production environment supporting large-scale, mission-critical applications Have demonstrable expertise working with Microsoft Azure and/or Amazon Web Services. Production on-call experience in a 24/7 cloud based environment Have a good understanding of microservices, cloud infrastructure (AWS, Azure, GCP), databases (SQL, No-SQL, Key/Value), containers (docker, kubernetes), web technologies (web sockets, http) and networking (SSL, routing, VPN) Exceptional communication skills, including technical writing in the English language Have a systematic problem-solving approach, coupled with a strong sense of ownership and drive Comfortable with the Agile software development methodology Loves to work as a team, but is able to work effectively in a remote environment where tasks may be self-driven

#J-18808-Ljbffr