Lead Site Reliability Engineer for Security

1 week ago


San Francisco, California, United States Okta, Inc. Full time
Senior Site Reliability Engineer, Security

About Okta

Okta stands as a leader in identity management, empowering users to securely access technology across various platforms and devices. Our solutions in Workforce and Customer Identity Clouds provide seamless access, authentication, and automation, ensuring that identity is central to business security and growth.

At Okta, we value diverse perspectives and experiences. We seek individuals who are committed to continuous learning and can enhance our team with their unique insights.

Okta's Workforce Identity Cloud Security Engineering division is in search of a skilled and enthusiastic Senior Site Reliability Engineer to join our team dedicated to crafting and implementing security solutions that strengthen our cloud infrastructure. We prioritize innovation and aim to transform promising ideas into robust security measures that support large-scale, critical infrastructure. We advocate for comprehensive security practices, adherence to industry standards, and the enforcement of the principle of least privilege to elevate our security posture. Our Infrastructure Security team possesses a specialized skill set that merges security expertise with the capability to design and deploy infrastructure across multiple cloud environments while maintaining product functionality and performance.

This role is pivotal within a security-focused, dynamic organization poised for significant growth. You will serve as a bridge between the Security and Engineering teams, leveraging technical expertise to influence the security strategy. Your focus will be on engineering security components of the systems utilized across our services. Be part of a transformative journey in the cloud computing arena.

Your Responsibilities Include:

  • Constructing, operating, and overseeing Okta's production infrastructure.
  • Championing security best practices and leading initiatives to enhance our security posture for essential infrastructure.
  • Addressing production incidents and strategizing on preventive measures.
  • Diagnosing and resolving intricate production challenges to ensure reliability and performance.
  • Automating manual processes to improve efficiency.
  • Advancing our monitoring tools and platforms continuously.
  • Promoting best practices for developing scalable and dependable services across engineering.
  • Creating and maintaining technical documentation, runbooks, and procedures.
  • Participating in a 24/7 online environment as part of an on-call rotation.

Ideal Candidate Profile:

  • Proactive in problem-solving: identify issues and implement solutions.
  • Experience in automating, securing, and managing large-scale production IAM and containerized services in AWS, GCP, or other cloud platforms.
  • Familiarity with CI/CD methodologies, Linux fundamentals, OS hardening, networking principles, and IP protocols.
  • Knowledge of configuration management tools such as Chef and Terraform.
  • Proficiency in operational scripting languages like Ruby, Python, Go, and shell, along with source control usage.
  • Experience with industry-standard security tools like Nessus, Qualys, OSQuery, and Splunk.
  • Understanding of Public Key Infrastructure (PKI) and secrets management.

Additional Qualifications:

  • Experience in conducting threat assessments and evaluating vulnerabilities in high-availability environments.
  • Knowledge of MySQL, including replication and clustering strategies, and familiarity with data stores like DynamoDB, Redis, and Elasticsearch.

Minimum Required Skills and Experience:

  • 3+ years of experience in architecting and managing complex AWS or other cloud networking infrastructure.
  • 3+ years of experience with Chef and Terraform.
  • Strong understanding of Linux systems.
  • Background in security principles and practices.
  • Bachelor's degree in computer science or equivalent experience.

Okta is an Equal Opportunity Employer, committed to diversity and inclusion in the workplace.



  • San Francisco, California, United States Okta, Inc. Full time

    Senior Site Reliability Engineer, Security About Okta Okta is recognized as a leader in identity management, empowering users to securely access technology across various platforms and devices. Our Workforce and Customer Identity Clouds facilitate secure access, authentication, and automation, fundamentally transforming the digital experience by placing...


  • San Francisco, California, United States Okta, Inc. Full time

    Senior Site Reliability Engineer, Security About Okta Okta is recognized as a leader in identity management. Our mission is to empower individuals to securely access any technology—anywhere, on any device or application. Our Workforce and Customer Identity Clouds provide secure yet adaptable access, authentication, and automation that revolutionizes the...


  • San Francisco, California, United States Okta, Inc. Full time

    Senior Site Reliability Engineer, Security About Okta Okta is recognized as The World's Identity Company, dedicated to empowering individuals to securely access any technology across various devices and applications. Our Workforce and Customer Identity Clouds facilitate secure yet adaptable access, authentication, and automation, fundamentally transforming...


  • San Francisco, California, United States Okta, Inc. Full time

    Senior Site Reliability Engineer, Security About Okta Okta is recognized as the premier Identity Company globally. Our mission is to empower individuals to securely utilize any technology—anywhere, on any device or application. Our Workforce and Customer Identity Clouds facilitate secure yet adaptable access, authentication, and automation,...


  • San Francisco, California, United States Okta, Inc. Full time

    Senior Site Reliability Engineer, Security About Okta Okta is recognized as a leader in identity management. Our mission is to empower individuals to securely access any technology, anywhere, on any device or application. Our Workforce and Customer Identity Clouds provide a secure yet adaptable approach to access, authentication, and automation, placing...


  • San Francisco, California, United States AutoRABIT Holding Inc. Full time

    Job OverviewAbout AutoRABIT:AutoRABIT is a rapidly expanding SaaS company recognized as the premier provider of Salesforce DevSecOps solutions tailored for regulated sectors such as finance, insurance, and healthcare. Our platform empowers developers to streamline their workflows, enhancing productivity and accelerating release cycles while adhering to...


  • San Francisco, California, United States Abnormal Security Full time

    Job OverviewAt Abnormal Security, we empower organizations of all sizes to combat cyber threats through our innovative cloud solutions. As we strive to enhance our offerings in highly regulated environments, we are seeking a dedicated **Site Reliability Engineer II** to play a crucial role in ensuring the scalability, reliability, and availability of our...


  • San Jose, California, United States Zscaler Full time

    About ZscalerAt Zscaler, our Engineering team has developed the largest cloud security platform globally, and we continue to innovate. With over 100 patents and ambitious plans for service enhancement and global expansion, our team has established us as a leader in cloud security, serving more than 15 million users across 185 countries. We invite you to...


  • San Jose, California, United States Zscaler Full time

    About ZscalerAt Zscaler, our Engineering team has developed the largest cloud security platform globally, and we continue to innovate. With over 100 patents and ambitious plans for service enhancement and global expansion, our team has established us as the leader in cloud security, serving more than 15 million users across 185 countries. We invite you to...


  • San Jose, California, United States Zscaler Full time

    About UsZscaler has developed the world's largest cloud security platform, continually innovating and expanding our services. With a robust portfolio of over 100 patents and ambitious plans for global growth, our team has established itself as a leader in cloud security, serving more than 15 million users across 185 countries. We are looking for talented...


  • San Francisco, California, United States Abnormal Security Full time

    About the RoleAbnormal Security is a leading provider of cloud-based cybersecurity solutions, trusted by enterprises of all sizes to stop cybercrime. As a Cloud Reliability Engineer II, you will play a critical role in ensuring the reliability and availability of our products, which must scale with the growth of our customers.Our goal is to establish our...


  • San Francisco, California, United States AutoRABIT Holding Inc. Full time

    Job OverviewAbout AutoRABIT:AutoRABIT is a rapidly expanding SaaS company recognized as the premier provider of Salesforce DevSecOps solutions tailored for regulated sectors such as finance, insurance, and healthcare. Our offerings empower developers to streamline their daily operations, enhancing productivity and accelerating release cycles while adhering...


  • San Francisco, California, United States Salesforce, Inc. Full time

    Cloud Infrastructure Specialist - Site Reliability Engineer LeadJob Category: Enterprise Technology & InfrastructureAbout Salesforce, Inc.We're a leading technology company, inspiring innovation and driving business growth with cutting-edge solutions. Our mission is to empower businesses to thrive in a rapidly changing world. We're committed to creating a...


  • San Francisco, California, United States Operant AI Full time

    Job OverviewSenior Site Reliability EngineerAs the inaugural SRE within our organization, we are looking for an individual to establish Operant's SRE strategy and operations aimed at ensuring the resilience and security of our platforms and services. If you are enthusiastic about the prospect of being an early engineer at a startup ready to revolutionize...


  • San Francisco, California, United States AutoRABIT Holding Inc. Full time

    Job OverviewAbout AutoRABIT:AutoRABIT is a rapidly expanding SaaS provider and a prominent leader in the Salesforce DevSecOps platform tailored for regulated sectors such as finance, insurance, and healthcare. Our solutions empower developers to streamline their daily operations, enhancing productivity and accelerating release cycles while adhering to...


  • San Francisco, California, United States Centene Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Centene. As a key member of our technology organization, you will play a critical role in ensuring the reliability, performance, and security of our platform infrastructure.Key ResponsibilitiesLead Projects and Initiatives: Help lead projects focused on...


  • San Francisco, California, United States Cisco Full time

    Position Overview We are seeking experienced engineers to join our Site Reliability Engineering (SRE) team dedicated to the Federal sector at Cisco. In this role, you will be instrumental in designing and upholding the infrastructure and systems that are vital for our Federal operations. Your collaboration with application development teams will ensure...


  • San Francisco, California, United States Autodesk, Inc. Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to lead our cloud infrastructure efforts and ensure the reliability and performance of our software solutions. As a key member of our team, you will be responsible for designing, implementing, and maintaining scalable and secure cloud infrastructure to support our growing user...


  • San Francisco, California, United States Diverse Lynx Full time

    About the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a key member of our organization, you will play a critical role in ensuring the reliability and efficiency of our digital infrastructure.Key Responsibilities:Design and implement reliable digital infrastructure solutionsCollaborate with...


  • San Jose, California, United States Adobe Full time

    Site Reliability Engineer page is loadedAdobe's Reliability Engineering team is looking for a Site Reliability Engineer (SRE) to help build and operate services like Adobe Sign. Adobe Sign is the fastest, and easiest way to get contracts signed and filed.You have a track record as a site reliability engineer in large-scale SaaS businesses, and a strong...