Senior Site Reliability Engineer

2 weeks ago


San Francisco, California, United States AutoRABIT Holding Inc. Full time
Job OverviewAbout AutoRABIT:
AutoRABIT is a rapidly expanding SaaS company recognized as the premier provider of Salesforce DevSecOps solutions tailored for regulated sectors such as finance, insurance, and healthcare. Our offerings empower developers to streamline their daily operations, enhancing productivity and accelerating release cycles while adhering to strict security, compliance, and privacy standards.

Position Summary:

We are seeking a Senior Site Reliability/DevSecOps Engineer to contribute to the development, scaling, and management of our cloud services.

In this position, you will leverage your extensive experience to implement and optimize operational best practices across teams, providing insights and recommendations for enhanced reliability and automation. You will be accountable for the security, availability, performance, efficiency, change management, monitoring, emergency response, capacity planning, backup, and disaster recovery of our technical ecosystem, while also driving automation initiatives and establishing a robust DevSecOps framework.

Success in this role requires accountability, agility, and strong analytical capabilities, coupled with a commitment to continuous learning, data collection, and execution based on insights.

Key Responsibilities:

  • Serve as a Site Reliability or DevSecOps engineer with a strong focus on automation, reliability, scalability, monitoring, and capacity planning, possessing the knowledge to support a diverse range of software and systems.
  • Contribute to the creation and upkeep of frameworks for monitoring, automation, and coding to enhance service scalability and reliability.
  • Support both internal and client-facing teams in deploying new software releases, VPNs, and related security infrastructure.
  • Assist in resolving AutoRABIT service or customer-related issues as necessary.
  • Engage in sustainable incident response practices and conduct blameless postmortems.
  • Facilitate the automation of manual tasks, including user provisioning in production and testing environments.
  • Collaborate within a small agile team to develop and refine SRE software, support colleagues, and pursue self-improvement.
  • Participate in a regular on-call or rotational schedule to support AutoRABIT servers, including weekends and holidays.
Required Qualifications:
  • Proven experience in deploying and maintaining scalable, resilient, and secure infrastructure using AWS, GCP, and/or Azure cloud services and automation.
  • Familiarity with essential DevSecOps tools for monitoring (e.g., ELK, AWS, Azure CloudWatch) and infrastructure management platforms (e.g., Kubernetes, Docker, Ansible, Jenkins, Terraform).
  • Proficiency in Shell Scripting (Bash), Python, or similar languages is essential.
  • Knowledge of programming languages such as Python, Go, or Java.
  • Experience with configuration management tools like Ansible or Chef.
  • Strong understanding of CI/CD pipelines and tools such as Jenkins, GitLab CI, or CircleCI.
  • Excellent troubleshooting skills in SaaS or customer environments.
  • A collaborative team player who values feedback and knowledge sharing.
  • A proactive mindset: challenging the status quo, leading, and contributing to significant improvements and innovations while maintaining accountability.
  • Exceptional written and verbal communication skills in US English for effective collaboration in a global team environment.

Education and Experience:

  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent experience.
  • Minimum of 2 years of experience in Infrastructure Management, DevOps, or Site Reliability, preferably in a SaaS or cloud context.
  • AWS, GCP, and/or Azure certification is preferred.
  • At least 2 years of experience with Kubernetes.
  • Minimum of 2 years managing Linux-based systems in a public cloud environment such as AWS, GCP, or Azure.
  • At least 2 years of experience with systems monitoring and logging; familiarity with ELK is advantageous.
  • Solid understanding of standard TCP/IP networking and common protocols such as DNS, load balancers, and HTTP.

Note: Candidates must be US citizens or permanent residents and capable of obtaining a Government Security clearance if required.

Salary range for this position is $70,000 to $100,000 per year, based on experience.

This is a fully remote position.

Powered by JazzHR

zcHta7dtkS



  • San Francisco, California, United States AutoRABIT Holding Inc. Full time

    Job OverviewAbout AutoRABIT:AutoRABIT is a rapidly expanding SaaS provider and a prominent leader in the Salesforce DevSecOps platform tailored for regulated sectors such as finance, insurance, and healthcare. Our solutions empower developers to streamline their daily operations, enhancing productivity and accelerating release cycles while adhering to...


  • San Francisco, California, United States RevenueCat Full time

    About RevenueCatWe are a leading provider of mobile subscription infrastructure, handling over $3 billion in in-app purchases annually across thousands of apps. Our mission is to build a standard for mobile subscription infrastructure, and we're looking for a Senior Site Reliability Engineer to help us achieve this goal.About the RoleWe're seeking a highly...


  • San Francisco, California, United States Outdefine Full time

    About the JobWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Outdefine. As a key member of our Infrastructure team, you will be responsible for ensuring the reliability and scalability of our blockchain-based systems.Key ResponsibilitiesRun internal Chainlink and Blockchain nodes to ensure seamless connectivity and data...


  • San Francisco, California, United States Centene Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Centene. As a key member of our technology organization, you will play a critical role in ensuring the reliability, performance, and security of our platform infrastructure.Key ResponsibilitiesLead Projects and Initiatives: Help lead projects focused on...


  • San Francisco, California, United States Operant AI Full time

    Job OverviewSenior Site Reliability EngineerAs the inaugural SRE within our organization, we are looking for an individual to establish Operant's SRE strategy and operations aimed at ensuring the resilience and security of our platforms and services. If you are enthusiastic about the prospect of being an early engineer at a startup ready to revolutionize...


  • San Francisco, California, United States Circle Full time

    About CircleCircle is a leading financial technology company that is revolutionizing the way value is transferred globally. Our innovative infrastructure enables businesses, institutions, and developers to harness the power of blockchain technology and capitalize on the emerging internet of money.Job SummaryWe are seeking a highly skilled Senior Site...


  • San Francisco, California, United States Autodesk, Inc. Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to lead our cloud infrastructure efforts and ensure the reliability and performance of our software solutions. As a key member of our team, you will be responsible for designing, implementing, and maintaining scalable and secure cloud infrastructure to support our growing user...


  • San Francisco, California, United States AutoRABIT Holding Inc. Full time

    Job OverviewAbout AutoRABIT:AutoRABIT is a rapidly expanding SaaS company recognized as the premier provider of Salesforce DevSecOps solutions tailored for regulated sectors such as finance, insurance, and healthcare. Our platform empowers developers to streamline their workflows, enhancing productivity and accelerating release cycles while adhering to...


  • San Francisco, California, United States Cognizant Full time

    Senior Site Reliability Engineer and R2 Solutions Architect (Remote) Cognizant is seeking an experienced Senior Site Reliability Engineer and R2 Solutions Architect with expertise in Python Performance Validation and Dynatrace to oversee critical projects. Your contributions will significantly enhance the efficiency and effectiveness of our solutions,...


  • San Francisco, California, United States Chelsoft Solutions Co Full time

    Job OverviewWe are seeking a Senior Site Reliability Engineer to join our dynamic team at Chelsoft Solutions Co. This position is designed for a skilled SRE professional who thrives in a hybrid work environment.Key ResponsibilitiesImplement and maintain reliable systems and infrastructure.Collaborate with cross-functional teams to enhance system...


  • San Francisco, California, United States Crusoe Full time

    About This Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Crusoe Energy Systems. As a Site Reliability Engineer, you will play a pivotal role in ensuring the reliability and performance of our infrastructure.Key Responsibilities:Collaborate with the SRE team to detect, analyze, and prevent issues to maintain high Service...


  • San Diego, California, United States Dexcom Full time

    About Dexcom:Founded in 1999, Dexcom, Inc. (NASDAQ: DXCM) is a pioneer in the development and marketing of Continuous Glucose Monitoring (CGM) systems designed for use by individuals with diabetes and healthcare professionals. As a leader in the transformation of diabetes management, Dexcom is committed to providing innovative CGM technology that empowers...


  • San Francisco, California, United States Cisco Full time

    Position Overview We are seeking experienced engineers to become part of our Federal region's Site Reliability Engineering (SRE) team at Cisco, a leader in Internet and cloud intelligence solutions. In this role, you will be instrumental in designing and sustaining the infrastructure and systems vital for the operations within the Federal sector. Your...


  • San Francisco, California, United States Okta, Inc. Full time

    Senior Site Reliability Engineer, Security About Okta Okta is recognized as a leader in identity management, empowering users to securely access technology across various platforms and devices. Our Workforce and Customer Identity Clouds facilitate secure access, authentication, and automation, fundamentally transforming the digital experience by placing...


  • San Francisco, California, United States Okta, Inc. Full time

    Senior Site Reliability Engineer, Security About Okta Okta stands as a leader in identity management, empowering users to securely access technology across various platforms and devices. Our solutions in Workforce and Customer Identity Clouds provide seamless access, authentication, and automation, ensuring that identity is central to business security and...


  • San Francisco, California, United States Okta, Inc. Full time

    Senior Site Reliability Engineer, Security About Okta Okta is recognized as The World's Identity Company, dedicated to empowering individuals to securely access any technology across various devices and applications. Our Workforce and Customer Identity Clouds facilitate secure yet adaptable access, authentication, and automation, fundamentally transforming...


  • San Francisco, California, United States Okta, Inc. Full time

    Senior Site Reliability Engineer, Security About Okta Okta is recognized as a leader in identity management. Our mission is to empower individuals to securely access any technology—anywhere, on any device or application. Our Workforce and Customer Identity Clouds provide secure yet adaptable access, authentication, and automation that revolutionizes the...


  • San Francisco, California, United States Okta, Inc. Full time

    Senior Site Reliability Engineer, Security About Okta Okta is recognized as the premier Identity Company globally. Our mission is to empower individuals to securely utilize any technology—anywhere, on any device or application. Our Workforce and Customer Identity Clouds facilitate secure yet adaptable access, authentication, and automation,...


  • San Francisco, California, United States Pager Full time

    PagerDuty empowers teams of all kinds to drive business forward through our Operations Cloud.We're seeking a Senior Site Reliability Engineer to join our SRE-Platform team. As a key contributor, you'll build, maintain, and scale our Kubernetes platform, accelerating developer productivity, improving reliability, and helping PagerDuty scale for the...


  • San Francisco, California, United States Okta, Inc. Full time

    Senior Site Reliability Engineer, Security About Okta Okta is recognized as a leader in identity management. Our mission is to empower individuals to securely access any technology, anywhere, on any device or application. Our Workforce and Customer Identity Clouds provide a secure yet adaptable approach to access, authentication, and automation, placing...