Senior Site Reliability/DevOps Engineer

2 weeks ago


San Francisco, California, United States AutoRABIT Holding Inc. Full time

About AutoRABIT:

AutoRABIT is a leading provider of Salesforce DevSecOps platform for regulated industries. Our solutions enable developers to automate daily tasks, increasing productivity and release velocity while meeting stringent security, compliance, and privacy regulations.

About the role:

We are seeking a Senior Site Reliability/DevOps Engineer to help develop, scale, and operate our cloud services. As an experienced business professional, you will implement and execute best practice operations and improvements across teams, providing visibility and recommendations for improved reliability and automation.

Responsibilities:

  • Contribute to the development and maintenance of frameworks for monitoring, automation, and code to increase scalability and reliability of the service.
  • Assist internal and customer-facing teams with deployment of new software releases, VPN, and other related security infrastructure.
  • Participate in and practice sustainable incident response and blameless postmortems.
  • Contribute to the automation of manual tasks, such as provisioning of users in production and test environments.
  • Help and develop peers' capabilities through knowledge sharing, mentoring, and collaboration.

Required Skills and Experience:

  • Design, implement, and maintain scalable, resilient, and secure infrastructure using AWS.
  • Develop and manage infrastructure as code using Terraform.
  • Implement and manage CI/CD pipelines to automate deployments and ensure smooth delivery of applications.
  • Monitor system performance, identify bottlenecks, and implement solutions to improve reliability and performance.
  • Troubleshoot, resolve, and perform RCAs for incidents, ensuring minimal disruption to services.

Education and Background:

  • Bachelor's in Computer Science, Engineering, or equivalent degree or experience.
  • 5+ years of experience in site reliability engineering, DevOps, or a related field.
  • AWS, GCP, and/or Azure Certified.
  • 3+ years of Kubernetes experience.
  • 3+ years' experience managing Linux-based systems in a public cloud such as AWS, GCP, or Azure.
  • 3+ years of experience with systems monitoring and logging; knowledge of ELK is preferred.


  • San Francisco, California, United States Autodesk Full time

    {"Responsibilities": "As a Senior Site Reliability Engineer at Autodesk, you will be responsible for leading the development and maintenance of robust cloud infrastructure to support millions of daily users. You will automate processes to improve system reliability and introduce best practices in continuous integration and deployment. You will also lead...


  • San Francisco, California, United States Astranis Full time

    Astranis MissionAstranis is revolutionizing global connectivity by building smaller, more cost-effective spacecraft to bridge the digital divide.Job SummaryWe're seeking a highly skilled Senior Site Reliability Engineer to join our team and lead our DevOps efforts as we expand to a fleet of satellites and their supporting services.Key ResponsibilitiesOwn and...


  • San Francisco, California, United States GRNET Full time

    About GRNETGRNET is a leading provider of advanced network and cloud computing services to academic and research institutions, educational entities, and public sector agencies in Greece.Our ApproachWe adopt a Site Reliability Engineering approach to ensure the reliability, scalability, and efficiency of our infrastructure. Our team is divided into three...


  • San Francisco, California, United States GRNET Full time

    About GRNETGRNET is a leading provider of advanced network and cloud computing services to academic and research institutions, educational entities, and public sector agencies in Greece.Our ApproachWe adopt a Site Reliability Engineering approach to ensure the reliability, scalability, and efficiency of our infrastructure. Our team is divided into three...


  • San Francisco, California, United States Pager Full time

    About the RolePagerDuty is seeking a highly skilled Senior Site Reliability Engineer to join our SRE-Platform team. As a key contributor, you will play a crucial role in building, maintaining, and scaling our Kubernetes platform.Key ResponsibilitiesMaintain the overall health of the platform, including triaging and troubleshooting production issues,...


  • San Francisco, California, United States GRNET Full time

    About GRNETGRNET is a leading entity in the Greek Government, providing advanced network and cloud computing services to academic and research institutions, educational entities, and public sector agencies.The company offers a wide range of services, including:Unified Portal for Government Digital ServicesCloud Services for Research and EducationNetworking...


  • San Jose, California, United States Western Digital Full time

    Job DescriptionWestern Digital is seeking a highly skilled Site Reliability Engineer - DevOps to join our team. As a key member of our engineering process, you will be responsible for delivering software development tools and infrastructure that empower our engineering teams to develop and deliver high-quality products quickly.You will play a pivotal role in...


  • San Francisco, California, United States GRNET Full time

    About GRNETGRNET is a leading provider of advanced network and cloud computing services to academic and research institutions, educational entities, and public sector agencies in Greece.Our ApproachWe adopt a Site Reliability Engineering approach to ensure the reliability, scalability, and efficiency of our infrastructure and services.Our TeamOur SRE...


  • San Francisco, California, United States Apollo Solutions Full time

    Site Reliability EngineerApollo Solutions has partnered with a pioneering artificial intelligence business that is revolutionizing the use of AI/ML in gaming and security.The company is working closely with government contracts and gaming console companies and is seeking a Site Reliability Engineer to join their growing team.The Site Reliability Engineer...


  • San Francisco, California, United States GRNET S.A. Full time

    About GRNETGRNET S.A. is a leading provider of advanced network and cloud computing services to academic and research institutions, educational entities, and public sector agencies in Greece.Our ServicesWe offer a wide range of services, including:Unified Portal for all Government-related Digital ServicesCloud Services for Research and EducationNetworking...


  • San Francisco, California, United States Circle Full time

    About CircleCircle is a financial technology company at the forefront of the emerging internet of money, where value can flow freely and securely across borders.Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will design, build, and maintain Circle's cloud...


  • San Francisco, California, United States PicnicHealth Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at PicnicHealth. As a key member of our engineering team, you will be responsible for ensuring the reliability, efficiency, and architecture of our cloud, developer, and security operations.As a Senior SRE, you will take the lead in identifying and resolving...


  • San Francisco, California, United States Astranis Full time

    About the RoleAstranis is a pioneering company in the field of satellite technology, aiming to bridge the digital divide by connecting the four billion people worldwide who lack internet access. As a Senior Site Reliability Engineer for Ground Software Systems, you will play a crucial role in ensuring the reliability and availability of our mission-critical...


  • San Francisco, California, United States Infused Solutions Full time

    Senior Site Reliability EngineerInfused Solutions is seeking a highly skilled Senior Site Reliability Engineer to join their IT infrastructure team. Our client is a market leader in the San Francisco area, and we are looking for a talented individual with expertise in Microsoft Azure and a strong background in software engineering.Key Responsibilities:Design...


  • San Francisco, California, United States GoForward, Inc. Full time

    About ForwardForward is a pioneering healthcare company on a mission to make high-quality healthcare accessible to a billion people worldwide.We're building a cutting-edge healthcare platform from the ground up, combining innovative hardware, software, and medical expertise under one roof.Job SummaryWe're seeking a world-class Senior Software Engineer to...


  • San Francisco, California, United States Apollo Solutions Full time

    Principal Site Reliability EngineerApollo Solutions is seeking a highly skilled Principal Site Reliability Engineer to join our team in San Francisco or remotely. As a key member of our engineering team, you will be responsible for designing, building, and scaling high availability, low latency software environments.Responsibilities:Deliver complex projects...


  • San Francisco, California, United States Wasmer Full time

    About the RoleWe are seeking an exceptional Site Reliability Engineer to join our team at Wasmer. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining scalable and reliable infrastructure solutions for our Edge computing platform.Key ResponsibilitiesDesign and implement scalable and reliable infrastructure...


  • San Francisco, California, United States Apollo Solutions Full time

    Principal Site Reliability EngineerApollo Solutions is seeking a highly skilled Principal Site Reliability Engineer to join our team in San Francisco or remotely. As a key member of our organization, you will play a crucial role in driving the DevOps culture and advocating for automation, blameless postmortems, and high-performing production...


  • San Francisco, California, United States Diverse Lynx Full time

    About the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a key member of our organization, you will play a critical role in ensuring the reliability and efficiency of our digital infrastructure.Key Responsibilities:Design and implement reliable digital infrastructure solutionsCollaborate with...


  • San Francisco, California, United States Astranis Full time

    About AstranisAstranis is a pioneering company on a mission to bridge the digital divide by connecting the four billion people worldwide who currently lack internet access. We're doing this by building the next generation of smaller, more cost-effective spacecraft to bring the world online.Our TeamWe've launched two satellites into orbit, signed ten...