Reliability Engineering Specialist

2 weeks ago


California, United States Charter Global Full time

Position: Site Reliability Engineer

Company: Charter Global

Contract Type: Temporary

Key Responsibilities:

  • Provide guidance to architecture and development teams to enhance application availability, reliability, and performance on a global scale.
  • Collaborate with architecture teams to ensure that operability, measurability, and manageability are integrated into business features and enablers.
  • Work alongside product owners and managers to establish and track essential metrics that align with Service Level Objectives (SLOs) and Service Level Agreements (SLAs).
  • Engage with development team members to diagnose and resolve technical issues.
  • Lead Root Cause Analysis for production incidents and other failures within the software, pipeline, or related DevOps processes and technologies.
  • Additional tasks may be assigned based on specific role requirements.

Required Qualifications:

  • Minimum of 7 years of experience in Automation Programming using one or more scripting languages such as Python, Go, Java, Ruby, Rust, or JavaScript, with a preference for Python and Go. Note that Bash is not considered a programming language.
  • At least 7 years of experience utilizing Linux terminal tools and crafting shell scripts in a Linux environment.
  • Comprehensive understanding of public cloud service principles.
  • In-depth knowledge of Unix/Linux operating systems, including administration (experience with Debian is preferred but not mandatory).
  • Strong grasp of networking concepts (e.g., TCP/IP, routing, network architectures, and hardware), storage solutions, and database management systems.
  • Extensive experience in debugging, optimizing code, and automating repetitive tasks.

Additional skills and experience may be necessary depending on the specific roles.



  • California, Missouri, United States Insight Global Full time

    Position OverviewWe are seeking an experienced Infrastructure Reliability Specialist to join our dynamic team. This role is crucial for maintaining the reliability and performance of our systems in a fast-paced environment.Key ResponsibilitiesManage and optimize cloud infrastructure using AWS services, including EKS and IAM.Implement and maintain Kubernetes...


  • California, Missouri, United States Amazon Full time

    Position Overview:The Reliability and Maintainability Engineer plays a crucial role in ensuring the performance and longevity of systems within Amazon's Kuiper Government Solutions (KGS). This position is focused on enhancing the reliability, availability, and maintainability of both space-based and terrestrial systems.Key Responsibilities:Lead the RAM...


  • California, United States Bayside Solutions Full time

    Kubernetes Site Reliability EngineerW2 ContractSalary Range: $124,800 - $145,600 per yearLocation: Cupertino, CA - Hybrid RolePosition Overview:As a Kubernetes Site Reliability Engineer, you will play a crucial role in ensuring the reliability and performance of our cloud-based systems. Your primary responsibility will be to maintain high availability,...


  • California, United States Bayside Solutions Full time

    Kubernetes Site Reliability EngineerW2 ContractSalary Range: $124,800 - $145,600 per yearLocation: Cupertino, CA - Hybrid RoleJob Overview:As a Kubernetes Site Reliability Engineer, you will play a crucial role in managing essential cloud infrastructure to ensure uninterrupted service, facilitate seamless scaling, and enable the deployment of innovative...


  • Sacramento, California, United States Two95 International Inc. Full time

    Position: Reliability Engineering Manager Location: Remote Type: Fulltime Salary: Competitive PRIMARY RESPONSIBILITIES: The Reliability Engineering Manager will ensure that reliability strategies are integrated into overarching IT objectives and that performance expectations are clearly articulated. The manager will collaborate with both business and IT...


  • California, United States Bayside Solutions Full time

    Kubernetes Site Reliability EngineerW2 ContractSalary Range: $124,800 - $145,600 per yearLocation: Cupertino, CA - Hybrid RolePosition Overview:The role involves overseeing essential cloud infrastructures to ensure continuous availability, facilitate seamless scaling, and support the development of new applications and services. We are seeking a driven...


  • California, United States Bayside Solutions Full time

    Kubernetes Site Reliability EngineerW2 ContractSalary Range: $124,800 - $145,600 per yearLocation: Cupertino, CA - Hybrid RolePosition Overview:The primary responsibility of this role is to oversee critical cloud infrastructure, ensuring consistent uptime, facilitating seamless scaling, and enabling the development of new applications and services. We are...


  • California, United States Bayside Solutions Full time

    Kubernetes Site Reliability EngineerW2 ContractSalary Range: $124,800 - $145,600 per yearLocation: Cupertino, CA - Hybrid RolePosition Overview:The role involves overseeing essential cloud infrastructures to ensure uninterrupted service, facilitate seamless scaling, and enable the development of new applications and services. We seek a driven engineer who is...


  • California, United States Bayside Solutions Full time

    Kubernetes Site Reliability EngineerW2 ContractSalary Range: $124,800 - $145,600 per yearLocation: Cupertino, CA - Hybrid RolePosition Overview:The role involves overseeing essential cloud infrastructure to ensure uninterrupted service, facilitate seamless scaling, and enable the development of new applications and services. We seek a driven engineer who is...


  • California, Missouri, United States Insight Global Full time

    Position Title: Site Reliability Engineer (AWS/Kubernetes/Python/Terraform)Job Overview:A leading media organization is in search of skilled Site Reliability Engineers to enhance their streaming operations. This role demands extensive expertise in AWS, Kubernetes, Terraform, and Python, contributing to a permanent role within the company.Key...


  • Sacramento, California, United States Two95 International Inc. Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineering Manager to join our team at Two95 International Inc. as a key member of our IT department. The successful candidate will be responsible for ensuring the reliability and efficiency of our IT systems and infrastructure.Key ResponsibilitiesReliability Program Development: Work with the...


  • California, Missouri, United States Bitwarden Inc. Full time

    About BitwardenBitwarden empowers organizations, developers, and individuals to securely manage and share sensitive information. With a transparent, open-source approach to password management, secrets management, and innovations in passwordless and passkey technologies, Bitwarden simplifies the implementation of robust security practices across all online...


  • California, United States Zscaler Full time

    Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185 countries. Bring your...


  • California, Missouri, United States Insight Global Full time

    Position Overview: A leading media organization is in search of a dedicated team of Site Reliability Engineers to enhance their streaming services. This role demands extensive expertise in cloud technologies, particularly AWS, alongside proficiency in Kubernetes, Terraform, and Python.Key Responsibilities: - Demonstrate robust experience as a Site...


  • Baldwin Park, California, United States Caelux Corporation Full time

    About Caelux CorporationCaelux Corporation is a pioneering leader in the field of perovskite solar cell technology, committed to revolutionizing the renewable energy sector with cutting-edge innovations.We are at the forefront of developing full-scale (1x2m) high-efficiency, cost-effective solar solutions and are looking for a visionary Vice President of...


  • California, United States Charter Global Full time

    Position: Site Reliability EngineerCompany: Charter GlobalContract Type: Temporary EngagementKey Responsibilities:Provide guidance to architecture and development teams to enhance application availability, reliability, and performance on a global scale.Collaborate with architecture teams to ensure that operability, measurability, and manageability are...


  • California, United States Zilliz Full time

    What you will do: Work at the intersection of development and site reliability. Creating SRE tools and systems, as well as supporting existing infrastructure and platforms. Ensure the reliability, availability, and performance of Zilliz’s distributed database systems. Develop and implement strategies for monitoring, incident management, and disaster...


  • California, United States Zilliz Full time

    What you will do: Work at the intersection of development and site reliability. Creating SRE tools and systems, as well as supporting existing infrastructure and platforms. Ensure the reliability, availability, and performance of Zillizs distributed database systems. Develop and implement strategies for monitoring, incident management, and disaster...


  • Milpitas, California, United States Micross Components Full time

    Micross Components is a prominent global supplier of specialized electronic components tailored for military, aerospace, medical, and rigorous industrial applications. As a comprehensive source for high-reliability and cutting-edge electronics, Micross offers a diverse range of solutions, including bare die and wafer processing, advanced custom packaging,...


  • Baldwin Park, California, United States Caelux Corporation Full time

    About Caelux Corporation:Caelux Corporation stands at the forefront of innovation in perovskite solar cell technology, dedicated to transforming the renewable energy landscape through advanced solutions. We are engaged in the development of large-scale (1x2m) high-efficiency, cost-effective solar technologies and are seeking a skilled Senior Reliability...