Reliability Engineering Specialist
2 weeks ago
Position: Site Reliability Engineer
Company: Charter Global
Contract Type: Temporary
Key Responsibilities:
- Provide guidance to architecture and development teams to enhance application availability, reliability, and performance on a global scale.
- Collaborate with architecture teams to ensure that operability, measurability, and manageability are integrated into business features and enablers.
- Work alongside product owners and managers to establish and track essential metrics that align with Service Level Objectives (SLOs) and Service Level Agreements (SLAs).
- Engage with development team members to diagnose and resolve technical issues.
- Lead Root Cause Analysis for production incidents and other failures within the software, pipeline, or related DevOps processes and technologies.
- Additional tasks may be assigned based on specific role requirements.
Required Qualifications:
- Minimum of 7 years of experience in Automation Programming using one or more scripting languages such as Python, Go, Java, Ruby, Rust, or JavaScript, with a preference for Python and Go. Note that Bash is not considered a programming language.
- At least 7 years of experience utilizing Linux terminal tools and crafting shell scripts in a Linux environment.
- Comprehensive understanding of public cloud service principles.
- In-depth knowledge of Unix/Linux operating systems, including administration (experience with Debian is preferred but not mandatory).
- Strong grasp of networking concepts (e.g., TCP/IP, routing, network architectures, and hardware), storage solutions, and database management systems.
- Extensive experience in debugging, optimizing code, and automating repetitive tasks.
Additional skills and experience may be necessary depending on the specific roles.
-
Infrastructure Reliability Specialist
2 weeks ago
California, Missouri, United States Insight Global Full timePosition OverviewWe are seeking an experienced Infrastructure Reliability Specialist to join our dynamic team. This role is crucial for maintaining the reliability and performance of our systems in a fast-paced environment.Key ResponsibilitiesManage and optimize cloud infrastructure using AWS services, including EKS and IAM.Implement and maintain Kubernetes...
-
Reliability and Maintainability Engineer
2 weeks ago
California, Missouri, United States Amazon Full timePosition Overview:The Reliability and Maintainability Engineer plays a crucial role in ensuring the performance and longevity of systems within Amazon's Kuiper Government Solutions (KGS). This position is focused on enhancing the reliability, availability, and maintainability of both space-based and terrestrial systems.Key Responsibilities:Lead the RAM...
-
Kubernetes Reliability Engineer
1 week ago
California, United States Bayside Solutions Full timeKubernetes Site Reliability EngineerW2 ContractSalary Range: $124,800 - $145,600 per yearLocation: Cupertino, CA - Hybrid RolePosition Overview:As a Kubernetes Site Reliability Engineer, you will play a crucial role in ensuring the reliability and performance of our cloud-based systems. Your primary responsibility will be to maintain high availability,...
-
Kubernetes Reliability Engineer
1 week ago
California, United States Bayside Solutions Full timeKubernetes Site Reliability EngineerW2 ContractSalary Range: $124,800 - $145,600 per yearLocation: Cupertino, CA - Hybrid RoleJob Overview:As a Kubernetes Site Reliability Engineer, you will play a crucial role in managing essential cloud infrastructure to ensure uninterrupted service, facilitate seamless scaling, and enable the deployment of innovative...
-
Reliability Engineering Manager
1 week ago
Sacramento, California, United States Two95 International Inc. Full timePosition: Reliability Engineering Manager Location: Remote Type: Fulltime Salary: Competitive PRIMARY RESPONSIBILITIES: The Reliability Engineering Manager will ensure that reliability strategies are integrated into overarching IT objectives and that performance expectations are clearly articulated. The manager will collaborate with both business and IT...
-
Kubernetes Reliability Engineer
1 week ago
California, United States Bayside Solutions Full timeKubernetes Site Reliability EngineerW2 ContractSalary Range: $124,800 - $145,600 per yearLocation: Cupertino, CA - Hybrid RolePosition Overview:The role involves overseeing essential cloud infrastructures to ensure continuous availability, facilitate seamless scaling, and support the development of new applications and services. We are seeking a driven...
-
Kubernetes Reliability Engineer
1 week ago
California, United States Bayside Solutions Full timeKubernetes Site Reliability EngineerW2 ContractSalary Range: $124,800 - $145,600 per yearLocation: Cupertino, CA - Hybrid RolePosition Overview:The primary responsibility of this role is to oversee critical cloud infrastructure, ensuring consistent uptime, facilitating seamless scaling, and enabling the development of new applications and services. We are...
-
Kubernetes Reliability Engineer
1 week ago
California, United States Bayside Solutions Full timeKubernetes Site Reliability EngineerW2 ContractSalary Range: $124,800 - $145,600 per yearLocation: Cupertino, CA - Hybrid RolePosition Overview:The role involves overseeing essential cloud infrastructures to ensure uninterrupted service, facilitate seamless scaling, and enable the development of new applications and services. We seek a driven engineer who is...
-
Kubernetes Reliability Engineer
1 week ago
California, United States Bayside Solutions Full timeKubernetes Site Reliability EngineerW2 ContractSalary Range: $124,800 - $145,600 per yearLocation: Cupertino, CA - Hybrid RolePosition Overview:The role involves overseeing essential cloud infrastructure to ensure uninterrupted service, facilitate seamless scaling, and enable the development of new applications and services. We seek a driven engineer who is...
-
Cloud Infrastructure Reliability Engineer
2 weeks ago
California, Missouri, United States Insight Global Full timePosition Title: Site Reliability Engineer (AWS/Kubernetes/Python/Terraform)Job Overview:A leading media organization is in search of skilled Site Reliability Engineers to enhance their streaming operations. This role demands extensive expertise in AWS, Kubernetes, Terraform, and Python, contributing to a permanent role within the company.Key...
-
Site Reliability Engineering Manager
3 days ago
Sacramento, California, United States Two95 International Inc. Full timeJob SummaryWe are seeking a highly skilled Site Reliability Engineering Manager to join our team at Two95 International Inc. as a key member of our IT department. The successful candidate will be responsible for ensuring the reliability and efficiency of our IT systems and infrastructure.Key ResponsibilitiesReliability Program Development: Work with the...
-
Principal Site Reliability Engineer
1 week ago
California, Missouri, United States Bitwarden Inc. Full timeAbout BitwardenBitwarden empowers organizations, developers, and individuals to securely manage and share sensitive information. With a transparent, open-source approach to password management, secrets management, and innovations in passwordless and passkey technologies, Bitwarden simplifies the implementation of robust security practices across all online...
-
Senior Site Reliability Engineer-Federal
7 hours ago
California, United States Zscaler Full timeOur Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185 countries. Bring your...
-
Cloud Infrastructure Reliability Engineer
2 weeks ago
California, Missouri, United States Insight Global Full timePosition Overview: A leading media organization is in search of a dedicated team of Site Reliability Engineers to enhance their streaming services. This role demands extensive expertise in cloud technologies, particularly AWS, alongside proficiency in Kubernetes, Terraform, and Python.Key Responsibilities: - Demonstrate robust experience as a Site...
-
Reliability Engineer IV
6 days ago
Baldwin Park, California, United States Caelux Corporation Full timeAbout Caelux CorporationCaelux Corporation is a pioneering leader in the field of perovskite solar cell technology, committed to revolutionizing the renewable energy sector with cutting-edge innovations.We are at the forefront of developing full-scale (1x2m) high-efficiency, cost-effective solar solutions and are looking for a visionary Vice President of...
-
Infrastructure Reliability Specialist
1 week ago
California, United States Charter Global Full timePosition: Site Reliability EngineerCompany: Charter GlobalContract Type: Temporary EngagementKey Responsibilities:Provide guidance to architecture and development teams to enhance application availability, reliability, and performance on a global scale.Collaborate with architecture teams to ensure that operability, measurability, and manageability are...
-
Staff Site Reliability Engineer Cloud Platform
2 months ago
California, United States Zilliz Full timeWhat you will do: Work at the intersection of development and site reliability. Creating SRE tools and systems, as well as supporting existing infrastructure and platforms. Ensure the reliability, availability, and performance of Zilliz’s distributed database systems. Develop and implement strategies for monitoring, incident management, and disaster...
-
Staff Site Reliability Engineer Cloud Platform
1 month ago
California, United States Zilliz Full timeWhat you will do: Work at the intersection of development and site reliability. Creating SRE tools and systems, as well as supporting existing infrastructure and platforms. Ensure the reliability, availability, and performance of Zillizs distributed database systems. Develop and implement strategies for monitoring, incident management, and disaster...
-
Semiconductor Reliability Testing Engineer
2 weeks ago
Milpitas, California, United States Micross Components Full timeMicross Components is a prominent global supplier of specialized electronic components tailored for military, aerospace, medical, and rigorous industrial applications. As a comprehensive source for high-reliability and cutting-edge electronics, Micross offers a diverse range of solutions, including bare die and wafer processing, advanced custom packaging,...
-
Senior Reliability Engineer
1 day ago
Baldwin Park, California, United States Caelux Corporation Full timeAbout Caelux Corporation:Caelux Corporation stands at the forefront of innovation in perovskite solar cell technology, dedicated to transforming the renewable energy landscape through advanced solutions. We are engaged in the development of large-scale (1x2m) high-efficiency, cost-effective solar technologies and are seeking a skilled Senior Reliability...