Current jobs related to Principal Site Reliability Engineer - California - JobBoard.io


  • California, Missouri, United States Bitwarden Inc. Full time

    About BitwardenBitwarden empowers organizations, developers, and individuals to securely manage and share sensitive information. With a transparent, open-source approach to password management, secrets management, and innovations in passwordless and passkey technologies, Bitwarden simplifies the implementation of robust security practices across all online...


  • California, United States Job Board Full time

    By making evidence the heart of security, we help customers stay ahead of ever-changing cyber-attacks. Corelight is a cybersecurity company that transforms network and cloud activity into evidence. Evidence that elite defenders use to proactively hunt for threats, accelerate response to cyber incidents, gain complete network visibility, and create powerful...


  • California, United States Zilliz Full time

    What you will do: Work at the intersection of development and site reliability. Creating SRE tools and systems, as well as supporting existing infrastructure and platforms. Ensure the reliability, availability, and performance of Zilliz’s distributed database systems. Develop and implement strategies for monitoring, incident management, and disaster...


  • California, United States Zilliz Full time

    What you will do: Work at the intersection of development and site reliability. Creating SRE tools and systems, as well as supporting existing infrastructure and platforms. Ensure the reliability, availability, and performance of Zillizs distributed database systems. Develop and implement strategies for monitoring, incident management, and disaster...


  • California, United States Engineering for Change, LLC Full time

    Heliogen is a renewable energy technology company focused on decarbonizing industry and empowering a sustainable civilization. Powered by AI,computer vision, and robotics,Heliogen’sconcentrating solar thermal solutions turn sunlightinto heat, steam, power or green hydrogen fuel to help industries, About this role: The Principal/Staff or Senior Electrical...


  • California, United States Insight Global Full time

    Site Reliability Engineer (AWS/Kubernetes/Python/Terraform) Post Date: Jul 02, 2024 Location: ZIP/Postal Code 90067 Job Type: Permanent Category: Software Engineering Pay Rate: $95k - $212k (estimate) Job Description A media company is seeking a team of SREs to join their streaming team. The role requires strong experience in AWS, Kubernetes, Terraform, and...


  • California, United States Bayside Solutions Full time

    Kubernetes Site Reliability EngineerW2 ContractSalary Range: $124,800 - $145,600 per yearLocation: Cupertino, CA - Hybrid RolePosition Overview:As a Kubernetes Site Reliability Engineer, you will play a crucial role in ensuring the reliability and performance of our cloud-based systems. Your primary responsibility will be to maintain high availability,...


  • California, United States Bayside Solutions Full time

    Kubernetes Site Reliability EngineerW2 ContractSalary Range: $124,800 - $145,600 per yearLocation: Cupertino, CA - Hybrid RoleJob Overview:As a Kubernetes Site Reliability Engineer, you will play a crucial role in managing essential cloud infrastructure to ensure uninterrupted service, facilitate seamless scaling, and enable the deployment of innovative...


  • California, United States Bayside Solutions Full time

    Kubernetes Site Reliability EngineerW2 ContractSalary Range: $124,800 - $145,600 per yearLocation: Cupertino, CA - Hybrid RolePosition Overview:The role involves overseeing essential cloud infrastructures to ensure continuous availability, facilitate seamless scaling, and support the development of new applications and services. We are seeking a driven...


  • California, United States Bayside Solutions Full time

    Kubernetes Site Reliability EngineerW2 ContractSalary Range: $124,800 - $145,600 per yearLocation: Cupertino, CA - Hybrid RolePosition Overview:The primary responsibility of this role is to oversee critical cloud infrastructure, ensuring consistent uptime, facilitating seamless scaling, and enabling the development of new applications and services. We are...


  • California, United States Bayside Solutions Full time

    Kubernetes Site Reliability EngineerW2 ContractSalary Range: $124,800 - $145,600 per yearLocation: Cupertino, CA - Hybrid RolePosition Overview:The role involves overseeing essential cloud infrastructures to ensure uninterrupted service, facilitate seamless scaling, and enable the development of new applications and services. We seek a driven engineer who is...


  • California, United States Bayside Solutions Full time

    Kubernetes Site Reliability EngineerW2 ContractSalary Range: $124,800 - $145,600 per yearLocation: Cupertino, CA - Hybrid RolePosition Overview:The role involves overseeing essential cloud infrastructure to ensure uninterrupted service, facilitate seamless scaling, and enable the development of new applications and services. We seek a driven engineer who is...


  • California, Missouri, United States Insight Global Full time

    Position Title: Site Reliability Engineer (AWS/Kubernetes/Python/Terraform)Job Overview:A leading media organization is in search of skilled Site Reliability Engineers to enhance their streaming operations. This role demands extensive expertise in AWS, Kubernetes, Terraform, and Python, contributing to a permanent role within the company.Key...


  • San Jose, California, United States Hireio, Inc. Full time

    About Hireio, Inc.Hireio, Inc. stands at the forefront of the mobile video landscape, recognized as a premier platform for short-form video content. As a leading Unicorn startup, we have achieved remarkable milestones, including over 1.3 billion mobile downloads in the United States and 2 billion globally. With a robust user base of 1.5 billion monthly...


  • California, Missouri, United States Insight Global Full time

    Position Overview: A leading media organization is in search of a dedicated team of Site Reliability Engineers to enhance their streaming services. This role demands extensive expertise in cloud technologies, particularly AWS, alongside proficiency in Kubernetes, Terraform, and Python.Key Responsibilities: - Demonstrate robust experience as a Site...


  • California, United States Charter Global Full time

    Position: Site Reliability EngineerCompany: Charter GlobalContract Type: TemporaryKey Responsibilities:Provide guidance to architecture and development teams to enhance application availability, reliability, and performance on a global scale.Collaborate with architecture teams to ensure that operability, measurability, and manageability are integrated into...


  • California, Missouri, United States Amazon Full time

    Position Overview:The Reliability and Maintainability Engineer plays a crucial role in ensuring the performance and longevity of systems within Amazon's Kuiper Government Solutions (KGS). This position is focused on enhancing the reliability, availability, and maintainability of both space-based and terrestrial systems.Key Responsibilities:Lead the RAM...


  • Los Gatos, California, United States Netflix Full time

    "At Netflix, we strive to bring joy to people across the world through amazing stories. As we grow internationally, we are continually enhancing our cloud-based infrastructure to improve our performance, scalability, and reliability.The SRE team's goal is to ensure customer joy by successfully managing risk and minimizing impact across Netflix. We do this...


  • California, Missouri, United States Diamond Foundry Full time

    Diamond Foundry Inc. is addressing the thermal constraints at the core of today's most innovative technology sectors, including AI and cloud computing, electric vehicle power electronics, and advanced wireless communication (5G/6G). We have successfully created the world's first single-crystal diamond wafers and are on a mission to integrate diamonds into...


  • California, Missouri, United States Diamond Foundry Full time

    Diamond Foundry Inc. is at the forefront of addressing the thermal constraints that underpin today's most innovative technology sectors, including AI, cloud computing, electric vehicle power electronics, and advanced wireless communications like 5G and 6G. We have successfully developed the world's first single-crystal diamond wafers and are committed to our...

Principal Site Reliability Engineer

1 month ago


California, United States JobBoard.io Full time

By making evidence the heart of security, we help customers stay ahead of ever-changing cyber-attacks.

Corelight is a cybersecurity company that transforms network and cloud activity into evidence. Evidence that elite defenders use to proactively hunt for threats, accelerate response to cyber incidents, gain complete network visibility, and create powerful analytics using machine learning and behavioral analysis tools. Easily deployed, and available in traditional and SaaS-based formats, Corelight is the fastest-growing Network Detection and Response (NDR) platform in the industry. We are the only NDR platform that leverages the power of Open Source projects in addition to our own technology to deliver Intrusion Detection (IDS), Network Security Monitoring (NSM), and Smart PCAP solutions. We sell to some of the most sensitive, mission critical large enterprises and government agencies in the world.

In this role, you will lead the design and architecture of the Corelight SaaS Operations focusing on developing and maintaining systems, automation, and infrastructure to support the Corelight mission and Corelight SaaS offerings. This role will work closely with other teams in Engineering, Customer Success, and our Product organization.

Your Role and Responsibilities

  • Architecture & Design
    • Lead improvements in architecture and design, and facilitate various tests and reviews of our code, products, services, and infrastructure
    • Drive the overall Corelight SaaS Cloud architecture, collaborating closely with Engineering, Product, and other technical leaders
    • Provide expertise and assistance on cloud architecture and APIs
  • SaaS Operations
    • Implement and enhance SaaS Operations, focusing on cost efficiency, monitoring, and change management controls
    • Develop and apply best practices for automation, disaster recovery, and system resilience
    • Evaluate new projects and design changes for security implications, and work with design teams to balance value, impact, and effort
  • Cloud Infrastructure
    • Engage in hands-on, in-depth analysis, review, and design of cloud infrastructure, ensuring high availability, resilience, and adherence to stringent SLO objectives

Qualifications

  • Education & Experience
    • Bachelors or Masters degree in Computer Science, Engineering, or a related field
    • 10+ years of experience in cloud infrastructure, cybersecurity, or a related field, with significant experience in AWS/GCP/Azure
  • Technical Skills
    • Deep expertise in AWS services, including but not limited to EC2, S3, RDS, Lambda, ECS/EKS, Glue, EMR, Redshift, OpenSearch, and VPC
    • Proficient in programming languages such as Python, Golang, SQL, Bash or Scala
    • Strong experience with big data technologies and frameworks such as Hadoop, Spark, Graph DB, or Kafka
    • Strong experience with data integration tools and ETL processes
    • Experience with Infrastructure as Code (IaC) tools such as Terraform, CloudFormation, or Ansible
    • Knowledge of containerization and orchestration tools like Docker and Kubernetes
    • Strong experience with CI/CD tools like Jenkins, GitLab CI, or CircleCI
    • Experience in architecting, building, and scaling platforms and distributed systems that require high availability, resilience, and meeting stringent SLO objectives
  • Soft Skills
    • Excellent leadership and team management skills
    • Strong problem-solving and analytical skills
    • Exceptional communication and collaboration abilities
    • Ability to work in a fast-paced, dynamic environment and manage multiple priorities effectively

Preferred Qualifications

  • Experience with data security and compliance frameworks
  • AWS Certified Big Data Specialty or other relevant AWS certifications
  • Familiarity with machine learning and data science workflows
  • Experience with observability tools such as Prometheus, Grafana, ELK stack, or Datadog

Location: This position is Remote and candidate must be in the Pacific Time Zone

We are proud of our culture and values - driving diversity of background and thought, low-ego results, applied curiosity and tireless service to our customers and community. Corelight is committed to a geographically dispersed yet connected employee base with employees working from home and office locations around the world. Fueled by an accelerating revenue stream, and investments from top-tier venture capital organizations such as Crowdstrike, Accel and Insight - we are rapidly expanding our team.

Check us out at www.corelight.com

#J-18808-Ljbffr