Sr Site Reliability Engineer

2 weeks ago


Irvine, CA, United States Ledgent Tech Full time

Job Description

No Corp-to-Corp, No 3rd party firms
.
Hiring on behalf of our client: Sr Site Reliability Engineer (SRE)
??Location: 100% onsite in Irvine, CA (local canddiates will be considered)??Employment Type: Direct-hire
??Salary Range: $150,000 to $180,000 (based on level of experience)

.

Partnered with a client who is at the forefront of the future innovation hub of next-generation networking, IoT smart home products, and software services. Be a part of a pivotal time in propelling the global ventures. Join their mission in shaping a technology-driven future.

They're looking for passionate and experienced Sr Site Reliability Engineers to join the team and play a crucial role in ensuring their cloud platform's security, reliability, scalability, and operational excellence.

.

Responsibilities:

  • Serve as technical contributor for implementing and operating Microservices on Kubernetes cloud-based platforms. Collaborate with the Cloud Technical Development and DevOps teams to deploy services to the Multi-Cloud Platform.
  • Performing Load Tests and Chaos Tests to ensure the scalability and reliability of microservices.
  • Observability for Microservices and cloud platforms like AWS (preferred), or OCI or Azure.
  • Disaster recovery plans in collaboration with the Development and DevOps team.
  • Analyze and resolve production risks caused by insufficient resources, such as node groups, CPU, memory, HPA scheduling, JVM pre-warming, etc.
  • Write and maintain scripts for automation using languages like Python, Go, or Bash.
  • Define and maintain the KPIs (SLA/SLO/SLI) for all cloud microservices with development teams to better understand the business.
  • Security and compliance standards, including ISO27001, SOC2, and GDPR.
  • Incident response efforts to troubleshoot and resolve production issues quickly.
  • Perform post-incident analysis to identify root causes and potential workarounds/solutions.
  • Other duties as assigned
.

Requirements:
  • Bachelor's degree in Computer Science, Information Technology, or a related field.
  • 7+ years of experience as a Site Reliability Engineer.
  • Proficiency in programming and scripting in Python.
  • Hands-on experience within a heavy Cloud environment in SRE, DevOps, cloud operations, and cloud security best practices.
  • Strong experience in Kubernetes, Microservices architecture environment.
  • Strong knowledge of security technologies, including Identity and access management, Network security, Application security, and Data protection.
  • Strong problem-solving and analytical skills, with the ability to work independently and as part of a team.
  • Experience in developing and maintaining technical documentation and implementing compliance requirements.

.

.

All qualified applicants will receive consideration for employment without regard to race, color, national origin, age, ancestry, religion, sex, sexual orientation, gender identity, gender expression, marital status, disability, medical condition, genetic information, pregnancy, or military or veteran status. We consider all qualified applicants, including those with criminal histories, in a manner consistent with state and local laws, including the California Fair Chance Act, City of Los Angeles' Fair Chance Initiative for Hiring Ordinance, and Los Angeles County Fair Chance Ordinance. For unincorporated Los Angeles county , to the extent our customers require a background check for certain positions, the Company faces a significant risk to its business operations and business reputation unless a review of criminal history is conducted for those specific job positions.

Job Reference: JN -122025-410529

  • Irvine, CA, United States Amazon Full time

    Description Prime Video is a first-stop entertainment destination offering customers a vast collection of premium programming in one app available across thousands of devices. Prime members can customize their viewing experience and find their favorite movies, series, documentaries, and live sports - including Amazon MGM Studios-produced series and movies;...


  • Irvine, CA, United States Amazon Full time

    Description Prime Video is a first-stop entertainment destination offering customers a vast collection of premium programming in one app available across thousands of devices. Prime members can customize their viewing experience and find their favorite movies, series, documentaries, and live sports - including Amazon MGM Studios-produced series and movies;...


  • Irvine, CA, United States Amazon Full time

    Description Prime Video is a first-stop entertainment destination offering customers a vast collection of premium programming in one app available across thousands of devices. Prime members can customize their viewing experience and find their favorite movies, series, documentaries, and live sports - including Amazon MGM Studios-produced series and movies;...


  • Irvine, CA, United States Amazon Full time

    Description Prime Video is a first-stop entertainment destination offering customers a vast collection of premium programming in one app available across thousands of devices. Prime members can customize their viewing experience and find their favorite movies, series, documentaries, and live sports - including Amazon MGM Studios-produced series and movies;...


  • Irvine, CA, United States Cox Full time

    The Site Reliability Engineer - Incident Response is a critical enterprise-level role responsible for accelerating incident resolution and enhancing the overall incident management process. This individual partners with engineering teams during active incidents to troubleshoot issues using monitoring and logging tools, and post-incident, delivers...


  • Irvine, CA, United States Cox Full time

    The Site Reliability Engineer - Incident Response is a critical enterprise-level role responsible for accelerating incident resolution and enhancing the overall incident management process. This individual partners with engineering teams during active incidents to troubleshoot issues using monitoring and logging tools, and post-incident, delivers...


  • Irvine, CA, United States TP-Link North America, Inc. Full time

    At the forefront of the future of connected living, TP-Link's Systems Inc. R&D Center in Irvine, Southern California's innovation hub, spearheads research and development of next-generation networking, IoT smart home products, and software services. Our team of passionate engineers are constantly innovating, engineering solutions that transform the end user...


  • Irvine, CA, United States TP-Link North America, Inc. Full time

    At the forefront of the future of connected living, TP-Link's Systems Inc. R&D Center in Irvine, Southern California's innovation hub, spearheads research and development of next-generation networking, IoT smart home products, and software services. Our team of passionate engineers are constantly innovating, engineering solutions that transform the end user...


  • Irvine, CA, United States TP-Link North America, Inc. Full time

    At the forefront of the future of connected living, TP-Link's Systems Inc. R&D Center in Irvine, Southern California's innovation hub, spearheads research and development of next-generation networking, IoT smart home products, and software services. Our team of passionate engineers are constantly innovating, engineering solutions that transform the end user...


  • Irvine, CA, United States TP-Link North America, Inc. Full time

    At the forefront of the future of connected living, TP-Link's Systems Inc. R&D Center in Irvine, Southern California's innovation hub, spearheads research and development of next-generation networking, IoT smart home products, and software services. Our team of passionate engineers are constantly innovating, engineering solutions that transform the end user...