Cloud Native/Serverless Reliability Engineer

4 weeks ago


Sunnyvale, United States Alibaba Cloud Full time

Cloud Native/Serverless Reliability Engineer (SRE)Join to apply for the Cloud Native/Serverless Reliability Engineer (SRE) role at Alibaba Cloud.Job OverviewThe Alibaba Cloud Cloud Native Serverless Team is a leading innovation force within Alibaba Cloud, dedicated to empowering developers and enterprises with cutting-edge serverless technologies. Focused on building scalable, cost-efficient, and fully managed serverless solutions, the team drives the evolution of cloud-native architectures by abstracting infrastructure complexity and enabling seamless integration with modern application development paradigms. They deliver industry-leading serverless solutions that compete with AWS Lambda and other global cloud providers.ResponsibilitiesCloud Product Operations & Reliability: Oversee stability, performance tuning, and high-availability architecture for serverless components to ensure 24/7 reliability.Containerized Lifecycle Management: Manage deployments, auto-scaling, upgrades, and resource optimization in serverless environments.Incident Response & Root Cause Analysis: Lead troubleshooting of incidents related to serverless and cloud products, develop diagnostic tools using Go/Rust.Automation & Operational Excellence: Build automation tools, implement chaos engineering, capacity planning, and failover mechanisms.Collaboration & Documentation: Work with teams on architecture design, create technical documentation, and standardize serverless operations.Minimum QualificationsBachelor's or higher in Computer Science with 3+ years in SRE/serverless operations.Deep understanding of SRE principles, reliability metrics, and diagnosing distributed system failures.Excellent communication skills for cross-team collaboration and documentation.Experience modifying cloud product source code for performance optimization; serverless experience preferred.Kubernetes certifications (CKA/CKAD) or equivalent cloud provider certifications.Additional InformationThe pay range at the start of employment is expected to be between $104,400 and $171,000/year, with variations based on experience, location, and other factors. The position is at-will, with potential salary modifications. #J-18808-Ljbffr



  • Sunnyvale, CA, United States Alibaba Cloud Full time

    Alibaba Cloud Native Message Middleware Team is responsible for message products, including RocketMQ and other messaging products. We are committed to creating a more stable, user-friendly, streaming, and large-scale messaging platform for the future.Cloud Product Operations & ReliabilityOversee stability maintenance, performance tuning, and...


  • Sunnyvale, CA, United States Alibaba Cloud Full time $104,400 - $171,000 per year

    Mission of the Cloud Intelligence Group SRE TeamThe mission of the Cloud Intelligence Group SRE (Site Reliability Engineering) Team is to ensure the stability of production environments, enterprise-grade cloud data reliability, and service continuity for the Cloud Intelligence Group. Our greatest challenge lies in guaranteeing uninterrupted business...


  • Sunnyvale, United States Ericsson Full time

    A leading technology provider is seeking a Senior Software Development Engineer to work on AIOps systems. This role involves collaborating with various teams to enhance monitoring applications and developing innovative solutions. Candidates need 8+ years of experience in software development, particularly in Python or Golang, and a strong background in...


  • Sunnyvale, CA, United States Ericsson Full time

    A leading technology provider is seeking a Senior Software Development Engineer to work on AIOps systems. This role involves collaborating with various teams to enhance monitoring applications and developing innovative solutions. Candidates need 8+ years of experience in software development, particularly in Python or Golang, and a strong background in...


  • Sunnyvale, CA, United States Ericsson Full time

    A leading technology provider is seeking a Senior Software Development Engineer to work on AIOps systems. This role involves collaborating with various teams to enhance monitoring applications and developing innovative solutions. Candidates need 8+ years of experience in software development, particularly in Python or Golang, and a strong background in...


  • Sunnyvale, CA, United States Ericsson Full time

    A leading technology provider is seeking a Senior Software Development Engineer to work on AIOps systems. This role involves collaborating with various teams to enhance monitoring applications and developing innovative solutions. Candidates need 8+ years of experience in software development, particularly in Python or Golang, and a strong background in...


  • Sunnyvale, CA, United States Ericsson Full time

    A leading technology provider is seeking a Senior Software Development Engineer to work on AIOps systems. This role involves collaborating with various teams to enhance monitoring applications and developing innovative solutions. Candidates need 8+ years of experience in software development, particularly in Python or Golang, and a strong background in...


  • Sunnyvale, CA, United States Ericsson Full time

    A leading technology provider is seeking a Senior Software Development Engineer to work on AIOps systems. This role involves collaborating with various teams to enhance monitoring applications and developing innovative solutions. Candidates need 8+ years of experience in software development, particularly in Python or Golang, and a strong background in...


  • Sunnyvale, CA, United States Ericsson Full time

    A leading technology provider is seeking a Senior Software Development Engineer to work on AIOps systems. This role involves collaborating with various teams to enhance monitoring applications and developing innovative solutions. Candidates need 8+ years of experience in software development, particularly in Python or Golang, and a strong background in...


  • Sunnyvale, CA, United States Ericsson Full time

    A leading technology provider is seeking a Senior Software Development Engineer to work on AIOps systems. This role involves collaborating with various teams to enhance monitoring applications and developing innovative solutions. Candidates need 8+ years of experience in software development, particularly in Python or Golang, and a strong background in...