Principal Cloud Engineering and Production Operations Engineer
3 weeks ago
Principal Cloud Engineering and Production Operations Engineer Join to apply for the Principal Cloud Engineering and Production Operations Engineer role at A10 Networks, Inc The Principal Cloud and Production Operations Engineer serves as the senior technical authority responsible for architecting, automating, and optimizing hybrid and cloud-native production environments that power critical customer‑facing services and enterprise applications. This role combines deep cloud infrastructure expertise with strong production reliability and operational engineering skills. The Principal Engineer acts as both architect and hands‑on builder, ensuring scalability, resilience, and security across multi‑cloud and on‑prem environments. Reporting to the Associate Director of IT and Infrastructure, this position will collaborate closely with Engineering, DevOps, Security, and IT Operations to drive a culture of automation, observability, and continuous improvement across the production ecosystem. Key Responsibilities Cloud Architecture and Engineering Design, implement, and maintain cloud and hybrid infrastructure supporting production workloads, enterprise systems, and CI/CD pipelines Lead the adoption of infrastructure-as-code (IaC) using Terraform, CloudFormation, or similar tools to enable repeatable, auditable, and secure deployments Architect scalable and fault‑tolerant solutions across OCI, AWS, Azure, and on‑prem data centers, ensuring high availability and cost efficiency Evaluate emerging cloud services and technologies for applicability to business needs and long‑term scalability goals Production Operations and Reliability Serve as the technical lead for production operations, ensuring uptime, performance, and reliability of customer‑facing and internal systems Develop and maintain observability frameworks leveraging metrics, logs, and traces to ensure proactive detection and rapid response Partner with engineering teams to implement SRE‑inspired practices, including service level objectives (SLOs), error budgets, and post‑incident reviews Drive root cause analysis, performance tuning, and continuous improvement of production services Automation and CI/CD Enablement Collaborate with DevOps and application engineering teams to build and optimise automated deployment pipelines supporting frequent, low‑risk releases Integrate security and compliance checks into CI/CD workflows to ensure production readiness and alignment with internal standards Design self‑healing infrastructure and automated rollback mechanisms to reduce operational risk Ensure secure and reliable configuration management and environment orchestration using tools such as Ansible, Chef, or Puppet Operational Governance and Collaboration Establish and enforce operational best practices for monitoring, patching, and change management across production systems Lead production readiness reviews for new releases and large‑scale changes Collaborate with the Security and Compliance teams to ensure systems adhere to policy, hardening standards, and regulatory requirements Participate in and occasionally lead on‑call rotations for critical production systems, ensuring rapid triage and resolution Leadership and Mentorship Act as a technical mentor to cloud and infrastructure engineers, fostering a culture of knowledge sharing and engineering excellence Lead architectural reviews, design sessions, and capacity planning discussions Serve as a trusted advisor to management on cloud modernization, resilience engineering, and cost optimisation strategies Qualifications Bachelor’s degree in Computer Science, Information Systems, or related field; Master’s preferred 10+ years of experience in cloud and infrastructure engineering, including 3+ years in a senior or principal role Expertise with OCI (preferred), AWS and/or Azure cloud services, including networking, compute, storage, and identity management Proven experience managing production‑scale environments supporting mission‑critical applications and services Strong proficiency in: Infrastructure‑as‑code (Terraform, CloudFormation) CI/CD and DevOps toolchains (Jenkins, GitLab, ArgoCD) Container orchestration (Kubernetes, Docker) Monitoring and observability platforms (Prometheus, Grafana, Datadog, ELK) Scripting and automation (Python, Bash, PowerShell) Solid understanding of security, compliance, and networking principles in hybrid environments Exceptional analytical, problem‑solving, and incident management skills Demonstrated ability to lead complex, cross‑functional initiatives from concept to execution Preferred Experience Experience in high‑availability SaaS or networking environments Knowledge of FinOps, cost optimisation, and multi‑cloud governance frameworks Familiarity with Zero Trust, identity federation, and cloud access security model Exposure to AI/ML infrastructure or data‑driven pipelines is a plus Why Join Us This is a hands‑on leadership opportunity to define the next generation of cloud and production operations within a high‑impact technology environment. The Principal Cloud and Production Operations Engineer will directly influence the reliability, speed, and scalability of the company’s global technology platforms, ensuring operational excellence and innovation. A10 Networks is an equal opportunity employer and a VEVRAA federal subcontractor. All qualified applicants will receive consideration for employment without regard to race, colour, religion, sex, sexual orientation, gender identity, national origin, disability status, protected veteran status, or any other characteristic protected by law. A10 also complies with all applicable state and local laws governing nondiscrimination in employment. Hybrid. Targeted compensation guideline $140,000 - $185,000. Compensation will vary based on number of factors, including market demand for specific skills, role type, job level, and individual qualifications. Final salary offers are determined by considerations including, but not limited to, subject matter expertise, demonstrated skill level, relevant experience, geographic location, education, certifications, and training. #J-18808-Ljbffr
-
San Francisco, United States A10 Networks Full timePrincipal Cloud Engineering and Production Operations Engineer Join to apply for the Principal Cloud Engineering and Production Operations Engineer role at A10 Networks, Inc The Principal Cloud and Production Operations Engineer serves as the senior technical authority responsible for architecting, automating, and optimizing hybrid and cloud-native...
-
San Jose, United States A10 Networks Full timePrincipal Cloud Engineering and Production Operations Engineer The Principal Cloud and Production Operations Engineer serves as the senior technical authority responsible for architecting, automating, and optimizing hybrid and cloud-native production environments that power critical customer-facing services and enterprise applications. This role combines...
-
Principal Cloud
3 weeks ago
San Francisco, United States A10 Networks, Inc Full timeA leading technology company in San Francisco is looking for a Principal Cloud Engineering and Production Operations Engineer to architect and optimize their cloud environments. This role demands expertise in cloud infrastructure, automation, and production reliability. The ideal candidate will have over 10 years of relevant experience and proficiency in key...
-
Principal Cloud Platform Engineer
7 days ago
San Francisco, California, United States Stratitech Services LLC Full time $120,000 - $200,000 per yearPrincipal Cloud Platform Engineer (Serverless / AWS)Location:SF Bay AreaEmployment Type:Contract to Hire, Onsite 2 Days a Week in San BrunoAbout the RoleStratITech Services is seeking aPrincipal Cloud Platform Engineer (Serverless / AWS)for a confidential client building next-generation consumer and IoT technology. This role issoftware-centric, focused on...
-
San Francisco, California, United States A10 Networks, Inc Full timeThis role acts as a hands-on technical lead, driving cloud engineering initiatives, automating infrastructure, and ensuring high-availability and performance across customer-facing systems. The Lead Engineer will collaborate with IT, DevOps, and Software Engineering teams to build secure, scalable environments that support continuous delivery and rapid...
-
Principal Software Engineer, Crusoe Cloud
2 weeks ago
San Francisco, United States Epoch Biodesign Full timeLocation San Francisco, CA - US Employment Type Full time Location Type On-site Department Cloud Engineering Cruose's mission is to accelerate the abundance of energy and intelligence. We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed, or sustainability. Be part of the AI...
-
Principal Platform Security Engineer
4 weeks ago
San Francisco, CA, United States Nifty Gateway Studio Full timePrincipal Platform Security Engineer (Cloud/K8S) Gemini is a global crypto and Web3 platform founded by Cameron and Tyler Winklevoss in 2014, offering a wide range of simple, reliable, and secure crypto products and services to individuals and institutions in over 70 countries. Our mission is to unlock the next era of financial, creative, and personal...
-
Principal Platform Security Engineer
3 weeks ago
San Francisco, CA, United States Nifty Gateway Studio Full timePrincipal Platform Security Engineer (Cloud/K8S) If the following job requirements and experience match your skills, please ensure you apply promptly. Location : New York, New York; San Francisco, California About the Company Gemini is a global crypto and Web3 platform founded by Cameron and Tyler Winklevoss in 2014, offering a wide range of simple,...
-
Principal Platform Security Engineer
4 weeks ago
San Francisco, CA, United States Gemini Full timeAbout the Company Gemini is a global crypto and Web3 platform founded by Cameron and Tyler Winklevoss in 2014, offering a wide range of simple, reliable, and secure crypto products and services to individuals and institutions in over 70 countries. Our mission is to unlock the next era of financial, creative, and personal freedom by providing trusted access...
-
San Jose, United States A10 Networks Full timeLead Cloud Engineering and Production Operations Engineer page is loaded## Lead Cloud Engineering and Production Operations Engineerlocations: San Jose, Californiatime type: Full timeposted on: Posted Yesterdayjob requisition id: R-101214Lead Cloud Engineering and Production Operations EngineerThis role acts as a hands-on technical lead, driving...