Senior Software Engineer
4 days ago
In 2012, Lambda started with a crew of AI engineers publishing research at top machine-learning conferences. We began as an AI company built by AI engineers. That hasn't changed. Today, we're on a mission to be the world's top AI computing platform. We equip engineers with the tools to deploy AI that is fast, secure, affordable, and built to scale. Whether they need powerhouse GPU hardware on-site or the flexibility of cloud-based solutions, we've got the horsepower to make it happen. Lambda's AI Cloud has been adopted by the world's leading companies and research institutions including Anyscale, Rakuten, The AI Institute, and multiple enterprises with over a trillion dollars of market capitalization. Our goal is to make computation as effortless and ubiquitous as electricity.
If you'd like to build the world's best deep learning cloud, join us.
*Note: This position requires presence in our San Francisco office location 4 days per week; Lambda's designated work from home day is currently Tuesday.
About the RoleWe are seeking a Senior Software Engineer to join our Managed Kubernetes (Mk8s) team. This is a hybrid role that blends deep software engineering capabilities with Site Reliability Engineering (SRE) principles. You will play a crucial role in shaping the architecture, reliability, and automation of our Kubernetes-based infrastructure, which powers mission-critical workloads across our global platform.
What You'll DoSoftware Engineering
- Design, build, and maintain scalable control plane services, operators, and custom controllers for Kubernetes.
- Develop automation for cluster lifecycle management (provisioning, upgrades, patching, deletion).
- Develop internal tools, APIs, and command-line interfaces (CLIs) that enable customers and ML/AI teams to deploy and monitor inference services effectively.
- Write resilient systems that gracefully handle failure across large-scale distributed environments.
SRE & Operations
- Define and implement Service-Level Objectives (SLOs) and Service-Level Indicators (SLIs) for Kubernetes services, workloads, and the platform.
- Dive into systems at a low level to solve unique cluster problems and write up your findings.
- Assist customers with high-level Kubernetes questions and integration with applications, storage, and authentication.
- Assist with initial cluster build-outs and validation to help identify failed hardware before customer delivery.
- Work closely with our HPC Ops and Datacenter Ops teams on issues that require lower-level expertise or cross-functional solutions.
- Participate in a well-managed, sustainable on-call rotation.
- Have 6+ years of experience in software engineering or SRE roles, 3+ years leading large-scale complex projects, or tech lead.
- Experience tuning Kubernetes internals and writing operators (CRDs, CSI, CNI, etc.).
- Strong programming skills in Go and Python; experience with GitOps (e.g., ArgoCD), Helm, and Kubernetes operators.
- Experience operating Kubernetes clusters in production environments (e.g., EKS, GKE, on-prem).
- Deep understanding of SRE principles: incident response, chaos engineering, scaling, and reliability.
- Proficiency in observability tools (Prometheus, Grafana, FluentBit, etc.).
- Experience with infrastructure-as-code tools (Terraform, Pulumi) and CI/CD pipelines.
- Solid knowledge of Linux systems, networking, containers, and cloud infrastructure.
- Deep Kubernetes expertise.
- Experience with user-level restrictions and hardening (e.g. AppArmor).
- Experience with HPC clusters, environments & tooling.
- Experience with large-scale AI/ML training clusters.
- Experience with machine learning/AI frameworks.
- Expertise with hybrid or multi-cloud Kubernetes environments.
- Familiarity with GPU, Infiniband, or high-performance computing on K8s.
- Past contributions to CNCF projects or Kubernetes SIGs a plus.
If you don't meet all of these requirements but believe you may be a good fit, please still apply and provide a cover letter that helps us understand your experience and readiness for this role.
Salary Range Information
Based on market data and other factors, the annual salary range for this position is $255,000 - $405,000. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description.
About Lambda
- Founded in 2012, ~350 employees (2024) and growing fast.
- We offer generous cash & equity compensation.
- Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove.
- We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability.
- Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG.
- Health, dental, and vision coverage for you and your dependents.
- Commuter/Work from home stipends for select roles.
- 401k Plan with 2% company match (USA employees).
- Flexible Paid Time Off Plan that we all actually use.
A Final Note:
You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills.
Equal Opportunity Employer
Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.
-
Senior Software Engineer
1 week ago
San Diego, CA, United States Top Engineer Full timeTOP ENGINEER JOB POST!!! CONFIDENTIAL SEARCH FOR AN AEROSPACE LEADER Industry: Aerospace / Defense / Software Degree: BS in CS, CE, or EE (MS Preferred) Experience: 5-15 Years Role: Senior Embedded Software Engineer Join an established company with great technology to design and develop high-reliability embedded software for cutting-edge, space-based...
-
Senior Software Engineer
1 week ago
San Diego, CA, United States Top Engineer Full timeTOP ENGINEER JOB POST!!! CONFIDENTIAL SEARCH FOR AN AEROSPACE LEADER Industry: Aerospace / Defense / Software Degree: BS in CS, CE, or EE (MS Preferred) Experience: 5-15 Years Role: Senior Embedded Software Engineer Join an established company with great technology to design and develop high-reliability embedded software for cutting-edge, space-based...
-
Senior Software Engineer, Platform
4 days ago
San Francisco, CA, United States BEACON SOFTWARE COMPANY Full timeBeacon Software is a permanent capital holding company which acquires and grows essential businesses. We are a profitable series B+ firm that combines great technologists, operators and M&A professionals to accelerate the scale of the ambition of the dozens of businesses we own and operate. We are supported by capital from tier-1 venture capital, crossover,...
-
Senior Software Engineer
9 hours ago
San Francisco, CA, United States Cypress HCM Full timeGet AI-powered advice on this job and more exclusive features. This range is provided by Cypress HCM. Your actual pay will be based on your skills and experience talk with your recruiter to learn more. Base pay range $80.00/hr - $88.00/hr Direct message the job poster from Cypress HCM We have an exciting opportunity for a Senior Software Engineer with the...
-
Senior ASIC Engineer
2 weeks ago
San Jose, CA, United States Top Engineer Full timeTOP ENGINEER JOB POST!!! Confidential Search for International Employer Industry: Electronics / Semiconductors Degree: BSEE Required (MSEE Preferred) Experience: 10+ years with Full ASIC/SoC Lifecycle CUTTING-EDGE CUSTOM ASICs & SOCs FOR EMERGING TECHNOLOGIES Role: Senior ASIC Engineer - ARM-Based Systems Join a cutting-edge developer of custom ASICs...
-
Senior Machine Learning Engineer
2 weeks ago
San Francisco, CA, United States Top Engineer Full timeTOP ENGINEER JOB POST!!! Confidential Search for International Employer Industry: Social Commerce / AI Technology Degree: BS in Computer Science or Mathematics from Top 40 University Experience: 4-8 years in Production ML Systems AI-POWERED SOCIAL COMMERCE REVOLUTION Role: Senior Machine Learning Engineer - Multimodal AI Join a leading partner in social...
-
Senior Software Engineer
2 days ago
San Francisco, CA, United States Peregrine Corporation Full timeJoin to apply for the Senior Software Engineer role at Peregrine. Peregrine supports public safety agencies across the country, empowering public servants to improve operations and make better decisions. Today, our technology is used by customers to serve more than 30 million Americans. We are a team of public service entrepreneurs who are passionate about...
-
Senior Software Engineer
2 weeks ago
San Francisco, CA, United States Amadeus Search Full timeSenior Software Engineer - Generative AI Full-Time | On-Site | San Francisco, CA Salary: $190,000 - $280,000 + equity (flexible) Visa: Green card and visa support available Hiring: 2 engineers About the Role We're hiring a Senior Software Engineer to join a high-growth AI startup focused on building cutting-edge generative AI tools for the legal domain and...
-
Senior Software engineer
1 week ago
San Francisco, CA, United States Zenex Partners Full timeSenior Software Engineer Location:- San Francisco, CA Duration:- 6+ Months Pay rate:- $83.33/hr W2 Responsibilities of Sr. Software Engineer Build reliable & scalable backend services to build secure, compelling & easy-to-use homes for stable assets in the crypto economy. Passionate about building an open financial system that brings the world together....
-
Senior Software engineer
2 days ago
San Francisco, CA, United States Zenex Partners Full timeSenior Software Engineer Location:- San Francisco, CA Duration:- 6+ Months Pay rate:- $83.33/hr W2 Responsibilities of Sr. Software Engineer Build reliable & scalable backend services to build secure, compelling & easy-to-use homes for stable assets in the crypto economy. Passionate about building an open financial system that brings the world together....