Staff AI Ops Engineer

2 weeks ago


San Jose, California, United States Calix Full time

Calix provides the cloud, software platforms, systems and services required for communications service providers to simplify their businesses, excite their subscribers and grow their value.

Calix is where passionate innovators come together with a shared mission: to reimagine broadband experiences and empower communities like never before. As a true pioneer in broadband technology, we ignite transformation by equipping service providers of all sizes with an unrivaled platform, state-of-the-art cloud technologies, and AI-driven solutions that redefine what's possible. Every tool and breakthrough we offer is designed to simplify operations and unlock extraordinary subscriber experiences through innovation.

Calix is seeking a highly skilled
Staff
AI Ops Engineer
with hands-on experience with GCP to join our cutting-edge
AI/ML
team. In this role, you will be responsible for building, scaling, and maintaining the infrastructure that powers our machine learning and generative AI applications. You will work closely with data scientists, ML engineers, and software developers to ensure our ML/AI systems are robust, efficient, and production ready.

This is a remote-based position that can be located anywhere in the United States or Canada.

Key Responsibilities

  • Design, implement, and maintain scalable infrastructure for ML and GenAI applications
  • Deploy, operate, and troubleshoot production ML/GenAI pipelines/services
  • Build and optimize CI/CD pipelines for ML model deployment and serving
  • Scale compute resources across CPU/GPU architectures to meet performance requirements
  • Implement container orchestration with Kubernetes
  • Architect and optimize cloud resources on GCP for ML training and inference
  • Setup and maintain runtime frameworks and job management systems (Airflow, KubeFlow, MLflow, etc.)
  • Establish monitoring, logging and alerting for systems observability
  • Optimize system performance and resource utilization for cost efficiency
  • Develop and enforce AIOps best practices across the organization

Qualifications

  • Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent experience).
  • 8+ years of overall software engineering experience
  • 3+ years of focused experience in DevOps/AIOps or similar ML infrastructure roles
  • Proficient in IaC, using Terraform.
  • Strong experience with containerization and orchestration using Docker and Kubernetes
  • Demonstrated expertise in cloud infrastructure management on GCP
  • Proficiency with workflow management such as Airflow & Kubeflow
  • Strong CI/CD expertise with experience implementing automated testing and deployment pipelines
  • Experience with scaling distributed compute architectures utilizing various accelerators (CPU/GPU)
  • Solid understanding of system performance optimization techniques
  • Experience implementing comprehensive observability solutions for complex systems
  • Knowledge of monitoring and logging tools (Prometheus, Grafana, ELK stack).
  • Strong proficiency in Python
  • Familiarity with ML frameworks such as PyTorch and ML platforms like Vertex AI
  • Excellent problem-solving skills and ability to work independently
  • Strong communication skills and ability to work effectively in cross-functional teams

The base pay range for this position varies based on the geographic location. More information about the pay range specific to candidate location and other factors will be shared during the recruitment process. Individual pay is determined based on location of residence and multiple factors, including job-related knowledge, skills and experience.

San Francisco Bay Area
156, ,700 USD Annual

All Other US Locations:
136, ,000 USD Annual

As a part of the total compensation package, this role may be eligible for a bonus. For information on our benefits click here.


  • Systems Engineer

    2 weeks ago


    San Diego, California, United States G2 Ops Full time

    ​​ Quick Position FactsLocation: San Diego, CA at our wonderful G2 Ops office and customer siteWork Setting: In person, some remote opportunity, and/or flexible working hours, not a fully remote positionSalary Range: Based on relevant experience, education and certifications8+ years: $135,000+ 10+ years: $155,000+ Please note these ranges vary depending...


  • San Jose, California, United States FloQast Full time

    As a Staff AI Engineer on our Core AI team, you will be a cornerstone of FloQast's AI transformation. You will architect, build, and scale the AI products that power our accounting automation platform and enable our vision of an AI accountant teammate. This role requires deep expertise in production AI systems and a passion for solving complex accounting...

  • AI Engineer

    2 weeks ago


    San Francisco, California, United States Autospark AI Full time

    Company DescriptionAutospark AI develops AI as a Service (AIaaS) solutions that enable small and medium-sized businesses to harness the power of advanced multi-agent AI systems. Our technology supports growth, optimizes marketing efforts, and improves operational efficiencies for clients. We are committed to making AI accessible and impactful for businesses...

  • Management Analyst

    2 weeks ago


    San Diego, California, United States G2 Ops Full time

    ​​ Quick Position FactsLocation: San Diego, CA at our wonderful G2 Ops office and customer siteWork Setting: In person, some remote opportunity, and/or flexible working hours, not a fully remote positionSalary Range: Based on relevant experience, education, and certifications7+ years: $120,000+10+ years: $145,000+Please note these ranges vary depending...


  • San Jose, California, United States SK hynix America Full time

    Job Title: AI System Engineer, Sr. StaffOffice Location: San Jose, CAJob Type: Full-TimeWork Model: OnsiteAbout SK Hynix AmericaAt SK hynix America, we're at the forefront of semiconductor innovation, developing advanced memory solutions that power everything from smartphones to data centers. As a global leader in DRAM and NAND flash technologies, we drive...


  • San Francisco, California, United States Broccoli AI Full time

    About BroccoliBroccoli is building the AI operating system for the $500B home services market. We deploy intelligent AI agents at the front lines of HVAC, roofing, and other trades businesses to answer calls, engage customers, book jobs, and ensure every lead is captured.Backed by top VCs, we closed a $27M Series A and are scaling fast. We work with leading...

  • AI Research Engineer

    17 hours ago


    San Francisco, California, United States Muro AI Full time

    About Muro AIMuro AI is transforming how the $2T construction industry plans and builds. Founded by Cornell alumni, ex-founders, and former McKinsey operators, we're building AI agents that automate the most complex, manual, and costly phase of construction: preconstruction.We move fast, build with conviction, and obsess over delivering real impact to the...


  • San Francisco, California, United States Benchstack Ai Full time

    A Solutions Engineer (Forward Deployed) is required for a Series A AI startup, which is building automation that transforms how healthcare teams work. Overloaded staff, manual workflows, missed patient calls — this team is fixing all of that with generative AI that's already handling10,000+ interactions a day across the US and Canada.They're hiring for a...


  • San Francisco, California, United States Snorkel AI Full time

    About SnorkelAt Snorkel, we believe meaningful AI doesn't start with the model, it starts with the data.We're on a mission to help enterprises transform expert knowledge into specialized AI at scale. The AI landscape has gone through incredible changes between 2015, when Snorkel started as a research project in the Stanford AI Lab, to the generative AI...

  • AI Scientist

    3 hours ago


    San Jose, California, United States AI Cybersecurity Company Full time

    Are you passionate aboutGenerative AIand want to apply it to one of the most impactful domains —cybersecurity?Join our cutting-edge startup in theSan Francisco Bay Area, where we are developing AI systems that transform how organizations understand, detect, and respond to cyber threats.As anApplied AI Scientist, you'll bridge AI research and real-world...