Cloud-Native Inference Infrastructure Engineer

2 weeks ago


San Jose, United States ByteDance Full time

A leading technology company in San Jose seeks a Software Engineer specialized in Inference Infrastructure to design and maintain cloud-native systems. The ideal candidate has experience in building large-scale ML infrastructure and proficiency in major programming languages like Go or Python. Join a globally collaborative environment and contribute to innovative AI solutions while enjoying competitive salary and comprehensive benefits.
#J-18808-Ljbffr



  • San Jose, United States Pangleglobal Full time

    Software Engineer Graduate (Inference Infrastructure) - 2026 Start (PHD)Location: San JoseTeam: TechnologyEmployment Type: RegularThe Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance’s Core Compute Infrastructure organization,...

  • Software Engineer

    3 days ago


    San Jose, CA, United States ByteDance Full time

    Responsibilitie About the Team The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the platforms that power microservices, big data, distributed...

  • Software Engineer

    1 day ago


    San Jose, CA, United States ByteDance Full time

    Responsibilitie About the Team The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the platforms that power microservices, big data, distributed...


  • San Jose, CA, United States ByteDance Full time

    Responsibilitie About the Team The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the platforms that power microservices, big data, distributed...


  • San Jose, CA, United States ByteDance Full time

    Responsibilitie About the Team The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the platforms that power microservices, big data, distributed...


  • San Jose, CA, United States ByteDance Full time

    Responsibilitie About the Team The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the platforms that power microservices, big data, distributed...


  • San Jose, CA, United States ByteDance Full time

    Responsibilitie About the Team The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the platforms that power microservices, big data, distributed...


  • San Mateo, United States Fireworks AI Full time

    About Fireworks AI At Fireworks, we’re building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry. We’ve been independently benchmarked as the leader in LLM inference speed and are driving cutting-edge innovation through projects like our own function...


  • San Jose, CA, United States eTeam Full time

    Cloud-Native Security Engineers are responsible for securing cloud-native applications and infrastructure across public, private, or hybrid cloud environments. They work closely with DevOps and development teams to implement security best practices in CI/CD pipelines, containerized environments (e.g., Docker, Kubernetes), and cloud platforms (e.g., AWS,...


  • San Jose, CA, United States ByteDance Full time

    Responsibilitie About the Team The Compute Infrastructure - Orchestration & Scheduling team uses Kubernetes and Serverless technologies to build a large, reliable, and efficient compute infrastructure. This infrastructure powers hundreds of large-scale clusters globally, with over millions of online containers and offline jobs daily, including AI and LLM...