Cloud-Native Inference Infrastructure Engineer
2 weeks ago
A leading technology company in San Jose seeks a Software Engineer specialized in Inference Infrastructure to design and maintain cloud-native systems. The ideal candidate has experience in building large-scale ML infrastructure and proficiency in major programming languages like Go or Python. Join a globally collaborative environment and contribute to innovative AI solutions while enjoying competitive salary and comprehensive benefits.
#J-18808-Ljbffr
-
Software Engineer Graduate
4 weeks ago
San Jose, United States Pangleglobal Full timeSoftware Engineer Graduate (Inference Infrastructure) - 2026 Start (PHD)Location: San JoseTeam: TechnologyEmployment Type: RegularThe Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance’s Core Compute Infrastructure organization,...
-
Software Engineer
3 days ago
San Jose, CA, United States ByteDance Full timeResponsibilitie About the Team The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the platforms that power microservices, big data, distributed...
-
Software Engineer
1 day ago
San Jose, CA, United States ByteDance Full timeResponsibilitie About the Team The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the platforms that power microservices, big data, distributed...
-
Software Engineer Intern
3 days ago
San Jose, CA, United States ByteDance Full timeResponsibilitie About the Team The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the platforms that power microservices, big data, distributed...
-
Software Engineer Graduate
2 days ago
San Jose, CA, United States ByteDance Full timeResponsibilitie About the Team The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the platforms that power microservices, big data, distributed...
-
Software Engineer Graduate
17 hours ago
San Jose, CA, United States ByteDance Full timeResponsibilitie About the Team The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the platforms that power microservices, big data, distributed...
-
Software Engineer Intern
3 days ago
San Jose, CA, United States ByteDance Full timeResponsibilitie About the Team The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the platforms that power microservices, big data, distributed...
-
Software Engineer, Cloud Infrastructure
4 weeks ago
San Mateo, United States Fireworks AI Full timeAbout Fireworks AI At Fireworks, we’re building the future of generative AI infrastructure. Our platform delivers the highest-quality models with the fastest and most scalable inference in the industry. We’ve been independently benchmarked as the leader in LLM inference speed and are driving cutting-edge innovation through projects like our own function...
-
Cloud-Native Security Engineers
1 week ago
San Jose, CA, United States eTeam Full timeCloud-Native Security Engineers are responsible for securing cloud-native applications and infrastructure across public, private, or hybrid cloud environments. They work closely with DevOps and development teams to implement security best practices in CI/CD pipelines, containerized environments (e.g., Docker, Kubernetes), and cloud platforms (e.g., AWS,...
-
Software Engineer- Compute Infrastructure
3 days ago
San Jose, CA, United States ByteDance Full timeResponsibilitie About the Team The Compute Infrastructure - Orchestration & Scheduling team uses Kubernetes and Serverless technologies to build a large, reliable, and efficient compute infrastructure. This infrastructure powers hundreds of large-scale clusters globally, with over millions of online containers and offline jobs daily, including AI and LLM...