Platform Engineer, Model Shaping
3 weeks ago
About Model ShapingThe Model Shaping team at Together AI works on products and research for tailoring open foundation models to downstream applications. We build services that allow machine learning developers to choose the best models for their tasks and further improve these models using domain-specific data. In addition to that, we develop new methods for more efficient model training and evaluation, drawing inspiration from a broad spectrum of ideas across machine learning, natural language processing, and ML systems.About the RoleAs a Platform Engineer at Model Shaping, you will work on the foundational layers of Together’s platform for model customization and evaluation. You will design the infrastructure and backend services that will allow us to sustainably and reliably scale the systems powering production workflows launched by our users, as well as internal research experiments.You will operate in a cross-functional environment, collaborating with other engineers and researchers in the team to improve the infrastructure based on the needs of projects they work on. You will also interact with other engineering teams at Together (such as Commerce, Data Engineering, and Cloud Infrastructure) to integrate the services developed by Model Shaping with systems developed by those teams.ResponsibilitiesDesign and build Together’s systems and infrastructure for model customization, including user-facing features and internal improvementsContribute to reliability improvements for the platform, participating in an on-call rotation and improving processes for incident responseCreate and improve internal tooling for deployment, continuous integration, and observabilityBuild a job orchestration platform spanning multiple data centers, supporting a highly heterogeneous hardware landscapePartner with teams developing internal services, co-designing these services and incorporating them in systems built by Model ShapingRequirements3+ years of experience in building infrastructure or backend components of production servicesComfortable with the fundamentals of Linux environments and modern container/orchestration stacks (e.g., Docker and Kubernetes)Strong software engineering background in Python or GoExperienced with infrastructure automation tools (Terraform, Ansible), monitoring/observability stacks (Prometheus, Grafana), and CI/CD pipelines (GitHub Actions, ArgoCD)Skilled with analyzing non-trivial issues of complex software systems and documenting your findingsHave cloud environment (e.g., AWS/GCP/Azure) administration experience, preferably with a hybrid bare-metal/cloud environmentStrong communication skills, willing to document systems and processes and collaborate with peers of varying technical expertiseStand-out experienceDeveloping large-scale production systems with high reliability requirementsPipeline orchestration frameworks (e.g., Kubeflow, Argo Workflows, Flyte)Managing GPU workloads on HPC clusters, ideally with hands-on experience in operating NVIDIA’s networking stack (e.g., NCCL, Mellanox firmware, GPUDirect RDMA)Deployment of services for AI training or inferenceMaintaining or contributing to open-source projectsAbout Together AITogether AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancements such as FlashAttention, RedPajama, SWARM Parallelism, and SpecExec. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure.CompensationWe offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is $200,000 - $290,000. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge.Equal OpportunityTogether AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.Please see our privacy policy at https://www.together.ai/privacy #J-18808-Ljbffr
-
Platform Engineer, Model Shaping
3 weeks ago
San Francisco, United States Together AI Full timeAbout Model Shaping The Model Shaping team at Together AI works on products and research for tailoring open foundation models to downstream applications. We build services that allow machine learning developers to choose the best models for their tasks and further improve these models using domain-specific data. In addition to that, we develop new methods...
-
Platform Engineer, Model Shaping
4 weeks ago
San Francisco, United States Together AI Full timeAbout Model Shaping The Model Shaping team at Together AI works on products and research for tailoring open foundation models to downstream applications. We build services that allow machine learning developers to choose the best models for their tasks and further improve these models using domain-specific data. In addition to that, we develop new methods...
-
Platform Engineer, Model Shaping
2 weeks ago
San Francisco, United States Together Full timeThe Model Shaping team at Together AI works on products and research for tailoring open foundation models to downstream applications. We build services that allow machine learning developers to choose the best models for their tasks and further improve these models using domain-specific data. In addition to that, we develop new methods for more efficient...
-
Software Engineer, Scientific Models
4 weeks ago
San Francisco, United States Benchling Full timeBiotechnology is rewriting life as we know it, from the medicines we take, to the crops we grow, the materials we wear, and the household goods that we rely on every day. But moving at the new speed of science requires better technology. Benchling's mission is to unlock the power of biotechnology. The world's most innovative biotech companies use Benchling's...
-
Software Engineer, Scientific Models
2 weeks ago
San Francisco, United States Benchling Full timeBiotechnology is rewriting life as we know it, from the medicines we take, to the crops we grow, the materials we wear, and the household goods that we rely on every day. But moving at the new speed of science requires better technology. Benchling's mission is to unlock the power of biotechnology. The world's most innovative biotech companies use Benchling's...
-
Software Engineer, Scientific Models
3 days ago
San Francisco, CA, United States Benchling Full timeBiotechnology is rewriting life as we know it, from the medicines we take, to the crops we grow, the materials we wear, and the household goods that we rely on every day. But moving at the new speed of science requires better technology. Benchling's mission is to unlock the power of biotechnology. The world's most innovative biotech companies use Benchling's...
-
Software Engineer, Scientific Models
2 weeks ago
San Francisco, CA, United States Benchling Full timeBiotechnology is rewriting life as we know it, from the medicines we take, to the crops we grow, the materials we wear, and the household goods that we rely on every day. But moving at the new speed of science requires better technology. Benchling's mission is to unlock the power of biotechnology. The world's most innovative biotech companies use Benchling's...
-
Software Engineer, Scientific Models
5 days ago
San Francisco, CA, United States Benchling Full timeBiotechnology is rewriting life as we know it, from the medicines we take, to the crops we grow, the materials we wear, and the household goods that we rely on every day. But moving at the new speed of science requires better technology. Benchling's mission is to unlock the power of biotechnology. The world's most innovative biotech companies use Benchling's...
-
San Francisco, United States Serval Full timeA progressive tech company in San Francisco is seeking a Senior Fullstack Engineer to play a pivotal role in shaping their AI platform for IT teams. The successful candidate will build foundational technology, ensuring user experience and establishing best practices within the engineering team. A degree in Computer Science and 4+ years of engineering...
-
Senior Machine Learning Engineer
2 weeks ago
San Francisco, United States Top Engineer Full timeTOP ENGINEER JOB POST!!! Confidential Search for International Employer Industry: Social Commerce / AI Technology Degree: BS in Computer Science or Mathematics from Top 40 University Experience: 4-8 years in Production ML Systems AI-POWERED SOCIAL COMMERCE REVOLUTION Role: Senior Machine Learning Engineer - Multimodal AI Join a leading partner in social...