Large Scale System Architect

2 days ago


San Francisco, California, United States CentML Full time
About This Opportunity

We are seeking an experienced system architect to join our team and lead the design and development of our large-scale machine learning platform. As a key member of our team, you will be responsible for designing and building a unified solution that brings our innovative in-house technologies into a single, cohesive platform.

Job Responsibilities:
  • Designing and developing the CentML platform.
  • Developing solutions for scheduling large-scale ML training and inference workloads on GPU clusters over multiple cloud service providers.
  • Communicating with our product teams and defining use cases, as well as developing methodology and benchmarks to evaluate different approaches.
Requirements:
  • Bachelor's degree in Computer Science, Computer Engineering, or relevant technical field, or equivalent practical experience. A graduate degree with research experience is a plus.
  • Experience building large-scale systems from scratch. Prior experience in container-based deployment systems like Kubernetes is a big plus.
  • Strong coding skills in at least one of Python and C++.
  • Solid fundamentals in other computer science and computer engineering topics: algorithms and data structures, operating systems, computer architecture, etc.
Compensation Package:

$150,000 - $220,000 per year, depending on location and experience, plus benefits including employee stock options, best-in-class medical and dental benefits, parental leave top-up for 6 months, professional development budget, and flexible vacation time.



  • San Francisco, California, United States Delphina Full time

    About the PositionWe are seeking an experienced Large Scale Systems Architect to join our founding team at Delphina. As a key early hire, you will partner closely with our team on the direction of our product and drive critical technical decisions.Key ResponsibilitiesDevelop platforms that enable scientists, researchers, developers to run ML jobs easily and...


  • San Jose, California, United States Tik Tok Full time

    About This RoleAs a Large-Scale Recommendation Architect, you will play a crucial role in designing and implementing a storage solution for offline data in our recommendation system, serving over a billion users. Your primary objectives will be system reliability, uninterrupted service, and seamless performance. You will work with the team to create a...


  • San Jose, California, United States Tik Tok Full time

    About the RoleWe are looking for a highly skilled Senior Backend Software Engineer to join our User Growth Team. As a key member of the team, you will be responsible for designing and developing large-scale software systems that power TikTok's apps, leveraging data to inform product decisions and drive business outcomes.Your Key ResponsibilitiesDesign and...


  • San Francisco, California, United States Amazon Web Services, Inc. Full time

    Job Description: We are seeking an experienced Cloud Architect to design and implement large-scale In-Memory Database solutions on Amazon Web Services (AWS). The ideal candidate will have a strong background in distributed systems, NoSQL databases, and cloud computing.

  • Technical Lead

    7 days ago


    San Francisco, California, United States Salesforce Inc Full time

    Job OverviewSalesforce is seeking a skilled Technical Lead - Large Scale System Engineering to join our team. In this role, you will be responsible for designing, developing, and maintaining large-scale systems that are reliable, scalable, and secure.About SalesforceWe're Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM....


  • San Francisco, California, United States World Labs Full time

    World Labs is pioneering the development of Large World Models to enable AI systems that understand and interact with complex 3D environments. As a member of our team, you will design, implement, and optimize large-scale distributed data pipelines for AI model training. We're seeking candidates with strong engineering skills and expertise in data processing...


  • San Francisco, California, United States OpenAI Full time

    We are seeking an experienced Senior Software Engineer to lead our Data Acquisition team. The ideal candidate will have a strong background in large-scale distributed systems and data processing.The successful candidate will own and lead engineering projects in the area of data acquisition, including web crawling, data ingestion, and search. They will...


  • San Francisco, California, United States HMBL Full time

    Unleash Your Potential as a Technical LeaderHMBL is an innovative executive search and technical recruiting agency that takes a strategic approach to sourcing top talent. We leverage data and analysis to deliver exceptional results. If you're a driven individual looking to make a meaningful impact, we may have the perfect opportunity for you.We're seeking a...


  • San Francisco, California, United States Amazon Web Services, Inc. Full time

    Job Overview:Award-winning Amazon Web Services, Inc. seeks a seasoned Cloud Architect to join our team as an In-Memory Database Specialist Solutions Architect.About the Role:This is a unique opportunity to work with cutting-edge technology and be part of a global organization that pioneers cloud computing. As an In-Memory Database Specialist Solutions...


  • San Francisco, California, United States essential AI Full time

    At Essential AI, we are seeking a highly skilled and experienced AI Systems Architect to join our team. This role will be responsible for designing and implementing large-scale enterprise deployments of our AI-powered solutions.Company OverviewWe believe that building delightful end-user experiences requires innovating across the stack - from UX to models...


  • San Mateo, California, United States Verkada Full time

    Opportunity to Innovate and GrowAs a Backend Engineer at Verkada, you will have the opportunity to innovate and grow with our rapidly expanding company. You will work on large-scale systems, collaborate with cross-functional teams, and develop cutting-edge software products.Main Responsibilities:Build, test, and operate highly scalable, available, and...


  • San Diego, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Senior Big Data Engineer to join our team at Apple. As a key member of our Software Engineering group, you will be responsible for designing and developing large-scale data solutions that drive business growth and efficiency.The ideal candidate will have a strong background in architecting, designing, and...


  • San Jose, California, United States HireIO Inc Full time

    Your OpportunityHireIO Inc is seeking a seasoned Machine Learning Engineer Lead to drive the innovation and growth of our digital advertising products. As a key member of our team, you will lead the design and implementation of large-scale ad systems that power millions of transactions daily. We are looking for a talented individual who has a strong...


  • San Mateo, California, United States META Full time

    About MetaMeta is a technology company that builds products that help people connect, find communities and grow businesses. Our software engineering teams build the core technologies that power our products and enable users to access them from anywhere in the world.Salary RangeWe offer a competitive salary range of $173,000 - $250,000 per year, plus bonus...


  • San Francisco, California, United States HireIO Inc Full time

    We are looking for a highly skilled Large Scale Model Developer to join our team at HireIO Inc. In this role, you will be responsible for the research and application of large-scale models, exploring new applications and solutions for related technologies in the fields of search, recommendation, advertising, content creation, and customer service.The...


  • San Francisco, California, United States Rippling Full time

    About the JobWe are seeking a skilled Backend Engineer to join our team and contribute to the development of large-scale systems. As a key member of our HRIS team, you will be responsible for designing, building, and maintaining the core services used in many products. You will work closely with other teams across Rippling to support and scale our products...


  • San Jose, California, United States HireIO Inc Full time

    At HireIO Inc, we're looking for a Machine Learning Engineer - Digital Advertising to join our team. As part of our dynamic team, you'll play a key role in developing and optimizing large-scale ad systems.The successful candidate will have a strong background in computer science and experience with machine learning, natural language processing, and computer...


  • San Francisco, California, United States Early Warning® Full time

    About the RoleWe are seeking a seasoned Cloud Engineering Lead to join our team at Early Warning Services, LLC.This is a senior technical individual contributor position responsible for leading and mentoring large-scale engineering projects across the organization. As a trusted expert in cloud engineering, you will drive best practices, architectures, and...


  • San Francisco, California, United States HireIO Inc Full time

    Job OpportunityWe are looking for a talented Large Scale Model Developer to join our team in the USA/California/SF Bay Area/San Jose. This role comes with a salary of $160,000 per year.ResponsibilitiesDevelop and optimize large-scale language models to extreme levels.Construct data, tune instructions, align preferences, and optimize models.Implement relevant...


  • San Jose, California, United States HireIO Inc Full time

    Job DescriptionWe are seeking a Large-Scale System Development Engineer to join our team at HireIO Inc. As a Large-Scale System Development Engineer, you will design, develop, and deploy large-scale systems that meet the needs of our customers.The ideal candidate will have a strong background in computer science, with a minimum of 4 years of experience in...