DevOps Infrastructure Engineer

5 months ago


Cupertino, United States ETCHED LLC Full time
About Etched

Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely deep chain-of-thought reasoning.

DevOps Infrastructure Engineer

Designing and writing software for new ASICs is hard, and requires a huge amount of software and tooling. It is even more challenging for model-specific ASICs, as it is important for them to hit the market at the right time, and thus moving fast is essential.

You will drive adoption of cutting-edge tooling, to improve the speed and reliability of our toolchains. You will help us innovate to do better than the industry norm, by running massively parallel CI jobs, specifying and building our own fully-redundant SSD-only server infrastructure, and making sure these tools run automatically and reliability.

You will work with an IT contracting firm to do the day-to-day maintenance and installation - while you must be knowledgeable enough about IT to work with this firm, most of your time will be spent designing new toolchains entirely

The scope and title of this role can be modified for exceptional candidates.

Representative projects
• Spec out a server using a 6 GHz desktop CPU to speed up single-threaded workloads
• Decide if moving our servers to the cloud/a colo facility makes sense to improve uptime
• Set up networking infrastructure to allow Jupyter notebook users to connect to our

servers, without waiting for them to be restarted.
• Parallelize our CI stack to run on dozens of different machines at once, designing a

policy to avoid unnecessary CI failures if a machine goes down.

You may be a good fit if you
• Are highly technical
• Strong knowledge of Linux, containerization, CI/CD, and programming languages such

as Python/C++. You will be asked coding questions during your interview.
• Proven ability to lead technical teams and mentor junior members
• Have 4+ years of experience with either infrastructure engineering or software

development
• Experience debugging complex hardware and software issues with server infrastructure

Strong candidates may also have experience with
• In-depth understanding of workflows used in the semiconductor industry, especially those involving Synopsys and Cadence EDA tooling and Verilator
• Proficiency with cloud computing technology and experience working with a Big 3 Cloud
• Experience monitoring and installing datacenter hardware
• In-depth understanding of workflows used in the semiconductor industry,

We encourage you to apply even if you do not believe you meet every single qualification.

How we're different:

Etched believes in the Bitter Lesson. We think most of the progress in the AI field has come from using more FLOPs to train and run models, and the best way to get more FLOPs is to build model-specific hardware. Larger and larger training runs encourage companies to consolidate around fewer model architectures, which creates a market for single-model ASICs.

We are a fully in-person team in Cupertino, and greatly value engineering skills. We do not have boundaries between engineering and research, and we expect all of our technical staff to contribute to both as needed.

Benefits:
  • Full medical, dental, and vision packages, with 100% of premium covered, 90% for dependents
  • Housing subsidy of $2,000/month for those living within walking distance of the office
  • Daily lunch and dinner in our office
  • Relocation support for those moving to Cupertino


  • Cupertino, United States ETCHED LLC Full time

    About Etched Etched is building AI chips that are hard-coded for individual model architectures. Our first product (Sohu) only supports transformers, but has an order of magnitude more throughput and lower latency than a B200. With Etched ASICs, you can build products that would be impossible with GPUs, like real-time video generation models and extremely...


  • Cupertino, California, United States ETCHED LLC Full time

    About Etched">Etched LLC is a pioneer in building custom AI chips, aiming to create a market for single-model ASICs by encouraging companies to consolidate around fewer model architectures. Our first product, Sohu, only supports transformers but has a significantly higher throughput and lower latency than traditional GPUs.">As a DevOps Infrastructure...


  • Cupertino, California, United States ETCHED LLC Full time

    **Company Overview:** Etched LLC is a revolutionary company that's changing the game in AI chips. We're building model-specific hardware that's hard-coded for individual model architectures, making it possible to train and deploy AI models at unprecedented speeds. We're looking for a skilled DevOps Infrastructure Solutions Specialist to join our team! As a...


  • Cupertino, California, United States ETCHED LLC Full time

    About Etched">E-topped, a company that creates advanced chips for AI models, is looking for an experienced DevOps Infrastructure Engineer. With Etched's cutting-edge technology, our products can process data at an order of magnitude faster than traditional GPUs, enabling the creation of innovative products like real-time video generation models and deep...


  • Cupertino, California, United States ETCHED LLC Full time

    About UsEtched LLC is a leading developer of AI chips that are hard-coded for individual model architectures. Our mission is to provide innovative solutions for the AI community by pushing the boundaries of what is possible with traditional hardware.Job TitleWe are seeking a highly skilled DevOps Infrastructure Engineer to join our team. The ideal candidate...

  • DevOps Engineer

    5 days ago


    Cupertino, United States MindSource Full time

    Mindsource is seeking a Devops Engineer for one of our Direct Clients . If interested, please drop your resume to akhil@mindsource.comTitle: Devops Engineer Location: Cupertino, CA or Austin, TX (Locals Only)Long Term Contract We’re looking for someone with hands-on data platform experience (Spark, Iceberg, Hive) in the following technologies:AWS services:...

  • DevOps Engineer

    3 weeks ago


    Cupertino, United States MindSource Full time

    Mindsource is seeking a Devops Engineer for one of our Direct Clients . If interested, please drop your resume to akhil@mindsource.comTitle: Devops Engineer Location: Cupertino, CA or Austin, TX (Locals Only)Long Term Contract We’re looking for someone with hands-on data platform experience (Spark, Iceberg, Hive) in the following technologies:AWS services:...


  • Cupertino, California, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Cloud Infrastructure Architect to lead the design and implementation of new tools, pipelines, and automation in our AWS organization. The ideal candidate will have experience with cloud computing, DevOps practices, and leading engineering teams.The Role:Design and implement new tools and pipelines to automate...


  • cupertino, United States MindSource Full time

    Mindsource is seeking a Devops Engineer for one of our Direct Clients . If interested, please drop your resume to akhil@mindsource.comTitle: Devops Engineer Location: Cupertino, CA or Austin, TX (Locals Only)Long Term Contract We’re looking for someone with hands-on data platform experience (Spark, Iceberg, Hive) in the following technologies:AWS services:...


  • Cupertino, California, United States Apple Full time

    Job Summary">We're seeking an experienced silicon validation software engineer to join our Cupertino-based team. The successful candidate will have a strong background in software development and a passion for automation and devops. Your primary responsibility will be to design and implement automated testing tools and infrastructure that serve hundreds of...


  • Cupertino, California, United States ETCHED LLC Full time

    About Etched">At Etched LLC, we're revolutionizing AI computing by building custom chips that outperform traditional GPUs. Our goal is to empower innovators with the tools they need to push the boundaries of what's possible.">As a DevOps Infrastructure Engineer, you'll play a critical role in designing and implementing software for new ASICs, driving...


  • Cupertino, California, United States ETCHED LLC Full time

    **Job Description:** We're seeking a highly skilled Cloud Native DevOps Architect to join our team! As a key member of our infrastructure engineering group, you will design and implement cutting-edge tooling to improve the speed and reliability of our toolchains. The ideal candidate will have 4+ years of experience in infrastructure engineering or software...


  • Cupertino, California, United States Apple Full time

    Company Overview">Cupertino, California is home to Apple's silicon validation team. This team plays a critical role in ensuring the quality and reliability of Apple's hardware products. As a member of this team, you will have the opportunity to work on cutting-edge technology and collaborate with talented engineers from around the world.Salary">The base pay...


  • Cupertino, California, United States ETCHED LLC Full time

    About the RoleThis is an exciting opportunity to join Etched LLC as a DevOps Infrastructure Engineer. The successful candidate will have a strong background in Linux, containerization, CI/CD, and programming languages such as Python/C++. They will also have proven ability to lead technical teams and mentor junior members, as well as 4+ years of experience...


  • Cupertino, California, United States Apple Full time

    As a key member of the Apple Services Engineering team, we are seeking an exceptional Engineering Program Manager (EPM) to lead our efforts in building and running services that millions of customers use every day.About the RoleWe are looking for a dynamic EPM to partner with teams across Apple to deliver innovative product solutions that shape the future of...

  • DevOps Engineer

    1 month ago


    Cupertino, United States Diverse Lynx Full time

    10 Years of experience required (Must) Experience in at-least one Container Orchestration Tool - Kubernetes / Docker Swarm / Nomad-Consul Experience in Devops Tools like Jenkins and Docker. Experience with Scripting. Should have ability to automate repeated tasks. (Python preferred) Experience with Server-Side technologies such as Nginx, Haproxy, Apache....


  • Cupertino, California, United States Apple Full time

    Apple is a leader in delivering innovative technology solutions. As a Senior Site Reliability Engineer, you'll play a critical role in ensuring the reliability and scalability of our cloud services.In this role, you'll be responsible for designing and implementing innovative solutions to accelerate our ability to deliver thousands of applications reliably...


  • Cupertino, California, United States CEREBRAS SYSTEMS INC. Full time

    Job Title: AI Infrastructure Network EngineerCerebras Systems is seeking an experienced AI Infrastructure Network Engineer to join our team.Key Responsibilities:Manage and optimize end-to-end network performance of complex AI infrastructure, including servers and switches.Evaluate and recommend servers, switches, and routers for next-generation...


  • Cupertino, California, United States Apple Full time

    About Us">At Apple, we're passionate about creating innovative products that delight our customers. Our silicon validation team is responsible for ensuring the quality and reliability of our hardware products. We're looking for a skilled silicon validation software engineer to join our team and help us deliver exceptional results.Salary Range">The estimated...

  • Software Engineer III

    3 weeks ago


    Cupertino, California, United States Amazon Full time

    We are looking for a seasoned DevOps Engineer to join our team at Amazon. As a key contributor, you will be responsible for building and maintaining scalable infrastructure solutions to support the development and deployment of machine learning models on our cloud platform.**Key Responsibilities:**- Design and implement efficient data processing pipelines...