Systems Development Engineer III, Annapurna Labs Infrastructure

4 weeks ago


Round Rock TX, United States Annapurna Labs (U.S.) Inc. Full time
Job Description

Annapurna Labs, our organization within AWS, is responsible for building innovation in silicon and software for AWS customers. With development centers in the U.S. and Israel, Annapurna is at the forefront of innovation by combining cloud scale with the world’s most talented engineers. Our team covers multiple disciplines including silicon engineering, hardware design and verification, software, and operations. Because of our teams’ breadth of talent, we’ve been able to improve AWS cloud infrastructure in networking and security with products such as AWS Nitro, Enhanced Network Adapter (ENA), and Elastic Fabric Adapter (EFA), in compute with AWS Graviton and F1 EC2 Instances, in machine learning with AWS Neuron, Inferentia and Trainium ML Accelerators, and in storage with scalable NVMe.
As part of Annapurna Labs team, you’ll have the opportunity to invent the next generation of cloud computing infrastructure. You’ll experience what it’s like to work in a fast-paced, innovative, and start-up like environment filled with some of the brightest minds in the industry. The work we do is not only cutting-edge and internet-scale but also deeply important to our customers. We design and build every component of our hardware and software to come together into products that our customers use for accelerated computing: either Machine Learning acceleration, or FPGA acceleration. We get our hands dirty, from creating our own silicon, ensuring our hardware is functional and healthy, and managing the full lifecycle of our systems at the huge scale and complexity of AWS.

If you want a career that makes an impact, allows you to invent, and have first-hand visibility into how your implementations delight customers, then we have a role for you.
If you're interested in being on a team that is "building a complete product" from inception to delighted customers, Annapurna is a fantastic choice.
Join us in creating the most advanced Machine Learning Accelerators in the world

Key job responsibilities
As a technical leader of the Cloud-Scale Machine Learning Acceleration Infrastructure team you’ll be responsible for architecting and leading development of the infrastructure used by our engineering teams. Our customers, the engineering teams, building hardware/software running in our data centers which are custom designed machine learning products: AWS Inferentia2 and Trainium.
You will need to lead across teams to develop and execute in-depth infrastructure development plans that enables the engineering development of the Machine Learning Acceleration product family. You will dive deep to solve critical infrastructure issues involving networking, high performance compute clusters, infrastructure automation of hardware/software/firmware testing, and ASIC/EDA development. You will execute and scale the next generation of cloud infrastructure based on cloud frameworks and AWS services. You will own design reviews for infrastructure development and partner with AWS service teams and vendors. You will influence within your team, your customers and AWS service teams to help drive and develop the technical implementation for overall system designs. You will identify and implement process improvements which improve your team’s agility and operations, including improvements to design, automation, development, test or operations. You will define new mechanisms that execute system health monitoring, diagnostics, repair, and automation. You will develop, document and update operational runbooks as you participate in on-call rotations.

A day in the life
Each day you will work with the best engineers in the industry to develop Machine Learning Accelerators. On-site in Austin, Texas, you will be apart of the team that develops custom silicon and you will own the infrastructure that enables this innovation. Take a look inside our labs to see what you will learn at Annapurna Labs:

We are open to hiring candidates to work out of one of the following locations:

Austin, TX, USA
BASIC QUALIFICATIONS- 5+ years of programming with at least one modern language such as C++, C#, Java, Python, Golang, PowerShell, Ruby experience
- 3+ years of non-internship professional software development experience
- 5+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience
- 5+ years of deploying and operating in a Linux/Unix environment experience
- 3+ years of systems design, software development, operations, automation, and process improvement experience
- Experience leading the design, build and deployment of complex and performant (reliable and scalable) software solutions in production
- 3+ years of systems development in an IT or data center environment experience
- Experience with debugging complex issues with HW/SW, networking and storage systems
- Experience with operations of large scale infrastructure deployments including improving operational excellence
PREFERRED QUALIFICATIONS- Knowledge of engineering practices and patterns for the full software/hardware/networks development life cycle, including coding standards, code reviews, source control management, build processes, testing, certification, and livesite operations
- Experience taking a leading role in building complex software or computing infrastructure that has been successfully delivered to customers
- Experience writing technical documents, project plans and progress reports to leadership and to stakeholders
- Experience with AWS Cloud Infrastructure deployments using CDK
- Experience with IT security software/tools/standards

Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit
  • IT Engineer

    1 week ago


    Round Rock, United States K2 Staffing Full time

    Job Description Job Description Summary Our client is a leading IT Solutions Company located in Round Rock, TX and they are in need of a Level III IT Engineer . A qualified candidate would have both proven experience with technology and outstanding personal communication skills. You should enjoy building solutions that leverage technology to meet a...

  • IT Engineer

    2 weeks ago


    Round Rock, United States K2 Staffing, LLC Full time

    Summary Our client is a leading IT Solutions Company located in Round Rock, TX and they are in need of a Level III IT Engineer . A qualified candidate would have both proven experience with technology and outstanding personal communication skills. You should enjoy building solutions that leverage technology to meet a client’s business needs. Duties &...

  • IT Engineer

    4 weeks ago


    Round Rock, United States K2 Staffing, LLC Full time

    Job DescriptionJob DescriptionSummaryOur client is a leading IT Solutions Company located in Round Rock, TX and they are in need of a Level III IT Engineer. A qualified candidate would have both proven experience with technology and outstanding personal communication skills. You should enjoy building solutions that leverage technology to meet a client’s...

  • IT Engineer

    3 weeks ago


    Round Rock, United States K2 Staffing Full time

    Job DescriptionJob DescriptionSummary  Our client is a leading IT Solutions Company located in Round Rock, TX and they are in need of a Level III IT Engineer. A qualified candidate would have both proven experience with technology and outstanding personal communication skills. You should enjoy building solutions that leverage technology to meet a...


  • Round Rock, United States Kratos Unmanned Systems Full time

    5-D Systems, a KRATOS Company, is seeking a highly motivated candidate who will provide model-based systems engineering technical contribution and leadership. This position is full-time and requires the ability to successfully work in a team environment. The focus is supported to unmanned, manned, and optionally piloted aircraft platforms. The successful...


  • Round Rock, United States Kratos Unmanned Systems Full time

    5-D Systems, a KRATOS Company, is seeking a highly motivated candidate who will provide model-based systems engineering technical contribution and leadership. This position is full-time and requires the ability to successfully work in a team environment. The focus is supported to unmanned, manned, and optionally piloted aircraft platforms. The successful...

  • Systems Engineer

    3 weeks ago


    Round Rock, United States Kratos Defense Full time

    Bring your talents to the 5-D Systems Engineering Team for Summer 2024! 5-D Systems, Inc., a KRATOS Company, has exciting Systems Engineering Internship opportunities for current undergraduate (Software and Junior Senior level) and graduate students. As a Systems Engineering Intern, you will work with a team of talented and experienced defense industry...


  • Little Rock, United States Misoenergy Full time

    If you are unable to complete this application due to a disability, contact this employer to ask for an accommodation or an alternative application process. Power System Operations Engineer III/Senior Active - Full Time Employee -SE Little Rock, AR, US 2 days ago Requisition ID: 2387 MISO offers a comprehensive benefits package available on your first day of...


  • Little Rock, United States Midcontinent Independent System Operator Full time

    MISO offers a comprehensive benefits package available on your first day of employment. Position Location: Little Rock, AR Are you an electrical engineer looking to make an impact in a control room that oversees the reliable operations of an electric grid serving 45 million people? Do you have interest in working in a fast-paced environment where you will be...


  • Round Rock, United States ARM Full time

    Job DescriptionJob Overview:Arm is dedicated to empowering the success of our partners through significant investments. This commitment extends to hands-on collaboration to optimize their codebases, enhancing performance on ARM architecture. As Arm's market share expands rapidly, our partners leverage our expertise and strengths to deliver unparalleled...


  • Little Rock, United States Midcontinent Independent System Operator Full time

    MISO offers a comprehensive benefits package available on your first day of employment. Position Location: Little Rock, AR Are you an electrical engineer looking to make an impact in a control room that oversees the reliable operations of an electric grid serving 45 million people? Do you have interest in working in a fast-paced environment where you will...


  • Little Rock, United States MIDCONTINENT INDEPENDENT SYSTEM OPERATOR INC Full time

    Job DescriptionJob DescriptionMISO offers a comprehensive benefits package available on your first day of employment.Position Location: Little Rock, ARAre you an electrical engineer looking to make an impact in a control room that oversees the reliable operations of an electric grid serving 45 million people? Do you have interest in working in a fast-paced...


  • Rock Hill, United States 3D Systems Full time

    *WHO WE ARE: * *More than 30 years ago, 3D Systems launched the 3D printing industry and has been leading additive manufacturing innovation ever since. Today, our diverse, global workforce brings innovation, performance, and reliability to every interaction - empowering our customers to create physical products at a digital pace. 3D Systems solutions address...

  • Software Engineer III

    2 weeks ago


    Round Rock, United States Toppan Photomasks Full time

    Join Our Team: Toppan Photomasks, Inc. is looking for a qualified Software Engineer to join our team and play a key role in shaping the future of our software solutions for one of the largest semiconductor tooling providers in the world. Located in Round Rock, Texas, Toppan Photomasks focuses on providing a great place to work for its employees and a great...


  • Round Rock, TX, United States Dell Full time

    Ensures that Dell’s Cloud Service delivers the service performance, reliability, and availability expected by our customers and internal client groups. SRE teams are continuously improving the management automation of our Cloud Service, with the objective of enabling industrialized fleet management at scale. Join us to do the best work of your career...


  • Round Rock, United States Kratos Defense Full time

    5-D Systems, a KRATOS Company, is seeking a highly motivated candidate who will provide model-based systems engineering technical contribution and leadership.This position is full-time and requires the ability to successfully work in a team environment. The focus is supported to unmanned, manned, and optionally piloted aircraft platforms. The successful...


  • Round Rock, United States Kratos Defense Full time

    5-D Systems, a KRATOS Company, is seeking a highly motivated candidate who will provide model-based systems engineering technical contribution and leadership. This position is full-time and requires the ability to successfully work in a team environment. The focus is supported to unmanned, manned, and optionally piloted aircraft platforms. The successful...


  • Fort Worth, TX, United States Softworld Inc Full time

    ***Due to the nature of the work being performed US Citizenship is required*** Job Title: Cloud Infrastructure Engineer Job Location: Fort Worth TX 76101 Onsite Requirements: Experience with Azure Cloud Infrastructure Engineering. Perform Risk, Issue and Opportunity (RIO) development and tracking with Digital Enterprise SQL database experience,...


  • Round Rock, United States ShiftCode Analytics Full time

    Intevriew : Video Visa : All apart from H1b, CPT and TN Hybrid in Hopkinton, MA or Round Rock, TX from Day-1. Will be onsite 2-3 days a week at either location. Dell will heavily favor consultants who already live near Hopkinton or Round Rock. Afterward they will look at consultants who need to relocate according to their current distance to either city....


  • Sunnyvale, TX, United States Google Full time

    Minimum qualifications:Bachelor's degree or equivalent practical experience. 5 years of experience with software development in one or more programming languages (e.g., Python, C, C++, Java, JavaScript). 5 years of experience in a technical leadership role; overseeing projects, with 5 years of experience in a people management, supervision/team leadership...