Software Development Engineer, Annapurna Labs, Machine Learning Fleet Operations

5 days ago


Austin, Texas, United States Amazon Full time
About the Role

We are seeking a highly skilled Software Development Engineer to join our Machine Learning Fleet Operations team at Annapurna Labs, a part of AWS Utility Computing. As a member of this team, you will play a critical role in supporting the development and management of Compute, Database, Storage, Platform, and Productivity Apps services in AWS.

Our team is responsible for maintaining an exceptionally high quality bar for our fleet of advanced machine learning server products. We perfect the customer experience by developing scalable software for rapid incident response times and data visualization as well as diving deep into hardware issues as they arise.

Key Responsibilities
  • Member of a team responsible for system remediation, operational excellence, and customer experience on bleeding edge ML products
  • Utilize data to root cause hardware failures and identify live trends on the most complex systems in AWS
  • Implement and improve system level testing across the product lifecycle
  • Develop software which can be maintained, improved upon, documented, tested, and reused
  • Dive deep on issues at the intersection of hardware and software
About the Team

Our team is dedicated to supporting new members. We have a broad mix of experience levels and tenures, and we're building an environment that celebrates knowledge-sharing and mentorship.

Our senior members enjoy one-on-one mentoring and thorough, but kind, code reviews. We care about your career growth and strive to assign projects that help our team members develop your engineering expertise so you feel empowered to take on more complex tasks in the future.

What We Offer

We value work-life harmony and strive for flexibility as part of our working culture. Achieving success at work should never come at the expense of sacrifices at home.

We're continuously raising our performance bar as we strive to become Earth's Best Employer. That's why you'll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.

Basic Qualifications
  • 2+ years of non-internship professional software development experience
  • 1+ years of designing or architecting (design patterns, reliability and scaling) of new and existing systems experience
  • 1+ years of administrative experience in networking, storage systems, operating systems and hands-on systems engineering experience
  • Knowledge of systems engineering fundamentals (networking, storage, operating systems)
  • Experience programming with at least one modern language such as C++, C#, Java, Python, Golang, PowerShell, Ruby
  • Experience with Linux/Unix
Preferred Qualifications
  • Experience building services using AWS products


  • Austin, Texas, United States Annapurna Labs (U.S.) Inc. Full time

    About Annapurna Labs (U.S.) Inc.Annapurna Labs is a leading innovator in hardware/software co-design, pushing the boundaries of technology not only in Amazon Web Services (AWS) but across the industry.Job SummaryWe are seeking a skilled Software Engineer to join our Release and Automation Software Team. As a key member of our team, you will design and build...


  • Austin, Texas, United States Annapurna Labs (U.S.) Inc. Full time

    About the RoleWe are seeking a skilled Software Engineer to join our Release and Automation team at Annapurna Labs (U.S.) Inc. As a key member of our team, you will be responsible for designing and building services and automations to improve the releases and operations of our Machine Learning servers.Key ResponsibilitiesUnderstand the Machine Learning...

  • asic power engineer

    4 days ago


    Austin, Texas, United States Annapurna Labs Full time

    About Annapurna LabsAnnapurna Labs is a leading innovator in cloud-scale machine learning acceleration. Our team designs and optimizes hardware in our data centers, including AWS Inferentia, our custom-designed machine learning inference datacenter server.Job SummaryWe are seeking an experienced ASIC Design Engineer to join our Cloud-Scale Machine Learning...


  • Austin, Texas, United States Annapurna Labs (U.S.) Inc. Full time

    Annapurna Labs (U.S.) Inc. is a leader in providing a robust, scalable, and cost-effective cloud infrastructure that supports numerous enterprises globally. We are at the forefront of machine learning and artificial intelligence services, catering to the diverse needs of our clients. Our team is currently on the lookout for seasoned Hardware Test Engineers,...


  • Austin, Texas, United States Annapurna Labs (U.S.) Inc. Full time

    About the RoleWe are seeking an experienced Design Verification Engineer to join our Cloud-Scale Machine Learning Acceleration team at Annapurna Labs (U.S.) Inc. As a member of this team, you will be responsible for the design and validation of machine learning hardware in our data centers.Key ResponsibilitiesVerify and validate that our hardware and...


  • Austin, Texas, United States Annapurna Labs (U.S.) Inc. Full time

    Design Verification SpecialistWe are seeking a highly skilled Design Verification Specialist to join our team at Annapurna Labs (U.S.) Inc. in the field of cloud server platforms.As a Design Verification Specialist, you will play a key role in the validation of machine learning hardware in our data centers. Your responsibilities will include verifying and...


  • Austin, Texas, United States Annapurna Labs (U.S.) Inc. Full time

    About the RoleWe are seeking an experienced Design Verification Engineer to join our team at Annapurna Labs (U.S.) Inc. as a key member of the Cloud-Scale Machine Learning Acceleration team.Key ResponsibilitiesVerify and validate that our hardware and software solutions achieve their desired functionality.Develop and execute multi-faceted verification and...


  • Austin, Texas, United States Triunity Software Full time

    MLOps Engineer Job DescriptionWe are seeking an experienced MLOps Engineer to join our team at Triunity Software. As a key member of our team, you will be responsible for designing and implementing large-scale data pipelines and engineering infrastructure to support our clients' enterprise machine learning systems.Key Responsibilities:Design and create data...


  • Austin, Texas, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Serdes/PCIE Phy Expert to join our team at Annapurna Labs, a leading provider of cloud computing solutions. As a Serdes/PCIE Phy Expert, you will be responsible for designing and implementing innovative next-generation machine learning chips and servers.Key ResponsibilitiesDesign and implement Serdes/PCIE PHY and...

  • Serdes Phy Expert

    3 days ago


    Austin, Texas, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Serdes/PCIE Phy Expert to join our team at Annapurna Labs, a leading provider of cloud computing solutions. As a Serdes/PCIE Phy Expert, you will be responsible for designing and implementing innovative next-generation machine learning chips and servers.Key ResponsibilitiesDesign and implement Serdes/PCIE PHY and...


  • Austin, Texas, United States eBay Inc. Full time

    About the RoleeBay Inc. is a global leader in ecommerce, and we're committed to pushing boundaries and leaving our mark as we reinvent the future of ecommerce for enthusiasts.We're seeking a motivated Software Engineer with a strong background in software development and hands-on experience in machine learning to join our Global Payments and Risk team.Key...


  • Austin, Texas, United States eBay Inc. Full time

    About the RoleeBay Inc. is a global leader in ecommerce, and we're committed to pushing boundaries and leaving our mark as we reinvent the future of ecommerce for enthusiasts.We're seeking a motivated Software Engineer with a strong background in software development and hands-on experience in machine learning to join our Global Payments and Risk team.Key...


  • Austin, Texas, United States HP Development Company, L.P. Full time

    About UsHP Development Company, L.P. is a leading technology company that has been driving innovation since its inception in 1939. With a portfolio that spans printing, personal computing, software, and services, we serve over 1 billion customers in over 170 countries. Our commitment to fostering a diverse and inclusive workplace has enabled us to attract...


  • Austin, Texas, United States HP Development Company, L.P. Full time

    About the RoleWe are seeking a highly skilled Machine Learning Engineer to join our dynamic full-stack team at HP Development Company, L.P. As a key member of our team, you will be responsible for designing, training, and integrating advanced machine learning capabilities to create impactful solutions deployable across diverse platforms.Key...


  • Austin, Texas, United States Diversity Talent Scouts- Executive Search Firm Full time

    **Job Opportunity**We are seeking a highly skilled Machine Learning Engineer to contribute to the advancement of AI/ML integration and adoption across our organization. This role plays a pivotal part in driving innovation and excellence in machine learning, software engineering, and data science.The ideal candidate will possess a strong foundation in machine...


  • Austin, Texas, United States Purple Drive Full time

    About the RolePurple Drive is seeking a highly skilled and experienced Principal Software Engineer to join our team. As a key member of our engineering team, you will be responsible for designing, developing, and deploying robust and scalable AI solutions using Python and.NET.Key ResponsibilitiesDevelop and maintain large-scale software systems using Python...


  • Austin, Texas, United States Procore Technologies Full time

    Job Title: Director, Machine LearningWe are seeking a seasoned leader to spearhead our Machine Learning Engineering and Applied Science team at Procore Technologies. As a Director, Machine Learning, you will be responsible for building and scaling a high-performing team of ML engineers and scientists who design, develop, and deploy impactful machine learning...


  • Austin, Texas, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Systems Development Engineer to join our Annapurna Labs Infrastructure team in Austin, Texas. As a key member of our team, you will be responsible for designing and supporting enterprise-scale infrastructure for our Machine Learning Acceleration product family.Key ResponsibilitiesDesign and implement...


  • Austin, Texas, United States Xometry Full time

    Job Title: Senior Manager, Machine Learning EngineeringXometry is seeking a highly skilled Senior Manager, Machine Learning Engineering to lead our machine learning engineering team. As a key member of our engineering organization, you will be responsible for designing, developing, and deploying machine learning models and systems that drive business...


  • Austin, Texas, United States Visa Full time

    About the RoleWe are seeking a highly skilled Senior Machine Learning Engineer to join our team at Visa. As a key member of our Data and AI Platform technology organization, you will play a critical role in developing and implementing advanced machine learning solutions that drive strategic growth and operational efficiency.Key ResponsibilitiesDesign,...