Cloud Inference Optimization Specialist

3 weeks ago


Seattle, Washington, United States Amazon Full time

About the Role

Amazon is seeking a highly skilled Cloud Inference Optimization Specialist to join our Generative AI science team in Amazon AWS Bedrock. As part of this team, you will have the opportunity to impact millions of our customers by researching and building innovative algorithms that can optimize the inference engine of foundation models.

About the Team

AWS Utility Computing (UC) provides product innovations that set AWS services and features apart in the industry. Our team works alongside a supportive and collaborative mix of scientists and engineers to research and develop state-of-the-art technology for inference optimization. We are committed to making cloud computing easier and more accessible to businesses worldwide.

Your Responsibilities

  • Develop large-scale machine learning systems with expertise in profiling, debugging, and system performance and scalability.
  • Own problems end-to-end, innovate to bridge gaps, and be willing to learn missing knowledge.
  • Prioritize projects based on business needs and technical challenges.

BASIC QUALIFICATIONS

  • PhD in CS, ML, or related field, or Master with equivalent years of experience.
  • Experience programming in Java, C++, Python, or related language.
  • Patents or publications at top-tier peer-reviewed conferences or journals.

PREFERRED QUALIFICATIONS

  • Deep understanding of GPU architectures and experience optimizing for different hardware platforms and systems.
  • Experience training and fine-tuning Large Language Models (LLMs), or experience with various inference engines.

About Amazon

At Amazon, we value diverse experiences. If your career is just starting, hasn't followed a traditional path, or includes alternative experiences, don't let it stop you from applying. We strive to create an inclusive workplace where everyone feels valued and supported.

Compensation Package: The estimated salary for this role is $250,000 - $350,000 per year, including bonus and benefits. However, please note that actual compensation may vary based on factors such as location, experience, and performance.



  • Seattle, Washington, United States salesforce, inc. Full time

    At Salesforce, we believe that the future of business is powered by innovation and technology. As a Cloud Optimization Specialist - Senior FinOps Expert, you will play a critical role in driving our company's transformation journey in the public cloud.About the RoleWe are seeking an experienced and highly skilled professional to join our team as a Cloud...


  • Seattle, Washington, United States Amazon Full time

    Join Our TeamWe are excited to announce that Amazon is seeking highly skilled Inference Applications Software Engineers to join our team working on AWS Neuron. This role involves developing and optimizing distributed inference solutions using Python, PyTorch or JAX, and collaborating with cross-functional teams to drive business decisions.Responsibilities-...


  • Seattle, Washington, United States Amazon Full time

    Company Overview">Amazon, a global leader in e-commerce and technology, is revolutionizing the way people shop and live. As a Senior Applied Scientist, Campaign Measurement, you will be part of the Campaign Measurement & Optimization (CMO) organization, responsible for developing cutting-edge measurement and optimization models to drive marketing investment...


  • Seattle, Washington, United States InterSources Full time

    Job Description:As an AWS and Azure Optimization Specialist at InterSources Inc., you will be responsible for managing and optimizing cloud infrastructure on AWS and Azure platforms. You will analyze cloud usage and spending patterns to identify opportunities for cost optimization and develop strategies to reduce cloud costs without compromising performance...


  • Seattle, Washington, United States Amazon Web Services, Inc. Full time

    Job Summary$250,000 - $350,000 per year.As the Head of Customer Optimization & Acceleration at Amazon Web Services (AWS), you will lead a team of experts responsible for simplifying the customer experience by addressing complex operational issues as customers adopt cloud technologies. Your primary goal will be to drive cost savings and maximize the value of...


  • Seattle, Washington, United States Tik Tok Full time

    Media Platform Team OverviewThe Media Platform team is a critical component of TikTok's global infrastructure, responsible for optimizing app experience related to performance for our users. We collaborate with all teams in the video creation and consumption ecosystem to provide end-to-end optimization solutions.Job SummaryWe're seeking a highly experienced...


  • Seattle, Washington, United States ZipRecruiter Full time

    Unlock AI Innovation as a Cloud Solutions ArchitectWe are seeking a highly skilled Cloud Solutions Architect to join our dynamic team. In this role, you will play a crucial part in shaping the adoption of AI technology and working with cutting-edge cloud and AI infrastructure.The ideal candidate will have at least 5 years of experience in a similar technical...


  • Seattle, Washington, United States CGL Consulting Co., Ltd Full time

    Job SummaryCGL Consulting Co., Ltd seeks a skilled Cloud Infrastructure Specialist to manage and operate cloud infrastructure across AWS or Azure and Kubernetes environments, ensuring optimal performance and resource allocation. The ideal candidate will have hands-on experience with AWS or Azure and Kubernetes, strong proficiency in Linux, networking, and...


  • Seattle, Washington, United States Apple Full time

    Job OverviewApple is seeking a Cloud Networking Specialist to join our team. This role involves designing and developing cutting-edge traffic proxies that power our services at an unprecedented scale, serving hundreds of millions of users globally.About the Team:We are a dynamic team of engineers working on cutting-edge projects to enhance our network stack...


  • Seattle, Washington, United States Amazon Full time

    About the JobWe are seeking a skilled Senior Software Developer to join our team working on distributed inference solutions for AWS Neuron. This role involves developing and optimizing code for high-performance applications using Python, PyTorch or JAX, and collaborating with cross-functional teams to drive business decisions.Responsibilities- Develop and...


  • Seattle, Washington, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Senior Robotics Optimization Engineer to join our team at Amazon's Frontier AI & Robotics department. As a key member of our science team, you will play a vital role in transforming cutting-edge research into high-performance production systems.Your primary focus will be on optimizing large-scale transformer...


  • Seattle, Washington, United States Lavendo Full time

    About LavendoLavendo is a publicly traded tech company at the forefront of the AI revolution, building full-stack infrastructure for the global AI industry.With a team of over 500 skilled engineers, they are creating an ecosystem where the next generation of AI breakthroughs can flourish. Their AI-centric cloud platform offers a true hyperscale experience...


  • Seattle, Washington, United States F5 Networks Full time

    Job Title:Cloud Platform Operations SpecialistAbout the Role:We are seeking a skilled Cloud Platform Operations Specialist to join our team at F5 Networks. As a key member of our operations team, you will be responsible for ensuring the optimal performance, availability, and security of our critical platforms.Responsibilities:Manage user access and monitor...

  • AWS Cloud Engineer

    3 weeks ago


    Seattle, Washington, United States Amazon Full time

    Company OverviewAWS provides the most comprehensive and widely used cloud platform, empowering businesses to innovate and grow. With over 750 instances available, customers can choose from the latest processors, storage, networking, operating systems, and purchase models to meet their workload needs.We are the first major cloud provider to support Intel,...

  • AWS Cloud Engineer

    2 weeks ago


    Seattle, Washington, United States Amazon Full time

    About Amazon Elastic Compute Cloud (Amazon EC2)With over 750 instances to choose from, Amazon EC2 offers the broadest and deepest compute platform. We support Intel, AMD, and Arm processors, providing on-demand EC2 Mac instances and 400 Gbps Ethernet networking. Our price performance for machine learning training is unbeatable, and we offer the lowest cost...


  • Seattle, Washington, United States Softpath System Full time

    Job Title: Cloud Infrastructure SpecialistJob Summary:We are seeking a skilled Cloud Infrastructure Specialist to join our team at Softpath System. The ideal candidate will have expertise in Azure, AKS, and Terraform, with a strong understanding of data and infrastructure as code.Key Responsibilities:Design, deploy, and manage infrastructure solutions using...


  • Seattle, Washington, United States F5 Networks Full time

    Job Title: Enterprise Cloud Platform Operations SpecialistAbout F5 NetworksWe strive to bring a better digital world to life at F5 Networks. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world.Job SummaryThe Enterprise Cloud Platform Operations Specialist is...


  • Seattle, Washington, United States Amazon Full time

    About AmazonAmazon Elastic Compute Cloud (Amazon EC2) offers the broadest and deepest compute platform, with over 750 instances and choice of the latest processor, storage, networking, operating system, and purchase model to help you best match the needs of your workload. We are the first major cloud provider that supports Intel, AMD, and Arm processors, the...


  • Seattle, Washington, United States Apple Full time

    At Apple, we're seeking an exceptional Senior Machine Learning Optimization Engineer to join our team in Seattle, Washington. This highly collaborative role involves leading efforts in identifying bottlenecks and optimizing our model inference stack.You will work closely with diverse teams to diagnose performance issues and develop innovative solutions that...


  • Seattle, Washington, United States Apple Full time

    About the Role">We are seeking a Machine Learning Optimization Engineer to join our MIND team at Apple. This is an exciting opportunity to work on innovative projects that involve HW/SW co-design for efficient inference.">Key Responsibilities">You will research and develop ML optimization techniques for efficient on-device ML, materialize ideas/concepts, and...