LLMOps Engineer

1 month ago


Westerville, Ohio, United States Vertiv Full time
Job Summary

As a Cloud/Gen AI LLMOps Engineer, you will play a pivotal role in designing and maintaining the infrastructure and pipelines for cutting-edge Large Language Models (LLMs), collaborating closely with Generative AI Architects. Your expertise in automating and streamlining the LLM lifecycle will be instrumental in ensuring the efficiency, scalability, and reliability of Generative AI models in production.

Key Responsibilities
  • Develop and execute Machine Learning (ML)/LLM pipelines specifically for Large Language Models, encompassing data acquisition, pre-processing, model training/tuning, deployment, and monitoring.
  • Utilize automation tools such as GitOps, CI/CD pipelines, and containerization technologies (Docker, Kubernetes) to streamline ML/LLM tasks across the Large Language Model lifecycle.
  • Establish robust monitoring and alerting systems to track Large Language Model performance, data drift, and other key metrics, proactively identifying and resolving issues.
  • Perform truth analysis to assess the accuracy and effectiveness of Large Language Model outputs, comparing them to known, accurate data.
  • Collaborate closely with infrastructure, DevOps teams, and Generative AI Architects to optimize model performance and resource utilization.
  • Oversee and maintain cloud infrastructure (AWS, Azure) specifically for Large Language Model workloads, ensuring cost-efficiency and scalability.
  • Stay current with the latest advancements in ML/LLM Ops, integrating these developments into generative AI platforms and processes.
  • Communicate effectively with both technical and non-technical stakeholders, providing updates on the performance and status of Large Language Models.
Requirements
  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
  • At least 5 years of experience as an ML engineer within public cloud platforms.
  • Strong programming skills in Python and/or other languages.
  • Expertise in cloud platforms (AWS, Azure) for ML workloads, MLOps, DevOps, or Data Engineering.
  • Proven experience in MLOps, LLMOps, or related roles, with hands-on experience deploying and managing machine learning and large language model pipelines.
  • Familiarity with generative AI applications and domains such as content creation, data augmentation, style transfer.
  • Strong knowledge of Generative AI architectures and methods, including chunking, vectorization, context-based retrieval and search, working with Large Language Models such as Open AI GPT, Llama2, Llama3, Mistral, etc.
About Vertiv

Vertiv is a global critical infrastructure and data center technology company, ensuring customers' vital applications run continuously by bringing together hardware, software, analytics, and ongoing services. Our portfolio includes power, cooling, and IT infrastructure solutions and services that extend from the cloud to the edge of the network. Headquartered in Columbus, Ohio, USA, Vertiv employs around 20,000 people and does business in more than 130 countries. Visit https://www.vertiv.com/ to learn more.


  • LLMOps Engineer

    3 weeks ago


    Westerville, Ohio, United States Vertiv Full time

    Job SummaryAs a seasoned LLMOps Engineer, you will play a pivotal role in architecting and maintaining cutting-edge Large Language Models (LLMs) infrastructure and pipelines, collaborating closely with Generative AI Architects.This role will be based at Vertiv's Westerville, OH - HQ location.Key Responsibilities:Conceptualize and Develop ML/LLM Pipelines:...

  • LLMOps Engineer

    6 months ago


    Westerville, United States Vertiv Full time

    Job Summary As an LLMOps Engineer - Cloud/Gen AI , you will play a crucial role in building and maintaining the infrastructure and pipelines for cutting-edge Large Language Models (LLMs), working closely with Generative AI Architects. Your expertise in automating and streamlining the LLM lifecycle will be instrumental in ensuring the efficiency,...