Senior AI DevOps/SRE
1 day ago
Description We are currently seeking an experienced Senior AI DevOps/SRE to join our team. In this pivotal role, you will collaborate closely with data scientists and software developers to ensure seamless integration and optimize the operational efficiency of our AI deployments. Your expertise will be pivotal in deploying, maintaining, and scaling our cutting-edge AI solutions, encompassing LLMs and RAG systems. As a key team member, you will spearhead both traditional DevOps responsibilities and innovative approaches to MLOps. Your proactive involvement will be essential in driving the success of our AI initiatives and maximizing their impact across the organization. #EasyApply Responsibilities Implement and maintain CI/CD pipelines for AI and machine learning projects, ensuring robust deployment strategies and continuous integration Monitor and ensure the reliability, availability, and performance of AI applications, particularly those involving LLMs and RAG Collaborate with AI research teams to operationalize machine learning models and systems efficiently Develop and enforce best practices for version control, configuration management, and testing of AI-driven software solutions Utilize MLOps tools such as Kubeflow, MLflow, or TensorFlow Extended (TFX) to streamline the machine learning lifecycle from experimentation to production Implement monitoring solutions that track both system metrics and model performance to facilitate proactive issue resolution Participate in on-call rotations to support the operational health of critical systems, employing SRE principles to meet service-level objectives (SLOs) and reduce downtime Requirements Bachelors degree in Computer Science, Engineering, or a related field Proven experience as a DevOps Engineer or SRE, with a strong background in software development and automation Expertise in deployment and management of LLMs, including technologies like RAG Proficient in CI/CD tools (Jenkins, GitLab CI, CircleCI) and infrastructure as code (Terraform, Ansible) Solid knowledge of container orchestration technologies (Kubernetes, Docker) Familiarity with MLOps tools and practices to support machine learning lifecycle management Nice to have Experience with cloud services (AWS, GCP, Azure), particularly in AI/ML deployments Background in monitoring tools like Prometheus, Grafana, and ELK stack Understanding of Python, particularly in data science and machine learning contexts Certification in Kubernetes, AWS/GCP/Azure, or similar technologies We offer We connect like-minded people : Delivering innovative solutions to industry leaders, making a global impact Enjoyable working environment, whether it is the vibrant office or the comfort of your own home Opportunity to work abroad for up to two months per year Relocation opportunities within our offices in 50+ countries Corporate and social events We invest in your growth : Leadership development, career advising, soft skills and well-being programs Certifications, including GCP, Azure and AWS Unlimited access to LinkedIn Learning, Get Abstract, O'Reilly, Cloud Guru Free English classes with certified teachers We cover it all : Participation in the Employee Stock Purchase Plan Monetary bonuses for engaging in the referral program Comprehensive medical & family care package Five trust days per year (sick leave without a medical certificate) Benefits package (sports activities, a variety of stores and services) is a team of innovators united by a passion for technology. The dynamic and inclusive culture we embrace helps positively impact our communities, clients, and employees. Here you will collaborate with multi-national teams, contribute to numerous cutting-edge projects, deliver the most creative solutions, and have an opportunity to learn. Our people are at the heart of our success, and we are proud to provide talents with a solid ground to develop and grow.
-
Director, SRE
1 day ago
Georgia, United States MAPFRE Full timeDirector, SRE & IT Operations at MAPFRE USA Are you a visionary leader with a passion for resilience, automation, and innovation? MAPFRE USA is seeking a dynamic Director of Site Reliability Engineering (SRE) and IT Operations to architect and drive our infrastructure strategy into the future. About the Role Reporting to the SVP of IT Infrastructure,...
-
Director, SRE
4 days ago
Georgia, United States MAPFRE Full timeDirector, SRE & IT Operations at MAPFRE USA Are you a visionary leader with a passion for resilience, automation, and innovation? MAPFRE USA is seeking a dynamic Director of Site Reliability Engineering (SRE) and IT Operations to architect and drive our infrastructure strategy into the future. About the Role Reporting to the SVP of IT Infrastructure,...
-
Senior AWS DevOps Engineer
3 days ago
georgia, United States Epam Full timeDescription We are looking for a Senior DevOps Engineer with expertise in AWS Cloud administration and automation to strengthen our team. In this role, you'll engineer and operate our automated deployment framework, implement DevOps strategies and participate in service design and planning. #LI-MDA7 Responsibilities Implement a DevOps strategy with...
-
Senior GCP DevOps Engineer
2 weeks ago
georgia, United States Epam Full timeDescription We are looking for a Senior DevOps Engineer with production expertise in Google Cloud Platform (GCP) and a passion for building and maintaining efficient cloud infrastructure. In this role, you will architect, implement, and manage GCP solutions that ensure seamless performance, scalability, and reliability. #LI-MDA7 #EasyApply Responsibilities...
-
Senior Cloud Engineer
1 week ago
georgia, United States Epam Full timeDescription If you are an experienced professional passionate about DevOps and looking for a challenging and rewarding role, join our team as a Senior Cloud Engineer / DevOps Engineer . At EPAM, you will be involved in a project focused on transitioning existing Microservices hosted in AWS. Your expertise will be pivotal in modernizing our infrastructure...
-
Senior Azure DevOps Engineer
6 days ago
georgia, United States Epam Full timeDescription You are an industry visionary with a passion for designing complex Cloud solutions. You want to transform businesses so they may operate and grow successfully in the cloud-first world. If it sounds like you join us as a Senior Azure DevOps Engineer to work with highly skilled teams across the globe. You will use the latest technologies of the...
-
Lead GCP DevOps Engineer
3 days ago
georgia, United States Epam Full timeDescription We are looking for a Lead DevOps Engineer with advanced production expertise in Google Cloud Platform to join our team. In this role, you'll engage with cloud technologies and AI tools, manage complex projects and be recognized as an expert in CI/CD, public clouds and infrastructure engineering. #LI-MDA7 Responsibilities Control and support...
-
Senior Data DevOps Engineer
7 days ago
georgia, United States Epam Full timeDescription We are looking for a Senior Data DevOps to join EPAM and contribute to a project for a large customer in the e-commerce/fashion industry. As a Senior Data DevOps in Data Platform, you will focus on maintaining and implementing new features to the data transformation architecture, which is the backbone of the Customer's analytical data platform....
-
Data Solution Architect
23 hours ago
georgia, United States Epam Full timeDescription We are seeking an innovative Data Solution Architect/AI Architect to join our expanding team. In this role, you'll play a vital part in crafting solutions that meet diverse requirements and needs. Collaborating closely with teams of data scientists and machine learning engineers, you'll be at the center of driving innovation. Your expertise will...
-
Manager - Secure Data
2 days ago
Georgia, United States Boston Consulting Group Full timeLocations: Boston | Atlanta | LondonWho We AreBoston Consulting Group partners with leaders in business and society to tackle their most important challenges and capture their greatest opportunities. BCG was the pioneer in business strategy when it was founded in 1963. Today, we help clients with total transformation-inspiring complex change, enabling...