Highly Skilled Platform Engineer Wanted for AI Ops Excellence
3 weeks ago
We are seeking a highly skilled and motivated individual to manage and optimize the operational performance of Open AI GPT and related AI platforms across our enterprise.
The ideal candidate will have a blend of expertise in AI/ML model operations, DevOps, infrastructure management, and strong problem-solving skills to support mission-critical AI applications.
This role is responsible for ensuring efficient, scalable, and secure operations for AI-based products and services, including large language models (LLMs) and custom GPT solutions.
Key Responsibilities:
Ai/Llm Operations & Monitoring:
- Manage the day-to-day operations of Open AI GPT models and other Ai/ml platforms.
- Implement automated monitoring and alerting for model performance, drift, and infrastructure health.
- Ensure high availability, reliability, and scalability of deployed GPT models across the enterprise.
- Optimize resource allocation and scaling for large model deployments, ensuring cost-effectiveness.
Automation & Ci/Cd Pipelines:
- Design and maintain automated Ci/Cd pipelines for rapid deployment of Ai/ml models.
- Collaborate with data science and engineering teams to streamline model retraining and updates.
- Integrate Mlops tools and platforms (e.g., Kubeflow, MLflow, or other Ai orchestration tools).
Security & Compliance:
- Implement and manage security policies around data privacy, model access, and infrastructure security.
- Ensure Ai platforms adhere to enterprise-level compliance and governance standards.
- Identify and mitigate risks related to Ai model vulnerabilities and data usage.
Infrastructure Management:
- Administer cloud-based infrastructure (e.g., Azure,) used for Ai/ml model deployment.
- Handle model orchestration, scaling, and optimization in containerized environments (Kubernetes, Docker).
- Support hybrid cloud/on-prem infrastructure setups where required.
Collaboration & Stakeholder Management:
- Work closely with data scientists, Ai engineers, and product teams to align Ai Ops activities with business goals.
- Serve as the central point of contact for troubleshooting Ai-related issues, providing root-cause analysis, and addressing performance bottlenecks.
- Document operational workflows, best practices, and post-mortem analyses for continuous improvement.
Proactive Issue Resolution:
- Use predictive analytics and anomaly detection techniques to prevent Ai platform issues before they impact the business.
- Lead incident management for Ai platform disruptions and resolve operational issues in a timely manner.
Experience:
We are looking for an individual with 5+ years of experience in Ai Ops, MLOps, DevOps, or platform operations. The ideal candidate should have proven expertise with Ai/ml platforms, especially Open Ai Gpt, other LLMs, or enterprise-grade Ai services.
Technical Expertise:
- Hands-on experience with cloud platforms preferably Azure for Ai/ml deployments.
- Proficiency with Ai frameworks and libraries (TensorFlow, PyTorch, etc.).
- Experience with Ci/Cd tools (Jenkins, GitLab, CircleCI) and infrastructure-as-code (Terraform, Ansible).
- Familiarity with containerization (Docker, Kubernetes) and orchestration tools.
- Understanding of Ai model lifecycle management, versioning, and governance.
Skills:
- Strong scripting/programming skills (Python, Bash, etc.).
- Analytical and problem-solving mindset with the ability to address complex operational issues.
- Excellent communication skills to engage with cross-functional teams and present solutions to stakeholders.
- Experience in managing high-performance, distributed systems.
-
Generative AI Platform Lead
4 days ago
Frisco, Texas, United States T-Mobile Full timeCompany Overview: At T-Mobile, we invest in our employees' growth and development, providing them with the tools and resources needed to succeed in their careers. Our company culture values collaboration, innovation, and customer satisfaction.We are looking for a highly skilled Generative AI Platform Lead to join our team. The ideal candidate will have a...
-
AI Operations Engineer
3 weeks ago
Frisco, Texas, United States EXOSOFT TECH Inc. Full timeJob SummaryWe are seeking a highly skilled Cloud Platform Specialist to join our team at ExoSoft Tech Inc. in the role of AI Operations Engineer. This is a unique opportunity to work with cutting-edge technology and contribute to the development of innovative AI solutions.
-
Platform Operations Leader
4 weeks ago
Frisco, Texas, United States Omni Inclusive Full timeAbout the Role:We are seeking a Platform Operations Leader to join our team at Omni Inclusive. As a key member of our Platform Operations department, you will play a critical role in establishing roadmaps, designs, and implementing Platform Ops (Artificial Intelligence for IT Operations); for private cloud, and machine learning capabilities to automate and...
-
Platform Architect for Generative AI
3 weeks ago
Frisco, Texas, United States The Hartford Full timeJob DescriptionWe are seeking a seasoned Principal Platform Engineer to join our team and contribute to the development of our Generative AI capability. As a key member of our team, you will be responsible for designing and building scalable, secure, and reliable solutions to support our Generative AI reference architecture.Key ResponsibilitiesDesign and...
-
Software Engineer
3 weeks ago
Frisco, Texas, United States The Hartford Full timeCompany OverviewThe Hartford is a leading insurance company that goes beyond traditional policies and coverages. We believe in making a meaningful impact and empowering individuals to achieve their goals. Our team is dedicated to shaping the future of Generative AI and creating innovative solutions for our internal customers.About UsNearly 19,000 employees...
-
Enterprise AI Strategist
5 hours ago
Frisco, Texas, United States T-Mobile Full timeJob OverviewT-Mobile is a leader in the telecommunications industry, providing innovative wireless communication services to millions of customers. We are committed to investing in our employees, providing them with the tools and resources needed to succeed. Our Total Rewards Package reflects this commitment, offering competitive base salaries and...
-
AI/ML Technical Expert
3 weeks ago
Frisco, Texas, United States Futran Tech Solutions Pvt. Ltd. Full timeAbout the PositionFutran Tech Solutions Pvt. Ltd. is seeking a highly skilled AI/ML Technical Expert to lead the development of cutting-edge AI and ML solutions.We are looking for an experienced professional with a strong background in machine learning, data science, and software engineering. The ideal candidate will have a deep understanding of MLOps...
-
Cloud Engineer for Large Language Models
3 weeks ago
Frisco, Texas, United States The Hartford Full timeAbout The RoleWe are looking for a highly skilled Principal Platform Engineer to join our team and help us build the foundation of our Generative AI capability. As a key member of our team, you will be responsible for designing and building scalable, secure, and reliable solutions to support our Generative AI reference architecture.ResponsibilitiesDesign and...
-
Platform Operations Leader
19 hours ago
Frisco, Texas, United States Omni Inclusive Full timeAbout the RoleOmn Inclusive is seeking a Platform Operations Leader to join our team. The successful candidate will be responsible for establishing roadmaps, designs, and supporting the implementation of Platform Ops (Artificial Intelligence for IT Operations) for private cloud and machine learning capabilities to automate and streamline operational...
-
Tax and Accounting AI Content Strategist
2 days ago
Frisco, Texas, United States Thomson Reuters Full timeAbout the RoleIn this position as a Tax and Accounting AI Content Strategist, you will be collaborating with internal and external teams to support product enhancements and initiatives related to AI and tax technology solutions. You will design and facilitate content experiments, including evaluation metrics and success criteria.Key...
-
Platform Security Engineer
2 days ago
Frisco, Texas, United States Omni Inclusive Full timeRole SummaryThis is an exciting opportunity for a skilled Security Engineer (API) to join our team at Omni Inclusive. As a key member of our platform engineering group, you will play a vital role in ensuring the security and integrity of our cloud-based systems.You will be responsible for analyzing, designing, and implementing tool and service designs within...
-
AI Innovation Lead
4 days ago
Frisco, Texas, United States Disability Solutions Full timeDisability Solutions OverviewT-Mobile is committed to creating a culture of inclusion and accessibility, providing equal opportunities for individuals with disabilities to grow and thrive in their careers.Generative AI RoleThe Principal Architect for Generative AI Solutions will play a critical role in designing and integrating generative AI technologies...
-
Enterprise Security Specialist
2 days ago
Frisco, Texas, United States Omni Inclusive Full timeKey ResponsibilitiesWe are looking for an Enterprise Security Specialist to join our team at Omni Inclusive. The successful candidate will establish roadmaps, designs, and support the implementation of Platform Ops (Artificial Intelligence for IT Operations) for private cloud and machine learning capabilities to automate and streamline operational...
-
Vault Platform Architect
2 days ago
Frisco, Texas, United States Omni Inclusive Full timeJob Description:As a Vault Platform Architect at Omni Inclusive, you will be responsible for designing, deploying, and maintaining a robust vault platform that ensures the confidentiality, integrity, and availability of sensitive data. This includes experience with HashiCorp Vault, Cyberark, or similar PAM, secrets, certificate management platforms, as well...
-
Generative AI Solutions Lead
20 hours ago
Frisco, Texas, United States T-MOBILE USA, Inc. Full timeJob ResponsibilitiesDesign and Integration: Design and oversee the integration of generative AI technologies into platforms and applications.Technical Leadership: Guide and influence technical teams in the development and integration of AI technologies.Solution Architecture: Create technical blueprints and solutions that integrate generative AI...
-
Senior Enterprise Architect, AI Innovation
20 hours ago
Frisco, Texas, United States T-MOBILE USA, Inc. Full timeCompany OverviewT-MOBILE USA, Inc. is a leading provider of wireless communications services in the United States. We invest in our employees and offer a competitive total rewards package that includes a base salary and compensation plan, annual stock grant, employee stock purchase plan, 401(k), and access to free money coaches.Job DescriptionWe are seeking...
-
Innovation Leader for AI Solutions
4 days ago
Frisco, Texas, United States Thomson Reuters Full timeWe are seeking an experienced innovation leader to guide the delivery of cutting-edge AI solutions from concept to production. As a Manager of Research Engineering at Thomson Reuters Labs, you will foster a culture of innovation, growth, and continuous improvement.About the RoleIn this opportunity, you will:Own the delivery of business-critical features for...
-
Frisco, Texas, United States T-Mobile Full timeAbout the RoleThe Principal Architect for Generative AI Solutions at T-Mobile will be responsible for leading the development and implementation of AI-driven solutions to enhance operational efficiency and innovation. This role requires a strong understanding of AI technologies and their applications in enterprise settings, as well as excellent communication...
-
Frisco, Texas, United States Advanced Dynamics Corp Full timeAdvanced Dynamics Corp, a leading AmLaw 200 firm with a strong presence in the US, Ireland, and Mexico, is seeking a seasoned Labor and Employment Associate Attorney to join their Collin County, TX office.The ideal candidate will have at least 4 years of experience in labor and employment law, with a proven track record of counseling and representing...
-
Senior Engineer
2 days ago
Frisco, Texas, United States Comerica Full timeWe are looking for a Senior Engineer - Web Platform Development to join our team, responsible for designing and developing complex web applications using modern technologies.About the RoleThis position will focus on the following technologies: Java, React, Terraform, Jenkins, Bitbucket, and AWS (both EC2 and Serverless configurations). The ideal candidate...