Cloud Solutions Architect: High-Performance Computing Expert

3 days ago


Dallas, Texas, United States Lavendo Full time
About Lavendo

Lavendo is at the forefront of the AI revolution, providing cutting-edge infrastructure that's reshaping the landscape of artificial intelligence. Our mission is to democratize access to world-class AI infrastructure, enabling organizations of all sizes to turn bold AI ambitions into reality.

Job Description:

We're seeking a skilled Cloud Solutions Architect (Remote) to play a key role in designing, implementing, and maintaining large-scale machine learning (ML) training and inference workflows for clients. As a Cloud Solutions Architect, you'll provide expert, hands-on guidance to help clients achieve optimal ML pipeline performance and efficiency.

Responsibilities:
  • Design and implement scalable ML training and inference workflows using Kubernetes and Slurm, focusing on containerization (e.g., Docker) and orchestration.
  • Optimize ML model training and inference performance with data scientists and engineers.
  • Develop and expand a library of training and inference solutions by designing, deploying, and managing Kubernetes and Slurm clusters for large-scale ML training with ready-to-deploy, standardized solutions.
  • Integrate with ML frameworks: integrate K8s and Slurm with popular ML frameworks like TensorFlow, PyTorch, or MXNet, ensuring seamless execution of distributed ML training workloads.
  • Develop monitoring and logging tools to track distributed training performance, identify bottlenecks, and troubleshoot issues.
  • Create automation scripts and tools to streamline ML training workflows, leveraging technologies like Ansible, Terraform, or Python.
Requirements:
  • At least 3 years of experience in MLOps, DevOps, or a related field.
  • Strong experience with Kubernetes and containerization (e.g., Docker).
  • Experience with cloud providers like AWS, GCP, or Azure.
  • Familiarity with Slurm or other distributed computing frameworks.
  • Proficiency in Python, with experience in ML frameworks such as TensorFlow, PyTorch, or MXNet.
  • Knowledge of ML model serving and deployment.
  • Familiarity with CI/CD pipelines and tools like Jenkins, GitLab CI/CD or CircleCI.
  • Experience with monitoring and logging tools like Prometheus, Grafana or ELK Stack.
  • Solid understanding of distributed computing principles, parallel processing, and job scheduling.
  • Experience with automation tools like Ansible, Terraform.
Attributes for Success:
  • PASSION FOR AI AND TRANSFORMATIVE TECHNOLOGIES.
  • A genuine interest in optimizing and scaling ML solutions for high-impact results.
  • Results-driven mindset and problem-solver mentality.
  • Adaptability and ability to thrive in a fast-paced startup environment.
  • Comfortable working with an international team and diverse client base.
  • Communication and collaboration skills, with experience working in cross-functional teams.
BENEFITS:
  • A highly competitive salary range of $130,000-$175,000 per year (negotiable based on experience and skills).
  • Full medical benefits and life insurance: 100% coverage for health, vision, and dental insurance for employees and their families.
  • 401(k) match program with up to a 4% company match.
  • Stock options plan in a publicly traded company.
  • PTO and paid holidays.
  • Flexible remote work environment.
  • Reimbursement of up to $85/month for mobile and internet.
  • Work with state-of-the-art AI and cloud technologies, including the latest NVIDIA GPUs (H100, L40S, with H200 and Blackwell chips coming soon).
  • Be part of a team that operates one of the most powerful commercially available supercomputers.
  • Contribute to sustainable AI infrastructure with energy-efficient data centers that recover waste heat to warm nearby residential buildings.


  • Dallas, Texas, United States Syntricate Technologies Full time

    Syntricate Technologies is a cutting-edge tech firm that specializes in innovative cloud solutions. We are seeking an experienced Cloud Solutions Architect Expert to join our team.The ideal candidate will have extensive knowledge of AWS and a proven track record of designing and implementing scalable, secure, and high-performance cloud architectures.A...


  • Dallas, Texas, United States TEKsystems Full time

    About the RoleTEKsystems is seeking a Cloud Network Solutions Architect to join our team. As a Cloud Network Solutions Architect, you will be responsible for designing and implementing cloud-based network solutions that meet the needs of our clients.Job Responsibilities:Design and implement cloud-based network architecturesCollaborate with cross-functional...


  • Dallas, Texas, United States Cloud Kinetics Full time

    Senior Software Architect for Cloud ProductsDo you have a passion for designing and implementing complex software systems? Are you a seasoned Software Architect with experience in cloud-based technologies?We are seeking a Senior Software Architect for Cloud Products to join our team. In this role, you will be responsible for leading the architecture of our...


  • Dallas, Texas, United States Vantage Point Consulting Inc. Full time

    Job SummaryVantage Point Consulting Inc. is seeking an experienced Cloud Solutions Architect to lead our IT transformation efforts.About UsWe are a cutting-edge consulting firm dedicated to helping businesses navigate the complexities of cloud computing and datacenter transformations.Key Responsibilities:Design and implement medium to large-scale datacenter...


  • Dallas, Texas, United States ApTask Full time

    Job Title: Cloud Solutions ArchitectWe are seeking an experienced Cloud Solutions Architect to join our team at ApTask. As a key member of our organization, you will be responsible for designing and implementing cloud-native applications using Microsoft Azure Kubernetes Service (AKS). Your expertise in containerization technologies, Kubernetes orchestration,...


  • Dallas, Texas, United States Canonical - Jobs Full time

    Cloud Solutions Architect PositionWe are looking for a highly experienced Cloud Solutions Architect to join our team in North America. As a key member of our cloud team, you will be responsible for designing and implementing cloud solutions for our customers.Key responsibilities include:Designing and implementing cloud solutions for customersWorking closely...


  • Dallas, Texas, United States SVK Technology Solutions Full time

    We are seeking an experienced Oracle Cloud P2P Functional Architect to join our team at SVK Technology Solutions. As a key member of our organization, you will be responsible for designing and implementing cloud-based procurement solutions that meet the needs of our clients.About the RoleThe ideal candidate will have 15+ years of experience in Oracle Apps,...


  • Dallas, Texas, United States Cyborgwave Full time

    About the JobAt Cyborgwave, we are seeking a highly skilled Cloud Solutions Architect to join our team. This is a fantastic opportunity to work on designing and implementing scalable, robust systems on the AWS cloud platform.


  • Dallas, Texas, United States Amazon Full time

    Estimated salary: $105,000 - $165,000 per yearAbout UsAt Amazon, we're on a mission to be Earth's most customer-centric company. We strive to offer our customers the lowest prices and the greatest convenience possible.Job DescriptionWe're seeking an Associate Solutions Architect to partner with our customers to craft highly scalable, flexible, and resilient...


  • Dallas, Texas, United States Expedite Technology Solutions LLC Full time

    Company OverviewExpedite Technology Solutions LLC is a forward-thinking technology firm dedicated to delivering cutting-edge solutions. Our team of experts strives for excellence in innovation and customer satisfaction.SalaryThe estimated annual salary for this role is $220,000 - $300,000, depending on experience.Job DescriptionWe are seeking an experienced...


  • Dallas, Texas, United States Syntricate Technologies Full time

    We are seeking a highly skilled Azure Cloud Solutions Architect to join our team at Syntricate Technologies. As a key member of our cloud solutions team, you will be responsible for designing and implementing modern cloud solutions that meet the needs of our clients.Job SummaryThe successful candidate will have extensive experience in system administration,...


  • Dallas, Texas, United States Edward Daniels Group Full time

    Job DescriptionWe are seeking a skilled Cloud Solutions Architect to join our team at the Edward Daniels Group in Irving, TX. As a critical member of our cloud engineering team, you will play a key role in designing and implementing modern cloud solutions.About the Role:Develop and coordinate cloud architecture across diverse areas including application...


  • Dallas, Texas, United States Alcority Full time

    Alcority is seeking an experienced Cloud Security Solutions Architect to join our team. The successful candidate will have a strong background in cloud security and be able to design and implement secure solutions for our clients.About the RoleThe Cloud Security Solutions Architect will report directly to the Information Security Architect and will be...

  • Cloud Architect

    4 weeks ago


    Dallas, Texas, United States Compugain Full time

    Cloud Solutions Architect - State and Local GovernmentsCompugain is seeking a Cloud Solutions Architect to design and implement cloud-based solutions for state and local governments. The ideal candidate will have a deep understanding of cloud platforms, security, and compliance requirements for public sector organizations.Key Responsibilities:Design and...


  • Dallas, Texas, United States Diverse Lynx Full time

    Job SummaryWe are seeking a highly skilled Cloud ITOM Solution Architect to join our team at Diverse Lynx LLC.About the RoleThe successful candidate will be responsible for designing, developing, and implementing cloud-based ITOM solutions using ServiceNow.This role requires strong technical expertise in cloud computing, ServiceNow development, and IT...


  • Dallas, Texas, United States Mastech Digital Full time

    About the RoleWe are seeking a seasoned Azure Cloud Architect to join our team at Mastech Digital in Fort Worth, TX. This is a long-term W2 contract position that requires the successful candidate to be based in Texas and available for on-site interviews.The ideal candidate will have at least 10 years of experience in software development with Java, 8 years...


  • Dallas, Texas, United States InterSources Full time

    About the RoleWe are seeking an Expert Java Solutions Architect to join our team at InterSources Inc. As a seasoned professional with a minimum of 8 years of experience in Java, J2EE, Spring, Spring Boot, and web services, you will play a crucial role in driving digital transformations across various domains and industries.Key ResponsibilitiesDesign and...


  • Dallas, Texas, United States Lantern Full time

    Job Title: Cloud AI Solutions ArchitectLantern is currently seeking a highly skilled Cloud AI Solutions Architect to lead our clients in designing and deploying AI solutions in Microsoft Azure. This role is 100% remote or for someone local to Dallas/Richardson area, this could be a hybrid role too.About the Job:The successful candidate will have experience...


  • Dallas, Texas, United States Redapt, Inc. Full time

    We are seeking a highly experienced Cloud Solutions Engineer to join our team at Redapt, Inc.As a Cloud Solutions Engineer, you will be responsible for designing and implementing cloud-based DevOps solutions using Kubernetes and Infrastructure as Code (IaC) technologies.You will lead requirements gathering, analysis, and solution development for cloud-based...


  • Dallas, Texas, United States iSoftTek Solutions Inc Full time

    Job OverviewiSoftTek Solutions Inc is seeking a skilled Cloud Infrastructure Architect to join our team.About the RoleWe are looking for an experienced professional who can design, implement, and manage cloud-based infrastructure solutions. The ideal candidate will have expertise in Azure network management, virtual networks, and network security.The...