AI/HPC Infrastructure Specialist

5 days ago


Emory, United States DSO National Laboratories Full time
Job Title: AI/HPC Infrastructure Engineer

At DSO National Laboratories, we are seeking an experienced AI/HPC Infrastructure Engineer to join our dynamic team.

Job Summary

We are looking for a skilled engineer to design, implement, and manage the infrastructure that supports our AI initiatives. As an AI Infrastructure Engineer, you will play a crucial role in ensuring the optimal performance and reliability of our AI models.

Key Responsibilities
  • Design and implement scalable and efficient on-premise AI infrastructure solutions to train and serve large AI models.
  • Collaborate with cross-functional teams to identify and address performance bottlenecks, latency issues, and scalability challenges in AI infrastructure.
  • Establish robust monitoring systems to track the health, performance, and utilization of AI infrastructure components.
  • Implement security measures and best practices to protect AI infrastructure and data.
  • Work closely with cross-functional teams to understand their requirements and provide technical guidance.
Requirements
  • Degree in Computer Engineering / Computer Science/ Artificial Intelligence
  • Familiarity with cluster management tools like Bright, data processing frameworks (e.g., Apache Spark, Apache Beam), machine learning frameworks (e.g., TensorFlow, PyTorch), networking for HPC applications, containerization technologies (e.g., Docker, Kubernetes) and HPC scheduling
Preferred Qualifications
  • Experience in optimizing infrastructure for performance, scalability, and cost-efficiency.
  • Demonstrated ability to analyse complex problems, propose innovative solutions, and implement them effectively.
About Us

DSO National Laboratories is Singapore's largest defence research and development (R&D) organisation, with the critical mission to develop technological solutions to sharpen the cutting edge of Singapore's national security.

We offer a dynamic and challenging work environment, with opportunities for career growth and development. If you are a motivated and talented individual who is passionate about AI and HPC infrastructure, we encourage you to apply for this exciting opportunity.



  • Emory, United States DSO National Laboratories Full time

    AI/HPC Infrastructure EngineerAt DSO National Laboratories, we are seeking an experienced AI/HPC Infrastructure Engineer to join our dynamic team.Key Responsibilities:Design, implement, and manage infrastructure that supports AI initiativesCollaborate with cross-functional teams to design scalable and efficient on-premise AI infrastructure solutionsIdentify...


  • Emory, United States TechnipFMC Full time

    Job Title: Infrastructure EngineerWe are seeking an experienced IT Infrastructure Engineer to join our Information & Digital Services (IDS) department in Singapore.Job Summary:The successful candidate will be responsible for maintaining, monitoring, and supporting our infrastructure environment, including data centers, physical servers, virtual...


  • Emory, United States TechnipFMC Full time

    Job Title: Infrastructure EngineerWe are seeking an experienced IT Infrastructure Engineer to join our Information & Digital Services (IDS) department in Singapore.Job SummaryThe successful candidate will be responsible for maintaining, monitoring, and supporting our infrastructure environment, including data centers, physical servers, virtual...

  • Azure System Engineer

    4 weeks ago


    Emory, United States Randstad Full time

    About the JobYou will be responsible for maintaining and implementing system policies and procedures, ensuring IT compliance, and providing system engineering support to the manager/team.Key Responsibilities:Maintain and implement system policies and proceduresImplement and maintain the system, including changesMonitor the systems and ensure IT...