Current jobs related to Senior HPC Engineer - San Diego - ASRC Federal Holding Company


  • San Jose, California, United States Advanced Micro Devices , Inc. Full time

    Job SummaryWe are seeking a highly skilled Senior HPC Infrastructure Engineer to join our team at Advanced Micro Devices, Inc. as a Principal Engineer, HPC EDA Infrastructure GRID. This is a critical role that will require the successful candidate to establish and maintain AMD's technological leadership position in HPC EDA infrastructure in the semiconductor...


  • San Jose, California, United States Cadence Design Systems Full time

    At Cadence Design Systems, we are seeking a skilled Senior Staff Systems Engineer to enhance our team. This position is ideal for an experienced individual with a robust background in systems engineering and administration.Position:Senior Staff Systems EngineerLocation: Not specifiedKey Responsibilities:Facilitate customer implementations and guarantee...

  • HPC Engineer

    1 month ago


    San Fernando, United States Northrop Grumman Full time

    Northrop Grumman Classified Solution is seeking a Staff HPC Engineer to join our dynamic team of technical professionals in the Northridge, California. Must have: Current DoD Secret clearance- adjudicated within the past 5 years.Basic qualifications for a Staff HPC Engineer level (T05): Associate’s degree with 16 years of experience, or a bachelor’s...

  • HPC Engineer

    1 month ago


    San Fernando, United States Northrop Grumman Full time

    Northrop Grumman Classified Solution is seeking a Staff HPC Engineer to join our dynamic team of technical professionals in the Northridge, California. Must have: Current DoD Secret clearance- adjudicated within the past 5 years.Basic qualifications for a Staff HPC Engineer level (T05): Associate’s degree with 16 years of experience, or a bachelor’s...

  • HPC engineer

    2 months ago


    San Jose, California, United States Zealogics LLC Full time

    Job ResponsibilitiesCandidates should have good domain knowledge in High-Performance Computing, script language(Shell, Python), Linux administrator, operating systems (Linux, Windows), computer networkDistributed file systems (Lustre/NFS), virtualization and containerization related experience is a plusConfiguration and maintenance of the HPC computer...

  • HPC engineer

    2 months ago


    San Jose, United States Zealogics.com Full time

    Job DescriptionJob DescriptionJob ResponsibilitiesCandidates should have good domain knowledge in High-Performance Computing, script language(Shell, Python), Linux administrator, operating systems (Linux, Windows), computer networkDistributed file systems (Lustre/NFS), virtualization and containerization related experience is a plusConfiguration and...


  • San Jose, United States Advanced Micro Devices , Inc. Full time

    Overview: WHAT YOU DO AT AMD CHANGES EVERYTHING We care deeply about transforming lives with AMD technology to enrich our industry, our communities, and the world. Our mission is to build great products that accelerate next-generation computing experiences the building blocks for the data center, artificial intelligence, PCs, gaming and embedded....


  • San Jose, United States Blue Signal Search Full time

    Our client, a leading provider of HPC Solutions for over three decades, offering a comprehensive range of workstations, servers, and clusters, is seeking a Head of HPC Solution Architecture. This senior leadership role demands strong expertise in end-to-end systems solutioning, particularly in the Artificial Intelligence (AI) space. The primary focus is on...


  • San Jose, United States Conductor Full time

    Principal Engineer, HPC Interconnect Architecture San Jose, California, United States Please Note: To provide the best candidate experience with our high application volumes, we limit applications to a total of 10 over 6 months. Advancing the World’s Technology Together Our technology solutions power the tools you use every day--including smartphones,...


  • San Jose, California, United States Super Micro Computer Full time

    Job Summary:The Senior Manager of Solution Engineering is tasked with overseeing all facets of server rack integration operations. This position entails managing a dedicated team, collaborating with various departments, and ensuring the smooth integration of server racks in alignment with the organization's objectives and quality benchmarks.Key...


  • San Jose, California, United States Super Micro Computer Full time

    Job Summary:The Senior Manager of Solution Engineering is tasked with supervising all facets of server rack integration processes. This position entails leading a team, collaborating with various departments, and ensuring the flawless integration of server racks in alignment with organizational objectives and quality benchmarks. The Senior Solution Manager...


  • San Jose, California, United States Super Micro Computer Full time

    Job Summary:The Senior Manager of Solution Engineering will oversee all facets of server rack integration operations. This position entails leading a team, collaborating with various departments, and ensuring the effective integration of server racks in alignment with organizational objectives and quality benchmarks.Key Responsibilities:As a Senior Manager,...


  • San Jose, United States Advanced Micro Devices , Inc. Full time

    Establish and maintain AMDs technological leadership position in HPC EDA infrastructure in semiconductor industry and represents AMD to the outside technical community, partners, and vendors. THE PERSON: You have a passion for scaling and optimizing Engineer, Infrastructure, Server, Principal, Technology


  • San Jose, California, United States Super Micro Computer Full time

    Job Overview:The Senior Manager of Solution Engineering is tasked with supervising all facets of server rack integration operations. This position entails leading a team, collaborating with various departments, and ensuring the flawless integration of server racks in alignment with the organization's objectives and quality benchmarks.Key Responsibilities:The...


  • San Jose, California, United States Super Micro Computer Full time

    Job Summary:The Senior Manager of Solution Engineering will oversee all facets of server rack integration operations. This position requires effective management of a dedicated team, collaboration with various departments, and ensuring the smooth integration of server racks aligned with organizational objectives and quality benchmarks.Key...


  • San Jose, California, United States Super Micro Computer Full time

    Job Overview:The Senior Manager of Solution Engineering is tasked with overseeing the comprehensive operations of server rack integration. This pivotal role includes managing a dedicated team, collaborating with various departments, and ensuring the effective integration of server racks in alignment with the organization's objectives and quality...


  • San Jose, California, United States Super Micro Computer Full time

    Job Req ID: 24131About Supermicro:Supermicro stands as a premier provider of cutting-edge server, storage, and networking solutions tailored for Data Center, Cloud Computing, Enterprise IT, Hadoop/Big Data, Hyperscale, HPC, and IoT/Embedded clients globally. Recognized as the #5 fastest growing entity among the Silicon Valley Top 50 technology firms, our...

  • Senior CAD Engineer

    4 weeks ago


    San Diego, United States InnoPhase IoT Full time

    About InnoPhase IoT If you are keen to work with a bunch of brilliant people with various backgrounds, if you share the same value of working smart and celebrating successes, if you have enthusiasm for big technology in a small company, if your goals are to learn and experience different aspects of worknot just singing the same song every day, you'll find...


  • San Jose, California, United States Super Micro Computer Full time

    Job OverviewJob Req ID: 24450About Supermicro:Supermicro stands as a premier provider of cutting-edge server, storage, and networking solutions tailored for Data Centers, Cloud Computing, Enterprise IT, Big Data, Hyperscale, High-Performance Computing (HPC), and IoT/Embedded markets globally. Recognized as the #5 fastest growing entity among the Silicon...


  • San Jose, California, United States Super Micro Computer Full time

    Job Overview:As a Senior Product Engineer at Supermicro, you will play a pivotal role in shaping the future of data center and server technologies. Your expertise will be crucial in developing proof of concepts and delivering technical presentations that highlight the unique features of Supermicro products at various industry events.Key Responsibilities:Stay...

Senior HPC Engineer

3 months ago


San Diego, United States ASRC Federal Holding Company Full time

ASRC Federal is searching for a Senior HPC Engineer to support Inuteq LLC which this role is fully telework ASRC Federal InuTeq provides High Performance Computing services throughout the HPC lifecycle for computational requirements, architecture, acquisition, and operations to federal government customers. Our employees embrace innovation and are committed to a culture of continuous, standards-driven process improvement, and assimilation of industry best practices. We are seeking to fill a role that primarily provides development for Supercomputing Batch Scheduling with Supercomputing Systems Administration secondary support for our NASA NACS High Performance Computing (HPC) contract. Summary: The successful candidate will be an active supporting member of the ASRC Federal team reporting directly to the Manager of the Application Performance and Productivity (APP) group and matrixed directly to the Supercomputing Systems Team Manager. An individual at this skill level should have demonstrated extensive experience working with common HPC batch schedulers e.g. (PBS, Slurm, or Moab/Torque) while contributing to the support of users of HPC resources on the various issues they might have getting applications to run efficiently. This individual should demonstrate experience installing, maintaining, and upgrading HPC systems. The individual, along with the entire HPC team, will be engaged in the day-to-day operations and support of the HPC resources. Activities may include system patching, OS upgrades, deploying new systems, writing scripts, and troubleshooting system issues on the HPC system. The ability to interact with users to determine symptoms, and then reproduce their issues to isolate the causes is critical skills for this work. There will also be activities in testing, benchmarking, user tool scripting, and analyzing trouble tickets to find patterns indicating system or user education issues. Duties and Responsibilities: Designs, deploys and maintains HPC clusters with over 2000+ nodes with InfiniBand, 100+ petabytes of data storage in production. Write and shepherd scalable feature designs through the entire software development process, from requirements and use cases to release Designs and develops scripts for system administration, monitoring and usage reporting. Modify existing software to correct errors and/or improve performance Designs and develops scripts for system regression test and performance (file systems (Luster), scheduler (PBS), interconnect (HDR/NDR, Slingshot, ), high availability, etc.). Troubleshoots, isolates and resolves application, system and other technical problems (hardware, software, and network). Understands research use cases, researches and deploys new technologies, defining cost, performance and other trade-offs. Manages and maintains tools for configuration management (HPCM, Ansible & GIT), resource management, scheduling and all necessary aspects of HPC in accordance with best practices. Researches, deploys and manages networking and security infrastructure, including development of policies and procedures. Assists in developing and writing proposals and publications. Creates and provides clear documentation. Mentoring junior staff and cross training peers After hours/weekend support as required Moderate Supercomputing System Administration that contributes to: Day-to-day operations of the Linux HPC clusters and storage systems Proactive monitoring, analyze, and correct system issues Development of scripts to automate repetitive tasks or tools to enhance support of the HPC systems System performance analysis and tuning Building, installing, and supporting user-requested software Supporting evaluation and assessment of new HPC technology Resolving user report issues and manage support tickets requests in Remedy Requirements: Bachelor’s degree in computer science or related field Strong computer science background with in-depth systems-level knowledge in operating systems and networking A minimum of 10 years experience of administration of HPC systems and scheduling software (PBS, Slurm, or Moab/Torque) A minimum of 10 years of experience of systems programming in heterogeneous, multi-platform HPC environments Strong ability to analyze, debug and maintain the integrity of an existing code base Demonstrated equivalence of 5 years of Linux/UNIX user support experience and hands-on experience with administration of Linux systems Experience working with HPC applications and proficiency in at least C, C++, or Fortran Superior scripting skills and excellent attention to detail; proficiency in at least Python, Perl, or Bash Strong ability to interact with customers to understand needs, elicit requirements, and get feedback on prototype solutions Excellent communication and people skills; excellent time management and organizational skills Experience with system configuration management tools e.g. , puppet, chef, ansible Experience with revision control software e.g. CVS, SVN, Git Track record of delivering commercial quality software on schedule with excellent quality through multiple release cycles Proficiency at technical writing Preferred Skills (Requesting Manager Defines): Proficiency with analysis and problem-solving skills for debugging and optimization of applications Familiarity/proficiency with OpenMP and Message Passing Interface (MPI) programming Experience with Lustre, and InfiniBand Experience with cloud technologies (AWS, Azure, GCP), OpenStack or Kubernetes is a plus ASRC Federal and its Subsidiaries are Equal Opportunity / Affirmative Action employers. All qualified applicants will receive consideration for employment without regard to race, gender, color, age, sexual orientation, gender identification, national origin, religion, marital status, ancestry, citizenship, disability, protected veteran status, or any other factor prohibited by applicable law.