Senior High Performance Computing System Administrator

1 month ago


New York, United States Icahn School of Medicine at Mount Sinai Full time

Strength Through Diversity

Ground breaking science. Advancing medicine. Healing made personal.

Roles & Responsibilities:

The Scientific Computing and Data group at the Icahn School of Medicine at Mount Sinai partners with scientists to accelerate scientific discovery. To achieve these aims, we support a cutting-edge high-performance computing and data ecosystem along with MD/PhD-level support for researchers. The group is composed of a high-performance computing team, the research clinical data warehouse team and a research data services team.


The Senior HPC Administrator, High Performance Computational and Data Ecosystem, is responsible for a computational and data science ecosystem for researchers at Mount Sinai. This ecosystem includes high-performance computing (HPC) systems, clinical research databases, and a software development infrastructure for local and national projects. To meet Sinai’s scientific and clinical goals, the Senior Administrator has a good technical understanding for computational, data and software development systems along with a strong focus on customer service for researchers. The HPC Senior Administrator is an expert troubleshooter and productive team member and leads projects to effective and efficient completion independently under little to no supervision. This position reports to the Director for Computational & Data Ecosystem in Scientific Computing. Specific responsibilities are listed below.


Responsibilities


  • Design, deploy and maintain Scientific Computing’s computational and data science ecosystem including ~30,000 cores with high bandwidth, low latency interconnects, GPUs, large shared memory nodes, databases, scientific workflows and 30+ petabytes of storage in production, clinical data warehouse and software development environment.
  • Lead the troubleshooting, isolation and resolution of all technical issues including application, system, hardware, software, and network). Actively monitors the systems.
  • Maintains, tunes and manages computational, data, cloud technologies and workflow systems for ISMMS researchers, scientists and their external collaborators. Defines and deploys a comprehensive computational and data vision. Identifies and communicates system advantages/disadvantages and tradeoffs.
  • Designs, develops, implements system administration tasks, including hardware and software configuration, configuration management, system monitoring (including the development and maintenance of regression tests), usage reporting, system performance (file systems, scheduler, interconnect, high availability, etc.), security, networking and metrics, etc.
  • Collaborates effectively with research and hospital system IT, compliance, HIPAA, security and other departments to ensure compliance with all regulations and Sinai policies.
  • Participates in the integration of HPC resources with laboratory equipment such as sequencers, clinical and research data resources and systems, etc. Incorporate and link data and compute resources.
  • Researches, deploys and optimizes resource management and scheduling software and policies and actively monitoring. Designs, tunes, manages and upgrades parallel file systems, storage and data-oriented resources.
  • Researches, deploys and manages security infrastructure, including development of policies and procedures.
  • Maintain all necessary aspects of HPC in accordance with best practices. Develops and implements backup policies.
  • Prepares and manages budgets for hardware, software and maintenance. Participates in chargeback/fee recovery analysis and provides suggestions to make operations sustainable.
  • Assists in developing and writing system design for research proposals. Creates and provides clear documentation.
  • Works effectively and productively with other team members within the group and across Mount Sinai.
  • Performs related duties as assigned or requested.
  • Provides after hours support for critical system and production issues.
  • Answers and resolves user tickets.


Qualifications:


  • Bachelor's degree in computer science, engineering or another scientific field. Master's or PhD preferred
  • 8+ years (higher preferred) of progressive HPC system administration and operations (preferably in a Redhat/CentOS Linux administration, Batch HPC cluster environment)
  • Must be an expert troubleshooter; Must be a team player and customer focused
  • Experience with job scheduler such as LSF or Slurm and parallel file systems and storage
  • Experience with networking and security
  • Experience with configuration management systems such as xCAT, Puppet and/or Ansible
  • Experience of databases and web services
  • Experience in Infiniband, Gigabit Ethernet
  • Experience in an academic or research community environment
  • Script and programming experience
  • Experience with Cloud Computing
  • Ability to multitask effectively in a dynamic environment
  • Excellent communication skills, analytical ability, strong judgment and management skills, and the ability to work effectively as a liaison between both research and technology teams.
  • Strong written, oral, and interpersonal communication skills

Preferred Experience

  • Advanced degree
  • Experience with GPFS, LSF, TSM, IB and ethernet networking
  • Experience with databases and web services is highly preferred


Strength Through Diversity


The Mount Sinai Health System believes that diversity, equity, and inclusion are key drivers for excellence. We share a common devotion to delivering exceptional patient care. When you join us, you become a part of Mount Sinai’s unrivaled record of achievement, education, and advancement as we revolutionize medicine together. We invite you to participate actively as a part of the Mount Sinai Health System team by:


  • Using a lens of equity in all aspects of patient care delivery, education, and research to promote policies and practices to allow opportunities for all to thrive and reach their potential.
  • Serving as a role model confronting racist, sexist, or other inappropriate actions by speaking up, challenging exclusionary organizational practices, and standing side-by-side in support of colleagues who experience discrimination.
  • Inspiring and fostering an environment of anti-racist behaviors among and between departments and co-workers.


At Mount Sinai, our leaders strive to learn, empower others, and embrace change to further advance equity and improve the well-being of staff, patients, and the organization. We expect our leaders to embrace anti-racism, create a collaborative and respectful environment, and constructively disrupt the status quo to improve the system and enhance care for our patients. We work hard to create an inclusive, welcoming and nurturing work environment where all feel they are valued, belong and are able to advance professionally.


Explore more about this opportunity and how you can help us write a new chapter in our history


About the Mount Sinai Health System:


Mount Sinai Health System is one of the largest academic medical systems in the New York metro area, with more than 43,000 employees working across eight hospitals, more than 400 outpatient practices, more than 300 labs, a school of nursing, and a leading school of medicine and graduate education. Mount Sinai advances health for all people, everywhere, by taking on the most complex health care challenges of our time — discovering and applying new scientific learning and knowledge; developing safer, more effective treatments; educating the next generation of medical leaders and innovators; and supporting local communities by delivering high-quality care to all who need it. Through the integration of its hospitals, labs, and schools, Mount Sinai offers comprehensive health care solutions from birth through geriatrics, leveraging innovative approaches such as artificial intelligence and informatics while keeping patients’ medical and emotional needs at the center of all treatment. The Health System includes approximately 7,400 primary and specialty care physicians; 13 joint-venture outpatient surgery centers throughout the five boroughs of New York City, Westchester, Long Island, and Florida; and more than 30 affiliated community health centers. We are consistently ranked by U.S. News & World Report's Best Hospitals, receiving high "Honor Roll" status, and are highly ranked: No. 1 in Geriatrics and top 20 in Cardiology/Heart Surgery, Diabetes/Endocrinology, Gastroenterology/GI Surgery, Neurology/Neurosurgery, Orthopedics, Pulmonology/Lung Surgery, Rehabilitation, and Urology. New York Eye and Ear Infirmary of Mount Sinai is ranked No. 12 in Ophthalmology. U.S. News & World Report’s “Best Children’s Hospitals” ranks Mount Sinai Kravis Children's Hospital among the country’s best in several pediatric specialties. The Icahn School of Medicine at Mount Sinai is ranked No. 14 nationwide in National Institutes of Health funding and in the 99th percentile in research dollars per investigator according to the Association of American Medical Colleges. Newsweek’s “The World’s Best Smart Hospitals” ranks The Mount Sinai Hospital as No. 1 in New York and in the top five globally, and Mount Sinai Morningside in the top 20 globally.


The Mount Sinai Health System is an equal opportunity employer. We comply with applicable Federal civil rights laws and does not discriminate, exclude, or treat people differently on the basis of race, color, national origin, age, religion, disability, sex, sexual orientation, gender identity, or gender expression. We are passionately committed to addressing racism and its effects on our faculty, staff, students, trainees, patients, visitors, and the communities we serve. Our goal is for Mount Sinai to become an anti-racist health care and learning institution that intentionally addresses structural racism.”


EOE Minorities/Women/Disabled/Veterans



  • New York, United States Icahn School of Medicine at Mount Sinai Full time

    Strength Through DiversityGround breaking science. Advancing medicine. Healing made personal.Roles & Responsibilities: The Scientific Computing and Data group at the Icahn School of Medicine at Mount Sinai partners with scientists to accelerate scientific discovery. To achieve these aims, we support a cutting-edge high-performance computing and data...


  • New York, United States Icahn School of Medicine at Mount Sinai Full time

    Strength Through DiversityGround breaking science. Advancing medicine. Healing made personal.Roles & Responsibilities: The Scientific Computing and Data group at the Icahn School of Medicine at Mount Sinai partners with scientists to accelerate scientific discovery. To achieve these aims, we support a cutting-edge high-performance computing and data...


  • New York, New York, United States Hudson River Trading Full time

    The Research & Development team at Hudson River Trading (HRT) builds and maintains the computers, networks, data storage, operating systems, and software that allow our trading strategies and research environment to operate worldwide 24/7. We are looking for an experienced Storage Engineer who enjoys being challenged, appreciates an open and collaborative...


  • New York, United States LT Apparel Group Full time

    Company Overview:LT Apparel Group (LTAG) is a leader in the apparel industry, dedicated to innovation, quality, and customer satisfaction. With a strong heritage and a focus on the future, LTAG provides a collaborative and dynamic work environment where employees are empowered to grow and contribute to the company’s success. We value teamwork, creativity,...


  • New York, United States LT Apparel Group Full time

    Company Overview:LT Apparel Group (LTAG) is a leader in the apparel industry, dedicated to innovation, quality, and customer satisfaction. With a strong heritage and a focus on the future, LTAG provides a collaborative and dynamic work environment where employees are empowered to grow and contribute to the company’s success. We value teamwork, creativity,...


  • New York, United States Icahn School of Medicine at Mount Sinai Full time

    The Scientific Computing and Data team at the Icahn School of Medicine at Mount Sinai partners with scientists to accelerate scientific discovery for basic and translational science research. To achieve these aims, we support a high-performance computing and data ecosystem along with MD/PhD-level support for researchers. The group is composed of a high...


  • New York, United States Icahn School of Medicine at Mount Sinai Full time

    The Scientific Computing and Data team at the Icahn School of Medicine at Mount Sinai partners with scientists to accelerate scientific discovery for basic and translational science research. To achieve these aims, we support a high-performance computing and data ecosystem along with MD/PhD-level support for researchers. The group is composed of a high...


  • New York, United States Philanthropy New York Full time

    Description:  ORGANIZATIONAL OVERVIEW The mission of the , a division of the Simons Foundation, is to advance scientific research through computational methods, including data analysis, theory, modeling and simulation. It currently houses five science centers focused on computational astrophysics (CCA), computational biology (CCB), computational...


  • New York, New York, United States JPMorganChase Full time

    Job Description We have an opportunity to impact your career and provide an adventure where you can push the limits of what's possible.As a Lead Software Engineer at JPMorgan Chase within the Chief Technology Office's Global Technology & Applied Research team, you serve as a seasoned member of an agile team to design and deliver production quality technology...


  • New Canaan, United States Silver Hill Hospital Full time

    Job DescriptionJob DescriptionThe Senior System Administrator is a critical member of the Information Technology team. The Senior System Administrator is a leadership role and aligns priorities and plans with key business objectives. A senior system administrator manages and maintains the organization’s IT infrastructure, including computer systems,...


  • New York, New York, United States Philanthropy New York Full time

    ORGANIZATIONAL OVERVIEW The mission of the Center for Computational Neuroscience (CCN), part of Philanthropy New York, is to enhance scientific inquiry through advanced computational techniques, including data interpretation, theoretical frameworks, modeling, and simulation. CCN is dedicated to developing innovative models and conceptual frameworks that...


  • New Orleans, United States ExecRecruitment Full time

    Job DescriptionJob DescriptionExecRecruitment is a global professional services provider and contingency staffing company. Our main objective is to source top talent and support professional growth.One of our direct clients is actively seeking a Systems Administrator to join their team. The System Administrator will educate our employees and act as the...


  • New York, New York, United States Data Intelligence Full time

    Company Overview:Data Intelligence, LLC (DI) is a reputable small business dedicated to supporting the vital missions of government clients since its inception in 2005. We specialize in full life cycle system development, systems engineering, cybersecurity, and comprehensive analytical and logistics support for C4ISR and other intricate systems.Position...


  • New York, New York, United States Masterworks Full time

    About MasterworksMasterworks is a leading fintech platform that enables individuals to invest in high-value art pieces, including works by renowned artists such as Banksy, Basquiat, and Picasso. With a portfolio of nearly $800 million in world-class artworks, Masterworks has introduced nearly 950,000 individuals to the $2.2 trillion art market.As a...


  • New York, United States JPMorgan Chase & Co Full time

    Job DescriptionJOB DESCRIPTIONWe have an exciting and rewarding opportunity for you to take your software engineering career to the next level.As a Lead Software Engineer at JPMorgan Chase within the Chief Technology Office’s Global Technology & Applied Research team, you serve as a seasoned member of an agile team to design and deliver trusted...


  • New York, United States Hudson River Trading Full time

    The Research & Development team at Hudson River Trading (HRT) builds and maintains the computers, networks, data storage, operating systems, and software that allow our trading strategies and research environment to operate worldwide 24/7. We are looking for an experienced Storage Engineer who enjoys being challenged, appreciates an open and collaborative...


  • New York, United States Russell Tobin & Associates Full time

    What are we looking for in our Sr. Systems Administrator ? Job Title - Senior System Administrator  Job Type - 2-Month Contract (Possibly Extend)Job Location - 100% REMOTEPay Range - $50/hr - $70/hr depends on experienceJob DescriptionSummaryWe are looking for a (Contract) Senior System Administrator to join our team. This individual...


  • New York, United States Russell Tobin Full time

    Job DescriptionJob DescriptionJob Title - Senior System Administrator Job Type - 2-Month Contract (Possibly Extend)Job Location - 100% REMOTEPay Range - $50/hr - $70/hr depends on experienceJob DescriptionSummaryWe are looking for a (Contract) Senior System Administrator to join our team. This individual will be responsible for the installation, maintenance,...


  • New Orleans, United States Modus21, LLC Full time $98,000 - $138,000

    Job DescriptionJob DescriptionPosition Title: Modus21 Senior Systems Administrator (e-Craft Labor Category System Administrator III)Status: Contingent Upon AwardLocation: Charleston, S.C.; New Orleans. LA, Millington, TNModus21 is a Charleston, South Carolina based business and technology consulting firm specializing in solving complex business problems for...


  • New York, New York, United States Hire With Jarvis Full time

    Are you a driven IT technician looking to join a great team with a solid foundation and a ton of room to grow? We're working with a growing MSP that's looking to expand their teamWhat You'll Be Doing:Supporting their clients in the NYC area Working with clients in-person and remotely to solve day to day issues Performing systems and server installations and...