Lead Platform Engineer

4 days ago


Boston, United States Attis Full time

Lead Platform Engineer - High Performance Computing / HPC / Linux / Python / MLOps OverviewAn experienced Lead Platform Engineer (HPC) is required to build and operate the foundational systems for a mission-critical environmental forecasting platform. This is a role for a true "engineer's engineer", someone who possesses a deep, first-principles understanding of the tools they use and thrives on solving complex operational challenges at a massive scale. You will be responsible for the high-performance infrastructure that deploys, runs, and maintains data-heavy machine learning applications, directly impacting the ability to model and predict complex Earth systems. Why Join?Work on a Mission That Matters: Apply your engineering skills to a tangible, real-world scientific challenge. This isn't just another tech job; your work will contribute to a more accurate understanding of our planet's environment.Solve Unique Technical Problems: You will face fascinating challenges in data scale and processing that require deep technical thinking. This is an opportunity to build and maintain robust systems designed to handle petabyte-scale datasets and advanced AI/ML workloads.A Culture of Technical Excellence: Join a small, brilliant team of experts who value technical rigor and collaboration. This is an environment where your expertise will be challenged, respected, and crucial to the organization's success.Build the Foundation: As a key member of the operations team, you will have a significant impact on the reliability, scalability, and technical direction of the company's core machine learning applications. Relocation assistance and a hybrid work model are available for their offices in the Golden, CO, or greater Boston, MA areas.Package: A generous salary up to $230k + equity + 15% bonus + full benefits package. The CompanyMy client is a venture-backed, deep-tech company operating at the intersection of aerospace and large-scale data analytics. By leveraging a proprietary constellation of satellites, the company captures vast, unique datasets about the Earth's atmosphere. This data is the lifeblood of their business, enabling them to generate some of the world's most accurate and high-resolution environmental predictions for a range of global industries. The RoleYour primary responsibility is the operational excellence of the core AI/ML applications and computing infrastructure. This is a hands-on builder and operator role where you will be "in the weeds" of complex systems.Design, set up, and maintain the compute and storage infrastructure in both cloud and physical environments.Ensure the reliability, performance, and scalability of data-heavy, high-performance computing (HPC) applications.Deploy, run, and maintain large-scale AI/ML models, bridging the gap between model development and production.Manage and manipulate datasets of 20 TB or larger, ensuring data integrity and availability.Collaborate closely with machine learning scientists to understand their computational needs and provide a rock-solid foundation for their work. The Essential Requirements3+ years in a DevOps, MLOps, or a similar technical position with a focus on high-performance computing and data-intensive workloads.A profound foundational understanding of the tools you use, including the details of how they work "under the hood."Proven, hands-on experience managing and manipulating datasets of 20 TB or larger.Experience running and maintaining AI/ML foundation models or other existing large-scale AI/ML models in a production environment.Strong foundational knowledge of Python and a deep, practical understanding of Linux.Fluent with version control processes and systems, such as Git.Strong collaboration skills and the ability to work effectively in a high-performing team. What Will Make You Stand Out1+ years of experience fine-tuning and validating existing AI/ML models.Direct experience working with large-scale geographic or geospatial data sets.A demonstrable interest and passion for acquiring new skills in the AI/ML, HPC, and satellite data domains. If you are interested in this role, please apply with your resume through this site. SEO Keywords for SearchMLOps Engineer, Machine Learning Operations, DevOps Engineer, AI/ML Infrastructure, High-Performance Computing, HPC Engineer, Data Engineer, Python, Linux, Big Data, Geospatial Data, Satellite Data, Weather Forecasting, AI Operations, Cloud Infrastructure Engineer, Site Reliability Engineer, SRE, Data Infrastructure, Petabyte Scale Data, Machine Learning Deployment, AI Model Operations, Technical Operations, Scientific Computing, Infrastructure Engineer. DisclaimerAttis Global Ltd is an equal opportunities employer. No terminology in this advert is intended to discriminate on any of the grounds protected by law, and all qualified applicants will receive consideration for employment without regard to age, sex, race, national origin, religion or belief, disability, pregnancy and maternity, marital status, political affiliation, socio-economic status, sexual orientation, gender, gender identity and expression, and/or gender reassignment. M/F/D/V. We operate as a staffing agency and employment business. More information can be found at attisglobal.com.



  • Boston, United States TetraScience, Inc. Full time

    TetraScience is the Scientific Data and AI company. We are catalyzing the Scientific AI revolution by designing and industrializing AI-native scientific data sets, which we bring to life in a growing suite of next gen lab data management solutions, scientific use cases, and AI-enabled outcomes.TetraScience is the category leader in this vital new market,...


  • Boston, United States TetraScience Full time

    TetraScience is the Scientific Data and AI Cloud company. We are catalyzing the Scientific AI revolution by designing and industrializing AI-native scientific data sets, which we bring to life in a growing suite of next gen lab data management solutions, scientific use cases, and AI-enabled outcomes.TetraScience is the category leader in this vital new...


  • Boston, United States Company 1 - The Manufacturers Life Insurance Company Full time

    We are seeking a Lead Data Platform Solutions Engineer to join our Global Data Governance Technology team. This is an exciting opportunity to lead the design, strategy, and execution of robust, cloud-native infrastructure and data governance solutions that power enterprise-wide compliance and data quality initiatives. You will leverage modern technologies...


  • Boston, United States Ensono Full time

    A leading managed services provider is seeking a Software Engineer Lead to guide their engineering teams and develop enterprise applications. Responsibilities include mentoring, defining best practices, and overseeing API integrations with platforms like ServiceNow and Snowflake. The ideal candidate has strong Python coding skills and experience in...


  • Boston, United States Klaviyo Inc. Full time

    Lead Software Platform Engineer - Observability At Klaviyo, we value the unique backgrounds, experiences and perspectives each Klaviyo (we call ourselves Klaviyos) brings to our workplace each and every day. We believe everyone deserves a fair shot at success and appreciate the experiences each person brings beyond the traditional job requirements. If...


  • Boston, MA, United States Wellington Management Full time

    About Us Wellington Management offers comprehensive investment management capabilities that span nearly all segments of the global capital markets. Our investment solutions, tailored to the unique return and risk objectives of institutional clients in more than 60 countries, draw on a robust body of proprietary research and a collaborative culture that...


  • Boston, MA, United States Wellington Management Full time

    About Us Wellington Management offers comprehensive investment management capabilities that span nearly all segments of the global capital markets. Our investment solutions, tailored to the unique return and risk objectives of institutional clients in more than 60 countries, draw on a robust body of proprietary research and a collaborative culture that...


  • Boston, MA, United States Wellington Management Full time

    About Us Wellington Management offers comprehensive investment management capabilities that span nearly all segments of the global capital markets. Our investment solutions, tailored to the unique return and risk objectives of institutional clients in more than 60 countries, draw on a robust body of proprietary research and a collaborative culture that...


  • Boston, United States Wellington Management Full time

    1 month ago Be among the first 25 applicants Get AI-powered advice on this job and more exclusive features. Pay found in job post Retrieved from the description. Base pay range $100,000.00/yr - $225,000.00/yr About Us Wellington Management offers comprehensive investment management capabilities that span nearly all segments of the global capital markets. Our...


  • Boston, United States Liberty Mutual Insurance Full time

    A leading insurance company in Boston is seeking a Manager, Engineering/Tech Lead to oversee the Guidewire PolicyCenter platform. The role involves managing technical teams, ensuring the platform's scalability and reliability, and collaborating across various departments to meet business goals. Candidates should possess strong technical leadership, extensive...