Site Reliability Engineer
3 weeks ago
About the Position
We are looking for experienced engineers who understand AI systems, and are excited about becoming global leaders in a completely novel field. We need people that can work independently as part of a small team.
You will be responsible for building the industry’s first end-to-end AI evaluation platform, starting with an offline evaluation harness and web platform. You’ll also help forge the foundation of our company’s engineering team and company culture.
We are a small team passionate about technology, research, and the extraordinary things we can build when we combine the two. We’re also fun, collaborative, and have meaningful lives outside of work.
You might be a good fit if you’ve worked as an ML Engineer, Infrastructure Engineer, or SRE. We are building a team in Boston, Massachusetts, and preference will be given to candidates who can join us in our office 2 days per week.
Do you have the skills to fill this role Read the complete details below, and make your application today.
What you’ll do:
- Work closely with the engineering team and lead early product development, including designing and implementing a foundational evaluation platform.
- Balance trade-offs for performance and usability for our initial customers. We want our platform to be easy to use, and for customers to get results fast.
- Ship to learn: iterating, experimenting, and testing ideas to move us along our path to product-market fit.
- Bootstrap our initial development tooling, infrastructure, and deployment processes
- Collaborate with external researchers, design partners, and early customers.
Requirements:
- > 5 years of full time industry experience building and operating production systems in a modern cloud / enterprise setting, using tools like Python, terraform / pulumi, or kubernetes / lambda — with security in mind
- You’re a self-starter who is comfortable with ambiguity and open-ended technical challenges
- You can own projects end-to-end, and effectively collaborate with teammates
- You can balance building high quality, secure software with prioritizing company goals
- Experience with different cloud providers like AWS, GCP, or Azure
Nice to have (but not required):
- Experience with ML systems, particularly high scale distributed inference for modern LLMs
- Experience building user-facing data, ML, or analytics products
- Experience working at an early stage startup
Benefits
- Competitive salary & equity stake in the company
- Medical, dental, and vision insurance
- 401k benefits with company match
- Unlimited PTO policy & holiday shutdown last week of the year
- Paid commuter benefits for employees working hybrid in Boston
- Weekly team lunches
-
Site Reliability Engineer
2 days ago
Boston, United States Biofourmis Full timePosition Overview: Biofourmis is seeking a talented and experienced Site Reliability Engineer to join our dynamic global team. As a Site Reliability Engineer (SRE), you will play a critical role in ensuring the reliability, scalability, and performance of our digital health platform. You will collaborate closely with cross-functional teams to design,...
-
Engineering Manager
2 days ago
Boston, United States New Balance Full timeWho We Are: Since 1906, New Balance has empowered people through sport and craftsmanship to create positive change in communities around the world. We innovate fearlessly, guided by our core values and driven by the belief that conventions were meant to be challenged. We foster a culture in which every associate feels welcomed and respected, where leaders...
-
Site Reliability Engineer
20 hours ago
Boston, MA, United States Biofourmis Full timePosition Overview: Biofourmis is seeking a talented and experienced Site Reliability Engineer to join our dynamic global team. As a Site Reliability Engineer (SRE), you will play a critical role in ensuring the reliability, scalability, and performance of our digital health platform. You will collaborate closely with cross-functional teams to design,...
-
Reliability Engineer
2 days ago
Boston, United States Sequoia Biotech Consulting Full timeResponsibilities The GxP Reliability Engineer will provide reliability engineering support for all facilities, utilities systems and equipment including analytical instrumentation, R&D lab support equipment and systems. This role will facilitate the deployment of Maintenance and Reliability Best Practices for new and existing equipment, facilities, and...
-
Site Reliability Engineer
2 days ago
Boston, United States BlueSkyClarity Full timeSite Reliability Engineer (Kubernetes, Microservices, Operations) Apply Site Reliability Engineer (Kubernetes, Microservices, Operations), Boston, MA, Downtown & Metro West Market Compensation Commensurate with experience, bonus, equity, benefits additional, EOE Candidates must be a U.S. citizen or national, refugee, asylum, or lawful permanent resident. H1b...
-
Oracle: Principal Site Reliability Engineer
21 hours ago
Boston, MA, United States Soteriare Full timeApply locations Merrimack, NH Boston, MA time type Full time posted on Posted 5 Days Ago job requisition id 2093756 Job Description: As a member of the TechOps SRE team, you'll work closely with our engineering partners to help enable and drive initiatives from design to implementation. This is a phenomenal opportunity to have a direct impact on the...
-
Lead Site Reliability Engineer
2 days ago
Boston, United States Dice Full timeDice is the leading career destination for tech experts at every stage of their careers. Our client, Motion Recruitment Partners, LLC, is seeking the following. Apply via Dice today! We are partnered with a a dynamic startup poised to revolutionize data management, competing with established players. They are looking for a Senior Site Reliability to join...
-
Lead Site Reliability Engineer
1 day ago
Boston, United States Motion Recruitment Partners LLC Full timeWe are partnered with aa dynamic startup poised to revolutionize data management, competing with established players. They are looking for a Senior Site Reliability to join their grown DevOps team to ensure the reliability and performance of their highly scalable systems. You will work closely with software engineers to automate tooling and migrate...
-
Lead Site Reliability Engineer
2 weeks ago
Boston, United States Motion Recruitment Full timeWe are partnered with a a dynamic startup poised to revolutionize data management, competing with established players. They are looking for a Senior Site Reliability to join their grown DevOps team to ensure the reliability and performance of their highly scalable systems. You will work closely with software engineers to automate tooling and migrate...
-
Principal Site Reliability Engineer
2 weeks ago
Boston, United States Apollo Solutions Full timePrincipal DevOps Engineer/SRE Apollo Solutions have partnered with a disruptive early stage AI/ML start-up backed by top tier venture capital. In this role, you will be working closely with their founders and founding engineers to ensure fast, secure and reliable features can be delivered as well as building their infrastructure to feature massive...
-
Principal Site Reliability Engineer
2 weeks ago
Boston, United States Apollo Solutions Full timePrincipal DevOps Engineer/SRE Apollo Solutions have partnered with a disruptive early stage AI/ML start-up backed by top tier venture capital. In this role, you will be working closely with their founders and founding engineers to ensure fast, secure and reliable features can be delivered as well as building their infrastructure to feature massive...
-
Principal Site Reliability Engineer
2 weeks ago
Boston, United States Apollo Solutions Full timePrincipal DevOps Engineer/SRE Apollo Solutions have partnered with a disruptive early stage AI/ML start-up backed by top tier venture capital. In this role, you will be working closely with their founders and founding engineers to ensure fast, secure and reliable features can be delivered as well as building their infrastructure to feature massive...
-
Principal Site Reliability Engineer
1 week ago
Boston, United States Apollo Solutions Full timePrincipal DevOps Engineer/SRE If you are interested in applying for this job, please make sure you meet the following requirements as listed below. Apollo Solutions have partnered with a disruptive early stage AI/ML start-up backed by top tier venture capital. In this role, you will be working closely with their founders and founding engineers to ensure...
-
Reliability Engineer
20 hours ago
Boston, MA, United States Sequoia Biotech Consulting Full timeResponsibilities The GxP Reliability Engineer will provide reliability engineering support for all facilities, utilities systems and equipment including analytical instrumentation, R&D lab support equipment and systems. This role will facilitate the deployment of Maintenance and Reliability Best Practices for new and existing equipment, facilities, and...
-
Senior Lead Site Reliability Engineer
2 weeks ago
Boston, United States Motion Recruitment Partners, LLC Full timeJob Description We are working with a software company specializing in email marketing, automation, and customer relationship management for e-commerce businesses. They have gained popularity with e-commerce businesses, particularly those looking to harness the power of data-driven email marketing and automation to increase customer engagement, retention,...
-
Senior Lead Site Reliability Engineer
2 weeks ago
Boston, United States Motion Recruitment Full timeJob Description We are working with a software company specializing in email marketing, automation, and customer relationship management for e-commerce businesses. They have gained popularity with e-commerce businesses, particularly those looking to harness the power of data-driven email marketing and automation to increase customer engagement, retention,...
-
Lead Site Reliability Engineer
21 hours ago
Boston, MA, United States Dice Full timeDice is the leading career destination for tech experts at every stage of their careers. Our client, Motion Recruitment Partners, LLC, is seeking the following. Apply via Dice today! We are partnered with a a dynamic startup poised to revolutionize data management, competing with established players. They are looking for a Senior Site Reliability to join...
-
Boston, United States Klaviyo Full timeAt Klaviyo, we value the unique backgrounds, experiences and perspectives each Klaviyo (we call ourselves Klaviyos) brings to our workplace each and every day. We believe everyone deserves a fair shot at success and appreciate the experiences each person brings beyond the traditional job requirements. If you’re a close but not exact match with the...
-
Senior Site Reliability Engineer
2 weeks ago
Boston, Massachusetts, United States Marriott Full timeJob Number Job Category Information Technology Location Marriott International HQ, 7750 Wisconsin Avenue, Bethesda, Maryland, United States Schedule Full-Time Located Remotely? Y Relocation? N Position Type Management JOB SUMMARY Lead role in the Monitoring and Performance Management function at Marriott. Performs detailed performance analysis of...
-
Senior Software Engineer
1 month ago
Boston, United States Alarm.com Full timeJob DescriptionJob DescriptionSenior Software Engineer (Site Reliability Engineer)Do you love working with the latest technologies? Excited about helping maintain, improve, and scale an environment that supports millions of customers and IoT devices? Passionate about code at scale?If the above holds true for you, then we would love to talk to you! Alarm.com...