Site Reliability Engineer

3 weeks ago


Boston, United States Intelletec Limited Full time

About the Position

We are looking for experienced engineers who understand AI systems, and are excited about becoming global leaders in a completely novel field. We need people that can work independently as part of a small team.

You will be responsible for building the industry’s first end-to-end AI evaluation platform, starting with an offline evaluation harness and web platform. You’ll also help forge the foundation of our company’s engineering team and company culture.

We are a small team passionate about technology, research, and the extraordinary things we can build when we combine the two. We’re also fun, collaborative, and have meaningful lives outside of work.

You might be a good fit if you’ve worked as an ML Engineer, Infrastructure Engineer, or SRE. We are building a team in Boston, Massachusetts, and preference will be given to candidates who can join us in our office 2 days per week.



Do you have the skills to fill this role Read the complete details below, and make your application today.

What you’ll do:

  • Work closely with the engineering team and lead early product development, including designing and implementing a foundational evaluation platform.
  • Balance trade-offs for performance and usability for our initial customers. We want our platform to be easy to use, and for customers to get results fast.
  • Ship to learn: iterating, experimenting, and testing ideas to move us along our path to product-market fit.
  • Bootstrap our initial development tooling, infrastructure, and deployment processes
  • Collaborate with external researchers, design partners, and early customers.

Requirements:

  • > 5 years of full time industry experience building and operating production systems in a modern cloud / enterprise setting, using tools like Python, terraform / pulumi, or kubernetes / lambda — with security in mind
  • You’re a self-starter who is comfortable with ambiguity and open-ended technical challenges
  • You can own projects end-to-end, and effectively collaborate with teammates
  • You can balance building high quality, secure software with prioritizing company goals
  • Experience with different cloud providers like AWS, GCP, or Azure

Nice to have (but not required):

  • Experience with ML systems, particularly high scale distributed inference for modern LLMs
  • Experience building user-facing data, ML, or analytics products
  • Experience working at an early stage startup

Benefits

  • Competitive salary & equity stake in the company
  • Medical, dental, and vision insurance
  • 401k benefits with company match
  • Unlimited PTO policy & holiday shutdown last week of the year
  • Paid commuter benefits for employees working hybrid in Boston
  • Weekly team lunches


  • Boston, United States Biofourmis Full time

    Position Overview: Biofourmis is seeking a talented and experienced Site Reliability Engineer to join our dynamic global team. As a Site Reliability Engineer (SRE), you will play a critical role in ensuring the reliability, scalability, and performance of our digital health platform. You will collaborate closely with cross-functional teams to design,...

  • Engineering Manager

    2 days ago


    Boston, United States New Balance Full time

    Who We Are: Since 1906, New Balance has empowered people through sport and craftsmanship to create positive change in communities around the world. We innovate fearlessly, guided by our core values and driven by the belief that conventions were meant to be challenged. We foster a culture in which every associate feels welcomed and respected, where leaders...


  • Boston, MA, United States Biofourmis Full time

    Position Overview: Biofourmis is seeking a talented and experienced Site Reliability Engineer to join our dynamic global team. As a Site Reliability Engineer (SRE), you will play a critical role in ensuring the reliability, scalability, and performance of our digital health platform. You will collaborate closely with cross-functional teams to design,...


  • Boston, United States Sequoia Biotech Consulting Full time

    Responsibilities The GxP Reliability Engineer will provide reliability engineering support for all facilities, utilities systems and equipment including analytical instrumentation, R&D lab support equipment and systems. This role will facilitate the deployment of Maintenance and Reliability Best Practices for new and existing equipment, facilities, and...


  • Boston, United States BlueSkyClarity Full time

    Site Reliability Engineer (Kubernetes, Microservices, Operations) Apply Site Reliability Engineer (Kubernetes, Microservices, Operations), Boston, MA, Downtown & Metro West Market Compensation Commensurate with experience, bonus, equity, benefits additional, EOE Candidates must be a U.S. citizen or national, refugee, asylum, or lawful permanent resident. H1b...


  • Boston, MA, United States Soteriare Full time

    Apply locations Merrimack, NH Boston, MA time type Full time posted on Posted 5 Days Ago job requisition id 2093756 Job Description: As a member of the TechOps SRE team, you'll work closely with our engineering partners to help enable and drive initiatives from design to implementation. This is a phenomenal opportunity to have a direct impact on the...


  • Boston, United States Dice Full time

    Dice is the leading career destination for tech experts at every stage of their careers. Our client, Motion Recruitment Partners, LLC, is seeking the following. Apply via Dice today! We are partnered with a a dynamic startup poised to revolutionize data management, competing with established players. They are looking for a Senior Site Reliability to join...


  • Boston, United States Motion Recruitment Partners LLC Full time

    We are partnered with aa dynamic startup poised to revolutionize data management, competing with established players. They are looking for a Senior Site Reliability to join their grown DevOps team to ensure the reliability and performance of their highly scalable systems. You will work closely with software engineers to automate tooling and migrate...


  • Boston, United States Motion Recruitment Full time

    We are partnered with a a dynamic startup poised to revolutionize data management, competing with established players. They are looking for a Senior Site Reliability to join their grown DevOps team to ensure the reliability and performance of their highly scalable systems. You will work closely with software engineers to automate tooling and migrate...


  • Boston, United States Apollo Solutions Full time

    Principal DevOps Engineer/SRE Apollo Solutions have partnered with a disruptive early stage AI/ML start-up backed by top tier venture capital. In this role, you will be working closely with their founders and founding engineers to ensure fast, secure and reliable features can be delivered as well as building their infrastructure to feature massive...


  • Boston, United States Apollo Solutions Full time

    Principal DevOps Engineer/SRE Apollo Solutions have partnered with a disruptive early stage AI/ML start-up backed by top tier venture capital. In this role, you will be working closely with their founders and founding engineers to ensure fast, secure and reliable features can be delivered as well as building their infrastructure to feature massive...


  • Boston, United States Apollo Solutions Full time

    Principal DevOps Engineer/SRE Apollo Solutions have partnered with a disruptive early stage AI/ML start-up backed by top tier venture capital. In this role, you will be working closely with their founders and founding engineers to ensure fast, secure and reliable features can be delivered as well as building their infrastructure to feature massive...


  • Boston, United States Apollo Solutions Full time

    Principal DevOps Engineer/SRE If you are interested in applying for this job, please make sure you meet the following requirements as listed below. Apollo Solutions have partnered with a disruptive early stage AI/ML start-up backed by top tier venture capital. In this role, you will be working closely with their founders and founding engineers to ensure...

  • Reliability Engineer

    20 hours ago


    Boston, MA, United States Sequoia Biotech Consulting Full time

    Responsibilities The GxP Reliability Engineer will provide reliability engineering support for all facilities, utilities systems and equipment including analytical instrumentation, R&D lab support equipment and systems. This role will facilitate the deployment of Maintenance and Reliability Best Practices for new and existing equipment, facilities, and...


  • Boston, United States Motion Recruitment Partners, LLC Full time

    Job Description We are working with a software company specializing in email marketing, automation, and customer relationship management for e-commerce businesses. They have gained popularity with e-commerce businesses, particularly those looking to harness the power of data-driven email marketing and automation to increase customer engagement, retention,...


  • Boston, United States Motion Recruitment Full time

    Job Description We are working with a software company specializing in email marketing, automation, and customer relationship management for e-commerce businesses. They have gained popularity with e-commerce businesses, particularly those looking to harness the power of data-driven email marketing and automation to increase customer engagement, retention,...


  • Boston, MA, United States Dice Full time

    Dice is the leading career destination for tech experts at every stage of their careers. Our client, Motion Recruitment Partners, LLC, is seeking the following. Apply via Dice today! We are partnered with a a dynamic startup poised to revolutionize data management, competing with established players. They are looking for a Senior Site Reliability to join...


  • Boston, United States Klaviyo Full time

    At Klaviyo, we value the unique backgrounds, experiences and perspectives each Klaviyo (we call ourselves Klaviyos) brings to our workplace each and every day. We believe everyone deserves a fair shot at success and appreciate the experiences each person brings beyond the traditional job requirements. If you’re a close but not exact match with the...


  • Boston, Massachusetts, United States Marriott Full time

    Job Number Job Category Information Technology Location Marriott International HQ, 7750 Wisconsin Avenue, Bethesda, Maryland, United States Schedule Full-Time Located Remotely? Y Relocation? N Position Type Management JOB SUMMARY Lead role in the Monitoring and Performance Management function at Marriott. Performs detailed performance analysis of...


  • Boston, United States Alarm.com Full time

    Job DescriptionJob DescriptionSenior Software Engineer (Site Reliability Engineer)Do you love working with the latest technologies? Excited about helping maintain, improve, and scale an environment that supports millions of customers and IoT devices? Passionate about code at scale?If the above holds true for you, then we would love to talk to you! Alarm.com...