Director of Site Reliability Engineering

1 week ago


New York, NY, United States Jobot Full time

Are you a hands-on leader within the DevOps/SRE space? Are you a supporter of responsible AI adoption & cybersecurity? This opportunity requires an incredibly versatile SRE to handle both hands on & strategic initiatives while leading a team

This Jobot Job is hosted by: Craig Rosecrans
Are you a fit? Easy Apply now by clicking the "Apply" button and sending us your resume.
Salary: $200,000 - $260,000 per year

A bit about us:

We are seeking a dynamic and innovative Director of Site Reliability Engineering to join our growing team. This role is pivotal in maintaining the stability and efficiency of our cutting-edge technology services, ensuring that our systems are always online and performant. The successful candidate will be responsible for leading a talented team of engineers, developing and implementing site reliability best practices, and driving continuous improvement initiatives. This is an exciting opportunity to be at the forefront of technology, working in a fast-paced, innovative environment where your work will have a direct impact on our business and customers.

Why join us?

  • Competitive Base Salary + Stock options
  • Company paid health plan for employees
  • Flexible Hours
  • Very generous PTO
  • Dental and Vision, FSA, HSA
  • Small team, autonomy
  • Many more great perks


Job Details

We are seeking a dynamic and innovative Director of Site Reliability Engineering to join our growing team. This role is pivotal in maintaining the stability and efficiency of our cutting-edge technology services, ensuring that our systems are always online and performant. The successful candidate will be responsible for leading a talented team of engineers, developing and implementing site reliability best practices, and driving continuous improvement initiatives. This is an exciting opportunity to be at the forefront of technology, working in a fast-paced, innovative environment where your work will have a direct impact on our business and customers.

Responsibilities:

1. Lead, mentor, and manage a high-performing team of Site Reliability Engineers.
2. Develop and implement best practices for system reliability, scalability, operability, and performance.
3. Collaborate with engineering teams to define service level objectives, ensure we are exceeding them, and implement strategies to improve upon them.
4. Drive the design and deployment of our multi-region architectures and on-prem deployments.
5. Utilize your expertise in K8 and CloudFormation to automate and innovate.
6. Oversee compliance with frameworks such as FedRAMP and SOC 2.
7. Develop a deep understanding of our AI/ML Infrastructure to ensure optimal performance and reliability.
8. Work closely with other teams to identify and correct bottlenecks in the delivery process.
9. Spearhead incident management, ensuring swift resolution, comprehensive post-mortem investigations, and effective preventative measures.

Qualifications:

1. Bachelor's degree in Computer Science, Engineering, or related field.
2. Minimum of 5 years of experience in Site Reliability Engineering leadership & 10+ years of SRE/Infrastructure/DevOps experience
3. Proven leadership experience managing high-performing engineering teams.
4. Extensive experience with K8, CloudFormation, and multi-region architectures.
5. In-depth understanding of compliance frameworks such as FedRAMP and SOC 2.
6. Prior experience in a startup environment is highly desirable.
7. Proficiency in AI/ML Infrastructure and on-prem deployments.
8. Exceptional problem-solving skills and attention to detail.
9. Excellent communication and interpersonal skills.
10. Proven ability to thrive in a fast-paced, dynamic environment.

Join us in this exciting role where you can make a significant impact. We are committed to fostering a culture of innovation, teamwork, and professional growth. If you are a driven, results-oriented leader with a passion for technology and a knack for problem-solving, we would love to hear from you.

Interested in hearing more? Easy Apply now by clicking the "Apply" button.

Jobot is an Equal Opportunity Employer. We provide an inclusive work environment that celebrates diversity and all qualified candidates receive consideration for employment without regard to race, color, sex, sexual orientation, gender identity, religion, national origin, age (40 and over), disability, military status, genetic information or any other basis protected by applicable federal, state, or local laws. Jobot also prohibits harassment of applicants or employees based on any of these protected categories. It is Jobot's policy to comply with all applicable federal, state and local laws respecting consideration of unemployment status in making hiring decisions.

Sometimes Jobot is required to perform background checks with your authorization. Jobot will consider qualified candidates with criminal histories in a manner consistent with any applicable federal, state, or local law regarding criminal backgrounds, including but not limited to the Los Angeles Fair Chance Initiative for Hiring and the San Francisco Fair Chance Ordinance.

Information collected and processed as part of your Jobot candidate profile, and any job applications, resumes, or other information you choose to submit is subject to Jobot's Privacy Policy, as well as the Jobot California Worker Privacy Notice and Jobot Notice Regarding Automated Employment Decision Tools which are available at .

By applying for this job, you agree to receive calls, AI-generated calls, text messages, or emails from Jobot, and/or its agents and contracted partners. Frequency varies for text messages. Message and data rates may apply. Carriers are not liable for delayed or undelivered messages. You can reply STOP to cancel and HELP for help. You can access our privacy policy here:

  • New York, NY, United States Writer Corporation Full time

    About this role We are looking for a foundational member of the Cloud infrastructure team at WRITER. This role will involve contributing to the development and implementation of our Site reliability engineering (SRE) program. The ideal candidate will ensure the reliability, scalability, performance, and security of WRITER's critical systems, taking a...


  • New York, NY, United States Writer Corporation Full time

    About this role We are looking for a foundational member of the Cloud infrastructure team at WRITER. This role will involve contributing to the development and implementation of our Site reliability engineering (SRE) program. The ideal candidate will ensure the reliability, scalability, performance, and security of WRITER's critical systems, taking a...


  • New York, NY, United States Writer Corporation Full time

    About this role We are looking for a foundational member of the Cloud infrastructure team at WRITER. This role will involve contributing to the development and implementation of our Site reliability engineering (SRE) program. The ideal candidate will ensure the reliability, scalability, performance, and security of WRITER's critical systems, taking a...


  • New York, NY, United States Writer Corporation Full time

    About this role We are looking for a foundational member of the Cloud infrastructure team at WRITER. This role will involve contributing to the development and implementation of our Site reliability engineering (SRE) program. The ideal candidate will ensure the reliability, scalability, performance, and security of WRITER's critical systems, taking a...


  • New York, NY, United States Kraken Full time

    Help us use technology to make a big green dent in the universe! Kraken powers some of the most innovative global developments in energy. We're a technology company focused on creating a smart, sustainable energy system. From optimising renewable generation, creating a more intelligent grid and enabling utilities to provide excellent customer experiences,...


  • New York, NY, United States Kraken Full time

    Help us use technology to make a big green dent in the universe! Kraken powers some of the most innovative global developments in energy. We're a technology company focused on creating a smart, sustainable energy system. From optimising renewable generation, creating a more intelligent grid and enabling utilities to provide excellent customer experiences,...


  • New York, NY, United States Patreon Full time

    Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans and build a lasting business including: paid memberships, free memberships, community chats, live video, and selling to fans directly with one-time purchases....


  • New York, NY, United States Patreon Full time

    Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans and build a lasting business including: paid memberships, free memberships, community chats, live video, and selling to fans directly with one-time purchases....


  • New York, NY, United States Patreon Full time

    Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans and build a lasting business including: paid memberships, free memberships, community chats, live video, and selling to fans directly with one-time purchases....


  • New York, NY, United States Elliot Partnership Full time

    Site Reliability Engineer - (Linux & Python/Go) New York, NY (Hybrid, 3 days in office) Highly competitive compensation package Join an elite technology and research group at the forefront of global finance, where world-class engineering and quantitative research converge to solve some of the most complex problems in any industry. Their teams are composed...