Director, Site Reliability Engineering

2 weeks ago


New York, New York, United States NBCUniversal Full time $189,592 - $220,000
Company Description

NBCUniversal is one of the world's leading media and entertainment companies. We create world-class content, which we distribute across our portfolio of film, television, and streaming, and bring to life through our theme parks and consumer experiences. We own and operate leading entertainment and news brands, including NBC, NBC News, MSNBC, CNBC, NBC Sports, Telemundo, NBC Local Stations, Bravo, USA Network, and Peacock, our premium ad-supported streaming service. We produce and distribute premier filmed entertainment and programming through Universal Filmed Entertainment Group and Universal Studio Group, and have world-renowned theme parks and attractions through Universal Destinations & Experiences. NBCUniversal is a subsidiary of Comcast Corporation.

Our impact is rooted in improving the communities where our employees, customers, and audiences live and work. We have a rich tradition of giving back and ensuring our employees have the opportunity to serve their communities. We champion an inclusive culture and strive to attract and develop a talented workforce to create and deliver a wide range of content reflecting our world.

Comcast NBCUniversal has announced its intent to create a new publicly traded company ('Versant') comprised of most of NBCUniversal's cable television networks, including USA Network, CNBC, MSNBC, Oxygen, E, SYFY and Golf Channel along with complementary digital assets Fandango, Rotten Tomatoes, GolfNow, GolfPass, and SportsEngine. The well-capitalized company will have significant scale as a pure-play set of assets anchored by leading news, sports and entertainment content. The spin-off is expected to be completed during 2025.

Job Description
  • As a member of NBCUniversal's Production Software Engineering team, responsible for leading and performing custom architectural design, implementation, monitoring, and maintenance for a portfolio of production application environments.
  • Responsible for hands-on configuration and support as well as managing the work of other architects and engineers.
  • Work closely with our Principal Software Engineer on technical architecture and design based on customer product requirements, translating product requirements to technical designs and implementations.
  • Collaborate with cross-functional team members such as Scrum Leads, Software Engineers, QA Engineers, UX Designers, Product Managers, other Architects & Site Reliability Engineers (Contractors and/or Staff), and third-party vendors.
  • Effectively delegate responsibilities to team members, mentoring and providing them with repeatable processes, and verifying the quality of their work.
  • Utilize metrics to measure accomplishments and monitors progress, ensuring milestones and projects are completed on-time.
  • Communicate progress and the impact of solutions in technical terms to technology partners and in business terms to business partners.
  • Establish a reputation as the subject matter expert for every tech stack used in Production Software Engineering applications and how they all fit together while keeping current with new technologies, developing innovative technical ideas, and generating proposals.
  • Work with product teams to learn business objectives, development teams to plan platform needs, QA to understand test strategy, and SRE on environments and deployments.
  • Participate in Scrums, demos, and other Agile ceremonies and ensure accurate and timely status updates to the team.
  • Serve as primary interface with the NBCU Cyber Security team for all security-related initiatives, patching, remediations, etc.
  • Hands-on commissioning, configuration, administration, documentation, and support for all on-prem & cloud (AWS) environments (Servers, Storage, Databases, Networking, Security, etc.).
  • Technical impact analysis, implementation, and monitoring of all cyber, technology audit, enterprise engineering, & IT (Databases, Monitoring, etc.) activities related to Production Software Engineering applications and platforms.
  • Create and manage CI/CD pipelines using tool likes Cloud Formation, Foreman, Jenkins, Nexus, Rundeck, Ansible, and Puppet.
  • Lead implementation of monitoring and reporting framework using tools like Grafana, Influx, Graylog/Splunk, Selenium, New Relic, and Icinga.
  • Recognize and identify potential technical impacts of enterprise change controls which could affect our applications and customers.
  • Help improve performance, scalability, and reliability.
  • Build and maintain distributed infrastructure and automation.
  • Solve problems quickly and automates processes for the future.
  • Direct management of other engineers and architects (Contractors and/or Staff). 24x7x365 availability for production outages, emergencies, and deployments.
  • 100% telecommuting is permitted for this role.
Qualifications
  • Bachelor's degree in Computer Science, Information Technology, or related field (or foreign degree equivalent), plus 10 years of experience as a Software Architect, in the job offered, or in a related occupation.
  • The position requires each of the following skills, which must have been gained through 10 years of experience:
  1. Hands-on systems engineering experience on Linux/Unix platforms;
  2. Experience with technical leadership and people management;
  3. Experience with Continuous Delivery and SDLC practices;
  4. DevOps principles, experience with operational tools (Ansible or Puppet or Chef, Terraform) and best practices for infrastructure (on-prem or cloud) and software deployment;
  5. Operational experience with large scale applications;
  6. Experience with NoSQL data stores (MarkLogic, MongoDB, Cassandra, DynamoDB, Couchbase, PostgreSQL, etc.);
  7. Experience with a broad range of enterprise technologies;
  8. Experience building real-time, large-scale, low-latency distributed systems;
  9.  Experience with Agile tools like Jira, GitHub or similar.
  • The position requires each of the following skills, which must have been gained through eight (8) years of experience:
  1. Experience using AWS Cloud in a production environment;
  2. Experience with AWS IAM, EC2, RDS, S3, Lambda, batch and step functions.

This position is eligible for company sponsored benefits, including medical, dental and vision insurance, 401(k), paid leave, tuition reimbursement, and a variety of other discounts and perks. Learn more about the benefits offered by NBCUniversal by visiting the Benefits page of the Careers website.

Salary range: $189,592 - $220,000 per year

Full-time: 40 hours/week

Additional Information

As part of our selection process, external candidates may be required to attend an in-person interview with an NBCUniversal employee at one of our locations prior to a hiring decision. NBCUniversal's policy is to provide equal employment opportunities to all applicants and employees without regard to race, color, religion, creed, gender, gender identity or expression, age, national origin or ancestry, citizenship, disability, sexual orientation, marital status, pregnancy, veteran status, membership in the uniformed services, genetic information, or any other basis protected by applicable law. 

If you are a qualified individual with a disability or a disabled veteran, you have the right to request a reasonable accommodation if you are unable or limited in your ability to use or access as a result of your disability. You can request reasonable accommodations by emailing [email protected].

For LA County and City Residents Only:  NBCUniversal will consider for employment  qualified applicants with criminal histories, or arrest or conviction records, in a manner  consistent with relevant legal requirements, including the City of Los Angeles' Fair Chance Initiative For Hiring Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, where applicable.

Business Segment: Operations & Technology

  • New York, New York, United States WRITER Full time

    About WRITERWRITER is where the world's leading enterprises orchestrate AI-powered work. Our vision is to expand human capacity through superintelligence. And we're proving it's possible – through powerful, trustworthy AI that unites IT and business teams together to unlock enterprise-wide transformation. With WRITER's end-to-end platform, hundreds of...


  • New York, New York, United States Kraken Full time

    Help us use technology to make a big green dent in the universeKraken powers some of the most innovative global developments in energy.We're a technology company focused on creating a smart, sustainable energy system. From optimising renewable generation, creating a more intelligent grid and enabling utilities to provide excellent customer experiences, our...


  • New York, New York, United States Luma Financial Technologies Full time

    About the roleAt Luma, our Site Reliability Engineer (SRE) team keeps our platform reliable, secure, and lightning fast. They own everything from AWS infrastructure and Kubernetes clusters to CI/CD pipelines, monitoring, and alerting. If you're passionate about tackling big challenges, automating at scale, and making systems more resilient, we'd love to have...


  • New York, New York, United States Talkspace Corporate Full time $88,000 - $131,000

    At Talkspace, we are committed to fostering a diverse, equitable, inclusive, and belonging-centered workplace where everyone can thrive while making a difference in mental health. Want to help over two million people receive quality mental healthcare? Come join our mission of getting therapy in the hands of everyoneWe are looking for an experienced Site...


  • New York, New York, United States Madronich Dr Robert Full time

    At Talkspace, we are committed to fostering a diverse, equitable, inclusive, and belonging-centered workplace where everyone can thrive while making a difference in mental health. Want to help over two million people receive quality mental healthcare? Come join our mission of getting therapy in the hands of everyoneWe are looking for an experiencedSite...


  • New York, New York, United States Justworks Full time

    Company Description We work hard. We're helping businesses get off the ground, by simplifying payroll, benefits, payments and compliance needs, enabling them to focus on what matters. We're data driven and never stop inventing new ways to improve.Working at Justworks, you'll enjoy company happy hours, a welcoming and casual environment, great benefits, and...


  • New York, New York, United States S&P Global Full time

    About the Role:Grade Level (for internal use):09Site Reliability Engineer – Datadog Specialist The Team: The IT Operations team at S&P Dow Jones Indices (S&P DJI) is tasked with owning and maintaining the Production IT systems that underpin S&P DJI's index platforms and applications, ensuring their high availability. The team prioritizes service...


  • New York, New York, United States Unify Full time

    About UnifyUnify was founded January 17th, 2023 by Austin Hughes and Connor Heggie. Prior to Unify, Austin led Ramp's growth product team focused on new customer acquisition, and Connor was a machine learning research engineer at Scale AI. The rest of our team comes from companies like Airbnb, Spotify, Bridgewater and LinkedIn.Our mission is to build the...


  • New York, New York, United States MSCI Inc. Full time

    Your Team ResponsibilitiesThe MSCI office is offering an excellent opportunity for a Production Support Engineer to join the Real Estate Site Reliability Engineering team.The successful candidate will join a team of IT production engineers and will report to the Production Team Lead. The role requires close co-operation with the development and...

  • site director

    18 hours ago


    New York, New York, United States Lensa Full time

    Lensa is a career site that helps job seekers find great jobs in the US. We are not a staffing firm or agency. Lensa does not hire directly for these jobs, but promotes jobs on LinkedIn on behalf of its direct clients, recruitment ad agencies, and marketing partners. Lensa partners with DirectEmployers to promote this job for City of New York. Clicking...