Director, Site Reliability Engineering
3 weeks ago
Production Software Engineering ArchitectAs a member of NBCUniversal's Production Software Engineering team, responsible for leading and performing custom architectural design, implementation, monitoring, and maintenance for a portfolio of production application environments.Responsible for hands-on configuration and support as well as managing the work of other architects and engineers.Work closely with our Principal Software Engineer on technical architecture and design based on customer product requirements, translating product requirements to technical designs and implementations.Collaborate with cross-functional team members such as Scrum Leads, Software Engineers, QA Engineers, UX Designers, Product Managers, other Architects & Site Reliability Engineers (Contractors and/or Staff), and third-party vendors.Effectively delegate responsibilities to team members, mentoring and providing them with repeatable processes, and verifying the quality of their work.Utilize metrics to measure accomplishments and monitors progress, ensuring milestones and projects are completed on-time.Communicate progress and the impact of solutions in technical terms to technology partners and in business terms to business partners.Establish a reputation as the subject matter expert for every tech stack used in Production Software Engineering applications and how they all fit together while keeping current with new technologies, developing innovative technical ideas, and generating proposals.Work with product teams to learn business objectives, development teams to plan platform needs, QA to understand test strategy, and SRE on environments and deployments.Participate in Scrums, demos, and other Agile ceremonies and ensure accurate and timely status updates to the team.Serve as primary interface with the NBCU Cyber Security team for all security-related initiatives, patching, remediations, etc.Hands-on commissioning, configuration, administration, documentation, and support for all on-prem & cloud (AWS) environments (Servers, Storage, Databases, Networking, Security, etc.).Technical impact analysis, implementation, and monitoring of all cyber, technology audit, enterprise engineering, & IT (Databases, Monitoring, etc.) activities related to Production Software Engineering applications and platforms.Create and manage CI/CD pipelines using tool likes Cloud Formation, Foreman, Jenkins, Nexus, Rundeck, Ansible, and Puppet.Lead implementation of monitoring and reporting framework using tools like Grafana, Influx, Graylog/Splunk, Selenium, New Relic, and Icinga.Recognize and identify potential technical impacts of enterprise change controls which could affect our applications and customers.Help improve performance, scalability, and reliability.Build and maintain distributed infrastructure and automation.Solve problems quickly and automates processes for the future.Direct management of other engineers and architects (Contractors and/or Staff). 24x7x365 availability for production outages, emergencies, and deployments.100% telecommuting is permitted for this role.Bachelor's degree in Computer Science, Information Technology, or related field (or foreign degree equivalent), plus 10 years of experience as a Software Architect, in the job offered, or in a related occupation.The position requires each of the following skills, which must have been gained through 10 years of experience:Hands-on systems engineering experience on Linux/Unix platforms;Experience with technical leadership and people management;Experience with Continuous Delivery and SDLC practices;DevOps principles, experience with operational tools (Ansible or Puppet or Chef, Terraform) and best practices for infrastructure (on-prem or cloud) and software deployment;Operational experience with large scale applications;Experience with NoSQL data stores (MarkLogic, MongoDB, Cassandra, DynamoDB, Couchbase, PostgreSQL, etc.);Experience with a broad range of enterprise technologies;Experience building real-time, large-scale, low-latency distributed systems;Experience with Agile tools like Jira, GitHub or similar.The position requires each of the following skills, which must have been gained through eight (8) years of experience:Experience using AWS Cloud in a production environment;Experience with AWS IAM, EC2, RDS, S3, Lambda, batch and step functions.This position is eligible for company sponsored benefits, including medical, dental and vision insurance, 401(k), paid leave, tuition reimbursement, and a variety of other discounts and perks. Learn more about the benefits offered by NBCUniversal by visiting the Benefits page of the Careers website.Salary range: $189,592 - $220,000 per yearFull-time: 40 hours/weekAs part of our selection process, external candidates may be required to attend an in-person interview with an NBCUniversal employee at one of our locations prior to a hiring decision. NBCUniversal's policy is to provide equal employment opportunities to all applicants and employees without regard to race, color, religion, creed, gender, gender identity or expression, age, national origin or ancestry, citizenship, disability, sexual orientation, marital status, pregnancy, veteran status, membership in the uniformed services, genetic information, or any other basis protected by applicable law.If you are a qualified individual with a disability or a disabled veteran, you have the right to request a reasonable accommodation if you are unable or limited in your ability to use or access nbcunicareers.com as a result of your disability. You can request reasonable accommodations by emailing AccessibilitySupport@nbcuni.com.For LA County and City Residents Only: NBCUniversal will consider for employment qualified applicants with criminal histories, or arrest or conviction records, in a manner consistent with relevant legal requirements, including the City of Los Angeles' Fair Chance Initiative For Hiring Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, where applicable.
-
Director of Site Reliability Engineering
2 weeks ago
New York, United States Jobot Full timeDirector of Site Reliability Engineering Base pay range: $200,000 - $260,000 per year. We are seeking a dynamic and innovative Director of Site Reliability Engineering to join our growing team. This role is pivotal in maintaining the stability and efficiency of our cutting‑edge technology services, ensuring that our systems are always online and...
-
Director of Site Reliability
2 weeks ago
New York, United States Jobot Full timeA leading tech company is seeking a Director of Site Reliability Engineering in New York. The role involves leading a team to maintain exceptional system performance and implementing best practices for site reliability. Candidates should have substantial experience in SRE and demonstrated leadership in engineering teams, especially in innovative and dynamic...
-
Site reliability engineer
1 week ago
New York, United States WRITER Full timeJoin to apply for the Site reliability engineer role at WRITER. Base pay range $157,700.00/yr - $277,800.00/yr About This Role We are looking for a foundational member of the Cloud infrastructure team at WRITER. This role will involve contributing to the development and implementation of our Site reliability engineering (SRE) program. The ideal candidate...
-
Site Reliability Engineer
1 week ago
New York, United States Upward Trend Full timeThis range is provided by Upward Trend. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range $130,000.00/yr - $300,000.00/yr Site Reliability Engineer Major global hedge fund New York Our client, a multibillion AUM hedge fund headquartered in the United States is looking for a Site Reliability...
-
Site Reliability Engineer
1 week ago
New York, United States Quantitative Systems Full timeGet AI-powered advice on this job and more exclusive features. This range is provided by Quantitative Systems. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more. Base pay range $250,000.00/yr - $300,000.00/yr Senior Site Reliability Engineer – Trading Systems This isn’t a support role. It’s an...
-
Site Reliability Engineer
1 week ago
New York, United States STAND 8 Technology Consulting Full timeSite Reliability Engineer – Contract (Hybrid) STAND 8 Technology Consulting invites a Site Reliability Engineer (SRE) to design, build, and maintain reliable, scalable, and high‑performance systems. This hybrid role is based in New York, NY, with onsite work 4 days per week. Compensation Hourly range: $73.00/hr – $83.00/hr (base Pay Range). Individual...
-
Engineering Manager, Site Reliability
2 weeks ago
New York, New York, United States Reddit, Inc. Full timeReddit is a community of communities. It's built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. With 100,000+ active communities and approximately 116 million daily active unique visitors, Reddit is one of...
-
Site Reliability Engineer
5 hours ago
New York, United States Patreon, Inc. Full timePatreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans and build a lasting business including: paid memberships, free memberships, community chats, live video, and selling to fans directly with one-time purchases....
-
Site reliability engineer
2 weeks ago
New York, United States writer.com Full timeAbout this role We are looking for a foundational member of the Cloud infrastructure team at WRITER. This role will involve contributing to the development and implementation of our Site reliability engineering (SRE) program. The ideal candidate will ensure the reliability, scalability, performance, and security of WRITER’s critical systems, taking a...
-
Site Reliability Engineer
1 week ago
New York, United States MIO Partners, Inc. Full timeSite Reliability Engineer Join to apply for the Site Reliability Engineer role at MIO Partners, Inc. MIO Partners, Inc. (MIO) provides proprietary investment products to McKinsey’s retirement plan and partners and offers independent, high‑quality financial advice to McKinsey’s partners. We manage a wide array of investment vehicles with significant...