Reliability Engineering Expert
2 days ago
We are seeking a highly skilled Site Reliability Manager to join our team at Macmillan Learning.
About the Role:- This role involves managing incidents, optimizing system performance, and ensuring operational excellence through automation and monitoring strategies.
- Lead incident management processes, ensuring swift resolution and communication during outages. Conduct root cause analyses and implement preventive measures.
- Design and maintain robust monitoring systems for internal and third-party applications, establishing SLIs, SLOs, and SLAs.
- Automate operational tasks and develop self-healing systems to reduce manual intervention.
- Collaborate with cross-functional teams and vendors to maintain system performance and address potential reliability issues proactively.
- Provide leadership in system performance reporting, ensuring proactive communication with stakeholders on system health, ongoing initiatives, incident updates, and post-resolution analysis.
- Expertise with monitoring tools (e.g., Splunk, Azure Monitor) and cloud platforms (e.g., Azure, AWS).
- Familiarity with ITIL frameworks and advanced automation practices.
- Strong scripting skills (e.g., Python, Bash) and familiarity with Infrastructure as Code tools.
- Excellent problem-solving and communication skills.
- Proven experience (5+ years) in Site Reliability Engineering, DevOps, or related fields.
- Experience with Service Now and Pager Duty (or similar).
- Experience managing SaaS platforms like Google Workspace.
The estimated salary for this position is $120,000 - $130,000 per year.
-
Reliability Engineering Expert
2 days ago
New York, New York, United States City of New York Full timeAbout the RoleThe City of New York is seeking a highly experienced Reliability Engineering Expert to join its Bureau of Wastewater Treatment. As a key member of the team, this individual will be responsible for implementing reliability-centered maintenance (RCM) practices across 14 wastewater treatment plants and two collection facilities.The Reliability...
-
Reliability Expert for Scalable Systems
2 weeks ago
New York, New York, United States Peloton Cycle Full timeAt Peloton, we view our platform as a product that unlocks the speed of development and learning. Our extraordinary platform allows us to scale easily, enabling our engineers to focus on new features and capabilities. A key to crafting an excellent platform is data-driven insights and understanding where we should focus our attention to create the best...
-
Site Reliability Engineering Expert
2 weeks ago
New York, New York, United States Fidelity Information Services Full timeCompany OverviewFidelity Information Services is a leading provider of financial services and technology solutions. Our mission is to empower our clients with innovative and reliable systems.SalaryThe estimated annual salary for this position is $31,200.Job DescriptionWe are seeking an experienced Site Reliability Engineer to join our team. As a key member...
-
Reliability Engineering Team Lead
3 weeks ago
New York, New York, United States Capital One Full timeOverviewAchieve high system reliability as a Lead Reliability Engineer at Capital One, joining our team of innovators to drive business success with technical excellence. Collaborate with cross-functional teams to design and implement robust solutions that ensure seamless services.About the RoleWe are seeking an experienced Reliability Engineer who can guide...
-
Expert Endodontist Opportunity in Manhattan, NY
3 weeks ago
New York, New York, United States Expert Dental Full timeAbout Expert Dental">We are a dentist-owned, multi-specialty dental group with offices in Midtown Manhattan and Tribeca. Our modern, state-of-the-art offices feature the latest dental technologies.Our team of board-certified specialists includes Oral Surgery, Endodontics, Periodontics, Orthodontics, and Pediatric Dentistry providers who work together to...
-
Distributed Systems Reliability Expert
2 weeks ago
New York, New York, United States Cockroach Labs Full timeCockroach Labs is the creator of CockroachDB, a cloud-native, distributed SQL database that scales fast, survives anything, and thrives anywhere. Our mission is to simplify how businesses build and operate world-changing applications.About the RoleYou will oversee our production system, ensuring stable and scalable infrastructure as we deliver CockroachDB to...
-
Foundation Expert
2 days ago
New York, New York, United States JPCL Engineering Full timeJob DescriptionWe are seeking a highly skilled Professional Engineer to perform special inspections specifically focused on pile installations. The successful candidate will play a crucial role in ensuring the integrity and safety of foundation systems by conducting thorough inspections and providing expert evaluations.About the RoleThis is a key position...
-
Expert Enterprise Network Engineer
2 weeks ago
New York, New York, United States CyberTec Full timeJob Overview:A challenging role has emerged at CyberTec for a highly skilled Senior Network Architect to lead the design, implementation, and management of cutting-edge enterprise networks. The ideal candidate will have extensive experience with Cisco ASR, Cisco Nexus, Cisco ACI, and VMware NSX, as well as advanced load balancing technologies (F5/NSX ALB)...
-
New York, New York, United States Formation Bio Full timeAbout Formation BioFormation Bio is a cutting-edge biotech company that utilizes AI and technology to revolutionize the drug development process. Founded in 2016, the company has established itself as a leader in the industry by developing innovative solutions to accelerate drug development and clinical trials.The company partners with pharmaceutical...
-
Pile Installation Expert
3 weeks ago
New York, New York, United States JPCL Engineering Full timeWe are seeking a highly skilled and experienced Pile Installation Expert to join our team at JPCL Engineering. The successful candidate will play a crucial role in ensuring the structural integrity of foundation systems by conducting thorough inspections and providing expert evaluations.About the RoleThis position is focused on performing special inspections...
-
Reliability Engineering Specialist
2 weeks ago
New York, New York, United States Palantir Technologies Full timeA World-Changing Technology CompanyAbout UsAt Palantir Technologies, we develop and deploy software solutions that empower our partners to make data-driven decisions and drive meaningful outcomes.The RoleWe are seeking a skilled Reliability Engineer to join our team. As a Product Reliability Engineer, you will play a critical role in ensuring the stability...
-
Senior Foundation Engineer
3 weeks ago
New York, New York, United States JPCL Engineering Full timeFoundation Specialist Position at JPCL EngineeringWe are seeking an experienced Senior Foundation Engineer to join our team in New York. The successful candidate will be responsible for conducting thorough inspections and providing expert evaluations on pile installations, ensuring the integrity and safety of foundation systems.About the JobThis is a...
-
Reliability Engineering Leadership Position
3 weeks ago
New York, New York, United States Capital One Full timeCapital One Reliability Engineer RoleWe are seeking a skilled Lead Reliability Engineer to join our team at Capital One. As a key member of our engineering group, you will play a critical role in designing and implementing reliable systems that meet the needs of our customers.About the JobCollaborate with Agile teams to design, develop, test, implement, and...
-
Senior Full Stack Software Engineer
3 weeks ago
New York, New York, United States Ness Digital Engineering Full timeNess Digital Engineering is a leading digital engineering firm offering comprehensive digital advisory services through scaled engineering capabilities. As your trusted tech partner, we help businesses thrive in the digital economy by combining cutting-edge strategy and technology with our core engineering expertise.We are seeking an experienced Senior Full...
-
Reliability Solutions Architect
3 weeks ago
New York, New York, United States Capital One Full timeJoin Our Mission to Revolutionize TechnologyWe are seeking a highly skilled Reliability Solutions Architect to join our team at Capital One. As a key member of our technology department, you will play a vital role in designing and implementing technical solutions that improve system reliability and efficiency.About the Role:Collaborate with Agile teams to...
-
Reliability Engineering Leadership Position
2 weeks ago
New York, New York, United States Capital One Full timeOverviewCapital One is a leading financial institution seeking a seasoned reliability engineer to drive process improvements and influence the strategic direction of our technology teams.Salary Range:The estimated annual salary for this role in New York City (Hybrid On-Site) is $201,400 - $229,900. Candidates hired to work in other locations will be subject...
-
New York, New York, United States Apollo Solutions Full timeApollo Solutions is seeking a highly skilled Reliable Infrastructure Engineer to join their team. This role involves working closely with other engineers to ensure fast, secure, and reliable features are delivered, as well as building the company's infrastructure to support massive scalability.Responsibilities:Lead technical discussions on cloud...
-
Bridge Engineer Expert
3 weeks ago
New York, New York, United States City of New York Full timeJob Title: Bridge Engineer ExpertJob Summary:We are seeking an experienced Bridge Engineer Expert to join our team at the City of New York. As a key member of our Department of Transportation, you will be responsible for managing bridge projects and ensuring they are completed on time and within budget.Key Responsibilities:* Manage bridge projects from...
-
Site Reliability Engineer
23 hours ago
New York, New York, United States Motion Recruitment Full timeSite Reliability Engineer OpportunityWe are seeking a highly skilled Site Reliability Engineer to join our client's innovative team in New York City. In this challenging role, you will optimize and automate infrastructure using your expertise in cloud infrastructure, CI/CD pipelines, and automation.Job Responsibilities- Oversee the full GCP infrastructure...
-
Site Reliability Engineering Lead
2 days ago
New York, New York, United States Insight Global Full timeCompany Overview">Insight Global is committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We believe everyone matters and strive to create equal opportunities for all.About the Job">We are looking for a talented Site Reliability Engineer to join our team. The successful candidate will...