Site Reliability Engineer SRE
1 week ago
We are seeking a Site Reliability Engineer to support the Eagle Access platform and Vault in Azure within a highly available, client?facing environment. This role will be responsible for first-level monitoring, incident response, and service recovery, ensuring the performance and stability of critical applications. The ideal candidate brings strong communication skills, a collaborative mindset, and technical experience across Linux, SQL, Python, and monitoring tools.
This position is based out of Boston, MA, working Tuesday through Saturday from 1:00?PM 9:00?PM ET, with a shift adjustment to 12:00?PM 8:00?PM ET during daylight savings. The first several months will focus on knowledge transfer and shadowing, with the expectation of handling Saturday shifts independently once fully ramped.
Key Responsibilities- Incident Monitoring & Response: Act as the first line of defense for service disruptions, database efficiency issues, and performance incidents; analyze problems and provide recommendations or escalates when necessary.
- Ticket Management: Monitor and resolve ServiceNow tickets related to system alerts, incidents, and user issues.
- Knowledge Transfer: Participate in structured onboarding by shadowing senior team members; gradually transition into independent ownership of shift coverage.
- Escalation Management: Escalate complex issues to senior staff when appropriate, collaborating with leads including Pierre and other escalation points.
- Linux Navigation: Perform basic Linux functions such as log review, file editing, and process management; contribute to efficiency improvements.
- Database Support: Utilize SQL to assist with troubleshooting, analysis, and efficiency-related incidents.
- Scripting & Automation: Support senior team members in developing scripts; contribute to improving automation and operational processes over time.
- Cross-Team Communication: Collaborate with peers and internal stakeholders, including non-technical users, explaining issues in clear, laymans terms when required.
- Monitoring & Tools: Leverage monitoring platforms (e.g., Nagios, Splunk, Azure Monitor, Elastic) to track system health, detect issues, and proactively prevent downtime.
- Team Collaboration: Work effectively in a flexible, evolving environment while fostering strong team relationships and knowledge sharing.
- Minimum of 3+ years of experience in a technology-related position with demonstrated success.
- Hands?on experience with Linux command line navigation and troubleshooting.
- Experience with relational databases and SQL query languages (SQL required).
- Exposure to ServiceNow or similar ticketing systems.
- Strong interpersonal and communication skills; ability to collaborate within a team and communicate effectively with both technical and non-technical audiences.
- Willingness to provide L1/L2 support, including weekend coverage and extended resolution timelines for complex cases.
- Strong attention to detail with the ability to follow structured processes and procedures.
- Bachelors degree in a technology-related discipline preferred.
- Python scripting experience.
- Familiarity with Azure cloud environments.
- Experience with enterprise monitoring tools such as SiteScope, Nagios, Splunk, Elastic, Oracle OEM, or Azure Monitor.
- Prior exposure to performance troubleshooting in large-scale systems.
- Experience supporting client-facing environments where clear communication is required.
This is a unique opportunity to be part of a fast?paced, collaborative operations team supporting award?winning, industry?leading data and analytics solutions. You will play a key role in ensuring system reliability and availability while growing your technical skills across cloud platforms, databases, monitoring, and automation in a dynamic, evolving environment.
Awards- Americas Most Innovative Companies, Fortune, 2024
- Worlds Most Admired Companies, Fortune 2024
- Human Rights Campaign Foundation, Corporate Equality Index, 100% score, 2023?2024
- Best Places to Work for Disability Inclusion, Disability: IN 100% score, 2023?2024
- Most Just Companies, Just Capital and CNBC, 2024
- Dow Jones Sustainability Indices, Top performing company for Sustainability, 2024
- Bloombergs Gender Equality Index (GEI), 2023
Pay Rate Range
Min Pay Rate: 41.45 Max Pay Rate: 51.82 USD hourly
Additional NotesApplications will be accepted on an ongoing basis.
This posting is for a contract assignment with Tundra Technical Solutions to provide services to Bank of New York (BNY). This is not a full?time employment opportunity. Candidates selected for this role will be engaged as contractors for the specified duration of the project. For any inquiries regarding the terms of the contract or engagement, please contact Tundra Technical Solutions directly.
Benefits InformationOptional benefits offering include medical, dental, vision and retirement benefits via Tundra Technical Solutions.
#J-18808-Ljbffr-
Principal Site Reliability Engineer
1 week ago
Boston, MA, United States General Motors Full timeJob Description Remote : Reporting where work can/needs to be performed / collaboration should happen. If the person lives w/n 50 miles of such a location, they are expected to come in three times a week. If they do not live withing 50 miles of any of those locations, they don't need to report in. The rapid adoption of advanced software in vehicles marks a...
-
Principal Site Reliability Engineer
1 week ago
Boston, MA, United States General Motors Full timeJob Description Remote : Reporting where work can/needs to be performed / collaboration should happen. If the person lives w/n 50 miles of such a location, they are expected to come in three times a week. If they do not live withing 50 miles of any of those locations, they don't need to report in. The rapid adoption of advanced software in vehicles marks a...
-
SRE Manager
1 week ago
Boston, MA, United States Insight Global Full timeAre you a highly motivated Site Reliability Engineer Manager looking to enter a rapidly growing environment? We are looking for senior software engineers to join our Site Reliability Engineering team to provide tooling & guidance to our customer's product engineers to ensure productivity & success. The SRE team is responsible for servicing our customer's end...
-
SRE Manager
1 week ago
Boston, MA, United States Insight Global Full timeAre you a highly motivated Site Reliability Engineer Manager looking to enter a rapidly growing environment? We are looking for senior software engineers to join our Site Reliability Engineering team to provide tooling & guidance to our customer's product engineers to ensure productivity & success. The SRE team is responsible for servicing our customer's end...
-
Lead Site Reliability Engineer
2 weeks ago
Boston, MA, United States Oracle Full timeJob Description Oracle Health (OHAI) is a leader in generative AI for healthcare, focusing on cutting-edge cloud services that streamline healthcare operations. Our EHR and Clinical AI Agent platforms help healthcare providers reduce manual tasks and improve patient care. We are expanding our OCI Cloud Operations team and are seeking a Principal Site...
-
Lead Director Engineering, SRE
1 day ago
Boston, MA, United States CVS Health Full timeAt CVS Health, we're building a world of health around every consumer and surrounding ourselves with dedicated colleagues who are passionate about transforming health care. As the nation's leading health solutions company, we reach millions of Americans through our local presence, digital channels and more than 300,000 purpose-driven colleagues - caring for...
-
Lead Director Engineering, SRE
4 days ago
Boston, MA, United States CVS Health Full timeAt CVS Health, we're building a world of health around every consumer and surrounding ourselves with dedicated colleagues who are passionate about transforming health care. As the nation's leading health solutions company, we reach millions of Americans through our local presence, digital channels and more than 300,000 purpose-driven colleagues - caring for...
-
Senior Software Engineer
2 weeks ago
Boston, MA, United States Veeva Systems Full timeVeeva Systems is a mission-driven organization and pioneer in industry cloud, helping life sciences companies bring therapies to patients faster. As one of the fastest-growing SaaS companies in history, we surpassed $2B in revenue in our last fiscal year with extensive growth potential ahead. At the heart of Veeva are our values: Do the Right Thing, Customer...
-
Site Reliability Engineer DevOps | REMOTE
1 week ago
Boston, MA, United States Oracle Full timeJob Description Are you a creative person who loves a challenge? Solve the complex puzzles you've been dreaming of as our Engineer. If you have a passion for innovation in tech, we want you on our team! Thrive in this crucial automation role. Oracle is a technology leader that's changing how the world does business. We're looking for an experienced and...
-
Senior Site Reliability Engineer
1 day ago
Boston, MA, United States Oracle Full timeJob Description Are you a creative person who loves a challenge? Solve the complex puzzles you've been dreaming of as our Engineer. If you have a passion for innovation in tech, we want you on our team! Thrive in this crucial automation role. Oracle is a technology leader that's changing how the world does business. We're looking for an experienced and...