SRE Manager

2 months ago


Raleigh, United States Ally Full time

**General information**

**Ref #** 17885

**Remote?** No

**Ally and Your Career**

*

Ally Financial only succeeds when its people do - and thats more than some clich people put on job postings. We live this stuff We see our people as, well, people - with interests, families, friends, dreams, and causes that are all important to them. Our focus is on the health and safety of our teammates as well as work-life balance and diversity and inclusion. From generous benefits to a variety of employee resource groups, we strive to build paths that encourage employees to stretch themselves professionally. We want to help you grow, develop, and learn new things. Youre constantly evolving, so shouldnt your opportunities be, too?

**The Opportunity**

Are you passionate about ensuring the reliability and scalability of complex systems? Do you thrive on implementing efficient solutions to prevent and resolve incidents? We are seeking a talented and motivated Site Reliability Engineer (SRE) to join our dynamic team.

At Ally, you get a startup feel, but experience the benefits of a company thats worked out the kinks and is fulfilling its purpose. Were always evolving and see that as a good thing. From owning our work to seeing its impact in the real world, our team is relentless in finding new ways technology can help make experiences better and help people. We are problem solvers, we value diverse thinking, we support one another, and we challenge ourselves to think bigger in the journey to deliver customer-obsessed tech solutions. To read more about what our tech team does, be sure to visit our tech blog at ally.tech

**The Work Itself**

* Responsible for managing the SRE Team including Ally employees and contractors.

* Collaborate with cross-functional teams to design, build, and maintain robust, scalable, and fault-tolerant systems.

* Work closely with development teams and architects to advocate for reliability best practices during the application development lifecycle.

* Design and implement monitoring and alerting to provide real-time visibility into user experience and system health and performance.

* Monitor and analyze system performance, proactively identifying potential issues and implementing solutions to ensure optimal performance and reliability.

* Develop and maintain automated tools and processes to streamline operational tasks and reduce manual interventions.

* Participate in incident response and post-mortems, contributing to continuous improvement efforts.

* Conduct capacity planning and resource optimization to handle growing demands on our infrastructure.

* Continuously research and evaluate new technologies and practices to enhance the reliability and efficiency of our systems.

* Conduct capacity planning and resource optimization to handle growing demands on our infrastructure.

* Continuously research and evaluate new technologies and practices to enhance the reliability and efficiency of our systems.

**The Skills You Bring**

* Bachelor's degree in Computer Science, Engineering, or related fields preferred (or equivalent practical experience)

* Strong verbal and written communication skills

* Experience of overall 2-4 years' of managing an SRE or DevOps team with observability workload.

* 2-4 years' of Agile Management owning SRE roadmaps and deliverables using Scrum / Kanban

* 2-4 years' of delivering projects along side a constant flow of side intake and production response workloads

* Experience presenting to leadership and collaborate effectively/communicate technical concepts to non-technical business stakeholders

* Proven 5+ years' experience as a Site Reliability Engineer or similar role in a production environment

* 5+ years' experience with AWS services (ASG, Fargate, Lambda, Aurora DB, Dynamo DB, ALB/NLB)

* 5+ years' working experience with CI/CD pipelines (Gitlab) and developing infrastructure-as-code (Terraform, Ansible, etc.)

* Working knowledge of observabilty platforms like splunk, dynatrace, datadog, sumologic or new relic

* Working experience with designing Observability for enterprise applications

* Working knowledge of containers in ecs, eks or k8

* Experienced knowledge of system administration, dev ops

* Development experience along with cloud and physical servers

* Understanding and experience working with business, product and engineering teams in developing SLI, SLO and SLA's

* Conduct capacity planning and resource optimization to handle growing demands on our infrastructure

* Continuously research and evaluate new technologies and practices to enhance the reliability and efficiency of our systems

Other Skills & Experience Desired

* Strong knowledge of Linux/Unix systems and network protocols

* Experience with distributed systems and microservices architecture

* Proficiency in programming or scripting languages such as Python, Java, or bash

* Hands-on experience with monitoring and logging tools (DynaTrace, Cloudwatch, Prometheus, Grafana, etc.)

* Familiarity with cybersecurity best practices and principles

* Certifications in AWS

* Ability to lead triage calls including working across multiple divisions to resolve issues.

#li-hybrid

**How We'll Have Your Back**

*

Ally's compensation program offers market-competitive base pay and pay-for-performance incentives (bonuses) based on achieving personal and company goals. But Allys total compensation - or total rewards - extends beyond your paycheck and is designed to support and enrich your personal and professional life, including:

* Time Away: competitive holiday and flexible paid-time-off, including time off for volunteering and voting.

* Planning for the Future: plan for the near and long term with an industry-leading 401K retirement savings plan with matching and company contributions, student loan and 529 educational assistance programs, tuition reimbursement, and other financial well-being programs.

* Supporting your Health & Well-being: flexible health and insurance options including dental and vision, pre-tax Health Savings Account with employer contributions and a total well-being program that helps you and your family stay on track physically, socially, emotionally, and financially.

* Building a Family: adoption, surrogacy, and fertility support as well as parental and caregiver leave, back-up child and adult/elder day care program and childcare discounts.

* Work-Life Integration: other benefits including LifeMatters Employee Assistance Program, subsidized and discounted Weight Watchers program and other employee discount programs.

Who We Are:

Ally Financial is a customer-centric, leading digital financial services company with passionate customer service and innovative financial solutions. We are relentlessly focused on "Doing it Right" and being a trusted financial-services provider to our consumer, commercial, and corporate customers. For more information, visit www.ally.com.

Ally is an equal opportunity employer committed to diversity and inclusion in the workplace. All qualified applicants will receive consideration for employment without regard to age, race, color, sex, religion, national origin, disability, sexual orientation, gender identity or expression, pregnancy status, marital status, military or veteran status, genetic disposition or any other reason protected by law.

Where permitted by applicable law, must have received or be willing to receive the COVID-19 vaccine by date of hire to be considered, if not currently employed by Ally.

We are committed to working with and providing reasonable accommodation to applicants with physical or mental disabilities. For accommodation requests, email us at work@ally.com. Ally will not discriminate against any qualified individual who is capable of performing the essential functions of the job with or without reasonable accommodation.

**_Base Pay Range:_**

**Emerging:** 110000

**Experienced:** 145000

**Expert:** 180000

Incentive Compensation: This position is eligible to participate in our annual incentive plan



  • Raleigh, United States Arch Capital Group Ltd. Full time

    With a company culture rooted in collaboration, expertise and innovation, we aim to promote progress and inspire our clients, employees, investors and communities to achieve their greatest potential. Our work is the catalyst that helps others achieve their goals. In short, We Enable Possibility℠. The Director, Site Reliability Engineering (SRE) is a...


  • Raleigh, North Carolina, United States Arch Capital Group Ltd. Full time

    Company Overview:At Arch Capital Group Ltd., we are committed to fostering a culture that emphasizes collaboration, expertise, and innovation. Our mission is to empower our clients, employees, investors, and communities to reach their fullest potential. We serve as a catalyst for progress, enabling possibilities for all stakeholders.Position Summary:The Head...


  • Raleigh, United States Red Hat Full time

    About the Job. Red Hat is seeking a Site Reliability Engineer (SRE) to develop, scale, and operate our OpenShift managed cloud services. OpenShift is Red Hats enterprise Kubernetes distribution. As an SRE you will contribute to running OpenShift at Reliability Engineer, Liability, Reliability, Engineer, Reliability, Monitoring, Technology


  • Raleigh, North Carolina, United States Veradigm® Full time

    Welcome to Veradigm. Our mission is to be the most trusted provider of innovative solutions that empower all stakeholders across the healthcare continuum to deliver world-class outcomes. Our vision is a connected community of health that spans continents and borders. With the largest community of clients in healthcare, Veradigm is able to deliver an...


  • Raleigh, North Carolina, United States Veradigm® Full time

    Welcome to Veradigm. Our mission is to be the most trusted provider of innovative solutions that empower all stakeholders across the healthcare continuum to deliver world-class outcomes. Our vision is a connected community of health that spans continents and borders. With the largest community of clients in healthcare, Veradigm is able to deliver an...


  • Raleigh, North Carolina, United States Celonis Full time

    About Celonis: Celonis stands as the global frontrunner in Process Mining technology, recognized as one of the fastest-growing SaaS enterprises worldwide. Our mission is to enhance productivity by embedding data and intelligence at the heart of business operations, and we invite you to be a part of this journey. Position Overview: Join a dynamic,...


  • Raleigh, North Carolina, United States Celonis Full time

    About Celonis: Celonis stands as the global frontrunner in Process Mining technology and is recognized as one of the fastest-growing SaaS companies worldwide. We are dedicated to harnessing the potential of data and intelligence to enhance productivity within business operations, and we invite you to be a part of this journey. Role Overview: Join a...


  • Raleigh, North Carolina, United States First Citizens Bank Full time

    About the RoleWe are seeking a highly skilled Application Systems Engineer to join our team at First Citizens Bank. As a key member of our technology team, you will be responsible for ensuring the performance, reliability, and availability of our critical applications.Key ResponsibilitiesPerformance and Reliability: Drive adherence to Service Level...


  • Raleigh, North Carolina, United States Ally Full time

    General InformationReference Number: 17885Remote Work: NoAbout Ally and Your CareerAt Ally Financial, our success is intrinsically linked to the success of our employees. We prioritize the well-being of our team members, recognizing their diverse interests, families, and aspirations. Our commitment to work-life balance, health, and inclusivity is reflected...

  • Software Engineer

    1 month ago


    Raleigh, United States Celonis Full time

    We're Celonis, the global leader in Process Mining technology and one of the world's fastest-growing SaaS firms. We believe there is a massive opportunity to unlock productivity by placing data and intelligence at the core of business processes - and for that, we need you to join us. The Role: You will be part of a highly technical, collaborative and...


  • Raleigh, United States Cisco Full time

    Who We Are Today’s results-oriented business environment is more than that – it’s a period of disruption between the pandemic, global business change and internal process complexity. For us to focus on simplicity and the best customer experience, we need great talent and the right skillsets to be successful. This is now a mantra for our Cisco...


  • Raleigh-Durham, United States Celonis Full time

    We're Celonis, the global leader in Process Mining technology and one of the world's fastest-growing SaaS firms. We believe there is a massive opportunity to unlock productivity by placing data and intelligence at the core of business processes - and for that, we need you to join us.The Role: You will be part of a highly technical, collaborative...


  • Raleigh, North Carolina, United States Splunk Full time

    Splunk is here to build a safer and more resilient digital world. The world's leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. While customers love our technology, it's our people that make Splunk stand out as an amazing career destination and why we've won so many awards as a best...


  • Raleigh, United States Vaco Full time

    Unable to partner with 3rd party vendors (Corp-to-Corp/C2C) for this opportunity. We are unable to sponsor at this time. Relocation assistance is not provided. Our client is actively seeking a skilled Senior AppDynamics Engineer to enhance their IT infrastructure and support daily operations. This role is pivotal in maintaining and optimizing their Linux...


  • Raleigh, North Carolina, United States Pendo Full time

    Become a Key Player in Pendo's Data Ingestion Team as a Senior Backend Software EngineerWe are seeking a talented Senior Backend Software Engineer to join our data ingestion team at Pendo. This team plays a vital role in managing and enhancing the data pipeline that supports our analytics, guides, session replay, and various event-driven features through...


  • Raleigh, North Carolina, United States Splunk Full time

    Splunk is here to build a safer and more resilient digital world. The world's leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. While customers love our technology, it's our people that make Splunk stand out as an amazing career destination and why we've won so many awards as a best...


  • Raleigh, North Carolina, United States Cisco Full time

    About Us: At Cisco, our Network Engineering & Operations team is revolutionizing the management of our networks by leveraging cutting-edge technology, insightful data analytics, and innovative business processes. Together, we aim to transform the Cisco experience.Your Role: As a key member of Cisco IT's Network Engineering & Operations (NEO) team, you will...


  • Raleigh, United States General Dynamics Information Technology Full time

    JOB DESCRIPTION Deliver simple solutions to complex problems as a DevOps Engineer at GDIT. Here, you’ll tailor cutting-edge solutions to the unique requirements of our clients. With a career in application development, you’ll make the end user’s experience your priority and we’ll make your career growth ours. At GDIT, people are our...


  • Raleigh, United States Cisco Full time

    Application window has been extended and expected to close on 9/9/2024 Who We Are: The Network Engineering & Operations team is changing the way we run Cisco's networks by maximizing the power of technology, superior data insights and designing business processes. Together, we will Reimagine the Cisco experience. What You'll Do You will be a member within...