Reliability Solutions Engineer

2 weeks ago


Raleigh, North Carolina, United States Ally Full time
General Overview

Ref # 17854

Remote? No

About Ally and Your Career

At Ally Financial, we believe that our success is intrinsically linked to the well-being of our employees. We prioritize our team's health, safety, and overall work-life balance, fostering an environment that embraces diversity and inclusion.

We offer a variety of employee resource groups and generous benefits, encouraging our team members to pursue professional growth and development. As you evolve, so should your career opportunities.


The Opportunity

Are you driven by the challenge of maintaining the reliability and scalability of intricate systems? Do you excel at devising effective solutions to mitigate and resolve incidents? We are on the lookout for a skilled and enthusiastic Reliability Solutions Engineer to join our innovative team.

At Ally, you will experience the dynamic atmosphere of a startup while enjoying the stability of a well-established company. We are committed to continuous improvement and innovation.

Our team is dedicated to problem-solving, valuing diverse perspectives, and supporting one another as we strive to deliver technology solutions that prioritize customer satisfaction.


Key Responsibilities
  • Collaborate with cross-functional teams to design, develop, and sustain robust, scalable, and fault-tolerant systems.
  • Engage with development teams and architects to promote reliability best practices throughout the application development lifecycle.
  • Design and implement monitoring and alerting systems to ensure real-time visibility into user experience and system performance.
  • Monitor and assess system performance, proactively identifying potential issues and implementing solutions for optimal reliability.
  • Develop and maintain automated tools and processes to enhance operational efficiency and minimize manual tasks.
  • Participate in incident response and post-incident reviews, contributing to ongoing improvement initiatives.
  • Conduct capacity planning and resource optimization to accommodate increasing demands on our infrastructure.
  • Continuously research and assess new technologies and methodologies to improve system reliability and efficiency.

Qualifications
  • Bachelor's degree in Computer Science, Engineering, or a related field preferred (or equivalent practical experience).
  • Strong verbal and written communication skills.
  • Ability to work collaboratively in a team environment and convey technical concepts to non-technical stakeholders.
  • Proven experience (1+ years) as a Reliability Solutions Engineer or in a similar role within a production environment.
  • Experience with AWS services (ASG, Fargate, Lambda, Aurora DB, Dynamo DB, ALB/NLB) for at least 1 year.
  • Familiarity with CI/CD pipelines (Gitlab) and infrastructure-as-code tools (Terraform, Ansible, etc.) for at least 1 year.
  • Working knowledge of observability platforms such as Splunk, Dynatrace, Datadog, Sumo Logic, or New Relic.
  • Experience with containerization technologies (ECS, EKS, or Kubernetes).
  • Strong understanding of system administration and DevOps practices.
  • Development experience with cloud and physical servers.

Additional Skills & Experience
  • Solid knowledge of Linux/Unix systems and networking protocols.
  • Experience with distributed systems and microservices architecture.
  • Proficiency in programming or scripting languages such as Python, Java, or Bash.
  • Hands-on experience with monitoring and logging tools (DynaTrace, CloudWatch, Prometheus, Grafana, etc.).
  • Familiarity with cybersecurity best practices.
  • AWS certifications are a plus.
  • Ability to lead triage calls and collaborate across divisions to resolve issues.

Compensation and Benefits

Ally offers a competitive compensation package that includes base pay and performance-based incentives. Our total rewards extend beyond salary, encompassing:

  • Time Off: Flexible paid time off, including holidays and volunteer days.
  • Retirement Planning: An industry-leading 401K plan with matching contributions and educational assistance programs.
  • Health & Well-being: Comprehensive health insurance options, including dental and vision, along with wellness programs.
  • Family Support: Adoption, surrogacy, and fertility assistance, as well as parental leave.
  • Work-Life Integration: Employee assistance programs and wellness initiatives.

About Ally

Ally Financial is a leading digital financial services provider, dedicated to delivering exceptional customer service and innovative financial solutions. We are committed to "Doing it Right" and serving our customers with integrity.

For more information about our company culture and values, please visit our website.


Ally is an equal opportunity employer, committed to fostering a diverse and inclusive workplace. All qualified applicants will receive consideration for employment without regard to any protected status.



  • Raleigh, North Carolina, United States Veradigm® Full time

    Welcome to Veradigm. Our mission is to be the most trusted provider of innovative solutions that empower all stakeholders across the healthcare continuum to deliver world-class outcomes. Our vision is a connected community of health that spans continents and borders. With the largest community of clients in healthcare, Veradigm is able to deliver an...


  • Raleigh, North Carolina, United States Veradigm® Full time

    Welcome to Veradigm. Our mission is to be the most trusted provider of innovative solutions that empower all stakeholders across the healthcare continuum to deliver world-class outcomes. Our vision is a connected community of health that spans continents and borders. With the largest community of clients in healthcare, Veradigm is able to deliver an...


  • Raleigh, North Carolina, United States HighCloud Solutions Full time

    Job Title: Expert Voice EngineerWe are seeking an experienced Voice Engineer to join our team at HighCloud Solutions. As a Voice Engineer, you will be responsible for designing, implementing, and managing Cisco Unified Communications systems across our organization.Key Responsibilities:System Design & Implementation:Design, deploy, and manage Cisco Unified...


  • Raleigh, North Carolina, United States Cisco Full time

    About the RoleCisco is seeking a highly skilled Network Reliability Engineer to join our Network Engineering & Operations team. As a key member of our team, you will be responsible for designing, building, deploying, and operating our internal network interconnects supporting client networks inside Cisco.Key ResponsibilitiesDesign and implement network...


  • Raleigh, North Carolina, United States Biogen Idec Full time

    Job OverviewPosition SummaryThe Senior Reliability Engineer plays a crucial role in applying Reliability Engineering principles to enhance the design specifications and operational efficiency of essential assets throughout the organization. This position involves the development of analytical techniques to assess the reliability of components, machinery, and...


  • Raleigh, North Carolina, United States Associates Systems LLC Full time

    Essential Qualifications for Site Reliability Engineer:As part of your responsibilities and interactions with defense programs, you must be a US citizen capable of obtaining and maintaining a DoD Secret Security Clearance.A Bachelor’s degree in Computer Science, Engineering, Applied Mathematics, or a similar technical discipline is required, along with 7-9...


  • Raleigh, North Carolina, United States Biogen Idec Full time

    Job OverviewAbout the PositionThe Senior Reliability Engineer is responsible for implementing Reliability Engineering principles to enhance design specifications and operational efficiency of essential assets throughout the organization. This role involves developing analytical techniques to assess the reliability of components, machinery, and processes. The...


  • Raleigh, North Carolina, United States Ally Full time

    About the RoleWe are seeking a highly skilled Senior Cloud Reliability Engineer to join our dynamic team at Ally. As a key member of our infrastructure team, you will be responsible for ensuring the reliability and scalability of our complex systems.Key ResponsibilitiesCollaborate with cross-functional teams to design, build, and maintain robust, scalable,...


  • Raleigh, North Carolina, United States Ally Full time

    General InformationReference Number: 17885Remote Work: NoAbout Ally and Your CareerAt Ally Financial, our success is intrinsically linked to the success of our employees. We prioritize the well-being of our team members, recognizing their diverse interests, families, and aspirations. Our commitment to work-life balance, health, and inclusivity is reflected...


  • Raleigh, North Carolina, United States Citrix Systems Inc Full time

    Location: Fully on-site in Raleigh, NC.About Our TeamAre you passionate about working in a dynamic and agile environment? If you thrive in a setting that encourages innovation and collaboration, we want to hear from you. Our team is embarking on an exciting journey as we transition back to our roots, focusing on our SaaS offerings and positioning ourselves...


  • Raleigh, North Carolina, United States Enviva Full time

    About EnvivaThe Enviva team is committed to a shared vision for a sustainable energy future. As a leading global energy company, we specialize in providing eco-friendly wood bioenergy solutions. We are recognized as the largest producer of sustainable wood pellets, offering a low-carbon alternative to traditional fossil fuels.Position OverviewReporting to...


  • Raleigh, North Carolina, United States Enviva Full time

    About EnvivaThe Enviva team is united by a common goal: to foster a renewable energy future. As a rapidly expanding, mission-driven global energy firm, we focus on providing sustainable wood bioenergy solutions. We are recognized as the largest producer of sustainable wood pellets, offering a low-carbon alternative to traditional fossil fuels.Position...


  • Raleigh, North Carolina, United States Celonis Full time

    About Celonis: Celonis stands as the global frontrunner in Process Mining technology and is recognized as one of the fastest-growing SaaS companies worldwide. We are dedicated to harnessing the potential of data and intelligence to enhance productivity within business operations, and we invite you to be a part of this journey. Role Overview: Join a...


  • Raleigh, North Carolina, United States Optima Engineering Full time

    About the RoleAt Optima Engineering, we are seeking a skilled and dedicated professional to join our team as a Plumbing and Fire Protection Engineer. As a key member of our team, you will be responsible for delivering high-quality engineering solutions to our clients.Key ResponsibilitiesDesign and Development: Support the proposal process by providing...


  • Raleigh, North Carolina, United States Booz Allen Hamilton Full time

    Position Overview:In the evolving landscape of cloud technology, the role of a site reliability engineer is pivotal. Your expertise in creating robust platforms that cater to client requirements while leveraging the advantages of containerization, both in cloud environments and on-premises, is essential. Imagine utilizing your engineering acumen to enhance...


  • Raleigh, North Carolina, United States Focused HR Solutions Full time

    Our client is seeking a seasoned AWS Solution Architect / Engineer to join their growing team. This role demands a technical leader who can guide the implementation of cloud solutions across various domains, ensuring alignment with business objectives and architectural best practices.Responsibilities:Provide technical leadership in areas such as Business...


  • Raleigh, North Carolina, United States Zolon Tech Solutions, Inc. Full time

    Title : Infrastructure Solutions EngineerLocation : Remote/HybridDuration : 6 MonthsJob Overview:This role involves the development of innovative solutions utilizing scripting languages such as PowerShell and VBScript. The candidate will be responsible for identifying and addressing challenges related to Microsoft Workplace Modernization initiatives,...


  • Raleigh, North Carolina, United States Ally Full time

    About the RoleWe are seeking a highly skilled Senior Cloud Reliability Engineer to join our dynamic team at Ally. As a key member of our cloud infrastructure team, you will be responsible for ensuring the reliability and scalability of our complex systems.Key ResponsibilitiesCollaborate with cross-functional teams to design, build, and maintain robust,...


  • Raleigh, North Carolina, United States Booz Allen Hamilton Full time

    Position Overview:In today's digital landscape, the ability to effectively utilize cloud technology is paramount. As a Cloud Infrastructure Reliability Engineer, you possess the expertise to create robust platforms that cater to client requirements while leveraging the advantages of containerization, both in cloud environments and on-premises. Your...


  • Raleigh, North Carolina, United States EMCOR Government Services Full time

    EMCOR Government Services is a leading provider of mechanical engineering solutions, focusing on the design and manufacturing of controlled environmental chambers tailored for various industries. With a legacy of over a century of expertise, our engineering team is dedicated to delivering the most reliable and innovative solutions in the market.As a part of...