Site Reliability Engineer

1 week ago


O'Fallon, United States RIT Solutions, Inc. Full time
Title: Senior BizOps Engineer (SRE - Site Reliability Engineer)
Location: O'Fallon, Missouri (Main Campus) --- LOCAL CANDIDATES ONLY 3 days a week onsite in office
Duration: 24 months
Glider: Software Engineer (Java, SQL, Cloud, Oracle, Microservices, Linux and Bash Scripting)

  • What are your top 3 required technical skills?
    1. Application Frameworks - ie: Spring Boot, etc and Computing Architectures - ie: Distributed, Mainframe, Cloud, etc.
    2. Observability - logging, monitoring, alerting and dashboarding tools, standards and response, etc.
    3. Incident and Knowledge Management - Incident Communications, etc.
  • What are a couple of desired/nice to have technical skills?
    1. Operational Readiness - Chaos Engineering, Production Readiness, etc.
    2. DevOps - Continuous delivery/deployment; Configuration as Code, etc.
  • What soft skills would you like to see in a candidate?
    1. Critical Thinking and Problem Solving
    2. Curious with Bias Towards Actions to avoid or mitigate Risks
  • Job Description Summary

    "The RiskPS BizOps team is looking for a Site Reliability Engineer who can help us solve problems, implement automation, and leverage best practices.
    • Are you a born problem solver who loves to figure out how something works?
    • Are you a detail -oriented individual who enjoys complex problem solving?
    • Do you love determining the correct actions required to fix a problem?
    • Do you have a low tolerance for manual work and look to automate everything you can?

    Business Operations is leading the Site Reliability Engineering (SRE) transformation at Client through our tooling and by being an advocate for change & standards throughout the development, quality, release, and product organizations. We need team members with an appetite for change and pushing the boundaries of what can be done with automation. Experience in working across development, operations, and product teams to prioritize needs and to build relationships is a must.

    Mission

    The role of business operations is to be the production readiness steward for the platform. This is accomplished by closely partnering with developers to design, build, implement, and support technology services. A business operations engineer will ensure operational criteria like system availability, capacity, performance, monitoring, self-healing, and deployment automation are implemented throughout the delivery process. Business Operations plays a key role in leading the Site Reliability Engineering (SRE) transformation at Client through our tooling and by being an advocate for change and standards throughout the development, quality, release, and product organizations.

    We accomplish this transformation through supporting daily operations with a hyper focus on triage and then root cause by understanding the business impact of our products. The goal of every biz ops team is to shift left to be more proactive and upfront in the development process, and to proactively manage production and change activities to maximize customer experience and increase the overall value of supported applications. Biz Ops teams also focus on risk management by tying all our activities together with an overarching responsibility for compliance and risk mitigation across all our environments. A biz ops focus is also on streamlining and standardizing traditional application specific support activities and centralizing points of interaction for both internal and external partners by communicating effectively with all key stakeholders.

    Ultimately, the role of biz ops is to align Product and Customer Focused priorities with Operational needs. We regularly review our run state not only from an internal perspective, but also understanding and providing the feedback loop to our development partners on how we can improve the customer experience of our applications.

    Responsibilities

    For all team members:
    • Engage in and improve the whole lifecycle of services-from inception and design, through deployment, operation and refinement.
    • Analyze ITSM activities of the platform and provide feedback loop to development teams on operational gaps or resiliency concerns
    • Support services before they go live through activities such as system design consulting, capacity planning and launch reviews.
    • Maintain services once they are live by measuring and monitoring availability, latency and overall system health.
    • Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity.
    • Practice sustainable incident response and blameless postmortems.
    • Take a holistic approach to problem solving, by connecting the dots during a production event thru the various technology stack that makes up the platform, to optimize mean time to recover
    • Work with a global team spread across tech hubs in multiple geographies and time zones
    • Share knowledge and mentor junior resources

    Qualifications
    • BS degree in Computer Science or related technical field involving coding (e.g., physics or mathematics), or equivalent practical experience.
    • Bachelor's degree in Information Technology, Computer Science or equivalent work experience.
    Experience in a financial environment preferred.
    • Analytical/problem solving and planning skills.
    • The ability to organize, multi-task and prioritize work based on current business needs.
    • Possess strong communication skills -- both verbal and written.
    • Strong relationship skills, collaborative skills and customer service skills.
    • 1-3 Years of experience in the following is required: UNIX, scripting, Oracle, SQL skills
    • We support many different stakeholders. Experience in dealing with difficult situations and making decisions with a sense of urgency is needed.
    • Experience in one or more of the following is preferred: C, C++, Java, Python, Go, Perl or Ruby.
    • Interest in designing, analyzing and troubleshooting large-scale distributed systems.
    • We need team members with an appetite for change and pushing the boundaries of what can be done with automation. Experience in working across development, operations, and product teams to prioritize needs and to build relationships is a must.
    • Ability to work with little or no supervision.
    • Minor travel may be required."


  • O'Fallon, United States EPITEC Full time

    REQUIRED Glider Test: Software Engineer (Java, SQL, Cloud, Oracle, Microservices, Linux and Bash Scripting)The RiskPS BizOps team is looking for a Site Reliability Engineer who can help us solve problems, implement automation, and leverage best practices. Typical workday is on call (i.e., monitoring alerts and production support), implementing Change...


  • O'Fallon, United States MasterCard Full time

    OverviewWe're a global leader in the payments industry, and we're looking for a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability and performance of our cloud-based systems.


  • O'Fallon, United States Pinnacle Group Full time

    About the Role:Pinnacle Group is looking for a highly skilled Site Reliability Lead to join our Quality Assistance Command Center team. As a Reliability Engineer, you will be responsible for ensuring the stability and resilience of our systems and infrastructure.You will work closely with our development teams to identify areas for improvement and implement...


  • O'Fallon, MO , USA, United States MasterCard Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our Enterprise Data Accessibility team at Mastercard. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, fault-tolerance, and scalability of our cloud-based systems and services.Key ResponsibilitiesDesign, build, and operate large-scale,...


  • O'Fallon, United States Pinnacle Group Full time

    Overview:Pinnacle Group is seeking a highly skilled Reliability Engineer to join our Quality Assistance Command Center team. As a Site Reliability Engineering Lead, you will be responsible for contributing to the day-to-day planning, design, execution, and reporting of resiliency testing.You will bring industry experience and thought leadership skills to...


  • O'Fallon, United States RIT Solutions, Inc. Full time

    About the Role: At RIT Solutions, Inc., we're looking for a talented Site Reliability Expert to help us solve complex problems and improve our overall system reliability. In this role, you'll work closely with cross-functional teams to analyze ITSM activities, identify areas for improvement, and implement automation scripts to streamline processes.Key...


  • O'Fallon, United States MasterCard Full time

    Reliability and Efficiency Expert**Job Description:**At Mastercard, we are seeking a highly experienced Sr. Reliability Engineer to join our Transaction Switching (Authorization) Business Operations team. In this role, you will be responsible for ensuring the reliability and efficiency of our services by designing and implementing service management...


  • O'Fallon, United States Pinnacle Group Full time

    Pinnacle Group is seeking a highly skilled Reliability Engineer to join our team. With a strong background in system architecture and cloud technologies, this individual will be responsible for designing and implementing resiliency strategies to ensure the stability and availability of our systems.Key ResponsibilitiesDevelop and implement resiliency testing...


  • O'Fallon, United States MasterCard Full time

    Join Our TeamWe are looking for a highly skilled Senior BizOps Engineer to join our team. As a member of the Business Operations group, you will contribute to designing and implementing innovative solutions to ensure the reliability of our platforms.About the Job:Develop and maintain cloud solutions on Azure, GCP, or AWSCollaborate with development teams to...


  • O'Fallon, United States Pinnacle Group Full time

    Pinnacle Group Job DescriptionOverviewThe Pinnacle Group is looking for a skilled Reliability Engineer to join our team. This role will focus on ensuring the reliability and stability of our cloud infrastructure, working closely with cross-functional teams to identify and resolve potential issues.Job ResponsibilitiesDesign and implement resiliency testing...


  • O'Fallon, United States MasterCard Full time

    About the RoleThe Business Operations team is seeking an experienced Senior Business Operations Site Reliability Engineer to lead the platform's reliability efforts. As a trusted advisor, you will collaborate with stakeholders to design, implement, and maintain scalable systems.Requirements:Bachelor's degree in Computer Science or related fieldCoding...


  • O'Fallon, United States MasterCard Full time

    Our Purpose Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we’re helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation,...


  • O'Fallon, United States Pinnacle Group Full time

    About the RoleWe are seeking a talented Site Resiliency Expert to join our team at Pinnacle Group. As a key member of our infrastructure team, you will be responsible for ensuring the reliability and stability of our systems. This is an excellent opportunity to work with a dynamic team who are passionate about delivering quality solutions.Key...


  • O'Fallon, United States MasterCard Full time

    Job OverviewWe are seeking a highly skilled Service Management Reliability Engineer to join our team at Mastercard. As a key member of our infrastructure operations team, you will be responsible for ensuring the reliability and resilience of our systems and infrastructure.About You:To be successful in this role, you will need to have a strong background in...


  • O'Fallon, United States Pinnacle Group Full time

    Overview Avanti Command Center Quality Assistance BizOps team is looking for a Reliability Engineer who combines strategic thought leadership skills, a strong development & automation background and sound business judgment. As a Site Reliability Engineering Lead, you will actively contribute to the day-to-day planning, design, execution, and reporting of...

  • DevOps Engineer

    4 weeks ago


    O'Fallon, United States PRI Global Full time

    Looking for only Local Consultants as per the client guidelinesThe requirement as followsJob title: Devops EngineerDuration: 24 monthsLocation: O'Fallon, MO - HybridThe role of business operations is to be the production readiness steward for the platform. This is accomplished by closely partnering with developers to design, build, implement, and support...

  • DevOps Engineer

    21 hours ago


    O'Fallon, United States PRI Global Full time

    Looking for only Local Consultants as per the client guidelinesThe requirement as followsJob title: Devops EngineerDuration: 24 monthsLocation: O'Fallon, MO - HybridThe role of business operations is to be the production readiness steward for the platform. This is accomplished by closely partnering with developers to design, build, implement, and support...


  • O'Fallon, United States EPITEC Full time

    REQUIRED Glider Test: Software Engineer (Java, SQL, Cloud, Oracle, Microservices, Linux and Bash Scripting)The RiskPS BizOps team is looking for a Site Reliability Engineer who can help us solve problems, implement automation, and leverage best practices. Typical workday is on call (i.e., monitoring alerts and production support), implementing Change...


  • O'Fallon, United States MasterCard Full time

    Job Title: Bizops Engineer LeaderAs a highly skilled Bizops Engineer Leader, you will play a crucial role in ensuring the stability and health of our platform. You will foster developer-run ownership and empower developers to build resilient products.We are seeking an experienced professional with a strong technical background, excellent problem-solving...

  • Lead, Bizops Engineer

    4 weeks ago


    O'Fallon, United States MasterCard Full time

    Our Purpose Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we’re helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation,...