Current jobs related to Site Reliability Engineer - Huntsville, Alabama - McGraw-Hill Education

  • Reliability Engineer

    3 weeks ago


    Huntsville, Alabama, United States TriMas Corporation Full time

    Maintenance Planner/Reliability EngineerThis role is responsible for supporting the maintenance day shift team in planning, failure analysis, and reliability analysis. The ideal candidate will have a strong background in mechanical engineering and experience with maintenance management systems.Key Responsibilities:Plan and coordinate maintenance...

  • Reliability Engineer

    3 weeks ago


    Huntsville, Alabama, United States TriMas Corporation Full time

    Maintenance Planner/Reliability Engineer Job SummaryThe primary purpose of this position is to support the current maintenance day shift team members in the areas of planning, failure analysis report, reliability analysis, and follow up on corrective actions. The secondary purpose of this position is to research advance technologies and project...

  • Reliability Engineer

    3 weeks ago


    Huntsville, Alabama, United States Blue Origin Full time

    Reliability Engineer - Engines & AvionicsAt Blue Origin, we're pushing the boundaries of space exploration and development of reusable, safe, and low-cost space vehicles and systems. We're seeking a highly skilled Reliability Engineer to join our team in the Engines business unit, where our focus is on the design, development, manufacturing, and testing of...


  • Huntsville, Alabama, United States iQuasar Full time

    Job Title: Reliability Systems EngineerJob Summary: iQuasar LLC is seeking a skilled Reliability Systems Engineer to support the Marshall Space Flight Center Safety Mission and Assurance Services contract. The ideal candidate will have experience with quantitative fault tree analysis and risk assessment.Responsibilities:Prepare and evaluate integrating logic...

  • RAM Engineer

    4 weeks ago


    Huntsville, Alabama, United States Waltonen Full time

    Job Title: RAM EngineerWaltonen Engineering is seeking a highly skilled RAM Engineer to join our team. As a RAM Engineer, you will be responsible for planning, executing, and maintaining Reliability, Availability, and Maintainability (RAM) testing and evaluation strategies.Key Responsibilities:Develop and implement RAM planning and testing strategies to...


  • Huntsville, Alabama, United States United States Army Futures Command Full time

    Job Summary:This position is located in the Systems Readiness Directorate/ RAM Division of the United States Army Futures Command. The ideal candidate will serve as a Reliability, Availability, and Maintainability (RAM) Engineering and System Assessment Division Stockpile Reliability Program (SRP) Subject Matter Expert (SME).Key Responsibilities: Provide SRP...


  • Huntsville, Alabama, United States TriMas Corporation Full time

    Job Summary:The primary purpose of this position is to support the current maintenance day shift team members in the areas of planning, failure analysis report, reliability analysis, and follow up on corrective actions. The secondary purpose of this position is to research advance technologies and project management.Key Responsibilities:• Plan maintenance...


  • Huntsville, Alabama, United States Leidos Full time

    Job Title: Principal Manufacturing EngineerLeidos is seeking a highly skilled Principal Manufacturing Engineer to lead the development and execution of manufacturing plans for our growing defense programs. As a key member of our team, you will be responsible for creating unique manufacturing solutions that support the warfighter.Key Responsibilities:Lead the...


  • Huntsville, Alabama, United States Radiance Technologies Full time

    Radiance Technologies Job DescriptionWe are seeking a highly skilled Senior Reliability and Maintainability Engineer to join our team at Radiance Technologies. As a key member of our engineering team, you will be responsible for designing, establishing, and implementing a robust Reliability and Maintainability (RandM) program to ensure the integrated space...


  • Huntsville, Alabama, United States CFD Research Corp. Full time

    We are seeking a highly skilled and experienced individual to join our team as an Electronics Design and Reliability Specialist. The ideal candidate will have expertise in the design and characterization of electronics operating in diverse environments.The successful candidate will be responsible for developing and applying analytical and behavioral modeling...

  • Data Engineer

    1 month ago


    Huntsville, Alabama, United States SOS International LLC Full time

    About the RoleSOS International LLC is seeking a highly skilled Data Engineer to join our analytics team working on an innovative MLOps workload leveraging cutting-edge technologies and supporting a government customer in Huntsville, Alabama.This role will be responsible for delivering automation to key national security missions interacting with...


  • Huntsville, Alabama, United States CFD Research Corporation Full time

    Job OverviewWe are seeking a highly skilled Electronics Design and Reliability Specialist to join our team at CFD Research Corporation. The ideal candidate will have expertise in the design and characterization of electronics operating in diverse environments.Key Responsibilities:Develop and apply analytical and behavioral modeling techniques to analyze...


  • Huntsville, Alabama, United States CDG, Inc. Full time

    Job OverviewCDG, Inc. is seeking a skilled Civil Site Development Project Manager to lead our team in Huntsville, AL.The ideal candidate will have experience in civil site planning and design, utility design and permitting, storm water management design and permitting, and erosion control design and permitting.Key Responsibilities:Effectively utilize...

  • Senior Data Engineer

    3 weeks ago


    Huntsville, Alabama, United States SOSi Full time

    Job SummarySOSi is seeking a highly skilled Senior Data Engineer to join our team in Huntsville, Alabama. As a key member of our MLOps Team, you will be responsible for designing, building, and maintaining ETL processes, configuring storage systems for efficiency and effectiveness, and architecting and developing services to enable MLOps.Key...


  • Huntsville, Alabama, United States CFD Research Corp. Full time

    We are seeking a highly skilled Electronics Design and Reliability Specialist to join our team at CFD Research Corp. The ideal candidate will have expertise in the design and characterization of electronics operating in diverse environments.Key Responsibilities:Develop and apply analytical and behavioral modeling techniques to analyze circuit/system level...

  • Data Engineer

    4 weeks ago


    Huntsville, Alabama, United States SOSi Full time

    About the RoleSOSi is seeking a highly skilled Data Engineer to join our analytics team working on an innovative MLOps workload leveraging cutting-edge technologies and supporting a government customer in Huntsville, Alabama.Key ResponsibilitiesDesign and develop data pipelines, ETL processes, and storage systems to enable machine learning...

  • Systems Engineer III

    4 weeks ago


    Huntsville, Alabama, United States Blue Origin Full time

    Job SummaryWe are seeking a highly skilled Systems Engineer to join our team at Blue Origin. As a Systems Engineer, you will be responsible for the design, development, and integration of complex systems, including liquid rocket engines and propulsion systems.Key Responsibilities:Participate in the development of system requirements and interfacesImplement...


  • Huntsville, Alabama, United States Leidos Full time

    Job Summary:The Leidos Defense Systems Sector is seeking a talented Junior Systems Engineer to join a diverse team of systems engineers within the Enduring Shield Program. This position requires a strong background in systems engineering, reliability, and electrical engineering.Key Responsibilities:Perform RAM analyses such as reliability prediction, Mean...


  • Huntsville, Alabama, United States Howard Technology Solutions Full time

    About the RoleThis is a unique opportunity to join Howard Technology Solutions as an AV Technical Site Manager. As a key member of our team, you will be responsible for overseeing all on-site audio/visual activities during installation, integration, and maintenance projects.Key ResponsibilitiesLead and manage on-site AV installation and integration projects...

  • Systems Engineer III

    3 weeks ago


    Huntsville, Alabama, United States Blue Origin Full time

    At Blue Origin, we're pushing the boundaries of space exploration and development. As a Systems Engineer III, you'll play a critical role in designing and developing the next generation of reusable rockets and spacecraft systems.This role is part of the Blue Origin Engines business unit, where our focus is on the design, development, manufacturing, and...

Site Reliability Engineer

2 months ago


Huntsville, Alabama, United States McGraw-Hill Education Full time
About the Role

We are seeking a highly skilled Site Reliability Engineer to join our team at McGraw Hill Education. As a key member of our cloud engineering team, you will play a critical role in designing and maintaining high-capacity systems that ensure the reliability, performance, and security of our customer platforms.

Key Responsibilities
  • Design, deploy, and manage automation tools in a DevOps model to enhance predictability, accelerate time-to-market, and ensure repeatability, traceability, and transparency of infrastructure automation (infrastructure-as-code, monitoring-as-code).
  • Collaborate with product development teams to optimize systems for reliability and performance, while managing AWS costs and using optimization tools to maximize ROI and meet Service Level Objectives.
  • Continuously learn and stay updated on the AWS ecosystem through participation in game day scenarios, professional conferences, and other development opportunities.
  • Ownership of the reliability, uptime, system security, cost, capacity, resiliency, and performance of applications and platforms, while leading data-driven initiatives to enhance stability and improve service levels.
  • Ensure that the architecture and deployment models are adequately designed to meet SLA commitments.
  • Act as the primary contact during major incidents, resolving issues and managing on-call alarms.
  • Maintain and enhance telemetry systems to improve visibility into application performance and business metrics, ensuring operational workloads are effectively managed.
Requirements
  • Minimum of 5 years of applicable Site Reliability Engineering (SRE) experience.
  • Hands-on experience with following technologies is required: Cloud and Infrastructure as a Code: AWS (CloudFront, S3, EC2, ECS, SES, SQS, SNS, Load Balancing, VPC, Config, Systems Manager, Lambda, API Gateway, DB services) and Terraform.
  • Programming and Containerization: Python, Golang, Bash, Ansible, and AWS ECS.
  • Security and web platforms: Rapid7, WAF, Apache.
  • Apache Tomcat, Angular.
  • Config Management and provisioning: Ansible, Packer.
  • Telemetry: NewRelic, CloudWatch, DataDog.
  • DevSecOps: Artifactory, Jenkins, CircleCI, SonarQube, Jfrog X-Ray, Control Tower, GitHub.
Why McGraw Hill Education?

At McGraw Hill Education, we are committed to creating a culture of curiosity and innovation. As a Site Reliability Engineer, you will have the opportunity to own your growth and develop as we do. Our team is passionate about delivering exceptional, reliable services, and we are looking for talented individuals who share our vision.

The pay range for this position is between $124,350 - $155,000 annually, however, base pay offered may vary depending on job-related knowledge, skills, experience, and annual bonus plan may be provided as part of the compensation package, in addition to a full range of medical and/or other benefits, depending on the position offered.