Site Reliability Engineer

2 weeks ago


Bloomington, Illinois, United States Steampunk Full time
Overview:
Steampunk is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our infrastructure and operations team, you will be responsible for ensuring the reliability, availability, and performance of our cloud-based systems and services.

Responsibilities:
Design and implement scalable and efficient cloud infrastructure solutions to meet the needs of our clients. Collaborate with development teams to optimize the performance and resilience of services through code improvements, architectural enhancements, and performance tuning. Develop and maintain monitoring and alerting mechanisms to proactively identify potential issues before they impact users. Automate repetitive tasks and enhance operational efficiency using automation tools and scripts. Collaborate with cross-functional teams to embed reliability best practices into the software development process. Provide mentorship and training to team members on SRE principles and practices. Lead the development and implementation of incident response procedures to ensure timely and effective resolution of issues. Foster a culture of continuous improvement by conducting thorough post-incident reviews and implementing preventative measures.

Requirements:
Master's degree and 8 years of experience; OR Bachelor's degree and 10 years of IT experience. Eligible to obtain and maintain a government security clearance. Knowledge and experience with Agile and DevSecOps methodologies. Experience in system engineering in one or more areas including telecommunications concepts, computer languages, operating systems, database/Data Base Management System (DBMS) and middleware. Experience with source code and binary repository products and techniques, infrastructure and cloud management tools, log management and analysis tools, automation and configuration management tools, and monitoring and alerting tools. Preferred qualifications include knowledge and experience with NewRelic and/or other AIOps platforms, programming skills in Javascript, Ruby, and/or Go, experience with Nginx, HAProxy, Docker, Kubernetes, or similar technologies, and experience with messaging systems, collaboration software, application-based firewall and proxy server(s), and operating systems. Experience with Linux and Windows operating systems, along with scripting tools and techniques such as Bash, CSH, KSH, ZSH, etc. and/or Powershell. Experience with monitoring and alerting tools such as Prometheus, Grafana, and Datadog.

About Steampunk:
Steampunk is a change agent in the Federal contracting industry, bringing new thinking to clients in the Homeland, Federal Civilian, Health, and DoD sectors. Through our human-centered delivery methodology, we are fundamentally changing the expectations our Federal clients have for true shared accountability in solving their toughest mission challenges. As an employee-owned company, we focus on investing in our employees to enable them to do the greatest work of their careers and rewarding them for outstanding contributions to our growth. If you want to learn more about our story, visit our website. We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, or any other characteristic protected by law. Steampunk participates in the E-Verify program.

  • Bloomington, Illinois, United States Capital One Financial Corp Full time

    Job Title: Lead Platform Engineer, Site Reliability EngineeringCapital One Financial Corp is seeking a highly skilled Lead Platform Engineer, Site Reliability Engineering to join our team. As a key member of our engineering organization, you will be responsible for designing, developing, and implementing scalable and reliable cloud-based systems.Key...


  • Bloomington, Illinois, United States Steampunk Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Steampunk. As a key member of our engineering team, you will be responsible for designing, implementing, and maintaining the reliability and performance of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement infrastructure optimization...


  • Bloomington, Illinois, United States Steampunk Full time

    About the RoleWe are seeking a highly skilled Sr. Site Reliability Engineer to join our team at Steampunk. As a key member of our team, you will be responsible for ensuring the reliability, availability, and performance of our cloud-based systems and infrastructure.Key ResponsibilitiesConduct in-depth analyses of infrastructure to identify areas for...


  • Bloomington, Illinois, United States Steampunk Full time

    About the RoleWe are seeking a highly skilled Sr. Site Reliability Engineer to join our team at Steampunk. As a key member of our organization, you will play a critical role in ensuring the reliability, availability, and performance of our cloud-based systems and infrastructure.Key ResponsibilitiesCollaborate with development teams to design and implement...


  • Bloomington, Illinois, United States Steampunk Full time

    OverviewSteampunk is seeking a highly skilled Sr. Site Reliability Engineer to join our team. As a key member of our organization, you will play a critical role in ensuring the reliability, availability, and performance of our cloud-based systems and infrastructure.ResponsibilitiesCollaborate with development and operations teams to design, implement, and...


  • Bloomington, Illinois, United States Steampunk Full time

    About the RoleWe are seeking a highly skilled Sr. Site Reliability Engineer to join our team at Steampunk. As a key member of our infrastructure team, you will be responsible for ensuring the reliability, availability, and performance of our cloud-based systems and services.Key ResponsibilitiesDesign and implement infrastructure optimization strategies to...


  • Bloomington, Illinois, United States Capital One Full time

    Capital One Shopping - Sr Lead Site Reliability EngineerLocations:US Remote, United States of AmericaSr Lead Site Reliability Engineer - Back End, Shopping (Remote-Eligible)Overview:Capital One Shopping is seeking a Sr Lead Site Reliability Engineer to join our dynamic remote-first engineering team. As a key member of our team, you will be responsible for...


  • Bloomington, Illinois, United States Censys Full time

    About CensysCensys is a leading cybersecurity company that empowers security teams with internet visibility and intelligence. Our mission is to bring transparency and trustworthiness to the world's security landscape.Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our Infrastructure and Ops platform team. As a key member...


  • Bloomington, Illinois, United States Booz Allen Hamilton Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Booz Allen Hamilton. As a key member of our engineering team, you will be responsible for designing, building, and maintaining scalable and resilient systems that meet the needs of our clients.Key Responsibilities:Design and implement scalable and resilient...


  • Bloomington, Illinois, United States Capital One Full time

    Job SummaryCapital One is seeking a highly skilled Platform Engineer to join our team. As a Platform Engineer, you will be responsible for designing, developing, and deploying cloud-based infrastructure and applications.Key ResponsibilitiesWork with product owners to understand desired application capabilities and testing scenariosContinuously improve...


  • Bloomington, Illinois, United States Censys Full time

    About CensysCensys is a leading cybersecurity company that empowers security teams with internet visibility and intelligence. Our mission is to bring transparency and trustworthiness to the world's security landscape.Job SummaryWe are seeking a talented Senior Site Reliability Engineer to join our Infrastructure and Ops platform team. As a key member of our...


  • Bloomington, Illinois, United States MITRE Full time

    Reliability and Maintainability EngineerAt MITRE, we're committed to tackling our nation's toughest challenges and creating a fulfilling life for our employees. As a Reliability and Maintainability Engineer, you'll play a critical role in developing innovative solutions for electronic prototypes designed for advanced sensing, communication, and navigation...


  • Bloomington, Illinois, United States Capital One Financial Corp Full time

    About the Role:We are seeking a highly skilled Senior Software Engineer, Site Reliability to join our team at Capital One Financial Corp. As a key member of our engineering team, you will be responsible for designing, developing, testing, implementing, and supporting technical solutions in full-stack development tools and technologies.Key...


  • Bloomington, Illinois, United States The MITRE Corporation Full time

    Job SummaryWe are seeking a highly skilled Reliability and Maintainability Engineer to join our team at The MITRE Corporation. As a key member of our Mechanical and Reliability Systems and Prototype Development Department, you will be responsible for leading reliability, maintainability, and availability (RMA) engineering analyses on programs for various...


  • Bloomington, Illinois, United States PharmEng Technology Americas Full time

    Job OpportunityPharmEng Technology Americas is seeking a seasoned Reliability Engineering Manager to join our team. As a key member of our engineering team, you will be responsible for ensuring the process and process equipment meet compliance, safety, and business requirements on a day-to-day basis.The ideal candidate will have superior skills and...


  • Bloomington, Illinois, United States Capital One Full time

    Transformative Cloud Engineer OpportunityCapital One is seeking a highly skilled Cloud Engineer to join our team and drive innovation in cloud computing. As a Cloud Engineer, you will be responsible for designing, building, and maintaining scalable cloud-based systems that meet the needs of our customers.Key Responsibilities:Lead the development of...


  • Bloomington, Illinois, United States Zachary Piper Solutions Full time

    Piper Companies is seeking a highly skilled Cloud Reliability Engineer to support a world-leading data analytics product and service provider. The Cloud Reliability Engineer will be responsible for ensuring the reliability and availability of our cloud platforms, tackling complex problems, and driving improvements to enhance performance and scalability.Key...


  • Bloomington, Illinois, United States Booz Allen Hamilton Full time

    Job Title: AWS Site Reliability EngineerWe are seeking a highly skilled AWS Site Reliability Engineer to join our team at Booz Allen Hamilton. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.About the Role:This is an exciting opportunity to work with a talented...


  • Bloomington, Illinois, United States Zachary Piper Solutions Full time

    Piper Companies is seeking a Cloud Reliability Specialist to support a world-leading data analytics product & service provider. The Cloud Reliability Specialist will be expected to provide automation, cloud optimization, security implementation, and compliance support.Responsibilities of the Cloud Reliability Specialist include: Maintain the reliability and...


  • Bloomington, Illinois, United States Booz Allen Hamilton Full time

    Job Title: AWS Site Reliability EngineerWe are seeking a highly skilled AWS Site Reliability Engineer to join our team at Booz Allen Hamilton. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.About the Role:This is an exciting opportunity to work with a talented...