Site Reliability Engineer

1 month ago


Chicago, United States Oneview Healthcare Full time
Job DescriptionJob DescriptionSalary:

Position Overview: 

Site Reliability Engineers support and smooth functioning of the Oneview system for our hospital customers, using their advanced technical and coding skills. People in this role will be former systems administrators or operation engineers with strong coding skills. Career development in this role includes developing the skills to advance into Operational Management or Software Engineering. 

 

Key Responsibilities: 

Maintenance:  

  • Maintaining IaC repository and provisions new cloud environments using Terraform. 
  • Maintaining the tooling required to deliver repeatable and reliable deployment of software releases 
  • Deploying software releases to  internal Dev & QA environments using Octopus. 
  • Implementing, validating and operating disaster recovery procedures. 
  • Performing infrastructure patching and upgrades to infrastructural software components; OS, RDBMS, middleware. 

Monitoring and optimization: 

  • Configuring and utilizing monitoring tools to ensure the Oneview system is running efficiently and effectively, and proactively identify any issues. 
  • Identify, validate, and implement optimizations to infrastructure, or identify optimizations to software. 

Configuration management:  

  • Automating repetitive infrastructure and software configuration or maintenance steps using automation tools. 

Troubleshooting: 

  • Diagnose and resolve the most complex technical issues, as needed during system implementation and operation. 

Documentation and reporting: 

  • Writing, updating, and using documentation, including runbooks/playbooks  
  • Document changes, updates, and resolution of technical issues. 
  • Timely escalation of blockers or issues to management. 
  • Contribute to the Technical Operations and Engineering knowledge bases. 

Communication and collaboration: 

  • Work closely with other teams including Account Management, Professional Services, Customer Support and Engineering to ensure the smooth functioning of the Oneview system. 
  • Use communication skills to collaborate effectively with customers, partners and colleagues as required. 

On-site:

  • Traveling to customer sites when needed. 

 

Qualifications: 

Professional and educational:  

  • 5 years relevant work experience in systems administration, IT operations or operation engineering. 
  • A formal qualification in Information Technology, Computer Science or similar. 
  • Experience of working with sensitive data (PII & PHI). 

Technical knowledge:  

  • Hands-on experience supporting SaaS products. 
  • Strong coding skills. 

Communication skills:  

  • The ability to communicate effectively with end-users, stakeholders, and team members is essential. 
  • Fluent written and spoken English is mandatory. 

Experience with tools and technologies:  

  • Cloud: Azure Keyvault, Application Gateways, Network Security Groups, Azure App Config.  
  • CaC: Chef, Ansible, Puppet or similar  
  • Azure DevOps, Jenkins, Gitlab or other CI solutions also welcome.  
  • Containers: Docker, Kubernetes, Helm, Istio, Aquasec, Nexus IQ and associated security and standards tooling.  
  • CI/CD: Experience of full scale SDLC; Code Quality (Sonarqube or similar), Testing (BDD, Cucumber, TDD, NUnit, Specflow, Newman, Selenium), Build (Msbuild, Docker, Nuget), Publish (Nexus, ACR, Azure Artifacts), Deploy (Terraform, Chef, Docker, Kubernetes, Helm, Octopus)  
  • SRE: Experience of Canary, Blue/Green, Zero downtime deployments.   
  • Security: Experience of segregated secrets management for applications. 
  • Monitoring: Datadog, Splunk, APM and RUM, Pester 
  • Imaging: Packer, FOG, WDS, DSC 
  • Experience with task automation and configuration management scripting (e.g. PowerShell, Java Scripting). 
  • Intergration tools: Mirth, RabbitMQ, Service Bus 

 

General requirements: 

  • Ability to work as part of a team or to work unsupervised and take responsibility for the completion of tasks. 
  • Excellent organizational, prioritization and problem-solving skills. 
  • Detail oriented. 
  • Applies analytical thinking. 
  • An enthusiastic and respectful manner with customers and colleagues. 
  • Availability to provide support to customers outside of core hours as and when required by clients including at points going on client sites as required. 


  • Chicago, United States Resource Logistics Full time

    Role: Site Reliability Engineer Location: Chicago, IL Hire Type: Full-time Responsibilities: Expertise with Monitoring, Alerting, Reliability Engineering & Observability Experience with Splunk, SignalFx or similar Tools Ability to create Log ingestions, Identify Metrics and KPIs App, Platform, Infra Logging & Alerting Best practices Creating Dashboards,...


  • Chicago, Illinois, United States Calabitek Full time

    Job DescriptionPosition: Site Reliability EngineerLocation: RemoteExperience: 10+ yearsThis position is responsible for ensuring application observability, maintenance, and support. The role involves identifying and implementing proactive preventive measures, evaluating, and recommending techniques, practices, or technologies that align with business...


  • Chicago, United States Definity First Full time

    We are seeking a skilled and motivated Site Reliability Engineer (SRE) to join our dynamic team. As an SRE at Definity First, you will play a crucial role in ensuring the reliability, scalability, and performance of our systems. You will collaborate with cross-functional teams to design, build, and maintain our infrastructure, and you'll have the opportunity...


  • Chicago, Illinois, United States Calabitek Full time

    Job OverviewPosition: Site Reliability EngineerLocation: Chicago, IL (Local Candidates Preferred)Experience: 10+ YearsThis position is crucial for ensuring application observability, ongoing maintenance, and robust support. The role involves identifying and implementing proactive preventive measures, as well as evaluating and recommending techniques,...


  • Chicago, Illinois, United States National Black MBA Association Full time

    About the RoleThis is a strategic and transformation-focused role within the National Black MBA Association's Global Technology organization. As a Manager of Site Reliability Engineering, you will play a key part in ensuring the reliable and efficient operation of our security services.Key Responsibilities:Design and drive monitoring, alerting, and ticket...


  • Chicago, Illinois, United States National Black MBA Association Full time

    About the RoleThis is a strategic and transformation-focused role within the National Black MBA Association's Global Technology organization. As a Manager of Site Reliability Engineering, you will play a key part in ensuring the reliable and efficient operation of our security services.**Key Responsibilities:**Design and drive monitoring, alerting, and...


  • Chicago, Illinois, United States Oak Street Health Full time

    Transformative Role at Oak Street HealthWe are seeking a skilled Site Reliability Engineer to collaborate with our software engineering teams in implementing monitoring and alerting solutions, designing performance tests, and automating tasks to enhance efficiency.Key ResponsibilitiesDesign and implement telemetry, monitoring, and alerting systems to ensure...


  • Chicago, United States Cleo Full time

    Site Reliability Engineer At Cleo, we make doing business easy! Cleo is an established software company with a start-up feel. We have awesome products, which go hand in hand with our awesome culture! We are devoted to our people and pride ourselves on creating a fun, laid-back, but fast-paced work environment. Not only do we work hard, we play hard. We have...


  • Chicago, United States Saxon Global Full time

    Northern Trust Site Reliability Engineer (Azure) Location : Downtown Chicago - Onsite 2 days/week - 181 W Madison St Duration : 12+ month contract w/extension/conversion Overview The Goals Driven Wealth Management platform is a showcase product for Northern Trusts Wealth Management business and we must demonstrate our ability to deliver and...


  • Chicago, United States AmericanEagle.com Full time

    Americaneagle.com is a family-owned web design, development, and digital marketing agency with a passionate belief in the power of technology to positively transform business practices. Our focus is on helping customers grow and achieve success in the digital space. We cover a variety of different industries, including eCommerce, associations & nonprofits,...


  • Chicago, Illinois, United States The Hartford Full time

    Senior Site Reliability EngineerAt The Hartford, we are committed to making a significant impact as an insurance provider that transcends traditional coverages and policies. Being part of our team means you have the opportunity to achieve your professional aspirations while assisting others in reaching theirs. Join us as we work towards shaping the...


  • Chicago, United States PDSSOFT Full time

    8 Months Contract Only Locals within an hour's drive distance Chicago, IL, US, 60602 Must have 10+ yrs of IT experience Work Model: Hybrid Anchor Days: Monday, Wednesday, Friday Hours: 8:30am - 5pm CST Job Post Title Site Reliability/DevOps Engineer Job Post Summary Seeking a Site Reliability/DevOps Engineer to gather and analyze metrics to assist in...


  • Chicago, United States PDSSOFT Full time

    8 Months Contract Only Locals within an hour's drive distance Chicago, IL, US, 60602 Must have 10+ yrs of IT experience Work Model: Hybrid Anchor Days: Monday, Wednesday, Friday Hours: 8:30am - 5pm CST Job Post Title Site Reliability/DevOps Engineer Job Post Summary Seeking a Site Reliability/DevOps Engineer to gather and analyze metrics to assist...


  • Chicago, Illinois, United States Gusto Full time

    About GustoGusto is a modern, online people platform that helps small businesses take care of their teams. On top of full-service payroll, Gusto offers health insurance, 401(k)s, expert HR, and team management tools. Today, Gusto offices in Denver, San Francisco, and New York serve more than 300,000 businesses nationwide. Our mission is to create a world...


  • Chicago, United States Saxon Global Full time

    Site Reliability Engineer (SRE) - (Azure, Systems background) Client: Lexis Nexis Location: REMOTE Rate: $62 C2C Duration: 1 Year Notes: Azure, Systems background experience •BSc Engineering/Computer Science or relevant experience. •Proven background working in a technical, IT related position. •Desirable -Azure Certifications ...


  • Chicago, United States Oak Street Health Full time

    Company: Oak Street Health Title: Engineer II, Site Reliability Engineer Location: Chicago Role Description: As a Site Reliability Engineer, you will be instrumental to the stability and performance of a new kind of platform for healthcare, one built specifically for the clinical team. From design to implementation, you will partner with our stellar software...


  • Chicago, United States Outdefine Full time

    As a skilled professional seeking career growth, you deserve access to the best job opportunities available. Join Outdefine's Trusted community today and apply to premier job openings with leading enterprises globally. Set your own rate, keep all your pay, and enjoy the benefits of a fee-free experience. Site Reliability Engineer Uber Freight Software 500+...


  • Chicago, Illinois, United States Donato Technologies, Inc Full time

    Job OverviewPosition Title: DevOps EngineerCompany: Donato Technologies, IncWork Model: HybridOnsite Days: Tuesday - ThursdayContract Duration: 6 MonthsPosition SummaryWe are in search of a skilled DevOps Engineer to partner with our Application Development teams in delivering innovative business solutions through agile methodologies while effectively...


  • Chicago, United States McDonald's Full time

    McDonald’s new growth strategy, Accelerating the Arches, encompasses all aspects of our business as the leading global omni-channel restaurant brand. As the consumer landscape shifts we are using our competitive advantages to further strengthen our brand. One of our core growth strategies is to Double Down on the 3Ds (Delivery, Digital and Drive Thru)....


  • Chicago, United States McDonald's Global Technology Full time

    Job DescriptionCompany Description:McDonald's new growth strategy, Accelerating the Arches, encompasses all aspects of our business as the leading global omni-channel restaurant brand. As the consumer landscape shifts we are using our competitive advantages to further strengthen our brand. One of our core growth strategies is to Double Down on the 3Ds...