We have other current jobs related to this field that you can find below


  • Chicago, Illinois, United States Calabitek Full time

    Job DescriptionPosition: Site Reliability EngineerLocation: RemoteExperience: 10+ yearsThis position is responsible for ensuring application observability, maintenance, and support. The role involves identifying and implementing proactive preventive measures, evaluating, and recommending techniques, practices, or technologies that align with business...


  • Chicago, Illinois, United States Calabitek Full time

    Job OverviewPosition: Site Reliability EngineerLocation: Chicago, IL (Local Candidates Preferred)Experience: 10+ YearsThis position is crucial for ensuring application observability, ongoing maintenance, and robust support. The role involves identifying and implementing proactive preventive measures, as well as evaluating and recommending techniques,...


  • Chicago, Illinois, United States The Hartford Full time

    Senior Site Reliability EngineerAt The Hartford, we are committed to making a significant impact as an insurance provider that transcends traditional coverages and policies. Being part of our team means you have the opportunity to achieve your professional aspirations while assisting others in reaching theirs. Join us as we work towards shaping the...


  • Chicago, United States Resource Logistics Full time

    Role: Site Reliability Engineer Location: Chicago, IL Hire Type: Full-time Responsibilities: Expertise with Monitoring, Alerting, Reliability Engineering & Observability Experience with Splunk, SignalFx or similar Tools Ability to create Log ingestions, Identify Metrics and KPIs App, Platform, Infra Logging & Alerting Best practices Creating Dashboards,...


  • Chicago, United States Definity First Full time

    We are seeking a skilled and motivated Site Reliability Engineer (SRE) to join our dynamic team. As an SRE at Definity First, you will play a crucial role in ensuring the reliability, scalability, and performance of our systems. You will collaborate with cross-functional teams to design, build, and maintain our infrastructure, and you'll have the opportunity...


  • Chicago, United States Resource Logistics Full time

    Role : Site Reliability Engineer Location: Chicago, IL Hire Type: fulltime Expertise with Monitoring, Alerting, Reliability Engineering & Observability Experience with Splunk, SignalFx or similar Tools Ability to create Log ingestions, Identify Metrics and KPIs App, Platform, Infra Logging & Alerting Best practices Creating Dashboards, Event Correlation,...


  • Chicago, Illinois, United States DASH2 Full time

    OverviewDASH2 is seeking skilled technical professionals at various levels who are eager to challenge themselves in delivering top-tier SaaS solutions. We provide a stimulating environment that encourages growth, adaptability, and the consistent application of your expertise. Our clients depend on us during critical moments, and our engineering team is...


  • Chicago, United States Oneview Healthcare Full time

    Job DescriptionJob DescriptionSalary: Position Overview: Site Reliability Engineers support and smooth functioning of the Oneview system for our hospital customers, using their advanced technical and coding skills. People in this role will be former systems administrators or operation engineers with strong coding skills. Career development in this role...


  • Chicago, Illinois, United States Gusto Full time

    About GustoGusto is a modern, online people platform that helps small businesses take care of their teams. On top of full-service payroll, Gusto offers health insurance, 401(k)s, expert HR, and team management tools. Today, Gusto offices in Denver, San Francisco, and New York serve more than 300,000 businesses nationwide. Our mission is to create a world...


  • Chicago, Illinois, United States Itron, Inc. Full time

    Itron is revolutionizing how utilities and cities manage energy and water. We are committed to creating a more sustainable, resourceful world. Join us.Job Family SummaryPlans, designs, develops and tests software systems or applications for software enhancements and new products including cloud-based or internet-related tools. Evaluates reliability of...


  • Chicago, United States AmericanEagle.com Full time

    Americaneagle.com is a family-owned web design, development, and digital marketing agency with a passionate belief in the power of technology to positively transform business practices. Our focus is on helping customers grow and achieve success in the digital space. We cover a variety of different industries, including eCommerce, associations & nonprofits,...


  • Chicago, United States Saxon Global Full time

    Site Reliability Engineer (SRE) - (Azure, Systems background) Client: Lexis Nexis Location: REMOTE Rate: $62 C2C Duration: 1 Year Notes: Azure, Systems background experience •BSc Engineering/Computer Science or relevant experience. •Proven background working in a technical, IT related position. •Desirable -Azure Certifications ...


  • Chicago, United States Oak Street Health Full time

    Company: Oak Street Health Title: Engineer II, Site Reliability Engineer Location: Chicago Role Description: As a Site Reliability Engineer, you will be instrumental to the stability and performance of a new kind of platform for healthcare, one built specifically for the clinical team. From design to implementation, you will partner with our stellar software...


  • Chicago, United States Outdefine Full time

    As a skilled professional seeking career growth, you deserve access to the best job opportunities available. Join Outdefine's Trusted community today and apply to premier job openings with leading enterprises globally. Set your own rate, keep all your pay, and enjoy the benefits of a fee-free experience. Site Reliability Engineer Uber Freight Software 500+...


  • Chicago, United States McDonald's Global Technology Full time

    Job DescriptionCompany Description:McDonald's new growth strategy, Accelerating the Arches, encompasses all aspects of our business as the leading global omni-channel restaurant brand. As the consumer landscape shifts we are using our competitive advantages to further strengthen our brand. One of our core growth strategies is to Double Down on the 3Ds...


  • Chicago, United States McDonald's Full time

    McDonald’s new growth strategy, Accelerating the Arches, encompasses all aspects of our business as the leading global omni-channel restaurant brand. As the consumer landscape shifts we are using our competitive advantages to further strengthen our brand. One of our core growth strategies is to Double Down on the 3Ds (Delivery, Digital and Drive Thru)....


  • Chicago, United States McDonald's Full time

    McDonald’s new growth strategy, Accelerating the Arches, encompasses all aspects of our business as the leading global omni-channel restaurant brand. As the consumer landscape shifts we are using our competitive advantages to further strengthen our brand. One of our core growth strategies is to Double Down on the 3Ds (Delivery, Digital and Drive Thru)....


  • Chicago, United States Oak Street Health Full time

    Description Company: Oak Street Health Title: Engineer II, Site Reliability Engineer Location: Chicago Role Description: As a Site Reliability Engineer, you will be instrumental to the stability and performance of a new kind of platform for healthcare, one built specifically for the clinical team. From design to implementation, you will partner with our...


  • Chicago, United States DASH2 Full time

    Summary We are looking for technical team members at all levels who want to push themselves to deliver best in market SaaS solutions. We offer a challenging environment where you will have to grow, adapt and use your skills consistently. Our customers rely on us in the moments that matter. Engineering delivers on that promise. You can read the bullets...

  • Reliability Engineer

    1 month ago


    Chicago, United States Mondelez International Full time

    Click HERE to Apply: Reliability EngineerAre You Ready to Make It Happen at Mondelēz International?Join our Mission to Lead the Future of Snacking. Make It With Pride.Your goal will be to ensure that the site manufacturing & support activities, without interruption, without any facilities shortages and/or any issues thereof. You will achieve 100% compliance...

Tier-2 Site Reliability Engineer

2 months ago


Chicago, United States Bank of America Full time

Job Description: About Us: At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection.

Responsible Growth is how we run our company and how we deliver for our clients, teammates, communities, and shareholders every day. One of the keys to driving Responsible Growth is being a great place to work for our teammates around the world. We’re devoted to being a diverse and inclusive workplace for everyone. We hire individuals with a broad range of backgrounds and experiences and invest heavily in our teammates and their families by offering competitive benefits to support their physical, emotional, and financial well-being. Bank of America believes both in the importance of working together and offering flexibility to our employees. We use a multi-faceted approach for flexibility, depending on the various roles in our organization. Working at Bank of America will give you a great career with opportunities to learn, grow and make an impact, along with the power to make a difference. Join us Position Summary Bank of America Network Services has a need to recruit a Software Site Reliability Engineer (SRE) to support production operations of network automation solutions. Cisco Robot is one such solution. The Cisco Robot solution is composed of 3 key Cisco automation tools, Network Service Orchestrator (NSO), Business Process Automation (BPA), Configuration Workflow Manager (CWM) The technology areas of focus for the Level 2 Cisco Robot SRE include: Cisco CMSP, IBM Watson and Prometheus monitoring tools for solution component monitoring and event management Usage of BMC Remedy change and incident management system Strong familiarity with networking routing and switching protocols, Data Center knowledge and Access network solutions, understanding of networking security technologies. Strong working knowledge of the following Cisco software products: Network Service Orchestrator (NSO), Business Process Automation (BPA), Configuration Workflow Manager (CWM). Working knowledge of microservices based software architecture Working knowledge of Kubernetes and OpenShift Knowledge of Virtualization & Cloud (VMware, OpenStack) and database (MongoDB, Postgres)

technologies Hands-on experience with Python programming language Knowledge of software integration SOAP/RESTful API Hands on experience with network & software configuration tools such as Ansible, Chef/Puppet Orchestration skillsets and a foundational knowledge of cloud computing, virtualization and storage solutions are desirable. The work is always in alignment to the current and approved Network Services Standards, Incident and Problem Management Policies & Procedures, governance and management policies set forth by the firm. This position will interface directly with internal stakeholders and external suppliers/providers, architecture, product engineering, product management, and business management. At times, the post holder might be required to interface with various levels of senior management. Strong communication and problem-solving skills are essential. The candidate must be able to work on their own and also contribute in team settings of various sizes and locations. Adherence and use of standards, product sets, templates, systems, and artifacts are important to the success of the individual, the department, and the firm at large. The ROBOT Support Engineer will be considered a subject matter expert in their field and is expected to stay current with various technologies, organizational goals, and industry trends to drive end to end value. Primary Skill Virtualization Required Skills: Ensure that Cisco Robot and other network automation production systems are operational in accordance with stated service objectives. Perform continuous monitoring and event management of Robot production systems Manage Robot incidents including solving problems, triaging complex incidents, and managing end-user incident-related communications Write operational playbooks to improve monitoring posture and resolve issues. Feed more complex requirements to the DevOps teams Support business continuity tabletop exercises Level 2 Escalation point for Operational Support of End User Access Network Cisco Wired/Wireless LAN, Palo Alto CloudGenix (SD-WAN) Technical areas of focus include but are not limited to end-user WAN, LAN, WLAN, SD-WAN, MPLS Proactive network reviews including Routine testing of disaster recovery scenarios, identification of vulnerabilities and opportunities for improvement in observability across the network stack Mentorship of Production Services Specialists and technical leadership within the team Work with senior team members to validate impacts and communicate to all stakeholder’s technical status updates Participate in the documentation of application flows, upstream/downstream impacts during outages, the customer experience in failure scenarios, contacts for various support needs and ensures appropriate runbooks and wikis are up to date and available for use during triage Work ad-hoc reports and offline incidents at the direction of the senior team members or leadership Promote and enforce production governance during triage/testing and fix efforts, exercises judgment within defined procedures and practices to determine appropriate action Adhere to design standards and global design authority processes and procedures Assemble professional documents based on existing templates and ability to provide accurate work descriptions with assumptions, and caveats Desired Skills: Foundational knowledge of routing and switching protocols Foundational knowledge of Industry Data Center and Enterprise access network solutions Foundational knowledge of Cisco Data Center Compute platforms, such as UCS Blade & Rack Servers Foundational knowledge on Cisco Data Center platforms including Cisco Nexus, Catalyst switches, ASR routers Broad understanding and/or experience with L2-L3 networking, data center, and security technology, sufficient enough to understand customer solutions, topologies, and interactions with higher networking layers. 3+ years of Experience with other network technologies WAN, MAN, LAN, Optical, Routing, Switching, Firewall, Proxy/Threat Prevention, DDI, Load Balancing, and AAA 2+ years of Cloud or SDN knowledge and experience Experience with SDN; Cisco ACI, VMware NSX, Arista CloudVision Experience with SDWAN, preferred if on CloudGenix Ability to solve network issues and isolate problems Understanding of Incident & Change Management process Network Automation/Orchestration skillsets in frameworks and toolsets, preferably Tail-f NCS / NSO Network Programmability skillsets in Software Defined Networking (SDN), REST APIs, NETCONF, YANG, JSON, and XML. Foundational knowledge of Cisco and Industry Cloud computing (i.e. Openstack, VMWare and AWS), Data Center, Virtualization, Storage and Networking solutions is desirable. Programming understanding in Python and Exposure to Micro services architecture. Basic administration of mongo-DB and/or Postgres Experience with container management Hands on experience with Linux operating system and scripting

Desired Skills: Foundational knowledge of Cisco and Industry Cloud computing (i.e. Openstack, VMWare and AWS), Data Center, Virtualization, Storage and Networking solutions is desirable. Experience in Networking-related disciplines within a design, implementation, or operations role Relevant Industry certifications in Network Technologies Experience working in an Agile environment Experience of working within Financial Services (Insurance, Banking, Investment banking) Shift: 1st shift (United States of America)

Hours Per Week: 40 #J-18808-Ljbffr