Senior Site Reliability Engineer

2 weeks ago


Kansas City, United States Gorilla Logic Full time

Gorilla Logic Overview Gorilla Logic provides nearshore Agile teams to Fortune 500 and SMB companies, bringing unparalleled expertise in the delivery of full-stack web, mobile, and enterprise applications. Our highly collaborative Agile Gorillas are uniquely qualified to implement complex software initiatives. With offices in the United States, Costa Rica, Colombia, and Mexico, Gorilla Logic helps clients gain competitive advantages to achieve results faster.

Job Opening: Senior Site Reliability Engineer (SRE) Gorilla Logic is looking for a Senior Site Reliability Engineer (SRE) responsible for automation, instrumentation, and stability of our client's platforms to achieve operational health and performance. Our environment will require you to work effectively with your teammates, of course. But your real success will be measured by how well you couple critical thinking with self-motivation, enthusiasm, and determination.

Responsibilities

Focus on platform monitoring, analytics, observability, dashboarding, and alerting

Combine sysadmin and development skills to automate Platform infrastructure and operations

Responsible for core SRE tenants of availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning

Focus on Platform infrastructure to optimize existing systems and eliminating work through automation that defines clear processes for a given operation/business solution

Sustain scalable and highly reliable software systems for infrastructure and operations

Applying a software engineering mindset to systems administration

Partner with development teams to bring reliability into the development cycle

Partner with engineering teams to identify and instrument SLAs and SLOs

Must have the ability to work in a dynamic, fast-paced environment

Strong communication skills to interact with Agile team members

Good analytical thinking and problem-solving skills

Technical Requirements

Bachelor's degree in Computer Science or related field (or equivalent experience)

5+ years of web application development background or DevOps

3+ years of experience as a site reliability engineer

Primary programming language skill in Python

2+ years of working in Azure, Azure DevOps (ADO), CI/CD, and Pipelines

Experience with Dynatrace for monitoring, observability, and security

Extensive experience with infrastructure monitoring and performance tools

A proactive approach to spotting problems, solving complex problems, identifying areas for improvement, and performance bottlenecks

Demonstrated track record of maintaining and building large scale distributed systems

Experience with TCP/IP networking protocols and security principles

Proficient in scripting, coding, and deployment automation

Bonus Skills Experience with dynamic resource orchestration frameworks (Docker, Kubernetes), Linux background with both Debian and Ubuntu, Familiar with Jenkins, Spinnaker, Artifactory, Terraform, Datadog, and Sumologic. Familiar with web technologies such as HTTP, TLS, REST, Nginx, and HAProxy.

#J-18808-Ljbffr



  • Kansas City, United States Gorilla Logic Full time

    Gorilla Logic: Mid-Level Site Reliability Engineer (SRE) Gorilla Logic provides nearshore Agile teams to Fortune 500 and SMB companies, bringing unparalleled expertise in the delivery of full-stack web, mobile, and enterprise applications. Our highly collaborative Agile Gorillas are uniquely qualified to implement complex software initiatives. With offices...


  • Jersey City, United States Hispanic Technology Executive Council Full time

    At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. Responsible Growth is how we run our company and how we deliver for our clients, teammates, communities and shareholders every day. One of the keys to driving Responsible Growth is being a great place to work for our teammates...


  • Foster City, United States Zoox Full time

    Foster City, CA • Full-time Staff/Senior Staff Site Reliability Engineer Zoox is looking for a site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous vehicles. In this role, you will be heavily involved in all phases of rolling out a service from...


  • Jersey City, United States BCforward Full time

    Job Title: Site Reliability Engineer (AWS) (SRE) Type: W2 (Strictly No C2C and no sponsorship available) Location: Jersey City or Plano or Delaware (Hybrid) Duration: 9 Months Contract to hire Hybrid Model: 3 Days onsite 2 days remote a. Skillset AWS, Big Data, Spark, Python, Shell / Perl Scripting, Control-M, Autosys. Grafana, AppDynamics, APICA b....


  • Arizona City, United States Openlane Full time

    Job Description: Site Reliability Engineer (f.k.a. Platform Engineer) for CarsArrive Network, Inc. located in Mesa, AZ. Provide daily, hands-on assistance to maintain and advance the build process to ensure reliability and optimum integration with Continuous Integration/Continuous Delivery (CI/CD) and Release Management. Work with the development,...


  • Oklahoma City, United States BJ's Wholesale Club Full time

    Lead Site Reliability Engineer page is loaded Lead Site Reliability Engineer Apply locations BJ's Club Support Center Marlborough, MA #5997 time type Full time posted on Posted 2 Days Ago job requisition id R147855 Join our team of more than 34,000 team members, supporting our members and communities in our Club Support Center, 235+ clubs and eight...


  • Jersey City, United States Pinnacle Group, Inc. Full time

    W2 only - Preferred Citizen or Green Card Holder Contract to Hire Must Have: AWS Certification7-8 years of experience and 2 years of AWS expTools: Grafana, DataDogDatabase: MySQL or Oracle-Unix, Linux, Shell Scripting, LAN, NFS-Python, Go Lang, Terraform, Jenkins -Docker, Kubernetes Site Reliability Engineer (AWS) (SRE)Roles and Responsibilities:• Design,...


  • Jersey City, United States Pinnacle Group, Inc. Full time

    W2 only - Preferred Citizen or Green Card Holder Contract to Hire Must Have: AWS Certification7-8 years of experience and 2 years of AWS expTools: Grafana, DataDogDatabase: MySQL or Oracle-Unix, Linux, Shell Scripting, LAN, NFS-Python, Go Lang, Terraform, Jenkins -Docker, Kubernetes Site Reliability Engineer (AWS) (SRE)Roles and Responsibilities:• Design,...


  • Jersey City, United States Pinnacle Group, Inc. Full time

    W2 only - Preferred Citizen or Green Card Holder Contract to Hire Must Have: AWS Certification7-8 years of experience and 2 years of AWS expTools: Grafana, DataDogDatabase: MySQL or Oracle-Unix, Linux, Shell Scripting, LAN, NFS-Python, Go Lang, Terraform, Jenkins -Docker, Kubernetes Site Reliability Engineer (AWS) (SRE)Roles and Responsibilities:• Design,...


  • Foster City, United States Zoox Full time

    Zoox is looking for an experienced leader to lead our Site Reliability Engineering team. Infrastructure is key in building, validating, and running our autonomous driving software, and the team you’ll be running supports it all. In this highly impactful role, you will closely work with partners in many teams including the driving AI teams, safety...


  • Jersey City, New Jersey, United States Devexperts Full time

    Company DescriptionDevexperts has been working for nearly two decades consulting and developing for the financial industry. We solve complex technological challenges facing the most well-respected financial institutions worldwide.By becoming a part of Devexperts, you'll become a part of a company that fosters self-improvement and actively seeks...


  • Redwood City, United States Attain Full time

    About Attain Built for consumers and companies, alike. In a world driven by data, we believe consumers and businesses can coexist. Our founders had a vision to empower consumers to leverage their greatest asset-their data-in exchange for modern financial services. Built with this vision in mind, our platform allows consumers to access savings tools, earned...


  • Jersey City, United States Ben Aris Full time

    Job Summary: The Senior Reliability Engineer will identify and resolve mechanical issues, improving quality, capacity, and reliability. Collaborating with onsite technical teams, corporate networks, and vendors, you'll strive to improve plant reliability and performance while applying technical knowledge to analyze causes of long-term reliability issues,...


  • Jersey City, United States Ben Aris LLC Full time

    Job DescriptionJob DescriptionJob Summary: The Senior Reliability Engineer will identify and resolve mechanical issues, improving quality, capacity, and reliability. Collaborating with onsite technical teams, corporate networks, and vendors, you'll strive to improve plant reliability and performance while applying technical knowledge to analyze causes of...


  • Redwood City, California, United States C3 Full time

    We are looking for a Site Reliability Engineer to join our team at our HQ in Redwood City, CA.Responsibilities:Maximize system uptime and availability, ensuring functional and performance SLAs.Establish end-to-end monitoring and alerting on all critical aspects.Solve complex problems for critical services and build automation to prevent problem...


  • Jersey City, New Jersey, United States tapwage Full time

    There's nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.As a Site Reliability Engineer III at JPMorgan Chase within the Digital Private Markets /Aumni (A JP Morgan Chase Company), you will solve complex...


  • Gibson City, United States International Flavors & Fragrances Full time

    International Fragrances & Flavors Inc (IFF) is looking for an experienced Reliability Engineer for our Gibson City, Illinois Nourish site. The Reliability Engineer at Gibson City provides the technical expertise on all matters of reliability enginee Reliability Engineer, Liability, Reliability, Engineer, Reliability, Technical Support, Manufacturing


  • Foster City, United States Zoox Full time

    Zoox is looking for a site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous vehicles. In this role, you will be heavily involved in all phases of rolling out a service from designing systems that are easy to maintain and fault-tolerant through...


  • Foster City, United States Zoox Full time

    Zoox is looking for a site reliability engineer who will be responsible for measuring and maintaining the uptime of the many services critical to the development process for autonomous vehicles. In this role, you will be heavily involved in all phases of rolling out a service from designing systems that are easy to maintain and fault-tolerant through...


  • Jersey City, United States Devexperts Full time

    Devexperts has been working for nearly two decades consulting and developing for the financial industry. We solve complex technological challenges facing the most well-respected financial institutions worldwide.By becoming a part of Devexperts, you’ll become a part of a company that fosters self-improvement and actively seeks out-of-the-box ideas. Our...