Site Reliability Engineer

2 weeks ago


Columbus, United States Vision It US Full time
Job DescriptionJob Description

We are looking for an adventurous Senior Site Reliability Engineer who loves AWS technologies. You will be a member of an engineering team where collaboration and innovation are a key focus. As part of this team you will design, build, deploy, and monitor software and infrastructure that delivers new features to the market. Be prepared to explore new technologies and design concepts as an integral part of your job.

What you will be doing
Partner with engineering, security, and product teams to keep our services reliable, available, fast and cost efficient
Build tools and automation that eliminates repetitive tasks, minimizes downtime, achieves human free operations, and provides self-service solutions to product development teams
Design, build and operate large-scale production systems hosted within our on-prem and AWS hosting environments
Lead technology initiatives that drive scalability and reliability improvements
Advocate and implement reliable design patterns (e.g. circuit breakers, graceful degradation)
Share an on-call rotation with your team and respond to incidents; lead triage efforts and provide needed status updates

Skills and Qualifications:
7+ years of industry experience
4 years of full stack software engineering experience in one or more of the following programming languages: Java, Go, C# or Python
3+ years deploying, operating, and debugging server software on Linux. Comfortable diagnosing and resolving common system issues.
Deep experience implementing infrastructure as code with Terraform
You have designed, built, and operated highly available AWS ECS, EKS or independent K8s clusters.
Strong knowledge of common AWS technologies like ELB, CloudFront, EC2, RDS, ElastiCache, S3, ElasticSearch, IAM and Route 53
You have participated in a 24x7 on-call rotation with your team and responded to incidents
Proficient with APM, infrastructure and log aggregation tooling to monitor system health and customer experience (e.g. New Relic, OpenTelemetry, Cloudwatch, Sumologic, ELK)
A proven track record of diagnosing and fixing time sensitive and critical production issues
Experience developing and maintaining ci/cd pipelines (e.g. jenkins, circleci, git, gitflow, sonarqube, blue/green)

Big Pluses
Ansible, Cloudformation, Packer
Database administration skills (AWS Aurora, MySQL, Postgres, Oracle)
Have leveraged deployment strategies such as blue-green and canary
Experience building RESTful services and/or web applications
Experience automating software deployments and following a continuous delivery and deployment model
Experience with system analysis and troubleshooting in large-scale Linux environment

People who have been successful in this role:
Passionate and adept at software development and/or system engineering
Love to understand how new technologies and architectures work, educate coworkers and channel their knowledge into improving system reliability and performance
Continuously learning about application scalability, availability, reliability, and security
Intensely curious about how complex distributed systems operate and fail at scale
Think freely and independently, and are ready to share their views
Eager to learn from mistakes and socialize the lessons learned
Like to take ownership of infrastructure components and leading projects


Required Skills : Terraform,Java
Additional Skills : AWS Engineer,DevOps EngineerThis is a high PRIORITY requisition. This is a PROACTIVE requisition

  • Columbus, United States Huntington Bancshares, Inc. Full time

    The Site Reliability Engineer provides technical and consultative support on the most complex technical matters. Responsibilities:Extensive expertise within production environments (AWS/ On Premise), covering security, deployment, automation, and ser Reliability Engineer, Liability, Reliability, Reliability, Engineer, Technical Support, Technology, Banking


  • Columbus, United States V-Soft Consulting Group Full time

    Job Title: Site Reliability Engineer Location: Columbus OH/Hybrid 3 days onsite Duration: 3+ month CTH Contract W2 Role Required Skills: SRE background for 4+ years, AWS and EC2 and Lambda DynamoDB, python or java, they are moving towards containers (Kubernetes/docker) Job Description Maintain the production environment by monitoring availability and taking...


  • Columbus, United States V-Soft Consulting Group Full time

    Job Title: Site Reliability Engineer Location: Columbus OH/Hybrid 3 days onsite Duration: 3+ month CTH Contract W2 Role Required Skills: SRE background for 4+ years, AWS and EC2 and Lambda DynamoDB, python or java, they are moving towards containers (Kubernetes/docker) Job Description Maintain the production environment by monitoring availability and taking...


  • Columbus, United States Vision It US Full time

    Job Description Job Description We are looking for an adventurous Senior Site Reliability Engineer who loves AWS technologies. You will be a member of an engineering team where collaboration and innovation are a key focus. As part of this team you will design, build, deploy, and monitor software and infrastructure that delivers new features to the market. Be...


  • Columbus, United States Huntington Bancshares, Inc. Full time

    Description Summary: The Site Reliability Engineer provides technical and consultative support on the most complex technical matters. Responsibilities: Extensive expertise within production environments (AWS/On Premise), covering security, deployment, automation, and serverless technologies. Apply deep knowledge of SRE principles to ensure the scalability...


  • Columbus, United States V-Soft Consulting Group, Inc. Full time

    Job Title: Site Reliability EngineerLocation: Columbus OH/Hybrid 3 days onsiteDuration: 3+ month CTHContract W2 RoleRequired Skills: SRE background for 4+ years, AWS and EC2 and Lambda DynamoDB, python or java, they are moving towards containers (Kubernetes/docker)Job DescriptionMaintain the production environment by monitoring availability and taking a...


  • Columbus, United States JobRialto Full time

    Description: Maintain the production environment by monitoring availability and taking a holistic view of system health Ensure highly resilient, low latency, business continuity designs in multi regions application deployments Build software and systems to manage platform infrastructure and applications Improve reliability, quality, and time-to-market of our...


  • Columbus, United States JobRialto Full time

    Description: Maintain the production environment by monitoring availability and taking a holistic view of system health Ensure highly resilient, low latency, business continuity designs in multi regions application deployments Build software and systems to manage platform infrastructure and applications Improve reliability, quality, and time-to-market of our...


  • Columbus, United States Saxon Global Full time

    Duties and Responsibilities: Maintain the production environment by monitoring availability and taking a holistic view of system health Ensure highly resilient, low latency, business continuity designs in multi regions application deployments Build software and systems to manage platform infrastructure and applications Improve reliability, quality, and...


  • Columbus, United States Global Payments Full time

    Site Reliability Engineer I page is loaded Site Reliability Engineer I Apply locations Columbus, Georgia, USA time type Full time posted on Posted Today job requisition id R0050363 Every day, Global Payments makes it possible for millions of people to move money between buyers and sellers using our payments solutions for credit, debit, prepaid and merchant...

  • Reliability Engineer

    2 weeks ago


    Columbus, United States AkzoNobel N.V. Full time

    Leads reliability initiatives in line with site and business needs, tracking all stages to deliver on production, quality, health/safety/environmental, and cost improvement targets. Interfaces with other departments to establish a reliability-centere Reliability Engineer, Liability, Reliability, Continuous Improvement, Equipment Maintenance, Reliability,...


  • Columbus, United States Central Point Partners Full time

    Description: Contract to hire 1-2 teams interview or in person if possible Great communication & written skills JOB DESCRIPTION Summary: The Programmer/Analyst-Senior modifies existing software/application programs, which are typically more complex in nature, or writes new programs to support user and management needs. Duties and...


  • Columbus, Ohio, United States Sunrun Full time

    Everything we do at Sunrun is driven by a determination to transform the way we power our lives. We know that starts at the individual employee level. We strive to foster an environment you can thrive in through our commitment to diversity, inclusion and belonging. A renewable energy revolution is beginning to blossom into the world's largest industrial...


  • Columbus, United States AkzoNobel Full time

    Select how often (in days) to receive an alert: Reliability Engineer Date: May 15, 2024 Location: Columbus, OH, US Company: AkzoNobel We’ve been pioneering a world of possibilities to bring surfaces to life for well over 200 years. As experts in making coatings, there’s a good chance you’re only ever a few meters away from one of our products. Our...


  • Columbus, Ohio, United States AkzoNobel Full time

    We've been pioneering a world of possibilities to bring surfaces to life for well over 200 years. As experts in making coatings, there's a good chance you're only ever a few meters away from one of our products. Our world class portfolio of brands – including Dulux, International, Sikkens and Interpon – is trusted by customers around the globe. We're...


  • Columbus, United States AkzoNobel Full time

    We’ve been pioneering a world of possibilities to bring surfaces to life for well over 200 years. As experts in making coatings, there’s a good chance you’re only ever a few meters away from one of our products. Our world class portfolio of brands – including Dulux, International, Sikkens and Interpon – is trusted by customers around the globe....


  • Columbus, United States Job Juncture Full time

    Respected poultry processor in Perry, GA is seeking a Sr. Reliability Engineer to lead the implementation, continuous improvement, and sustainability of reliability systems and processes within the manufacturing site in order to deliver a strategic reliability-centered maintenance approach to optimizing asset life and productivity through documented asset...


  • Columbus, United States Remotely Full time

    This is a remote position. Site Reliability Engineer - US Residents Only, 1 year experience, remote) Team Remotely Inc. is a staffing and recruitment agency that offers a comprehensive solution for talent acquisition, including sourcing, vetting, pay rolling, and managing talent. Whether you need contract staffing, direct hire, direct sourcing, talent pools,...


  • Columbus, United States Infinity Consulting Solutions Full time

    We have partnered with our client in search of an Application Support Engineer.Application Support Roles & Responsibilities: Application monitoring infrastructure using Splunk or Dynatrace, servers, databases, distributed batch jobs and supporting sustained resiliency, disaster recovery and high availability events Triage Distributed and Mainframe...


  • Columbus, United States Austin Allen Company, LLC Full time

    Reliability Engineer Mechanical Engineer Salary up to $120,000 + Bonus + Benefits + Paid Relocation to either the Mid-West or the Southern USA **Preferred Industry Background: Chemicals, Pulp & Paper, Sheeted Plastics, Non-Wovens, Paper Converting & etc.** As the reliability engineer, you would be responsible for managing the preventative and predictive...