DevOps - Site Reliability Engineer ( SRE)

2 weeks ago


Atlanta, United States Resource Informatics Group Inc Full time
Job DescriptionJob Description

 

Role: Site Reliability Engineer

Location: Atlanta, GA

Duration: 12 months

Rate: $market All Inclusive

 

Job Description:

 

  • This Software Engineer will be part of the Site Reliability Engineering (SRE) team.
  • The SRE team is an innovative team devoted to providing automated solutions and services for Cox Automotive to measure, evaluate and plan for visible, reliable application delivery and maintenance.
  • As a member of the SRE team, you will work with development teams to help create automated pipelines and solutions required for continuous delivery in an Agile Dev/Ops environment.
  • The tools and use-cases are diverse, and our challenge is to increase the development velocity by optimizing various parts of the pipeline and increase application stability.
  • This is an opportunity to create automation, monitoring, and pipelines to improve deploy and response time across the board.

 

We are looking for engineers who are passionate about infrastructure as code and continuous deployment to build scalable and highly reliable applications.

If you love to figure out how all the pieces are put together and if automation and building tools to monitor and manage your applications sounds interesting to you, we want to talk to you.

 

What you will do:

  • Automate anything and everything (Infrastructure build out, testing, deploying, monitoring, etc)
  • Design and assist in the authoring of software tools that reliably manage application delivery
  • Design and assist in the setup and maintenance of application monitoring and alerting
  • Engage with Development/Capability Teams to ensure best practices are implemented
  • Improve predictability and reliability of software releases, workflows and operating software.
  • Reduce application deployment windows by leading company towards a Continuous Deployment environment
  • Reduce mean time to recovery (MTTR) by helping troubleshoot, monitor, alert, and automating recovery.

 

The skills we require:

  • Python, Ruby, Go or other systems programming (moderate skills required)
  • Experience with configuration management systems (Octopus, Chef, Puppet)
  • Experience rolling out redundant, mission-critical applications in a highly available production environment
  • Experience with version control systems (Git or SVN)
  • Experience with Cloud Computing platforms (Amazon AWS, Kubernetes, Heroku, etc)
  • Experience with continuous integration tools (Jenkins, CircleCI, etc), Artifactory (or Nexus)
  • Excellent written communication, problem solving, and process management skills
  • Desire to work in a fast paced, evolving, growing, dynamic environment

 

The skills we prefer:

  • Linux system engineering expertise
  • VMWare, VirtualBox experience.
  • Experience supporting Ruby or Java applications - Experience supporting Database Server infrastructure (MySQL, Postgres, etc)
  • Networking Knowledge
  • Experience with Hashicorp tools (Vagrant, Terraform, Packer, etc), Linux Containers (docker, rocket)
  • Experience with Java build tools such as Ant, Maven, Gant, or Gradle
  • Experience with agile development, continuous integration and automated testing
  • Experience with dashboarding, monitoring


  • Atlanta, United States Resource Informatics Group Full time

    Job Description Job Description Role: Site Reliability Engineer Location: Atlanta, GA Duration: 12 months Rate: $market All Inclusive Job Description: This Software Engineer will be part of the Site Reliability Engineering (SRE) team. The SRE team is an innovative team devoted to providing automated solutions and services for Cox Automotive to measure,...


  • Atlanta, United States Scicom Infrastructure Services Full time

    Job DescriptionJob DescriptionSalary:  We are seeking a talented Site Reliability Engineer (SRE) with expertise in Datadog monitoring, Microsoft Azure cloud platform, and experience with Salesforce CPQ (Configure, Price, Quote) to join our dynamic team. The SRE will play a key role in ensuring the reliability, scalability, and performance of our cloud-based...

  • SRE Lead

    3 weeks ago


    Atlanta, United States MAU Workforce Solutions Full time

    3Ci is seeking a Site Reliability Engineer Lead to join their Client's Certification and Deployment management team. This role involves working with stakeholders to define SLOs and SLIs, develop the overall SRE strategy and roadmap, and ensure the reliability and performance of systems. Required Education and Experience Experience as an SRE Lead with a...


  • Atlanta, United States Hermeus Full time

    Hermeus is seeking a Senior Software or Site Reliability Engineer to join the Information Team and take charge of building infrastructure for our business-critical development and test facilities to help accelerate our iteration speed on hardware development and testing. You will work closely with avionics and aerospace engineering teams to design and...


  • Atlanta, United States Highbrow Full time

    System Reliability Engineer (SRE) 1 —> 3 to 5 years experience Location :- Kansas City, Mi or Atlanta, GA or Dallas, Texas Job Description: We are seeking an experienced System Reliability Engineer (SRE) 1 to join our team. The ideal candidate will have 3 to 5 years of relevant experience and will play a crucial role in ensuring the reliability,...


  • Atlanta, United States Highbrow Full time

    System Reliability Engineer (SRE) 2 —> 5 to 7 years experience Location :- Kansas City, Mi or Atlanta, GA or Dallas, Texas Job Description: We are seeking an experienced System Reliability Engineer (SRE) 2 to join our team. The ideal candidate will have 5 to 7 years of relevant experience and will play a crucial role in ensuring the reliability,...


  • Atlanta, United States Hansen Technologies Full time

    About The Role If you are an experienced Site Reliability Engineer join our team in Pune location to become a driving force in ensuring the reliability, performance, and scalability of our systems. As an SRE, you'll be more than just a technical expert, you’ll be a creative problem solver with exceptional customer relationship skills. Your primary mission...

  • GCP SRE

    5 days ago


    Atlanta, GA, USA, United States Diverse Lynx Full time

    Analyze the maturity of the platforms, identify the gaps / missing capabilities and draft architectural solutions and automation opportunities Rollout SRE practices across the platforms, propose solutions and automation opportunities Work with dev, platform, engineering and COE teams to finalize cloud migration & monitoring strategies, recommendations,...


  • Atlanta, United States XomegaITInc. Full time

    Job DescriptionJob DescriptionPerformance/SRE EngineerLocation: Atlanta, GA (Onsite from Day 1)Duration: Long term contractSkillset:Bachelor's degree in engineering with 10+ years of experience in Performance Engineering and Site Reliability Management (SRE).Experience with monitoring production environment for Performance, Availability and holistic view...


  • Atlanta, United States New Relic Full time

    Your opportunity We are an SRE Team who focuses on the customer experience by improving reliability of New Relic's streaming services and making it easy for the development teams creating those services to do reliability right. We concentrate on services built to stream customer data, currently limited to the Kafka platform but later may be expanded to...


  • Atlanta, Georgia, United States New Relic Full time

    Your opportunityWe are an SRE Team who focuses on the customer experience by improving reliability of New Relic's streaming services and making it easy for the development teams creating those services to do reliability right. We concentrate on services built to stream customer data, currently limited to the Kafka platform but later may be expanded to other...


  • Atlanta, United States CNA Search Full time

    Functions and Responsibilities Manage production environments by monitoring availability and taking a holistic view of system health Automate reliability, quality, and repeatability of cloud environments Proactively ensure the highest levels of systems and infrastructure availability Responsible for maintaining tools/systems/platforms for cloud service...


  • Atlanta, GA, United States Smarsh Full time

    Atlanta / PortlandDivisions – Enterprise Engineering /Full-Time /HybridWe are seeking a Sr. Site Reliability Engineer II to join our growing Enterprise Engineering Team. What will you do?Attend and actively participate in team ceremonies (stand-ups, retros, and planning meetings). Coordinate efforts with globally dispersed teams. Respond to incidents...


  • Atlanta, United States iScale Solutions Full time

    Job Description This is a remote position. Key Responsibilities: Design, implement, and maintain highly available and scalable infrastructure on AWS cloud platform. Develop and manage Infrastructure as Code (IaC) using Terraform for provisioning and managing cloud resources. Implement containerization strategies using Docker for packaging and deploying...

  • DevOps Engineer

    1 week ago


    Atlanta, United States Damco Solutions Full time

    We are searching for a decisive and insightful DevOps engineer to join our reputable company. The DevOps engineer will be involved in various stages of each product's lifespan and should remain abreast of technological advancements to promote efficiency. You should also keep track of customer reviews to enhance marketability. To ensure success as a DevOps...

  • AWS Devops Engineer

    2 days ago


    Atlanta, United States Saxon Global Full time

    Hello, Hope you are doing great. Please let me know if you are interested with below position. Position - AWS DevOps Engineer Location - Atlanta, GA (Remote) Job Type - Long Term Contract Job Description: Experience in designing and deploying dynamically scalable, highly available, fault-tolerant, and reliable applications on AWS Expert knowledge in AWS...


  • Atlanta, Georgia, United States Intercontinental Exchange Holdings, Inc. Full time

    Overview: Job Purpose   Our DevOps Engineers apply software engineering practices to build, run and maintain the software and infrastructure required for distributed fault-tolerant systems. DevOps Engineer ensures that the reliability and uptime of our systems aligns with the needs of the system’s user base and optimizes the capacity, performance, and...


  • Atlanta, United States Resource Logistics Full time

    Job Title/Role Akamai CDN DevOps Engineer Atlanta, GA or Frisco, TX Contract Mandatory Skills Akamai - content delivery networks (CDNs) with DevOps practices, AWS, Kubernetes, CI/CD pipelines, Jenkins, Docker, and Ansible. Client Interview Needed for Selection (Yes / No) Yes Job Description Position Overview We are seeking a talented Akamai CDN...

  • DevOps Engineer

    2 weeks ago


    Atlanta, United States Remotely Full time

    This is a remote position. DevOps Engineer(1 year experience, remote) Be part of our future! This job posting builds our talent pool for potential future openings. We'll compare your skills and experience against both current and future needs. If there's a match, we'll contact you directly. No guarantee of immediate placement, and we only consider...

  • DevOps Engineer

    1 week ago


    Atlanta, United States Global Payments Inc. Full time

    Responsibilities:Lead the design and implementation of DevOps practices and principles across the organization.Develop and maintain CI/CD pipelines for automating the build, test, and deployment of applications.Build and deploy software releases and patchesActively troubleshoot any issues that arise during testing, catching and solving issues before...