Reliability and Scalability Specialist

1 week ago


Atlanta, Georgia, United States Highbrow Full time
Site Reliability Engineer (SRE) at Highbrow

We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team at Highbrow. As an SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our systems.

Key Responsibilities
  • Collaborate with Cross-Functional Teams: Work closely with Application and DevOps teams to ensure seamless integration and communication, fostering a culture of collaboration and open communication.
  • Design and Implement CI/CD Pipelines: Develop and implement Continuous Integration, Continuous Delivery, and Continuous Deployment pipelines to automate build, test, and deployment processes.
  • Automate Deployment and Testing: Develop and troubleshoot Jenkins scripts to automate deployment and testing processes, ensuring efficient and reliable delivery of software.
  • Manage Cloud Infrastructure: Manage and maintain cloud infrastructure using Terraform, AWS CloudFormation, and Python, ensuring scalability and reliability.
  • Containerization and Orchestration: Work with container systems like Docker and Kubernetes to ensure efficient deployment and scaling of applications.
Requirements
  • Expertise in AWS and Terraform: Possess in-depth knowledge of AWS services, including EC2, S3, CloudTrail, VPC, EBS, RDS, ELB, Route 53, CloudWatch, CloudFormation, and Lambda.
  • Experience with GitLab CI/CD: Have experience with GitLab CI/CD, Step Functions, Redis Cache, and DynamoDB.
  • Proficiency in Linux and DevOps: Possess proficiency in Linux, DevOps, and containerization using Kubernetes and Docker.
  • Knowledge of Additional Technologies: Have knowledge of Tomcat, Git, Jenkins, YAML, Kafka, Oracle, Shell Scripting, Java, XML, Splunk, APPD, Prometheus, and Grafana.
Desired Qualifications
  • Experience with SQL and Cassandra: Have experience with SQL and Cassandra.
  • Knowledge of MQ and Message Queuing Systems: Possess knowledge of MQ and message queuing systems.
  • Experience with Cloud Platforms: Have experience with AWS and Azure cloud platforms.


  • Atlanta, Georgia, United States Jobs for Humanity Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Jobs for Humanity. As a key member of our Platform Service Delivery team, you will play a critical role in ensuring the stability, reliability, and scalability of our payment infrastructure.Key ResponsibilitiesParticipate in day-to-day operations, including changes...


  • Atlanta, Georgia, United States Workday Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Workday. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and availability of our customer environments.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure solutionsDevelop and maintain scripts...

  • DevOps Engineer

    7 days ago


    Atlanta, Georgia, United States Motion Recruitment Full time

    About Motion RecruitmentMotion Recruitment is a leading provider of engineering talent to the insurance industry. We are currently seeking a skilled DevOps/Site Reliability Engineer to join our client's team in Atlanta.Job SummaryWe are looking for a highly experienced Site Reliability Engineer to collaborate with our development and operations teams to...


  • Atlanta, Georgia, United States Jobs for Humanity Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our innovative Platform Service Delivery team at FIS Global. As a key member of our team, you will play a critical role in ensuring the high stability, reduced Service Downtime, and improved Quality of Service for our clients.Key ResponsibilitiesParticipate in day-to-day...


  • Atlanta, Georgia, United States Channel Personnel Services Full time

    Job OverviewChannel Personnel Services is seeking a highly skilled Reliability Specialist to join our team. As a key member of our organization, you will play a critical role in ensuring the operability, maintainability, and reliability of our equipment, processes, and systems.Key ResponsibilitiesAsset Risk Management: Identify and manage asset reliability...


  • Atlanta, Georgia, United States Austin Allen Company Full time

    Senior Reliability Specialist / Maintenance and Electrical **Required Industry Background: Wood Products, OSB, Plywood, Lumber, or related forestry sectors. SALARY: Up to $195,000 + Bonus + Benefits + Travel Expenses REMOTE POSITION WITH TRAVEL Travel Requirements: This will depend on your home-based location. You will be spending your time in the southern &...


  • Atlanta, Georgia, United States T-Mobile Full time

    At T-Mobile, we prioritize your growth. Our comprehensive Total Rewards Package guarantees that our employees receive the same exceptional care we extend to our customers. All team members benefit from a competitive base salary and a robust compensation package - this is our Total Rewards. Employees have access to various wealth-building opportunities...


  • Atlanta, Georgia, United States Channel Personnel Services Full time

    Job DescriptionAs an Asset Reliability Specialist, you will play a crucial role in identifying and managing risks associated with asset reliability that may negatively impact operational efficiency. Your responsibilities will include:Developing and maintaining industry standards for materials, equipment, and spare parts.Systematically defining, designing,...


  • Atlanta, Georgia, United States Motion Recruitment Full time

    About Motion RecruitmentMotion Recruitment is a leading provider of engineering talent to prominent insurance firms in the Atlanta area.Job SummaryWe are seeking a skilled DevOps/Senior Site Reliability Engineer to join our client's engineering team. This is a full-time position offering a hybrid work model at their Atlanta office.About the RoleThis is an...


  • Atlanta, Georgia, United States Advansys Full time

    Job Title: Site Reliability Engineer Company: Advansys Location: Remote Duration: Long term Position Overview: We are looking for a proficient Site Reliability Engineer who will play a pivotal role in enhancing the dependability, efficiency, and accessibility of our software solutions. Key Responsibilities: Ensure the stability and performance of software...


  • Atlanta, Georgia, United States Advansys Full time

    About the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Advansys. As a key member of our infrastructure team, you will be responsible for maintaining and improving the reliability, performance, and availability of our software systems.Key Responsibilities:Maintain and improve the reliability, performance, and availability...


  • Atlanta, Georgia, United States BLM GROUP USA CORPORATION Full time

    Job SummaryWe are seeking a highly skilled Maintenance and Reliability Specialist to join our team at BLM GROUP USA CORPORATION. As a key member of our Service department, you will be responsible for performing preventive maintenance of our equipment at customer sites, ensuring timely resolutions to customer issues, and maintaining a positive client...


  • Atlanta, Georgia, United States Motion Recruitment Full time

    Job Title: Site Reliability Engineer 2/Azure DeveloperAbout the Role:Motion Recruitment is seeking a highly skilled Site Reliability Engineer 2/Azure Developer to join our client's engineering team. As a key member of the team, you will be responsible for ensuring the reliability, scalability, and performance of their systems.Key Responsibilities:Collaborate...


  • Atlanta, Georgia, United States FIS Global Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our global team at FIS Global. As a Site Reliability Engineer, you will play a critical role in ensuring the scalability, high availability, performance, stability, and reliability of our software applications.Key ResponsibilitiesFocusing on scalability, high availability,...

  • Reliability Engineer

    5 hours ago


    Atlanta, Georgia, United States National Black MBA Association Full time

    Job DescriptionJob Title: Reliability EngineerJob Summary:We are seeking a highly skilled Reliability Engineer to join our team at the National Black MBA Association. As a Reliability Engineer, you will be responsible for designing, implementing, and managing systems to ensure high availability and performance of production services.Key...


  • Atlanta, Georgia, United States Ultimate Software Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Ultimate Software. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and efficiency of our cloud-based services.Key ResponsibilitiesDesign and implement scalable and reliable cloud infrastructure solutionsDevelop and maintain...


  • Atlanta, Georgia, United States Ultimate Software Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Ultimate Software. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and efficiency of our cloud-based services.Key ResponsibilitiesDesign and implement scalable and reliable cloud infrastructure solutionsDevelop and maintain...


  • Atlanta, Georgia, United States Motion Recruitment Full time

    Job Title: Senior Cloud Reliability EngineerJob Type: Full-timeLocation: Atlanta, GAJob Description:We are seeking a highly skilled Senior Cloud Reliability Engineer to join our team at Motion Recruitment. As a Senior Cloud Reliability Engineer, you will be responsible for designing, implementing, and maintaining the company's cloud infrastructure, ensuring...


  • Atlanta, Georgia, United States Cox Enterprises Full time

    This Software Engineer will be part of the Site Reliability Engineering (SRE) team. The SRE team is an innovative team devoted to providing automated solutions and services for Cox Automotive to measure, evaluate and plan for visible, reliable application delivery and maintenance. As a member of the SRE team, you will work with development teams to help...


  • Atlanta, Georgia, United States FIS Full time

    Job SummaryFIS is seeking a highly skilled Site Reliability Engineer to join our innovative Platform Service Delivery team. As an SRE, you will play a critical role in ensuring the high stability, reduced Service Downtime, and improved Quality of Service for our clients.About the Role:Participate and lead the day-to-day operations of the team for changes and...