Reliability Engineering Specialist

4 days ago


Atlanta, Georgia, United States VDart Inc Full time
About the Role

VDart Inc is seeking a skilled Site Reliability Engineer to join our team. In this position, you will be responsible for designing, implementing, and maintaining highly available and scalable cloud-based systems using Kubernetes and AWS services.

Key Responsibilities:
  • Cloud Infrastructure Management: Leverage your expertise in managing Kubernetes clusters in production and staging environments, ensuring high availability and efficient resource utilization.
  • AWS Services: Develop and maintain automated workflows using AWS cloud services (EC2, S3, RDS, EKS, Lambda) to build, manage, and scale cloud-native infrastructure.
  • CICD Pipeline Support: Build, optimize, and maintain CI/CD pipelines to enable seamless code delivery and deployments using tools like Jenkins or GitLab CI.
  • Monitoring & Observability: Implement and maintain monitoring, alerting, and logging solutions using tools such as Prometheus, Grafana, CloudWatch, or ELK stack to ensure system health and availability.
  • Incident Response: Lead and support incident response efforts, conduct root cause analysis, and implement post-incident reviews to improve system resilience.
  • Performance Optimization: Identify and resolve performance bottlenecks, improve system efficiency, and ensure applications and infrastructure are optimized for both cost and performance.

Requirements:
  • Kubernetes Expertise: Strong expertise in managing and scaling Kubernetes clusters, including experience with Kubernetes networking, storage, and multi-cluster architectures.
  • AWS Cloud Expertise: Proficiency with AWS services such as EC2, S3, EKS, RDS, VPC, Lambda, IAM, CloudWatch, and others. Experience with AWS best practices for scalability, security, and cost management.
  • CI/CD Pipelines: Experience building and maintaining continuous integration and continuous deployment (CI/CD) pipelines using Jenkins, GitLab CI, or similar tools.
  • Scripting & Automation: Proficiency in scripting languages such as Python, Bash, or Go to automate operational tasks and improve workflows.

Compensation:
We offer an estimated salary range of $120,000 - $180,000 per year, depending on qualifications and location.

  • Atlanta, Georgia, United States Channel Personnel Services Full time

    Job DescriptionChannel Personnel Services seeks a skilled Reliability Specialist to identify and manage asset reliability risks that could adversely affect plant or business operations.Key Responsibilities:Develop and maintain plant standards that influence the selection of materials, equipment, and spare parts.Design, develop, monitor, and refine an Asset...


  • Atlanta, Georgia, United States Channel Personnel Services Full time

    Job SummaryAt Channel Personnel Services, we are looking for a skilled Reliability Specialist to join our team. In this role, you will be responsible for identifying and managing asset reliability risks that could adversely affect plant or business operations. You will develop and maintain plant standards that influence the selection of materials, equipment,...


  • Atlanta, Georgia, United States Truist Inc Full time

    Job SummaryWe are seeking a skilled Technical Systems Support Specialist to join our team as a Site Reliability Engineer. This role is responsible for providing day-to-day support for business-critical systems, ensuring operational stability, and quickly resolving incidents.


  • Atlanta, Georgia, United States Motion Recruitment Full time

    Company OverviewMotion Recruitment is partnering with a leading provider of fraud protection in financial institutions to find an exceptional Site Reliability Engineer. This opportunity is perfect for someone who thrives in a dynamic environment and loves building and maintaining top-notch products and applications.SalaryThe estimated salary range for this...


  • Atlanta, Georgia, United States CLX Engineering Full time

    **Overview**CLX Engineering is seeking an experienced Airport Infrastructure Management Specialist to join our team. This role will play a critical part in the success of our airport projects, acting as an extension of staff with our clients.As an Airport Infrastructure Management Specialist, you will work independently and communicate with CLX Engineering's...


  • Atlanta, Georgia, United States Diligente Technologies Full time

    We are Diligente Technologies, a dynamic organization seeking a highly experienced Principal Database Administrator/Reliability Engineer to join our team. This role offers the opportunity to work on cutting-edge SaaS technologies and impactful projects that cater to enterprises and users worldwide.Key ResponsibilitiesDatabase Infrastructure...


  • Atlanta, Georgia, United States Resource Informatics Group Inc Full time

    We are looking for a skilled Site Reliability Engineer to join our team at Resource Informatics Group Inc in Atlanta, GA. As a member of the SRE team, you will play a crucial role in ensuring the reliability and availability of our applications.**Key Responsibilities:**Automate infrastructure build-out, testing, deploying, monitoring, and other tasks to...


  • Atlanta, Georgia, United States NOVA Engineering Full time

    About NOVA Engineering and EnvironmentalWe are a leading provider of environmental consulting, geotechnical engineering, and construction materials testing and inspection services. Our company was founded in 1996 and has since expanded to serve clients throughout the southeastern United States.As an Environmental Permitting Specialist at NOVA, you will play...


  • Atlanta, Georgia, United States Citizens Full time

    Citizens, a leading financial institution, is seeking an experienced Site Reliability Engineer to join our team. This role involves designing and implementing cutting-edge observability platforms that ensure the reliability and uptime of our critical systems.Job SummaryWe are looking for a highly skilled engineer who can collaborate with cross-functional...


  • Atlanta, Georgia, United States Meade Engineering Full time

    About the RoleWe are seeking a highly experienced Electrical Substation Engineer to join our team at Meade Engineering, Inc. in Phoenix, Arizona or Austin, Texas.This is a fantastic opportunity for a skilled engineer to work on large-scale data center projects, designing and implementing high/medium voltage electrical distribution systems.Job...


  • Atlanta, Georgia, United States Channel Personnel Services Full time

    Job OverviewWe are seeking an experienced Reliability Solutions Expert to join our team at Channel Personnel Services.Key Responsibilities:Develop and maintain plant standards that influence the selection of materials, equipment, and spare parts.Create and refine Asset Maintenance Plans that include value-added preventive maintenance tasks and predictive...


  • Atlanta, Georgia, United States Southeastern Engineering, Inc. Full time

    Job OverviewSoutheastern Engineering, Inc. is seeking a highly skilled Technical Construction Support Specialist to join our team.Estimated Salary: $45,000 - $60,000 per yearResponsibilitiesThis role will support higher level inspectors and engineers on assignments relating to highway and bridge construction inspection. Key responsibilities...


  • Atlanta, Georgia, United States KU Kids Deanwood Full time

    Job Overview">KU Kids Deanwood is seeking a reliable transportation and furniture assembly specialist to join our team. This role involves transporting furniture and items to customers while assisting with delivery and assembling furniture on-site.">Estimated Salary: $40,000 - $60,000 per year (dependent on experience)">Responsibilities:">">Driver...


  • Atlanta, Georgia, United States 4P Consulting Inc. Full time

    Job DescriptionWe are seeking a highly skilled Distribution Systems Engineer Specialist to join our team at 4P Consulting Inc.About the Role:This is a full-time position that requires strong technical engineering skills and a solid understanding of power systems, electrical system operations, and distribution systems.The successful candidate will be...


  • Atlanta, Georgia, United States HCL Technologies Ltd. Full time

    At HCL Technologies Ltd., we're seeking a seasoned Reliability Product Manager to join our team. In this role, you'll have the opportunity to make a significant impact on our products and services.About the RoleWe're looking for an 8+yrs experienced professional who can help us lead the Reliance Team. The ideal candidate will have big ideas and strategies to...


  • Atlanta, Georgia, United States Centennial Farms Dairy Full time

    About the RoleWe are seeking an experienced Senior Plant Maintenance Engineer to lead our engineering and maintenance functions at Centennial Farms Dairy. As a key member of our team, you will be responsible for directing and controlling all plant engineering and maintenance activities to optimize equipment efficiency and overall facility...


  • Atlanta, Georgia, United States Garver Engineering Full time

    At Garver Engineering, we are seeking a highly skilled Bridge Engineer to join our Transportation Design Center in Atlanta, Georgia. As a key member of our team, you will play a crucial role in delivering bridge and structural projects that meet the highest standards of quality and safety.Key Responsibilities:Perform a wide range of design tasks from basic...


  • Atlanta, Georgia, United States KPMG Full time

    KPMG is a leading advisory firm seeking a Senior Specialist, DevOps Engineer to join our Performance Transformation group in our Deal Advisory and Strategy practice.Job OverviewThe ideal candidate will have a strong background in software development and experience with DevOps practices. The successful candidate will collaborate with solution architects to...


  • Atlanta, Georgia, United States Scicom Infrastructure Services Full time

    Job OverviewWe are seeking a skilled Cloud Resilience Specialist to enhance the reliability and robustness of our cloud-based systems. In this role, you will be responsible for designing and implementing chaos engineering practices to identify weaknesses in our infrastructure and applications.Key Responsibilities:Chaos Engineering Strategies: Develop and...

  • AI Engineer

    4 days ago


    Atlanta, Georgia, United States Enaar Group Full time

    We are seeking a seasoned AI Engineer to join our team as a Financial Risk Specialist at the Enaar Group in Atlanta, GA.About the RoleThis is an exciting opportunity for an experienced machine learning engineer to play a pivotal role in guiding our clients towards data-driven decisions, enhanced investment strategies, and secure financial transactions.Key...