Current jobs related to Site Reliability Engineer - Alpharetta - Bakkt LLC


  • Alpharetta, United States Saxon Global Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at Saxon Global. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain cloud infrastructure using TerraformDevelop and...


  • Alpharetta, United States Saxon Global Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at Saxon Global. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design, implement, and maintain cloud infrastructure using TerraformDevelop and deploy...


  • Alpharetta, United States Equifax Full time

    Unlock Your Potential as a Site Reliability Engineer at EquifaxAre you ready to take your career to the next level? Equifax is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our large-scale, distributed systems.Key...


  • Alpharetta, United States Saxon Global Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at Saxon Global. As a key member of our infrastructure team, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design, implement, and maintain cloud infrastructure using Terraform and GCP...


  • Alpharetta, United States RICEFW Technologies Full time

    Job Title: Site Reliability Engineer IIIRICEFW Technologies is seeking a highly skilled Site Reliability Engineer III to join our team. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining scalable and reliable cloud-based systems.Main Responsibilities:Design and implement resilient application...


  • Alpharetta, United States Equifax Full time

    Job SummaryEquifax is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our internal and external services.Key ResponsibilitiesManage system(s) uptime across cloud-native (AWS, GCP) and hybrid architectures.Build infrastructure as code (IAC)...


  • Alpharetta, United States Tektree Systems Inc. Full time

    Role: Site Reliability Engineer (SRE)Client: EquifaxLocation: Alpharetta, GA (DAY1 Onsite - F2F interview)===Note: Candidates with prior experience at Equifax are preferredJob Description Seeking an experienced Site Reliability Engineer who can operate independently with limited guidance and oversight. This individual will be passionate about end-user...


  • Alpharetta, GA, USA, United States Saxon Global Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Saxon Global. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly available...


  • Alpharetta, GA, USA, United States Navtech Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Navtech Inc. in Alpharetta, GA. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design and implement scalable and reliable...


  • Alpharetta, GA, United States Resource Informatics Group Full time

    Senior Site Reliability EngineerAt Resource Informatics Group, we are seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our engineering organization, you will be responsible for ensuring the reliability, scalability, and performance of our software applications.Key Responsibilities:• Run the production...


  • Alpharetta, United States Equifax Full time

    Unlock Your Potential as a Site Reliability Engineer Intern at EquifaxAre you a university student looking to power your possible and kickstart your career in technology? Equifax is seeking highly motivated and talented individuals to join our 12-week Site Reliability Engineer Internship Program.About the ProgramOur internship program provides a unique...


  • Alpharetta, United States Equifax Full time

    Join Our Team as a Site Reliability Engineer InternWe are seeking highly motivated and talented individuals to join our team as Site Reliability Engineers Interns. As an intern, you will have the opportunity to work on real-world projects, collaborate with our experienced engineers, and gain hands-on experience in designing, implementing, and maintaining...


  • Alpharetta, GA, USA, United States Equifax Full time

    Job SummaryEquifax is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for managing complex systems uptime across cloud-native architectures, operating systems at an optimal cost while maintaining availability targets, and building CI/CD pipelines for build, test, and deployment of...


  • Alpharetta, United States Equifax Full time

    Job DescriptionEquifax is a leading provider of information solutions and human resources. We are seeking a highly skilled Site Reliability Engineer to join our team.The successful candidate will be responsible for ensuring the reliability and performance of our cloud-native systems, as well as designing and implementing solutions to operational problems....

  • Reliability Engineer

    1 month ago


    Alpharetta, United States Arclin Full time

    Reliability EngineerArclin is a leading innovator in the production of engineered materials and technologies, delivering high-performance products for a diverse range of industries. We are committed to excellence in every aspect of our business, from product development to customer service.Job SummaryThe Reliability Engineer will be responsible for...

  • Reliability Engineer

    2 weeks ago


    Alpharetta, United States Arclin Full time

    Reliability EngineerArclin is a leading innovator in the production of engineered materials and technologies, delivering high-performance products for a diverse range of industries.We are committed to excellence in every aspect of our business, from product development to customer service.Key Responsibilities:Conduct reliability assessments and failure...


  • Alpharetta, United States Arclin Full time

    Job Title: Reliability Engineering ManagerArclin is a leading provider of advanced materials and solutions for the engineered materials industry. We are committed to delivering innovative products and services that drive performance, reliability, and sustainability for our customers.Job SummaryThe Reliability Engineering Manager is responsible for overseeing...


  • Alpharetta, United States Arclin Full time

    Reliability Engineering Manager Job DescriptionArclin is a leading provider of advanced materials and solutions for the engineered materials industry. We are committed to delivering innovative products and services that drive performance, reliability, and sustainability for our customers.Position OverviewThe Reliability Engineering Manager is responsible for...


  • Alpharetta, United States ADP Full time

    Job Title: Lead System Reliability EngineerADP is seeking a highly skilled Lead System Reliability Engineer to join our Global Product & Technology team. As a key member of our team, you will be responsible for driving reliability and resiliency across the enterprise while operationalizing and building automated solutions and tools.Key Responsibilities:Lead...

  • Reliability Engineer

    2 weeks ago


    Alpharetta, Georgia, United States ADP Full time

    About the RoleWe are seeking a highly skilled Reliability Engineer to join our team at ADP. As a key member of our development team, you will be responsible for designing and executing product performance testing for our cloud-based Human Resource solutions to ensure customer usability before deployment.Key ResponsibilitiesCollaborate with the development...

Site Reliability Engineer

3 months ago


Alpharetta, United States Bakkt LLC Full time
Job DescriptionJob Description

About Us

Founded in 2018, Bakkt builds technology that connects commerce.

Our vision is to connect the digital economy by offering one ecosystem for cryptocurrency and digital assets, loyalty, and commerce. We enable our partners and clients to deliver new opportunities to their customers through SaaS and API solutions that unlock crypto and drive loyalty, powering engagement and performance.

Come build with us.

As a Site Reliability Engineer, you will be responsible for closely monitoring our production environments, swiftly addressing issues, and applying creative solutions to ensure the seamless operation of our platform. You will utilize your natural curiosity and strong problem-solving skills to investigate and resolve technical issues across our applications, services, databases & infrastructure.

Responsibilities

  • Observability:
    1. Implement and manage robust monitoring systems to continuously track the functional and non-functional health and performance of our production systems.
    2. Proactively identify anomalies and potential issues before they impact our clients.
  • Client Support:
    1. Partner with software engineering, project management and customer success teams to respond to client requests and support inquires.
    2. Work closely with our clients to provide support during integration, and ensure a positive experience.
  • Incident Management:
    1. Lead escalation remediation's by working across multiple teams such as software engineering, devops, and project management for web applications and services running in a 24/7, always on, cloud platform environment.
    2. Participate in an on-call rotation to address and resolve critical incidents outside of regular business hours.
  • Operations:
    1. Execute and develop operational procedures necessary for service requests and incident response.
    2. Maintain critical platform support knowledge, such as customer contact lists, vendor escalation procedures, scheduled job inventories, and operational playbooks.
    3. Support planning and execution of production changes and software releases.
  • Automation:
    1. Develop scripts and tools to automate repetitive tasks, streamline workflows, and improve the efficiency of the production support process.
    2. Assist in the automation of customer operational tasks and ensures alignment with business requirements regarding customer facing processes such as customer order reconciliation.
    3. Ensure timely execution of scheduled and repeatable processes such as periodic system validations, daily triage, system monitoring and event log management.
  • Continuous Improvement:
    1. Actively participate in process improvement initiatives, suggesting enhancements to observability, logging strategies, incident response procedures, and support workflows.

Requirements

  • A bachelor’s degree in Computer Science, Information Technology or equivalent
  • 5+ years of application support and production support experience supporting cloud-based platforms using an SRE support model.
  • Proven track record in a production support/SRE role, demonstrating your ability to monitor and troubleshoot complex systems in highly available production environments.
  • Experience with common development tools and practices, including Java-based, Springboot environments and source control tools, such as GIT in a team environment
  • Demonstrated ability to understand application logs and and supporting various monitoring and visualization tools (e.g. Alertsite, LogStash, DataDog)
  • Excellent communication skills, both written and verbal, for effective interaction with technical and non-technical stakeholders.
  • Self-starter who can work independently and effectively across functional team environments.
  • Proven ability to learn new IT technologies and disciplines.

Preferred

  • Ability to read and interpret Java, Angular, SQL and other software coding languages
  • Experience with GCP, Google Kubernetes Engine, Google Compute Engine
  • Experience with n-tier web and services application architectures and in Java-based, Springboot and Tomcat Environment.
  • Working knowledge of SQL Server
  • Experience with JIRA or other Service Desk tools
  • Experience with multiple OS platforms (Linux, Windows)
  • Experience with Mongo and scripting language like python

Bakkt is devoted to having diversity in its workforce and is proud to be an equal opportunity employer. Bakkt does not make any employment decisions based on race, color, religion, sex, national origin, veteran status, disability, age, sexual orientation, gender identity of any other characteristic protected by law