Site Reliability Engineer

3 weeks ago


Alpharetta, United States BAKKT LLC Full time

About Us Founded in 2018, Bakkt builds technology that connects commerce. Our vision is to connect the digital economy by offering one ecosystem for cryptocurrency and digital assets, loyalty, and commerce. We enable our partners and clients to deliver new opportunities to their customers through SaaS and API solutions that unlock crypto and drive loyalty, powering engagement and performance. Come build with us. As a Site Reliability Engineer, you will be responsible for closely monitoring our production environments, swiftly addressing issues, and applying creative solutions to ensure the seamless operation of our platform. You will utilize your natural curiosity and strong problem-solving skills to investigate and resolve technical issues across our applications, services, databases & infrastructure. Responsibilities Observability: Implement and manage robust monitoring systems to continuously track the functional and non-functional health and performance of our production systems. Proactively identify anomalies and potential issues before they impact our clients. Client Support: Partner with software engineering, project management and customer success teams to respond to client requests and support inquiries. Work closely with our clients to provide support during integration, and ensure a positive experience. Incident Management: Lead escalation remediation by working across multiple teams such as software engineering, devops, and project management for web applications and services running in a 24/7, always-on, cloud platform environment. Participate in an on-call rotation to address and resolve critical incidents outside of regular business hours. Operations: Execute and develop operational procedures necessary for service requests and incident response. Maintain critical platform support knowledge, such as customer contact lists, vendor escalation procedures, scheduled job inventories, and operational playbooks. Support planning and execution of production changes and software releases. Automation: Develop scripts and tools to automate repetitive tasks, streamline workflows, and improve the efficiency of the production support process. Assist in the automation of customer operational tasks and ensure alignment with business requirements regarding customer-facing processes such as customer order reconciliation. Ensure timely execution of scheduled and repeatable processes such as periodic system validations, daily triage, system monitoring and event log management. Continuous Improvement: Actively participate in process improvement initiatives, suggesting enhancements to observability, logging strategies, incident response procedures, and support workflows. Requirements A bachelor’s degree in Computer Science, Information Technology or equivalent. 5+ years of application support and production support experience supporting cloud-based platforms using an SRE support model. Proven track record in a production support/SRE role, demonstrating your ability to monitor and troubleshoot complex systems in highly available production environments. Experience with common development tools and practices, including Java-based, Springboot environments and source control tools, such as GIT in a team environment. Demonstrated ability to understand application logs and supporting various monitoring and visualization tools (e.g. Alertsite, LogStash, DataDog). Excellent communication skills, both written and verbal, for effective interaction with technical and non-technical stakeholders. Self-starter who can work independently and effectively across functional team environments. Proven ability to learn new IT technologies and disciplines. Preferred Ability to read and interpret Java, Angular, SQL and other software coding languages. Experience with GCP, Google Kubernetes Engine, Google Compute Engine. Experience with n-tier web and services application architectures and in Java-based, Springboot and Tomcat Environment. Working knowledge of SQL Server. Experience with JIRA or other Service Desk tools. Experience with multiple OS platforms (Linux, Windows). Experience with Mongo and scripting language like python. Bakkt is devoted to having diversity in its workforce and is proud to be an equal opportunity employer. Bakkt does not make any employment decisions based on race, color, religion, sex, national origin, veteran status, disability, age, sexual orientation, gender identity of any other characteristic protected by law. #J-18808-Ljbffr



  • Alpharetta, Georgia, United States Advansys Full time

    Position: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to enhance our cloud infrastructure and ensure optimal system performance. Key Responsibilities:Oversee the availability and reliability of systems across both cloud-native environments (AWS, GCP) and hybrid setups.Implement strategies to improve system...


  • Alpharetta, Georgia, United States Advansys Full time

    Position: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to oversee the reliability and performance of our systems. This role is crucial for ensuring optimal uptime across diverse environments, including cloud-native and hybrid infrastructures.Key Responsibilities:Oversee system uptime across cloud-based...


  • Alpharetta, Georgia, United States Advansys Full time

    Position: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to oversee the reliability and performance of our systems. The ideal candidate will be responsible for:Ensuring system uptime across both cloud-native environments (AWS, GCP) and hybrid infrastructures.Implementing best practices for system monitoring...


  • Alpharetta, Georgia, United States Advansys Full time

    Position: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to oversee the reliability and performance of our systems. The ideal candidate will be responsible for:Ensuring high availability and uptime of systems across cloud-native environments, including AWS and GCP.Implementing best practices for system...


  • Alpharetta, United States Equifax Inc. Full time

    Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you.This Intern - Site Reliability Engineer 12-week Internship Program is a gateway to full-time career paths for current university students who are...


  • Alpharetta, United States Bakkt LLC Full time

    Job DescriptionJob DescriptionAbout UsFounded in 2018, Bakkt builds technology that connects commerce.Our vision is to connect the digital economy by offering one ecosystem for cryptocurrency and digital assets, loyalty, and commerce. We enable our partners and clients to deliver new opportunities to their customers through SaaS and API solutions that unlock...


  • Alpharetta, United States Equifax Full time

    Manage system(s) uptime across cloud-native (AWS, GCP) and hybrid architectures. Build infrastructure as code (IAC) patterns that meet security and engineering standards using one or more technologies (Terraform, scripting with cloud CLI, and program Reliability Engineer, Liability, Specialist, Engineer, Cloud Architect, Reliability, Technology


  • Alpharetta, United States Equifax Full time

    Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale,...


  • Alpharetta, United States Equifax Full time

    About the RoleAt Equifax, we're seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our engineering organization, you'll play a critical role in ensuring the reliability and performance of our large-scale, distributed systems.Key ResponsibilitiesSystem Uptime Management: Manage system uptime across cloud-native (AWS, GCP)...


  • Alpharetta, United States Equifax Full time

    About the RoleAt Equifax, we're seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our engineering organization, you'll play a critical role in ensuring the reliability and performance of our large-scale, distributed systems.Key ResponsibilitiesSystem Uptime Management: Manage system uptime across cloud-native (AWS, GCP)...


  • Alpharetta, United States Insight Global Full time

    Insight Global is seeking a Cloud Reliability Engineer to enhance their team of DevOps and Site Reliability Engineers (SREs) dedicated to supporting their advanced tools. This team is currently overseeing a multitude of environments. The ideal candidate will possess substantial experience in AWS, Linux/Windows systems, and troubleshooting methodologies. This...


  • Alpharetta, United States Equifax Full time

    Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with...


  • Alpharetta, United States Andritz Full time

    About the RoleWe are seeking an experienced Electrical and Controls Engineer to join our team in Alpharetta, GA. As a key member of our Automation & Digitalization division, you will be responsible for designing, developing, and delivering high-quality electrical and control engineering projects that improve plant productivity, reliability, and...


  • Alpharetta, United States Equifax Full time

    Equifax is where you can power your possible. If you want to achieve your true potential, chart new paths, develop new skills, collaborate with bright minds, and make a meaningful impact, we want to hear from you. Synopsis of the role: This is a 12-week internship program and it is a gateway to full-time career paths for current university students. The...

  • Reliability Specialist

    3 months ago


    Alpharetta, United States ANDRITZ AG Full time

    Every day, ANDRITZ continues to deliver successful innovative solutions to our customers globally. Why are we so successful? Because we are passionate and love what we do We are at the forefront of future engineering technologies, with solutions that ensure the success of our clients in key industries that are shaping the future of the world we live in.Job...


  • Alpharetta, United States ANDRITZ AG Full time

    Every day, ANDRITZ continues to deliver successful innovative solutions to our customers globally. Why are we so successful? Because we are passionate and love what we do! We are at the forefront of future engineering technologies, with solutions that ensure the success of our clients in key industries that are shaping the future of the world we live...

  • Field Engineer

    3 months ago


    Alpharetta, United States Synerfac Full time

    We are looking for a self-motivated and dependable Field Engineers to join the team. This individual will be mobile throughout the entire shift and will drive from site to site in a company vehicle, and be provided with a gas card, tablet, and phone. Qualifications * Be on the job and ready to work at the designated project start time with the ability to...


  • Alpharetta, United States Kraftpowercon Inc Full time

    Job DescriptionJob DescriptionWe want to power a better world – are you ready for the challenge? KraftPowercon is a global leader in industrial power conversion. Our innovative solutions, products, and services have helped customers since 1935. Are you the right person for this very rewarding challenge? If yes, you will join a company whose purpose is...

  • Automation Engineer

    3 weeks ago


    Alpharetta, United States Andritz Full time

    Every day, ANDRITZ continues to deliver successful innovative solutions to our customers globally. Why are we so successful? Because we are passionate and love what we do! We are at the forefront of future engineering technologies, with solutions that ensure the success of our clients in key industries that are shaping the future of the world we live in. Our...

  • Automation Engineer

    4 weeks ago


    Alpharetta, United States ANDRITZ AG Full time

    Every day, ANDRITZ continues to deliver successful innovative solutions to our customers globally. Why are we so successful? Because we are passionate and love what we do We are at the forefront of future engineering technologies, with solutions that ensure the success of our clients in key industries that are shaping the future of the world we live in. Our...