Senior Site Reliability Engineer

3 weeks ago


Atlanta, Georgia, United States Boomi Inc Full time

About Boomi and What Makes Us Special

We're a fast-growing company that's changing the world by connecting everyone to everything, anywhere.

Our award-winning, intelligent integration and automation platform helps organizations power the future of business.

At Boomi, you'll work with world-class people and industry-leading technology.

We're looking for trailblazers with an entrepreneurial spirit who can solve challenging problems, make a real impact, and want to be part of building something big.

As a Senior Site Reliability Engineer, you'll be responsible for developing sophisticated systems and software based on the customer's business goals, needs, and general business environment.

You'll work with product management, other engineering teams, customer success, and support on developing cutting-edge new product features and enhancements across various areas of Boomi offerings.

Key Responsibilities:

  • Be an active member of an Agile team, collaboratively realizing features through the software development lifecycle.
  • Design, build, and maintain infrastructure as code that enables provisioning and maintenance of Boomi's infrastructure.
  • Participate actively in detecting, remediating, and reporting on Production incidents, ensuring the SLAs/SLOs are defined and met.
  • Participate in an on-call rotation to ensure coverage for planned/unplanned events.
  • Engage with other Engineering organizations to implement processes, identify improvements, and drive consistent results.
  • Work with your SRE and other engineering counterparts for building more scalable, resilient, and reliable systems.
  • Collaborate with Engineering organizations to build and automate tooling.
  • Implement best practices on Observability and build monitoring that alerts on symptoms rather than on outages.
  • Improve operational processes (such as deployments and upgrades) to make them as simple as possible.
  • Plan the growth of Boomi's infrastructure.
  • Work independently with a minimal level of guidance from technical leadership.
  • Mentor other Boomi engineers, including design collaboration and code reviews.

Requirements:

  • Possess a strong passion for SRE, DevOps, Automation, and infrastructure platforms.
  • Expert in developing Ansible playbooks and automation for Infrastructure as code using CloudFormation templates.
  • A grasp of Cloud Native concepts, containerization best practices, and security awareness in Cloud will be a strong plus.
  • Expert in defining, measuring, and improving Reliability Metrics.
  • Strong understanding in implementing observability practices (Monitoring, Logging, Distributed Tracing etc.) preferably using Splunk and New Relic.
  • Strong understanding and working experience with AWS/Azure.
  • Ability to design and implement APIs for use by internal teams.
  • Strong understanding of CI/CD workflows.
  • Experience with agile collaboration tools, such as JIRA and Confluence.
  • Experience with Web Services technologies including REST, SOAP, and WSDL.

Desired Experience:

  • 7+ years experience in the software engineering industry, with experience supporting large-scale SaaS and Cloud-based software solutions in production.
  • Certified in Cloud (AWS/Azure/GCP), experience in using services such as virtual machines, containers, and databases.
  • Experience in Ansible, Terraform, Python, and JavaScript.
  • Familiarity using AWS technologies such as CloudFormation, S3, ECS, EKS, and EC2.
  • Security awareness in the Cloud will be a strong plus.
  • Experience in Observability, creating dashboards for SLA/SLI/SLO.
  • Basic understanding of Application Integration and/or Data Integration (ETL).

We take pride in our culture and core values and are committed to being a place where everyone can be their true, authentic self.

Our team members are our most valuable resources, and we look for and encourage diversity in backgrounds, thoughts, life experiences, knowledge, and capabilities.

All employment decisions are based on business needs, job requirements, and individual qualifications.

Boomi strives to create an inclusive and accessible environment for candidates and employees.

If you need accommodation during the application or interview process, please submit a request.



  • Atlanta, Georgia, United States Jonas Software UK Full time

    About the Role:We are seeking a highly skilled Senior Site Reliability Engineer to join our team at Jonas Software UK. As a key member of our technical operations team, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • Atlanta, Georgia, United States Microsoft Corporation Full time

    We are seeking a highly skilled Senior Site Reliability Engineer to join our Windows Servicing and Delivery team at Microsoft Corporation.The ideal candidate will have a strong background in software engineering, network engineering, or systems administration, with a proven track record of delivering high-quality solutions that meet customer needs.As a...


  • Atlanta, Georgia, United States STORD Full time

    About the RoleStord is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our SRE team, you will be responsible for designing and implementing scalable, efficient, and secure infrastructure and platform solutions.You will collaborate with cross-functional teams to deliver high-quality products and services to our...


  • Atlanta, Georgia, United States SIDEARM Sports Full time

    Job SummaryAt SIDEARM Sports, we're seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our SRE team, you'll play a critical role in ensuring the reliability, availability, and performance of our live services, which impact millions of customers across the entertainment space.Key ResponsibilitiesCollaborate with...


  • Atlanta, Georgia, United States Microsoft Corporation Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Microsoft Corporation. As a key member of our Windows Servicing and Delivery team, you will be responsible for ensuring the reliability and performance of our product offerings, including Windows client, Windows Update, and Windows Autopatch.Key Responsibilities...


  • Atlanta, Georgia, United States Cox Communications Full time

    About the RoleThis is an exciting opportunity to join our team as a Senior Site Reliability Engineer. As a key member of our Manheim Logistics SRE team, you will play a crucial role in designing and maintaining AWS infrastructure and deployment pipelines for our 15+ development teams.We are looking for a highly skilled and experienced engineer who can work...


  • Atlanta, Georgia, United States Pyramid Consulting Full time

    Job SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Pyramid Consulting, Inc. This is a contract opportunity with long-term potential and is located in Atlanta, GA.Key ResponsibilitiesDesign and implement SLOs / SLIs / error budgets and manage reliability for infrastructure and applicationsProven experience with...


  • Atlanta, Georgia, United States Microsoft Corporation Full time

    About the RoleMicrosoft Corporation is seeking a highly skilled Senior Site Reliability Engineering Manager to lead the delivery of critical features in Office 365 government cloud offerings. As a key member of the Office 365 team, you will be responsible for combining your passion for quality, reliability, and creativity to drive evolution in the continuous...


  • Atlanta, Georgia, United States Pyramid Consulting Full time

    Pyramid Consulting is seeking a talented Senior Site Reliability Engineer to join our team. This is a contract opportunity with long-term potential and is located in a major US city. The successful candidate will have a strong background in setting SLOs / SLIs / error budgets and managing reliability for infrastructure and applications.Key...


  • Atlanta, Georgia, United States Ditto Job Board Full time

    Job Title: Site Reliability EngineerAt Ditto, we're on a mission to unleash the full power of edge devices by removing all the plumbing required to build amazing applications. As a Site Reliability Engineer, you'll play a critical role in helping us achieve this goal.About the RoleWe're seeking a highly skilled Site Reliability Engineer to join our Federal...


  • Atlanta, Georgia, United States JobRialto Full time

    Job SummaryThe Site Reliability Engineer is responsible for ensuring the availability, scalability, and performance of critical services and systems. This role requires expertise in OpenShift and CloudFormation, along with a deep understanding of site reliability principles, container technologies, monitoring tools, and automation.Key ResponsibilitiesEnsure...


  • Atlanta, Georgia, United States Navtech Full time

    Job Title: Site Reliability EngineerJob Description:We are seeking a highly skilled Site Reliability Engineer to join our team at Navtech. As a Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, and performance of our production systems.Key Responsibilities:Provide L4 technical support for production 24x7Design and...


  • Atlanta, Georgia, United States Geotab Full time

    About GeotabGeotab is a global leader in IoT and connected transportation, certified as a Great Place to WorkTM. We are a company of diverse and talented individuals who work together to help businesses grow and succeed, and increase the safety and sustainability of our communities.Our team is growing, and we're looking for people who follow their passion,...


  • Atlanta, Georgia, United States Della Infotech Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at Della Infotech. As a key member of our DevOps team, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure using AWS...


  • Atlanta, Georgia, United States Kobiton Full time

    About the RoleKobiton is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, performance, and scalability of our systems and services.You will work closely with development and operations teams to build and maintain robust infrastructure, automate...

  • Senior Civil Engineer

    3 weeks ago


    Atlanta, Georgia, United States Sevan Multi-Site Solutions Full time

    Job SummaryWe are seeking a highly skilled Senior Civil Engineer to join our team at Sevan Multi-Site Solutions. As a Senior Civil Engineer, you will be responsible for providing civil engineering services within a dedicated design team, working alongside a group of individuals to deliver high-quality projects.Key ResponsibilitiesPrepare complete, accurate,...


  • Atlanta, Georgia, United States IRIS Consulting Corporation Full time

    Job DescriptionWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at IRIS Consulting Corporation. As a key member of our Retail, Site Reliability Engineering team, you will be responsible for establishing and maintaining the reliability of our cloud-based infrastructure and applications.Key Responsibilities:Design and implement...


  • Atlanta, Georgia, United States Everbridge Full time

    About the Role:We are seeking a highly skilled Senior Data Reliability Engineer to join our team at Everbridge. As a key member of our Database Reliability Engineering team, you will be responsible for ensuring the overall service quality and availability of our data solutions.Key Responsibilities:Own operational availability, security, performance,...


  • Atlanta, Georgia, United States Cynet Systems Full time

    Job Description:We are seeking a highly skilled Site Reliability Engineer to join our team at Cynet Systems. The ideal candidate will have a strong background in application development, architecture, and consulting, with a proven track record of performing assessments and providing roadmaps with project plans.The successful candidate will have a good...


  • Atlanta, Georgia, United States Resource Informatics Group Inc Full time

    Job OverviewWe are seeking a highly skilled Site Reliability Engineer to join our team at Resource Informatics Group Inc. As a key member of our SRE team, you will be responsible for designing and implementing automated solutions to ensure the reliability and scalability of our applications.Key Responsibilities:Design and implement automated pipelines and...