Lead Site Reliability Engineer

10 hours ago


Atlanta, Georgia, United States Bose Full time
Job Title: Lead Site Reliability Engineer

At Bose, we're passionate about making sound matter. Our Information Technology team is dedicated to delivering valuable and reliable business and technology solutions. We're seeking a Lead Site Reliability Engineer to join our team and lead the way in ensuring the reliability and performance of our systems.

Key Responsibilities:
  • Lead and mentor a team of Site Reliability Engineers, providing guidance and support to ensure the team's success.
  • Foster a culture of collaboration, continuous improvement, and innovation within the team.
  • Define and communicate clear goals and objectives for the SRE team, aligning with overall business objectives.
  • Develop and execute strategies to improve system reliability, availability, and performance.
  • Drive the adoption of best practices and standards for SRE across the organization.
  • Participate in and lead strategic planning for capacity management, disaster recovery, and infrastructure investments.
  • Lead post-incident reviews to identify root causes and implement preventive measures.
  • Develop and enforce incident response procedures and runbooks.
  • Collaborate with engineering and architecture teams to design scalable and resilient system architectures.
  • Optimize system performance and reliability through proactive monitoring, tuning, and enhancements.
  • Evaluate and implement new technologies and tools to improve system capabilities and efficiency.
  • Drive the automation of operational processes to improve efficiency and reduce manual intervention.
  • Oversee the development and maintenance of tools for deployment, monitoring, and configuration management.
  • Promote the use of Infrastructure-as-Code (IaC) and Continuous Integration/Continuous Deployment (CI/CD) practices.
  • Lead efforts in capacity planning to ensure infrastructure can support current and future business needs.
  • Design and implement scaling strategies to handle variations in demand and growth.
  • Monitor and optimize resource utilization to balance performance and cost-effectiveness.
  • Work closely with cross-functional teams, including development, operations, and product management, to ensure alignment on reliability and performance goals.
  • Communicate effectively about system status, performance metrics, and ongoing improvements to stakeholders.
  • Provide technical guidance and support to other teams as needed.
  • Ensure thorough documentation of systems, processes, and procedures.
  • Create and maintain operational runbooks, knowledge base articles, and training materials.
  • Share knowledge and best practices with the team and organization through training sessions and workshops.
Requirements:
  • Advanced proficiency in scripting and programming languages such as Python, Go, Bash, or Java.
  • Extensive experience with monitoring and observability tools (e.g., Prometheus, Grafana, Datadog).
  • In-depth knowledge of containerization and orchestration technologies (e.g., Docker, Kubernetes).
  • Strong familiarity with cloud platforms (e.g., AWS, Azure, Google Cloud).
  • Expertise in configuration management and Infrastructure-as-Code tools (e.g., Terraform, Ansible).
  • Strong understanding of networking, distributed systems, and databases.
  • Proven ability to lead and manage technical teams effectively.
  • Excellent problem-solving, analytical, and communication skills.
Experience Requirements:
  • Experience: 5+ years of experience in Site Reliability Engineering, Systems Engineering, or related roles, with at least 2 years in a leadership or management capacity.
Education/Certification Requirements:
  • Education: Bachelor's degree in Computer Science, Engineering, or a related field. Advanced degree or relevant certifications (e.g., AWS Certified DevOps Engineer, Google Professional DevOps Engineer) preferred.

Bose is an equal opportunity employer that is committed to inclusion and diversity. We evaluate qualified applicants without regard to race, color, religion, sex, sexual orientation, gender identity, genetic information, national origin, age, disability, veteran status, or any other legally protected characteristics.

Please note, the company's pay transparency is available at Bose Pay Transparency. We are committed to working with and providing reasonable accommodations to individuals with disabilities. If you need a reasonable accommodation because of a disability for any part of the application or employment process, please send an e-mail to [email protected] and let us know the nature of your request and your contact information.



  • Atlanta, Georgia, United States Bose Full time

    Job DescriptionBose is seeking a highly skilled Lead Site Reliability Engineer to join our Information Technology team. As a key member of our team, you will be responsible for leading and mentoring a team of Site Reliability Engineers, providing guidance, support, and performance evaluations.Key ResponsibilitiesLead, mentor, and manage a team of Site...


  • Atlanta, Georgia, United States Bose Full time

    Job DescriptionBose is seeking a highly skilled Site Reliability Engineering Lead to join our Information Technology team. As a key member of our team, you will be responsible for leading and managing a team of Site Reliability Engineers, providing guidance, support, and performance evaluations.Key ResponsibilitiesFoster a culture of collaboration,...


  • Atlanta, Georgia, United States Inficare Full time

    Job Title: Site Reliability Engineer LeadThis position is for a hands-on Site Reliability Engineer Lead, focused on providing robust, secure, and scalable services for a diverse set of applications across on-premise and cloud environments. You will contribute to the strategy and delivery of the team, as well as managing the day-to-day workload. This role...


  • Atlanta, Georgia, United States Datum Technologies Group Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Datum Technologies Group. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Implement and improve monitoring, alerting,...


  • Atlanta, Georgia, United States ACL Digital Full time

    Job Title: Site Reliability EngineerWe are seeking a skilled Site Reliability Engineer to join our team at ACL Digital. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems and applications.Key Responsibilities:Implement and improve monitoring, alerting, and logging...


  • Atlanta, Georgia, United States Navtech Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Navtech Inc. in Atlanta or St. Louis. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our production systems.Key Responsibilities:Provide L4 technical support for production...


  • Atlanta, Georgia, United States Tekwissen Full time

    Job Title: Site Reliability EngineerAt TekWissen Group, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the stability, availability, and performance of our cloud-based systems.Key Responsibilities:Provide consulting services to improve system stability,...


  • Atlanta, Georgia, United States Tata Consultancy Services Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at Tata Consultancy Services. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our mission-critical services.Key ResponsibilitiesAutomate Infrastructure and Testing: Automate infrastructure needs,...


  • Atlanta, Georgia, United States T-Mobile US, Inc. Full time

    About the RoleWe're looking for a talented Site Reliability Engineer to join our team at T-Mobile US, Inc. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our systems and services.Key ResponsibilitiesDesign, implement, and maintain scalable and reliable systems and servicesCollaborate with...


  • Atlanta, Georgia, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based systems and applications.Key Responsibilities:Design, implement, and maintain monitoring tools,...


  • Atlanta, Georgia, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design, implement, and maintain monitoring tools, alerts,...


  • Atlanta, Georgia, United States Tata Consultancy Services Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at Tata Consultancy Services. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our infrastructure and applications.Key ResponsibilitiesDesign, develop, and support tools, services, and applications to...


  • Atlanta, Georgia, United States Jobs for Humanity Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our innovative Platform Service Delivery team at FIS Global.About the Role:As a Site Reliability Engineer, you will be responsible for ensuring the high stability, reduced Service Downtime, and improved Quality of Service for FIS clients. You will work with...


  • Atlanta, Georgia, United States Lorven Technologies Full time

    Job Title: Site Reliability EngineerLorven Technologies is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and reliable cloud...


  • Atlanta, Georgia, United States MTech Systems Full time

    Job Title: Site Reliability EngineerAt MTech Systems, we are committed to delivering high-quality software solutions that meet the evolving needs of our customers. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, performance, and efficiency of our applications.Key Responsibilities:Performance Management: Identify and...


  • Atlanta, Georgia, United States Next Level Business Services, Inc. Full time

    Job Title: Site Reliability EngineerNext Level Business Services, Inc. is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems and applications.Key Responsibilities:Design, implement, and maintain...


  • Atlanta, Georgia, United States Calsoft Labs Inc. Full time

    Job Title: Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Calsoft Labs Inc. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design and develop scalable and reliable...


  • Atlanta, Georgia, United States GeorgiaTEK Systems Inc. Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at GeorgiaTEK Systems Inc. in Minneapolis, MN or Atlanta, GA. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design,...


  • Atlanta, Georgia, United States GeorgiaTEK Systems Inc. Full time

    Job Title: Site Reliability EngineerGeorgiaTEK Systems Inc. is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • Atlanta, Georgia, United States Geotab Full time

    Job Title: Site Reliability EngineerGeotab is a global leader in IoT and connected transportation, and we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based infrastructure. You will work closely with our development team to...