Senior Site Reliability Engineer

4 days ago


Atlanta, Georgia, United States Learfield Full time
About Learfield

Learfield is a leading media and technology services company in intercollegiate athletics, unlocking the value of college sports for brands and fans through an omnichannel platform with innovative content and commerce solutions for fan engagement.

Job Title: Senior Site Reliability Engineer

We are seeking an experienced Senior Site Reliability Engineer to join our team. As a key member of our SRE team, you will be responsible for ensuring the reliability, availability, and performance of our live services.

Key Responsibilities:
  • Work in cross-discipline teams to ensure service reliability, availability, and performance
  • Collaborate with our domain engineering and Site Reliability Engineering teams to architect and maintain live services
  • Plan and forecast service capacity and demand, analyze software performance, and tune systems and software
  • Solve mission-critical incidents and build automation to prevent problem recurrence; automate away all toil
  • Identify root causes of production issues, and recommend permanent solutions for them
  • Setup and improve monitoring (metrics, logs, alerts, etc) to identify issues quickly
  • Develop effective documentation, tooling, and alerts to identify and address risks
  • Actively participate/offer solutions to keep our environment secure. Review compliance and internal scans and work with development teams to stay ahead of security vulnerabilities
  • Develop Run Books for Level I NOC team to reduce MTTD/MTTR for alerts
  • Participate in on-call rotation with other members of Site Reliability Engineering team
Requirements:
  • Experience with Linux container technologies (Docker, Kubernetes)
  • Experience with public and private clouds: GCP, OpenStack, AWS, and/or Azure
  • Understanding of cloud orchestration frameworks (terraform, Kubernetes, argoCD, spinnaker, etc) and their role in IT transformation
  • 5+ years' experience working with Linux systems and related tooling (kernel, shell, system libraries, file systems, client-server protocols, etc)
  • The ability to read/write code fluently in C#, Python, or Go
  • Deep understanding of software development lifecycle including git-based CI and CD pipelines
  • Networking: experience with network theory and protocols, e.g. TCP/IP, UDP, DNS, HTTP, TLS, and load balancing
  • Strong experience in distributed systems architectures - layered, event-driven, service mesh, etc.
  • Familiarity with distributed message buses such as Kafka, Confluent
  • Strong interpersonal and communication skills
What We Offer:
  • Approximate base pay range: $110,000 to $130,000
  • Annual discretionary bonus and/or sales compensation
  • Full spectrum of benefits for eligible employees including Medical, Dental, Vision, Health Savings Account, Life Insurance and Other Insurance Plans, Flexible Paid Time Off (including Parental Leave), Paid Holidays, 401(k), and Short/Long Term Disability

Learfield is an Equal Opportunity Employer: Female / Minority / Disability / Protected Veteran / Sexual Orientation / Gender Identity.



  • Atlanta, Georgia, United States Cox Communications Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Manheim Logistics SRE team. As a key member of our team, you will be responsible for designing and maintaining AWS infrastructure and deployment pipelines for our 15+ development teams.Key ResponsibilitiesDesign and implement scalable and reliable infrastructure...


  • Atlanta, Georgia, United States SIDEARM Sports Full time

    About SIDEARM SportsSIDEARM Sports is a leading provider of technology solutions for collegiate athletic programs. Our team of experts is dedicated to delivering innovative and reliable solutions that meet the evolving needs of our clients.Job SummaryWe are seeking an experienced Senior Site Reliability Engineer to join our team. As a key member of our SRE...


  • Atlanta, Georgia, United States Learfield Full time

    About LearfieldLearfield is a leading media and technology services company in intercollegiate athletics, unlocking the value of college sports for brands and fans through an omnichannel platform with innovative content and commerce solutions for fan engagement.Job SummaryWe are seeking an experienced Senior Site Reliability Engineer to join our team. As a...


  • Atlanta, Georgia, United States Collabera Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Collabera. As a key member of our engineering team, you will be responsible for designing, implementing, and maintaining scalable and reliable systems across our platforms.Key Responsibilities:Partner with Application Development...


  • Atlanta, Georgia, United States PagerDuty Full time

    About the RolePagerDuty is seeking a highly skilled Senior Site Reliability Engineer to join our SRE-Platform team. As a key contributor, you will play a crucial role in building, maintaining, and scaling the Kubernetes platform that powers PagerDuty.Key ResponsibilitiesTriage and troubleshoot production issues, ensuring the overall health of the...


  • Atlanta, Georgia, United States Greenlight Full time

    About GreenlightGreenlight is a leading family fintech company dedicated to empowering parents to raise financially savvy kids. Our mission is to create a better, brighter future for the next generation.Job SummaryWe are seeking a Senior Site Reliability Engineer to join our team. As a key member of our Engineering organization, you will play a critical role...


  • Atlanta, Georgia, United States SIDEARM Sports Full time

    About UsSIDEARM Sports is a leading provider of technology solutions for collegiate athletic programs. We're a passionate team of technologists, creatives, and strategists dedicated to delivering exceptional products and services to our partners and their fans.Job SummaryWe're seeking an experienced Senior Site Reliability Engineer to join our team. As a key...


  • Atlanta, Georgia, United States Genesis10 Full time

    Job Title: Senior Site Reliability EngineerGenesis10 is seeking a Senior Site Reliability Engineer to join our team in Atlanta, GA. This is a 12+ month contract position.About the Role:We are looking for a highly skilled Senior Site Reliability Engineer to join our team. The successful candidate will be responsible for managing and optimizing data streaming...


  • Atlanta, Georgia, United States Genesis10 Full time

    Job Title: Senior Site Reliability EngineerGenesis10 is seeking a Senior Site Reliability Engineer to join our team in Atlanta, GA. This is a 12+ month contract position.About the Role:We are looking for a highly skilled Senior Site Reliability Engineer to join our team. The successful candidate will be responsible for managing and optimizing data streaming...


  • Atlanta, Georgia, United States Cox Enterprises Full time

    About the RoleCox Automotive is seeking a highly skilled Senior Site Reliability Engineer to join our Manheim Logistics SRE team. As a key member of our team, you will be responsible for designing and maintaining AWS infrastructure and deployment pipelines for our 15+ development teams.Key ResponsibilitiesDesign and implement scalable and reliable...


  • Atlanta, Georgia, United States SIDEARM Sports Full time

    About UsSIDEARM Sports is a leading provider of technology solutions for collegiate athletic programs. We're a passionate team of technologists, creatives, and strategists dedicated to delivering exceptional products and services.Job DescriptionWe're seeking an experienced Senior Site Reliability Engineer to join our team. As a key member of our SRE team,...


  • Atlanta, Georgia, United States PagerDuty Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our SRE-Platform team at PagerDuty. As a key contributor, you will be responsible for building, maintaining, and scaling our Kubernetes platform, which powers our digital operations management solutions.Key ResponsibilitiesTriage and troubleshoot production issues,...


  • Atlanta, Georgia, United States Cox Communications Full time

    About the RoleCox Automotive is seeking a highly skilled Senior Site Reliability Engineer to join our Manheim Logistics SRE team. As a key member of our team, you will be responsible for designing and maintaining AWS infrastructure and deployment pipelines for our 15+ development teams.Key ResponsibilitiesDesign and implement scalable and reliable AWS...


  • Atlanta, Georgia, United States Greenlight Full time

    Job DescriptionGreenlight is a leading fintech company on a mission to help parents raise financially smart kids. We serve over 6 million parents and kids with our award-winning banking app for families. Our platform allows parents to automate allowance, manage chores, set flexible spend controls, and invest for their family's future. Kids and teens learn to...


  • Atlanta, Georgia, United States Cox Enterprises Full time

    About the RoleCox Automotive is seeking a highly skilled Senior Site Reliability Engineer to join our Manheim Logistics SRE team. As a key member of our team, you will be responsible for designing and maintaining AWS infrastructure and deployment pipelines for our 15+ development teams.Key ResponsibilitiesDesign and implement scalable and reliable AWS...


  • Atlanta, Georgia, United States T-Mobile US, Inc. Full time

    About the RoleWe're looking for a talented Site Reliability Engineer to join our team at T-Mobile US, Inc. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our systems and services.Key ResponsibilitiesDesign, implement, and maintain scalable and reliable systems and servicesCollaborate with...


  • Atlanta, Georgia, United States Lorven Technologies Full time

    Job Title: Sr Site Reliability EngineerJob Summary:Lorven Technologies is seeking a highly skilled Sr Site Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for driving cross-team initiatives that improve Delta engineering practices, increasing accountability, and delivering increased uptime and...


  • Atlanta, Georgia, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based systems and applications.Key Responsibilities:Design, implement, and maintain monitoring tools,...


  • Atlanta, Georgia, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design, implement, and maintain monitoring tools, alerts,...


  • Atlanta, Georgia, United States Tata Consultancy Services Full time

    Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team at Tata Consultancy Services. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our infrastructure and applications.Key ResponsibilitiesDesign, develop, and support tools, services, and applications to...