Senior Site Reliability Engineer

3 days ago


New York, New York, United States AEG Full time
Job Title: Senior Site Reliability Engineer

We are seeking a highly skilled Senior Site Reliability Engineer to join our team at AEG. As a key member of our technical operations team, you will be responsible for leading and mentoring our SRE and TechOps teams, with a focus on automation to drive accountability, efficiency, and continuous improvement.

Key Responsibilities:
  • Build and Maintain Observability Frameworks: Develop and implement observability frameworks to monitor the health and performance of our services, ensuring uptime and reliability.
  • Incident Response and On-Call Support: Be the first line of defense in troubleshooting and resolving incidents without relying on runbooks, using strong problem-solving skills.
  • API Testing: Perform thorough API testing for published content using tools like Postman and Cypress to ensure accuracy and performance.
  • Infrastructure as Code: Utilize Terraform for managing infrastructure, including ServiceNow integrations, and automate workflows.
  • Monitoring and Logging: Leverage Datadog, or equivalent tools such as New Relic or Splunk, to set up monitoring, logging, and alerting systems.
  • Collaboration and Communication: Work closely with cross-functional teams, including developers, operations, and product managers, to ensure seamless integration and deployment of services.
  • AWS Resources Management: Manage and optimize AWS resources, including EKS and ECS, to ensure scalability and cost-efficiency.
  • CI/CD Pipeline Management: Use GitLab pipelines for continuous integration and deployment, ensuring smooth and automated delivery of code changes.
  • Integration: Integrate tools like ServiceNow with Slack or Asana to streamline workflows and enhance team communication.
Requirements:
  • 7+ years of experience: Proven background in architecting and managing cloud solutions (AWS, Azure, Google Cloud), along with hands-on experience in complex technology operations environments, including infrastructure, network, security, and incident management.
  • Cloud Expertise: Experience with AWS is mandatory; familiarity with GCP and Azure are a plus.
  • Programming Languages: Proficiency in Python; familiarity with Go, React/React Native is a plus.
  • Infrastructure as Code: Experience with Terraform.
  • API Data Quality checks and Frontend Testing: Hands-on experience with Cypress, Postman, and monitoring tools like Datadog (or equivalents like New Relic or Splunk).
  • Cloud Infrastructure: Strong understanding of AWS services, particularly EKS and ECS.
  • CI/CD Pipelines: Experience with GitLab for managing pipelines and automating deployments.
  • Observability: Expertise in setting up and maintaining observability frameworks to monitor and improve system reliability.
  • Troubleshooting: Excellent problem-solving and analytical abilities.
What We Offer:
  • Competitive Salary: $130,000 - $155,000.
  • Total Rewards Package: Comprehensive medical, dental, and vision benefits, as well as a suite of programs to promote well-being, including a $500 Wellness Reimbursement.
  • Career & Professional Development: On-the-job training, feedback, and ongoing educational opportunities to continue your personal and professional development.
  • Employee Engagement: Office perks, discounts, and employee events that go beyond the traditional paycheck to make you feel part of our team and inspire you to elevate the game.


  • New York, New York, United States Hudson River Trading Full time

    Job Title: Senior IT Site Reliability EngineerHudson River Trading (HRT) is a leading financial services company that utilizes a scientific approach to trading. We are seeking a highly skilled Senior IT Site Reliability Engineer to join our team.Job Summary:The Senior IT Site Reliability Engineer will be responsible for ensuring the availability and...


  • New York, New York, United States Hudson River Trading Full time

    Senior IT Site Reliability EngineerHudson River Trading (HRT) is a leading financial services company that leverages a scientific approach to trading. We are seeking a highly skilled Senior IT Site Reliability Engineer to join our IT Solutions Delivery team.This team is responsible for developing and maintaining the corporate productivity stack for the...


  • New York, New York, United States Peloton Full time

    Peloton Interactive, Inc. - Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team in New York, NY. As a key member of our organization, you will play a critical role in building and maintaining a monitor-able, performant, reliable, and highly scalable deployment platform.Key...


  • New York, New York, United States Peloton Full time

    Peloton Interactive, Inc. - Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team in New York, NY.Job Duties:Collaborate with cross-functional teams to design, build, and maintain a scalable and reliable deployment platform.Develop and implement observability and monitoring solutions to ensure...


  • New York, New York, United States BioSpace, Inc. Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Product Engineering team at BioSpace, Inc. As a key member of our team, you will play a critical role in shaping the future of drug development by enabling the organization to ship more reliable products faster, without toil or roadblocks, to accelerate the drug...


  • New York, New York, United States Hudson River Trading Full time

    Senior IT Site Reliability EngineerHudson River Trading (HRT) is a leading financial services company that leverages a scientific approach to trading. We are seeking a highly skilled Senior IT Site Reliability Engineer to join our IT Solutions Delivery team.This team is responsible for developing and maintaining the corporate productivity stack for the...


  • New York, New York, United States Peloton Full time

    About the RolePeloton is seeking a highly skilled Senior Site Reliability Engineer to join our team in New York. As a key member of our Site Reliability Engineering team, you will be responsible for designing, implementing, and maintaining our deployment platform to ensure high availability, scalability, and performance.Key ResponsibilitiesCollaborate with...


  • New York, New York, United States Clear Corporate Services LLC Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at CLEAR Corporate Services LLC. As a key member of our Engineering and Product pillar, you will play a critical role in establishing our SRE function and driving the reliability and scalability of our innovative systems.Key ResponsibilitiesEmbed within our...


  • New York, New York, United States Clear Full time

    About the RoleWe're seeking a highly skilled Senior Site Reliability Engineer to join our team at CLEAR. As a key member of our Engineering and Product pillar, you will play a crucial role in establishing our SRE function and driving the reliability and scalability of our innovative systems.Key ResponsibilitiesEmbed within our Engineering and Product team to...


  • New York, New York, United States Clear Corporate Services LLC Full time

    About the RoleWe're seeking a seasoned Senior Site Reliability Engineer to spearhead the establishment of our SRE function. As a key member of our Engineering and Product team, you'll drive the development and implementation of highly reliable and scalable systems that support our growing identity platform.Key ResponsibilitiesEmbed within an Engineering and...


  • New York, New York, United States Clear Corporate Services LLC Full time

    About the RoleWe're seeking a seasoned Senior Site Reliability Engineer to spearhead the establishment of our SRE function. As a key member of our Engineering and Product team, you'll drive the development and implementation of innovative systems that support our growing identity platform.Key ResponsibilitiesEmbed within an Engineering and Product pillar to...


  • New York, New York, United States Hudson River Trading Full time

    Senior IT Site Reliability EngineerHudson River Trading is seeking a seasoned IT Site Reliability Engineer to join our IT Solutions Delivery team.This team is responsible for developing and maintaining the corporate productivity stack for the entire firm, both on-prem and in the cloud.As a Senior IT SRE, you will ensure the availability and reliability of...


  • New York, New York, United States Clear Corporate Services LLC Full time

    About the RoleWe're seeking a seasoned Senior Site Reliability Engineer to spearhead the establishment of our SRE function. As a key member of our Engineering and Product pillar, you'll drive the development of innovative systems that support our growing identity platform.Key ResponsibilitiesEmbed within an Engineering and Product pillar to deeply understand...


  • New York, New York, United States Talented Hires Full time

    About Talented HiresTalented Hires is a dynamic and ambitious Series A startup leading the charge in generative AI for language processing. Our vision is to revolutionize how machines understand and generate human language, unlocking new possibilities for communication and interaction.Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled...


  • New York, New York, United States Podium Full time

    About PodiumPodium is a leading provider of review management, communication, marketing, and payments solutions for local businesses. Our mission is to help local businesses win by providing innovative technology that drives growth and success.The RoleWe are seeking a Senior Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will...


  • New York, New York, United States Podium Full time

    About PodiumPodium is a leading provider of review management, communication, marketing, and payments solutions for local businesses. Our mission is to help these businesses thrive by providing them with the tools and resources they need to succeed.The RoleWe are seeking a Senior Site Reliability Engineer to join our team. As a Site Reliability Engineer, you...


  • New York, New York, United States Major League Soccer Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Major League Soccer. As a key member of our technical operations team, you will be responsible for ensuring the reliability, performance, and scalability of our cloud-based infrastructure.Key Responsibilities:Design and implement...


  • New York, New York, United States BioSpace, Inc. Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Product Engineering team at Formation Bio. As a key member of our team, you will play a critical role in shaping the future of drug development by enabling our organization to ship more reliable products faster, without toil or roadblocks.ResponsibilitiesCollaborate...


  • New York, New York, United States JobRialto Full time

    Job Title: Senior Site Reliability EngineerJobRialto is seeking a highly skilled Senior Site Reliability Engineer to join our team. As a key member of our engineering team, you will play a critical role in maintaining the reliability, availability, and performance of our systems and services.This position requires a unique blend of software engineering,...


  • New York, New York, United States Formation Bio Full time

    About Formation BioFormation Bio is a pioneering tech and AI-driven pharma company revolutionizing the drug development process. By leveraging cutting-edge technology and innovative approaches, we aim to accelerate the discovery and delivery of new medicines to patients.The RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our...