Current jobs related to Site Reliability Engineering Manager - New York, New York - Insight Global


  • New York, New York, United States City National Bank Full time

    Job SummaryCity National Bank is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key ResponsibilitiesImplement solutions that improve stability, security, scalability,...


  • New York, New York, United States Hebbia Full time

    About HebbiaHebbia is a cutting-edge technology company that empowers users to collaborate with AI on each step and validate responses. Our mission is to put capable AI in the hands of 1 billion people by 2030.Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to contribute to building systems that optimize the uptime and reliability of...


  • New York, New York, United States Syndio Full time

    Job Title: Cloud and Site Reliability Engineering ManagerAt Syndio, we're seeking a highly skilled Cloud and Site Reliability Engineering Manager to join our team. As a key member of our engineering leadership, you will be responsible for defining and driving a vision for our cloud platform, ensuring it is scalable, reliable, and secure.About the RoleThis is...


  • New York, New York, United States City of New York Full time

    Job Title: Site Reliability Engineering ManagerThe City of New York is seeking a highly skilled Site Reliability Engineering Manager to join our team. As a key member of our Applications Division, you will be responsible for managing and mentoring a team of support engineers to ensure the availability, monitoring, performance, efficiency, change management,...


  • New York, New York, United States Hudson River Trading Full time

    Hudson River Trading is seeking a Senior IT Site Reliability Engineer to develop and maintain the corporate productivity stack.This role requires a deep understanding of Linux operating systems and application administration, as well as proficiency in Python and configuration management/IaC.Responsibilities include managing on-premise containerized web...


  • New York, New York, United States Tenth Mountain Full time

    Lead Site Reliability EngineerAt Tenth Mountain, we're committed to helping veterans transition into rewarding civilian careers. As a Lead Site Reliability Engineer, you'll play a critical role in ensuring the reliability and availability of our Payments infrastructure.Key Responsibilities:Provide 24/5 round-the-clock support for the Payments team, covering...


  • New York, New York, United States Formation Bio Full time

    About Formation BioFormation Bio is a pioneering tech and AI-driven pharma company that is revolutionizing the drug development process. By leveraging cutting-edge technology and innovative approaches, we are accelerating the discovery and development of new medicines.Our mission is to bring new treatments to patients faster and more efficiently, and we are...


  • New York, New York, United States Nationstaff Full time

    About This RoleWe are seeking a skilled Site Reliability Engineer to join our team at Nationstaff. As a Site Reliability Engineer, you will be responsible for building and maintaining continuous integration and deployment platforms, automating programmatic tasks, and deploying applications. You will also be responsible for configuration management,...


  • New York, New York, United States Tik Tok Full time

    About Site Reliability Engineering at TikTokTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. As a Site Reliability Engineer at TikTok, you will play a critical role in ensuring the reliability and scalability of our systems.Responsibilities Develop and maintain automation procedures to...


  • New York, New York, United States Nationstaff Full time

    About This RoleWe are seeking a talented Site Reliability Engineer to join our team at Nationstaff. As a Site Reliability Engineer, you will be responsible for building and maintaining continuous integration and deployment platforms, automating programmatic tasks, deploying applications, and monitoring and maintaining the uptime of our platform.Key...


  • New York, New York, United States City of New York Full time

    Job DescriptionThe City of New York is seeking a highly skilled Site Reliability Engineering Manager to join our team. As a key member of our Applications Division, you will be responsible for managing and mentoring a team of support engineers focused on application availability, monitoring, performance, efficiency, change management, and capacity planning...


  • New York, New York, United States Formation Bio Full time

    About Formation BioFormation Bio is a tech and AI-driven pharma company that's revolutionizing the industry with its efficient drug development processes.With a strong focus on innovation, the company has built technology platforms, processes, and capabilities to accelerate all aspects of drug development and clinical trials.By partnering with pharma...


  • New York, New York, United States Oakland Search Full time

    Senior Site Reliability EngineerTitle: Senior Site Reliability EngineerLocation: Manhattan, New York City (3 days in)Comp: $200,000 - $350,000 basic salary + highly competitive performance bonusesLevel: Junior to Senior hiresIndustry: Finance, Trading, Hedge fund, Capital Markets, QuantWe're looking for Software Reliability or Site Reliability Engineers to...


  • New York, New York, United States Citadel Enterprise Americas Services LLC Full time

    Job SummaryCitadel Enterprise Americas Services LLC is seeking a skilled Site Reliability Engineer to join our team. As a key member of our technical operations team, you will be responsible for ensuring the reliability and performance of our trading applications. This is a challenging and rewarding role that requires a strong understanding of software...


  • New York, New York, United States Tik Tok Full time

    About the RoleTikTok is seeking a skilled Site Reliability Engineer to join our U.S. Data Security team. As a key member of our team, you will be responsible for ensuring the reliability and scalability of our software systems.Responsibilities:Collaborate with infrastructure, product, and platform engineering teams to design and deploy scalable and secure...


  • New York, New York, United States Diverse Lynx Full time

    Job Title: Site Reliability Engineer - Cloud Expert Job Summary: We are seeking a highly skilled Site Reliability Engineer with expertise in cloud engineering to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based systems. Responsibilities: *...


  • New York, New York, United States RADAR Full time

    About the RoleWe're seeking a skilled Site Reliability Engineer to join our team at Radar. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining our production infrastructure. Your expertise in Terraform, AWS, and Docker will be crucial in ensuring the high availability and scalability of our systems.The StackWe...


  • New York, New York, United States Major League Soccer Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Major League Soccer. As a key member of our technical operations team, you will be responsible for ensuring the reliability, performance, and scalability of our cloud-based infrastructure.Key Responsibilities:Design and implement...


  • New York, New York, United States Alloy Full time

    About the RoleAlloy is seeking a skilled Site Reliability Engineer to join our Infrastructure Team. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining our cloud infrastructure to ensure high uptime and reliability.Key ResponsibilitiesDesign and implement scalable and secure cloud infrastructure using...


  • New York, New York, United States Array Full time

    About the RoleArray is a financial innovation platform that empowers digital brands, financial institutions, and fintechs to deliver compelling consumer products to market faster. Our suite of credit and identity monitoring tools, privacy protection, and financial ads marketplace via embeddable widgets or a clean, modern API help drive revenue and increase...

Site Reliability Engineering Manager

1 month ago


New York, New York, United States Insight Global Full time
Job Description

Insight Global is seeking a seasoned Manager of Site Reliability Engineering to lead our team of advanced Site Reliability Engineers. As a key member of our engineering organization, you will be responsible for designing, deploying, and maintaining our production systems, ensuring their reliability, scalability, and performance.

You will play a critical role in driving continuous improvement initiatives, monitoring system performance, troubleshooting issues, and ensuring timely incident response, root cause analysis, and problem resolution. Your expertise in SRE practices and experience with the listed technologies will enable you to effectively guide the team towards achieving operational excellence.

Key Responsibilities
  • Lead a team of Site Reliability Engineers in designing, deploying, and maintaining production systems
  • Ensure the reliability, scalability, and performance of our infrastructure
  • Drive continuous improvement initiatives to enhance system performance and reliability
  • Monitor system performance, troubleshoot issues, and ensure timely incident response
  • Collaborate with engineering teams to onboard them onto our platform systems
Requirements
  • 10+ years of SRE experience with 3 years of experience leading and managing production teams and systems
  • Expertise in Ansible, Concourse CI, Jenkins, Github Actions, EKS (Kubernetes), Linux Administration, terraform
  • Understanding of SRE principles, including reliability, scalability, availability, and performance
  • Proficient in scripting and automation (e.g., Python, Bash, and GO)
  • Experience with infrastructure-as-code (IaC) tools, configuration management, and CI/CD pipelines
  • Knowledge of cloud platforms (e.g., AWS, Azure, or Google Cloud) and containerization technologies (e.g., Docker)
About Insight Global

We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment without regard to race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances.