Site Reliability Engineer

4 weeks ago


New York, New York, United States Hebbia Full time
About Hebbia

Hebbia is a cutting-edge technology company that empowers users to collaborate with AI on each step and validate responses. Our mission is to put capable AI in the hands of 1 billion people by 2030.

Job Description

We are seeking a highly skilled Site Reliability Engineer to contribute to building systems that optimize the uptime and reliability of our platform. The successful candidate will be responsible for owning our deployment pipelines, building and maintaining our continuous integration and continuous deployment (CI/CD) systems, ensuring the reliability and performance of our services, enhancing our observability, supporting our local development environments, and bolstering our security posture.

Responsibilities
  • Manage deployment pipelines to facilitate smooth and efficient software releases.
  • Implement and maintain observability solutions for monitoring system performance and reliability.
  • Support local development environments to optimize developer workflows.
  • Work with development teams to ensure infrastructure aligns with project requirements.
  • Contribute to improving the security of our infrastructure by assisting with proactive measures and audits.
  • Develop and maintain automation scripts and tools to enhance operational efficiency.
  • Troubleshoot and resolve infrastructure and application issues to minimize downtime and maintain smooth operations.
  • Evaluate and integrate new technologies to enhance the scalability, reliability, and security of our infrastructure.
Requirements
  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
  • 4+ years software development experience at a venture-backed startup or top technology firm.
  • Proven experience as a Site Reliability Engineer, DevOps Engineer, or similar role.
  • Strong expertise in managing CI/CD pipelines and deployment automation.
  • Proficiency in cloud platforms such as AWS, Azure, or Google Cloud.
  • Solid understanding of containerization and orchestration technologies such as Docker and Kubernetes.
  • Experience with monitoring and observability tools such as Datadog, Prometheus, Grafana, or similar.
  • Knowledge of infrastructure-as-code (IaC) tools such as Terraform or CloudFormation.
  • Familiarity with security best practices and tools for infrastructure and application security.
  • Excellent problem-solving skills and the ability to troubleshoot complex issues.
  • Strong communication skills and the ability to work effectively in a collaborative environment.
  • A proactive and self-motivated approach to learning and adopting new technologies.
  • Passion for continuous improvement and operational excellence.
Compensation

The salary range for this position is set between $160,000 and $215,000. However, adjustments outside of this range may be considered for candidates whose qualifications significantly differ from those outlined in the job description. Additionally, this role is eligible to participate in our equity plan and benefits program, which includes comprehensive health, dental, and vision coverage, retirement benefits, daily catered lunch, and unlimited PTO.



  • New York, New York, United States CapB InfoteK Full time

    Job Title: Site Reliability EngineerAbout the Role:At CapB InfoteK, we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:• Develop and build low-level component...


  • New York, New York, United States Lorven Technologies Full time

    Job Title: Site Reliability EngineerLorven Technologies is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain infrastructure automation...


  • New York, New York, United States Insight Global Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at Insight Global. As a Site Reliability Engineer, you will be responsible for ensuring the uptime and reliability of our production and non-production environments. You will work closely with our development teams to build and maintain the infrastructure and applications...


  • New York, New York, United States Insight Global Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Insight Global. As a Site Reliability Engineer, you will be responsible for ensuring the uptime and reliability of our production and non-production environments.Key Responsibilities:Monitor availability and system health to ensure optimal...


  • New York, New York, United States Cynet Systems Full time

    Job Title: Site Reliability EngineerJob Summary:Cynet Systems is seeking a highly skilled Site Reliability Engineer to lead the development and implementation of geospatial application performance monitoring strategies. The ideal candidate will have a strong background in Site Reliability Engineering (SRE) and proven experience in using Dynatrace for...


  • New York, New York, United States Phaxis Full time

    Site Reliability EngineerWe are seeking an experienced Site Reliability Engineer to join our team at Phaxis. As a Site Reliability Engineer, you will be responsible for designing and building scalable and resilient systems, collaborating with engineering teams to advocate for optimal system use, and managing our centralized development infrastructure.Key...


  • New York, New York, United States Grafbase, Inc. Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Engineering team at Grafbase, Inc.As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our systems and services.Key ResponsibilitiesCollaborate with cross-functional teams to develop and deploy software...


  • New York, New York, United States Alchemy Full time

    About the RoleAlchemy is seeking a highly skilled Site Reliability Engineer to join our Infrastructure team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our globally used developer platform.Our mission is to empower builders with the tools they need to create exceptional on-chain products....


  • New York, New York, United States City National Bank Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at City National Bank. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key Responsibilities:Design and implement solutions...


  • New York, New York, United States Grafbase, Inc. Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Engineering team at Grafbase, Inc. As an SRE, you will play a critical role in ensuring the reliability, availability, and performance of our systems and services.Key ResponsibilitiesCollaborate with cross-functional teams to ensure software is developed and deployed for...


  • New York, New York, United States Peloton Full time

    About the RolePeloton is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our platform.Your Daily ImpactDesign and implement automated infrastructure provisioning and deployment processes using Terraform and...


  • New York, New York, United States Insight Global Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at Insight Global. As a key member of our infrastructure team, you will be responsible for ensuring the uptime and reliability of our production and non-production environments.Key ResponsibilitiesMonitor availability and system health to ensure optimal performanceDesign...


  • New York, New York, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is a leading platform for short-form mobile video, and our mission is to inspire creativity and bring joy. Our U.S. Data Security division is a subsidiary of TikTok, dedicated to protecting user data and ensuring the security of our platform.Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our...


  • New York, New York, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is the leading destination for short-form mobile video, and our mission is to inspire creativity and bring joy. U.S. Data Security (USDS) is a subsidiary of TikTok in the U.S., responsible for providing oversight and protection of the TikTok platform and U.S. user data. Our focus is on delivering a secure and reliable...


  • New York, New York, United States Intuit Inc Full time

    Job OverviewMailchimp is a leading marketing platform for small businesses, empowering millions of customers worldwide to build their brands and grow their companies with a suite of marketing automation, multichannel campaigns, CRM, and analytics tools.Job DescriptionWe are seeking an experienced Engineering Leader to lead our Site Reliability Engineering...


  • New York, New York, United States City National Bank Full time

    Job SummaryCity National Bank is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key ResponsibilitiesImplement solutions that improve stability, security, scalability,...


  • New York, New York, United States Oakland Search Full time

    Senior Site Reliability EngineerTitle: Senior Site Reliability EngineerLocation: Manhattan, New York City (3 days in)Comp: $200,000 - $350,000 basic salary + highly competitive performance bonusesLevel: Junior to Senior hiresIndustry: Finance, Trading, Hedge fund, Capital Markets, QuantWe're looking for Software Reliability or Site Reliability Engineers to...


  • New York, New York, United States Insight Global Full time

    Job DescriptionInsight Global is seeking a seasoned Manager of Site Reliability Engineering to lead our team of advanced Site Reliability Engineers. As a key member of our engineering organization, you will be responsible for designing, deploying, and maintaining our production systems, ensuring their reliability, scalability, and performance.You will play a...


  • New York, New York, United States Citadel Enterprise Americas Services LLC Full time

    Job SummaryCitadel Enterprise Americas Services LLC is seeking a skilled Site Reliability Engineer to join our team. As a key member of our technical operations team, you will be responsible for ensuring the reliability and performance of our trading applications. This is a challenging and rewarding role that requires a strong understanding of software...


  • New York, New York, United States Tik Tok Full time

    About the RoleTikTok is seeking a skilled Site Reliability Engineer to join our U.S. Data Security team. As a key member of our team, you will be responsible for ensuring the reliability and scalability of our software systems.Responsibilities:Collaborate with infrastructure, product, and platform engineering teams to design and deploy scalable and secure...