Infrastructure Reliability Engineer

2 weeks ago


New York, New York, United States Radar Full time
Job Overview

Position Summary

We are seeking Infrastructure Reliability Engineers to enhance our production systems. Radar operates a high-volume, data-centric platform managing over 1 billion API requests daily. Our services are utilized by more than 100 million devices globally. We maintain a multi-availability zone setup, with a key focus on advancing our deployment strategy to encompass multiple regions.

Technology Stack:

Our infrastructure is managed using Terraform with TypeScript (CDKTF).

Deployments are conducted on AWS utilizing Docker containers.

Data storage is handled through MongoDB on Atlas.

We implement blue-green and canary deployment strategies via CircleCI CI/CD.

Production monitoring is conducted through CloudWatch, Honeycomb, Pingdom, and PagerDuty.

DNS management is facilitated by CloudFlare.

Most engineers participate in the on-call rotation.

Our primary programming languages include TypeScript and Rust.

Data processing pipelines are developed using Airflow and Scala Spark.

We actively support OpenStreetMaps, MapLibre, and OpenAddresses.

Work Culture:

Our engineering team primarily consists of former technical co-founders or previous Radar interns from prestigious institutions. Engineers at Radar typically fall into two categories: those with Staff-level expertise in a specific technology stack or those with Multi-Stack capabilities at any level. The term 'Multi-Stack' reflects the diverse areas our engineers may engage with, including Mobile and Data engineering. While expertise in all areas is not required, a willingness to learn and adapt across different stacks is essential.

We prioritize rapid delivery and customer engagement. Our commitment to a comprehensive location infrastructure product is matched by our belief that customer insights are invaluable. Although communication via Slack is integral to our operations, in-person collaboration at our headquarters is often the most efficient way to achieve our goals. Weekly planning sessions occur on Mondays in small groups, utilizing Linear for project management. Each project is overseen by an Engineering lead, an executive, and a Go-to-Market lead, with engineers responsible for identifying needs, engaging with customers, and ensuring their success.

One of our most impactful practices is 'Walk A Mile,' which emphasizes understanding user experiences by engaging directly with our SDK in real-world scenarios. We believe in the importance of shipping significant updates weekly.

Hiring Process:

Following an initial conversation with our CTO, suitable candidates will be invited for an in-person interview at our headquarters. This session will involve collaborative system design challenges and coding exercises to develop a simple application. Candidates will also meet one of our co-founders to discuss our operational approach in greater detail.

Key Responsibilities:

  • Develop and maintain core Radar infrastructure using Terraform deployed on AWS via Docker.
  • Contribute to critical initiatives aimed at achieving 99.999% uptime, multi-region deployment, and optimizing cloud costs.
  • Ensure your contributions impact hundreds of millions of devices.
  • Participate in the on-call rotation (approximately once every 1-2 months).
  • Engage with Radar customers and prospects, gather feedback, and integrate it into your work to drive their success.

Qualifications:

  • Experience managing production environments on AWS using Terraform.
  • Familiarity with multi-region infrastructure achieving five nines availability.
  • Proficient in managing large, sharded MongoDB clusters with significant read-write operations.
  • Background in a high-growth startup environment.
  • Interest in customer interaction and contributing to their success.

Preferred Qualifications:

  • Previous experience as a technical co-founder.
  • Experience with high-throughput, data-intensive applications.

Collaboration:

  • Work alongside Tim Julien, CTO.
  • Collaborate with our Engineering, Customer Success, Sales Engineering, and Sales teams.
  • Engage directly with our customers and prospects.

Benefits:

  • Competitive salary and equity options.
  • Comprehensive medical, dental, and vision coverage with full premium support.
  • 401(k) plan with generous employer matching.
  • Unlimited paid time off.
  • Paid parental leave.
  • Weekly catered meals at our office.
  • Complimentary CitiBike membership (for eligible locations).
  • Monthly fitness reimbursements and wellness programs.

Compensation:

The salary range for this full-time position is between $225,000 - $275,000 annually, with opportunities for performance-based bonuses and incentives. Additionally, Radar offers a competitive equity plan with stock options, providing employees with a meaningful stake in the company's success.

About Radar:

Radar provides location infrastructure for a wide array of products and services. Our geofencing SDKs and maps APIs are utilized by companies such as Vercel, Panera, and T-Mobile to deliver location-based experiences across hundreds of millions of devices globally.

We are dedicated to fostering an inclusive workplace that values diversity and equal opportunity for all candidates.



  • New York, New York, United States Alloy Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Infrastructure Team at Alloy. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key ResponsibilitiesDesign, implement, and maintain scalable and highly available...


  • New York, New York, United States Alloy Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Infrastructure Team at Alloy. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining our cloud infrastructure to ensure high uptime and reliability.Key ResponsibilitiesDesign and implement scalable and secure cloud infrastructure...


  • New York, New York, United States Radar Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Radar, a leading provider of location infrastructure for every product and service. As a Site Reliability Engineer, you will play a critical role in designing, implementing, and maintaining our production infrastructure, ensuring high availability, scalability, and...


  • New York, New York, United States Kyndryl Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Cloud Infrastructure team at Kyndryl. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and security of our cloud-based services.Key ResponsibilitiesDesign and Implement Monitoring and Logging Systems: Develop and...


  • New York, New York, United States Hebbia Full time

    About HebbiaHebbia is a cutting-edge technology company that specializes in developing Artificial General Intelligence (AGI) solutions. Our mission is to empower users to collaborate with AI on complex tasks and validate responses, rather than blindly trusting them.Job DescriptionAs a highly skilled Site Reliability Engineer, you will play a critical role in...


  • New York, New York, United States Radar Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Radar, a leading provider of location infrastructure for every product and service. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key ResponsibilitiesDesign,...


  • New York, New York, United States FILD Search, LLC Full time

    About the Role:Are you an Infrastructure Reliability Specialist dedicated to enhancing user experiences for a vast audience? Do you thrive on ensuring system uptime, high availability, and effective disaster recovery for international platforms? If so, this opportunity may be for you.We are collaborating with a leading entity in the sports and entertainment...


  • New York, New York, United States Russell Tobin & Associates Full time

    Job Description:As a Site Reliability Engineer at Russell Tobin & Associates, you will play a critical role in ensuring the reliability and scalability of our cloud infrastructure. We are seeking a highly skilled and experienced engineer to join our team and contribute to the design, implementation, and maintenance of our cloud-based systems.Key...


  • New York, New York, United States Radar Full time

    About the RoleRadar is a high-throughput, data-intensive application handling 1 billion+ API calls per day. We're seeking a skilled Site Reliability Engineer to work on our production infrastructure.Key ResponsibilitiesDesign and implement scalable cloud infrastructure using Terraform and AWSCollaborate with cross-functional teams to ensure 99.999%...


  • New York, New York, United States Squarespace Full time

    About the RoleSquarespace is seeking an experienced Senior Site Reliability Engineer to join our Compute team. As a key member of our infrastructure engineering team, you will play a critical role in ensuring the reliability and scalability of our system.Key ResponsibilitiesDesign and implement scalable and reliable infrastructure solutions to support our...


  • New York, New York, United States Open Systems Technologies Full time

    Position Overview:A prominent financial organization is in search of a talented Platform Infrastructure Engineer. The successful candidate will possess a wealth of experience in DevOps, TechOps, or Site Reliability Engineering (SRE), particularly with a robust understanding of AWS technologies. This position provides an attractive compensation package and...


  • New York, New York, United States FLOAT LLC Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at FLOAT LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our infrastructure, enabling our engineering teams to focus on delivering high-quality software to our customers.Key ResponsibilitiesContinuously...


  • New York, New York, United States Diverse Lynx Full time

    About the Role:Diverse Lynx is seeking a highly skilled Cloud Reliability Engineer to join our team. As a Cloud Reliability Engineer, you will be responsible for ensuring the reliability and efficiency of our cloud-based systems.Key Responsibilities:Design and implement automated workflows to reduce TOIL and improve system reliabilityDevelop and maintain...


  • New York, New York, United States Formation Bio Full time

    Advancements in AI and drug discovery are creating more candidate drugs than the industry can progress because of the high cost and time of clinical trials. Recognizing that this development bottleneck may ultimately limit the number of new medicines that can reach patients, Formation Bio, founded in 2016 as TrialSpark Inc., has built technology platforms,...


  • New York, New York, United States Fidelity Information Services Full time

    Position: Cloud Infrastructure EngineerCompany: Fidelity Information ServicesJoin a pioneering team focused on Site Reliability EngineeringAssist in transitioning core payment systems to a cloud-native microservices frameworkCollaborate with development teams across multiple global locationsDesign, implement, and sustain services for the organizationChampion...


  • New York, New York, United States Open Systems Technologies Full time

    Job DescriptionCompany: Open Systems TechnologiesJob Title: Cloud Infrastructure EngineerLocation: RemoteJob Type: Full-timeAbout Us: Open Systems Technologies is a leading provider of cloud-based solutions, dedicated to delivering innovative and reliable technology services to our clients.Job Summary: We are seeking an experienced Cloud Infrastructure...


  • New York, New York, United States Hudson River Trading Full time

    About the RoleHudson River Trading (HRT) is seeking a highly skilled Senior IT Site Reliability Engineer to join our IT Solutions Delivery team. As a key member of our team, you will be responsible for ensuring the availability and reliability of our corporate productivity stack, both on-prem and in the cloud.Key ResponsibilitiesManage on-premise...


  • New York, New York, United States Instabase Full time

    At Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry. With customers representing some of the largest and most complex organizations in the world, and investors like Greylock, Andreessen Horowitz, and Index Ventures, our...


  • New York, New York, United States Hudson River Trading Full time

    Company OverviewHudson River Trading (HRT) is a leader in algorithmic trading, utilizing advanced technology and innovative strategies to excel in the financial markets.Position SummaryWe are seeking a Senior IT Site Reliability Engineer to enhance our IT Solutions Delivery team. This team is pivotal in developing and sustaining the corporate productivity...


  • New York, New York, United States Hudson River Trading Full time

    Company OverviewHudson River Trading (HRT) employs a scientific methodology in trading financial instruments. We have established one of the most advanced computing environments dedicated to research and development in the field of algorithmic trading.Position SummaryWe are seeking a Senior IT Site Reliability Engineer to enhance our expanding IT Solutions...