Site Reliability Engineer

2 weeks ago


New York, New York, United States PulsePoint Full time
About the Role

PulsePoint is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our multi-datacenter and hybrid Linux environments.

Key Responsibilities
  • Manage large-scale Linux infrastructure to ensure maximum uptime
  • Perform performance and reliability testing, including reviewing configuration, software choices/versions, and hardware specs
  • Advance our technology stack with innovative ideas and new creative solutions
  • Participate in capacity management of core systems and services, application analysis, and performance and security tuning
  • Provide operational support of systems and build automation to remediate and address the root cause
  • Create strategies for long-term permanent fixes to critical production incidents
  • Maintain documentation, build tooling, and create alerts to identify and address infrastructure reliability
  • Proactively identify system anomalies
  • Collaborate with the security team on new initiatives and ongoing changes
Requirements
  • Good attitude, positive and friendly demeanor
  • Thorough understanding of Linux (we use CentOS and Rocky Linux in production)
  • Deep understanding of Puppet stack (roles & profiles, Hiera, PuppetDB)
  • Experience with Foreman
  • Knowledge of git and ability to resolve merge conflicts
  • Experience with Jenkins CI
  • Experience administering SQL/NoSQL databases (MySQL, PostgreSQL, MongoDB, ES, Redis, Memcached)
  • Ability to work with Cassandra database clusters from installation through troubleshooting and maintenance
  • Experience with scalable infrastructure monitoring solutions such as Icinga, Prometheus, ELK, Graphite
  • Strong scripting and automation skills using languages like Ruby, Python, Bash
  • Understanding of networking concepts (TCP/IP stack, DNS, PKI, CDN, load balancing)
  • Experience with on-prem/bare metal servers operation
  • Knowledge of virtualization solutions - KVM
  • Experience with container technologies such as Docker, Containerd
  • Diverse experience with IT Security-related best practices in the SRE context
  • Willing and able to work East Coast U.S. hours (9am-6pm EST)
Preferred Qualifications
  • Knowledge of K8s and its ecosystem
  • Experience in AdTech or High-Frequency Trading a plus
  • Hands-on experience with cloud platforms (AWS and GCP)
What We Offer
  • Comprehensive healthcare with medical, dental, and vision options, and 100%-paid life & disability insurance
  • 401(k) Match
  • Generous paid vacation and sick time
  • Paid parental leave & adoption assistance
  • Annual tuition assistance
  • Better Yourself Wellness program
  • Commuter benefits and commuting subsidy
  • Group volunteer opportunities and fun events
  • A referral bonus program

PulsePoint is an Equal Opportunity/Affirmative Action employer and does not discriminate on the basis of race, ancestry, color, religion, sex, gender, age, marital status, sexual orientation, gender identity, national origin, medical condition, disability, veterans status, or any other basis protected by law.



  • New York, New York, United States Lorven Technologies Full time

    Job Title: Site Reliability EngineerLorven Technologies is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and reliable cloud...


  • New York, New York, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design and implement automated workflows to reduce TOIL and...


  • New York, New York, United States Phaxis Full time

    Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Phaxis. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining our critical infrastructure platforms.Key Responsibilities:Design and implement scalable and resilient servicesCollaborate with engineering teams to...


  • New York, New York, United States FLOAT LLC Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Float LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud infrastructure, enabling our engineering teams to focus on delivering high-quality software to our customers.Key...


  • New York, New York, United States Unreal Gigs Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Unreal Gigs. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our systems.Key Responsibilities:Design, implement, and maintain scalable infrastructure solutions to support...


  • New York, New York, United States Alchemy Full time

    About the RoleAlchemy is seeking a highly skilled Site Reliability Engineer to join our Infrastructure team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our globally used developer platform.Key ResponsibilitiesDesign, deploy, and continuously improve the infrastructure supporting...


  • New York, New York, United States ADP Full time

    About ADPADP is a global leader in HR technology, offering the latest AI and machine learning-enhanced payroll, tax, HR, benefits, and more. We believe our people make all the difference in cultivating an inclusive, down-to-earth culture that welcomes ideas, encourages innovation, and values belonging.Job DescriptionWe are seeking a Site Reliability Engineer...


  • New York, New York, United States Motion Recruitment Full time

    Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Motion Recruitment. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our systems, as well as collaborating with cross-functional teams to drive innovation and improvement.Key Responsibilities:Design,...


  • New York, New York, United States Diverse Lynx Full time

    Job Title: SRE - Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based systems.Key Responsibilities:Design and implement automated workflows to reduce TOIL and...


  • New York, New York, United States Braze Full time

    About the RoleWe're seeking a highly skilled Site Reliability Engineer to join our team at Braze. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our internal-facing services and platforms.Key ResponsibilitiesPartner with Braze's engineering teams to architect products that effectively utilize...


  • New York, New York, United States Unreal Gigs Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our tech startup, Unreal Gigs, specializing in infrastructure and authorization solutions.As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability, availability, and performance of our systems. Your responsibilities will include designing,...


  • New York, New York, United States Apollo Solutions Full time

    Site Reliability EngineerApollo Solutions is partnering with a pioneering artificial intelligence business that is revolutionizing the use of AI/ML in gaming and security.The company is working closely with government contracts and gaming console companies and is seeking a Site Reliability Engineer to join their growing team.The Site Reliability Engineer...


  • New York, New York, United States Alchemy Full time

    About the RoleAlchemy is seeking a highly skilled Site Reliability Engineer to join our Infrastructure team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our globally used developer platform.Our mission is to empower builders with the tools they need to create exceptional on-chain products....


  • New York, New York, United States Grafbase, Inc. Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Engineering team at Grafbase, Inc.As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our systems and services.Key ResponsibilitiesCollaborate with cross-functional teams to develop and deploy software...


  • New York, New York, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is a leading destination for short-form mobile video, inspiring creativity and bringing joy to millions of users worldwide.Our mission is to provide a secure and reliable platform for users to express themselves, learn, and be entertained.Role OverviewWe are seeking a skilled Site Reliability Engineer to join our U.S....


  • New York, New York, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is a leading destination for short-form mobile video, inspiring creativity and bringing joy to millions of users worldwide.Our mission is to provide a secure and reliable platform for users to express themselves, learn, and be entertained.Site Reliability Engineering at TikTokAs a Site Reliability Engineer at TikTok, you...


  • New York, New York, United States FLOAT LLC Full time

    About the RoleWe're seeking a highly skilled Site Reliability Engineer to join our team at Float LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key ResponsibilitiesDesign, implement, and maintain scalable and highly available cloud...


  • New York, New York, United States Fourier Ltd Full time

    Site Reliability EngineerFourier Ltd is seeking a skilled Site Reliability Engineer to join our technical operations team. As a Site Reliability Engineer, you will play a critical role in ensuring the superior performance and availability of our production applications throughout the development cycle.Key Responsibilities:Configure and manage multiple...


  • New York, New York, United States Alchemy Full time

    About AlchemyAlchemy is a world-class developer platform that aims to make building on the blockchain easy. Our mission is to bring blockchain to a billion people, and we're committed to delivering high-quality products that delight our customers.The RoleWe're seeking an experienced Infrastructure Engineer to join our team and help us design, deploy, and...


  • New York, New York, United States Alloy Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Infrastructure Team at Alloy. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key ResponsibilitiesDesign, implement, and maintain scalable and highly available cloud...