Site Reliability Engineer DevOps

2 weeks ago


New York, United States PEX Full time

​ SITE RELIABILITY ENGINEER SUMMARY: Since 2006 PEX has been on a steady march to build and evolve a solution that helps improve the way organizations operate in order to make them more efficient, more nimble, and more competitive. PEX has evolved into a robust, secure SaaS solution with a deep suite of workforce spend management capabilities, advanced card controls, real-time visibility into card usage, and improved reconciliation processes. More importantly, we are providing a better, more effective solution for thousands of companies and hundreds of thousands of people in the workforce. We work each day to find new ways we can help our clients operate more efficiently. Our environment is a mix of Windows and Linux machines that reside on-premise and in the cloud. It is crucial that all work is performed under strict adherence to PCI DSS requirements, and our environment is required to be available 24x7. WHO YOU ARE: As a Site Reliability Engineer, you will be responsible for planning, production, and engagement with software developers and infrastructure engineers to integrate software development and delivery. WHAT YOU’LL DO: ● Architectural oversight and ownership of web delivery stack - from the server/service to the end-user. ● Continuous improvement of system and application monitoring and automation ● Ensuring sufficient monitoring of infrastructure, systems, and application availability, performance, and capacity ● Ensuring sufficient monitoring of the availability, latency, scalability, and efficiency of all services ● Promoting availability and stability in a 24/7 high-availability environment ● Participating in an on-call rotation ​ REQUIRED SKILLS & QUALIFICATION ● Strong experience with Linux and at least one programming language (e.g. Python, Go, Ruby) ● Experience with containerization and orchestration technologies such as Docker and Kubernetes ● Experience with cloud infrastructure (e.g. Azure, AWS, GCP) as well as Infrastructure-as-Code tooling (e.g. Terraform) and CI/CD practices. ● Familiarity with monitoring, tracing, and logging tools (e.g. Zabbix, SumoLogic), including concepts such as SLI/SLO and error budgets. ● Strong problem-solving skills and ability to troubleshoot complex issues ● Strong communication skills and ability to work well in a team ● Experience with incident management and incident response ● Strong understanding of networking protocols and concepts ● Understanding of security concepts and best practices ● Strong understanding of system performance metrics and how to interpret them ● Ability to operate individually and as part of a team. ​ Powered by JazzHR



  • New York, United States Apollo Solutions Full time

    Site Reliability Engineer - Web3 Apollo Solutions have partnered with an innovative web3 start-up backed by top tier venture capital with a strong runway. They are looking to revolutionize the way way we with about the application of web3 and have already made significant inroads into the gaming, entertainment and finance industries. In this role, you will...


  • New York, United States Apollo Solutions Full time

    Site Reliability Engineer Apollo Solutions have partnered with a groundbreaking artifical inteligence business who are making major developments in how we use AI/ML for gaming/security. They are working closely with government contracts as well as gaming consoles companys and are now searching for an SRE to join their growing team. The Site Reliability...


  • New York, United States EVONA Full time

    Join Our Client's Team as a Site Reliability Engineer (SRE) Are you passionate about ensuring the reliability and stability of cutting-edge infrastructure? Do you thrive in collaborative environments where your ideas are valued and your contributions make a real impact? If so, we invite you to apply for the position of Site Reliability Engineer (SRE) with...


  • New York, United States The Greene Group Full time

    Job DescriptionJob DescriptionA major financial services company in NYC is growing its team rapidly, and they are looking for a Senior DevOps Engineer / Site Reliability Engineer who can join.If you’re passionate about high-availability, reliability, automation, we’d be excited to talk to you. Some technologies we are currently using: Linux, Docker,...


  • New York, United States InterEx Group Full time

    Senior Site Reliability Engineer PRIMARY ACCOUNTABILITIES Improve the reliability of mission critical solutions, applications, and platforms Software development for enterprises Continuous improvement identification and implementation Manage risks and resolve resolves issues that affect applications Lead efforts to troubleshoot and/or debug issues in any...


  • New York, United States developrec Full time

    SRE Lead/Manager | San Diego, CA | Full-time Role Overview: As the Engineering Manager for Site Reliability, you'll lead the charge in transitioning to cloud-based solutions while ensuring the stability of our existing systems for our rapidly growing user base, currently standing at around one million. You'll spearhead our cloud infrastructure strategy...


  • New York, United States InterEx Group Full time

    Senior Site Reliability EngineerPRIMARY ACCOUNTABILITIESImprove the reliability of mission critical solutions, applications, and platformsSoftware development for enterprisesContinuous improvement identification and implementationManage risks and resolve resolves issues that affect applicationsLead efforts to troubleshoot and/or debug issues in any...


  • New York, United States Citadel Securities Americas Services LLC Full time

    Site Reliability Engineer (Citadel Securities Americas Services LLC - New York, NY); Multiple positions available: Collaborate with cross-functional teams, including trading, quantitative, and software engineering teams, to support and enhance Citadel's core suite of trading applications with the latest, most cutting edge technology in order to proactively...


  • New York, New York, United States Particle Health Full time

    At Particle Health, our mission is to unlock the power of medical records in an intelligent platform that focuses healthcare back on the patient. Our energy is spent connecting to people's diverse sets of medical data, making that data useful in different settings, and designing an effortless way to share that information with any organization a person...


  • New York, United States InterEx Group Full time

    ROLE: Senior Site Reliability Engineer PRIMARY ACCOUNTABILITIES Improve the reliability of mission-critical solutions, applications, and platforms Software development for enterprises Continuous improvement identification and implementation Manage risks and resolve resolves issues that affect applications Lead efforts to troubleshoot and/or debug issues in...


  • New York, United States PEX Full time

    ​ SITE RELIABILITY ENGINEER  SUMMARY:  Since 2006 PEX has been on a steady march to build and evolve a solution that helps improve the way organizations operate in order to make them more efficient, more nimble, and more competitive.  PEX has evolved into a robust, secure SaaS solution with a deep suite of workforce spend management capabilities,...


  • New York, United States PEX Full time

    Job DescriptionJob Description​SITE RELIABILITY ENGINEER SUMMARY: Since 2006 PEX has been on a steady march to build and evolve a solution that helps improve the way organizations operate in order to make them more efficient, more nimble, and more competitive. PEX has evolved into a robust, secure SaaS solution with a deep suite of workforce spend...


  • New York, United States STONE Resource Group Full time

    Note: We are NOT able to work with 3rd party vendors OR on a C2C basis for this position. Overview STONE Resource Group is partnered with a leading company in the Financial Services Industry looking to add a Site Reliability Engineer to their team in Boise, ID. This is a hybrid, contract-to-hire opportunity offering growth potential and advanced technology...


  • New York, United States STONE Resource Group Full time

    Note: We are NOT able to work with 3rd party vendors OR on a C2C basis for this position. Overview STONE Resource Group is partnered with a leading company in the Financial Services Industry looking to add a Site Reliability Engineer to their team in Boise, ID. This is a hybrid, contract-to-hire opportunity offering growth potential and advanced technology...


  • New York, United States Nationstaff Full time

    About This Role We are seeking a talented Site Reliability Engineer with experience in building and maintaining continuous integration, automating programmatic tasks, deploying applications, configuration management, and monitoring and maintaining the uptime of the platform. The Site Reliability Engineer will be an expert in Linux, is passionate about open...


  • New York, United States Nationstaff Full time

    About This Role We are seeking a talented Site Reliability Engineer with experience in building and maintaining continuous integration, automating programmatic tasks, deploying applications, configuration management, and monitoring and maintaining the uptime of the platform. The Site Reliability Engineer will be an expert in Linux, is passionate about open...


  • New York, United States Instabase Full time

    At Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry.  With customers representing some of the largest and most complex organizations in the world, and investors like Greylock, Andreessen Horowitz, and Index Ventures, our...


  • New York, United States Tickets.com Full time

    Tickets.com, an MLB company, delivers innovative, cutting-edge technologies to enable frictionless and unforgettable fan experiences in venues across the globe. Together with MLB, Tickets.com is changing the landscape of the live sports and entertainment industry, delivering new digital venue and ticketing experiences to millions of fans. Our Technology team...


  • New York, United States InterEx Group Full time

    ROLE: Senior Site Reliability EngineerPRIMARY ACCOUNTABILITIESImprove the reliability of mission-critical solutions, applications, and platformsSoftware development for enterprisesContinuous improvement identification and implementationManage risks and resolve resolves issues that affect applicationsLead efforts to troubleshoot and/or debug issues in any...


  • New York, United States ICTerGezocht Full time

    Locatie Amsterdam Vacature in het kort Ever thought about the millions who use the ABN AMRO app or website every month? We aim to make their experience secure, personal, and smooth. As a Site Reliability Engineer, you'll have a crucial role in achieving this, working with a diverse team to design and implement top-notch systems. Our grid is a place of...