Site Reliability Engineer DevOps

3 weeks ago


New York, United States PEX Full time
Job DescriptionJob Description

SITE RELIABILITY ENGINEER 

SUMMARY: 

Since 2006 PEX has been on a steady march to build and evolve a solution that helps improve the way organizations operate in order to make them more efficient, more nimble, and more competitive. 

PEX has evolved into a robust, secure SaaS solution with a deep suite of workforce spend management capabilities, advanced card controls, real-time visibility into card usage, and improved reconciliation processes. More importantly, we are providing a better, more effective solution for thousands of companies and hundreds of thousands of people in the workforce. We work each day to find new ways we can help our clients operate more efficiently. 

Our environment is a mix of Windows and Linux machines that reside on-premise and in the cloud. It is crucial that all work is performed under strict adherence to PCI DSS requirements, and our environment is required to be available 24x7. 

WHO YOU ARE: 

As a Site Reliability Engineer, you will be responsible for planning, production, and engagement with software developers and infrastructure engineers to integrate software development and delivery. 

WHAT YOU’LL DO: 

● Architectural oversight and ownership of web delivery stack - from the server/service to the end-user. 

● Continuous improvement of system and application monitoring and automation

● Ensuring sufficient monitoring of infrastructure, systems, and application availability, performance, and capacity 

● Ensuring sufficient monitoring of the availability, latency, scalability, and efficiency of all services 

● Promoting availability and stability in a 24/7 high-availability environment

● Participating in an on-call rotation

REQUIRED SKILLS & QUALIFICATION 

● Strong experience with Linux and at least one programming language (e.g. Python, Go, Ruby) 

● Experience with containerization and orchestration technologies such as Docker and Kubernetes 

● Experience with cloud infrastructure (e.g. Azure, AWS, GCP) as well as Infrastructure-as-Code tooling (e.g. Terraform) and CI/CD practices. 

● Familiarity with monitoring, tracing, and logging tools (e.g. Zabbix, SumoLogic), including concepts such as SLI/SLO and error budgets. 

● Strong problem-solving skills and ability to troubleshoot complex issues

● Strong communication skills and ability to work well in a team 

● Experience with incident management and incident response 

● Strong understanding of networking protocols and concepts 

● Understanding of security concepts and best practices 

● Strong understanding of system performance metrics and how to interpret them

● Ability to operate individually and as part of a team. 


 

Powered by JazzHR

d7iSo9yC3Z



  • New York, United States InterEx Group Full time

    Senior Site Reliability Engineer PRIMARY ACCOUNTABILITIES Improve the reliability of mission critical solutions, applications, and platforms Software development for enterprises Continuous improvement identification and implementation Manage risks and resolve resolves issues that affect applications Lead efforts to troubleshoot and/or debug issues in any...


  • New York, United States InterEx Group Full time

    Senior Site Reliability Engineer PRIMARY ACCOUNTABILITIES Improve the reliability of mission critical solutions, applications, and platforms Software development for enterprises Continuous improvement identification and implementation Manage risks and resolve resolves issues that affect applications Lead efforts to troubleshoot and/or debug issues in any...


  • New York, United States developrec Full time

    SRE Lead/Manager | San Diego, CA | Full-time Role Overview: As the Engineering Manager for Site Reliability, you'll lead the charge in transitioning to cloud-based solutions while ensuring the stability of our existing systems for our rapidly growing user base, currently standing at around one million. You'll spearhead our cloud infrastructure strategy...


  • New York, United States InterEx Group Full time

    Senior Site Reliability EngineerPRIMARY ACCOUNTABILITIESImprove the reliability of mission critical solutions, applications, and platformsSoftware development for enterprisesContinuous improvement identification and implementationManage risks and resolve resolves issues that affect applicationsLead efforts to troubleshoot and/or debug issues in any...


  • New York, United States Citadel Securities Americas Services LLC Full time

    Site Reliability Engineer (Citadel Securities Americas Services LLC - New York, NY); Multiple positions available: Collaborate with cross-functional teams, including trading, quantitative, and software engineering teams, to support and enhance Citadel's core suite of trading applications with the latest, most cutting edge technology in order to proactively...


  • New York, United States PEX Full time

    ​ SITE RELIABILITY ENGINEER SUMMARY: Since 2006 PEX has been on a steady march to build and evolve a solution that helps improve the way organizations operate in order to make them more efficient, more nimble, and more competitive. PEX has evolved into a robust, secure SaaS solution with a deep suite of workforce spend management capabilities, advanced...


  • New York, United States Gallery Systems Full time

    Job Summary: Job Description: We are seeking a Site Reliability Engineer (SRE) with 3-5 years experience to join our team at Gallery Systems. The SRE will play a critical role in overseeing the reliability, performance, and scalability of our systems in a Microsoft/Linux environment. The ideal candidate will bring expertise and best practices from previous...


  • New York, United States InterEx Group Full time

    ROLE: Senior Site Reliability Engineer PRIMARY ACCOUNTABILITIES Improve the reliability of mission-critical solutions, applications, and platforms Software development for enterprises Continuous improvement identification and implementation Manage risks and resolve resolves issues that affect applications Lead efforts to troubleshoot and/or debug issues in...


  • New York, United States Synergis Full time

    SRE/Dynatrace Lead Contract to hire - W2 Remote - Candidate MUST be in Georgia or Alabama Job Description The Site Reliability Engineer Lead will work with stakeholders to define SLOs and SLIs as well as develop the overall SRE strategy and roadmap. The ideal candidate will develop a depth of understanding of how all the systems work together, how they fail,...


  • New York, United States PEX Full time

    Job DescriptionJob Description​SITE RELIABILITY ENGINEER SUMMARY: Since 2006 PEX has been on a steady march to build and evolve a solution that helps improve the way organizations operate in order to make them more efficient, more nimble, and more competitive. PEX has evolved into a robust, secure SaaS solution with a deep suite of workforce spend...


  • New York, United States Nationstaff Full time

    About This Role We are seeking a talented Site Reliability Engineer with experience in building and maintaining continuous integration, automating programmatic tasks, deploying applications, configuration management, and monitoring and maintaining the uptime of the platform. The Site Reliability Engineer will be an expert in Linux, is passionate about open...


  • New York, United States Nationstaff Full time

    About This Role We are seeking a talented Site Reliability Engineer with experience in building and maintaining continuous integration, automating programmatic tasks, deploying applications, configuration management, and monitoring and maintaining the uptime of the platform. The Site Reliability Engineer will be an expert in Linux, is passionate about open...


  • New York, United States InterEx Group Full time

    ROLE: Senior Site Reliability EngineerPRIMARY ACCOUNTABILITIESImprove the reliability of mission-critical solutions, applications, and platformsSoftware development for enterprisesContinuous improvement identification and implementationManage risks and resolve resolves issues that affect applicationsLead efforts to troubleshoot and/or debug issues in any...


  • New York, New York, United States Sesame Workshop Full time

    Sesame Workshop is seeking a Junior Site Reliability Engineer. Sesame Workshop is an independent nonprofit organization dedicated to helping children grow smarter, stronger, and kinder. This role is within the Digital Media Engineering (DME) group which is part of the Technology and Engineering department and will help provide support for our diverse media...


  • New York, United States Sesame Workshop Full time

    Job Description Sesame Workshop is seeking a Junior Site Reliability Engineer. Sesame Workshop is an independent nonprofit organization dedicated to helping children grow smarter, stronger, and kinder. This role is within the Digital Media Engineering (DME) group which is part of the Technology and Engineering department and will help provide support for our...


  • New York, United States Hale Recruiting Full time

    Summary - Site Reliablity Engineer (For one of the Big 4 Sports &Entertainment League) Our client is enhancing the landscape of the live sports and entertainment industry. They are striving to deliver innovative, cutting-edge technologies to enable safe, unforgettable fan experiences across the globe. They are assembling a world-class technology team to...

  • Devops Engineer

    4 weeks ago


    New York, United States Huntress Talent Full time

    The ideal candidate is self-driven, data-driven, and can work in a distributed team. This professional hold strong knowledge of Site Reliability Engineering and DevOps methodologies related to Delivery solutions & Platform Automation.In this role you will be part of the Site Reliability team, sharing your experience in the field with our Delivery, Support,...

  • Devops Engineer

    2 weeks ago


    New York, United States Huntress Talent Full time

    Job DescriptionJob DescriptionThe ideal candidate is self-driven, data-driven, and can work in a distributed team. This professional hold strong knowledge of Site Reliability Engineering and DevOps methodologies related to Delivery solutions & Platform Automation.In this role you will be part of the Site Reliability team, sharing your experience in the field...


  • New York, United States Unreal Gigs Full time

    Job Summary We are in search of a Site Reliability Engineer to join our tech startup specializing in infrastructure and authorization solutions. As a Site Reliability Engineer, you'll be pivotal in ensuring the reliability, availability, and performance of our systems. Your role will involve designing, implementing, and maintaining scalable infrastructure...


  • New York, United States Unreal Gigs Full time

    Job DescriptionJob DescriptionJob SummaryWe are in search of a Site Reliability Engineer to join our tech startup specializing in infrastructure and authorization solutions. As a Site Reliability Engineer, you'll be pivotal in ensuring the reliability, availability, and performance of our systems. Your role will involve designing, implementing, and...