Site Reliability Engineer

2 weeks ago


New York, United States EVONA Full time

Join Our Client's Team as a Site Reliability Engineer (SRE)

Are you passionate about ensuring the reliability and stability of cutting-edge infrastructure? Do you thrive in collaborative environments where your ideas are valued and your contributions make a real impact? If so, we invite you to apply for the position of Site Reliability Engineer (SRE) with our client's dynamic team.

About Our Client: Our client is a leading organization in the technology sector, committed to pushing the boundaries of innovation. Their team of skilled DevOps engineers works tirelessly to optimize systems for peak reliability, and they are seeking a motivated individual to join them in this endeavor.

Job Responsibilities: Design and implement strategies to enhance system reliability. Utilize tools such as Datadog, AWS, and Kubernetes to monitor and optimize system performance. Develop custom monitoring and alerting solutions to proactively address issues. Adhere to and contribute to SRE best practices to ensure infrastructure stability and resilience. Participate in on-call rotations to promptly respond to incidents.

Qualifications: Required: Bachelor’s degree in Computer Science, Engineering, or related field, or equivalent practical experience. 2+ years of experience with cloud platforms, particularly AWS, and container orchestration tools like Kubernetes. Familiarity with monitoring and observability tools such as Datadog, Prometheus, Grafana, etc. Basic understanding of networking, distributed systems, and microservices architecture. Proficiency in scripting and automation using Python or similar languages. Strong problem-solving skills and ability to troubleshoot effectively. Excellent communication and collaboration skills. Desired: Exposure to DevOps methodologies. Familiarity with infrastructure as code tools like Terraform. Knowledge of CI/CD pipelines and related tools. Basic understanding of containerization technologies such as Docker. Certifications in AWS, Kubernetes, or related areas are a plus.

Join Our Client's Team: If you are ready to contribute to a dynamic team environment and play a pivotal role in shaping the reliability of infrastructure, we encourage you to apply. Competitive compensation and opportunities for professional growth are offered.

How to Apply: Please submit your resume and a cover letter outlining your qualifications and experience relevant to the role. Our client looks forward to reviewing your application.

Equal Opportunity Employer: Our client is an equal opportunity employer and is committed to diversity in the workplace. They encourage applications from all qualified individuals, regardless of race, ethnicity, gender, sexual orientation, age, religion, disability, or any other characteristic protected by law.



  • New York, United States Apollo Solutions Full time

    Site Reliability Engineer - Web3 Apollo Solutions have partnered with an innovative web3 start-up backed by top tier venture capital with a strong runway. They are looking to revolutionize the way way we with about the application of web3 and have already made significant inroads into the gaming, entertainment and finance industries. In this role, you will...


  • New York, United States Unreal Gigs Full time

    Job DescriptionJob DescriptionJob SummaryWe are in search of a Site Reliability Engineer to join our tech startup specializing in infrastructure and authorization solutions. As a Site Reliability Engineer, you'll be pivotal in ensuring the reliability, availability, and performance of our systems. Your role will involve designing, implementing, and...


  • New York, United States Unreal Gigs Full time

    Job Summary We are in search of a Site Reliability Engineer to join our tech startup specializing in infrastructure and authorization solutions. As a Site Reliability Engineer, you'll be pivotal in ensuring the reliability, availability, and performance of our systems. Your role will involve designing, implementing, and maintaining scalable infrastructure...


  • New York, United States Apollo Solutions Full time

    Site Reliability Engineer Apollo Solutions have partnered with a groundbreaking artifical inteligence business who are making major developments in how we use AI/ML for gaming/security. They are working closely with government contracts as well as gaming consoles companys and are now searching for an SRE to join their growing team. The Site Reliability...


  • New York, United States developrec Full time

    SRE Lead/Manager | San Diego, CA | Full-time Role Overview: As the Engineering Manager for Site Reliability, you'll lead the charge in transitioning to cloud-based solutions while ensuring the stability of our existing systems for our rapidly growing user base, currently standing at around one million. You'll spearhead our cloud infrastructure strategy...


  • New York, United States The Cypress Group Full time

    Job Title: Manager of Site Reliability EngineeringCompany Overview:Join a leading global payment processing firm revolutionizing the industry through cutting-edge technology and a commitment to excellence. We are dedicated to providing seamless, secure, and efficient payment solutions to clients worldwide. As we continue to expand and innovate, we are...


  • New York, United States The Cypress Group Full time

    Job Title: Manager of Site Reliability EngineeringCompany Overview:Join a leading global payment processing firm revolutionizing the industry through cutting-edge technology and a commitment to excellence. We are dedicated to providing seamless, secure, and efficient payment solutions to clients worldwide. As we continue to expand and innovate, we are...


  • New York, United States The Cypress Group Full time

    Job Title: Manager of Site Reliability EngineeringCompany Overview:Join a leading global payment processing firm revolutionizing the industry through cutting-edge technology and a commitment to excellence. We are dedicated to providing seamless, secure, and efficient payment solutions to clients worldwide. As we continue to expand and innovate, we are...


  • New York, United States The Cypress Group Full time

    Job Title: Manager of Site Reliability EngineeringCompany Overview:Join a leading global payment processing firm revolutionizing the industry through cutting-edge technology and a commitment to excellence. We are dedicated to providing seamless, secure, and efficient payment solutions to clients worldwide. As we continue to expand and innovate, we are...


  • New York, United States The Cypress Group Full time

    Job Title: Manager of Site Reliability Engineering Company Overview: Join a leading global payment processing firm revolutionizing the industry through cutting-edge technology and a commitment to excellence. We are dedicated to providing seamless, secure, and efficient payment solutions to clients worldwide. As we continue to expand and innovate, we are...


  • New York, United States The Cypress Group Full time

    Job Title: Manager of Site Reliability EngineeringCompany Overview:Join a leading global payment processing firm revolutionizing the industry through cutting-edge technology and a commitment to excellence. We are dedicated to providing seamless, secure, and efficient payment solutions to clients worldwide. As we continue to expand and innovate, we are...


  • New York, United States ICTerGezocht Full time

    Locatie Amsterdam Vacature in het kort Ever thought about the millions who use the ABN AMRO app or website every month? We aim to make their experience secure, personal, and smooth. As a Site Reliability Engineer, you'll have a crucial role in achieving this, working with a diverse team to design and implement top-notch systems. Our grid is a place of...


  • New York, United States InterEx Group Full time

    Senior Site Reliability Engineer PRIMARY ACCOUNTABILITIES Improve the reliability of mission critical solutions, applications, and platforms Software development for enterprises Continuous improvement identification and implementation Manage risks and resolve resolves issues that affect applications Lead efforts to troubleshoot and/or debug issues in any...


  • New York, United States Old Mission Capital Full time

    Old Mission is a global proprietary trading firm that leverages state-of-the-art technology and research to identify and execute profitable trading strategies across multiple asset classes around the world. Our offices in Chicago, New York, and London are all composed of naturally-curious individuals who thrive in a team environment and constantly strive for...


  • New York, United States Instabase Full time

    At Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry.  With customers representing some of the largest and most complex organizations in the world, and investors like Greylock, Andreessen Horowitz, and Index Ventures, our...


  • New York, United States InterEx Group Full time

    Senior Site Reliability EngineerPRIMARY ACCOUNTABILITIESImprove the reliability of mission critical solutions, applications, and platformsSoftware development for enterprisesContinuous improvement identification and implementationManage risks and resolve resolves issues that affect applicationsLead efforts to troubleshoot and/or debug issues in any...


  • New York, United States Citadel Securities Americas Services LLC Full time

    Site Reliability Engineer (Citadel Securities Americas Services LLC - New York, NY); Multiple positions available: Collaborate with cross-functional teams, including trading, quantitative, and software engineering teams, to support and enhance Citadel's core suite of trading applications with the latest, most cutting edge technology in order to proactively...


  • New York, United States STONE Resource Group Full time

    Note: We are NOT able to work with 3rd party vendors OR on a C2C basis for this position. Overview STONE Resource Group is partnered with a leading company in the Financial Services Industry looking to add a Site Reliability Engineer to their team in Boise, ID. This is a hybrid, contract-to-hire opportunity offering growth potential and advanced technology...


  • New York, United States STONE Resource Group Full time

    Note: We are NOT able to work with 3rd party vendors OR on a C2C basis for this position. Overview STONE Resource Group is partnered with a leading company in the Financial Services Industry looking to add a Site Reliability Engineer to their team in Boise, ID. This is a hybrid, contract-to-hire opportunity offering growth potential and advanced technology...


  • New York, United States Nationstaff Full time

    About This Role We are seeking a talented Site Reliability Engineer with experience in building and maintaining continuous integration, automating programmatic tasks, deploying applications, configuration management, and monitoring and maintaining the uptime of the platform. The Site Reliability Engineer will be an expert in Linux, is passionate about open...