Site Reliability Engineer

2 weeks ago


New York, United States Unreal Gigs Full time
Job DescriptionJob Description

Job Summary

We are in search of a Site Reliability Engineer to join our tech startup specializing in infrastructure and authorization solutions. As a Site Reliability Engineer, you'll be pivotal in ensuring the reliability, availability, and performance of our systems. Your role will involve designing, implementing, and maintaining scalable infrastructure solutions to support our expanding customer base. This presents an exciting opportunity to thrive in a dynamic environment and contribute to the success of a company revolutionizing authorization systems globally.

Requirements

Responsibilities

  • Design, implement, and maintain scalable infrastructure solutions for projects, products, and customers.
  • Monitor and analyze system performance, addressing bottlenecks and issues for optimal performance and reliability.
  • Automate infrastructure deployment and configuration management processes.
  • Enhance system reliability, security, and efficiency through proactive monitoring, capacity planning, and performance tuning.
  • Resolve complex infrastructure and application issues in production and test environments.
  • Collaborate with software engineering teams to develop resilient, scalable, and secure systems.
  • Participate in on-call rotation and promptly respond to production incidents.
  • Document system configurations, troubleshooting procedures, and operational guidelines.

Requirements

  • Proven experience as a Site Reliability Engineer or in a similar role.
  • Strong grasp of networking, operating systems, and cloud infrastructure.
  • Proficiency in Site Reliability Engineering, System Design, and Distributed Computing.
  • Familiarity with programming languages such as NodeJS, Java, Python, Ruby, and Go.
  • Knowledge of containerization technologies like Docker and Kubernetes.
  • Experience with infrastructure-as-code tools like Terraform and Pulumi.
  • Familiarity with monitoring and logging tools such as Prometheus, Grafana, and the ELK stack.
  • Experience with relational databases, and ideally, exposure to distributed SQL databases like Google Cloud Spanner or CockroachDB.
  • Proficiency in Git and GitHub.
  • Experience with continuous integration and deployment systems.
  • Strong problem-solving and troubleshooting abilities.
  • Excellent communication and collaboration skills.

Benefits

  • Competitive salary package with potential for performance-based bonuses.
  • Comprehensive health and wellness benefits, including medical, dental, and vision coverage.
  • Flexible working hours and remote work options.
  • Opportunities for professional development and career growth.
  • Collaborative and inclusive work environment fostering innovation and creativity.
  • Regular team-building activities and events to promote camaraderie and team cohesion.


  • New York, United States Unreal Gigs Full time

    Job Summary We are in search of a Site Reliability Engineer to join our tech startup specializing in infrastructure and authorization solutions. As a Site Reliability Engineer, you'll be pivotal in ensuring the reliability, availability, and performance of our systems. Your role will involve designing, implementing, and maintaining scalable infrastructure...


  • New York, United States developrec Full time

    SRE Lead/Manager | San Diego, CA | Full-time Role Overview: As the Engineering Manager for Site Reliability, you'll lead the charge in transitioning to cloud-based solutions while ensuring the stability of our existing systems for our rapidly growing user base, currently standing at around one million. You'll spearhead our cloud infrastructure strategy...


  • New York, United States InterEx Group Full time

    Senior Site Reliability Engineer PRIMARY ACCOUNTABILITIES Improve the reliability of mission critical solutions, applications, and platforms Software development for enterprises Continuous improvement identification and implementation Manage risks and resolve resolves issues that affect applications Lead efforts to troubleshoot and/or debug issues in any...


  • New York, United States InterEx Group Full time

    Senior Site Reliability Engineer PRIMARY ACCOUNTABILITIES Improve the reliability of mission critical solutions, applications, and platforms Software development for enterprises Continuous improvement identification and implementation Manage risks and resolve resolves issues that affect applications Lead efforts to troubleshoot and/or debug issues in any...


  • New York, United States The Judge Group, LLC Full time

    Contract: 6+ months Hybrid: Riverwoods, IL W2 ONLY - NO C2C Job Responsibilities: Guide full stack developers on the importance of SRE principles. Analyze, design, and deploy new functionality and enhancements with high quality (security, reliability, operations) to production. Build new and analyze current monitoring for applications for...


  • New York, United States InterEx Group Full time

    Senior Site Reliability EngineerPRIMARY ACCOUNTABILITIESImprove the reliability of mission critical solutions, applications, and platformsSoftware development for enterprisesContinuous improvement identification and implementationManage risks and resolve resolves issues that affect applicationsLead efforts to troubleshoot and/or debug issues in any...


  • New York, United States Citadel Securities Americas Services LLC Full time

    Site Reliability Engineer (Citadel Securities Americas Services LLC - New York, NY); Multiple positions available: Collaborate with cross-functional teams, including trading, quantitative, and software engineering teams, to support and enhance Citadel's core suite of trading applications with the latest, most cutting edge technology in order to proactively...


  • New York, United States Nationstaff Full time

    About This Role We are seeking a talented Site Reliability Engineer with experience in building and maintaining continuous integration, automating programmatic tasks, deploying applications, configuration management, and monitoring and maintaining the uptime of the platform. The Site Reliability Engineer will be an expert in Linux, is passionate about open...


  • New York, United States Nationstaff Full time

    About This Role We are seeking a talented Site Reliability Engineer with experience in building and maintaining continuous integration, automating programmatic tasks, deploying applications, configuration management, and monitoring and maintaining the uptime of the platform. The Site Reliability Engineer will be an expert in Linux, is passionate about open...


  • New York, United States Gallery Systems Full time

    Job Summary: Job Description: We are seeking a Site Reliability Engineer (SRE) with 3-5 years experience to join our team at Gallery Systems. The SRE will play a critical role in overseeing the reliability, performance, and scalability of our systems in a Microsoft/Linux environment. The ideal candidate will bring expertise and best practices from previous...


  • New York, United States Hale Recruiting Full time

    Summary - Site Reliablity Engineer (For one of the Big 4 Sports &Entertainment League) Our client is enhancing the landscape of the live sports and entertainment industry. They are striving to deliver innovative, cutting-edge technologies to enable safe, unforgettable fan experiences across the globe. They are assembling a world-class technology team to...


  • New York, United States Sesame Workshop Full time

    Job Description Sesame Workshop is seeking a Junior Site Reliability Engineer. Sesame Workshop is an independent nonprofit organization dedicated to helping children grow smarter, stronger, and kinder. This role is within the Digital Media Engineering (DME) group which is part of the Technology and Engineering department and will help provide support for our...


  • New York, New York, United States Sesame Workshop Full time

    Sesame Workshop is seeking a Junior Site Reliability Engineer. Sesame Workshop is an independent nonprofit organization dedicated to helping children grow smarter, stronger, and kinder. This role is within the Digital Media Engineering (DME) group which is part of the Technology and Engineering department and will help provide support for our diverse media...


  • New York, United States Mondrian Alpha Full time

    A leading systematic multi strat fund are seeking an experienced site reliability engineer to join a team of senior engineers to focus on varying platforms throughout the business. SRE's here combine software and systems engineering experience to build, maintain and improve systems that power the companies investment strategies. The right candidate will come...


  • New York, United States Mondrian Alpha Full time

    A leading systematic multi strat fund are seeking an experienced site reliability engineer to join a team of senior engineers to focus on varying platforms throughout the business. SRE's here combine software and systems engineering experience to build, maintain and improve systems that power the companies investment strategies.The right candidate will come...


  • New York, United States InterEx Group Full time

    ROLE: Senior Site Reliability Engineer PRIMARY ACCOUNTABILITIES Improve the reliability of mission-critical solutions, applications, and platforms Software development for enterprises Continuous improvement identification and implementation Manage risks and resolve resolves issues that affect applications Lead efforts to troubleshoot and/or debug issues in...


  • New York, United States PEX Full time

    ​ SITE RELIABILITY ENGINEER SUMMARY: Since 2006 PEX has been on a steady march to build and evolve a solution that helps improve the way organizations operate in order to make them more efficient, more nimble, and more competitive. PEX has evolved into a robust, secure SaaS solution with a deep suite of workforce spend management capabilities, advanced...


  • New York, United States InterEx Group Full time

    ROLE: Senior Site Reliability EngineerPRIMARY ACCOUNTABILITIESImprove the reliability of mission-critical solutions, applications, and platformsSoftware development for enterprisesContinuous improvement identification and implementationManage risks and resolve resolves issues that affect applicationsLead efforts to troubleshoot and/or debug issues in any...


  • New York, United States PEX Full time

    Job DescriptionJob Description​SITE RELIABILITY ENGINEER SUMMARY: Since 2006 PEX has been on a steady march to build and evolve a solution that helps improve the way organizations operate in order to make them more efficient, more nimble, and more competitive. PEX has evolved into a robust, secure SaaS solution with a deep suite of workforce spend...


  • New York, United States Fourier Ltd Full time

    Joining a growing team to support, maintain and improve their automated trading systems. You'll be working in a fast paced and agile trading environment In the worlds most successful hedge fund to support maintain and improve their trading systems. This company looks for the most talented engineers on the market and rewards them accordingly. Build and...