Site Reliability Engineer

1 month ago


New York, United States Russell Tobin & Associates Full time



What are we looking for in our Site Reliability Engineer?

SRE - Prod Support

NY, NY (Hybrid 3 days)

$50 – 55 /hr. W2

Contract to Permanent

Job Description:

Responsibilities:

  • Monitor, resolve system errors, disruptions. Document resolution. Manage Incident as per ITIL lifecycle. Liaise with upstream data providers to resolve issues. Respond to and solve inquiries and operations requested by users. Document, Review, handling and resolution steps for support scenarios.
  • Prepare and present stability reports and presentations. Analyze Alert and Stability trends and make recommendations. Investigate root cause of the issues, Inform and educate developers about the cause so that developers and mitigate the root cause.
  • Automate (1) Resolution of common problems (2) Routine investigations (3) Routine user requests using scripts or available programming platform. Lead reliability or business driven projects. Perform reliability engineering
  • You will work closely with engineering/development teams to design, build, and maintain systems and help them decide on products to use, schema design and query tuning.
  • You will troubleshoot issues across the entire stack: hardware, software, application and network.
  • You will mentor other SREs on standard methodology for everything from monitoring to troubleshooting complex code and database issues.
  • You will identify and drive opportunities to improve automation for the company; scope and create automation for deployment, management and visibility of our services.
  • Represent the SRE organization in design reviews and operational readiness exercises for new and existing services.
  • Participate in on-call rotation and periodic conference calls with other specialists from other time zones.

Required Experience:

  • Hands on experience with UNIX
  • Hands on experience with SQL based database
  • Three Tier Support experience with DBs such as IBM, DB2, Sybase, Mongo, Green Plum, KDB
  • Excellent analytical and communication skills
  • Ability to prioritize and willingness to take ownership
  • Problem solving mindset and solution enabler
  • Great Problem trouble shooting and debug ability
  • Familiar with Financial Products like Equity and Fixed Income, securities, different type of risks in an investment bank, Trade flow
  • Should be able to contribute in system design and architecture with strong database knowledge.

Desired Skills

  • Knowledge of Automation Related activities using scripting languages such as Python, Bash, Perl, Ruby
  • Experience using Enterprise Tools such as App Dynamic, Grafana, Splunk, Dynatrace
  • Awareness of, and ability to reason about modern software & systems architectures, including load-balancing, queueing, caching, distributed systems failure modes generally, micro services, and so on.
  • Deep understanding of operating system level concepts such as processes, memory allocation, and the network stack, understanding of how applications are affected by the above, and ability to debug same.
  • Generally speaking, practical experience running large scale online systems is always an advantage.


Russell Tobin offers eligible employee’s comprehensive healthcare coverage (medical, dental, and vision plans), supplemental coverage (accident insurance, critical illness insurance and hospital indemnity), 401(k)-retirement savings, life & disability insurance, an employee assistance program, legal support, auto, home insurance, pet insurance and employee discounts with preferred vendors.


#CB



  • New York, United States Automatic Data Processing Full time

    ADP is hiring a Site Reliability Engineer. Do you thrive in a challenging environment, love production systems, curious by nature with a thirst for pushing the limits? Are you inspired by transformation and making an impact on the lives of millions o Reliability Engineer, Liability, Reliability, Engineer, Reliability, Operations, Manufacturing


  • New York, United States Unreal Gigs Full time

    Job DescriptionJob DescriptionJob SummaryWe are in search of a Site Reliability Engineer to join our tech startup specializing in infrastructure and authorization solutions. As a Site Reliability Engineer, you'll be pivotal in ensuring the reliability, availability, and performance of our systems. Your role will involve designing, implementing, and...


  • New York, United States Unreal Gigs Full time

    Job DescriptionJob DescriptionJob SummaryWe are in search of a Site Reliability Engineer to join our tech startup specializing in infrastructure and authorization solutions. As a Site Reliability Engineer, you'll be pivotal in ensuring the reliability, availability, and performance of our systems. Your role will involve designing, implementing, and...


  • New York, United States Unreal Gigs Full time

    Job Summary We are in search of a Site Reliability Engineer to join our tech startup specializing in infrastructure and authorization solutions. As a Site Reliability Engineer, you'll be pivotal in ensuring the reliability, availability, and performance of our systems. Your role will involve designing, implementing, and maintaining scalable infrastructure...


  • New York, United States RedTech Recruitment Full time

    Site Reliability Engineer – Graduates consideredWe are excited to be able to offer this Site Reliability Engineer role working for an industry-leading software company. This company has won several awards and is pioneering in their machine learning technology. Founded 8 years ago, with a team of 150 brilliant engineers, they are already renowned as having...


  • New York, United States Hyperion Industries Full time

    Company DescriptionJoin us on an exhilarating mission at Hyperion, a VC-backed startup working with Tim Hwang, CEO of FiscalNote (NYSE: NOTE). Our co-founders, with their extensive AI and engineering backgrounds from Google, Amazon, Workday, and Instacart are leading the charge. Our mission is to revolutionize Site Reliability Engineering (SRE) with an...


  • New York, United States Hyperion Industries Full time

    Company DescriptionJoin us on an exhilarating mission at Hyperion, a VC-backed startup working with Tim Hwang, CEO of FiscalNote (NYSE: NOTE). Our co-founders, with their extensive AI and engineering backgrounds from Google, Amazon, Workday, and Instacart are leading the charge. Our mission is to revolutionize Site Reliability Engineering (SRE) with an...


  • New York, United States Mondrian Alpha Full time

    An industry leading systematic trading fund is seeking highly skilled Site Reliability Engineers to join a team responsible for engineering and supporting the companies critical infrastructure platforms. This team also handles the centralized development infrastructure and works alongside engineering teams across the business assure the optimal route of...


  • New York, United States ICTerGezocht Full time

    Locatie Amsterdam Vacature in het kort Ever thought of how many people log in to the app or Internet Banking website each month? Over five million! The objective of the Personal Banking Grid is to ensure that each visit is not only secure but also a personal and smooth experience. As a Site Reliability Engineer, you play a key role in this mission. You will...


  • New York, United States Instabase Full time

    At Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry. With customers representing some of the largest and most complex organizations in the world, and investors like Greylock, Andreessen Horowitz, and Index Ventures, our...


  • New York, United States InterEx Group Full time

    Senior Site Reliability Engineer PRIMARY ACCOUNTABILITIES Improve the reliability of mission critical solutions, applications, and platforms Software development for enterprises Continuous improvement identification and implementation Manage risks and resolve resolves issues that affect applications Lead efforts to troubleshoot and/or debug issues in any...


  • New York, New York, United States Instabase Full time

    At Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry. With customers representing some of the largest and most complex organizations in the world, and investors like Greylock, Andreessen Horowitz, and Index Ventures, our...


  • New York, United States Hebbia Full time

    About Hebbia The user interface for AGI - Hebbia is AI that works the way you work. Designed to be generally capable- it can tackle even the most complex tasks, citing answers over any amount of sources. By showing its work, Hebbia empowers users to collaborate with AI on each step and validate responses instead of blindly trusting them. Our mission is to...


  • New York, New York, United States Astir IT Solutions, Inc. Full time

    Position: Senior Site Reliability EngineerLocation: Onsite in NJContract Duration: Long-term EngagementCompensation: $50 per hourNote: No OPT/CPT candidates will be considered.We are seeking a highly skilled Senior Site Reliability Engineer (SRE) with subject matter expertise. The ideal candidate will possess exceptional communication skills and the...


  • New York, New York, United States Streaming Talent Full time

    Streaming Talent is seeking a highly skilled Site Reliability Engineer to join our client's US team. As a key member of the Site Reliability Team, you will be responsible for ensuring the smooth operation of the company's Content Delivery Network.The ideal candidate will have a strong background in cloud technologies, with experience working with Kubernetes...


  • New York, New York, United States Astir IT Solutions, Inc. Full time

    Position: Senior Site Reliability EngineerLocation: Onsite in New JerseyContract Duration: Long-termCompensation: $50 per hourThis role requires a highly skilled individual with a strong background in Site Reliability Engineering. The ideal candidate will possess exceptional communication abilities and the confidence to engage with executive-level teams.Key...


  • New York, United States InterEx Group Full time

    Senior Site Reliability EngineerPRIMARY ACCOUNTABILITIESImprove the reliability of mission critical solutions, applications, and platformsSoftware development for enterprisesContinuous improvement identification and implementationManage risks and resolve resolves issues that affect applicationsLead efforts to troubleshoot and/or debug issues in any...


  • New York, New York, United States Astir IT Solutions, Inc. Full time

    Position: Senior Site Reliability EngineerLocation: Onsite in New JerseyContract Duration: Long-term EngagementCompensation: $50 per hourThis role requires a highly skilled individual with a strong background in Site Reliability Engineering. The ideal candidate will possess exceptional communication abilities and the confidence to engage with executive-level...


  • New York, New York, United States Astir IT Solutions, Inc. Full time

    Position: Senior Site Reliability EngineerLocation: Onsite in New JerseyContract Duration: Long-term EngagementCompensation: $50 per hourThis role requires a highly skilled individual with a strong background in Site Reliability Engineering. The ideal candidate will possess:Exceptional communication skills, with the ability to engage confidently with...


  • New York, New York, United States Astir IT Solutions, Inc. Full time

    Position: Senior Site Reliability EngineerLocation: Onsite in New JerseyContract Duration: Long-term EngagementCompensation: $50 per hourThis role requires a seasoned professional with a strong background in Site Reliability Engineering. The ideal candidate will possess exceptional communication skills and the confidence to engage with executive-level...