Reliability Engineer

2 weeks ago


New York, New York, United States Russell Tobin & Associates Full time



What are we seeking in our Reliability Engineer?

Position: SRE - Production Support

Location: Hybrid

Compensation: $50 – 55 /hr. W2

Employment Type: Contract to Permanent

Job Overview:

Key Responsibilities:

  • Oversee and rectify system errors and disruptions. Document resolutions and manage incidents following the ITIL lifecycle. Collaborate with upstream data providers to address issues. Respond to user inquiries and operational requests, documenting and reviewing support scenarios.
  • Compile and present stability reports and analyses. Evaluate alert and stability trends, providing recommendations. Investigate root causes of issues and educate developers to mitigate these causes.
  • Automate the resolution of common issues, routine investigations, and user requests through scripting or available programming platforms. Lead projects focused on reliability and business needs.
  • Collaborate closely with engineering and development teams to design, build, and maintain systems, assisting in product selection, schema design, and query optimization.
  • Troubleshoot issues across the entire technology stack, including hardware, software, applications, and networks.
  • Mentor fellow SREs on best practices for monitoring and troubleshooting complex code and database challenges.
  • Identify and pursue opportunities to enhance automation within the organization; scope and develop automation for deployment, management, and visibility of services.
  • Represent the SRE team in design reviews and operational readiness assessments for both new and existing services.
  • Participate in on-call rotations and periodic conference calls with specialists across different time zones.

Required Qualifications:

  • Practical experience with UNIX systems.
  • Hands-on experience with SQL-based databases.
  • Experience in Three Tier Support with databases such as IBM, DB2, Sybase, Mongo, Green Plum, KDB.
  • Strong analytical and communication skills.
  • Ability to prioritize tasks and take ownership of responsibilities.
  • Problem-solving mindset with a focus on enabling solutions.
  • Excellent troubleshooting and debugging capabilities.
  • Familiarity with financial products, including equity and fixed income, securities, and various investment risks.
  • Capability to contribute to system design and architecture with robust database knowledge.

Preferred Skills:

  • Experience with automation activities using scripting languages such as Python, Bash, Perl, or Ruby.
  • Familiarity with enterprise tools like App Dynamic, Grafana, Splunk, and Dynatrace.
  • Understanding of modern software and systems architectures, including load balancing, queuing, caching, and distributed systems.
  • In-depth knowledge of operating system concepts such as processes, memory allocation, and the network stack, with the ability to debug related issues.
  • Practical experience managing large-scale online systems is advantageous.


Russell Tobin offers eligible employees comprehensive healthcare coverage, including medical, dental, and vision plans, as well as supplemental coverage options, 401(k) retirement savings, life and disability insurance, an employee assistance program, legal support, and various insurance options.



#CB


  • Reliability Engineer

    1 month ago


    New York, New York, United States IFF Family of Companies Full time

    IFF in Newark, DE is seeking a Maintenance and Reliability Engineer to join our team The Newark Site is aligned to the pharma solutions business and is a leading manufacturer of microcrystalline cellulose and other pharmaceutical business excipient grade materials. Our site is also a key supplier to the Food & Beverage markets. The Newark Site has an...


  • New York, New York, United States Hyperion Industries Full time

    Company OverviewAt Hyperion Industries, we are on a transformative journey to redefine the landscape of Site Reliability Engineering (SRE). Our leadership team, comprising seasoned professionals from renowned tech giants, is dedicated to creating an AI-enhanced platform that autonomously manages incidents and elevates system uptime and reliability. We thrive...


  • New York, New York, United States Hyperion Industries Full time

    Company OverviewAt Hyperion Industries, we are on a transformative journey to redefine the field of Site Reliability Engineering (SRE). Backed by venture capital and led by visionary leaders with rich backgrounds in AI and engineering from renowned tech giants, we are committed to developing an innovative platform that autonomously manages incidents and...


  • New York, New York, United States Diverse Lynx Full time

    About the Role:Diverse Lynx is seeking a highly skilled Cloud Reliability Engineer to join our team. As a Cloud Reliability Engineer, you will be responsible for ensuring the reliability and efficiency of our cloud-based systems.Key Responsibilities:Design and implement automated workflows to reduce TOIL and improve system reliabilityDevelop and maintain...


  • New York, New York, United States Diverse Lynx Full time

    About the Role:Diverse Lynx is seeking a highly skilled Cloud Reliability Engineer to join our team. As a Cloud Reliability Engineer, you will be responsible for ensuring the reliability and efficiency of our cloud-based systems.Key Responsibilities:Design and implement automated workflows to reduce TOIL and improve system reliabilityDevelop and maintain...


  • New York, New York, United States Hyperion Industries Full time

    Company OverviewAt Hyperion Industries, we are on a transformative journey, spearheaded by a team of experts with rich backgrounds in AI and engineering from leading tech firms. Our goal is to innovate Site Reliability Engineering (SRE) through an advanced AI platform designed to autonomously manage and resolve incidents, thereby enhancing uptime and...


  • New York, New York, United States Instabase Full time

    At Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry. With customers representing some of the largest and most complex organizations in the world, and investors like Greylock, Andreessen Horowitz, and Index Ventures, our...


  • New York, New York, United States ASK Consulting Full time

    Job DescriptionImportant Note: All candidates must be directly contracted by ASK Consulting on their payroll and cannot be subcontracted. We are unable to provide sponsorship at this moment.Position: System Reliability EngineerContract Duration: 12+ monthsCompensation: $59.00/hr. on c2cRole Overview:This position focuses on maintaining the stability and...


  • New York, New York, United States Astir IT Solutions, Inc. Full time

    Position: Senior Site Reliability EngineerLocation: Onsite in NJContract Duration: Long-term EngagementCompensation: $50 per hourNote: No OPT/CPT candidates will be considered.We are seeking a highly skilled Senior Site Reliability Engineer (SRE) with subject matter expertise. The ideal candidate will possess exceptional communication skills and the...


  • New York, New York, United States Astir IT Solutions, Inc. Full time

    Position: Senior Site Reliability EngineerLocation: Onsite in New JerseyContract Duration: Long-termCompensation: $50 per hourThis role requires a highly skilled individual with a strong background in Site Reliability Engineering. The ideal candidate will possess exceptional communication abilities and the confidence to engage with executive-level teams.Key...


  • New York, New York, United States Astir IT Solutions, Inc. Full time

    Position: Senior Site Reliability EngineerLocation: Onsite in New JerseyContract Duration: Long-term EngagementCompensation: $50 per hourThis role requires a highly skilled individual with a strong background in Site Reliability Engineering. The ideal candidate will possess exceptional communication abilities and the confidence to engage with executive-level...


  • New York, New York, United States Astir IT Solutions, Inc. Full time

    Position: Senior Site Reliability EngineerLocation: Onsite in New JerseyContract Duration: Long-term EngagementCompensation: $50 per hourThis role requires a seasoned professional with a strong background in Site Reliability Engineering. The ideal candidate will possess exceptional communication skills and the confidence to engage with executive-level...


  • New York, New York, United States Streaming Talent Full time

    Streaming Talent is seeking a highly skilled Site Reliability Engineer to join our client's US team. As a key member of the Site Reliability Team, you will be responsible for ensuring the smooth operation of the company's Content Delivery Network.The ideal candidate will have a strong background in cloud technologies, with experience working with Kubernetes...


  • New York, New York, United States FLOAT LLC Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at FLOAT LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our infrastructure, enabling our engineering teams to focus on delivering high-quality software to our customers.Key ResponsibilitiesContinuously...


  • New York, New York, United States Astir IT Solutions, Inc. Full time

    Position: Senior Site Reliability EngineerLocation: Onsite in New JerseyContract Duration: Long-term EngagementCompensation: $50 per hourThis role requires a highly skilled individual with a strong background in Site Reliability Engineering. The ideal candidate will possess:Exceptional communication skills, with the ability to engage confidently with...


  • New York, New York, United States Russell Tobin & Associates Full time

    What are we seeking in our Reliability Engineer?Position: SRE - Production SupportLocation: Hybrid Work ModelCompensation: $50 – 55 /hr. W2Employment Type: Contract to PermanentJob Overview:Key Responsibilities:Oversee and rectify system errors and disruptions, ensuring thorough documentation of resolutions. Manage incidents in accordance with ITIL best...


  • New York, New York, United States Astir IT Solutions, Inc. Full time

    Position: Senior Site Reliability EngineerLocation: Onsite in New JerseyContract Duration: Long-termCompensation: $50 per hourThis role requires a highly skilled individual with a proven track record in Site Reliability Engineering. The ideal candidate will possess:Exceptional communication abilities and the confidence to engage with executive-level...


  • New York, New York, United States Hudson River Trading Full time

    About the RoleHudson River Trading (HRT) is seeking a highly skilled Senior IT Systems Reliability Engineer to join our IT Solutions Delivery team. This team is responsible for developing and maintaining the corporate productivity stack for the entire firm, both on-prem and in the cloud.Key ResponsibilitiesManage on-premise containerized web services and...


  • New York, New York, United States Hudson River Trading Full time

    Company OverviewHudson River Trading (HRT) employs a scientific methodology in trading financial instruments. We have established one of the most advanced computing environments dedicated to research and development in the field of algorithmic trading.Position SummaryWe are seeking a Senior IT Site Reliability Engineer to enhance our expanding IT Solutions...


  • New York, New York, United States Hudson River Trading Full time

    Company OverviewHudson River Trading (HRT) is a leader in algorithmic trading, utilizing advanced technology and innovative strategies to excel in the financial markets.Position SummaryWe are seeking a Senior IT Site Reliability Engineer to enhance our IT Solutions Delivery team. This team is pivotal in developing and sustaining the corporate productivity...