Site Reliability Engineer

4 days ago


New York, NY, United States Software Guidance and Assistance, Inc. Full time

Software Guidance & Assistance, Inc., (SGA), is searching for an pplication Support and SRE Engineer for a CONTRACT assignment with one of our premier Financial Services clients in New York, NY .

The team supports strategic initiatives like modernization, containerization, observability, SRE, DevOps and automation.

This team also partners with central technology teams like Infrastructure, Security, Network, Database and procurement to design and deliver solutions. The team leverages a variety of tools for platform design and application troubleshooting as they also provide elevated level 4 production support to the application operations teams.

Responsibilities :

  • Involved in application support, application server administration, technical troubleshooting of infrastructure and user incidents
  • Incorporate Site Reliability Engineering practices into the day-to-day role by developing automated solutions to long-standing problems to ensure minimal downtime and reduce toil
  • Experience with web architecture implementation including performance, availability, scalability, and disaster recovery planning.
  • Experience with monitoring and alerting tools, configuring application monitors using industry standard monitoring tools, as well as developing customized monitoring solutions
  • Revisit SRE Metrics and confirm against the firm and department goals
  • Identify areas for improvement including automation, toil reduction, resiliency and observability across the platforms and help build up the knowledge and documentation for the team
  • Partner with other teams such as enterprise infrastructure, networking, security, storage, and database and data center to roll out application platforms successfully as per the design.
  • Produce reusable infrastructure designs patterns and periodically review / refresh the patterns.
  • Support vendor / vendor technology onboarding following the Morgan Stanley best practices and security blueprint.
  • pply technical skills to automate daily support functions, improve system stability, support hygiene initiatives and deliver innovation that creates efficiency and consistency.
  • Occasional weekend availability and on-call work on a rotation basis.
Required Skills:
  • 7 to 12 years in a similar role of hands-on application / middleware specialist.
  • Strong infrastructure knowledge in Linux / Unix, Databases, Storage and Networking technologies.
  • Hands-on experience with containers and container orchestration platforms OpenShift / Kubernetes
  • Experience with scripting in Python and Shell
  • Hands-on experience of web servers (Apache / Nginx), application integration, configuration, and troubleshooting.
  • Clear concept of load balancer, web proxies and storage platforms like NAS / SAN from an implementation perspective only.
  • Familiar with basic security practices to ensure secure hosting solutions, including single sign-on (SSO) and standard encryption protocols.
  • Prior experience managing large web-based n-tier applications in secure environments on cloud
  • Strong knowledge SRE Principles with grasp over tools / approach to apply them
  • Strong infrastructure knowledge in Storage, Networking and Databases
  • Experience in troubleshooting Application Issues and Managing Incidents
  • Exposure to tools like Prometheus, Grafana, and Open Telemetry framework
  • Excellent verbal and written communication skills.
Preferred Skills:
  • Prior experience of working in a global financial organization is an advantage
  • Exposure and experience with data pipeline technologies such as Kafka, Redis and Airflow
  • Exposure to Big Data platforms like Hadoop / Cloudera and ELK Stack
  • Capacity planning and performance tuning exercise
  • Identity management protocols like OIDC / OAuth, SAML, LDAP integration
  • Cloud Application and infrastructure knowledge is a plus.
  • Experience in Cloud / Distributed computing technology or certification is a plus
SGA is a technology and resource solutions provider driven to stand out. We are a women-owned business. Our mission: to solve big IT problems with a more personal, boutique approach. Each year, we match consultants like you to more than 1,000 engagements. When we say let's work better together, we mean it. You'll join a diverse team built on these core values: customer service, employee development, and quality and integrity in everything we do. Be yourself, love what you do and find your passion at work. Please find us at https://sgainc.com/ .

SGA is an Equal Opportunity Employer and does not discriminate on the basis of Race, Color, Sex, Sexual Orientation, Gender Identity, Religion, National Origin, Disability, Veteran Status, Age, Marital Status, Pregnancy, Genetic Information, or Other Legally Protected Status. We are committed to providing access, equal opportunity, and reasonable accommodation for individuals with disabilities in employment, and our services, programs, and activities. Please visit our company EEO page to request an accommodation or assistance regarding our policy.

  • New York, NY, United States Writer Corporation Full time

    About this role We are looking for a foundational member of the Cloud infrastructure team at WRITER. This role will involve contributing to the development and implementation of our Site reliability engineering (SRE) program. The ideal candidate will ensure the reliability, scalability, performance, and security of WRITER's critical systems, taking a...


  • New York, NY, United States Jobot Full time

    Are you a hands-on leader within the DevOps/SRE space? Are you a supporter of responsible AI adoption & cybersecurity? This opportunity requires an incredibly versatile SRE to handle both hands on & strategic initiatives while leading a team! This Jobot Job is hosted by: Craig Rosecrans Are you a fit? Easy Apply now by clicking the "Apply" button and sending...


  • New York, NY, United States Patreon Full time

    Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans and build a lasting business including: paid memberships, free memberships, community chats, live video, and selling to fans directly with one-time purchases....


  • New York, NY, United States Patreon Full time

    Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans and build a lasting business including: paid memberships, free memberships, community chats, live video, and selling to fans directly with one-time purchases....


  • New York, NY, United States Patreon Full time

    Patreon is a media and community platform where over 300,000 creators give their biggest fans access to exclusive work and experiences. We offer creators a variety of ways to engage with their fans and build a lasting business including: paid memberships, free memberships, community chats, live video, and selling to fans directly with one-time purchases....


  • New York, NY, United States Elliot Partnership Full time

    Site Reliability Engineer - (Linux & Python/Go) New York, NY (Hybrid, 3 days in office) Highly competitive compensation package Join an elite technology and research group at the forefront of global finance, where world-class engineering and quantitative research converge to solve some of the most complex problems in any industry. Their teams are composed...


  • New York, NY, United States Elliot Partnership Full time

    Site Reliability Engineer - (Linux & Python/Go) New York, NY (Hybrid, 3 days in office) Highly competitive compensation package Join an elite technology and research group at the forefront of global finance, where world-class engineering and quantitative research converge to solve some of the most complex problems in any industry. Their teams are composed...


  • New York, NY, United States Elliot Partnership Full time

    Site Reliability Engineer - (Linux & Python/Go) New York, NY (Hybrid, 3 days in office) Highly competitive compensation package Join an elite technology and research group at the forefront of global finance, where world-class engineering and quantitative research converge to solve some of the most complex problems in any industry. Their teams are composed...


  • New York, NY, United States Magnite Full time

    Senior Site Reliability Engineer New York City, NY Hybrid Schedule (M/F remote, T/W/TH in-office) East Coast Remote Considerations in FL, GA, MA, NJ, NY, NC, PA, SC, VA At Magnite, we cultivate an environment of continuous growth and collaboration. Our work impacts what millions of people read, watch, and buy, and we're looking for people to help us tackle...


  • New York, NY, United States Magnite Full time

    Senior Site Reliability Engineer New York City, NY Hybrid Schedule (M/F remote, T/W/TH in-office) East Coast Remote Considerations in FL, GA, MA, NJ, NY, NC, PA, SC, VA At Magnite, we cultivate an environment of continuous growth and collaboration. Our work impacts what millions of people read, watch, and buy, and we're looking for people to help us tackle...