Current jobs related to Site Reliability Engineer - New York - Hale Recruiting
-
Site Reliability Engineer
4 weeks ago
New York, New York, United States Lorven Technologies Full timeJob Title: Site Reliability EngineerLorven Technologies is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and reliable cloud...
-
Site Reliability Engineer
2 weeks ago
New York, New York, United States CapB InfoteK Full timeJob Title: Site Reliability EngineerAbout the Role:At CapB InfoteK, we're seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:• Develop and build low-level component...
-
Site Reliability Engineer
4 weeks ago
New York, New York, United States Phaxis Full timeSite Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Phaxis. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining our critical infrastructure platforms.Key Responsibilities:Design and implement scalable and resilient servicesCollaborate with engineering teams to...
-
Site Reliability Engineer
4 weeks ago
New York, New York, United States Diverse Lynx Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Key Responsibilities:Design and implement automated workflows to reduce TOIL and...
-
Site Reliability Engineer
2 weeks ago
New York, New York, United States Lorven Technologies Full timeJob Title: Site Reliability EngineerLorven Technologies is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly available...
-
Site Reliability Engineer
2 weeks ago
New York, New York, United States Lorven Technologies Full timeJob Title: Site Reliability EngineerLorven Technologies is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain infrastructure automation...
-
Site Reliability Engineer
1 month ago
New York, New York, United States FLOAT LLC Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Float LLC. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud infrastructure, enabling our engineering teams to focus on delivering high-quality software to our customers.Key...
-
Site Reliability Engineer
4 weeks ago
New York, New York, United States Unreal Gigs Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Unreal Gigs. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, availability, and performance of our systems.Key Responsibilities:Design, implement, and maintain scalable infrastructure solutions to support...
-
Site Reliability Engineer
4 weeks ago
New York, New York, United States Alchemy Full timeAbout the RoleAlchemy is seeking a highly skilled Site Reliability Engineer to join our Infrastructure team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our globally used developer platform.Key ResponsibilitiesDesign, deploy, and continuously improve the infrastructure supporting...
-
Site Reliability Engineer
1 week ago
New York, New York, United States Insight Global Full timeJob SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at Insight Global. As a Site Reliability Engineer, you will be responsible for ensuring the uptime and reliability of our production and non-production environments. You will work closely with our development teams to build and maintain the infrastructure and applications...
-
Site Reliability Engineer
2 weeks ago
New York, New York, United States Insight Global Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Insight Global. As a Site Reliability Engineer, you will be responsible for ensuring the uptime and reliability of our production and non-production environments.Key Responsibilities:Monitor availability and system health to ensure optimal...
-
Site Reliability Engineer
4 weeks ago
New York, New York, United States ADP Full timeAbout ADPADP is a global leader in HR technology, offering the latest AI and machine learning-enhanced payroll, tax, HR, benefits, and more. We believe our people make all the difference in cultivating an inclusive, down-to-earth culture that welcomes ideas, encourages innovation, and values belonging.Job DescriptionWe are seeking a Site Reliability Engineer...
-
Site Reliability Engineer
1 month ago
New York, New York, United States Motion Recruitment Full timeSite Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Motion Recruitment. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our systems, as well as collaborating with cross-functional teams to drive innovation and improvement.Key Responsibilities:Design,...
-
Site Reliability Engineer
1 week ago
New York, New York, United States Cynet Systems Full timeJob Title: Site Reliability EngineerJob Summary:Cynet Systems is seeking a highly skilled Site Reliability Engineer to lead the development and implementation of geospatial application performance monitoring strategies. The ideal candidate will have a strong background in Site Reliability Engineering (SRE) and proven experience in using Dynatrace for...
-
Site Reliability Engineer
1 week ago
New York, New York, United States Phaxis Full timeSite Reliability EngineerWe are seeking an experienced Site Reliability Engineer to join our team at Phaxis. As a Site Reliability Engineer, you will be responsible for designing and building scalable and resilient systems, collaborating with engineering teams to advocate for optimal system use, and managing our centralized development infrastructure.Key...
-
Site Reliability Engineer
3 weeks ago
New York, New York, United States Diverse Lynx Full timeJob Title: SRE - Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based systems.Key Responsibilities:Design and implement automated workflows to reduce TOIL and...
-
Site Reliability Engineer
4 weeks ago
New York, New York, United States Braze Full timeAbout the RoleWe're seeking a highly skilled Site Reliability Engineer to join our team at Braze. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our internal-facing services and platforms.Key ResponsibilitiesPartner with Braze's engineering teams to architect products that effectively utilize...
-
Site Reliability Engineer
3 weeks ago
New York, New York, United States Apollo Solutions Full timeSite Reliability EngineerApollo Solutions is partnering with a pioneering artificial intelligence business that is revolutionizing the use of AI/ML in gaming and security.The company is working closely with government contracts and gaming console companies and is seeking a Site Reliability Engineer to join their growing team.The Site Reliability Engineer...
-
Site Reliability Engineer
1 month ago
New York, New York, United States Unreal Gigs Full timeJob SummaryWe are seeking a highly skilled Site Reliability Engineer to join our tech startup, Unreal Gigs, specializing in infrastructure and authorization solutions.As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability, availability, and performance of our systems. Your responsibilities will include designing,...
-
Site Reliability Engineer
2 weeks ago
New York, New York, United States Grafbase, Inc. Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Engineering team at Grafbase, Inc. As a Site Reliability Engineer, you will play a crucial role in ensuring the reliability, availability, and performance of our systems and services.You will collaborate with cross-functional teams to design, implement, and maintain...
Site Reliability Engineer
5 months ago
Summary - Site Reliablity Engineer (For one of the Big 4 Sports &Entertainment League)
Our client is enhancing the landscape of the live sports and entertainment
industry. They are striving to deliver innovative, cutting-edge technologies to enable safe,
unforgettable fan experiences across the globe. They are assembling a world-class technology team to build and support platforms and products that anticipate these emerging opportunities.
The Data(base) Reliability Engineer will join the infrastructure team while also working
alongside league team members and be responsible for the following areas:
- Uptime, High Availability and Disaster recovery planning
- Incident response
- Optimization of data stores
- Identify SLIs and define SLOs
- Observability tooling
- Debugging running systems and providing tools to assist runtime debugging
- Optimizations for cost control
- Ability to interface with all levels of employees
- Ability to work both independently with little supervision and in a team environment
- Ensures availability, security, integrity, and recovery of data, pipelines and data stores.
- Define and configure relevant database metrics to ensure observability
- Create and maintain dashboards and reports to visualize database performance and health
- Create monitoring and alerting to trigger on error conditions, degradation symptoms and defined
- SLOs, as well as outages
- Develops and implements data store maintenance plans, including performing integrity checks,
- Updating statistics and monitoring security and hardware resource utilization
- Work with peers to roll out changes to production environments and help mitigate and prevent
- Data-related production incidents
- Work on automation of data store infrastructure and help engineering succeed by providing
self-service tools
- Resolves performance, capacity, replication, and other distributed data, pipeline and data store issues
- Support and debug data production issues across services and levels of the stack
- Provide timely incident response and participate in on-call rotations
- Continuously identify opportunities for process improvement and automation to enhance
database performance, reliability, and efficiency
- Prioritize unblocking your teammates, collaboration and knowledge sharing
Qualifications:
To perform this job successfully, an individual must be able to perform the Duties and Responsibilities (Duties) above satisfactorily and meet the requirements below. The requirements listed below are representative of the minimum knowledge, skill, and/or ability required. Reasonable accommodations will be made to enable individuals with disabilities to perform the essential functions of the job.
Education and/or Experience: Required:
- Minimum of a bachelor’s degree in Computer Science, MIS or related degree and five (5) years of relevant experience including software or reliability engineering, database administration, datastore programming experience or combination of education, training and experience.
- Ability to communicate clearly and effectively strong opinions on how to use technologies such as cloud, microframeworks, DevOps, automation, and observability tools
- Demonstrable experience engineering automation of triggers, alerts, and remediation
- Have written code in a compiled language that runs in production somewhere
- Experience in Oracle 19c, Postgres, Mongo, Change Data Capture, data and data store monitoring, management and support
- Experience with OLTP, OLAP as well as PL/SQL code development and tuning
- Experience in Linux OS and shell scripting
- Extensive experience in performance tuning and analysis
- Strong ITIL principles are a plus
- Capacity planning for all aspects of a data store system (storage, compute, memory, etc.)
- Understanding of networking and connectivity and how it relates to a data store environment
- Excellent problem solving and troubleshooting skills
- Ability to work non-standard shifts including nights and/or weekend on-call responsibilities
- Dedicated to continuous improvement of yourself and our SRE/DBRE capabilities
- Key Technical Traits
- APIs and microservices: REST, Web, Graph
- Database Solutions – Oracle, MYSQL, MSSQL, CloudSQL, NoSQL
- Cloud Providers: Oracle Cloud Infrastructure, Google Cloud Platform, AWS
- Real-time log/event monitoring – DataDog, Stackdriver, Oracle Enterprise Manager, Oracle Cloud
- Monitoring, SolarWinds, Splunk, SumoLogic, OpenTelemetry
- Scripting: PL/SQL, Shell
- Secured Access and control – Okta SSO and MFA, MS Active Directory, DataSafe
- Software Development tools – Jira, GIT, Jenkins, ArgoCD, Terraform
- Compliance: PCI DSS, SSAE18/SOC 1