Reliability Engineer
4 weeks ago
At Goldman Sachs, we're seeking a skilled Site Reliability Engineer to join our team in Dallas. As an SRE, you'll be responsible for ensuring the availability and reliability of our firm's critical platform services.
Key responsibilities include:
- Developing and supporting automation tooling to improve platform reliability and team productivity
- Providing critical day-to-day support for our massive-scale, distributed system
- Assessing monitoring and alert signals to determine impact and risk to the business
Requirements include:
- BS degree in Computer Science or related technical field
- Proficiency in one or more programming languages, including Go, Python, C, C++, Java, Perl, Ruby, or shell scripting
- Experience with algorithms, data structures, and software design
Preferred qualifications include experience with distributed systems design, maintenance, and troubleshooting, as well as strong interpersonal skills and a drive for ownership.
We're committed to fostering a diverse and inclusive workplace, and we believe that our employees' unique perspectives and experiences make us better at what we do.
Learn more about our culture, benefits, and career opportunities at Goldman Sachs.
-
Site Reliability Engineer
4 weeks ago
Dallas, Texas, United States Diverse Lynx Full timeJob Title: Site Reliability EngineerWe are seeking a skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.**Key Responsibilities:*** Design, implement, and maintain scalable and reliable cloud...
-
Site Reliability Engineer
4 weeks ago
Dallas, Texas, United States The Goldman Sachs Group Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Goldman Sachs. As a Site Reliability Engineer, you will be responsible for ensuring the availability and reliability of our firm's most critical platform services.Key Responsibilities:Develop and implement incident management processes to ensure...
-
Senior Site Reliability Engineer
2 weeks ago
Dallas, Texas, United States Capgemini Full timeSite Reliability Engineer Job DescriptionWe're seeking an experienced Site Reliability Engineer to join our team at Capgemini. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability, scalability, and performance of our cloud infrastructure.Key Responsibilities:Design and implement scalable and reliable cloud...
-
Site Reliability Engineer
3 weeks ago
Dallas, Texas, United States Diamondpick Full timeThe roleDiamondpick is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the availability, reliability, and performance of our services and platforms in a highly transactional 24x7 environment.Key Responsibilities:Monitor application performance and take steps to improve...
-
Site Reliability Engineer
3 weeks ago
Dallas, Texas, United States Glow Networks Full timeSite Reliability Engineer (SRE for Datacenter)At Glow Networks, we are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. As an SRE, you will be responsible for ensuring the reliability and performance of our datacenter infrastructure. Responsibilities:Data monitoring and alerting, data quality assurance, and anomaly...
-
Site Reliability Engineer
4 weeks ago
Dallas, Texas, United States Motion Recruitment Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Motion Recruitment Partners. As a key member of our infrastructure team, you will be responsible for ensuring the reliability, performance, and scalability of our systems.Key Responsibilities:Develop and implement tools to monitor key metrics of...
-
Site Reliability Engineer
4 weeks ago
Dallas, Texas, United States ThemeSoft Full timeRole: SRE ArchitectLocation: Dallas, TXDescription:Foster a culture of reliability and efficiency by sharing best practices, approaches, and documentation across engineering teams.Automate manual tasks and system components to increase operational efficiency and reduce downtime.Troubleshoot and resolve complex issues in cloud-based SaaS and on-premise...
-
Site Reliability Engineer
3 weeks ago
Dallas, Texas, United States Mastech Digital Full timeAbout the Role:We are seeking a skilled Site Reliability Engineer to join our team at Mastech Digital. As a Site Reliability Engineer, you will be responsible for ensuring the smooth operation of our IT systems and infrastructure.Key Responsibilities:Administration and troubleshooting in Linux and WindowsPatching and basic scripting skills (PowerShell,...
-
Senior Site Reliability Engineer
3 weeks ago
Dallas, Texas, United States Veradigm Full timeWelcome to Veradigm, where our mission is to transform health through innovative solutions. We are seeking a highly skilled Senior Site Reliability Engineer to join our team and help us achieve our goals.As a Senior Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining robust, scalable, and reliable systems. You will...
-
Senior Cloud Reliability Engineer
4 weeks ago
Dallas, Texas, United States VIZIO Full timeAbout the RoleVIZIO is seeking a Senior Cloud Reliability Engineer to join our team. As a key member of our DevSecOps Engineering team, you will play a crucial role in enhancing the availability, performance, and security of our cloud services.Key ResponsibilitiesAvailability and Performance Optimization: Ensure that Vizio services deliver seamless...
-
Reliability Analysis Engineer
3 weeks ago
Dallas, Texas, United States SRI Tech Solutions Inc Full timeJob Title: Reliability Analysis EngineerJob Summary: We are seeking a skilled Reliability Analysis Engineer to join our team at SRI Tech Solutions Inc.Key Responsibilities:* Design for reliability through a statistical approach* Identify process conditions and possible related loading conditions or environmental factors that lead to failure modes* Analyze...
-
Senior Site Reliability Engineer
4 weeks ago
Dallas, Texas, United States Saxon Global Full timeJob Summary:We are seeking a skilled Site Reliability Engineer to ensure the reliability, availability, and performance of our production systems. As an SRE, you will work closely with cross-functional teams to design and implement tools and processes to automate deployment, observability, and troubleshooting of our applications and infrastructure.This...
-
Infrastructure Site Reliability Engineer
4 weeks ago
Dallas, Texas, United States CVS Health Full timeJob SummaryAt CVS Health, we're committed to delivering exceptional healthcare experiences for our customers. As an Infrastructure Site Reliability Engineer, you'll play a critical role in designing, implementing, and managing the infrastructure systems and tools that enable reliability and performance of our technology platforms.Key ResponsibilitiesManage...
-
Site Reliability Engineer
4 weeks ago
Dallas, Texas, United States Motion Recruitment Partners LLC Full timeJob Title: Site Reliability Engineer - AzureJob Description:Motion Recruitment Partners LLC is seeking a highly skilled Site Reliability Engineer - Azure to join their team. The ideal candidate will have a strong background in monitoring and recovery of data systems, with experience in Azure and cloud infrastructure.Key Responsibilities:Develop and utilize...
-
Site Reliability Engineer
3 weeks ago
Dallas, Texas, United States Bayone Full timeJob Title: Site Reliability Engineer - Cloud ExpertOverview:Bayone is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining highly available and scalable applications deployed in Azure. You will work closely with development teams to ensure...
-
Lead Site Reliability Engineer
4 weeks ago
Dallas, Texas, United States Goldman Sachs Full timeAbout This RoleAt Goldman Sachs, we're committed to building and running large-scale, massively distributed, fault-tolerant systems. Our Site Reliability Engineering (SRE) team is responsible for ensuring the availability and reliability of our firm's most critical platform services, meeting the requirements of our internal and external...
-
Reliability Specialist
4 weeks ago
Dallas, Texas, United States Mass Staffing Projects Full timeReliability Engineer OpportunityMass Staffing Projects is seeking a skilled Reliability Engineer to support our client's mining operations in the Free State region.Key Responsibilities:Lead the maintenance of equipment to ensure optimal performance and reliabilityDevelop and implement asset management strategies to minimize downtime and maximize...
-
Site Reliability Engineer, VP
4 weeks ago
Dallas, Texas, United States The Goldman Sachs Group Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer, VP to join our team at The Goldman Sachs Group. As a key member of our engineering team, you will be responsible for ensuring the reliability and scalability of our systems.ResponsibilitiesDesign and implement robust systems to manage hundreds of thousands of compute coresDevelop and...
-
Site Reliability Engineer
4 weeks ago
Dallas, Texas, United States Goldman Sachs Full timeAbout the RoleWe are seeking a talented Site Reliability Engineer to join our team at Goldman Sachs. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining the firm's cloud infrastructure. You will work closely with our development team to ensure the smooth operation of our systems and services.Key...
-
Cloud Platform Reliability Engineer
4 weeks ago
Dallas, Texas, United States RELQ TECHNOLOGIES LLC Full timeJob OverviewAt RELQ TECHNOLOGIES LLC, we're seeking a seasoned Cloud Platform Reliability Engineer to join our team. This role requires a minimum of 10+ years of experience in defining and implementing Monitoring solutions for large enterprises.The ideal candidate will have extensive knowledge of Observability and Application Performance Monitoring best...