Senior Engineer SIte Reliability
4 weeks ago
Senior Engineer Site Reliability
Dell Technologies customers rely on our products and services to drive progress. So, we take the service we provide extremely seriously. Service Delivery is all about making sure our technical solutions help clients fulfil their priorities, challenges and initiatives. As trusted advisors, we build in-depth knowledge of what each client wants to achieve. Then we make sure the services delivered by Dell Technologies deliver on all our promises. We also work closely with Sales and Global Services colleagues to develop strategic account growth plans, and to identify and pursue sales opportunities.
Join us to do the best work of your career and make a profound social impact as a Senior Engineer - Site Reliability Engineering on our Service Delivery Team in Austin, Texas.
What you'll achieve
The Senior Engineer- Site Reliability Engineering supporting Artificial Intelligence/Machine Learning/High Performance Compute Solutions, Service Delivery will be responsible for providing the primary management, administration, support, and ongoing maintenance of customer Platforms within a 24x7x365 datacenter environment. This is a technical leadership role. The ideal candidate will play a crucial role in managing and supporting complex solutions and platforms for our prestigious Fortune 100 clients.
The role will be expected to work in a positive and collaborative fashion with fellow team members, senior engineering/architect staff, vendors, and customers. The Senior Engineer will assist with process maturation, development, technical standards creation, and drive operational excellence through consistent delivery and best practices.
You will:
Serve as the technical expert in deploying, upgrading, troubleshooting Artificial Intelligence/Machine Learning/High Performance Compute Solutions platforms
Manage and maintain container platform (Kubernetes, OpenShift) infrastructure, including installation, configuration, and upgrades and optimize system performance, capacity, and availability of the environment
Act in the capacity of a Senior SRE/DevOps
5-7 years years of hands on experience working in an infrastructure managed services environment, supporting complex engineered solution in production with Artificial Intelligence/Machine Learning/High Performance Compute Systems and Platforms, Converged/Hyper-Converged infrastructure along with fluency in AI/ML pipelines, Nvidia GPU optimization, InfiniBand networking, Machine Learning operating systems such as cnvrg.io, Compute Orchestration Platform such as runai etc
Experience with cluster provisioning and resource schedulers
Programming experience with Python, Go, Ruby, Shell Scripts, PowerShell along with hands on experience with ELK, Prometheus, Grafana, Ansible, Git, or similar technologies
Expertise in Kubernetes, OpenShift, Docker, Container Networking, and Cloud Native Platform/Applications
Strong Networking Fundamentals along with Converged Infra (CI)/Hyper Converged Infa (HCI) Management Certification along with hands-on experience with Amazon Kubernetes Service (AKS), Amazon EKS, Google Kubernetes Engine (GKE), Rancher
Desirable Requirements
BE or MS in Computer Science or Computer Engineering or acceptable combination of equivalent industry experience will be considered
Certified Kubernetes/OpenShift Admin, NSX T Certification
Who we are
We believe that each of us has the power to make an impact. That's why we put our team members at the center of everything we do. If you're looking for an opportunity to grow your career with some of the best minds and most advanced tech in the industry, we're looking for you.
Dell Technologies is a unique family of businesses that helps individuals and organizations transform how they work, live and play. Join us to build a future that works for everyone because Progress Takes All of Us.
Dell Technologies is committed to the principle of equal employment opportunity for all employees and to providing employees with a work environment free of discrimination and harassment. Read the full Equal Employment Opportunity Policy here .
Dell's Flexible & Hybrid Work Culture
At Dell Technologies, we believe our best work is done when flexibility is offered.
We know that freedom and flexibility are crucial to all our employees no matter where you are located and our flexible and hybrid work style allows team members to have the freedom to ideate, be innovative, and drive results their way. To learn more about our work culture, please visit our locations page.
-
Senior Engineer, Site Reliability Engineering
2 weeks ago
Chicago, Illinois, United States Balyasny Asset Management Full timeWe are looking for a Senior Site Reliability Engineer who can cultivate our SRE philosophy, processes, and technologies from the ground up.As a Senior Site Reliability Engineer within the Platform group, you will lay the groundwork for our SRE infrastructure. Your role will entail driving standards and fostering adoption across our technology teams, whilst...
-
Senior Engineer, Site Reliability Engineering
2 months ago
Chicago, United States Balyasny Asset Management Full timeWe are looking for a Senior Site Reliability Engineer who can cultivate our SRE philosophy, processes, and technologies from the ground up. As a Senior Site Reliability Engineer within the Platform group, you will lay the groundwork for our SRE infrastructure. Your role will entail driving standards and fostering adoption across our technology teams, whilst...
-
Senior Engineer, Site Reliability Engineering
4 weeks ago
Chicago, United States Balyasny Asset Management Full timeWe are looking for a Senior Site Reliability Engineer who can cultivate our SRE philosophy, processes, and technologies from the ground up. As a Senior Site Reliability Engineer within the Platform group, you will lay the groundwork for our SRE infrastructure. Your role will entail driving standards and fostering adoption across our technology teams, whilst...
-
Senior Site Reliability Engineer
1 week ago
Chicago, United States DASH2 Full timeThe Senior/Principal Site Reliability Engineer is responsible for ensuring our SaaS products are fast, stable and optimized for our customers. SRE's here take on availability, performance, managing change, monitoring, response and are guardians of non-functional requirements.You either have an infrastructure background with a programmatic, automated mindset...
-
Senior Site Reliability Engineer
2 weeks ago
Chicago, United States DASH2 Full timeThe Senior/Principal Site Reliability Engineer is responsible for ensuring our SaaS products are fast, stable and optimized for our customers. SRE’s here take on availability, performance, managing change, monitoring, response and are guardians of non-functional requirements.You either have an infrastructure background with a programmatic, automated...
-
Azure Site Reliability Engineer
4 weeks ago
Chicago, Illinois, United States Motion Recruitment Full timeA financial company is looking for senior level Site Reliability Engineers to join their team in troubleshooting applications and managing their Azure environment. This will be a contract-to-hire position that is hybrid 3 days a week in the Chicago area. Expertise in Terraform, YAML, and Azure infrastructure is mandatory. This company is a global leader in...
-
Azure Site Reliability Engineer
2 months ago
Chicago, Illinois, United States Motion Recruitment Full timeA financial company is looking for senior level Site Reliability Engineers to join their team in troubleshooting applications and managing their Azure environment. This will be a contract-to-hire position that is hybrid 3 days a week in the Chicago area. Expertise in Terraform, YAML, and Azure infrastructure is mandatory. This company is a global leader in...
-
Senior Site Reliability Engineer
3 weeks ago
Chicago, United States Adyen Full timeThis is Adyen Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition. For our teams, we create an environment with opportunities for our people to succeed, backed by the culture...
-
Senior Site Reliability Engineer
4 weeks ago
Chicago, United States Adyen Full timeThis is Adyen Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition. For our teams, we create an environment with opportunities for our people to succeed, backed by the culture...
-
Senior Site Reliability Engineer
2 weeks ago
Chicago, Illinois, United States Adyen Full timeThis is AdyenAdyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition. For our teams, we create an environment with opportunities for our people to succeed, backed by the culture...
-
Senior Site Reliability Engineer
4 weeks ago
Chicago, United States Adyen Full timeThis is Adyen Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition. For our teams, we create an environment with opportunities for our people to succeed, backed by the culture...
-
Site Reliability Engineer
4 weeks ago
Chicago, United States Diverse Lynx Full timeJob Title: Site Reliability Engineer Location: Chicago - IL (Remote) Employment: Contract JobSummary: Experienced Senior Cloud Engineer with 6-10 years of experience in AWS and AWS Cloud Formation. Must have domain skill experience in Payer. Required Skills Technical Skills: Dynatrace, Python, AWS, AWS Cloud Watch, Glue and Lambda Roles & Responsibilities...
-
Site Reliability Engineer
4 weeks ago
Chicago, United States Diverse Lynx Full timeJob Title: Site Reliability Engineer Location: Chicago - IL (Remote) Employment: Contract JobSummary: Experienced Senior Cloud Engineer with 6-10 years of experience in AWS and AWS Cloud Formation. Must have domain skill experience in Payer. Required Skills Technical Skills: Dynatrace, Python, AWS, AWS Cloud Watch, Glue and Lambda Roles & Responsibilities...
-
Senior Site Reliability Engineer
2 months ago
Chicago, United States Deere & Company Full timeAdvanced Options 28 open jobs. Use your resume to get matched with the right job. Senior Platform Engineer (Chicago, Visa Sponsorship available) Reliability Engineer Dubuque, Iowa, United States Reliability Engineer Dubuque, Iowa, United States Senior Software Engineer - DevOps eCommerce (Chicago) SOFTWARE ENGINEER (Chicago, IL or Moline, IL - Hybrid) SAP...
-
Site Reliability Engineer
2 weeks ago
Chicago, Illinois, United States Spectraforce Technologies Full timeTitle: Senior Associate Software Engineer/Senior Lead Software EngineerLocation: Chicago, IL Onsite 3 days per weekDuration: 6 Month Contract to Hire Must Haves:5-8+ years of overall software engineering experience 4-6+ years in Site Reliability Engineering Experience developing, supporting, and managing cloud technologies Experience working with...
-
Site Reliability Engineer
2 weeks ago
Chicago, Illinois, United States Spectraforce Technologies Full timeTitle :Senior Associate Software Engineer/Senior Lead Software Engineer Location :Chicago, IL Onsite 3 days per week Duration : 6 Month Contract to Hire Must Haves: 5-8+ years of overall software engineering experience 4-6+ years in Site Reliability Engineering Experience developing, supporting, and managing cloud technologies Experience working with...
-
Site Reliability Engineer
1 month ago
Chicago, United States Saxon Global Full timeNorthern Trust Site Reliability Engineer (Azure) Location : Downtown Chicago - Onsite 2 days/week - 181 W Madison St Duration : 12+ month contract w/extension/conversion Overview The Goals Driven Wealth Management platform is a showcase product for Northern Trusts Wealth Management business and we must demonstrate our ability to deliver and...
-
Site Reliability Engineer
4 weeks ago
Chicago, United States Saxon Global Full timeNorthern Trust Site Reliability Engineer (Azure) Location : Downtown Chicago - Onsite 2 days/week - 181 W Madison St Duration : 12+ month contract w/extension/conversion Overview The Goals Driven Wealth Management platform is a showcase product for Northern Trusts Wealth Management business and we must demonstrate our ability to deliver and...
-
Site Reliability Engineer
21 hours ago
Chicago, United States Synergy Interactive Full timeAs a Site Reliability Engineer you will play a critical role in ensuring the reliability, scalability, and performance of our infrastructure and applications. You will collaborate closely with our engineering, operations, and development teams to design, implement, and maintain robust systems and processes that support our mission-critical services. Key...
-
Site Reliability Engineer
3 days ago
Chicago, United States Synergy Interactive Full timeAs a Site Reliability Engineer you will play a critical role in ensuring the reliability, scalability, and performance of our infrastructure and applications. You will collaborate closely with our engineering, operations, and development teams to design, implement, and maintain robust systems and processes that support our mission-critical services.Key...