Director, Edge Operations/SRE
2 days ago
PSR Associates is a consulting and talent solutions firm that connects qualified IT professionals with great opportunities. Whether you're looking for a contract or permanent position, we can help you find the right fit for your skills and experience. We have a team of experienced recruiters who know the IT industry inside and out, and we work with you every step of the way to ensure a smooth and successful transition. PSR Connecting Talent, Crafting Success.
Director, Edge Operations/SRE
Chicago, IL, US, 60607
Company Description:
The client's growth strategy, encompasses all aspects as the leading global omni-channel brand. As the consumer landscape shifts our client is using competitive advantages to further strengthen their brand. One of the core growth strategies is to Double Down on the 3Ds (Delivery, Digital and Drive Thru). Our client will accelerate technology innovation, so 65M+ customers a day will experience a fast, easy experience.
Exploring new and innovative ways to serve millions of customers, using AI, robotics and emerging tech. Combine that with an unparalleled global scale, and reshaping all areas of the business, industry and every community.
Department Overview
We are seeking a Director, Site Reliability Engineering and Operations to lead all aspects of a distributed team running the Edge platform in 100+ countries. This role is critical to ensuring the performance, reliability, and availability of edge computing infrastructure at scale.
The ideal candidate will bring deep technical expertise in Edge Infrastructure, On-Prem Cloud, and/or Google Cloud Platform (GCP), along with strong leadership skills to guide a high-performing team and collaborate effectively with strategic partners. This is an outstanding opportunity to drive operational excellence and continuous improvement across one of the largest edge deployments in the world.
Accountabilities and Responsibilities:
- Lead 24x7x365 operations of edge infrastructure, ensuring high availability, reliability, and efficient performance.
- Build, mentor, and lead a team of SREs, engineers and operations specialists across multiple geographies.
- Supervise incident response, root cause analysis, and resolution processes for edge-related outages or degradations.
- Encourage the utilization of SRE procedures, encompassing SLIs/SLOs, error budgets, and incident management.
- Collaborate with Platform Engineering and Application teams to build and deploy scalable, resilient systems.
- Monitor platform capacity and performance and collaborate with the Platform Engineering team to forecast and plan for future edge capacity/performance needs.
- Develop and maintain monitoring, alerting, and observability to proactively detect and resolve issues.
- Lead initiatives to automate day to day operational tasks and reduce toil.
- Collaborate with Edge Platform Vendor and Outsourced Service Provider to guarantee SLAs are achieved and consistently improved.
- Ensure all edge operations align with security guidelines and meet relevant regulatory and compliance standards.
- Engage with Platform Engineering, Application, Segment, and Market teams to coordinate edge operations with business objectives.
- Collaborate with Market Operations and other Global Tech Operations teams to implement modifications on the edge platform based on global and market-specific change protocols.
- Cultivate an environment of continuous improvement, teamwork, and operational efficiency among team members.
- Qualifications
- Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
- 10+ years of experience in infrastructure, On-Prem/Public Cloud, or SRE roles, with at least 5 years in a leadership capacity.
- Shown experience leading edge platforms, or hybrid cloud environments.
- Strong knowledge of Hardware, Kubernetes, CI/CD pipelines, and infrastructure as code (e.g., Terraform, Ansible).
- Experience with monitoring and observability tools (e.g., Prometheus, Grafana, ELK, New Relics).
- Experience with driving/leading automation initiative to reduce toil and improve efficiency.
- Excellent communication, leadership, and cross-functional collaboration skills.
- Experience in leading and forging partnerships with Vendors and Managed service partners to deliver business value.
- Demonstrable background in guiding Operations/SRE team within a sophisticated multinational corporation.
- Strong knowledge and experience with GCP and/or AWS cloud Infrastructure
*** Please note that any false information on your resume or application could lead to the offer being withdrawn or even termination after hire.***
-
Director, Edge Operations
6 days ago
Chicago, IL, United States Invitus Strategy Solutions LLC Full timeInvictus Strategy & Solutions is a Service-Disabled Veteran-Owned Small Business (SDVOSB) providing strategic workforce solutions to mission-critical government and commercial operations. From cleared federal programs to complex industrial projects, we deliver top-tier professionals who drive performance, safety, and results. Our commitment to operational...
-
AWS SRE Architect
2 weeks ago
Chicago, IL, United States Navtech Full timeHi, My name is Kevin Smith, and I am a staffing specialist at Navtech USA. I have an open opportunity that you may be a good fit for. If this sounds like something you would be interested in, please get in touch with me as soon as possible at kevin@navtechusa.com with your most recent resume, your ideal time and number for communication, and the expected pay...
-
AWS SRE Architect
7 days ago
Chicago, IL, United States Navtech Full timeHi, My name is Kevin Smith, and I am a staffing specialist at Navtech USA. I have an open opportunity that you may be a good fit for. If this sounds like something you would be interested in, please get in touch with me as soon as possible at kevin@navtechusa.com with your most recent resume, your ideal time and number for communication, and the expected pay...
-
Lead Site Reliability Engineer
2 weeks ago
Chicago, IL, United States EPAM Systems Inc Full timeAt EPAM, we're not just building software - we're engineering excellence. We're looking for a Lead Site Reliability Engineer (SRE) with a passion for performance, precision, and proactive problem-solving to join a high-impact team supporting a leading sell-side trading environment. This role is ideal for someone who thrives in fast-paced financial systems,...
-
Junior DevOps
2 weeks ago
Chicago, IL, United States Comcast Full timeFreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we're making it easier for buyers and sellers to transact across all screens, data types, and sales channels. As a global company, we have offices in nine countries and can...
-
Junior DevOps
2 weeks ago
Chicago, IL, United States Comcast Full timeFreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we're making it easier for buyers and sellers to transact across all screens, data types, and sales channels. As a global company, we have offices in nine countries and can...
-
Junior DevOps
6 days ago
Chicago, IL, United States Comcast Full timeFreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we're making it easier for buyers and sellers to transact across all screens, data types, and sales channels. As a global company, we have offices in nine countries and can...
-
Junior DevOps
2 days ago
Chicago, IL, United States Comcast Full timeFreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we're making it easier for buyers and sellers to transact across all screens, data types, and sales channels. As a global company, we have offices in nine countries and can...
-
Junior DevOps
1 week ago
Chicago, IL, United States Comcast Full timeFreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we're making it easier for buyers and sellers to transact across all screens, data types, and sales channels. As a global company, we have offices in nine countries and can...
-
Director of Operations
1 week ago
Chicago, IL, United States BLACK - Building Leadership And Community Knowledge Full timeDirector of Operations will be responsible for coordinating and supervising each initiative. The Director reports to the Board of Directors and the Board President and/or Chairman. The Director chairs the Executive Committee to ensure each initiative serves the needs of the community where it exists and to monitor, develop and implement policy and procedures...