Lead Site Reliability Engineer

1 month ago


Remote, Oregon, United States Comcast Advertising Full time

FreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we're making it easier for buyers and sellers to transact across all screens, data types, and sales channels. As a global company, we have offices in nine countries and can insert advertisements around the world.
Job Summary
Responsible for planning and designing new software and web applications. Analyzes, tests and assists with the integration of new applications. Oversees As the Site Reliability Engineer (SRE), you will be responsible for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning for the FreeWheel platforms. You will engage in designing, analyzing and troubleshooting large-scale distributed systems, debugging /optimizing code, and automating routine tasks. You will be part of a team consisting of a healthy mix between software and technology infrastructure backgrounds, provide subject matter expertise, resolve complex break/fix scenarios and engage broader teams as necessary, partner with engineering, vendors and client services to deliver successful technical solutions. You shall work with limited supervision and direction while executing associated functions and responsibilities, follows operational practices and independently determines/develops approaches for non-routine solutions.
Job Description
Core Responsibilities

  • Be responsible for reliability and technical operations of FreeWheel TV Platform Ad-Serving component(s).
  • Lead technical solutions in measuring and improving reliability, quality and efficiency of FreeWheel platforms.
  • Lead in a variety of complex analytical duties in the planning, deployment, testing and evaluation of FreeWheel products.
  • Possesses in-depth working knowledge of FreeWheel platforms, infrastructure, internal processes, and teams/partners.
  • Support FreeWheel powered live events such as Super Bowl, Olympic Games, March Madness, and FIFA World Cup.
  • Plug into software release cycle, work closely with developers and tech leads to ensure software releases are well designed, planned, implemented, released, and monitored.
  • Lead in design and implementation in authoring infrastructure as code with best practices, tool use, and quality assurance.
  • Lead technical solutions for infrastructure and application management, monitoring, and operations with standardization and automation focus.
  • Leverages engineering methodologies and technical knowledge in specific areas of focus.
  • Lead code level debugging on issues escalated to the team.
  • Lead on-call shifts, incident prevention, response, and retrospect.
  • Advocate for engineering and technical operations procedures, policies, processes and SRE best practices.
  • Partner with developers and vendors to identify and drive improvements including production quality, operational efficiency, engineering productivity.
  • Provide support and influence for the Cybersecurity program needs such as patching, vulnerability cleanup, secure server configuration, testing and validation, technical controls implementation and cybersecurity incident remediation efforts.
  • Provides training and coaching to peers and more junior SRE team members.
  • Consistent exercise of independent judgment and discretion in matters of significance.
  • Regular, consistent and punctual attendance. Must be able to work nights and weekends, variable schedule(s) and overtime as necessary.
  • Other duties and responsibilities as assigned.

Minimum Requirements

  • Bachelor's degree in computer science, a related engineering field, or equivalent practical experience.
  • Prior 7 years of experience in software engineering with one of programming languages: Python, Golang, JavaScript.
  • Prior 5 years of technical operation experience for business-critical application(s) over public cloud (AWS specific is a big plus) services: VPC, subnets, network access control lists, security groups, EC2 instances, S3 buckets, IAM, Route 53, Lambda.
  • Prior 5 years of experience with SDLC tools: Containers, Kubernetes, Docker, Salt / Ansible / Chef / Puppet, Jenkins, Git.
  • Prior experience of Linux administration, network security, and system infrastructure.
  • Excellent communication and collaboration, within/across team(s) and continents.
  • Work / Shift Timings: Selected candidate will be expected to work Eastern Standard hours & be able to work on weekend during on-call rotation schedule: usually 12 hours a day including weekend.

Preferred requirements

  • Prior experience in supporting business-critical services before they go live through activities such as system design consulting, developing software platforms and frameworks, capacity planning and launch reviews.
  • Technical leadership and influence demonstrated in focused product/tech areas and practices.
  • Prior experience in providing technical solutions at an internet company.

Employees at all levels are expected to:

  • Understand our Operating Principles; make them the guidelines for how you do your job.
  • Own the customer experience - think and act in ways that put our customers first, give them seamless digital options at every touchpoint, and make them promoters of our products and services.
  • Know your stuff - be enthusiastic learners, users and advocates of our game-changing technology, products, and services, especially our digital tools and experiences.
  • Win as a team - make big things happen by working together and being open to new ideas.
  • Be an active part of the Net Promoter System - a way of working that brings more employee and customer feedback into the company - by joining huddles, making call backs and helping us elevate opportunities to do better for our customers.
  • Drive results and growth.
  • Respect and promote inclusion & diversity.
  • Do what's right for each other, our customers, investors and our communities.

Disclaimer:

  • This information has been designed to indicate the general nature and level of work performed by employees in this role. It is not designed to contain or be interpreted as a comprehensive inventory of all duties, responsibilities, and qualifications.

Comcast is proud to be an equal opportunity workplace. We will consider all qualified applicants for employment without regard to race, color, religion, age, sex, sexual orientation, gender identity, national origin, disability, veteran status, genetic information, or any other basis protected by applicable law. Comcast will consider for employment qualified applicants with criminal histories in a manner consistent with the requirements of applicable law, including the Los Angeles Fair Chance Initiative for Hiring Ordinance and the San Francisco Fair Chance Ordinance.
Salary:
National Pay Range: $112,151.21 USD-$262,854.41 USD
Comcast intends to offer the selected candidate base pay within this range, dependent on job-related, non-discriminatory factors such as experience. The application window is 30 days from the date job is posted, unless the number of applicants requires it to close sooner or later.
Additionally, Comcast provides best-in-class Benefits to eligible employees. We believe that benefits should connect you to the support you need when it matters most, and should help you care for those who matter most. That's why we provide an array of options, expert guidance and always-on tools, that are personalized to meet the needs of your reality - to help support you physically, financially and emotionally through the big milestones and in your everyday life. Please visit the compensation and benefits summary on our careers site for more details.
Education
Bachelor's Degree
While possessing the stated degree is preferred, Comcast also may consider applicants who hold some combination of coursework and experience, or who have extensive related professional experience.
Relevant Work Experience
10 Years +



  • Remote, Oregon, United States Henry Meds Full time

    About Henry Meds:Tens of millions of Americans are unable to manage their chronic conditions with commercial medications. Using specialized compounded formulas tailored to individual patient needs, Henry helps people who have been left behind by the commercial market, all while remaining easy, accessible, and affordable. Our customers get access to the care...


  • Remote, Oregon, United States Abarca Full time

    What you'll doIn a few words...Abarca is igniting a revolution in healthcare. We built our company on the belief that with smarter technology we are redefining pharmacy benefits, but this is just the beginning...Our Site Reliability Engineering team leverages software engineering and infrastructure operations to create highly reliable and scalable software...


  • Remote, Oregon, United States Hypori Inc. Full time

    Hypori Inc, a leading provider of SaaS cybersecurity solutions, is transforming secure mobility for federal and commercial customers, including the United States Army. Hypori's secure virtual workspace enables users to access critical data and apps from any mobile device without compromising user privacy. From commercial IP to national security level intel,...


  • Remote, Oregon, United States Xero Full time

    Xero is a beautiful, easy-to-use platform that helps small businesses and their accounting and bookkeeping advisors grow and thrive. At Xero, our purpose is to make life better for people in small business, their advisors, and communities around the world. This purpose sits at the centre of everything we do. We support our people to do the best work of their...


  • Remote, Oregon, United States Business Wire Full time

    Business Wire, a Berkshire Hathaway company, is the global market leader in press release distribution and regulatory disclosure. We are on a mission to redefine how organizations connect with their audiences - and that's just the beginningOrganizations, large and small, depend on us to accurately publicize market-moving news and multimedia, and generate...


  • Remote, Oregon, United States Brooksource Full time

    Job DescriptionBrooksource is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems and applications.Key Responsibilities:Linux System Administration: Manage and optimize Linux-based systems and servers to...


  • Remote, Oregon, United States Neon Full time

    Neon is aiming to be the go-to platform if you need a serverless Postgres with additional features like branching and scaling, to name a couple. Currently we are serving 750k databases and we want to grow that number, along with delivering more features without compromising from reliability and scalability. This is where our SRE team comes into the...


  • Remote, Oregon, United States Circonus Full time

    As a Site Reliability Engineer (SRE) at Circonus, you will be responsible for keeping Circonus SaaS and on-premise customers up and running as well as improving the automation, scalability, and performance of systems. This is an unparalleled opportunity to grow on a small, collaborative, and friendly team with established leadership in the field of SRE. A...


  • Remote, Oregon, United States Own Company Full time

    Own is the leading data platform trusted by thousands of organizations to protect and activate SaaS data to transform their businesses. Own empowers customers to ensure the availability, security and compliance of mission-critical data, while unlocking new ways to gain deeper insights faster. By partnering with some of the world's largest SaaS ecosystems...


  • Remote, Oregon, United States DFIN Full time

    Donnelley Financial Solutions (DFIN) is a leader in risk and compliance solutions, providing insightful technology, industry expertise and data insights to clients across the globe. We're here to help you make smarter decisions with insightful technology, industry expertise and data insights at every stage of your business and investment lifecycles. As...


  • Remote, Oregon, United States Katmai Full time

    ABOUT KATMAIKatmai is pioneering the future of virtual experiences and hybrid work. The platform brings people together inside an easy-to-navigate 3D environment, enabling natural communication & collaboration, spontaneous interactions, and a sense of place that's been missing from the digital world. The simplicity of the user experience means no headsets...


  • Remote, Oregon, United States Sparksoft Corporation Full time

    Join us at Sparksoft, where we're not just another tech company—we're a catalyst for change. Our mission isn't just to offer IT solutions; it's to revolutionize the way you work. Here, passion isn't just a buzzword; it's the fuel behind groundbreaking ideas and transformative technologies. We serve a wide range of government clients, delivering impact...


  • Remote, Oregon, United States Ankura Full time

    Ankura is a team of excellence founded on innovation and growth.Practice Overview:Ankura is a team of excellence founded on innovation and growth. Technology Services at Ankura provide Technology platforms, solutions and support services globally and across the company in a scalable, performant, and secure manner. The Technology Services teams play a vital...


  • Remote, Oregon, United States Tyk Full time

    DescriptionWho are Tyk, and what do we do?The Tyk API Management platform is helping to drive the connected world and power new products and services. We're changing the way that organisations connect any number of their systems and services. Whether internal, external, public or highly encrypted systems, Tyk helps businesses drive value across the retail,...


  • Remote, Oregon, United States Symbotic Full time

    Company OverviewSymbotic is at the forefront of transforming supply chain logistics through its advanced A.I.-driven robotic technology platform. Our intelligent software coordinates sophisticated robots within a high-density, comprehensive system, revolutionizing warehouse automation to enhance efficiency, speed, and adaptability.Position SummaryThe Lead...


  • Remote, Oregon, United States Dutchie Full time

    About DutchieFounded in 2017, Dutchie is a comprehensive technology platform powering dispensary operations, while providing consumers with safe and easy access to cannabis. Dutchie aims to further support the positive societal change the cannabis industry brings to the world through wellness benefits, social justice, and empowering local communities through...


  • Remote, Oregon, United States Symbotic Full time

    About UsSymbotic is at the forefront of transforming supply chain logistics through our advanced A.I.-driven robotic technology platform. Our intelligent software coordinates sophisticated robots within a comprehensive system, revolutionizing warehouse automation to enhance efficiency, speed, and adaptability. Position OverviewThe Lead Site Installation...

  • Engineering Lead

    1 month ago


    Remote, Oregon, United States Alloy Automation Full time

    Alloy Automation (YC W20) is more than just a tech startup - we're building the integration infrastructure that everyone from fast growing startups to Fortune 500's rely on to launch and manage their integrations – at scale. Our engineering team delivers a best in class, incredible experience for our customers who range from global brands like Burberry...


  • Remote, Oregon, United States Sargent & Lundy Full time

    Position Overview Sargent & Lundy's Government Services Division is at the forefront of engineering design and advisory services, providing essential support to management and operational contractors associated with U.S. Department of Energy (DOE) sites and national laboratories. Our focus includes aiding the DOE Environmental Management Directorate and...


  • Remote, Oregon, United States GE Full time

    Job Description SummaryThe I&C Systems Design Engineer is responsible for design and analysis of I&C systems for nuclear power plant applications.Job DescriptionResponsible for Plant I&C Systems design activities that support:GE's BWRX-300 Small Modular Reactor (SMR) and/or Gen-IV reactor technologies including Natrium and ARC sodium fast reactors...