Senior Staff Engineer, Observability Engineer

4 weeks ago


Mountain View, United States Coupang Full time

We exist to wow our customers. We know we’re doing the right thing when we hear our customers say, “How did we ever live without Coupang?” Born out of an obsession to make shopping, eating, and living easier than ever, we’re collectively disrupting the multi-billion-dollar e-commerce industry from the ground up. We are one of the fastest-growing e-commerce companies that established an unparalleled reputation for being a dominant and reliable force in South Korean commerce.

We are proud to have the best of both worlds — a startup culture with the resources of a large global public company. This fuels us to continue our growth and launch new services at the speed we have been since our inception. We are all entrepreneurial surrounded by opportunities to drive new initiatives and innovations. At our core, we are bold and ambitious people that like to get our hands dirty and make a hands-on impact. At Coupang, you will see yourself, your colleagues, your team, and the company grow every day.

Our mission to build the future of commerce is real. We push the boundaries of what’s possible to solve problems and break traditional tradeoffs. Join Coupang now to create an epic experience in this always-on, high-tech, and hyper-connected world.

Summary: As a Sr. Staff Back-end Engineer within the Site Reliability organization, you will be working with large scale cloud infrastructure handling billions of metrics and peta-bytes of logs and metrics.

You will leverage this data to help internal teams to monitor service reliability and predict/prevent incidents. You have the opportunity to build the next generation Observability Platform based on Kubernetes and other OSS solutions, as well as building software components from scratch. You would work directly with various engineering teams in Coupang, influence them with SRE principles and best practices and see your impact directly.

Key Responsibilities:

Design, implement, and maintain observability solutions such as monitoring, alerting, logging, and tracing across various platforms, applications, and infrastructure.

Collaborate with cross-functional teams, including software engineers, SREs, and infrastructure teams, to identify and define observability requirements.

Develop and implement best practices for creating and maintaining effective monitoring, alerting, and telemetry systems.

Evaluate and recommend industry-leading observability tools and technologies to improve system visibility and reliability.

Define and track key performance indicators (KPIs) and service-level objectives (SLOs) related to system availability, performance, and reliability.

Assist in the troubleshooting and resolution of complex incidents and problems by analyzing data from observability tools.

Provide guidance and mentorship to other engineers on observability principles, practices, and tools.

Conduct ongoing evaluations of observability systems and identify opportunities for improvements and optimizations.

Drive the standardization and simplification of observability processes, tools, and frameworks across the organization.

Contribute to the development of training materials, documentation, and runbooks for observability systems and practices.

Essential Qualifications:

Bachelor's Degree in Computer Science, Engineering, or a related technical field.

Strong experience in implementing and managing observability solutions in large-scale, complex environments.

Deep knowledge of monitoring, alerting, and logging systems and tools, such as Prometheus, Grafana, Elastic Stack, Datadog, or New Relic.

Familiarity with distributed tracing technologies, such as Jaeger or Zipkin.

Experience with cloud-based infrastructure, including AWS, Azure, or Google Cloud Platform.

Strong understanding of DevOps and SRE practices, including continuous integration, continuous delivery, and infrastructure as code (IaC).

Proficiency in scripting languages, such as Python, Bash, or Ruby.

Excellent communication and collaboration skills, with the ability to work with teams across different functions and technical domains.

Strong problem-solving and analytical skills, with a focus on data-driven decision-making.

A proven track record of leading and delivering successful observability projects and initiatives.

Preferred Qualifications:

Experience with containerization and orchestration technologies, such as Docker and Kubernetes.

Familiarity with application performance management (APM) tools, such as Dynatrace or AppDynamics.

Professional certifications in cloud platforms, monitoring tools, or related technologies.

Pay & Benefits

Our compensation reflects the cost of labor across several US geographic markets. At Coupang, your base pay is one part of your total compensation.

The base pay for this position ranges from $159,000/year in our lowest geographic market to $324,000/year

in our highest geographic market. Pay is based on several factors including market location and may vary depending on job-related knowledge, skills, and experience.

General Description of All Benefits

Medical/Dental/Vision/Life, AD&D insurance

Flexible Spending Accounts (FSA) & Health Savings Account (HSA)

Long-term/Short-term Disability

Employee Assistance Program (EAP) program

401K Plan with Company Match

18-21 days of the Paid Time Off (PTO) a year based on the tenure

12 Public Holidays

Paid Parental leave

Pre-tax commuterbenefits

MTV - [Free] Electric Car Charging Station

General Description of Other Compensation

“Other Compensation” includes, but is not limited to, bonuses, equity, or other forms of compensation that wouldbe offered to the hired applicant in addition to their established salary range orwage scale.

Coupang is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to actual or perceived race (including traits historically associated with race, including but not limited to hair texture and protective hair styles), color, religion, religious creed (including religious dress and grooming practices), sex or gender (including pregnancy, childbirth, breastfeeding, and medical conditions related to pregnancy, childbirth or breastfeeding), gender identity, gender expression, sexual orientation, ,ancestry, national origin (including language use restrictions), age (40 and over), physical or mental disability, medical condition, genetic information, HIV/AIDS or Hepatitis C status, family status (including but not limited to marital or domestic partnership status), military or veteran status, use of a trained dog guide or service animal, political activities or affiliations, ancestry, citizenship, family and medical leave status, status as a victim of any violent crime, or any other characteristic or class protected by the laws or regulations in the locations where we operate.Coupang is also committed to providing a safe work environment for its employees and its consumers.If you need assistance and/or a reasonable accommodation in the application of recruiting process due to a disability, please contact us at

usrecruiting@coupang.com

#J-18808-Ljbffr



  • Mountain View, United States Startup Full time

    At Databricks, we are inspired by allowing data teams to solve the world’s toughest problems, from security threat detection to cancer drug development. We do this by building and running the world’s best data and AI infrastructure platform, so our customers can focus on the high value challenges that are central to their own missions. Our engineering...


  • Mountain View, United States CareerBuilder Full time

    Staff Infrastructure and Observability Engineer (Mountain View, CA) Department: Behaviors, Execution and Foundation Employment Type: Full Time Location: Mountain View, CA Reporting To: Angela Tan Description We're SmartThings, one of the leading IoT ecosystems in the world, creating the most effortless way for anyone to create a smart home. As a wholly owned...


  • Mountain View, United States Elastic Full time

    Elastic is a free and open search company that powers enterprise search, observability, and security solutions built on one technology stack that can be deployed anywhere. From finding documents to monitoring infrastructure to hunting for threats, Elastic makes data usable in real-time and at scale. Thousands of organizations worldwide, including Barclays,...


  • Mountain View, United States SmartThings Full time

    Job DescriptionJob DescriptionDescriptionWe’re SmartThings, one of the leading IoT ecosystems in the world, creating the most effortless way for anyone to create a smart home. As a wholly owned subsidiary of Samsung, our corporate offices are based in Minneapolis and the Bay Area.More than 270 million people worldwide use SmartThings to control and manage...


  • Mountain View, United States SmartThings Full time

    Job DescriptionJob DescriptionDescriptionWe’re SmartThings, one of the leading IoT ecosystems in the world, creating the most effortless way for anyone to create a smart home. As a wholly owned subsidiary of Samsung, our corporate offices are based in Minneapolis and the Bay Area.More than 270 million people worldwide use SmartThings to control and manage...


  • Mountain View, United States SmartThings Full time

    Job DescriptionJob DescriptionDescriptionWe’re SmartThings, one of the leading IoT ecosystems in the world, creating the most effortless way for anyone to create a smart home. As a wholly owned subsidiary of Samsung, our corporate offices are based in Minneapolis and the Bay Area.More than 270 million people worldwide use SmartThings to control and manage...


  • Mountain View, United States SmartThings Full time

    Job DescriptionJob DescriptionDescriptionWe’re SmartThings, one of the leading IoT ecosystems in the world, creating the most effortless way for anyone to create a smart home. As a wholly owned subsidiary of Samsung, our corporate offices are based in Minneapolis and the Bay Area.More than 270 million people worldwide use SmartThings to control and manage...


  • Mountain View, United States SmartThings Full time

    Job DescriptionJob DescriptionDescriptionWe’re SmartThings, one of the leading IoT ecosystems in the world, creating the most effortless way for anyone to create a smart home. As a wholly owned subsidiary of Samsung, our corporate offices are based in Minneapolis and the Bay Area.More than 270 million people worldwide use SmartThings to control and manage...

  • Senior Staff Engineer

    4 weeks ago


    Mountain View, United States PredictSpring Full time

    Job Overview As Senior Staff Engineer in the DevOps team, you will build and support applications and infrastructure enabling teams to configure, deploy, operate, and monitor the mission-critical services powering offered by the PredictSpring Cloud platform that serves the world's leading brands and retailers for their Modern POS Platform. You will work in a...

  • Senior Staff Engineer

    2 weeks ago


    Mountain View, United States PredictSpring Full time

    Job Overview As Senior Staff Engineer in the DevOps team, you will build and support applications and infrastructure enabling teams to configure, deploy, operate, and monitor the mission-critical services powering offered by the PredictSpring Cloud platform that serves the world's leading brands and retailers for their Modern POS Platform. You will work in a...


  • Mountain View, United States Coupang Full time

    We exist to wow our customers. We know we’re doing the right thing when we hear our customers say, “How did we ever live without Coupang?” Born out of an obsession to make shopping, eating, and living easier than ever, we’re collectively disrupting the multi-billion-dollar e-commerce industry from the ground up. We are one of the fastest-growing...


  • Mountain View, United States Motion Recruitment Full time

    An enterprise E-Commerce company located in Mountain View, California is looking for a Senior Staff Site Reliability Engineer to add to their growing team. This individual will be highly focused on building out observability tooling and scaling it across the organization. This will involve working cross-functionally with developer, infrastructure and SRE...

  • Senior Staff Engineer

    2 months ago


    Mountain View, United States PredictSpring Full time

    Job Overview As Senior Staff Engineer in the DevOps team, you will build and support applications and infrastructure enabling teams to configure, deploy, operate, and monitor the mission-critical services powering offered by the PredictSpring Cloud platform that serves the world's leading brands and retailers for their Modern POS Platform. You will work in a...


  • Mountain View, United States PredictSpring Full time

    Job Overview As Senior Staff Engineer in the DevOps team, you will build and support applications and infrastructure enabling teams to configure, deploy, operate, and monitor the mission-critical services powering offered by the PredictSpring Cloud platform that serves the worlds leading brands and retailers for their Modern POS Platform. You will work in a...

  • Senior Staff Engineer

    2 months ago


    Mountain View, United States PredictSpring Full time

    Job OverviewAs Senior Staff Engineer in the DevOps team, you will build and support applications and infrastructure enabling teams to configure, deploy, operate, and monitor the mission-critical services powering offered by the PredictSpring Cloud platform that serves the world's leading brands and retailers for their Modern POS Platform. You will work in a...

  • Senior Staff Engineer

    2 months ago


    Mountain View, United States PredictSpring Full time

    Job OverviewAs Senior Staff Engineer in the DevOps team, you will build and support applications and infrastructure enabling teams to configure, deploy, operate, and monitor the mission-critical services powering offered by the PredictSpring Cloud platform that serves the world's leading brands and retailers for their Modern POS Platform. You will work in a...

  • Senior Staff Engineer

    3 weeks ago


    Mountain View, United States PredictSpring Full time

    Job Overview As Senior Staff Engineer in the DevOps team, you will build and support applications and infrastructure enabling teams to configure, deploy, operate, and monitor the mission-critical services powering offered by the PredictSpring Cloud platform that serves the world's leading brands and retailers for their Modern POS Platform. You will work in a...

  • Senior Staff Engineer

    3 weeks ago


    Mountain View, United States PredictSpring Full time

    Job OverviewAs Senior Staff Engineer in the DevOps team, you will build and support applications and infrastructure enabling teams to configure, deploy, operate, and monitor the mission-critical services powering offered by the PredictSpring Cloud platform that serves the world's leading brands and retailers for their Modern POS Platform. You will work in a...


  • Mountain View, United States PredictSpring Full time

    Job OverviewAs Senior Staff Engineer in the DevOps team, you will build and support applications and infrastructure enabling teams to configure, deploy, operate, and monitor the mission-critical services powering offered by the PredictSpring Cloud platform that serves the world's leading brands and retailers for their Modern POS Platform. You will work in a...


  • Mountain View, United States PredictSpring Full time

    Job Overview As a Senior Staff Software Engineer - Front End, you will design and develop features on our CMS platform -- a platform on which the world's leading brands and retailers are building their Modern POS. You will play an integral role in shaping the direction of our product and bringing new features to market. Skills & Experience Strong knowledge...