Site Reliability Engineer

3 hours ago


Mountain View, California, United States Samsung Electronics America North America Full time
Site Reliability Engineer - DevOps Infrastructure

At Samsung Ads, we're transforming the advertising landscape with cutting-edge technology. As a Site Reliability Engineer - DevOps Infrastructure, you'll play a crucial role in ensuring the reliability, scalability, and performance of our advertising technology platform.

Key Responsibilities:
  • Design and implement scalable and resilient infrastructure solutions for our advertising technology platform.
  • Collaborate with development teams to integrate DevOps best practices into the software development lifecycle.
  • Implement and maintain CI/CD pipelines to automate software delivery and deployment processes.
  • Monitor, troubleshoot, and optimize system performance to ensure high availability and reliability.
  • Evaluate and estimate capacity and growth plan projections for future.
  • Work closely with security teams to implement and enforce best practices for infrastructure security.
  • Participate in on-call rotations to provide 24/7 support for critical systems.
  • Continuously evaluate and implement new technologies to enhance the efficiency of our infrastructure.
  • Be the Infrastructure and Operation subject matter expert for the development team.
  • Plan for future capacity and growth plans including disaster recovery and BCP.
Qualifications:
  • Typically requires at least 8 years of related experience and a Bachelor's degree; or 6 years and a Master's degree; or a PhD with 3 years.
  • Strong understanding of cloud technologies (e.g., AWS, Azure, GCP) and expertise managing cloud native applications on Kubernetes.
  • Proven experience with containerization and orchestration tools (e.g., Docker, Kubernetes).
  • Expertise in automation and scripting (e.g., Terraform, Ansible, Python) and infrastructure as code (IaC) tools.
  • Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
  • Excellent communication and leadership skills.
  • Knowledge of Ad Tech would be preferable.

At Samsung Ads, we're committed to creating a dynamic and innovative work environment that fosters growth and collaboration. If you're passionate about ensuring the reliability and scalability of cutting-edge advertising technology, we invite you to join our team.

Compensation for this role is expected to be between $180,000 and $200,000. Actual pay will be determined considering factors such as relevant skills and experience, and comparison to other employees in the role.



  • Mountain View, California, United States Optomi Full time

    Job Title: Site Reliability EngineerOptomi, in partnership with a large consulting firm, is seeking an experienced Site Reliability Engineer for their Remote team. This position requires a versatile, highly motivated individual capable of supplying frontline technical and operational support to our Site Reliability teams.As a vital part of the Reliability...


  • Mountain View, California, United States Moveworks Full time

    About MoveworksMoveworks is a leading AI startup that provides a universal AI copilot for search and automation across all business applications. Our mission is to empower employees to work faster and more efficiently by eliminating repetitive support issues and delivering instant knowledge.Job DescriptionWe are seeking a highly skilled Site Reliability...


  • Mountain View, California, United States Moveworks Full time

    About MoveworksMoveworks is a leading AI-powered automation platform that helps businesses streamline their operations and improve employee productivity. Our innovative technology enables employees to find information and get support in one place, reducing costs and increasing efficiency.Job DescriptionWe are seeking a highly skilled Site Reliability...


  • Mountain View, California, United States Atlassian Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Atlassian. As a Site Reliability Engineer, you will play a critical role in ensuring the performance, reliability, and scalability of our cloud-based services.ResponsibilitiesDesign, implement, and maintain scalable and reliable cloud infrastructureCollaborate with...


  • Mountain View, California, United States Moveworks Full time

    About MoveworksMoveworks is a leading AI startup that provides a universal AI copilot for search and automation across all business applications. Our mission is to empower employees to work faster and more efficiently by eliminating repetitive support issues and delivering instant knowledge.Job DescriptionWe are seeking a highly skilled Site Reliability...


  • Mountain View, California, United States Optomi Full time

    Optomi's Site Reliability Engineer OpportunityWe are seeking a skilled Site Reliability Engineer to join our team at Optomi, in partnership with a large consulting firm. This role requires a versatile and highly motivated individual who can provide frontline technical and operational support to our Site Reliability teams.Key Responsibilities:Collaborate with...


  • Mountain View, California, United States Tik Tok Full time

    About the RoleWe are seeking a skilled Site Reliability Engineer to join our Applied Machine Learning (AML) team. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining highly available, scalable, and fault-tolerant systems.ResponsibilitiesDesign and develop large-scale systems that meet the needs of our AML...


  • Mountain View, California, United States Synopsys Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Platform Team at Synopsys. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our engineering environment. You will work closely with our development teams to design, implement, and operate scalable and efficient...


  • Mountain View, California, United States Groq Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Groq. As a Principal Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our tools and services for provisioning and managing the full lifecycle of Groq hardware and...


  • Mountain View, California, United States Groq Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Groq. As a Principal Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our tools and services for provisioning and managing the full lifecycle of Groq hardware and...


  • Mountain View, California, United States Groq Full time

    Job Title: Principal Site Reliability EngineerWe are seeking a highly skilled Principal Site Reliability Engineer to join our team at Groq. As a Principal Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our tools and services for provisioning and managing the full lifecycle of Groq hardware and...

  • Site Reliability Engineer

    38 minutes ago


    Mountain View, California, United States Atlassian Full time

    About the RoleWe're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the performance and reliability of our services. You will work closely with our teams to identify and resolve issues, and develop solutions to improve our systems.Key Responsibilities:Investigate...


  • Mountain View, California, United States Tik Tok Full time

    Job Title: Site Reliability Engineer, EdgeAt TikTok, we're committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe, and so does our workplace.About the RoleWe're seeking a highly skilled Site Reliability Engineer to join our Edge team. As a...


  • Mountain View, California, United States Tik Tok Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our AML team, where you will play a critical role in designing, building, and maintaining highly available, scalable, and fault-tolerant systems.ResponsibilitiesDesign and develop large-scale systems that meet the needs of our users.Monitor and analyze system performance,...


  • Mountain View, California, United States Groq Full time

    Job Title: Senior Site Reliability EngineerWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Groq. As a key member of our infrastructure operations team, you will be responsible for ensuring the reliability, scalability, and performance of our tools and services.Key Responsibilities:Design and implement scalable and...


  • Mountain View, California, United States Groq Full time

    Unlock the Power of AI with GroqWe're on a mission to democratize access to AI, and we need your expertise to make it happen. As a Senior Site Reliability Engineer at Groq, you'll play a critical role in ensuring the reliability, scalability, and performance of our tools and services.Key Responsibilities:Design and implement scalable and reliable...


  • Mountain View, California, United States Moveworks Full time

    About MoveworksMoveworks is a leading AI startup that provides a universal AI copilot for search and automation across all business applications. Our mission is to empower employees to work faster and more efficiently by eliminating repetitive support issues and delivering instant knowledge.Job DescriptionWe are seeking a highly skilled Staff Site...


  • Mountain View, California, United States Tik Tok Full time

    About the RoleWe are seeking a skilled Site Reliability Engineer to join our AML team, where you will play a critical role in designing, building, and maintaining highly available, scalable, and fault-tolerant systems.ResponsibilitiesDesign and implement large-scale systems to ensure high availability and scalability.Monitor and analyze system performance,...


  • Mountain View, California, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is a leading destination for short-form mobile video, inspiring creativity and bringing joy to millions of users worldwide. Our mission is to empower creators and communities to express themselves authentically, while ensuring the security and integrity of our platform.Job SummaryWe are seeking a highly skilled Site...


  • Mountain View, California, United States Groq Full time

    Unlock the Power of AI with GroqAt Groq, we're revolutionizing the AI economy by making processing power more accessible, faster, and more affordable. Our Language Processing Unit (LPU) outpaces the GPU in speed, power, efficiency, and cost-effectiveness, empowering a world where AI is universally accessible.Join Our MissionWe're seeking a Senior Site...