Site Reliability Engineer

4 weeks ago


Los Angeles, California, United States eTek IT Services, Inc. Full time
Job OverviewAs a Site Reliability Engineer at eTek IT Services, Inc., you will play a vital role in ensuring the reliability, performance, and scalability of our infrastructure and applications. This is a unique opportunity to work with cutting-edge technology and make a significant impact on our organization's success.Key Responsibilities
  • Design and implement automation solutions to streamline processes and improve efficiency
  • Develop and maintain monitoring tools to ensure system health and performance
  • Participate in on-call rotations and handle incident response, troubleshooting, and resolution
  • Create and maintain scripts for operational tasks and automation
  • Conduct capacity planning and manage system scalability
  • Collaborate with development teams to improve system reliability and performance
  • Deploy and maintain cloud services and infrastructure
  • Define and implement service level objectives and indicators
  • Ensure security best practices are followed in all aspects of infrastructure and services
  • Perform system and application performance tuning and capacity forecasting
  • Conduct post-incident reviews and implement preventive measures
  • Participate in the design and implementation of disaster recovery plans
  • Document procedures, configurations, and processes
  • Contribute to the continuous improvement of processes and tools
Requirements
  • Bachelor's degree in Computer Science, Engineering, or a related field
  • Proven experience in a Site Reliability Engineer or similar role
  • Strong understanding of software development, system administration, and networking
  • Proficiency in scripting (e.g., Python, Shell, Perl)
  • Experience with monitoring and alerting tools (e.g., Nagios, Datadog, Prometheus)
  • Expertise in cloud services and infrastructure (e.g., AWS, GCP, Azure)
  • Knowledge of containerization and orchestration technologies (e.g., Docker, Kubernetes)
  • Experience with CI/CD pipelines and configuration management tools (e.g., Jenkins, Ansible)
  • Solid understanding of TCP/IP, HTTP, DNS, and other network protocols
  • Ability to analyze and troubleshoot complex systems and applications
  • Experience with incident management and on-call responsibilities
  • Familiarity with security best practices and tools
  • Excellent communication and collaboration skills
  • Certifications such as AWS Certified SysOps Administrator or Google Professional Cloud DevOps Engineer is a plus
  • Continuous learning and self-improvement mindset


  • Los Angeles, California, United States ICON Consultants, Inc. Full time

    Job Title: Site Reliability EngineerAt ICON Consultants, Inc., we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our datacenter infrastructure.Responsibilities:Data Monitoring and Alerting: Design and implement data...


  • Los Angeles, California, United States City National Bank Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at City National Bank. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key ResponsibilitiesImplement solutions that improve stability, security,...


  • Los Angeles, California, United States eTek IT Services, Inc. Full time

    Job OpportunityWe are seeking a highly skilled Site Reliability Engineer to join our team at eTek IT Services, Inc.Job SummaryAs a Site Reliability Engineer, you will be responsible for ensuring the high availability and scalability of our cloud-based systems. You will work closely with our development team to design, implement, and maintain reliable systems...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability Principal EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Principal Engineer to join our team. As a key member of our cloud operations team, you will be responsible for ensuring the reliability, scalability, and maximum uptime of our systems in the data center or cloud platform.Key...


  • Los Angeles, California, United States Loft Orbital Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Loft Orbital. As a key member of our Site Reliability team, you will be responsible for ensuring the reliability, scalability, and maintainability of our ground segment infrastructure.Key ResponsibilitiesCollaborate with development, operations, and IT teams to...


  • Los Angeles, California, United States City National Bank Full time

    Job Title: Site Reliability Principal EngineerAt City National Bank, we're seeking a highly skilled Site Reliability Principal Engineer to join our team. As a key member of our engineering organization, you will be responsible for ensuring the reliability, scalability, and maximum uptime of our software platforms.Key Responsibilities:Architect solutions to...


  • Los Angeles, California, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security (USDS) is a subsidiary of TikTok in the U.S.This new, security-first division was created to bring heightened focus and governance to our data protection policies and content assurance protocols...


  • Los Angeles, California, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is a leading destination for short-form mobile video, inspiring creativity and bringing joy to users worldwide. Our mission is to empower creators and communities to express themselves authentically.Our TeamThe U.S. Data Security team is responsible for protecting sensitive data and information, ensuring the highest...


  • Los Angeles, California, United States Tik Tok Full time

    About the Role:This is a Site Reliability Engineer position, focusing on the data pipeline reliability for the Video Platform team in USDS.Data SREs monitor data and keep production batch and real-time processing jobs up and running with the highest level of availability, ensuring our users have the freshest, complete, and correct data...


  • Los Angeles, California, United States Abbott Laboratories company Full time

    About the RoleAbbott is a global healthcare leader that helps people live more fully at all stages of life. Our portfolio of life-changing technologies spans the spectrum of healthcare, with leading businesses and products in diagnostics, medical devices, nutritionals and branded generic medicines.As a Senior Site Reliability Engineer, you will play a...


  • Los Angeles, California, United States Tik Tok Full time

    About the RoleTikTok is the leading destination for short-form mobile video, and our mission is to inspire creativity and bring joy. As a Site Reliability Engineer on our Video Platform team, you will play a critical role in ensuring the reliability and stability of our video system, which provides excellent experiences for billions of users around the...


  • Los Angeles, California, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security (USDS) is a subsidiary of TikTok in the U.S., responsible for protecting user data and ensuring the security of the TikTok platform.ResponsibilitiesGain a deep understanding of the TikTok...


  • Los Angeles, California, United States ICON Consultants, Inc. Full time

    **Job Title:** Site Reliability Engineer - Datacenter Expert**Location:** Remote**Pay Rate:** $100/hour + benefits**Assignment Length:** 3-month W2 Contract**Industry:** TechnologyThe ideal candidate will have experience with system operations and running large-scale, massively distributed infrastructure.Responsibilities:Data monitoring and alerting, data...


  • Los Angeles, California, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security (USDS) is a subsidiary of TikTok in the U.S.Our focus is on providing oversight and protection of the TikTok platform and U.S. user data, so millions of Americans can continue turning to TikTok...

  • Reliability Engineer

    4 weeks ago


    Los Angeles, California, United States Apple Full time

    Job SummaryWe are seeking a skilled Reliability Engineer to join our Audio Hardware Reliability Engineering team at Apple. As a key member of our team, you will be responsible for ensuring the durability and reliability of our products.Your primary focus will be on developing and implementing creative reliability tests, quantifying reliability risk, and...

  • Reliability Engineer

    4 weeks ago


    Los Angeles, California, United States Apple Full time

    Job SummaryWe are seeking a highly skilled Reliability Engineer to join our Apple Audio Hardware Reliability Engineering team. As a key member of our team, you will be responsible for ensuring the durability and reliability of our audio hardware products.Key ResponsibilitiesDevelop and implement creative reliability tests on new hardware programsQuantify...

  • Reliability Engineer

    4 weeks ago


    Los Angeles, California, United States Blue Origin Full time

    Job SummaryBlue Origin is seeking a highly skilled Reliability Engineer - Engines & Avionics to join our team. As a key member of our Engines business unit, you will be responsible for developing and implementing reliability strategies to ensure the safe and efficient operation of our engines and avionics systems.Key Responsibilities:Develop and implement...


  • Los Angeles, California, United States Blue Origin Full time

    Job SummaryBlue Origin is seeking a highly skilled Senior Reliability Engineer to join our team in Seattle, WA. As a key member of our Engines business unit, you will be responsible for developing and implementing reliability solutions for our next-generation rockets.Key ResponsibilitiesIdentify and analyze reliability requirements for our engine control...


  • Los Angeles, California, United States Abbott Full time

    About the RoleAbbott is a global healthcare leader that helps people live more fully at all stages of life. Our portfolio of life-changing technologies spans the spectrum of healthcare, with leading businesses and products in diagnostics, medical devices, nutritionals and branded generic medicines.As a Senior Cloud Reliability Engineer, you will work onsite...


  • Los Angeles, California, United States Czinger Full time

    Job OverviewCzinger Vehicles is a pioneering company in the automotive industry, pushing the boundaries of innovation and sustainability. We're seeking a highly skilled Senior Vehicle Reliability Engineer to join our team and contribute to the development of high-performance, sustainable vehicles.Key ResponsibilitiesIdentify and resolve critical issues and...