Site Reliability Engineer

4 weeks ago


Los Angeles, California, United States eTek IT Services, Inc. Full time

h3JobDescription/h3pAteTekITServices,Inc.,weareseekingahighlyskilledbSiteReliabilityEngineer-CloudInfrastructureExpert/btojoinourteam.Thisroleiscrucialinensuringthereliability,scalability,andperformanceofourinfrastructureandapplications./ph4KeyResponsibilities:/h4ulliDesignandimplementmonitoringandalertingsystemstoensurehighavailabilityandperformanceofservices./liliDevelopautomationtoolsforsystemprovisioning,configurationmanagement,andapplicationdeployment./liliCollaboratewithcross-functionalteamstoensurethatnewsoftwareandsystemsareproduction-ready./liliPerformcapacityplanningandmanageinfrastructurecapacityefficiently./liliConductrootcauseanalysisofproductionissuesandimplementpreventivemeasures./liliParticipateinon-callrotationsandrespondtosystememergencies./liliEnsurecompliancewithsecurityandregulatorystandardsinallaspectsoftheinfrastructure./liliContributetothecontinuousimprovementofthereliabilityandperformanceofsystemsandapplications./liliImplementbestpracticesforcloudinfrastructureandservices./liliLeadinitiativestooptimizesystemperformanceandstability./liliConductperiodictestingofdisasterrecoveryandfailoversystems./liliDocumentsystemconfigurations,processes,andprocedures./liliAssistinevaluatingnewtechnologiesandmethodstoimprovereliabilityandperformance./li/ulh4RequiredQualifications:/h4ulliBachelor'sdegreeinComputerScience,InformationTechnology,orarelatedfield./lili3+yearsofexperienceinasitereliabilityengineeringrole./liliProficiencyinLinuxsystemadministrationandtroubleshooting./liliStrongprogrammingskillsinPython,Shellscripting,orotherscriptinglanguages./liliExperiencewithcloudplatformssuchasAWS,GCP,orAzure./liliExpertiseinbuildingandmaintainingscalable,high-performancesystems./liliKnowledgeofcontainerizationandorchestrationtechnologies(Docker,Kubernetes)./liliHands-onexperiencewithmonitoringandloggingtools(e.g.,Prometheus,Grafana,ELK)./liliAbilitytodesignandimplementautomatedsolutionsforinfrastructureandapplicationdeployment./liliExcellenttroubleshootingandproblem-solvingskills./liliUnderstandingofnetworkingconceptsandprotocols./liliStrongcommunicationandcollaborationskills./liliRelevantcertifications(e.g.,AWSCertifiedDevOpsEngineer,GoogleProfessionalCloudDevOpsEngineer)aplus./li/ul



  • Los Angeles, California, United States City National Bank Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer to join our team at City National Bank. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and maximum uptime of our systems in the Data Center or Cloud Platform.Key ResponsibilitiesImplement solutions that improve stability, security,...


  • Los Angeles, California, United States Tik Tok Full time

    About the Role:This is a Site Reliability Engineer position, focusing on the data pipeline reliability for the Video Platform team in USDS.Data SREs monitor data and keep production batch and real-time processing jobs up and running with the highest level of availability, ensuring our users have the freshest, complete, and correct data...


  • Los Angeles, California, United States Tik Tok Full time

    About the RoleTikTok is the leading destination for short-form mobile video, and our mission is to inspire creativity and bring joy. As a Site Reliability Engineer on our Video Platform team, you will play a critical role in ensuring the reliability and stability of our video system, which provides excellent experiences for billions of users around the...


  • Los Angeles, California, United States ICON Consultants, Inc. Full time

    **Job Title:** Site Reliability Engineer - Datacenter Expert**Location:** Remote**Pay Rate:** $100/hour + benefits**Assignment Length:** 3-month W2 Contract**Industry:** TechnologyThe ideal candidate will have experience with system operations and running large-scale, massively distributed infrastructure.Responsibilities:Data monitoring and alerting, data...

  • Reliability Engineer

    1 month ago


    Los Angeles, California, United States Blue Origin Full time

    Job SummaryBlue Origin is seeking a highly skilled Reliability Engineer - Engines & Avionics to join our team. As a key member of our Engines business unit, you will be responsible for developing and implementing reliability strategies to ensure the safe and efficient operation of our engines and avionics systems.Key Responsibilities:Develop and implement...


  • Los Angeles, California, United States Blue Origin Full time

    Job SummaryBlue Origin is seeking a highly skilled Senior Reliability Engineer to join our team in Seattle, WA. As a key member of our Engines business unit, you will be responsible for developing and implementing reliability solutions for our next-generation rockets.Key ResponsibilitiesIdentify and analyze reliability requirements for our engine control...


  • Los Angeles, California, United States Abbott Full time

    About the RoleAbbott is a global healthcare leader that helps people live more fully at all stages of life. Our portfolio of life-changing technologies spans the spectrum of healthcare, with leading businesses and products in diagnostics, medical devices, nutritionals and branded generic medicines.As a Senior Cloud Reliability Engineer, you will work onsite...


  • Los Angeles, California, United States Czinger Full time

    Job OverviewCzinger Vehicles is a pioneering company in the automotive industry, pushing the boundaries of innovation and sustainability. We're seeking a highly skilled Senior Vehicle Reliability Engineer to join our team and contribute to the development of high-performance, sustainable vehicles.Key ResponsibilitiesIdentify and resolve critical issues and...

  • Reliability Engineer

    4 weeks ago


    Los Angeles, California, United States Apple Full time

    Job SummaryAs a Reliability Engineer at Apple, you will be responsible for ensuring the durability and reliability of our audio hardware products. This involves partnering with diverse engineering teams to develop and implement creative reliability tests, quantify reliability risk, and advise the executive team on the best path forward.Key Responsibilities...


  • Los Angeles, California, United States Omni Inclusive Full time

    The Senior Systems Reliability Engineer at Omni Inclusive will play a crucial role in elevating SRE practices, promoting new technologies, and solving complex problems. This position requires a software engineering approach to architect, design, automate, monitor, and build applications at scale.This includes operating and engineering software with close...

  • Reliability Engineer

    4 weeks ago


    Los Angeles, California, United States Blue Origin Full time

    Reliability Engineer - Space Propulsion SystemsAt Blue Origin, we're pushing the boundaries of space exploration and development. As a Reliability Engineer - Space Propulsion Systems, you'll play a critical role in ensuring the reliability and safety of our space propulsion systems. Your expertise will help us design, develop, and test engines and propulsion...

  • Reliability Engineer

    4 weeks ago


    Los Angeles, California, United States Blue Origin Full time

    Job SummaryBlue Origin is seeking a highly skilled Reliability Engineer to join our team in Seattle, WA. As a key member of our Engines business unit, you will be responsible for designing, developing, and testing engines and propulsion systems for commercial, civil, national security, and human spaceflight applications.Key ResponsibilitiesIdentify...


  • Los Angeles, California, United States Blue Origin Full time

    Space System Reliability SpecialistAt Blue Origin, we're working to develop reusable, safe, and low-cost space vehicles and systems. As a Space System Reliability Specialist, you'll be part of the Safety, Quality, and Mission Assurance team, focused on monitoring and assessing processes that guide Blue Origin's design, manufacturing, and operations.This role...


  • Los Angeles, California, United States KPFF Consulting Engineers Full time

    Job DescriptionExperienced Civil Engineer - Site Development ProjectsKPFF Consulting Engineers is seeking a motivated and experienced civil engineer to join our team. The selected candidate will be responsible for civil engineering design of site development projects, including rough and precise grading, water, sewer, storm drain, hydrology and hydraulic...


  • Los Angeles, California, United States Dunhill Professional Search Full time

    Job SummaryThe Dunhill Professional Search team is seeking a skilled Reliability Engineer to join our team. As a key member of our Technical Operations team, you will be responsible for developing and maintaining tools, alerts, and dashboards to support the monitoring of application health and performance. This role requires a strong analytical mindset and...

  • Reliability Manager

    4 weeks ago


    Los Angeles, California, United States NBCUniversal Full time

    Job SummaryThe Maintenance Reliability Manager provides critical support to ensure the reliability and maintainability of theme park attractions, driving uptime and equipment availability. This role involves providing technical expertise to the maintenance team, leading Root Cause Failure Analysis, and implementing corrective actions to address complex...

  • IT Network Engineer

    1 month ago


    Los Angeles, California, United States Crystal Stairs Full time

    Job Title: IT Network EngineerCrystal Stairs, Inc. is seeking an experienced IT Network Engineer to support our network infrastructure. The ideal candidate will ensure network connectivity is highly available, reliable, and secure.Responsibilities:Manage the agency's LAN/WAN network infrastructure across all remote sites.Configure routers and switches to...


  • Los Angeles, California, United States iHeartMedia Full time

    Job Title: Regional Broadcast EngineerWe are seeking a highly skilled Regional Broadcast Engineer to join our team at iHeartMedia. As a Regional Broadcast Engineer, you will be responsible for designing, configuring, deploying, and maintaining various types of electronic equipment for optimum transmission and/or broadcast performance with scalability,...


  • Los Angeles, California, United States Netflix Full time

    At Netflix, we're looking for a highly skilled and motivated leader to strengthen the reliable delivery of games on our service.This role is part of the Game Operations team, where our mission is to support the Games business by identifying opportunities and establishing best-in-class infrastructure and processes.In the Game Reliability Manager role, you'll...


  • Los Angeles, California, United States Xylem Full time

    Xylem is seeking an experienced Process Engineer to optimize industrial water treatment plants in the Western US area.This role involves troubleshooting and optimizing complex water and waste water treatment processes by working with operations personnel and customers, reviewing process data, performing pilots and experiments to interpret and understand the...