Site Reliability Engineer

2 weeks ago


Los Angeles, United States Nexus IT Group Full time
Responsibilities:

  • Oversees and maintains the organization's installed systems, networks, and infrastructure to ensure that it complies with corporate policies and standard operating procedure (SOP).
  • Guarantees the maximum degree of system and infrastructure accessibility.
  • Installs, sets up, and tests system management tools, application software, and operating systems.
  • Create, schedule, and deploy network communications systems automatically.
  • As needed, provide documentation to guarantee that each site network has access to accurate and up-to-date information.
  • Carries out warranty and assistance tasks.
  • Arranges and carries out system automation as needed to improve productivity.
  • Manages the creation of specialized hardware and software specifications.
  • Willing to keep abreast on security best practices and guide their implementation
  • Works together with other experts to guarantee excellent deliverables that adhere to organizational standards, rules, and procedures.
  • To fix problems that customers have reported, run diagnostics.
  • Handles work processes, optimization techniques, and risk management tools in assigned projects to ensure effective completion in accordance with stakeholder needs.
Requirements:

  • Possess knowledge of automation tools such as Chef, Ansible, CFEngine, Puppet, or Salt
  • Strong scripting abilities (shell scripts, Perl, Ruby, and Python, for example)
  • At least three years of expertise setting up enterprise-level Linux systems in a highly networked setting.
  • Demonstrated proficiency setting up, configuring, and debugging Linux and UNIX-based infrastructures.
  • Excellent communication skills, both written and oral
  • Strong work ethic
  • Proven background in managing and optimizing the performance of application stacks (such as Tomcat, Apache, Nginx, HAProxy, or Envoy/Varnish).
  • Knowledge of virtualization and containerization, such as with KVM/QEmu, VMware, and Proxmox clusters.
  • Monitoring system (Prometheus, Zabbix, etc.) experience
  • A bachelor's or master's degree in computer science, engineering, or a similar field
  • Incident management response experience


  • Los Angeles, United States Metric LLC Full time

    Role DescriptionThis is a full-time remote role for a Site Reliability Engineer at Metric LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, availability, and performance of our systems and services. Your day-to-day tasks will include monitoring and troubleshooting production issues, designing and implementing...

  • Site Reliability Engineer

    Found in: Appcast US C2 - 2 weeks ago


    Los Angeles, United States Metric LLC Full time

    Role DescriptionThis is a full-time remote role for a Site Reliability Engineer at Metric LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, availability, and performance of our systems and services. Your day-to-day tasks will include monitoring and troubleshooting production issues, designing and implementing...

  • Site Reliability Engineer

    Found in: Appcast Linkedin GBL C2 - 2 weeks ago


    Los Angeles, United States Metric LLC Full time

    Role DescriptionThis is a full-time remote role for a Site Reliability Engineer at Metric LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, availability, and performance of our systems and services. Your day-to-day tasks will include monitoring and troubleshooting production issues, designing and implementing...


  • Los Angeles, United States BayOne Solutions Full time

    Position: Site Reliability Engineer Location: Los Angeles, CA Duration: 6+ Months Pay Range: $85/hr - 90/hr on W2 If your skills, experience, and qualifications match those in this job overview, do not delay your application. Site Reliability Engineer It is an exciting time to be part of SIE’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs...


  • Los Angeles, United States developrec Full time

    SRE Lead/Manager | San Diego, CA | Full-time Role Overview: As the Engineering Manager for Site Reliability, you'll lead the charge in transitioning to cloud-based solutions while ensuring the stability of our existing systems for our rapidly growing user base, currently standing at around one million. You'll spearhead our cloud infrastructure strategy...


  • Los Angeles, United States Insight Global Full time

    Duration: 6 month with possible extensionLocation: Remote but need to sit in CaliforniaWork Authorization: W2Must-haves- 3-5 years of experience working as a Site Reliability Engineer- 3+ years of experience working in an AWS environmento Specifically: EC2, S3 Buckets, EKS, Security Groups, and Cloud Formation- 3-5 years of experience building CI/CD...


  • Los Angeles, United States Insight Global Full time

    Duration: 6 month with possible extension Location: Remote but need to sit in California Work Authorization: W2 Must-haves - 3-5 years of experience working as a Site Reliability Engineer - 3+ years of experience working in an AWS environment o Specifically: EC2, S3 Buckets, EKS, Security Groups, and Cloud Formation - 3-5 years of experience building CI/CD...


  • Los Angeles, United States forhyre.com Full time

    Job Description Job Description Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas of development and are interested in continuing to improve our platform through the ever-changing technology landscape. To be successful in this role You'll have the opportunity to design and implement major infrastructure...

  • Civil Engineer

    4 days ago


    Los Angeles, United States DK Engineer Corp Full time

    Job DescriptionJob Description About the job: We are currently looking for junior level (0-3 years experience) and experienced engineers (4-8 years experience) to join our team. Civil Engineers are responsible for the design of Civil plans including erosion control (SWPPP), grading and drainage, wet utilities (water, sewer, storm drain), stormwater treatment...


  • Los Angeles, United States Forhyre Full time

    Job DescriptionJob DescriptionWe are looking for someone that is generalist at heart, one who is curious, appreciates complexity, knows or wants to learn when to step back and when to dive deep. We call this role a Cloud Service Reliability Engineer. The Cloud Service Reliability Engineer will be responsible for effective design, execution, and maintenance...

  • Uncapped Games

    1 week ago


    Los Angeles, United States Tencent Full time

    Work Mode: Onsite Responsibilities: Description: Seeking the opportunity to build a game from scratch and create a global impact? Uncapped Games is seeking a talented and enthusiastic Site Reliability Engineer to join our new AAA team. The ideal candidate is well-versed in operating and improving distributed online systems, eager to automate & optimize...

  • Mechanical Engineer, Reliability Engineering

    Found in: Jooble US O C2 - 3 weeks ago


    Los Angeles, CA, United States Lotte Chemical USA Corporation Full time

    Develops and/or reviews fixed equipment repair plans, rerates and modifications, including calculations and drawings § Performs Level 1 and 2 Fitness for Service per API 579 § Provides engineering support to maintenance organization executing repairs or improvements in Fixed Equipment § Provides engineering support to API Inspectors reviewing...

  • Senior Site Reliability Engineer, CORE

    Found in: beBee jobs US - 3 weeks ago


    Los Gatos, California, United States Netflix Full time

    "At Netflix, we strive to bring joy to people across the world through amazing stories. As we grow internationally, we are continually enhancing our cloud-based infrastructure to improve our performance, scalability, and reliability.The SRE team's goal is to ensure customer joy by successfully managing risk and minimizing impact across Netflix. We do this...

  • Senior Site Reliability Engineer, CORE

    Found in: beBee S US - 2 weeks ago


    Los Gatos, California, United States Netflix Full time

    "At Netflix, we strive to bring joy to people across the world through amazing stories. As we grow internationally, we are continually enhancing our cloud-based infrastructure to improve our performance, scalability, and reliability.The SRE team's goal is to ensure customer joy by successfully managing risk and minimizing impact across Netflix. We do this...


  • Los Angeles, United States Xscape Photonics Inc Full time

    We are seeking a skilled Laser Reliability and Failure Analysis Engineer to join our team. The successful candidate will be responsible for assessing the reliability of semiconductor lasers through various testing and analysis methods. Key responsibilities include performing Failure Modes and Effects Analysis (FMEA), defining reliability tests, and...


  • Los Angeles, United States Taleo Full time

    Careers That Change Lives In this exciting role as a Senior Test Engineer in the in the product design verification & reliability engineering group of the Mechanical Engineering department , you will be responsible for product verification and reliability test planning/ designing, developing, and implementing testing methods and equipment for new product...

  • Engineer - Instrument

    2 weeks ago


    Los Angeles, United States Westlake Chemical Corporation Full time

    SUMMARY Provides technical support to operations, maintenance, and capital project groups to ensure improvements in instrument reliability and to resolve repetitive instrument-related problems DUTIES AND RESPONSIBILITIES May include, but are not limited to, the following: Plans and directs work, provides technical training, establishes goals and objectives...

  • Platform Engineer

    2 weeks ago


    Los Angeles, United States Smile Full time

    Platform Engineer - Platform Guild Smile.io is the world’s largest loyalty platform, providing easy-to-use reward programs that help to scale ecommerce brands and transform one-time sales into repeat, loyal customers. Over 100,000 brands use Smile to turn transactional purchases into passionate repeat shoppers. Smile.io is seeking a highly skilled and...

  • Solar Site Surveyor

    7 days ago


    Los Angeles, United States US POWER Full time

    **About Us**: US Power Solar Company is seeking motivated and reliable individuals to join our team as Solar Site Surveyors. As a SunPower Elite dealer and a leading provider of solar energy solutions, we are dedicated to delivering high-quality services to our customers while making a positive impact on the environment. As a Solar Site Surveyor, you will...

  • Process Engineer

    5 days ago


    Los Angeles, United States Hiring Now! Full time

    Our client is currently seeking a PROCESS ENGINEER for their manufacturing facility in the Salt Lake City, UT area.RESPONSIBILITIES:*Work closely with the Operations Management team and Site Director to assess existing processes.*Ensure processes comply with safety and quality standards.*Perform process simulations and troubleshooting issues.*Perform...