Site Reliability Engineer

3 weeks ago


San Francisco, United States Telestream, LLC Full time
Job DescriptionJob Description

About Us:


Welcome to the forefront of innovation at Telestream, an industry leading digital video delivery company. We are a dynamic and forward-thinking organization committed to leveraging cutting-edge cloud technologies to drive our success. If you're ready to be part of a collaborative team and contribute to our journey of continuous innovation and growth, we encourage you to apply.


Job Description:


As a Cloud Infrastructure Engineer, you will play a pivotal role in maintaining, enhancing, and modernizing our highly scalable cloud infrastructure. We value individuals who excel technically and thrive in an innovative and collaborative environment. This role will focus on enhancing the reliability, scalability, and security of our cloud infrastructure.


Necessary Qualifications:


  • Proficiency in managing and optimizing cloud environments, with hands-on experience running AWS in production environments.
  • Expertise in implementing Infrastructure as Code using tools such as Ansible, Terraform, and Terragrunt, for the management and provisioning of cloud systems.
  • Proficiency in configuring and managing CI/CD pipelines supporting DevOps and all stages of the software life-cycle.
  • Hands-on experience in Container technologies including Kubernetes, Docker, and Helm.
  • Practical experience with monitoring tools and alerting systems.
  • Knowledge of security best practices, including IAM, encryption protocols, vulnerability assessment, and compliance.
  • Expertise in networking concepts including TCP/IP, routing, DNS, load balancing, and security protocols
  • Strong coding and scripting skills

Responsibilities:


  • Ensure the ongoing reliability, performance, and security of existing project infrastructure.
  • Address maintenance needs and implement improvements in collaboration with multiple teams.
  • Proactively monitor, respond to, and resolve infrastructure issues.
  • Participate in an on-call rotation to provide timely support and minimize downtime.
  • Design, implement, and optimize modern cloud infrastructure to support growing needs.
  • Implement SRE and DevOps best practices to enhance scalability, reliability, and efficiency.
  • Create and maintain comprehensive and effective documentation
  • Implement and uphold security best practices to safeguard infrastructure assets
  • Establish and maintain effective monitoring systems to analyze system metrics, optimize performance, and troubleshoot issues.
  • Monitor cloud infrastructure costs and identify opportunities to optimize cloud resources
  • Lead migration, deployment and upgrade efforts to transition to new infrastructure while avoiding business impact.


Job Posted by ApplicantPro


  • San Francisco, United States Vertisystem Full time

    Duration: 6 months contractPay rate: $90/hr on W2Job Summary:It is an exciting time to be part of the organization’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team strives to make the organization highly reliable, scalable, operable and...


  • San Francisco, United States Vertisystem Full time

    Duration: 6 months contractPay rate: $90/hr on W2Job Summary:It is an exciting time to be part of the organization’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team strives to make the organization highly reliable, scalable, operable and...


  • San Francisco, United States Vertisystem Full time

    Duration: 6 months contract Pay rate: $90/hr on W2 Job Summary: It is an exciting time to be part of the organizations CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team strives to make the organization highly reliable, scalable, operable and...


  • San Francisco, California, United States Observable Full time

    Observable is seeking a full-time infrastructure and site reliability engineer to help improve, administrate, and grow Observable systems as we scale to meet our customer's needs.What you will doPerform site reliability and ops work for Observable production and staging environments. (Manage servers Tweak WAF rules Optimize SQL queries And more)Design and...


  • San Francisco, United States hims & hers Full time

    About the Role: We are seeking a Site Reliability Engineer to help build a reliable web experience for our users. We believe that moving fast is our competitive advantage, and enables us to better serve our users. We also know that the faster we move, the more likely we are to break things. You Will: Design and implement SRE practices ensuring availability,...


  • San Diego, United States ObjectWin Technology Full time

    Job Title: Site Reliability Engineer Location: San Diego, CA or Remote in CA Duration: 6 Months Description: It is an exciting time to be part of SIEs CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team strives to make PlayStation highly reliable,...


  • San Diego, United States ACL Digital Full time

    W2 Contract/ Local candidates only Job Title: Site Reliability Engineer Location: San Diego, CA (Open to other locations in California) Job Description: It is an exciting time to be part of SIEs CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team...


  • San Diego, United States ACL Digital Full time

    W2 Contract/ Local candidates onlyJob Title: Site Reliability EngineerLocation: San Diego, CA (Open to other locations in California)Job Description:It is an exciting time to be part of SIE’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team...


  • San Diego, United States ACL Digital Full time

    W2 Contract/ Local candidates onlyJob Title: Site Reliability EngineerLocation: San Diego, CA (Open to other locations in California)Job Description:It is an exciting time to be part of SIE’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team...


  • San Diego, United States ACL Digital Full time

    W2 Contract/ Local candidates only Job Title: Site Reliability Engineer Location: San Diego, CA (Open to other locations in California) Is this the role you are looking for If so read on for more details, and make sure to apply today. Job Description: It is an exciting time to be part of SIE’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs...


  • San Diego, United States Talent Software Services Full time

    Site Reliability Engineer - Senior (NE) Job Summary: Talent Software Services is in search of a Site Reliability Engineer - Senior (NE) for a contract position in San Diego, CA. The opportunity will be one year with a strong chance for a long-term extension. Po...


  • San Francisco, United States Best Secret Full time

    About BestSecretGroup We are a leading European members-only online destination for premium and luxury off-price fashion. Partnering with over 3,000 international brands, our tech-focused mindset and strong commitment to sustainability drives a truly unique experience for our members. With almost 100 years of experience behind us, and a major tech...


  • San Diego, United States PEAK Technical Staffing USA Full time

    Hiring Senior Site Reliability Engineer; primary responsibilities will include contributing to the implementation and delivery of the end-to-end automation platform, to support continuous integration and continuous delivery (CI/CD), with a focus on developer self-service capabilities. NOTE: Must have build out experience with Kubernetes. This position...


  • San Diego, United States PEAK Technical Staffing USA Full time

    Hiring Senior Site Reliability Engineer;primary responsibilities will include contributing to the implementation and delivery of the end-to-end automation platform, to support continuous integration and continuous delivery (CI/CD), with a focus on developer self-service capabilities. NOTE: Must have build out experience with Kubernetes.This position...


  • San Diego, United States Talent Software Services Full time

    Site Reliability Engineer - Senior (NE) Job Summary: Talent Software Services is in search of a Site Reliability Engineer - Senior (NE) for a contract position in San Diego, CA. The opportunity will be one year with a strong chance for a long-term extension. Position Summary: As a member of the CICD and Cloud Reliability team you'll work at the heart of the...


  • San Francisco, United States Sunrun Full time

    Everything we do at Sunrun is driven by a determination to transform the way we power our lives. We know that starts at the individual employee level. We strive to foster an environment you can thrive in through our commitment to diversity, inclusion and belonging. A renewable energy revolution is beginning to blossom into the world's largest industrial...


  • San Diego, CA, United States Talent Software Services Full time

    Site Reliability Engineer - Senior (NE) Job Summary: Talent Software Services is in search of a Site Reliability Engineer - Senior (NE) for a contract position in San Diego, CA. The opportunity will be one year with a strong chance for a long-term extension. Position Summary: As a member of the CICD and Cloud Reliability team you'll work at the heart of...


  • San Diego, CA, United States Talent Software Services Full time

    Site Reliability Engineer - Senior (NE) Job Summary: Talent Software Services is in search of a Site Reliability Engineer - Senior (NE) for a contract position in San Diego, CA. The opportunity will be one year with a strong chance for a long-term extension. Position Summary: As a member of the CICD and Cloud Reliability team you'll work at the heart of...


  • San Francisco, United States Geico - Government Employees Insurance Company Full time

    Have strong technical expertise and leadership, you are able to lead from the trenches and have proven knowledge in your field Be able to drive infrastructure as code and show proficiency in field-appropriate programming languages, lead by example ?W Reliability Engineer, Manager, Liability, Network Operations, Reliability, Engineer, Technology, Insurance

  • Reliability Engineer

    3 weeks ago


    San Francisco, United States OpenAI Full time

    Join the engineering teams that bring OpenAI's ideas safely to the world!! The Applied Engineering team works across research, engineering, product, and design to bring OpenAI's technology to consumers and businesses. We seek to learn from deployment and distribute the benefits of AI, while ensuring that this powerful tool is used responsibly and safely....