Site Reliability Engineering

4 weeks ago


San Francisco, United States Forhyre Full time
Job DescriptionJob Description

Forhyre is looking for engineers who can bring unique perspectives and innovative ideas to all areas of development and are interested in continuing to improve our platform through the ever-changing technology landscape.

To be successful in this role

  • You'll have the opportunity to design and implement major infrastructure components, systems, and developer-friendly capabilities to improve the availability, scalability, latency, and efficiency of our services
  • You will provide technical leadership to cross-functional engineering, infrastructure, and product teams, and evangelize cloud best practices while building a culture of reliability and observability
  • Engage in and improve the end to end lifecycle of software development--from inception and design, through deployment, operation and refinement of a highly distributed system running in public cloud
  • Serve as subject matter expert in an SRE mindset, best practices, and cloud-native principles
  • Scale systems sustainably through automation to improve reliability and velocity
  • Assist with all aspects of operational security and compliance
  • Run software performance analysis and system tuning
  • Design and implement tools to collect data from various sources and provide actionable insights
  • Participate in critical incident management and timely post-mortems of production incidents to drive practices around blameless analysis, resolution, and continuous improvement work with cross-functional teams Develop the rest of the team by conducting code reviews, providing mentorship, pairing, and training opportunities

Qualification & Skills

  • We are looking for Principal SRE with proven experience in running distributed systems at scale, in production
  • You have 15+ years of experience in relevant skills gained and developed in the same or similar role
  • Strong knowledge of container orchestration, preferably Kubernetes and networking technology
  • Hands-on experience in one or more languages, such as Node JS, Python, Go, Perl, Ruby, and Bash
  • Experience with SOA, Microservices architecture, API Management & Enterprise system Integrations
  • Strong production experience with cloud infrastructure, AWS, Azure & Google Cloud
  • Strong sense of ownership, and an ability to drive tasks to completion
  • Experience developing and monitoring distributed systems
  • Experience working in an Agile Environment with great collaboration skills


  • San Francisco, United States Vertisystem Full time

    Duration: 6 months contract Pay rate: $90/hr on W2 Job Summary: It is an exciting time to be part of the organization’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team strives to make the organization highly reliable, scalable, operable and...


  • San Francisco, United States Vertisystem Full time

    Duration: 6 months contract Pay rate: $90/hr on W2 Job Summary: It is an exciting time to be part of the organizations CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team strives to make the organization highly reliable, scalable, operable and...


  • San Francisco, United States Apollo Solutions Full time

    Principal Site Reliability Engineer Apollo Solutions have partnered with a groundbreaking Fintech start-up backed by top tier venture capital. They are looking to significantly disrupt how we view, store and invest our personal finance and have already made significant waves in the industry. The Principal Site Reliability Engineer will be working closely...


  • San Francisco, United States Resource Informatics Group Full time

    Job Title: Site Reliability Engineer Work Location: San Francisco, CA (Hybrid after showing successful engagement) Duration: 18+ months Most important skills:10 years of Oracle database administration experience on large production environment Database hands on skills especially around database and system troubleshooting and administration GoldenGate setup,...


  • San Francisco, United States Cypress HCM Full time

    Job DescriptionJob DescriptionSite Reliability Engineer (Grafana)Responsibilities:Collaborate with Service Owners and Observability Leaders to develop a strategy for monitoring the technology stack using Grafana.Initiate data ingestion by deploying Telegraf and exporters (if necessary), utilizing discovery to feed data into Grafana Mimir.Establish initial...


  • San Francisco, California, United States Observable Full time

    Observable is seeking a full-time infrastructure and site reliability engineer to help improve, administrate, and grow Observable systems as we scale to meet our customer's needs.What you will doPerform site reliability and ops work for Observable production and staging environments. (Manage servers Tweak WAF rules Optimize SQL queries And more)Design and...


  • San Francisco, United States hims & hers Full time

    About the Role: We are seeking a Site Reliability Engineer to help build a reliable web experience for our users. We believe that moving fast is our competitive advantage, and enables us to better serve our users. We also know that the faster we move, the more likely we are to break things. You Will: Design and implement SRE practices ensuring availability,...


  • San Francisco, United States Alembic Limited Full time

    About Us Alembic applies cutting-edge algorithms and composite AI solutions to provide a new approach for marketing data analytics. Unlike tools that only provide correlation, only Alembic provides true causation, giving organizations across sector and industry the ability to quantify the value of every marketing activity and maximize future marketing...


  • San Francisco, United States Indotronix International Corporation Full time

    Pay Rate:- W2 Rate $ 61.75 Looking in PST time zone, preferred to be local to SF and willing to go into office occasionally, but okay with Remote (needs to hive high work ethic!) Lead DevOps/Site Reliability Enginee Looking for a resource more senior in the DevOps space, with a leaning toward site reliability engineering. Docker containers,...


  • San Francisco, United States Indotronix International Corporation Full time

    Pay Rate:- W2 Rate $ 61.75 Looking in PST time zone, preferred to be local to SF and willing to go into office occasionally, but okay with Remote (needs to hive high work ethic!) Lead DevOps/Site Reliability Enginee Looking for a resource more senior in the DevOps space, with a leaning toward site reliability engineering. Docker containers,...


  • San Diego, United States TalentBurst Full time

    SENIOR SITE RELIABILITY ENGINEER Location: San Diego, CA 92127 - 100% onsite (San Diego site preferred, open to other sites located in San Francisco 94107, San Mateo 94404, Los Angeles 90045 or Aliso Viejo 92656) Duration: 6 months **W2 Acceptable It is an exciting time to be part of Continuous Integration/Continuous Deployment (CI/CD) and Cloud Site...


  • San Diego, United States ACL Digital Full time

    W2 Contract/ Local candidates only Job Title: Site Reliability Engineer Location: San Diego, CA (Open to other locations in California) Job Description: It is an exciting time to be part of SIE’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE...


  • San Diego, United States ACL Digital Full time

    W2 Contract/ Local candidates onlyJob Title: Site Reliability EngineerLocation: San Diego, CA (Open to other locations in California)Job Description:It is an exciting time to be part of SIE’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team...


  • San Diego, United States ACL Digital Full time

    W2 Contract/ Local candidates onlyJob Title: Site Reliability EngineerLocation: San Diego, CA (Open to other locations in California)Job Description:It is an exciting time to be part of SIE’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team...


  • San Diego, United States ACL Digital Full time

    W2 Contract/ Local candidates only Job Title: Site Reliability Engineer Location: San Diego, CA (Open to other locations in California) Is this the role you are looking for If so read on for more details, and make sure to apply today. Job Description: It is an exciting time to be part of SIE’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs...


  • San Jose, United States Myriad Consulting Inc Full time

    This role also open for junior (3+ yoe) candidates, and SRE lead (7+ yoe).Site Reliability Engineering(SRE) team combines software and systems engineering to build and run large-scale, massively distributed, and fault-tolerant systems. In our team, you ll have the opportunity to manage the complex challenges of scale, while using expertise in coding,...


  • San Diego, United States Talent Software Services Full time

    Site Reliability Engineer - Senior (NE) Job Summary: Talent Software Services is in search of a Site Reliability Engineer - Senior (NE) for a contract position in San Diego, CA. The opportunity will be one year with a strong chance for a long-term extension. Po...


  • San Francisco, California, United States Zetachain Full time

    We are seeking a Sr. Site Reliability Engineer to join our team and run critical infrastructure for our blockchain and web applications. You'll learn to deploy and maintain a fleet of RPC and validator nodes for multiple blockchain networks. You'll also provide guidance and expertise to development teams to ensure their application follow modern best...


  • San Diego, United States PEAK Technical Staffing USA Full time

    Hiring Senior Site Reliability Engineer; primary responsibilities will include contributing to the implementation and delivery of the end-to-end automation platform, to support continuous integration and continuous delivery (CI/CD), with a focus on developer self-service capabilities. NOTE: Must have build out experience with Kubernetes. This position...


  • San Diego, United States PEAK Technical Staffing USA Full time

    Hiring Senior Site Reliability Engineer;primary responsibilities will include contributing to the implementation and delivery of the end-to-end automation platform, to support continuous integration and continuous delivery (CI/CD), with a focus on developer self-service capabilities. NOTE: Must have build out experience with Kubernetes.This position...