Site Reliability Engineer

4 weeks ago


San Ramon, United States LaSalle Network Full time

LaSalle Network has partnered with a well-established software provider that's based in San Ramon, CA, who's in need of a well-rounded, Site Reliability Engineer (SRE) - Grafana Observability - with a strong background in Grafana and related tools such as Prometheus and Telegraf. The ideal candidate will play a crucial role in accelerating the transition of the observability stack into the Grafana ecosystem.
This role is a contract opportunity with the potential to extend or convert to a full time employee, and is operating under a remote work model.

Site Reliability Engineer (SRE) - Grafana Observability Responsibilities:

  • Collaborate with Service owners and Observability leaders to devise a strategy for monitoring the technology stack using Grafana
  • Deploy Telegraf and exporters as necessary and utilize discovery to ingest data into Grafana Mimir
  • Create alert rules and enable alerting in Grafana through self-service
  • Develop initial dashboards to monitor the health and capacity of services
  • Provide documentation and training to service owners for a smooth handover


Site Reliability Engineer (SRE) - Grafana Observability Requirements:

  • 3+ years of experience as a Site Reliability Engineer utilizing the Grafana platform
  • Proficiency in Grafana, including dashboarding best practices and writing for widgets and alert rules
  • Familiarity with Grafana Mimir or equivalent tools like Thanos or Cortex
  • Strong expertise in Prometheus and Telegraf
  • Experience with Ansible for writing playbooks and deploying/configuring services
  • Proficiency in using Git (GitLab) for managing self-services as code
  • Broad knowledge of various technology stacks and transitioning monitoring to the Grafana ecosystem
  • Experience working with modern operating systems such as Centos and Ubuntu


If you are interested in this opportunity and meet the qualifications, please apply today

Thank you,

Branden Luna
Team Lead - Technology Services
LaSalle Network

LaSalle Network is an Equal Opportunity Employer m/f/d/v.

LaSalle Network is the leading provider of direct hire and temporary staffing services. For over two decades, LaSalle has helped organizations hire faster and connect top talent with opportunities, from entry-level positions to the C-suite. With units specializing in Accounting and Finance, Administrative, Marketing, Technology, Supply chain, Healthcare Revenue Cycle, Call Center, Human Resources and Executive Search. LaSalle offers staffing and recruiting solutions to companies of all sizes and across all industries.

LaSalle Network is the premier staffing and recruiting firm, earning over 100 culture, revenue and industry-based awards from major publications and having its company experts regularly contribute insights on retention strategies, hiring trends and hiring challenges, and more to national news outlets. LaSalle Network offers temporary Field Employees benefit plans including medical, dental and vision coverage. Family Medical Leave, Worker's compensation, Paid Leave and Sick Leave are also provided. View a full list of our benefits here:

LNPW



  • San Ramon, California, United States LaSalle Network Full time

    LaSalle Network has partnered with a well-established software provider that's based in San Ramon, CA, who's in need of a well-rounded, Site Reliability Engineer (SRE) – Grafana Observability – with a strong background in Grafana and related tools such as Prometheus and Telegraf. The ideal candidate will play a crucial role in accelerating the transition...


  • San Ramon, United States The LaSalle Group Full time

    LaSalle Network has partnered with a well-established software provider that's based in San Ramon, CA, who's in need of a well-rounded, Site Reliability Engineer (SRE) - Grafana Observability - with a strong background in Grafana and related tools such as Prometheus and Telegraf. The ideal candidate will play a crucial role in accelerating the transition of...


  • San Ramon, United States The LaSalle Group Full time

    LaSalle Network has partnered with a well-established software provider that's based in San Ramon, CA, who's in need of a well-rounded, Site Reliability Engineer (SRE) - Grafana Observability - with a strong background in Grafana and related tools such as Prometheus and Telegraf. The ideal candidate will play a crucial role in accelerating the transition of...


  • San Ramon, United States LaSalle Network Full time

    LaSalle Network has partnered with a well-established software provider that’s based in San Ramon, CA, who’s in need of a well-rounded, Site Reliability Engineer (SRE) – Grafana Observability – with a strong background in Grafana and related tools such as Prometheus and Telegraf. The ideal candidate will play a crucial role in accelerating the...


  • San Ramon, United States LaSalle Network Full time

    LaSalle Network has partnered with a well-established software provider that’s based in San Ramon, CA, who’s in need of a well-rounded, Site Reliability Engineer (SRE) – Grafana Observability – with a strong background in Grafana and related tools such as Prometheus and Telegraf. The ideal candidate will play a crucial role in accelerating the...


  • San Ramon, United States LaSalle Network Full time

    LaSalle Network has partnered with a well-established software provider that's based in San Ramon, CA, who's in need of a well-rounded, Site Reliability Engineer (SRE) - Grafana Observability - with a strong background in Grafana and related tools such as Prometheus and Telegraf. The ideal candidate will play a crucial role in accelerating the transition of...


  • San Ramon, United States LaSalle Network Full time

    LaSalle Network has partnered with a well-established software provider that's based in San Ramon, CA, who's in need of a well-rounded, Site Reliability Engineer (SRE) - Grafana Observability - with a strong background in Grafana and related tools such as Prometheus and Telegraf. The ideal candidate will play a crucial role in accelerating the transition of...


  • San Ramon, United States LaSalle Network Full time

    LaSalle Network has partnered with a well-established software provider that's based in San Ramon, CA, who's in need of a well-rounded, Site Reliability Engineer (SRE) - Grafana Observability - with a strong background in Grafana and related tools such as Prometheus and Telegraf. The ideal candidate will play a crucial role in accelerating the transition of...


  • San Diego, United States ObjectWin Technology Full time

    Job Title: Site Reliability Engineer Location: San Diego, CA or Remote in CA Duration: 6 Months Description: It is an exciting time to be part of SIE's CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team strives to make PlayStation highly reliable,...


  • San Diego, California, United States PEAK Technical Staffing USA Full time

    Hiring Senior Site Reliability Engineer; primary responsibilities will include contributing to the implementation and delivery of the end-to-end automation platform, to support continuous integration and continuous delivery (CI/CD), with a focus on developer self-service capabilities.NOTE:Must have build out experience with Kubernetes. This position requires...


  • San Diego, United States ObjectWin Technology Full time

    Job Title: Site Reliability Engineer Location: San Diego, CA or Remote in CA Duration: 6 Months Description: It is an exciting time to be part of SIE’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team strives to make PlayStation highly...


  • San Diego, United States ObjectWin Technology Full time

    Job Title: Site Reliability Engineer Location: San Diego, CA or Remote in CA Duration: 6 Months Description: It is an exciting time to be part of SIE’s CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team strives to make PlayStation highly...


  • San Diego, United States ObjectWin Technology Full time

    Job Title: Site Reliability Engineer Location: San Diego, CA or Remote in CA Duration: 6 Months Description: It is an exciting time to be part of SIE's CICD and Cloud Site Reliability Engineering (SRE) team. SREs operate right at the intersection of Software Engineering and Infrastructure Engineering. The SRE team strives to make PlayStation highly reliable,...


  • San Jose, United States IBM Full time

    ENGINEERING Site Reliability Engineer, IBM Corporation, San Jose, CA (Up to 40% telecommuting permitted): Work with development teams to enable a continuous integration environment that sustains high productivity levels and emphasizes defect prevention techniques. Manage delivery pipeline....


  • San Mateo, California, United States eTek IT Full time

    Position : Site Reliability EngineerLocation : San Mateo, CARequired Skills Must Haves: 3 to 5 years exp. Kubernetes, DataDog, cloud services, large scale systems, AWS&GCP, minor Azure GKE, home strung clusters on prem, and AKS (Very Small), EKS Consistent upgrades across all the clusters and clouds Nice to Have: Gaming experience bonusAdditional SkillsJob...


  • San Francisco, United States Apollo Solutions Full time

    Principal Site Reliability Engineer Apollo Solutions have partnered with a groundbreaking Fintech start-up backed by top tier venture capital. They are looking to significantly disrupt how we view, store and invest our personal finance and have already made significant waves in the industry. The Principal Site Reliability Engineer will be working closely...


  • San Diego, United States TalentBurst Full time

    SENIOR SITE RELIABILITY ENGINEERLocation: San Diego, CA 92127 - 100% onsite (San Diego site preferred, open to other sites located in San Francisco 94107, San Mateo 94404, Los Angeles 90045 or Aliso Viejo 92656)Duration: 6 months **W2 Acceptable It is an exciting time to be part of Continuous Integration/Continuous Deployment (CI/CD) and Cloud Site...


  • San Diego, United States TalentBurst Full time

    SENIOR SITE RELIABILITY ENGINEER Location: San Diego, CA 92127 - 100% onsite (San Diego site preferred, open to other sites located in San Francisco 94107, San Mateo 94404, Los Angeles 90045 or Aliso Viejo 92656) Duration: 6 months **W2 Acceptable It is an exciting time to be part of Continuous Integration/Continuous Deployment (CI/CD) and Cloud Site...


  • San Francisco, United States Apollo Solutions Full time

    Principal Site Reliability Engineer Apollo Solutions have partnered with a groundbreaking Fintech start-up backed by top tier venture capital. They are looking to significantly disrupt how we view, store and invest our personal finance and have already made significant waves in the industry. The Principal Site Reliability Engineer will be working closely...


  • San Francisco, California, United States Apollo Solutions Full time

    Principal Site Reliability Engineer Apollo Solutions have partnered with a groundbreaking Fintech start-up backed by top tier venture capital. They are looking to significantly disrupt how we view, store and invest our personal finance and have already made significant waves in the industry. The Principal Site Reliability Engineer will be working closely...