Lead DevOps Engineer

4 weeks ago


Bellevue, United States JobRialto Full time
Job Overview:

We are seeking a highly skilled Telemetry Engineer to join our dynamic team. The ideal candidate will have expert knowledge in Prometheus, Grafana and Git. This role involves developing and managing telemetry for large-scale datasets and implementing strategies to reduce Mean Time to Resolution (MTTR).

Must Have:

Prometheus Proficiency: Develop, configure, and maintain monitoring solutions using Prometheus. Must have in-depth knowledge of Prometheus metrics, alerts, and query language.

Grafana: Design and implement dashboards in Grafana for real-time data visualization and monitoring. Customize and extend Grafana as per the project requirements.

Telemetry Skills: Create scalable telemetry solutions using Prometheus and Grafana to monitor and analyze large-scale datasets.

Reducing MTTR: Experience in developing telemetry solutions focused on reducing Mean Time to Resolution, enhancing system reliability and performance.

Git: Strong understanding in Git.

Good to have:

Thanos: Proficient knowledge of Thanos components with demonstrated hands-on experience in configuring, deploying, and managing mutliple Thanos components.

Python: Practical proficiency in Python scripting is a key requirement.

Qualifications:

- Bachelor's degree in computer science, Information Technology, or related field.

- Proven experience as a Telemetry Observability Engineer or similar role.

- Extensive knowledge of Prometheus and Grafana.

- Strong understanding of telemetry and observability principles.

- Excellent analytical and problem-solving skills.

- Strong communication and teamwork abilities.

- Experience with Splunk and Kubernetes is an added advantage.

Job Overview: We are seeking a highly skilled Telemetry Engineer to join our dynamic team.

The ideal candidate will have expert knowledge in Prometheus, Grafana and Git.

This role involves developing and managing telemetry for large-scale datasets and implementing strategies to reduce Mean Time to Resolution (MTTR).

Key skills:

Must Have:

Prometheus Proficiency: Develop, configure, and maintain monitoring solutions using Prometheus.

Must have in-depth knowledge of Prometheus metrics, alerts, and query language.

Grafana: Design and implement dashboards in Grafana for real-time data visualization and monitoring.

Customize and extend Grafana as per the project requirements.

Telemetry Skills: Create scalable telemetry solutions using Prometheus and Grafana to monitor and analyze large-scale datasets.

Reducing MTTR: Experience in developing telemetry solutions focused on reducing Mean Time to Resolution, enhancing system reliability and performance. Git: Strong understanding in Git.

Good to have:

Thanos: Proficient knowledge of Thanos components with demonstrated hands-on experience in configuring, deploying, and managing mutliple Thanos components. Python: Practical proficiency in Python scripting is a key requirement. Qualifications: - Bachelor's degree in computer science, Information Technology, or related field. - Proven experience as a Telemetry Observability Engineer or similar role. - Extensive knowledge of Prometheus and Grafana. - Strong understanding of telemetry and observability principles. - Excellent analytical and problem-solving skills. - Strong communication and teamwork abilities. - Experience with Splunk and Kubernetes is an added advantage.

Comments: SO is for TMO City Bellevue State WA Mandatory Skills Kubernetes, Prometheus, Grafana, thanos Kindly refer to SO details in Edge for detailed JD

Expectations from this role:

1. Interprets the DevOps Tool/feature/component design to develop/support the same in accordance with specifications

2. Adapts existing DevOps solutions and creates own DevOps solutions for new contexts

3. Codes, debugs, tests, documents and communicates DevOps development stages/status of DevOps develop/support issues

4. Select appropriate technical options for development such as reusing, improving or reconfiguration of existing components

5. Optimises efficiency, cost and quality of DevOps process, tools and technology development

6. Validates results with user representatives; integrates and commissions the overall solution

7. Helps Engineers troubleshoot issues that are novel/complex and are not covered by SOPs

8. Design, install, configure, troubleshoot CI/CD pipelines and software

9. Able to automate infrastructure provisioning on cloud/in-premises with the guidance of architects

10. Provides guidance to DevOps Engineers so that they can support existing components

11. Work with diverse teams with Agile methodologies

12. Facilitate saving measures through automation

13. Mentors A1 and A2 resources

14. Involved in the Code Review of the team

Typical performance measures: Typical performance measures:

1. Quality of deliverables

2. Error rate/completion rate at various stages of SDLC/PDLC

3. # of components/reused

4. # of domain/technology certification/ product certification obtained

5. SLA for onboarding and supporting users and ticketsPerformance Areas:

Automated components

Deliver components that automat parts to install components/configure of software/tools in on premises and on cloud

Deliver components that automate parts of the build/deploy for applications

Configured components

Configure a CI/CD pipeline that can be used by application development/support teams

Scripts

Develop/Support scripts (like Powershell/Shell/Python scripts) that automate installation/configuration/build/deployment tasks

Onboard users

Onboard and extend existing tools to new app dev/support teams

Mentoring

Mentor and provide guidance to peers

Education: Bachelors Degree

Additional client information:
  • IT|DevOps Engineering

    22 hours ago


    Bellevue, United States First Tek Full time

    Kubernetes container platform . Manage cluster life cycle by deploying updates and OS patches in K8s clusters . Build automation using GitOps . Total 7 to 8 years' experience in Onprem systems and DevOps . 4 to 5 Years' experience in managing Kubernetes clusters in enterprise level . Certified Kubernetes Administrator ( CKA certification ) is Mandatory...

  • DevOps Engineer

    22 hours ago


    Bellevue, United States Intelliswift Software Inc Full time

    Top 3 must-have - - Linux systems administration - Deployment automation ( e.g. Jenkins, ansible) - Production support in customer-facing online service Qualifications Masters/bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or related technical field, and two years of experience in software/systems or related. 5+...

  • DevOps Engineer

    2 weeks ago


    Bellevue, United States Intelliswift Software Full time

    Top 3 must-have -- Linux systems administration - Deployment automation ( e.g. Jenkins, ansible) - Production support in customer-facing online serviceQualifications • Master’s/bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or related technical field, and two years of experience in software/systems or related. • 5+...

  • DevOps Engineer

    2 weeks ago


    Bellevue, United States Intelliswift Software Full time

    Top 3 must-have -- Linux systems administration - Deployment automation ( e.g. Jenkins, ansible) - Production support in customer-facing online serviceQualifications • Master’s/bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or related technical field, and two years of experience in software/systems or related. • 5+...

  • DevOps Engineer

    2 weeks ago


    Bellevue, United States Intelliswift Software Full time

    Top 3 must-have -- Linux systems administration - Deployment automation ( e.g. Jenkins, ansible) - Production support in customer-facing online serviceQualifications • Master’s/bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or related technical field, and two years of experience in software/systems or related. • 5+...


  • Bellevue, United States Zortech Solutions Full time

    Job DescriptionJob DescriptionRole: DevOps Engineer-Elastic Search ExpLocation: Bellevue, WA-OnsiteDuration: FulltimeJob Description:Required Experience: Minimum 5 years of direct DevOps, Linux Admin and database experience.Must have Elk/ Elastic Search working experienceResponsibilities:Manage VMs across multiple datacenters and AWS to support dev/test and...


  • Bellevue, United States Zortech Solutions Full time

    Job DescriptionJob DescriptionRole: DevOps Engineer-Elastic Search Exp Location: Bellevue, WA-Onsite Duration: Fulltime Job Description: Required Experience: Minimum 5 years of direct DevOps, Linux Admin and database experience. Must have Elk/ Elastic Search working experience Responsibilities: Manage VMs across multiple datacenters and AWS to support...


  • Bellevue, United States Tata Consultancy Services Full time

    Roles & Responsibilities: Proven experience as a Site Reliability Engineer or DevOps Engineer focusing on Azure technologies. Strong proficiency in Azure services including Azure DevOps, Azure App Services, Azure Functions, Azure SQL, etc. Proficiency in scripting languages such as PowerShell, Python, or Bash. Excellent problem-solving skills and ability...


  • Bellevue, United States Tata Consultancy Services Full time

    Roles & Responsibilities:Proven experience as a Site Reliability Engineer or DevOps Engineer focusing on Azure technologies. Strong proficiency in Azure services including Azure DevOps, Azure App Services, Azure Functions, Azure SQL, etc. Proficiency in scripting languages such as PowerShell, Python, or Bash. Excellent problem-solving skills and ability to...


  • Bellevue, United States Tata Consultancy Services Full time

    Roles & Responsibilities:Proven experience as a Site Reliability Engineer or DevOps Engineer focusing on Azure technologies. Strong proficiency in Azure services including Azure DevOps, Azure App Services, Azure Functions, Azure SQL, etc. Proficiency in scripting languages such as PowerShell, Python, or Bash. Excellent problem-solving skills and ability to...


  • Bellevue, United States Tata Consultancy Services Full time

    Roles & Responsibilities:Proven experience as a Site Reliability Engineer or DevOps Engineer focusing on Azure technologies. Strong proficiency in Azure services including Azure DevOps, Azure App Services, Azure Functions, Azure SQL, etc. Proficiency in scripting languages such as PowerShell, Python, or Bash. Excellent problem-solving skills and ability to...


  • Bellevue, United States Diverse Lynx Full time

    Role name: SDET Lead San Jose, CA Contract Role Description: ? Develop test data and environment specifications. ? Conduct manual and/or automated test procedures. ? Precisely detect and relay defects and system improvements to development teams through defect tracking systems. ? Work closely with developers, product managers, and QA Leads. ? Maintain...


  • Bellevue, United States United Imaging North America Full time

    Job DescriptionJob DescriptionDescription:We are seeking ambitious Ultrasound Engineers to contribute to the design, development, and optimization of imaging modes and features for our ultrasound imaging platforms. This position could be opened at multiple levels including Principal, Lead, and Senior depending upon years of experience in relation to posted...

  • Lead ML Engineer

    5 days ago


    Bellevue, United States Flexton Inc. Full time

    Flexton Inc., Established in 2007, headquarter is in San Jose, CA with development centers in India at multiple locations. We are a leading professional services company offering a unique product mix that extends into Technology, Consulting, Digital and Operations. Flexton has been recognized multiple times by Inc 5000 as the Fastest Growing Company.Lead ML...

  • Lead ML Engineer

    5 days ago


    Bellevue, United States Flexton Inc. Full time

    Flexton Inc., Established in 2007, headquarter is in San Jose, CA with development centers in India at multiple locations. We are a leading professional services company offering a unique product mix that extends into Technology, Consulting, Digital and Operations. Flexton has been recognized multiple times by Inc 5000 as the Fastest Growing Company.Lead ML...

  • Lead ML Engineer

    5 days ago


    Bellevue, United States Flexton Inc. Full time

    Flexton Inc., Established in 2007, headquarter is in San Jose, CA with development centers in India at multiple locations. We are a leading professional services company offering a unique product mix that extends into Technology, Consulting, Digital and Operations. Flexton has been recognized multiple times by Inc 5000 as the Fastest Growing Company.Lead ML...

  • Azure Data Engineer

    4 weeks ago


    Bellevue, United States Infosys Full time

    Job Description :Infosys is seeking an Azure Data Engineer. In this role, you will work directly with the clients to lead critical modules in Data Engineering program which will include metadata management, data catalog, data quality, process/workflow definition implementation, Reporting, Delivery, etc. You will be part of an entrepreneurship and learning...

  • Azure Data Engineer

    4 weeks ago


    Bellevue, United States Infosys Full time

    Job Description :Infosys is seeking an Azure Data Engineer. In this role, you will work directly with the clients to lead critical modules in Data Engineering program which will include metadata management, data catalog, data quality, process/workflow definition implementation, Reporting, Delivery, etc. You will be part of an entrepreneurship and learning...

  • Azure Data Engineer

    4 weeks ago


    Bellevue, United States Infosys Full time

    Job Description :Infosys is seeking an Azure Data Engineer. In this role, you will work directly with the clients to lead critical modules in Data Engineering program which will include metadata management, data catalog, data quality, process/workflow definition implementation, Reporting, Delivery, etc. You will be part of an entrepreneurship and learning...


  • Bellevue, United States Software Technology Inc Full time

    Title: Kubernetes Admin Location: Bellevue WA 98004 Duration: Long Term Mandatory Areas Must Have Skills Skill 1 8 Yrs. of Exp Kubernetes container platform Skill 2 8 Yrs. of Exp ,Manage cluster life cycle by deploying updates and OS patches in K8s clusters Skill 3 4 Yrs. of Exp , Total 7 to 8 years' experience in Onprem systems and DevOps Kubernetes...