Lead DevOps Engineer
4 weeks ago
We are seeking a highly skilled Telemetry Engineer to join our dynamic team. The ideal candidate will have expert knowledge in Prometheus, Grafana and Git. This role involves developing and managing telemetry for large-scale datasets and implementing strategies to reduce Mean Time to Resolution (MTTR).
Must Have:
Prometheus Proficiency: Develop, configure, and maintain monitoring solutions using Prometheus. Must have in-depth knowledge of Prometheus metrics, alerts, and query language.
Grafana: Design and implement dashboards in Grafana for real-time data visualization and monitoring. Customize and extend Grafana as per the project requirements.
Telemetry Skills: Create scalable telemetry solutions using Prometheus and Grafana to monitor and analyze large-scale datasets.
Reducing MTTR: Experience in developing telemetry solutions focused on reducing Mean Time to Resolution, enhancing system reliability and performance.
Git: Strong understanding in Git.
Good to have:
Thanos: Proficient knowledge of Thanos components with demonstrated hands-on experience in configuring, deploying, and managing mutliple Thanos components.
Python: Practical proficiency in Python scripting is a key requirement.
Qualifications:
- Bachelor's degree in computer science, Information Technology, or related field.
- Proven experience as a Telemetry Observability Engineer or similar role.
- Extensive knowledge of Prometheus and Grafana.
- Strong understanding of telemetry and observability principles.
- Excellent analytical and problem-solving skills.
- Strong communication and teamwork abilities.
- Experience with Splunk and Kubernetes is an added advantage.
Job Overview: We are seeking a highly skilled Telemetry Engineer to join our dynamic team.
The ideal candidate will have expert knowledge in Prometheus, Grafana and Git.
This role involves developing and managing telemetry for large-scale datasets and implementing strategies to reduce Mean Time to Resolution (MTTR).
Key skills:
Must Have:
Prometheus Proficiency: Develop, configure, and maintain monitoring solutions using Prometheus.
Must have in-depth knowledge of Prometheus metrics, alerts, and query language.
Grafana: Design and implement dashboards in Grafana for real-time data visualization and monitoring.
Customize and extend Grafana as per the project requirements.
Telemetry Skills: Create scalable telemetry solutions using Prometheus and Grafana to monitor and analyze large-scale datasets.
Reducing MTTR: Experience in developing telemetry solutions focused on reducing Mean Time to Resolution, enhancing system reliability and performance. Git: Strong understanding in Git.
Good to have:
Thanos: Proficient knowledge of Thanos components with demonstrated hands-on experience in configuring, deploying, and managing mutliple Thanos components. Python: Practical proficiency in Python scripting is a key requirement. Qualifications: - Bachelor's degree in computer science, Information Technology, or related field. - Proven experience as a Telemetry Observability Engineer or similar role. - Extensive knowledge of Prometheus and Grafana. - Strong understanding of telemetry and observability principles. - Excellent analytical and problem-solving skills. - Strong communication and teamwork abilities. - Experience with Splunk and Kubernetes is an added advantage.
Comments: SO is for TMO City Bellevue State WA Mandatory Skills Kubernetes, Prometheus, Grafana, thanos Kindly refer to SO details in Edge for detailed JD
Expectations from this role:
1. Interprets the DevOps Tool/feature/component design to develop/support the same in accordance with specifications
2. Adapts existing DevOps solutions and creates own DevOps solutions for new contexts
3. Codes, debugs, tests, documents and communicates DevOps development stages/status of DevOps develop/support issues
4. Select appropriate technical options for development such as reusing, improving or reconfiguration of existing components
5. Optimises efficiency, cost and quality of DevOps process, tools and technology development
6. Validates results with user representatives; integrates and commissions the overall solution
7. Helps Engineers troubleshoot issues that are novel/complex and are not covered by SOPs
8. Design, install, configure, troubleshoot CI/CD pipelines and software
9. Able to automate infrastructure provisioning on cloud/in-premises with the guidance of architects
10. Provides guidance to DevOps Engineers so that they can support existing components
11. Work with diverse teams with Agile methodologies
12. Facilitate saving measures through automation
13. Mentors A1 and A2 resources
14. Involved in the Code Review of the team
Typical performance measures: Typical performance measures:
1. Quality of deliverables
2. Error rate/completion rate at various stages of SDLC/PDLC
3. # of components/reused
4. # of domain/technology certification/ product certification obtained
5. SLA for onboarding and supporting users and ticketsPerformance Areas:
Automated components
Deliver components that automat parts to install components/configure of software/tools in on premises and on cloud
Deliver components that automate parts of the build/deploy for applications
Configured components
Configure a CI/CD pipeline that can be used by application development/support teams
Scripts
Develop/Support scripts (like Powershell/Shell/Python scripts) that automate installation/configuration/build/deployment tasks
Onboard users
Onboard and extend existing tools to new app dev/support teams
Mentoring
Mentor and provide guidance to peers
Education: Bachelors Degree
Additional client information:
-
IT|DevOps Engineering
22 hours ago
Bellevue, United States First Tek Full timeKubernetes container platform . Manage cluster life cycle by deploying updates and OS patches in K8s clusters . Build automation using GitOps . Total 7 to 8 years' experience in Onprem systems and DevOps . 4 to 5 Years' experience in managing Kubernetes clusters in enterprise level . Certified Kubernetes Administrator ( CKA certification ) is Mandatory...
-
DevOps Engineer
22 hours ago
Bellevue, United States Intelliswift Software Inc Full timeTop 3 must-have - - Linux systems administration - Deployment automation ( e.g. Jenkins, ansible) - Production support in customer-facing online service Qualifications Masters/bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or related technical field, and two years of experience in software/systems or related. 5+...
-
DevOps Engineer
2 weeks ago
Bellevue, United States Intelliswift Software Full timeTop 3 must-have -- Linux systems administration - Deployment automation ( e.g. Jenkins, ansible) - Production support in customer-facing online serviceQualifications • Master’s/bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or related technical field, and two years of experience in software/systems or related. • 5+...
-
DevOps Engineer
2 weeks ago
Bellevue, United States Intelliswift Software Full timeTop 3 must-have -- Linux systems administration - Deployment automation ( e.g. Jenkins, ansible) - Production support in customer-facing online serviceQualifications • Master’s/bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or related technical field, and two years of experience in software/systems or related. • 5+...
-
DevOps Engineer
2 weeks ago
Bellevue, United States Intelliswift Software Full timeTop 3 must-have -- Linux systems administration - Deployment automation ( e.g. Jenkins, ansible) - Production support in customer-facing online serviceQualifications • Master’s/bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, or related technical field, and two years of experience in software/systems or related. • 5+...
-
DevOps Engineer-Elastic Search Exp
2 weeks ago
Bellevue, United States Zortech Solutions Full timeJob DescriptionJob DescriptionRole: DevOps Engineer-Elastic Search ExpLocation: Bellevue, WA-OnsiteDuration: FulltimeJob Description:Required Experience: Minimum 5 years of direct DevOps, Linux Admin and database experience.Must have Elk/ Elastic Search working experienceResponsibilities:Manage VMs across multiple datacenters and AWS to support dev/test and...
-
DevOps Engineer-Elastic Search/ US- Fulltime
5 days ago
Bellevue, United States Zortech Solutions Full timeJob DescriptionJob DescriptionRole: DevOps Engineer-Elastic Search Exp Location: Bellevue, WA-Onsite Duration: Fulltime Job Description: Required Experience: Minimum 5 years of direct DevOps, Linux Admin and database experience. Must have Elk/ Elastic Search working experience Responsibilities: Manage VMs across multiple datacenters and AWS to support...
-
Site Reliability Engineer
1 week ago
Bellevue, United States Tata Consultancy Services Full timeRoles & Responsibilities: Proven experience as a Site Reliability Engineer or DevOps Engineer focusing on Azure technologies. Strong proficiency in Azure services including Azure DevOps, Azure App Services, Azure Functions, Azure SQL, etc. Proficiency in scripting languages such as PowerShell, Python, or Bash. Excellent problem-solving skills and ability...
-
Site Reliability Engineer
1 week ago
Bellevue, United States Tata Consultancy Services Full timeRoles & Responsibilities:Proven experience as a Site Reliability Engineer or DevOps Engineer focusing on Azure technologies. Strong proficiency in Azure services including Azure DevOps, Azure App Services, Azure Functions, Azure SQL, etc. Proficiency in scripting languages such as PowerShell, Python, or Bash. Excellent problem-solving skills and ability to...
-
Site Reliability Engineer
7 days ago
Bellevue, United States Tata Consultancy Services Full timeRoles & Responsibilities:Proven experience as a Site Reliability Engineer or DevOps Engineer focusing on Azure technologies. Strong proficiency in Azure services including Azure DevOps, Azure App Services, Azure Functions, Azure SQL, etc. Proficiency in scripting languages such as PowerShell, Python, or Bash. Excellent problem-solving skills and ability to...
-
Site Reliability Engineer
1 week ago
Bellevue, United States Tata Consultancy Services Full timeRoles & Responsibilities:Proven experience as a Site Reliability Engineer or DevOps Engineer focusing on Azure technologies. Strong proficiency in Azure services including Azure DevOps, Azure App Services, Azure Functions, Azure SQL, etc. Proficiency in scripting languages such as PowerShell, Python, or Bash. Excellent problem-solving skills and ability to...
-
SDET(Automation) Lead
7 days ago
Bellevue, United States Diverse Lynx Full timeRole name: SDET Lead San Jose, CA Contract Role Description: ? Develop test data and environment specifications. ? Conduct manual and/or automated test procedures. ? Precisely detect and relay defects and system improvements to development teams through defect tracking systems. ? Work closely with developers, product managers, and QA Leads. ? Maintain...
-
Lead Ultrasound Engineer
4 weeks ago
Bellevue, United States United Imaging North America Full timeJob DescriptionJob DescriptionDescription:We are seeking ambitious Ultrasound Engineers to contribute to the design, development, and optimization of imaging modes and features for our ultrasound imaging platforms. This position could be opened at multiple levels including Principal, Lead, and Senior depending upon years of experience in relation to posted...
-
Lead ML Engineer
5 days ago
Bellevue, United States Flexton Inc. Full timeFlexton Inc., Established in 2007, headquarter is in San Jose, CA with development centers in India at multiple locations. We are a leading professional services company offering a unique product mix that extends into Technology, Consulting, Digital and Operations. Flexton has been recognized multiple times by Inc 5000 as the Fastest Growing Company.Lead ML...
-
Lead ML Engineer
5 days ago
Bellevue, United States Flexton Inc. Full timeFlexton Inc., Established in 2007, headquarter is in San Jose, CA with development centers in India at multiple locations. We are a leading professional services company offering a unique product mix that extends into Technology, Consulting, Digital and Operations. Flexton has been recognized multiple times by Inc 5000 as the Fastest Growing Company.Lead ML...
-
Lead ML Engineer
5 days ago
Bellevue, United States Flexton Inc. Full timeFlexton Inc., Established in 2007, headquarter is in San Jose, CA with development centers in India at multiple locations. We are a leading professional services company offering a unique product mix that extends into Technology, Consulting, Digital and Operations. Flexton has been recognized multiple times by Inc 5000 as the Fastest Growing Company.Lead ML...
-
Azure Data Engineer
4 weeks ago
Bellevue, United States Infosys Full timeJob Description :Infosys is seeking an Azure Data Engineer. In this role, you will work directly with the clients to lead critical modules in Data Engineering program which will include metadata management, data catalog, data quality, process/workflow definition implementation, Reporting, Delivery, etc. You will be part of an entrepreneurship and learning...
-
Azure Data Engineer
4 weeks ago
Bellevue, United States Infosys Full timeJob Description :Infosys is seeking an Azure Data Engineer. In this role, you will work directly with the clients to lead critical modules in Data Engineering program which will include metadata management, data catalog, data quality, process/workflow definition implementation, Reporting, Delivery, etc. You will be part of an entrepreneurship and learning...
-
Azure Data Engineer
4 weeks ago
Bellevue, United States Infosys Full timeJob Description :Infosys is seeking an Azure Data Engineer. In this role, you will work directly with the clients to lead critical modules in Data Engineering program which will include metadata management, data catalog, data quality, process/workflow definition implementation, Reporting, Delivery, etc. You will be part of an entrepreneurship and learning...
-
Kubernetes Administrator
2 days ago
Bellevue, United States Software Technology Inc Full timeTitle: Kubernetes Admin Location: Bellevue WA 98004 Duration: Long Term Mandatory Areas Must Have Skills Skill 1 8 Yrs. of Exp Kubernetes container platform Skill 2 8 Yrs. of Exp ,Manage cluster life cycle by deploying updates and OS patches in K8s clusters Skill 3 4 Yrs. of Exp , Total 7 to 8 years' experience in Onprem systems and DevOps Kubernetes...