Senior DevOps Engineer
2 days ago
Required Skills - DevOps, Cloud infrastructure, Kubernetes
Job Duties -- Deliverables Alignment: Develop solutions in line with key deliverables, including metrics collection, dashboards, reliability audits, and runbooks.
- Liaison Role: Act as a primary interface between the development team in Sweden and the US-based customer support team.
- Automation and CI/CD: Build and optimize CI/CD pipelines and scripts to automate generation, testing, deployment, and monitoring of customized builds.
- Observability: Implement and refine monitoring solutions using OpenTelemetry and Grafana for enhanced visibility into system performance.
- Reliability Audits: Conduct reliability audits for existing deployments, document findings, rank issues by criticality, and address concerns through merge requests or escalations.
- Production Support: Provide 24/7 Tier II production support on a rotational basis, handling escalations and minimizing downtime.
- Training and Documentation: Prepare technical training and documentation, including runbooks, playbooks, and onboarding materials for Tier I and Tier II support teams.
- Dashboards and Metrics: Develop Grafana dashboards for approximately 50-70 services, including Kubernetes platform and internal services.
- Issue Resolution: Investigate and resolve issues reported from lower-tier teams, ensuring timely resolution and continuous improvement.
- Game Day Scenarios: Collaborate with teams to plan and execute Game Day scenarios, simulating and preparing for likely system failures.
- Collaboration: Work closely with cross-functional teams to enhance operational efficiency and contribute to system and application improvements.
Job Requirements -- Experience: 8+ years in DevOps, SRE, or similar roles, with a focus on cloud-hosted, microservices-based environments.
- Technologies: Expertise in Kubernetes, AWS EKS, Terraform, ArgoCD, OpenTelemetry, and Grafana.
- DevOps Practices: Strong knowledge of CI/CD, infrastructure-as-code (IaC), and automation frameworks.
- Observability: Proven experience in implementing observability tools and frameworks for metrics collection and system monitoring.
- Incident Management: Background in production support, troubleshooting, and resolving critical system issues.
- Documentation: Strong technical writing skills for creating incident runbooks, playbooks, and support materials.
- On-Call Readiness: Willingness to participate in 24/7 rotational production support, including incident escalation and resolution.
Desired Skills & Experience -- Experience conducting reliability audits and implementing scalable solutions.
- Familiarity with GitOps practices and tools like GitLab.
- Proficiency in building automated remediation for alerts and contributing to infrastructure reliability enhancements.
- Background in supporting SaaS transitions, particularly in customer-facing and revenue-generating environments.
Required Skills : DevOps
Basic Qualification :
Additional Skills :
This is a high PRIORITY requisition. This is a PROACTIVE requisition
Background Check : No
Drug Screen : No
-
DevOps Engineer
2 hours ago
Englewood, CO, United States Diverse Lynx Full timeCloud DevOps Engineer :: Infosys : 116528-1 Must Have Skills: • DevOps • Python • SQL Nice to Have Skills: Detailed Job Description: • Development operations (DevOps) engineers are responsible for the production and ongoing maintenance of a website platform. • They also manage cloud infrastructure and system administration and work with teams to...
-
Amazon Webservices DevOps
3 days ago
Englewood, CO, United States APN Consulting Full timeAPN Consulting, Inc. is a progressive IT staffing and services company offering innovative business solutions to improve client business outcomes. We focus on high impact technology solutions in ServiceNow, Fullstack, Cloud & Data, and AI / ML. Due to our globally expanding service offerings we are seeking top-talent to join our teams and grow with us. We...
-
Amazon Webservices DevOps
2 weeks ago
Englewood, CO, United States APN Consulting Full timeAPN Consulting, Inc. is a progressive IT staffing and services company offering innovative business solutions to improve client business outcomes. We focus on high impact technology solutions in ServiceNow, Fullstack, Cloud & Data, and AI / ML. Due to our globally expanding service offerings we are seeking top-talent to join our teams and grow with us. We...
-
Amazon Webservices DevOps
1 week ago
Englewood, CO, United States APN Consulting Full timeAPN Consulting, Inc. is a progressive IT staffing and services company offering innovative business solutions to improve client business outcomes. We focus on high impact technology solutions in ServiceNow, Fullstack, Cloud & Data, and AI / ML. Due to our globally expanding service offerings we are seeking top-talent to join our teams and grow with us. We...
-
DevOps Engineer
2 weeks ago
Englewood, CO, United States Procyon TS Full timeJob Description Opening / Selling Statement -We are seeking a Mid-Level DevOps Engineer with Site Reliability Engineering (SRE) experience to contribute to the transition of Crew Management Applications to a web-based SaaS model hosted on AWS. The successful candidate will work under the guidance of a Senior DevOps Engineer, supporting critical system...
-
DevOps Engineer
5 days ago
Englewood, CO, United States Procyon TS Full timeJob Description Opening / Selling Statement -We are seeking a Mid-Level DevOps Engineer with Site Reliability Engineering (SRE) experience to contribute to the transition of Crew Management Applications to a web-based SaaS model hosted on AWS. The successful candidate will work under the guidance of a Senior DevOps Engineer, supporting critical system...
-
DevOps Engineer
7 days ago
Englewood, CO, United States Procyon TS Full timeJob Description Opening / Selling Statement -We are seeking a Mid-Level DevOps Engineer with Site Reliability Engineering (SRE) experience to contribute to the transition of Crew Management Applications to a web-based SaaS model hosted on AWS. The successful candidate will work under the guidance of a Senior DevOps Engineer, supporting critical system...
-
DevOps Engineer
1 week ago
Englewood, CO, United States Procyon TS Full timeJob Description Opening / Selling Statement -We are seeking a Mid-Level DevOps Engineer with Site Reliability Engineering (SRE) experience to contribute to the transition of Crew Management Applications to a web-based SaaS model hosted on AWS. The successful candidate will work under the guidance of a Senior DevOps Engineer, supporting critical system...
-
DevOps Engineer
1 week ago
Englewood, CO, United States Procyon TS Full timeJob Description Opening / Selling Statement -We are seeking a Mid-Level DevOps Engineer with Site Reliability Engineering (SRE) experience to contribute to the transition of Crew Management Applications to a web-based SaaS model hosted on AWS. The successful candidate will work under the guidance of a Senior DevOps Engineer, supporting critical system...
-
Englewood, CO, United States Spruce Infotech Full timePOC: Sam Chavez ATTENTION ALL SUPPLIERS!!! READ BEFORE SUBMITTING • UPDATED CONTACT NUMBER and EMAIL ID is a MANDATORY REQUEST from our client for all the submissions • Limited to 1 submission per supplier. Please submit your best. • We prioritize endorsing those with complete and accurate information • Avoid submitting duplicate profiles. We will...