Site Reliability Engineer
3 weeks ago
Job Title: Sr. SRE ( Site Reliability Engineer)
Location: Seattle, WA
Core skills needed -
Azure Clous, AKS – Scalability, monitoring, deployment, check logs, ensure node and pod health.
Databases include - Cassandra, Mongo, PostGres
Data bricks – how to set up data bricks
Databricks Notebooks – There are a lot of jobs on Databricks – experience with Databricks to know how a notebook is created and run - run queries against the database and finding discrepancies and perform fixes.
Based microservices, responsible for deployment, scripting language is python.
Should have an understanding around terraform.
Emphasis on Logs and Monitoring (datadog and splunk)
Summary of Experience
- Requires 10-12 years experience in the IT industry
- Requires 9+ years of software and DevOps development engineering
- Experience in working with cloud environment Azure preferred.
- Experience with Kubernetes, Azure Kubernetes (AKS) preferred.
- Experience with using Kafka, Event Hub, NATS or any messaging broker.
- Experience with Cassandra, PostgresSQL, Mongo, Elastic Search, Cosmos DB
- Experience on Azure DevOps, Jenkins/ Python / Terraform / Ansible
- Experience with Databricks
- Experience with DataDog, Splunk or other logging and APM tools.
- Experience in working with Linux environment.
Summary of Key Responsibilities
• Develop monitoring dashboards
• Configure alerts and automate process for system recovery
• Monitor alerts and take proactive steps to resolve system issues
• Troubleshoot production issues
• Lead production troubleshooting calls
• Responsible for patches and updates on production systems.
• Design and build cutting-edge, multi-micro service solutions to support Starbucks’s growth worldwide.
• Helping CI/CD team during rolling out application and infrastructure globally.
• Participates in a production support rotation that includes pager responsibilities.
• Ability to accurately break down complex application designs into component deliverables and estimate design and development timelines
-
Site Reliability Engineer
2 weeks ago
Seattle, Washington, United States Sogeti Full timeJob Title: Site Reliability EngineerAbout the Role:We are seeking an experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure using Azure or...
-
Site Reliability Engineer
2 weeks ago
Seattle, Washington, United States HireIO Inc Full timeJob Title: Site Reliability EngineerHireIO Inc is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, and performance of our distributed systems.Key Responsibilities:Design and implement scalable and reliable systemsCollaborate with cross-functional...
-
Site Reliability Engineer
1 week ago
Seattle, Washington, United States Oracle Full timeAbout the Role:Oracle is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, develop, and deploy software to improve the availability, scalability, and efficiency of...
-
Site Reliability Engineer
2 weeks ago
Seattle, Washington, United States Sogeti Full timeSite Reliability Engineer **Job Summary** We are seeking an experienced Site Reliability Engineer to join our team. As a key member of our operations team, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure. **Key Responsibilities** * Design, implement, and maintain scalable and reliable cloud...
-
Site Reliability Engineer
3 weeks ago
Seattle, Washington, United States Apple Full timeJob Title: Site Reliability EngineerAt Apple, we're looking for a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.About the RoleWe are seeking a talented and motivated individual to join our dynamic...
-
Site Reliability Engineer
1 week ago
Seattle, Washington, United States Oracle Full timeAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Oracle. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure. You will work closely with our development teams to design, implement, and operate large-scale distributed...
-
Site Reliability Engineer
4 weeks ago
Seattle, Washington, United States Apple Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled and motivated Site Reliability Engineer to join our dynamic and growing team at Apple.About the RoleAs a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.Key ResponsibilitiesDesign, implement,...
-
Site Reliability Engineer
4 weeks ago
Seattle, Washington, United States Tik Tok Full timeAbout TikTok U.S. Data SecurityTikTok U.S. Data Security is a subsidiary of TikTok in the U.S., dedicated to protecting user data and ensuring the security of our platform.ResponsibilitiesWe are seeking a highly motivated and experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the...
-
Site Reliability Engineer
3 weeks ago
Seattle, Washington, United States Apple Full timeJob Title: Site Reliability EngineerAt Apple, we're looking for a skilled Site Reliability Engineer to join our Object Storage SRE team. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability, scalability, and performance of our cloud storage systems.About the RoleWe're seeking a seasoned software and systems engineer with a...
-
Site Reliability Engineer
3 weeks ago
Seattle, Washington, United States Sogeti Full timeSite Reliability EngineerWe are seeking an experienced Site Reliability Engineer to join our team at Sogeti. As a key member of our operations team, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure using Azure or...
-
Site Reliability Engineer
3 weeks ago
Seattle, Washington, United States Nerdshub E Pvt Ltd Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Nerdshub E Pvt Ltd. As a Site Reliability Engineer, you will be responsible for ensuring the health and stability of our production systems, developing monitoring dashboards, and configuring alerts to automate system recovery.Key...
-
Site Reliability Engineer
1 week ago
Seattle, Washington, United States HireIO Inc Full timeJob SummaryAt HireIO Inc, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, and reliability of our Ads systems. This includes designing, analyzing, and troubleshooting large-scale distributed systems, as well as developing tools and...
-
Site Reliability Engineer
3 weeks ago
Seattle, Washington, United States Capgemini Full timeJob Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our software systems and infrastructure.Key Responsibilities:Develop, maintain, and configure cloud observability systems (e.g.,...
-
Senior Site Reliability Engineer
1 week ago
Seattle, Washington, United States Diverse Lynx Full timeJob Title: Sr. Site Reliability EngineerLocation: RemoteDuration: 12+ Months contractJob Description:We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the availability, reliability, and performance of our applications and services.You will work...
-
Senior Site Reliability Engineer
3 weeks ago
Seattle, Washington, United States SingleStore Full timeSenior Site Reliability EngineerAt SingleStore, we're seeking a seasoned Senior Site Reliability Engineer to drive our Kubernetes product strategy and help shape the future of our managed service.Key ResponsibilitiesDesign and build elastic Kubernetes clusters across on-prem, AWS, Azure, and Google Cloud environments.Develop and maintain production container...
-
Site Reliability Engineering Manager
3 weeks ago
Seattle, Washington, United States Apple Full timeSite Reliability Engineering ManagerAt Apple, we're looking for a skilled Site Reliability Engineering Manager to join our team. As a Site Reliability Engineering Manager, you will be responsible for leading a team that provides the platform for mission-critical cloud systems to maintain constant uptime, scale seamlessly, and allow for new applications and...
-
Site Reliability Engineer
2 weeks ago
Seattle, Washington, United States Tik Tok Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Data Platform Team at TikTok. As a key member of our team, you will be responsible for designing, building, and operating large-scale, massively distributed services and infrastructures.Key ResponsibilitiesDesign and implement reliable, scalable, and robust big data systems...
-
Site Reliability Engineer
1 month ago
Seattle, Washington, United States Tik Tok Full timeAbout the RoleThis is a Site Reliability Engineer position, focusing on the data pipeline reliability for the Video Platform team in USDS.Data SREs monitor data and keep production batch and real-time processing jobs up and running with the highest level of availability, ensuring our users have the freshest, complete, and correct data...
-
Site Reliability Engineer III
1 month ago
Seattle, Washington, United States F5 Networks Full timeAbout the RoleF5 Networks is seeking a highly skilled Site Reliability Engineer III to join our team. As a Site Reliability Engineer III, you will be responsible for ensuring the reliability, availability, and scalability of our critical systems and SaaS platforms.Key ResponsibilitiesApply modern engineering principles and practices to operational functions...
-
Site Reliability Engineer
2 weeks ago
Seattle, Washington, United States Apple Full timeJob SummaryApple is seeking a highly skilled and motivated Security Site Reliability Engineer (SRE) to join our dynamic and growing team.Key ResponsibilitiesEnsure the security, reliability, and scalability of our systems and infrastructure.Collaborate with cross-functional teams to design, implement, and maintain security measures, incident response...