Site Reliability Engineer NOT open for C2C or sponsorship
3 weeks ago
Job Summary
In this role as an Associate Site Reliability Engineer, you will be an integral member of a dynamic SRE/DevOps team continuously improving our AWS cloud deployment platform, "automation first".
Responsibilities
- Drive team initiatives to continuously refine AWS deployment practices for improved reliability, repeatability and security.
- Work closely with the development teams to automate deployment and configuration of infrastructure.
- Design effective monitoring / alerting (for conditions such as application-errors, high memory usage) and log aggregation approaches (to quickly access logs for troubleshooting, or generate reports for trend analysis) to proactively notify business stakeholders of issues and communicate metrics, working closely with these stakeholders
- Write code and scripts to automate provisioning of AWS services and to configure services, using tools and languages including AWS CLI / API, Terraform, Ansible, Python, Bash
- Configure build pipelines to support automated testing and deployments using tools including Jenkins, CircleCI, GitHub Actions
- Help refine DevSecOps security practices (including regular security patching, minimum-permissions accounts and policies, encrypt-everything) in compliance with Health IT, government and other standards regulations, implement, and verify them, using tools like the AWS security stack (GuardDuty, Systems Manager, Config),, VeraCode, SonarQube, etc. to analyze and verify compliance.
- Document and diagram deployment-specific aspects of architectures and environments, working closely with Software Engineers, Software Engineers in Test, and others in DevOps.
- Troubleshoot issues in production and other environments, applying debugging and problem-solving techniques (e.g., log analysis, non-invasive tests) , working closely with development and product teams.
- 2+ years Cloud administration experience (AWS, Azure, GCP) OR 2+ years software engineering experience in a modern, high-level language (Ruby, Java, Python, etc.)
- Strong experience developing and / or deploying Docker Containers on Kubernetes (Helm, Kustomize, etc)
- Working knowledge of IAC / configuration management tools such as Terraform, Ansible or Puppet.
- Recent experience with setup, configuration and monitoring of RDBMS and NoSQL datastores
- A strong understanding of Linux administration including Bash scripting
- Experience in automation using Go or Python
- Experience with log aggregation tools such as Datadog, ELK, Splunk
- Bachelor's degree in science, technology, engineering or similar field is desired.
- Experience in HIPAA/SOC 2 environments
Job Summary
In this role as an Associate Site Reliability Engineer, you will be an integral member of a dynamic SRE/DevOps team continuously improving our AWS cloud deployment platform, "automation first".
Responsibilities
Drive team initiatives to continuously refine AWS deployment practices for improved reliability, repeatability and security.
Work closely with the development teams to automate deployment and configuration of infrastructure.
Design effective monitoring / alerting (for conditions such as application-errors, high memory usage) and log aggregation approaches (to quickly access logs for troubleshooting, or generate reports for trend analysis) to proactively notify business stakeholders of issues and communicate metrics, working closely with these stakeholders
Write code and scripts to automate provisioning of AWS services and to configure services, using tools and languages including AWS CLI / API, Terraform, Ansible, Python, Bash
Configure build pipelines to support automated testing and deployments using tools including Jenkins, CircleCI, GitHub Actions
Help refine DevSecOps security practices (including regular security patching, minimum-permissions accounts and policies, encrypt-everything) in compliance with Health IT, government and other standards regulations, implement, and verify them, using tools like the AWS security stack (GuardDuty, Systems Manager, Config),, VeraCode, SonarQube, etc. to analyze and verify compliance.
Document and diagram deployment-specific aspects of architectures and environments, working closely with Software Engineers, Software Engineers in Test, and others in DevOps.
Troubleshoot issues in production and other environments, applying debugging and problem-solving techniques (e.g., log analysis, non-invasive tests) , working closely with development and product teams.
Qualifications
2+ years Cloud administration experience (AWS, Azure, GCP) OR 2+ years software engineering experience in a modern, high-level language (Ruby, Java, Python, etc.)
Strong experience developing and / or deploying Docker Containers on Kubernetes (Helm, Kustomize, etc)
Working knowledge of IAC / configuration management tools such as Terraform, Ansible or Puppet.
Recent experience with setup, configuration and monitoring of RDBMS and NoSQL datastores
A strong understanding of Linux administration including Bash scripting
Experience in automation using Go or Python
Experience with log aggregation tools such as Datadog, ELK, Splunk
Preferred Qualifications
Bachelor's degree in science, technology, engineering or similar field is desired.
Experience in HIPAA/SOC 2 environments
-
Seattle, United States Vaco Full timeJob Summary In this role as an Associate Site Reliability Engineer, you will be an integral member of a dynamic SRE/DevOps team continuously improving our AWS cloud deployment platform, "automation first". ResponsibilitiesDrive team initiatives to continuously refine AWS deployment practices for improved reliability, repeatability and security.Work closely...
-
seattle, United States Vaco Full timeJob Summary In this role as an Associate Site Reliability Engineer, you will be an integral member of a dynamic SRE/DevOps team continuously improving our AWS cloud deployment platform, "automation first". ResponsibilitiesDrive team initiatives to continuously refine AWS deployment practices for improved reliability, repeatability and security.Work closely...
-
Site Reliability Engineer III
17 hours ago
Seattle, United States F5 Networks Full timeAt F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation. Everything we do centers around...
-
Site Reliability Engineering Lead
23 hours ago
Seattle, United States DAT Solutions Full timeAbout DAT DATis an award-winning employer of choice and a next-generation SaaS technology company that has been at the leading edge of innovation in transportation supply chain logistics for 45 years. We continue to transform the industry year over year, by deploying a suite of software solutions to millions of customers every day - customers who depend on...
-
Site Reliability Engineer
1 day ago
Seattle, United States UKG (Ultimate Kronos Group) Full timeAbout the Team: Site Reliability Engineers at UKG are critical team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning,...
-
Senior Site Reliability Engineer
1 day ago
Seattle, United States Saxon Global Full timeStarbucks Senior Site Reliability Engineer (Cloud) 8-month contract (Likely extension to 18 month with strong performance) Hybrid - (Must be local to the Seattle area, onsite at Starbucks headquarters 3 days a week with 2 days remote) Job Summary and Mission This position contributes to Starbucks on their Data Platform Services team. This team maintains and...
-
Site Reliability Engineer
3 days ago
Seattle, United States CGL Consulting Co., Ltd Full timeSite Reliability Engineer (SRE)Responsibilities:1. Manage and operate cloud infrastructure across AWS or Azure and Kubernetes environments, ensuring optimal performance and resource allocation.2. Develop and maintain automated systems to monitor, build, and scale environments, ensuring stable and reliable operations.3. Perform capacity planning and implement...
-
Senior Site Reliability Engineer
19 hours ago
Seattle, United States UKG (Ultimate Kronos Group) Full timeAbout the Team: Senior Site Reliability Engineers at UKG are team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning,...
-
Lead Site Reliability Engineer
4 months ago
Seattle, United States Capgemini Full timeLeadSite Reliability Engineer Seattle,WA FTE/Direct hiring with benefits NoRemote - Onsite and Hybrid position fromWA location only Qualification& Skills 8+ years ofexperience in Site Reliability Engineering or related field Develop,maintain and configure cloud observability systems (e.g., Datadog, Splunk,OpenTelemetry, APM, etc.). Buildflexible...
-
Seattle, United States Coupang Full timePrincipal Engineer, Site Reliability EngineeringAt Coupang we are building the future of eCommerce. Born out of an obsession to make shopping, eating, and living easier than ever, we’re collectively disrupting the multi-billion-dollar e-commerce industry from the ground up. We exist to wow our customers. We know we’re doing the right thing when we hear...
-
Senior Site Reliability Engineer
6 months ago
Seattle, United States SingleStore Full timePosition Overview MemSQL is seeking a Senior Site Reliability Engineer to help drive our Kubernetes product strategy surrounding our managed service. You will be at the forefront; crafting the design, building out the collaborated vision, and sustaining your envisioned product strategy. This role will be an integral part of building our managed service...
-
Site Reliability Engineer
19 hours ago
Seattle, United States Sogeti Full timeSite Reliability Engineer (SRE) Direct hiring - FTE w/benefits Seattle, WA / work business PST hours. The SRE will be responsible for ensuring the reliability, scalability, and performance of our software systems and infrastructure. The ideal candidate will have a strong background in systems administration, software engineering, and a deep understanding...
-
seattle, United States Coupang Full timePrincipal Engineer, Site Reliability EngineeringAt Coupang we are building the future of eCommerce. Born out of an obsession to make shopping, eating, and living easier than ever, we’re collectively disrupting the multi-billion-dollar e-commerce industry from the ground up. We exist to wow our customers. We know we’re doing the right thing when we hear...
-
Senior Site Reliability Engineer
23 hours ago
Seattle, United States Tik Tok Full timeResponsibilities TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo. Why Join Us Creation is the core of TikTok's purpose. Our platform is built to help imaginations...
-
Staff Site Reliability Engineer
19 hours ago
Seattle, United States Zscaler Full timeAbout Zscaler Serving thousands of enterprise customers around the world including 40% of Fortune 500 companies, Zscaler (NASDAQ: ZS) was founded in 2007 with a mission to make the cloud a safe place to do business and a more enjoyable experience for enterprise users. As the operator of the world's largest security cloud, Zscaler accelerates digital...
-
seattle, United States CGL Consulting Co., Ltd Full timeSite Reliability Engineer (SRE)Responsibilities:1. Manage and operate cloud infrastructure across AWS or Azure and Kubernetes environments, ensuring optimal performance and resource allocation.2. Develop and maintain automated systems to monitor, build, and scale environments, ensuring stable and reliable operations.3. Perform capacity planning and implement...
-
seattle, United States CGL Consulting Co., Ltd Full timeSite Reliability Engineer (SRE)Responsibilities:1. Manage and operate cloud infrastructure across AWS or Azure and Kubernetes environments, ensuring optimal performance and resource allocation.2. Develop and maintain automated systems to monitor, build, and scale environments, ensuring stable and reliable operations.3. Perform capacity planning and implement...
-
Site Reliability Engineer
21 hours ago
Seattle, United States HireIO Inc Full time1. Engage in and improve the whole lifecycle of Ads systems — from system design consulting through to launch reviews, deployment, operation and refinement. 2. Build availability of services deployed across multiple data centers globally. 3. Deliver tools/software to improve the reliability, scalability and operability of services. 4. Measure and monitor...
-
Site Reliability Engineer
21 hours ago
Seattle, United States Tik Tok Full timeResponsibilities About TikTok U.S. Data Security TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security ("USDS") is a subsidiary of TikTok in the U.S. This new, security-first division was created to bring heightened focus and governance to our data protection policies and content...
-
Data Engineer V
1 week ago
Seattle, United States Aditi Consulting Full timeRetail ClientData Engineer V (ONLY W2 - NO SPONSORSHIP)Seattle, WA (ONSITE)12 months ContractPay Range on hourly base: $76/hour to $81/hourRequired Skills• Masters in computer science, mathematics, statistics, economics, or other quantitative fields• Experience with Apache Airflow to design and implement scalable, fault-tolerant data pipelines• Proven...