Site Reliability Engineer NOT open for C2C or sponsorship

3 weeks ago


seattle, United States Vaco Full time

Job Summary
In this role as an Associate Site Reliability Engineer, you will be an integral member of a dynamic SRE/DevOps team continuously improving our AWS cloud deployment platform, "automation first".
Responsibilities
  • Drive team initiatives to continuously refine AWS deployment practices for improved reliability, repeatability and security.
  • Work closely with the development teams to automate deployment and configuration of infrastructure.
  • Design effective monitoring / alerting (for conditions such as application-errors, high memory usage) and log aggregation approaches (to quickly access logs for troubleshooting, or generate reports for trend analysis) to proactively notify business stakeholders of issues and communicate metrics, working closely with these stakeholders
  • Write code and scripts to automate provisioning of AWS services and to configure services, using tools and languages including AWS CLI / API, Terraform, Ansible, Python, Bash
  • Configure build pipelines to support automated testing and deployments using tools including Jenkins, CircleCI, GitHub Actions
  • Help refine DevSecOps security practices (including regular security patching, minimum-permissions accounts and policies, encrypt-everything) in compliance with Health IT, government and other standards regulations, implement, and verify them, using tools like the AWS security stack (GuardDuty, Systems Manager, Config),, VeraCode, SonarQube, etc. to analyze and verify compliance.
  • Document and diagram deployment-specific aspects of architectures and environments, working closely with Software Engineers, Software Engineers in Test, and others in DevOps.
  • Troubleshoot issues in production and other environments, applying debugging and problem-solving techniques (e.g., log analysis, non-invasive tests) , working closely with development and product teams.
Qualifications
  • 2+ years Cloud administration experience (AWS, Azure, GCP) OR 2+ years software engineering experience in a modern, high-level language (Ruby, Java, Python, etc.)
  • Strong experience developing and / or deploying Docker Containers on Kubernetes (Helm, Kustomize, etc)
  • Working knowledge of IAC / configuration management tools such as Terraform, Ansible or Puppet.
  • Recent experience with setup, configuration and monitoring of RDBMS and NoSQL datastores
  • A strong understanding of Linux administration including Bash scripting
  • Experience in automation using Go or Python
  • Experience with log aggregation tools such as Datadog, ELK, Splunk
Preferred Qualifications
  • Bachelor's degree in science, technology, engineering or similar field is desired.
  • Experience in HIPAA/SOC 2 environments
Desired Skills and Experience
Job Summary
In this role as an Associate Site Reliability Engineer, you will be an integral member of a dynamic SRE/DevOps team continuously improving our AWS cloud deployment platform, "automation first".
Responsibilities
Drive team initiatives to continuously refine AWS deployment practices for improved reliability, repeatability and security.
Work closely with the development teams to automate deployment and configuration of infrastructure.
Design effective monitoring / alerting (for conditions such as application-errors, high memory usage) and log aggregation approaches (to quickly access logs for troubleshooting, or generate reports for trend analysis) to proactively notify business stakeholders of issues and communicate metrics, working closely with these stakeholders
Write code and scripts to automate provisioning of AWS services and to configure services, using tools and languages including AWS CLI / API, Terraform, Ansible, Python, Bash
Configure build pipelines to support automated testing and deployments using tools including Jenkins, CircleCI, GitHub Actions
Help refine DevSecOps security practices (including regular security patching, minimum-permissions accounts and policies, encrypt-everything) in compliance with Health IT, government and other standards regulations, implement, and verify them, using tools like the AWS security stack (GuardDuty, Systems Manager, Config),, VeraCode, SonarQube, etc. to analyze and verify compliance.
Document and diagram deployment-specific aspects of architectures and environments, working closely with Software Engineers, Software Engineers in Test, and others in DevOps.
Troubleshoot issues in production and other environments, applying debugging and problem-solving techniques (e.g., log analysis, non-invasive tests) , working closely with development and product teams.
Qualifications
2+ years Cloud administration experience (AWS, Azure, GCP) OR 2+ years software engineering experience in a modern, high-level language (Ruby, Java, Python, etc.)
Strong experience developing and / or deploying Docker Containers on Kubernetes (Helm, Kustomize, etc)
Working knowledge of IAC / configuration management tools such as Terraform, Ansible or Puppet.
Recent experience with setup, configuration and monitoring of RDBMS and NoSQL datastores
A strong understanding of Linux administration including Bash scripting
Experience in automation using Go or Python
Experience with log aggregation tools such as Datadog, ELK, Splunk
Preferred Qualifications
Bachelor's degree in science, technology, engineering or similar field is desired.
Experience in HIPAA/SOC 2 environments


  • Seattle, United States Vaco Full time

    Job Summary In this role as an Associate Site Reliability Engineer, you will be an integral member of a dynamic SRE/DevOps team continuously improving our AWS cloud deployment platform, "automation first". ResponsibilitiesDrive team initiatives to continuously refine AWS deployment practices for improved reliability, repeatability and security.Work closely...


  • seattle, United States Vaco Full time

    Job Summary In this role as an Associate Site Reliability Engineer, you will be an integral member of a dynamic SRE/DevOps team continuously improving our AWS cloud deployment platform, "automation first". ResponsibilitiesDrive team initiatives to continuously refine AWS deployment practices for improved reliability, repeatability and security.Work closely...


  • Seattle, United States F5 Networks Full time

    At F5, we strive to bring a better digital world to life. Our teams empower organizations across the globe to create, secure, and run applications that enhance how we experience our evolving digital world. We are passionate about cybersecurity, from protecting consumers from fraud to enabling companies to focus on innovation. Everything we do centers around...


  • Seattle, United States DAT Solutions Full time

    About DAT DATis an award-winning employer of choice and a next-generation SaaS technology company that has been at the leading edge of innovation in transportation supply chain logistics for 45 years. We continue to transform the industry year over year, by deploying a suite of software solutions to millions of customers every day - customers who depend on...


  • Seattle, United States UKG (Ultimate Kronos Group) Full time

    About the Team: Site Reliability Engineers at UKG are critical team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning,...


  • Seattle, United States Saxon Global Full time

    Starbucks Senior Site Reliability Engineer (Cloud) 8-month contract (Likely extension to 18 month with strong performance) Hybrid - (Must be local to the Seattle area, onsite at Starbucks headquarters 3 days a week with 2 days remote) Job Summary and Mission This position contributes to Starbucks on their Data Platform Services team. This team maintains and...


  • Seattle, United States CGL Consulting Co., Ltd Full time

    Site Reliability Engineer (SRE)Responsibilities:1. Manage and operate cloud infrastructure across AWS or Azure and Kubernetes environments, ensuring optimal performance and resource allocation.2. Develop and maintain automated systems to monitor, build, and scale environments, ensuring stable and reliable operations.3. Perform capacity planning and implement...


  • Seattle, United States UKG (Ultimate Kronos Group) Full time

    About the Team: Senior Site Reliability Engineers at UKG are team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning,...


  • Seattle, United States Capgemini Full time

    LeadSite Reliability Engineer Seattle,WA FTE/Direct hiring with benefits NoRemote - Onsite and Hybrid position fromWA location only Qualification& Skills 8+ years ofexperience in Site Reliability Engineering or related field Develop,maintain and configure cloud observability systems (e.g., Datadog, Splunk,OpenTelemetry, APM, etc.). Buildflexible...


  • Seattle, United States Coupang Full time

    Principal Engineer, Site Reliability EngineeringAt Coupang we are building the future of eCommerce. Born out of an obsession to make shopping, eating, and living easier than ever, we’re collectively disrupting the multi-billion-dollar e-commerce industry from the ground up. We exist to wow our customers. We know we’re doing the right thing when we hear...


  • Seattle, United States SingleStore Full time

    Position Overview MemSQL is seeking a Senior Site Reliability Engineer to help drive our Kubernetes product strategy surrounding our managed service. You will be at the forefront; crafting the design, building out the collaborated vision, and sustaining your envisioned product strategy. This role will be an integral part of building our managed service...


  • Seattle, United States Sogeti Full time

    Site Reliability Engineer (SRE) Direct hiring - FTE w/benefits Seattle, WA / work business PST hours. The SRE will be responsible for ensuring the reliability, scalability, and performance of our software systems and infrastructure. The ideal candidate will have a strong background in systems administration, software engineering, and a deep understanding...


  • seattle, United States Coupang Full time

    Principal Engineer, Site Reliability EngineeringAt Coupang we are building the future of eCommerce. Born out of an obsession to make shopping, eating, and living easier than ever, we’re collectively disrupting the multi-billion-dollar e-commerce industry from the ground up. We exist to wow our customers. We know we’re doing the right thing when we hear...


  • Seattle, United States Tik Tok Full time

    Responsibilities TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. TikTok has global offices including Los Angeles, New York, London, Paris, Berlin, Dubai, Singapore, Jakarta, Seoul and Tokyo. Why Join Us Creation is the core of TikTok's purpose. Our platform is built to help imaginations...


  • Seattle, United States Zscaler Full time

    About Zscaler Serving thousands of enterprise customers around the world including 40% of Fortune 500 companies, Zscaler (NASDAQ: ZS) was founded in 2007 with a mission to make the cloud a safe place to do business and a more enjoyable experience for enterprise users. As the operator of the world's largest security cloud, Zscaler accelerates digital...


  • seattle, United States CGL Consulting Co., Ltd Full time

    Site Reliability Engineer (SRE)Responsibilities:1. Manage and operate cloud infrastructure across AWS or Azure and Kubernetes environments, ensuring optimal performance and resource allocation.2. Develop and maintain automated systems to monitor, build, and scale environments, ensuring stable and reliable operations.3. Perform capacity planning and implement...


  • seattle, United States CGL Consulting Co., Ltd Full time

    Site Reliability Engineer (SRE)Responsibilities:1. Manage and operate cloud infrastructure across AWS or Azure and Kubernetes environments, ensuring optimal performance and resource allocation.2. Develop and maintain automated systems to monitor, build, and scale environments, ensuring stable and reliable operations.3. Perform capacity planning and implement...


  • Seattle, United States HireIO Inc Full time

    1. Engage in and improve the whole lifecycle of Ads systems — from system design consulting through to launch reviews, deployment, operation and refinement. 2. Build availability of services deployed across multiple data centers globally. 3. Deliver tools/software to improve the reliability, scalability and operability of services. 4. Measure and monitor...


  • Seattle, United States Tik Tok Full time

    Responsibilities About TikTok U.S. Data Security TikTok is the leading destination for short-form mobile video. Our mission is to inspire creativity and bring joy. U.S. Data Security ("USDS") is a subsidiary of TikTok in the U.S. This new, security-first division was created to bring heightened focus and governance to our data protection policies and content...

  • Data Engineer V

    1 week ago


    Seattle, United States Aditi Consulting Full time

    Retail ClientData Engineer V (ONLY W2 - NO SPONSORSHIP)Seattle, WA (ONSITE)12 months ContractPay Range on hourly base: $76/hour to $81/hourRequired Skills• Masters in computer science, mathematics, statistics, economics, or other quantitative fields• Experience with Apache Airflow to design and implement scalable, fault-tolerant data pipelines• Proven...