Site Reliability Engineer/SRE

2 weeks ago


Nevada City, United States Talent Space Full time
Talent Space is looking for a consulting Site Reliability Engineer/SRE for our SaaS client to support a large Production Environment. Role will address Corporate level requirements rather than be focused on a specific Engineering Group/Team.

Job Description:

As a Cloud Infrastructure Engineer, you will play a pivotal role in maintaining, enhancing, and modernizing our highly scalable cloud infrastructure. We value individuals who excel technically and thrive in an innovative and collaborative environment. This role will focus on enhancing the reliability, scalability, and security of our cloud infrastructure.

Necessary Qualifications:

 
  • Proficiency in managing and optimizing cloud environments, with hands-on experience running AWS in production environments.
  • Expertise in implementing Infrastructure as Code using tools such as Ansible, Terraform, and Terragrunt, for the management and provisioning of cloud systems.
  • Proficiency in configuring and managing CI/CD pipelines supporting DevOps and all stages of the software life-cycle.
  • Hands-on experience in Container technologies including Kubernetes, Docker, and Helm.
  • Practical experience with monitoring tools and alerting systems.
  • Knowledge of security best practices, including IAM, encryption protocols, vulnerability assessment, and compliance.
  • Expertise in networking concepts including TCP/IP, routing, DNS, load balancing, and security protocols
  • Strong coding and scripting skills

Responsibilities:

 
  • Ensure the ongoing reliability, performance, and security of existing project infrastructure.
  • Address maintenance needs and implement improvements in collaboration with multiple teams.
  • Proactively monitor, respond to, and resolve infrastructure issues.
  • Participate in an on-call rotation to provide timely support and minimize downtime.
  • Design, implement, and optimize modern cloud infrastructure to support growing needs.
  • Implement SRE and DevOps best practices to enhance scalability, reliability, and efficiency.
  • Create and maintain comprehensive and effective documentation
  • Implement and uphold security best practices to safeguard infrastructure assets
  • Establish and maintain effective monitoring systems to analyze system metrics, optimize performance, and troubleshoot issues.
  • Monitor cloud infrastructure costs and identify opportunities to optimize cloud resources
  • Lead migration, deployment and upgrade efforts to transition to new infrastructure while avoiding business impact.

Open to All Visa types, but preference given to USC or GC candidates.
Candidates local to Silicon Valley preferred. 
c2c, 1099 OR w2 Payment Terms accepted
100% REMOTE

  • Kansas City, United States Gorilla Logic Full time

    Gorilla Logic: Mid-Level Site Reliability Engineer (SRE) Gorilla Logic provides nearshore Agile teams to Fortune 500 and SMB companies, bringing unparalleled expertise in the delivery of full-stack web, mobile, and enterprise applications. Our highly collaborative Agile Gorillas are uniquely qualified to implement complex software initiatives. With offices...


  • Kansas City, United States Gorilla Logic Full time

    Gorilla Logic Overview Gorilla Logic provides nearshore Agile teams to Fortune 500 and SMB companies, bringing unparalleled expertise in the delivery of full-stack web, mobile, and enterprise applications. Our highly collaborative Agile Gorillas are uniquely qualified to implement complex software initiatives. With offices in the United States, Costa Rica,...


  • Jersey City, United States DevExperts Full time

    Devexperts has been working for nearly two decades consulting and developing for the financial industry. We solve complex technological challenges facing the most well-respected financial institutions worldwide. By becoming a part of Devexperts, you’ll become a part of a company that fosters self-improvement and actively seeks out-of-the-box ideas. Our...


  • Jersey City, United States Veterans Sourcing Group LLC Full time

    Site Reliability Engineer (AWS) (SRE) Jersey City, NJ- onsite 3 days/ week 12 month minimum contract w/ possible full time conversion Roles And Responsibilities Design, code, test, and deliver software to automate manual operational work Troubleshoot priority incidents, facilitate blameless post-mortems, and ensure permanent closure of incidents Engage with...


  • Salt Lake City, United States Technology Search Group, Inc. Full time

    About the job Site Reliability Engineer (SRE) Responsibilities Responsible for collaborating with businesspeople to have a real time understanding of business problems and expected to focus on agile methodology of development. Deliver high quality change within the deadlines. In this role, you will be responsible for coding, testing and delivering high...


  • West Valley City, United States Technology Search Group, Inc. Full time

    About the job Site Reliability Engineer (SRE) Responsibilities Responsible for collaborating with businesspeople to have a real time understanding of business problems and expected to focus on agile methodology of development. Deliver high quality change within the deadlines. In this role, you will be responsible for coding, testing and delivering high...


  • Salt Lake City, United States Global Channel Management Full time

    Requirements for a Junior Support Engineer SRE Position on LinkedIn Skills And Qualifications A minimum of 4 years of relevant experience Proficiency in standard RPE and strong written and verbal communication skills Demonstrated expertise in Linux systems Familiarity with Python for automation tasks Experience in Incident management protocols Willingness to...


  • Oklahoma City, United States BJ's Wholesale Club Full time

    Lead Site Reliability Engineer page is loaded Lead Site Reliability Engineer Apply locations BJ's Club Support Center Marlborough, MA #5997 time type Full time posted on Posted 2 Days Ago job requisition id R147855 Join our team of more than 34,000 team members, supporting our members and communities in our Club Support Center, 235+ clubs and eight...


  • Nevada, United States Redwood Materials, Inc. Full time

    Site Reliability Engineer We are seeking a highly skilled and motivated Site Reliability Engineer to collect requirements, design & implement highly available systems & solutions , coordinate work across multiple teams, drive improvements to existing systems, introduce automation, integrations, and ensure appropriate monitoring & alerting is in place for...


  • Oklahoma City, United States CROSS WEB Full time

    Sabre Corporation is a leading technology provider to the global travel and tourism industry. Headquartered in Southlake, Texas, USA, Sabre operates offices in approximately 60 countries around the world. At Sabre, we make travel happen. Positioned at the center of the business of travel, our platform connects people with experiences that matter in their...


  • Universal City, United States NBC Universal Media, LLC Full time

    Lead 3 SRE resources to perform day to day and embed SRE operations. Provide architectural and technical guidance and mentorship to SRE teams, fostering skill development, and building a strong and capable SRE practices. Oversee teams who are respons Reliability Engineer, Liability, Staff, Software Engineer, Reliability, Reliability, Technology


  • Salt Lake City, United States ARCS Full time

    Join our client's vibrant team in Cape Town as an Intermediate Site Reliability Engineer (SRE II). Operating mostly remotely, their team occasionally collaborates in the office for direct engagement. Your role involves achieving operational excellence through automation tooling (e.g., Terraform). You'll contribute to architectural discussions, keeping your...


  • Jersey City, NJ, United States Trigyn Technologies Inc Full time

    Immediate long-term contract to hire opportunity for Sr. Site Reliability Support Engineer with direct client in Jersey City. Trigyn’s financial services client has an immediate need for a Site Reliability Engineer in Jersey City. This is a long-term contract assignment, that could potentially become a “temp to perm” opportunity for the right...


  • Jersey City, United States The Dignify Solutions, LLC Full time

    10+ years of Development and Operations experience in building and running applications in production that has uptime over 99%. Related experience and/or training; or equivalent combination of education and experience 8+ years of experience as a SRE Architect in running large Reliability & Observability Programs for large, complex infrastructure deployments...


  • Jersey City, United States The Dignify Solutions, LLC Full time

    10+ years of Development and Operations experience in building and running applications in production that has uptime over 99%. Related experience and/or training; or equivalent combination of education and experience 8+ years of experience as a SRE Architect in running large Reliability & Observability Programs for large, complex infrastructure...


  • Jersey City, New Jersey, United States Devexperts Full time

    Company DescriptionDevexperts has been working for nearly two decades consulting and developing for the financial industry. We solve complex technological challenges facing the most well-respected financial institutions worldwide.By becoming a part of Devexperts, you'll become a part of a company that fosters self-improvement and actively seeks...


  • Salt Lake City, United States Diverse Lynx Full time

    Role: Site Reliability Engineer Type: Full time perm Location: Salt Lake City, Utah Annual Salary: Market Standard Responsibilities " Opportunity to drive modern Observability platform that covers Cloud-native and hybrid applications " Able to persuade stakeholders and champion effective techniques through product development " Solid understanding of...


  • Salt Lake City, United States Goldman Sachs Full time

    MORE ABOUT THIS JOB: Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. At Goldman Sachs, SRE is responsible for the availability and reliability of our firm's most critical platform services, and ensures they meet the...


  • Jersey City, United States Pinnacle Group Full time

    Site Reliability Engineer (AWS) Jersey City, NJ 07310 - Onsite (3) Days Hybrid schedule "Must be able to work on W2 without sponsorship. " Contract to Hire Opportunity Must Have : AWS Certification 7-8 years of experience and 2 years of AWS exp Tools: Grafana, DataDog Database: MySQL or Oracle -Unix, Linux, Shell Scripting, LAN, NFS -Python, Go Lang,...


  • Jersey City, United States Pinnacle Group Full time

    Site Reliability Engineer (AWS) Jersey City, NJ 07310 - Onsite (3) Days Hybrid schedule "Must be able to work on W2 without sponsorship. " Contract to Hire Opportunity Must Have : AWS Certification 7-8 years of experience and 2 years of AWS exp Tools: Grafana, DataDog Database: MySQL or Oracle -Unix, Linux, Shell Scripting, LAN, NFS -Python, Go...