Site Reliability Engineer
1 week ago
Site Reliability Engineer (SRE)
Location: Sunnyvale CA ( Day 1 Onsite)
Long Term
We are seeking a highly skilled Site Reliability Engineer (SRE) with a strong background in DevOps practices and expertise in Ansible, AWS, NGINX, load balancing, and related technologies. The ideal candidate will be responsible for ensuring the reliability, scalability, and performance of our infrastructure and applications while enhancing the overall efficiency of our development and operations teams.
Key Responsibilities
Design, implement, and manage scalable, reliable, and secure infrastructure using AWS services.
Develop, automate, and manage CI/CD pipelines to streamline the deployment process.
Use Ansible for configuration management and infrastructure automation to ensure consistent environments across development, testing, and production.
Configure, manage, and optimize NGINX as a reverse proxy, load balancer, and web server for high-availability services.
Design and maintain load balancing strategies to distribute traffic effectively across multiple servers.
Monitor system performance and availability, troubleshooting and resolving incidents as they occur.
Implement infrastructure as code (IaC) practices to automate the provisioning of resources.
Collaborate with development and operations teams to build reliable and efficient systems.
Conduct root cause analysis of incidents and implement preventive measures.
Manage logging and monitoring tools to gain insights into system performance and identify bottlenecks.
Drive efforts to improve system scalability and capacity planning.
Required Skills and Qualifications
- 10+ years of experience as an SRE, DevOps Engineer, or similar role.
- Proficiency in Ansible for configuration management and automation.
- Strong hands-on experience with AWS services (EC2, S3, RDS, Route 53, Lambda, etc.).
- Expertise in NGINX configuration and optimization for web servers and load balancing.
- Solid understanding of load balancing concepts, algorithms, and tools.
- Experience with containerization technologies such as Docker and orchestration tools like Kubernetes.
- Familiarity with scripting languages like Python, Bash, or Shell scripting.
- Strong understanding of system monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, Splunk).
- Knowledge of networking concepts, including DNS, TCP/IP, firewalls, and VPNs.
- Experience with CI/CD tools such as Jenkins, GitLab CI, or CircleCI.
- Strong problem-solving and analytical skills with attention to detail.
- Excellent communication skills to collaborate with cross-functional teams.
-
Site Reliability Engineer
1 week ago
Sunnyvale, United States Headway Tek Inc Full timeHiHope you are doing well,Please have a look on below requirement, if you are interested, please share your updated resumeJob Description: Site Reliability Engineer (SRE)Location: Sunnyvale, CA (Day 1 Onsite)Position type: FulltimeExperience Required: Minimum 8- 10 years overall, including 5+ years specifically as an SRE.we are Looking for Only Independent...
-
Site Reliability Engineer
8 hours ago
Sunnyvale, United States Diverse Lynx Full timeEngineer Experience in infrastructure management like managing storage, Compute and Network resources by automation of SRE reportsExperience in converting legacy applications to Docker/Kubernetes and deploying on Cloud environmentGood understanding of enterprise level vulnerability management.Hands-on experience in monitoring & logging and capacity...
-
Software Engineering Manager, Site Reliability
2 weeks ago
Sunnyvale, United States Apple Full timeSoftware Engineering Manager, Site Reliability Sunnyvale, California, United States Software and Services Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us making each other’s ideas stronger. It’s...
-
Software Engineering Manager, Site Reliability
3 weeks ago
Sunnyvale, United States Apple Inc. Full timeSoftware Engineering Manager, Site ReliabilityApple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us making each other’s ideas stronger. That happens because every one of us shares a belief that we can...
-
Site Reliability Engineer
1 week ago
Sunnyvale, United States Tech Mahindra Full timeGreetings!Position Title: Site Reliability Engineer (SRE)Location: Sunnyvale CA ( Day 1 Onsite)No. of positions: 4 Expertise in Ansible, AWS, NGINX, load balancingWe are seeking a highly skilled Site Reliability Engineer (SRE) with a strong background in DevOps practices and expertise in Ansible, AWS, NGINX, load balancing, and related technologies. The...
-
sunnyvale, United States Headway Tek Inc Full timeHiHope you are doing well,Please have a look on below requirement, if you are interested, please share your updated resumeJob Description: Site Reliability Engineer (SRE)Location: Sunnyvale, CA (Day 1 Onsite)Position type: FulltimeExperience Required: Minimum 8- 10 years overall, including 5+ years specifically as an SRE.we are Looking for Only Independent...
-
Senior Site Reliability Engineer
1 week ago
Sunnyvale, United States Synopsys Full timeWe Are:At Synopsys, we drive the innovations that shape the way we live and connect. Our technology is central to the Era of Pervasive Intelligence, from self-driving cars to learning machines. We lead in chip design, verification, and IP integration, empowering the creation of high-performance silicon chips and software content. Join us to transform...
-
Site Reliability Engineer
10 hours ago
Sunnyvale, United States Futran Tech Solutions Pvt. Ltd. Full timePosition: SRE Location: Sunnyvale, CA (Onsite) Mode: Full-Time Job Description: Software Development Engineer - SRE (Onsite) Skilled at writing clean, high-performant and unit-testable code in Java. Proficiency with the architecture, deployment, performance tuning, and troubleshooting large scale distributed systems on AWS. Understanding of SRE principals...
-
Nexwave | Site Reliability Engineer
1 week ago
sunnyvale, United States Nexwave Full timeSite Reliability Engineer (SRE)Location: Sunnyvale CA ( Day 1 Onsite)Long TermWe are seeking a highly skilled Site Reliability Engineer (SRE) with a strong background in DevOps practices and expertise in Ansible, AWS, NGINX, load balancing, and related technologies. The ideal candidate will be responsible for ensuring the reliability, scalability, and...
-
Tech Mahindra | Site Reliability Engineer
1 week ago
sunnyvale, United States Tech Mahindra Full timeGreetings!Position Title: Site Reliability Engineer (SRE)Location: Sunnyvale CA ( Day 1 Onsite)No. of positions: 4 Expertise in Ansible, AWS, NGINX, load balancingWe are seeking a highly skilled Site Reliability Engineer (SRE) with a strong background in DevOps practices and expertise in Ansible, AWS, NGINX, load balancing, and related technologies. The...
-
Tech Mahindra | Site Reliability Engineer
1 week ago
sunnyvale, United States Tech Mahindra Full timeGreetings!Position Title: Site Reliability Engineer (SRE)Location: Sunnyvale CA ( Day 1 Onsite)No. of positions: 4 Expertise in Ansible, AWS, NGINX, load balancingWe are seeking a highly skilled Site Reliability Engineer (SRE) with a strong background in DevOps practices and expertise in Ansible, AWS, NGINX, load balancing, and related technologies. The...
-
Senior Reliability Engineer
4 hours ago
Sunnyvale, United States Figure Full timeFigure is an AI Robotics company developing a general purpose humanoid. Our Humanoid is designed for corporate tasks targeting labor shortages and jobs that are undesirable or unsafe. We are based in Sunnyvale, CA and require 5 days/week in-office collaboration. We are looking for a Senior Reliability Test Engineer in charge of designing and executing test...
-
Sr. Associate Site Reliability Engineer
7 hours ago
Sunnyvale, United States Synopsys Full timeWe Are: At Synopsys, we drive the innovations that shape the way we live and connect. Our technology is central to the Era of Pervasive Intelligence, from self-driving cars to learning machines. We lead in chip design, verification, and IP integration, empowering the creation of high-performance silicon chips and software content. Join us to transform the...
-
Senior Reliability Test Engineer
2 hours ago
Sunnyvale, United States Figure Full timeFigure is an AI Robotics company developing a general purpose humanoid. Our Humanoid is designed for corporate tasks targeting labor shortages and jobs that are undesirable or unsafe. We are based in Sunnyvale, CA and require 5 days/week in-office collaboration. We are looking for a Senior Reliability Test Engineer to be in charge of designing and executing...
-
Data Center Chief Site Engineer
2 months ago
Sunnyvale, United States DataFlex LLC, The Human Capital & Company Matchmaker Experts Full timeData Center Chief Site Engineer- Critical Facilities -Sunnyvale Must be local to the Bay AreaMust be authorized to work in the US without sponsorshipMust meet the qualifications as set forth in the job description below: please do not apply if you do not meet the experience for this role. Our client is seeking to add a Chief Engineer who aligns with our...
-
Data Center Chief Site Engineer
2 months ago
sunnyvale, United States DataFlex LLC, The Human Capital & Company Matchmaker Experts Full timeData Center Chief Site Engineer- Critical Facilities -Sunnyvale Must be local to the Bay AreaMust be authorized to work in the US without sponsorshipMust meet the qualifications as set forth in the job description below: please do not apply if you do not meet the experience for this role. Our client is seeking to add a Chief Engineer who aligns with our...
-
Senior Data Center Infrastructure Engineer
7 days ago
Sunnyvale, California, United States LinkedIn Full timeOverviewData center engineering is a critical field that involves the design, construction, and maintenance of data centers. As a Senior Data Center Infrastructure Engineer at LinkedIn, you will play a vital role in ensuring the stability, reliability, and security of our data center infrastructure.Job DescriptionThis role will be based in Manassas, VA, and...
-
Operations Engineer, Fleet Reliability
3 weeks ago
Sunnyvale, United States CoreWeave Full timeJob DescriptionJob DescriptionCoreWeave is the AI Hyperscaler™, delivering a cloud platform of cutting edge services powering the next wave of AI. The company's technology provides enterprises and leading AI labs with the most performant, efficient and resilient solutions for accelerated computing. Since 2017, CoreWeave has operated a growing footprint...
-
Software Engineer
1 month ago
sunnyvale, United States Compunnel Inc. Full timeClient: WalmartRole: Software EngineerLocation: Sunnyvale, CA /2 days/week OnsiteContract – 12+ monthsDescription:This is a coder position and we are looking for development engineers with experience in at least either Java or PhythonSite Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively...
-
Software Engineer
2 months ago
Sunnyvale, United States Compunnel Inc. Full timeClient: WalmartRole: Software EngineerLocation: Sunnyvale, CA /2 days/week OnsiteContract – 12+ monthsDescription:This is a coder position and we are looking for development engineers with experience in at least either Java or PhythonSite Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively...