Service Reliability Engineer
2 weeks ago
Job Description
Job Description
Position : Service Reliability Engineer / Sr. Devops Engineer
Location : Santa Clara, CA
Duration : 1 Year +
OK with any visa No OPT please
Local consultants only
Customer will not provide letter for H1B candidates. Please check with the candidate and employers before submitting the resume. Face to face is mandatory so please submit local candidates only.
Responsibilities:
Development and Operations (DevOps) subject matter expert for 24x7 SaaS operation
Work hand-in-hand with micro-service software developers, architects, and field integration resources to architect and deliver Ericsson's next generation TV platforms.
Contribute to the development of new tools and automation that ensures the service can be optimized and tuned with minimal human intervention.
Accountable for working upstream with micro service developers on monitoring, tools and architecture to deliver security, reliability, manageability and availability at scale
Point of
escalation/decision
maker on response level of incidents
Participate in the Core SRE on-call roster and respond with command and control incident management during
High Pri Events
while maintaining internal and external SLAs
Act as
Technical Duty Officer
who leads resolution effort of the most complex service problems from network layer to the application at scale
Drive Problem
Management/Retrospectives
("post mortems")
Strong contribution and maintenance of our knowledge base
Analyze trends and make recommendations in the areas of monitoring, incident and change management, cloud orchestration and support.
Contribute to the future growth of the team by conducting candidate screenings and assessments
Accountable for deploying services to production environments
Technologies:
Experience with Docker and SaltStack, Kubernetes orchestration tools, etc.
Knowledge of MongoDB, Cassandra databases, Kafka, IIS Servers on
Azure/AWS/Openstack
Azure, Openstack and AWS concepts and APIs
Experience designing, setting up and maintaining, refining (noise reduction, auditing) monitoring tools such as Prometheus, Prometheus exporters, Kibana, Grafana, Alertmanager, etc
Demonstrable experience in one or more languages: Powershell, Python, BASH, C#, .NET
Strong knowledge of TCP/IP networking, DNS, VPNs, HTTP, load-balancers (such as NGINX), highly available microservice architecture, CDNs
Team Foundation Server/Visual Studio, Atlassian suite (Jira, Confluence), Git
Network analysis, performance and application issues using tcpdump, Fiddler and Wireshark.
Qualifications:
Bachelor's Degree in CS, MIS, or equivalent experience
5+ years of relevant experience with Windows/Unix systems fundamentals, monitoring, cloud services, networking, storage, database, and application knowledge;
Solid communications skills both written and verbal. Able to effectively tailor messaging to different audiences: External Customer, Leadership, technical SME, or to Tier-1
Previous experience in customer facing roles during high stress situations
Demonstrated skills as an influencer within a previous organization
In-depth knowledge of IT concepts, strategies, and methodologies; Agile knowledge a plus
In-depth knowledge of business operations, objectives, and strategies..
Familiarity with Containers (e.g. Docker, RKT) and IaaS (e.g. AWS, Azure, Openstack).
#J-18808-Ljbffr
-
Reliability Engineer
5 days ago
Santa Clara, United States Natron Energy Full timeNatron is seeking a Reliability Engineer to support the development and test of our high-power battery systems for data center UPS and EV charging applications. The occupant of this position will work with the Product Engineering, Reliability, Technology, and Operations teams to develop procedures for accelerated life and abuse testing of battery systems and...
-
Reliability Engineer
1 week ago
Santa Clara, CA, United States Natron Energy Full timeSanta Clara, CAOperations /Full Time /On-siteNatron is seeking a Reliability Engineer to support the development and test of our high-power battery systems for data center UPS and EV charging applications. The occupant of this position will work with the Product Engineering, Reliability, Technology, and Operations teams to develop procedures for accelerated...
-
Principal Site Reliability Engineer
18 hours ago
Santa Clara, United States Oracle Full timeSolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design an Reliability Engineer, Liability, Principal, Engineer, Reliability, Reliability, Manufacturing, Technology
-
Santa Clara, United States QCells Full timeHanwha Q CELLS Co., Ltd., is one of the world´s largest and most recognized photovoltaic manufacturers for its high-performance, high-quality solar cells and modules. It is headquartered in Seoul, South Korea (Global Executive HQ) and Talheim, Germany (Technology & Innovation HQ). Through its growing global business network spanning Europe, North...
-
Senior Reliability Test Engineer
1 week ago
Santa Clara, United States Johnson & Johnson Full timeJohnson & Johnson's Robotic and Digital Solutions (RAD) group is recruiting for a Senior Reliability Test Engineer , located in Santa Clara, CA . Robotics & Digital Solutions is part of Ethicon, Inc., a global leader in surgery with products and solutions found in almost every operating room around the world. Ethicon has made significant contributions to...
-
Reliability Manager
1 week ago
Santa Clara, United States Nvidia Full timeNVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI - the next era of computing. NVIDIA is a "learning machine" that constantly evolves by adapting...
-
Service Reliability Engineer
2 weeks ago
Santa Clara, United States Software Technology, Inc Full timeJob DescriptionJob DescriptionPosition : Service Reliability Engineer / Sr. Devops EngineerLocation : Santa Clara, CADuration : 1 Year + OK with any visa No OPT please Local consultants only Customer will not provide letter for H1B candidates. Please check with the candidate and employers before submitting the resume. Face to face is mandatory so please...
-
Sr Director of Quality and Reliability
7 days ago
Santa Clara, United States Natron Energy Full timeWe are seeking a highly experienced and dynamic individual to join our team as the Senior Director of Quality and Reliability. In this role, you will play a pivotal leadership role in championing a culture of quality excellence and driving the design and testing processes to ensure product reliability. You will lead a team of talented professionals and...
-
Sr Director of Quality and Reliability
2 days ago
Santa Clara, United States Natron Energy Full timeWe are seeking a highly experienced and dynamic individual to join our team as the Senior Director of Quality and Reliability. In this role, you will play a pivotal leadership role in championing a culture of quality excellence and driving the design and testing processes to ensure product reliability. You will lead a team of talented professionals and...
-
Staff Customer Reliability Engineer
4 weeks ago
Santa Clara, California, United States Palo Alto Networks Full timeJob Description Your Career Palo Alto Networks Cloud Security Products are the latest in our security platform that bring the power and scale of our products to our customers, and industry. It’s a groundbreaking change in the way the industry views cybersecurity as it relates to our cloud environments, one that is necessary in our mission...
-
Sr Site Reliability Engineer
5 days ago
Santa Clara, United States Palo Alto Networks Full timeJob Description Your Career We are looking for a Sr DevOps/SRE to operate in production a large scale GCP cloud running our innovative SaaS cyber-security product, while continuously improving application deployment, monitoring, operability and uptime of the service. The Cortex XDR group specializes in analysis and visualization of complex cyber-data...
-
Sr Director of Quality and Reliability
1 week ago
Santa Clara, CA, United States Natron Energy Full timeSanta Clara, CAOperations /Full Time /On-siteWe are seeking a highly experienced and dynamic individual to join our team as the Senior Director of Quality and Reliability. In this role, you will play a pivotal leadership role in championing a culture of quality excellence and driving the design and testing processes to ensure product reliability. You will...
-
Senior Field Service Engineer
4 days ago
Santa Clara, United States Orion Talent Full timePosition Details: Title: Engineer in Charge (EIC) Location: Santa Clara, CA Shift: Day shift, typically M – F Compensation: $150K depending on experience and qualifications and annualized bonus. Benefits: Medical, dental, vision and 401K (see website for more information). Excellent professional and personal development opportunities in an international...
-
Senior Field Service Engineer
1 month ago
Santa Clara, United States Orion Talent Full timePosition Details:Title: Engineer in Charge (EIC)Location: Santa Clara, CAShift: Day shift, typically M – FCompensation: $150K depending on experience and qualifications and annualized bonus.Benefits: Medical, dental, vision and 401K (see website for more information). Excellent professional and personal development opportunities in an international...
-
Senior Field Service Engineer
1 month ago
Santa Clara, United States Orion Talent Full timePosition Details:Title: Engineer in Charge (EIC)Location: Santa Clara, CAShift: Day shift, typically M – FCompensation: $150K depending on experience and qualifications and annualized bonus.Benefits: Medical, dental, vision and 401K (see website for more information). Excellent professional and personal development opportunities in an international...
-
Sr Site Reliability Engineer
1 week ago
Santa Clara, California, United States Palo Alto Networks Full timeJob Description Your Career We are looking for a Sr DevOps/SRE to operate in production a large scale GCP cloud running our innovative SaaS cyber-security product, while continuously improving application deployment, monitoring, operability and uptime of the service. The Cortex XDR group specializes in analysis and visualization of complex cyber-data...
-
Principal Software Engineer, Site Reliability
3 weeks ago
Santa Clara, California, United States Palo Alto Networks Full timeJob Description Your Career We are looking for a Principal DevOps/SRE to operate in production a large scale GCP cloud running our innovative SaaS cyber-security product, while continuously improving application deployment, monitoring, operability and uptime of the service. The Cortex XDR group specializes in analysis and visualization of complex...
-
Team Lead, Field Service Engineer
4 weeks ago
Santa Clara, United States IMS Nanofabrication Full timeJob DescriptionJob Description Responsibilities: Team Lead of Field Service Engineers (40%) • Manage the Field Service Engineer Team in Santa Clara, CA. • Supervise and execute the planning and progress of activities of a diverse team of Field Service Engineers. • Internally align on Service resources according to tool status and customer requests. •...
-
Engineering Manager Cloud Services
3 weeks ago
Santa Clara, United States Palo Alto Networks Full timePALO ALTO NETWORKS is the fastest-growing security company in history. We offer the chance to be part of an important mission: ending breaches and protecting our way of digital life. If you are a motivated, intelligent, creative, and hardworking individual, then this job is for you! We are seeking a development leader to join our exciting team in building...
-
Engineering Manager Cloud Services
16 hours ago
Santa Clara, United States Palo Alto Networks Full timePALO ALTO NETWORKSis the fastest-growing security company in history. We offer the chance to be part of an important mission: ending breaches and protecting our way of digital life. If you are a motivated, intelligent, creative, and hardworking individual, then this job is for you! We are seeking a development leader to join our exciting team in building...