Staff Site Reliability Engineer
4 weeks ago
We are seeking a highly skilled Cloud Platform SRE Engineer to join our team at Illumio. As a key member of our Operations team, you will be responsible for designing and deploying scalable, reliable, and secure cloud infrastructure. You will work closely with our Platform and Data engineers to deliver the latest Illumio products.
Key Responsibilities:
- Design and deploy scalable, reliable, and secure cloud infrastructure
- Collaborate with Platform and Data engineers to deliver the latest Illumio products
- Define and meet Platform SLOs, capacity utilization, cost visibility, security compliance, etc.
- Drive reliability improvements back into applications
- Mentor and educate team members to aid in strengthening technical expertise
- Curate proper SLI/SLOs to accurately measure or assess error budgets
- Embed with development teams to assist with cloud methodologies when developing products to ensure that the deliverable is as reliable as possible
- Work with development teams to build and strengthen application security and compliance
- Manage high-impact situations that involve technically challenging issues across diverse audiences and drive to find the root cause, mitigate, and identify a solution
- Focus on observability
Requirements:
- Bachelor's degree in Computer Science, Engineering, or related field; or equivalent work experience
- 6+ years of relevant SRE, DevOps, Platform or Infrastructure Engineering experience
- 4+ years in production support role in a fast-paced industry/organization
- Experience deploying, tuning, and maintaining Linux-based, highly available, fault-tolerant web platforms in public cloud providers such as AWS, Azure, and GCP
- Common monitoring, log aggregation, and metrics gathering platforms experience (Icinga, Sensu, Splunk, Telegraf/InfluxDB, et. al.)
- Configuration management & orchestration tools experience like Chef, Ansible, and AWS Services & APIs, or equivalent
- Experience scripting/coding with Python, Java, Ruby, and/or Go
- Experience with MySQL, PostgreSQL, Redis, or similar
- Solid knowledge of Linux operating system, Ubuntu, RHEL, OEL7 is required
- EKS and/or AKS frameworks
- Knowledge/Experience of Incident Management/on-call: PagerDuty
- Knowledge of Database Technologies, Release Management, REST, SRE, etc.
- Load balancers/ Traffic manager knowledge
- Experience working with Kubernetes, Docker, or other virtualization & containerization technologies
- Networking basics and trouble shooting skills
- Good understanding of Production deployment, Distributed Environments required
- Strong problem-solving and operational process skills, attention to detail
- Application support and debugging experience in a dynamic fast-paced production environment
- Experience with SDLC principles, architecture, and operations
- Experience working with senior leadership both inside and outside of engineering
- Ability to manage multiple tasks and competing priorities to deliver projects on schedule
- Azure certifications such as Azure Administrator, Azure Developer, or AWS/GCP certifications are a plus
What We Offer:
- $183,000 USD - $220,000 USD
- A wide range of benefits to our eligible team members, including Medical, Dental, Vision Coverage, Health and Dependent Savings Accounts, Life and Disability Programs, Paid Parental Leave, Voluntary Benefit Programs, Company Sponsored Wellness Program, Wellness Reimbursement Program, Retirement Savings, Equity Opportunities, Paid time off and Paid Holidays, Employee Incentive Program
About Illumio:
Illumio believes that an environment of unique backgrounds, experiences, viewpoints, and individual contributions drives our success and makes us stronger together. We are dedicated to creating and maintaining a diverse culture and emphasizing inclusion and belonging.
-
Site Reliability Engineer
4 weeks ago
Sunnyvale, California, United States Apple Full timeJob DescriptionAt Apple, we're revolutionizing the way people interact with technology. As a Senior Site Reliability Engineer, you'll play a critical role in maintaining and enhancing the reliability of our production systems.Key ResponsibilitiesDesign, develop, and maintain scalable, reliable, and efficient infrastructure.Implement monitoring, alerting, and...
-
Site Reliability Engineer
3 weeks ago
Sunnyvale, California, United States Saxon Global Full timeJob SummaryAs a Site Reliability Engineer at Saxon Global, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based e-commerce and retail platform. You will work closely with our software engineering teams to design, build, and maintain tools that improve the overall system health and availability.Key...
-
Senior Site Reliability Engineer
4 weeks ago
Sunnyvale, California, United States JobRialto Full timeJob Summary:We are seeking a Senior Site Reliability Engineer to join our team on a 12-month contract. The ideal candidate will combine software and systems engineering expertise to build, run, and optimize large-scale, fault-tolerant systems. This role focuses on automation, reliability, scalability, and performance, ensuring that applications have the...
-
Site Reliability Engineer
3 weeks ago
Sunnyvale, California, United States Apple Full timeSite Reliability Engineer - Infrastructure ExpertAt Apple, we're looking for a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in maintaining and enhancing the reliability of our production systems.You will collaborate with engineering teams to design, implement, and monitor infrastructure and...
-
Site Reliability Engineer
4 weeks ago
Sunnyvale, California, United States Diverse Lynx Full timeAs a Site Reliability Engineer at Diverse Lynx LLC, you will be responsible for managing infrastructure resources through automation of SRE reports. You will also convert legacy applications to Docker/Kubernetes and deploy them on cloud environments. Additionally, you will have a good understanding of enterprise-level vulnerability management.Key...
-
Sunnyvale, California, United States AdbaKx Full timeJob Title: Site Reliability Engineer with focus on Network and Security ExpertiseLocation: Sunnyvale, CADuration: 12 MonthsMust Have: NginX, Netscaler, Envoy Load balancing and Traffic management Network securityJob Description:As a Site Reliability Engineer with a focus on Network and Security Expertise, you will be responsible for ensuring the reliability...
-
Staff Engineer, Site Reliability Expert
3 weeks ago
Sunnyvale, California, United States LinkedIn Full timeAbout Traffic SRETraffic is responsible for delivering LinkedIn products and services to everyone on the Internet. Our team operates the edge of LinkedIn's data centers with a massive infrastructure that serves over 1 billion members and millions of requests per second. We develop and manage Layer 4 and Layer 7 network proxies, load balancers, service...
-
Staff Software Engineer
4 weeks ago
Sunnyvale, California, United States Walmart Full timeJob SummaryWe are seeking a highly skilled Staff Software Engineer to join our team at Walmart Global Tech. As a Staff Software Engineer, you will play a key role in driving the development of complex software changes and leading the design of new features. You will work closely with senior and junior teammates to cultivate a reciprocal learning environment...
-
Staff Software Engineer
3 weeks ago
Sunnyvale, California, United States Walmart Full timeAbout the Role:We are seeking a highly skilled Staff Software Engineer to join our team at Walmart Global Tech. As a Staff Software Engineer, you will be responsible for leading the design and implementation of high-performance edge systems, working closely with cross-functional teams to drive innovation and excellence in software development.Key...
-
Data Center Chief Site Engineer
3 weeks ago
Sunnyvale, California, United States DataFlex LLC, The Human Capital & Company Matchmaker Experts Full timeData Center Chief Site EngineerJob SummaryDataFlex LLC, The Human Capital & Company Matchmaker Experts is seeking a highly skilled Data Center Chief Site Engineer to join our team. As a key member of our facility operations team, you will be responsible for the operational management and effective daily oversight and administration of the site's operational...
-
Staff Software Engineer and Tech Lead
4 weeks ago
Sunnyvale, California, United States Uber Full timeAbout the RoleWe are seeking a highly skilled Staff Software Engineer and Tech Lead to join our Guest Products team at Uber. As a key member of our engineering team, you will be responsible for leading the design, development, and deployment of scalable and reliable backend systems.Key ResponsibilitiesLead the design, development, and deployment of scalable...
-
Staff Software Engineer
3 weeks ago
Sunnyvale, California, United States Walmart Global Tech Full timeAbout the Role:We are seeking a highly skilled Staff Software Engineer to join our team at Walmart Global Tech. As a Staff Software Engineer, you will be responsible for leading the design and development of mobile applications using React Native, ensuring scalability, reliability, and performance.Key Responsibilities:Lead the design and development of...
-
Staff Software Engineer
3 weeks ago
Sunnyvale, California, United States Sam's Club Full timeAbout the Role:We are seeking a highly skilled Staff Software Engineer to lead the design and development of mobile applications using React Native. The ideal candidate will have a strong background in software engineering, with a focus on mobile development and a passion for creating scalable, reliable, and high-performance applications.Key...
-
Staff Media Systems Engineer
4 weeks ago
Sunnyvale, California, United States LinkedIn Full timeJob Title: Staff Media Systems EngineerAt LinkedIn, we're seeking a highly skilled Staff Media Systems Engineer to join our Studio Engineering team. As a key member of our team, you'll be responsible for designing and running our global Live Production platform, ensuring seamless execution of large live events and productions for both internal and external...
-
Staff Software Engineer
4 weeks ago
Sunnyvale, California, United States Walmart Full timeJob SummaryWe are seeking a highly skilled Staff Software Engineer to join our team at Walmart Global Tech. As a key member of our engineering team, you will be responsible for designing, developing, and deploying complex software systems that drive business growth and innovation.Key Responsibilities:Lead and participate in medium- to large-scale projects,...
-
Staff Software Engineer, Display Ads
4 weeks ago
Sunnyvale, California, United States Uber Full timeAbout the RoleAs a Staff Software Engineer on the Display Ads team at Uber, you will be at the forefront of building and scaling Uber's advertising products and its underlying platform. Your work will enable Uber to provide personalized and relevant ad experiences, driving both customer satisfaction and revenue growth.Key ResponsibilitiesLead the design,...
-
Staff Software Engineer
4 weeks ago
Sunnyvale, California, United States Walmart Full timeAbout the RoleWe are seeking a highly skilled Staff Software Engineer to join our team at Walmart Global Tech. As a Staff Software Engineer, you will play a key role in shaping the technical direction of our software engineering team and driving innovation in our products and services.Key Responsibilities:Provide technical leadership and guidance to junior...
-
Senior Staff Software Engineer
4 weeks ago
Sunnyvale, California, United States Uber Full timeAbout the RoleThe Movement Engine org has four pillars that power our earners' movement in the physical world through the Uber platform, creating a delightful experience for our riders during all stages of a trip, and delivering food to our eaters in a pleasant and timely manner. These are our areas of focus: Leveraging GPS data and handling real-time...
-
Staff Data Engineer
2 weeks ago
Sunnyvale, California, United States LinkedIn Full timeKey ResponsibilitiesAs a Staff Software Engineer on LinkedIn's Data Science team, you will be responsible for leading the architecture and design of novel data applications, including front-end and back-end development. You will work closely with cross-functional teams to identify business opportunities and build scalable data solutions, establishing...
-
Sunnyvale, California, United States Tesla Full timeThe Field Reliability Engineering team at Tesla is dedicated to improving customer experience by reducing field issues. As a Field Reliability Specialist, you will be responsible for analyzing and resolving issues affecting high voltage systems in our vehicles.This is a hands-on individual contributor role that requires a strong background in electrical...