Site Reliability Developer 3

4 weeks ago


Santa Clara, California, United States Oracle Full time

Job Summary

Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services.

Key Responsibilities

  • Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas.
  • Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services.
  • Responsible for the design and delivery of the mission critical stack, with focus on security, capacity, resiliency, scale, and performance.
  • Authority for end-to-end performance and operability.
  • Partner with development teams in defining and implementing improvements in service architecture.
  • Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio.
  • Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack.
  • Demonstrate clear understanding of automation and orchestration principles.
  • Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs).
  • Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations.
  • Understand and explain the affect of product architecture decisions on distributed systems.
  • Professional curiosity and a desire to a develop deep understanding of services and technologies.

About Us

As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's problems. True innovation starts with diverse perspectives and various abilities and backgrounds.

When everyone's voice is heard, we're inspired to go beyond what's been done before. It's why we're committed to expanding our inclusive workforce that promotes diverse insights and perspectives.

We've partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity.

Oracle careers open the door to global opportunities where work-life balance flourishes. We offer a highly competitive suite of employee benefits designed on the principles of parity and consistency.

We put our people first with flexible medical, life insurance and retirement options. We also encourage employees to give back to their communities through our volunteer programs.

We're committed to including people with disabilities at all stages of the employment process.

Oracle is an Equal Employment Opportunity Employer*. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law.

Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.



  • Santa Clara, California, United States Syntricate Technologies Full time

    Job Title: Site Reliability EngineeringWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based systems.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure using...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job DescriptionPalo Alto Networks is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining scalable and reliable infrastructure for our cloud-based products.Key Responsibilities:Design and implement scalable and reliable infrastructure for...


  • Santa Clara, California, United States NVIDIA Full time

    As a Senior Manager in Site Reliability Engineering (SRE) at NVIDIA, you will lead a team dedicated to the design, construction, and maintenance of expansive production systems, emphasizing high efficiency and availability. This role spans various domains, including software and systems engineering, cloud-scale storage, data management, and services. SRE...


  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RoleWe are seeking a highly skilled Senior Staff Site Reliability Engineer to join our CDL/SLS team at Palo Alto Networks. As a key member of our team, you will be responsible for designing, building, and operating reliable and secure cloud infrastructure.Our Infrastructure Platform stack includes Terraform, Kubernetes, GitLab CI/CD, GitOps,...


  • Santa Clara, California, United States NVIDIA Full time

    NVIDIA is a leader in AI, machine learning, and datacenter acceleration. Our company is expanding its leadership into datacenter networking with ethernet switches, NICs, and DPUs. We have continuously reinvented ourselves over two decades, with our invention of the GPU in 1999 sparking the growth of the PC gaming market, redefining modern computer graphics,...


  • Santa Clara, California, United States Palo Alto Networks Full time

    About the RoleWe are seeking a highly skilled Senior Staff Site Reliability Engineer to join our team at Palo Alto Networks. As a key member of our Cloud Infrastructure team, you will be responsible for designing, building, and operating reliable and secure cloud infrastructure.Our ideal candidate will have a strong background in cloud computing, with...


  • Santa Clara, California, United States Palo Alto Networks Full time

    Job DescriptionYour CareerThe Global Customer Operation Team is responsible for building products that protect data, workloads, and infrastructure for some of the largest enterprise customers in the world.We help our customers in their journey to the public cloud by ensuring they have the best in class protection.The public cloud market has been growing at a...


  • Santa Clara, California, United States Palo Alto Networks Full time

    We are revolutionizing the cybersecurity landscape with our cloud-delivered security services, and our cloud infrastructure is rapidly expanding globally.We're seeking experienced SREs and software engineers interested in production engineering to help us scale the world's largest enterprise security cloud infrastructure.Palo Alto Networks has transformed...


  • Santa Barbara, California, United States Invoca Full time

    About Invoca:Invoca is a leading provider of AI-powered Conversation Intelligence solutions. With a strong focus on innovation and customer satisfaction, we're shaping the future of customer engagement. Our team is passionate about delivering exceptional results and making a lasting impact.About the Role:We're seeking a highly skilled Site Reliability...


  • Santa Clara, California, United States Blue River Technology Full time

    Job DescriptionWe're Blue River Technology, a team of innovators driven to create intelligent machinery that solves monumental problems for our customers. We empower our customers – farmers, construction crews, and foresters - to implement safer and more sustainable solutions, driving increased profitability with less reliance on scarce labor. We believe...


  • Santa Clara, California, United States XPENG Motors Full time

    Job Title: Senior Staff AI Infrastructure SREXpeng Motors is a leading smart electric vehicle company that designs, develops, and manufactures cutting-edge EVs with advanced Internet, AI, and autonomous driving technologies. We are committed to in-house R&D and intelligent manufacturing to create a better mobility experience for our customers.About the...


  • Santa Clara, California, United States NVIDIA Full time

    At NVIDIA, we're seeking a highly skilled Senior Cloud Reliability Engineer to join our team. As a key member of our Site Reliability Engineering (SRE) team, you'll be responsible for designing, building, and maintaining large-scale production systems with high efficiency and availability.This is a highly specialized discipline that demands knowledge across...


  • Santa Clara, California, United States Nvidia Full time

    Senior Reliability EngineerNVIDIA is seeking a highly skilled Senior Reliability Engineer to join our team. As a key member of our engineering team, you will be responsible for planning and implementing the qualifications of new NVIDIA products, including IC chips in AI, Mobile, Automotive, Deep Learning, Graphic Processor, and System on Chip sectors.Key...


  • Santa Clara, California, United States Anello Photonics Full time

    About Anello Photonics:Anello Photonics is a leading-edge technology company based in Santa Clara, CA. The company has developed integrated photonic system-on-chip technology for next-generation navigation. ANELLO's SIPHOGTM gyroscope is based on its patented photonic integrated circuit technology. The result is a product that is higher performance, much...


  • Santa Clara, California, United States Paradigm Information Services, Inc. Full time

    Job SummaryWe are seeking a highly skilled Site Manager to oversee our client's operations in Santa Clara, CA. As the Field Service Manager, you will be responsible for managing a team of engineers responsible for troubleshooting, maintaining, and adjusting client equipment. Your role will involve ensuring high customer satisfaction, adhering to company...


  • Santa Clara, California, United States NVIDIA Full time

    Job Title: Senior Cloud Reliability EngineerWe are seeking a highly motivated Senior Cloud Reliability Engineer to join our Embedded organization.This team is responsible for automating, deploying, and maintaining infrastructure for various NVIDIA AI workflows and applications such as Metropolis, ACE, and Riva hosted in the cloud.The Senior Cloud Reliability...


  • Santa Clara, California, United States Nino Press Inc Full time

    Job Title: Reliable C Class Driver WantedJob DescriptionWe are a growing printing company in need of a reliable and qualified C Class driver with a clean DMV record. The ideal candidate must have the ability to work well in any team setting, be fast-paced, take initiative, be flexible, and dependable. The driver will be responsible for delivering products to...


  • Santa Clara, California, United States Amazon Full time

    About the RoleWe are seeking a highly skilled Software Development Engineer to join our Usability & Interfaces team at AWS HealthOmics. As a key member of our team, you will be responsible for developing capabilities that champion how customers interact with our service, including API development, workflow and performance optimizations, and interfacing with...


  • Santa Clara, California, United States Nvidia Full time

    Product Development EngineerNVIDIA is seeking a highly skilled Product Development Engineer to join our Operations Engineering team. As a key member of this team, you will play a pivotal role in ensuring the successful release of NVIDIA's board and system products to high volume manufacturing.Key Responsibilities:Represent Ops Engineering during architecture...


  • Santa Clara, California, United States AVE by Korman Communities Full time

    Business Development ManagerWe are seeking a highly motivated and experienced Business Development Manager to join our team at AVE by Korman Communities. As a Business Development Manager, you will be responsible for creating demand and driving sales results for our fully furnished apartments. This role is ideal for senior-level sales professionals who are...