Principal Site Reliability Engineer

2 weeks ago


Redwood City, United States Oracle Full time

The Background

Oracle’s Fusion Applications group is designing and building the next-gen deployment platform for its suite of software products. We focus on transforming how Software Developers and DevOps engineers build cloud applications for enterprise customers. Our team is building new services to improve developer productivity and automate the process of running cloud services.

You are the builder here. You will be part of a team of intelligent, motivated, and diverse people and given the autonomy and support to do your best work using modern backend technologies, including Kubernetes, Terraform, and more.

We value self-initiated systems engineers who have a passion to learn, build, automate and rollout infrastructure services globally. Our core values are our foundation and how we deliver excellence. We strive for equity, inclusion, and respect for all. We are committed to the greater good in our products and our actions. We are constantly learning and taking opportunities to grow our careers and ourselves. We challenge each other to stretch beyond our past to build our future.

Our team is fully remote. We currently have members spread across the US, India and Europe. We practice scrum and leverage Slack and Zoom heavily for day-to-day communication.

The Role

As a Principal Site Reliability Engineer, you will meet with our customers, both internal and external, to understand their use cases and design automated solutions to reduce the amount of manual work required to operate their services. You will lead the design and implementation of automation, mentor your teammates, and communicate with stakeholders.

A successful candidate will bring a strong focus on our customers, a passion for innovative products, as well as hands-on experience as a software engineer applying the latest AI technologies.

Ideal Qualifications:

  • Bachelor’s or Master’s degree in Computer Science or equivalent related field experience
  • Strong software engineering and automation skills
  • Hands-on experience with automation technologies such as Ansible/Terraform and programming languages such as Golang, Python, bash, Java.

Career Level - IC4

As a lead Site Reliability Engineer on the team, you will:

  • Use technologies such as Kubernetes, Helm, Terraform to build and operate highly available, high-performance distributed systems
  • Automate tasks to enable continuous delivery and ensure continuous availability with minimal human overhead
  • Define and drive change management, continuous integration, and deployment best practices
  • Be a technical leader and establish culture, as well as mentor other engineers.
  • Design, implement, and launch innovative solutions that raise the bar in terms of operations
#J-18808-Ljbffr

  • Jersey City, New Jersey, United States Fidelity Investments Full time

    Job Title: Principal Site Reliability EngineerThe Role:As a member of the TechOps SRE team at Fidelity Investments, you will work closely with our engineering partners to enable and drive initiatives from design to implementation. Our highly available multi-region Kubernetes environments are best-in-class and central to our enterprise-grade infrastructure...


  • Redwood City, United States 1872 Consulting Full time

    Site Reliability Engineer - 100% RemoteRole Summary:Site Reliability Engineers (SREs) are responsible for working with different developer teams to keep our systems running smoothly. They are a blend of pragmatic operators and software craftspeople that apply excellent problem-solving and communication skills to develop or configure tools that will automate,...


  • Jersey City, United States Fidelity Investments Full time

    Job Description:The RoleAs a member of the TechOps SRE team, you'll work closely with our engineering partners to help enable and drive initiatives from design to implementation. Our highly available multi-region Kubernetes (AWS EKS) environments are best-in-class and central to our enterprise-grade infrastructure strategy. These growing environments...


  • Jersey City, New Jersey, United States Fidelity TalentSource LLC Full time

    Job Description:The RoleAs a member of the TechOps SRE team, you will work closely with our engineering partners to help enable and drive initiatives from design to implementation.Our highly available multi-region Kubernetes (AWS EKS) environments are best-in-class and central to our enterprise-grade infrastructure strategy. These growing environments...


  • Foster City, California, United States Omega Solutions Inc Full time

    Job Title: Site Reliability EngineerAt Omega Solutions Inc, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, and performance of our critical platforms and applications.Key Responsibilities:* 8+ years of experience in Site Reliability...


  • Jersey City, New Jersey, United States The Goldman Sachs Group, Inc Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Goldman Sachs. As a Site Reliability Engineer, you will be responsible for designing, developing, and operating distributed systems that provide observability for our mission-critical applications and platform services.Your...


  • Foster City, California, United States Bayone Full time

    Job SummaryAt Bayone, we are seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for ensuring the uptime and performance of our large production service.Key ResponsibilitiesHost OS upgradesDocker image upgradesSSL certificate upgradesRequirementsBachelor's degree in...


  • Oklahoma City, United States Paycom Payroll Llc Full time

    Site reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionality, and reliability of applications and sites.Do not wait to apply after reading...


  • Oklahoma City, United States Paycom Payroll Llc Full time

    Site reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionality, and reliability of applications and sites.RESPONSIBILITIESDevelop software to...


  • Oklahoma City, United States Paycom Payroll Llc Full time

    Site reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionality, and reliability of applications and sites.Do not wait to apply after reading...


  • Oklahoma City, United States Paycom Payroll Llc Full time

    Site reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionality, and reliability of applications and sites.RESPONSIBILITIESDevelop software to...


  • Jersey City, NJ, United States Fidelity Investments Full time

    As a member of the TechOps SRE team, you'll work closely with our engineering partners to help enable and drive initiatives from design to implementation. This is a phenomenal opportunity to have a direct impact on the emerging strategies of our infrastructure and deployments, while at the same time, helping enable the expansion of our business. ...


  • Jersey City, NJ, United States Fidelity Investments Full time

    As a member of the TechOps SRE team, you'll work closely with our engineering partners to help enable and drive initiatives from design to implementation. This is a phenomenal opportunity to have a direct impact on the emerging strategies of our infrastructure and deployments, while at the same time, helping enable the expansion of our business. ...


  • Jersey City, New Jersey, United States The Goldman Sachs Group, Inc Full time

    About the RoleWe are seeking a talented Site Reliability Engineer to join our SRE Platforms team at Goldman Sachs. As a Site Reliability Engineer, you will be responsible for designing, developing, and operating distributed systems that provide observability for our mission-critical applications and platform services.Our team is responsible for designing and...


  • Jersey City, New Jersey, United States Syntricate Technologies Full time

    We are seeking a highly skilled AWS Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure, particularly our AWS environment.The ideal candidate will have strong experience with AWS, with a focus on SRE principles...


  • Kansas City, Missouri, United States Datum Technologies Group Full time

    Job SummaryAt Datum Technologies Group, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining efficient technology platforms that meet both internal and external customer needs while effectively managing associated risks.Key...


  • Oklahoma City, OK, United States Paycom Payroll Llc Full time

    Site reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionality, and reliability of applications and sites.RESPONSIBILITIESDevelop software to...


  • Jersey City, NJ, Hudson County, NJ; New Jersey, United States Fidelity Investments Full time

    As a member of the TechOps SRE team, you'll work closely with our engineering partners to help enable and drive initiatives from design to implementation. This is a phenomenal opportunity to have a direct impact on the emerging strategies of our infrastructure and deployments, while at the same time, helping enable the expansion of our business. ...


  • Kansas City, Missouri, United States Infinite Computer Solutions Full time

    Job Title: Site Reliability Engineer PositionJob Description: We are seeking a skilled Site Reliability Engineer to join our team at Infinite Computer Solutions.Key Responsibilities:* Strong experience with Ansible, Gitlab, deployment, packages, Linux, Unix, Splunk, and Dynatrace is required.* A minimum of 8 to 10 years of experience in a similar role is...


  • Redwood City, California, United States Oracle Full time

    About the RoleWe are seeking a highly skilled Senior Principal Engineer to join our Data Services organization. As a key member of our team, you will be responsible for driving the architecture, design, and development of key features of our cloud-native big data services.Key ResponsibilitiesDrive the architecture, design, and development of key features of...