Site Reliability Developer Join OCI-Ns2

4 weeks ago


Austin, United States Oracle Full time

Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the critically important stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical services and technology areas and guide Development Teams to engineer and add world-class capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate partner concern point for sophisticated or critical issues that have not yet been detailed as Standard Operating Procedures (SOPs). Apply a deep understanding of service topology and their dependencies required to solve issues and define mitigations. Understand and explain the effect of product architecture decisions on distributed systems. Professional curiosity and a desire to develop understanding of services and technologies.

As part of the broader Engineering organization, you will be a key part of Oracle National Security Regions change management processes, work with Service Teams on introducing new features into secured regions, and troubleshoot production issues when required. This role is critical to the success of our system engineering, the virtual networking platforms, and the operational teams.

This role will support Oracles Government customers.

Job Responsibilities

Perform maintenance to services running within secured regions through the use of command line tools, change processes, and runbooks.

Deploy code and implement other changes.

Ensure timely resolution and documentation of incidents through bridge calls following guidance for secure communications and company-standard reporting methods.

Monitor the region for faults, alarms, and other errors.

Inform internal teams as required through process and procedure. Help identify and discuss process improvements and tools optimization.

Act as a point of escalation for incidents and other issues arising within the region.

Troubleshoot operational issues on behalf of service teams.

Required Qualifications

Must posses and maintain an active TS/SCI w/Polygraph Government Security clearance

Degree in computer related field, or 1-3 year experience equivalent.

Experience with Linux and infrastructure support and server administration.

Experience with managing secured services across distributed systems and geographies.

Customer focus, passion for delighting customers.

Demonstrable ability to quickly learn new technical domains and then train others.

Great verbal and written communication skills.

Solid understanding of cloud concepts, platforms, and distributed systems.

Preferred Qualifications

Experience with cloud networking and software driven networks.

Experience with coding or scripting using a major scripting language.

Experience deploying code within change management procedures.

Experience participating in or running incident bridges.

Experience in cloud technical support, operations, NOC or similar.

Experience working with government customers.



  • Austin, United States Oracle Full time

    Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the critically important stack, with focus on...


  • Austin, United States Oracle Full time

    Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the critically important stack, with focus on...


  • Austin, United States Oracle Full time

    Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate...


  • Austin, United States Tech Prastish Software Solutions Full time

    The Oracle Cloud Infrastructure (OCI) team at Oracle can provide you the opportunity to build and operate a suite of massive scale, integrated cloud services in a broadly distributed, multi-tenant cloud environment. OCI is committed to providing the best in cloud products that meet the needs of our customers who are tackling some of the world’s biggest...


  • Austin, United States Tech Prastish Software Solutions Full time

    The Oracle Cloud Infrastructure (OCI) team at Oracle can provide you the opportunity to build and operate a suite of massive scale, integrated cloud services in a broadly distributed, multi-tenant cloud environment. OCI is committed to providing the best in cloud products that meet the needs of our customers who are tackling some of the world’s biggest...


  • Austin, United States Procore Technologies Full time

    Job Description What if you could use your technology skills to develop a product that impacts the way communities’ hospitals, homes, sports stadiums, and schools across the world are built? Construction impacts the lives of nearly everyone in the world yet it’s also one of the world’s least digitized industries. That’s why we’re looking for an...


  • Austin, United States Thales Full time

    Location: Austin, United States of America Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become...


  • Austin, United States Thales Full time

    Location: Austin, United States of America Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become...


  • Austin, United States Thales Full time

    Location: Austin, United States of America Thales people architect identity management and data protection solutions at the heart of digital security. Business and governments rely on us to bring trust to the billons of digital interactions they have with people. Our technologies and services help banks exchange funds, people cross borders, energy become...


  • Austin, United States Texas Reliability Entity Full time

    Experienced Energy Reliability Engineer/Analyst Texas Reliability Entity, Inc. (Texas RE) is hiring!The Texas power grid is changing rapidly as economics, technology, and customer demands push the power industry to new limits. At the same time, what used to be low-probability events, such as extreme weather and cybersecurity breaches, are now occurring at a...


  • Austin, Texas, United States Iodine Software Full time

    Director - Site Reliability Engineering Join us. Let's make a direct impact in healthcare. Being an Iodine employee means becoming part of something bigger: using clinical AI echnology to drive smarter healthcare processes and positively impact patient care. Who we are: Iodine is an enterprise AI company that is championing a radical rethink of how to create...


  • Austin, United States Iodine Software Full time

    Director - Site Reliability Engineering Join us. Let's make a direct impact in healthcare. Being an Iodine employee means becoming part of something bigger: using clinical AI echnology to drive smarter healthcare processes and positively impact patient care. Who we are: Iodine is an enterprise AI company that is championing a radical rethink of how to...


  • Austin, Texas, United States Iodine Software Full time

    Director - Site Reliability Engineering Join us. Let's make a direct impact in healthcare. Being an Iodine employee means becoming part of something bigger: using clinical AI echnology to drive smarter healthcare processes and positively impact patient care. Who we are: Iodine is an enterprise AI company that is championing a radical rethink of how to create...


  • Austin, United States SureCo Inc Full time

    Job Type Full-time Description Job Title: Site Reliability Engineer (SRE) Location: Remote (comfortable working in the Pacific Time Zone) SureCo is changing how people in the US take care of their health - in 2020, new regulations went into effect, allowing employers to offer more choice at lower cost for employee health benefits, and SureCo is at the...


  • Austin, United States Oracle Full time

    Unlock the potential of advanced AI technologies with our renowned Speech & Language Organization. As a Principal Member of Technical Staff, you'll lead the charge in developing cutting-edge solutions and groundbreaking innovations that redefine human-machine interaction. We are looking for hands-on engineers with expertise and passion for solving...


  • Austin, United States Oracle Full time

    Unlock the potential of advanced AI technologies with our renowned Speech & Language Organization. As a Principal Member of Technical Staff, you'll lead the charge in developing cutting-edge solutions and groundbreaking innovations that redefine human-machine interaction. We are looking for hands-on engineers with expertise and passion for solving...


  • Austin, United States OBSERVE, LLC Full time

    About Us Observe.AI is the fastest way to boost contact center performance with live conversation intelligence. Built on the most accurate AI engine in the industry, Observe.AI uncovers insights from 100% of customer interactions and maximizes frontline team performance through coaching and end-to-end workflow automation. With Observe.AI , companies can act...


  • Austin, United States Virtu Financial Full time

    Virtu is a leading financial firm that leverages cutting edge technology to deliver liquidity to the global markets and innovative, transparent trading solutions to our clients. As a market maker, Virtu provides deep liquidity that helps to create more efficient markets around the world. Our market structure expertise, broad diversification, and execution...


  • Austin, United States Hispanic Technology Executive Council Full time

    Senior Engineer Site Reliability Dell Technologies customers rely on our products and services to drive progress. So, we take the service we provide extremely seriously. Service Delivery is all about making sure our technical solutions help clients fulfil their priorities, challenges and initiatives. As trusted advisors, we build in-depth knowledge of what...


  • Austin, United States Hispanic Technology Executive Council Full time

    Senior Engineer Site Reliability Dell Technologies customers rely on our products and services to drive progress. So, we take the service we provide extremely seriously. Service Delivery is all about making sure our technical solutions help clients fulfil their priorities, challenges and initiatives. As trusted advisors, we build in-depth knowledge of what...