Site Reliability Developer 3

2 days ago


Reston, Virginia, United States Oracle Full time

At Oracle Cloud Infrastructure (OCI), we build the future of the cloud for Enterprises as a diverse team of fellow creators and inventors. We act with the speed and attitude of a start-up, with the scale and customer-focus of the leading enterprise software company in the world.

Values are OCI's foundation and how we deliver excellence. We strive for equity, inclusion, and respect for all. We are committed to the greater good in our products and our actions. We are constantly learning and taking opportunities to grow our careers and ourselves. We challenge each other to stretch beyond our past to build our future.

You are the builder here. You will be part of a team of really smart, motivated, and diverse people and given the autonomy and support to do your best work. It is a dynamic and flexible workplace where you'll belong and be encouraged.

Site Reliability Developer

Oracle Cloud Infrastructure (OCI) - OCI National Security Regions

Reston, VA/ Seattle, WA/ Austin, TX

OCI National Security Region Networking team is looking for a Senior Site Reliability Engineer. As a Site Reliability Engineer, you will solve interesting technical challenges by defining, designing, deploying, and troubleshooting key Network Automation services focusing on scalability, security, and performance. The role involves software engineering, systems engineering, automation, network operations, and DevOps. You should be comfortable at building complex distributed systems. You will incorporate the ethos of software engineering and apply it to large-scale operational problems. Your primary goals are to create highly reliable and services, platforms, and infrastructure, always thinking about reliability, security, and ultra-scalable software systems to manage operations. When not working on operations, you will be working on software engineering tasks such as design and development of systems that increase reliability, scalability, and reduce operational overhead through automation. You should value simplicity and scale, work comfortably in a collaborative, agile environment, and be excited to learn.

A great software engineer will make all the difference for delivering quality solutions to our customers. Are you passionate about designing, developing, testing and delivering cloud services?  Do you thrive in a fast-paced environment, and want to be an integral part of a truly great team?

Come join us

As a Senior Site Reliability Engineer, you will be responsible for:

  • System Design and Operation:
    • Design and manage distributed Unix-based systems, particularly Oracle Linux.
    • Implement auto-scaling and self-healing infrastructure to ensure uptime and durability.
    • Tune system internals, including kernel parameters, networking, and filesystems, for high performance.
    • Maintain timely OS patching and compliance posture across environments.
    • Integrate systems with enterprise identity services such as Active Directory, LDAP, and Kerberos.
  • Automation and Infrastructure as Code:
    • Develop and maintain infrastructure automation using Ansible and Terraform.
    • Automate deployment pipelines, service configurations, and patch management.
    • Develop scripts and services in Python and Bash to enhance infrastructure delivery workflows.
    • Extend APIs and platform automation to drive efficiency and repeatability.
  • Observability and Incident Response:
    • Develop observability stacks using tools like Prometheus, Grafana, and other open-source telemetry tools.
    • Create dashboards and SLO/SLI-based alerts for real-time monitoring of production systems.
    • Participate in a global 24/7 on-call rotation, leading responses for high-severity incidents.
    • Conduct post-incident analysis (RCA) and drive remediations that improve long-term reliability.
  • Collaboration and Standards:
    • Partner with development teams to embed reliability in deployment pipelines.
    • Help define system architecture standards and maintain robust platform documentation.
    • Mentor engineers in Unix performance, observability, and debugging practices.
    • Champion a culture of automation, resilience, and continuous improvement.


  • Reston, Virginia, United States Oracle Full time

    DescriptionAt Oracle Cloud Infrastructure (OCI), we build the future of the cloud for Enterprises as a diverse team of fellow creators and inventors. We act with the speed and attitude of a start-up, with the scale and customer-focus of the leading enterprise software company in the world.Values are OCI's foundation and how we deliver excellence. We strive...


  • Reston, Virginia, United States Knack Solutions Full time

    ***W2 only***Position: Site Reliability Engineer (SRE)Work Authorization: All Work AuthorizationsLocation: Reston, VAContract: 24 monthsDescription: Site Reliability Engineer (SRE) roles and responsibilitiesThe SRE role bridges the Development Engineer role and the Production Engineer role with a mixture of development, test, deploy, and support skills that...


  • Reston, Virginia, United States ECS Full time

    Job DescriptionECS is seeking aSenior Site Reliability Engineerto workremotely.ECS is seeking talented professionals to join our successful and growing team in building the next-generation Continuous Diagnostics and Mitigation (CDM) Cyber data solution. The CDM Program is the Cybersecurity and Infrastructure Security Agency's (CISA) dynamic approach to...


  • Reston, Virginia, United States Verisign Full time

    Verisign helps enable the security, stability, and resiliency of the internet. We are a trusted provider of internet infrastructure services for the networked world and deliver unmatched performance in domain name system (DNS) services.We are a mission focused, values driven company where each individual can contribute to building a stronger, more secure...


  • Reston, Virginia, United States Palo Alto Networks Full time $120,000 - $179,000

    Company Description Our MissionAt Palo Alto Networks everything starts and ends with our mission:Being the cybersecurity partner of choice, protecting our digital way of life.Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and...


  • Reston, Virginia, United States Muller, Inc. Full time

    Title: Senior Project Manager, Site DevelopmentType: Full time, exemptLocation: Reston, VirginiaReports To: DirectorMuller is a full-service Site Work Contractor based in Northern Virginia. Our unique technology and systems-driven approach has led us to become a leading provider of sustainable civil construction services throughout Maryland, Virginia, and...

  • Site Technical Leader

    2 weeks ago


    Reston, Virginia, United States IBA Proton Therapy Full time $92,400 - $121,000 per year

    MissionResponsible for leading the technical performance of the IBA Proton Therapy System (PTS), maximizing system availability for patient treatment and quality assurance tests, overseeing the technical development and training of the Customer Service Engineers, and planning the execution of preventative and corrective maintenance.The Site Technical leader...

  • Airport Site Lead

    2 weeks ago


    Reston, Virginia, United States Leidos Full time $72,150 - $130,425

    LEIDOS has a direct placement for Leidos Airport Site Lead to provide technical and project management support for commercial airport projectsThis position encompasses the full project lifecycle management as the site lead / project manager.  The position involves, site surveys/validations, risk identification and mitigation, coordination, communication,...


  • Reston, Virginia, United States ICF Full time $81,094 - $137,860

    Join ICF's IT Modernization Team — Where Innovation Meets ImpactLocation: *Candidates residing within a 50-mile radius of Washington, DC, will be required to report onsite daily to a federal agency office in the DC area.  Candidates who reside outside the 50-mile radius will be considered full-time remote and will not be required to report on site daily...

  • Front-End Developer

    1 week ago


    Reston, Virginia, United States ACR Technology Inc Full time

    Location: Reston, VA (Local candidates preferred or willing to relocate)Employment Type: Full-Time / PermanentWe're seeking a passionate Front-End Developer to join our software engineering team in Reston, VA. In this role, you'll design, build, and optimize high-performance, scalable, and user-friendly web applications that power next-generation digital...