Current jobs related to Lead Cloud Reliability Engineer - Montgomery, Alabama - Oracle


  • Montgomery, Alabama, United States Leidos Full time

    Job Summary: Leidos is seeking a skilled Cloud Infrastructure Engineer to join our team in support of the U.S. Air Force Cloud One Architecture and Common Shared Services contract. As a Cloud Infrastructure Engineer, you will be responsible for ensuring the reliability, performance, and scalability of cloud-based applications and infrastructure.Key...

  • Senior Cloud Engineer

    2 weeks ago


    Montgomery, Alabama, United States Oracle Full time

    Job DescriptionOracle is seeking a highly skilled Senior Principal Engineer to join our Cloud Engineering team. As a key member of our team, you will be responsible for designing, implementing, and operating cloud services that enable animation, film, and game development studios to migrate their entire production pipeline to the...

  • Cloud Engineer

    3 weeks ago


    Montgomery, Alabama, United States Leidos Full time

    Cloud One DevOps EngineerLeidos is seeking skilled professionals to fill Cloud One DevOps Engineer roles, with opportunities at various levels. This is an exciting chance to modernize a multi-cloud environment and support critical missions for the U.S. Air Force.Key Responsibilities:Implement DevOps capabilities and tools across multiple cloud...

  • Azure Cloud Engineer

    2 weeks ago


    Montgomery, Alabama, United States TEKsystems co Allegis Group Full time

    Job Title: Azure Cloud EngineerWe are seeking an experienced Azure Cloud Engineer to join our team at TEKsystems c/o Allegis Group. As a Cloud Engineer, you will be responsible for designing, implementing, and maintaining cloud-based systems and infrastructure for our clients.Key Responsibilities:Design and implement cloud-based systems and infrastructure...


  • Montgomery, Alabama, United States SAIC Full time

    SAIC is seeking a highly skilled Cloud Computing Engineer to join the Cloud One Digital Engineering Team.This team is responsible for the architecture, engineering, and sustainment of the AF Cloud Digital Engineering platform currently deployed on AWS Cloud.The ideal candidate will have experience with AWS, Networking, VPNs, IaaS, PaaS, SQL, Jenkins,...


  • Montgomery, Alabama, United States Oracle Full time

    About the RoleWe are seeking a highly skilled and motivated Senior Cloud Engineer to join our team at Oracle. As a Senior Cloud Engineer, you will be responsible for designing, developing, and maintaining large-scale, highly available, cloud-based distributed systems.Key ResponsibilitiesDesign and develop cloud-based distributed systems using modern...


  • Montgomery, Alabama, United States Prime Therapeutics Full time

    Job DescriptionWe are seeking a highly skilled Cloud Infrastructure Engineer to join our team at Prime Therapeutics. As a Cloud Infrastructure Engineer, you will be responsible for designing, implementing, and managing cloud-based infrastructure to support our business applications.Key Responsibilities:Design and implement cloud-based infrastructure using...


  • Montgomery, Alabama, United States Leidos Full time

    Job SummaryLeidos is seeking a skilled Cloud Infrastructure Engineer to join our team in support of the U.S. Air Force Cloud One Architecture and Common Shared Services contract. As a key member of our team, you will play a critical role in modernizing a leading, global-scale multi-cloud environment to ensure system resiliency, security, and cost...

  • Azure Cloud Engineer

    2 weeks ago


    Montgomery, Alabama, United States TEKsystems Full time

    Job Title: Azure Cloud EngineerWe are seeking an experienced Azure Cloud Engineer to join our team at TEKsystems. As a key member of our cloud infrastructure team, you will be responsible for designing, implementing, and maintaining scalable and secure cloud solutions on the Microsoft Azure platform.Key Responsibilities:Design and implement cloud...


  • Montgomery, Alabama, United States Oracle Full time

    Job DescriptionWe are a world-class team of high-calibre security tool services Site Reliability Engineers. Our team is inclusive and diverse, with a full spectrum of experience distributed globally. We have the resources of a large enterprise and the energy of a start-up, working on a critical greenfield software assurance project collaboratively with our...

  • GCP Cloud Engineer

    2 weeks ago


    Montgomery, Alabama, United States Ford Motor Company Full time

    About the RoleWe are seeking a highly skilled GCP Cloud Engineer to join our team at Ford Motor Company. As a key member of our Enterprise Technology Group, you will play a critical part in crafting the future of mobility.Key ResponsibilitiesDesign, build, and maintain GCP infrastructure to support our applications and services.Collaborate with other teams...


  • Montgomery, Alabama, United States Leidos Full time

    Job SummaryLeidos is seeking a highly skilled Cybersecurity Engineer to join our team in support of the U.S. Air Force Cloud One Architecture and Common Shared Services contract. As a Cybersecurity Engineer, you will be responsible for designing, deploying, configuring, operating, and maintaining authorizations and accreditation of the C1 Architecture for...


  • Montgomery, Alabama, United States SAIC Full time

    Job Title: Senior Cloud Computing EngineerSAIC is seeking a highly skilled Senior Cloud Computing Engineer to join our Cloud One Digital Engineering Team. As a key member of our team, you will be responsible for designing, implementing, and maintaining secure cloud infrastructure on AWS.Key Responsibilities:Analyze existing system architecture and recommend...


  • Montgomery, Alabama, United States SAIC Full time

    Job Summary:SAIC is seeking a highly skilled Senior Cloud Computing Engineer to join the Cloud One Digital Engineering Team. The ideal candidate will have experience in designing and implementing secure cloud architectures, developing scripts and workflows for automated deployment, and providing input to design documents and test procedures.Key...


  • Montgomery, Alabama, United States SAIC Full time

    Job Title: Junior AWS Cloud Computing EngineerSAIC is seeking a highly skilled Junior AWS Cloud Computing Engineer to join its Cloud One Digital Engineering DevSecOps Team. This team is responsible for the architecture, engineering, and sustainment of the AF Cloud Digital Engineering platform currently deployed on AWS Cloud.Job Responsibilities:Point of...


  • Montgomery, Alabama, United States Confluent Full time

    About the RoleWe are seeking a highly skilled Staff Software Engineer to lead the technical development of our Stream Governance product at Confluent. As a key member of our engineering team, you will be responsible for designing, architecting, and delivering a cloud-native, multi-tenant service for Stream Governance. This role requires a strong technical...


  • Montgomery, Alabama, United States SAIC Full time

    Job Summary:SAIC is seeking a highly skilled Senior Cloud Computing Engineer to join the Cloud One Digital Engineering Team. The ideal candidate will have a strong background in cloud architecture, engineering, and security, with experience working with AWS cloud services.Key Responsibilities:Analyze existing system architecture and recommend secure design...

  • Data Engineer

    1 week ago


    Montgomery, Alabama, United States Shee Atika Government Services Full time

    Job Title: Data Engineer - Cloud InfrastructureWe are seeking a highly skilled Data Engineer to join our team and contribute to the design and development of our cloud infrastructure. The successful candidate will be responsible for designing and maintaining data infrastructure, building data pipelines, and ensuring data quality and integrity.Key...


  • Montgomery, Alabama, United States SAIC Full time

    Job Title: Senior AWS Cloud EngineerSAIC is seeking a highly skilled Senior AWS Cloud Engineer to join our team. As a key member of our Cloud One Operations and IL 2-6 team, you will be responsible for designing, deploying, and managing cloud-based systems and applications.Key Responsibilities:Design and deploy services within AWS multi-account...


  • Montgomery, Alabama, United States Oracle Full time

    Job DescriptionThe Oracle Cloud Infrastructure (OCI) Security team is responsible for ensuring the security of our cloud infrastructure and services. We are seeking a highly skilled and experienced Cloud Security Engineer to join our team.ResponsibilitiesDesign and implement secure cloud infrastructure and servicesIdentify and mitigate security risks and...

Lead Cloud Reliability Engineer

2 months ago


Montgomery, Alabama, United States Oracle Full time

Position Overview

Our Team

In response to our expanding Cloud initiatives, Oracle has established a pioneering division - Health Data Intelligence Platform. This group is dedicated to product innovation and strategic development for Oracle Health, while creating a comprehensive platform that supports advanced, automated healthcare solutions. This is a fresh venture, cultivated with an entrepreneurial mindset that fosters a vibrant and creative atmosphere. Your expertise will be vital in shaping this exceptional engineering hub with a commitment to excellence.

The Health Data Intelligence Platform presents a unique chance to significantly influence how Oracle Health products transform the healthcare landscape by reshaping the intersection of healthcare and technology.

You will have the opportunity to:

  • Impact billions of lives through our innovative products and services.
  • Develop technology that genuinely makes a difference in the world.
  • Make an immediate contribution to the evolution of technology.
  • Experience limitless growth potential through inspiring work.
  • Collaborate with top professionals in the industry.
  • Thrive in an open, diverse, and productive workplace.

About The Role

This is a distinctive opportunity to join a rapidly evolving and exceptional team tasked with engineering groundbreaking Oracle Cloud technologies and infrastructures that constitute the Oracle Cloud solutions. As a member of the Site Reliability Engineering (SRE) team, you will face continuous challenges and contribute to the daily success of Oracle Cloud, collaborating closely with development partners.

As a Lead Cloud Reliability Engineer, you will tackle intriguing technical challenges by defining, designing, deploying, and optimizing key Oracle Cloud services, platforms, and infrastructures, with a constant focus on reliability, scalability, resilience, security, and performance.

The ideal candidate for this dynamic and visible technical leadership position will possess the skills of a developer, the acumen of a systems and infrastructure expert, and the determination of a proactive problem-solver. These attributes should be complemented by strong communication skills to ensure the success of our Oracle Cloud customers.

Key Responsibilities

  • Service Ownership - You will be an integral part of the SRE team, dedicated to the comprehensive ownership of a suite of services and/or technology domains, in collaboration with our Development partners.
  • Ownership Scope - As an SRE, you will gain a thorough understanding of the complete configuration, technical dependencies, and overall operational characteristics of the production services you oversee. In partnership with your Development colleagues, you will ensure that services are designed and delivered with a critical focus on security, resilience, scalability, and performance. SREs are the ultimate authority and are accountable for the comprehensive performance and operability of the services they manage.
  • Service Design - As Oracle Cloud evolves, you will collaborate with development teams to define and implement enhancements in service architecture, both current and future. As an SRE, you will articulate the technical characteristics of your services and their interdependencies, guiding Development teams to engineer and incorporate premier capabilities into the Oracle Cloud service portfolio.
  • Operations Engineering - You will be equipped to communicate the scale, capacity, security, and performance attributes of the services you manage. You will be a domain expert, capable of understanding and conveying every aspect of your service stack, including:
    • Performance degradation and behavior under load of the services and their dependencies.
    • End-to-end tuning requirements, optimizing resource utilization as load patterns fluctuate.
    • Instrumentation and metrics that accurately depict service behaviors.
    • Scaling requirements and patterns.
    • Resiliency and recoverability, ensuring that backup/restore and disaster recovery capabilities are implemented, tested, and maintained.
    • Security operations and vulnerability remediation, ensuring vulnerabilities are addressed while adhering to corporate and federal security standards.
  • Automation - You will possess a solid understanding of automation and orchestration principles, and will be eager to automate processes wherever possible, while simultaneously reducing technical debt. Automation should be a fundamental aspect of your approach.
  • Prevention - After resolving an issue, you will proactively work on strategies to expedite future resolutions, aiming to prevent recurrence.
  • Technical Expertise - As a service owner, you will be the primary point of contact for complex or critical issues that lack documented Standard Operating Procedures (SOPs) for Level 1 staff. You will typically be called upon during major incidents as a Subject Matter Expert (SME) when the source of a problem is unclear. You will possess a deep understanding of service topology and dependencies necessary to resolve issues and define mitigations.
  • Broad Interests - SREs are a unique blend of system administrators and development engineers, capable of understanding and explaining how product architecture decisions affect the operation of distributed systems. They are driven by professional curiosity and a desire to develop a profound understanding of their services and the technologies they rely upon.
  • Represent SRE - Proactive, self-motivated, customer-focused, organized, and effective communicators are essential. SREs are expected to represent Cloud products and engineering in critical discussions.

Qualifications

Our team operates within the Health Data Intelligence Platform, focusing on cloud operations. We are responsible for developing and maintaining cloud computing services and solutions that enhance our operational efficiency, security, and attention to detail. As a team member, you will collaborate with innovative minds in a supportive environment that prioritizes infrastructure and applications. We empower our team to make advancements that enhance efficiency and productivity in their daily operations, leading to superior external customer product availability and support experiences.

Required Knowledge:

  • Server hardware configuration.
  • Linux internals.
  • Networking and TCP/IP.
  • Standard Internet services, such as DNS, HTTP, etc.
  • Scripting languages, such as Python, Ruby, Bash, etc.
  • Configuration management tools, such as Chef, Ansible, etc.
  • Monitoring and Instrumentation.
  • DevOps toolchain.
  • Cloud computing patterns.
  • IT Security and compliance.
  • 5+ years of experience managing large-scale customer-facing web services.
  • Most importantly, the ability to be a collaborative colleague and a willingness to learn and implement new Cloud technologies as needed.
  • A methodical approach to solving complex problems.

Understanding of:

  • REST APIs.
  • Load balancing technologies, including L7 routing, DNS, and CDN.
  • Knowledge of programming languages, such as Ruby, C++, Java, JavaScript.

Experience with:

  • Databases and big data stores.
  • Container and Container Management technologies, such as Docker and Kubernetes.
  • Defining and documenting technical architecture of complex and highly scalable products.

Required Qualifications:

  • U.S. Citizenship and eligibility for a Federal Security Clearance.
  • 5+ years of relevant technical experience.
  • Effective communication skills and the ability to build rapport with team members.
  • BS or MS or equivalent experience in Computer Science or a related field.

Join Our Team

About Us

As a global leader in cloud solutions, Oracle leverages cutting-edge technology to address contemporary challenges. True innovation stems from diverse perspectives and a variety of skills and backgrounds.

When every voice is valued, we are inspired to exceed previous achievements. This is why we are dedicated to expanding our inclusive workforce that fosters diverse insights and perspectives.

Oracle careers provide access to global opportunities where work-life balance thrives. We offer a competitive suite of employee benefits grounded in principles of fairness and consistency. We prioritize our people with flexible medical, life insurance, and retirement options. We also encourage employees to contribute to their communities through our volunteer initiatives.

We are committed to including individuals with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, please inform us.

Disclaimer:

Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability, and protected veterans' status, or any other characteristic protected by law.