Current jobs related to Principal Site Reliability Developer - Seattle - Oracle


  • Seattle, Washington, United States Oracle Full time

    About the Role:Oracle is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, develop, and deploy software to improve the availability, scalability, and efficiency of...


  • Seattle, Washington, United States Oracle Full time

    About the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Oracle. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure. You will work closely with our development teams to design, implement, and operate large-scale distributed...


  • Seattle, Washington, United States HireIO Inc Full time

    Job Title: Site Reliability EngineerHireIO Inc is seeking a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, and performance of our distributed systems.Key Responsibilities:Design and implement scalable and reliable systemsCollaborate with cross-functional...


  • Seattle, Washington, United States Sogeti Full time

    Site Reliability Engineer **Job Summary** We are seeking an experienced Site Reliability Engineer to join our team. As a key member of our operations team, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure. **Key Responsibilities** * Design, implement, and maintain scalable and reliable cloud...


  • Seattle, Washington, United States HireIO Inc Full time

    Job SummaryAt HireIO Inc, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the availability, scalability, and reliability of our Ads systems. This includes designing, analyzing, and troubleshooting large-scale distributed systems, as well as developing tools and...


  • Seattle, Washington, United States Sogeti Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking an experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure using Azure or...


  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerAt Apple, we're looking for a skilled Site Reliability Engineer to join our Object Storage SRE team. As a Site Reliability Engineer, you'll play a critical role in ensuring the reliability, scalability, and performance of our cloud storage systems.About the RoleWe're seeking a seasoned software and systems engineer with a...


  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerAt Apple, we're looking for a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.About the RoleWe are seeking a talented and motivated individual to join our dynamic...


  • Seattle, Washington, United States Nerdshub E Pvt Ltd Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Nerdshub E Pvt Ltd. As a Site Reliability Engineer, you will be responsible for ensuring the health and stability of our production systems, developing monitoring dashboards, and configuring alerts to automate system recovery.Key...


  • Seattle, Washington, United States Capgemini Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our software systems and infrastructure.Key Responsibilities:Develop, maintain, and configure cloud observability systems (e.g.,...


  • Seattle, Washington, United States Sogeti Full time

    Site Reliability EngineerWe are seeking an experienced Site Reliability Engineer to join our team at Sogeti. As a key member of our operations team, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure using Azure or...


  • Seattle, Washington, United States Tik Tok Full time

    About TikTok U.S. Data SecurityTikTok U.S. Data Security is a subsidiary of TikTok in the U.S., dedicated to protecting user data and ensuring the security of our platform.ResponsibilitiesWe are seeking a highly motivated and experienced Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the...


  • Seattle, Washington, United States Diverse Lynx Full time

    Job Title: Sr. Site Reliability EngineerLocation: RemoteDuration: 12+ Months contractJob Description:We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the availability, reliability, and performance of our applications and services.You will work...


  • Seattle, Washington, United States Hireio, Inc. Full time

    Job OverviewHireio, Inc. is seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our Ads systems team, you will be responsible for ensuring the reliability, scalability, and operability of our services.Key ResponsibilitiesDesign and implement scalable and reliable systems architectureCollaborate with cross-functional teams...


  • Seattle, Washington, United States Oracle Full time

    Job SummarySolve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems....


  • Seattle, Washington, United States Tik Tok Full time

    About the RoleWe are seeking an experienced Site Reliability Engineer to join our USDS Video Platform team at TikTok. As a key member of our team, you will be responsible for ensuring the reliability and scalability of our video system, which serves billions of users worldwide.ResponsibilitiesDesign and implement scalable and reliable systems to support our...


  • Seattle, Washington, United States HireIO Inc Full time

    Job DescriptionHireIO Inc is seeking a highly skilled Site Reliability Engineer to join our team.The ideal candidate will have a strong background in software development and a passion for ensuring the reliability and scalability of our systems.Key Responsibilities:Design and implement scalable and reliable systems architectureCollaborate with...


  • Seattle, Washington, United States Apple Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled and motivated Site Reliability Engineer to join our dynamic and growing team at Apple.About the RoleAs a Site Reliability Engineer, you will play a critical role in ensuring the security, reliability, and scalability of our systems and infrastructure.Key ResponsibilitiesDesign, implement,...


  • Seattle, Washington, United States Sogeti Full time

    Job Title: Lead Site Reliability Engineer Job Summary: We are seeking a highly skilled Lead Site Reliability Engineer to join our team at Sogeti. The successful candidate will be responsible for developing and maintaining cloud observability systems, building monitoring and alerting systems, and optimizing system performance. Key Responsibilities: *...


  • Seattle, Washington, United States Apple Full time

    Site Reliability Engineering ManagerAt Apple, we're looking for a skilled Site Reliability Engineering Manager to join our team. As a Site Reliability Engineering Manager, you will be responsible for leading a team that provides the platform for mission-critical cloud systems to maintain constant uptime, scale seamlessly, and allow for new applications and...

Principal Site Reliability Developer

2 months ago


Seattle, United States Oracle Full time

We are facing several engineering challenges in critical foundational data-plane services that powers the next gen OCI cloud. This is your opportunity to build innovative solutions from the ground up. These are exciting times and our team is still young and growing fast, working on ambitious new initiatives such as providing canonical implementation of core components for data planes through a data-plane runtime framework, developing a remote persistent storage solution with the latency and performance comparable to that of a local NVMe drive or developing standard and tooling to identify critical performance improvements across OCI data-planes. We are looking for a passionate self-motivated Site Reliability DevOps Engineer who will be responsible for defining and deploying key services with deep focus on architecture, production operations, capacity planning, performance management, deployment, and release engineering. You will work with multiple cross-functional teams helping deliver new and outstanding experiences while ensuring reliability and performance. You should value simplicity and scale, work comfortably in a collaborative, agile environment, and be excited to learn. Responsibilities: With your superb technical, research and analytical capabilities and demonstrated ability to get the right things done quickly and effectively, you will react to production deficiencies by continuously implementing automation, self-healing, and real-time monitoring to production systems. You will be a strong contributor to supporting and development of platform services including architecture, provisioning, configuration, deployment, and support. You will partner with the distributed team in prototyping new platform services. You will solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. You will design, write, and deploy software to improve the availability, scalability, and efficiency of Data plane platform. You will facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning and performance. Career Level - IC4 Preferred Skills and Experience: 7+ years of experience software engineering practices and IT operations tasks Demonstrate clear understanding of end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Demonstrate clear understanding of automation and orchestration principles. Extensive experience with Linux system administration Experience in container administration and development applying Kubernetes, Docker, Mesos, or similar Experience in infrastructure automation through Terraform, Chef, Ansible, Puppet, Packer or similar Experience with CI/CD pipelines including VCS (git, svn, etc.), Gitlab Runners, Jenkins Experience in developing scripts to automate software deployments and installations using Python or Bash Experience working with or supporting production, test, and development environments for medium to large scale environments Knowledge of cloud compute technologies and networking Experience working with fault tolerant, highly available, high throughput, distributed, scalable systems Experience operating services in one of the major Clouds such as AWS, OCI, Azure, etc. Experience with C++ build systems is a huge plus #J-18808-Ljbffr