Site Reliability Engineer, Virtualization Expert

2 months ago


Sunnyvale, California, United States Bayside Solutions Full time

Job Summary:

We are seeking a highly skilled Site Reliability Engineer with expertise in virtualization and Linux compute platforms to join our team at Bayside Solutions, Inc.

Key Responsibilities:

  • Deploy services to thousands of servers across multiple data centers simultaneously, ensuring high availability and scalability.
  • Utilize Infrastructure as a Service (IaaS) orchestration tools, including OpenStack and CloudStack, to manage and automate infrastructure provisioning.
  • Develop and maintain robust Linux systems administration skills, with a strong understanding of system virtualization, including Libvirt, QEMU, KVM, and relevant APIs and programming languages.
  • Design and implement load balancing, high availability, and failover solutions to ensure business continuity.
  • Collaborate with cross-functional teams to identify and resolve complex technical issues, leveraging strong problem-solving skills and a deep understanding of OSI model network layers.
  • Manage, scale, and troubleshoot Java applications, with a focus on performance optimization and reliability.
  • Develop and maintain expertise in Kickstart and Bootstrap, with a strong understanding of automation and tool-building principles.
  • Design and implement secure authentication schemes, certificates, and secrets management solutions to protect sensitive data.
  • Develop advanced telemetry and observability solutions to monitor and analyze services at various levels, including API, runtime, infrastructure, and log analysis.
  • Collaborate with the development team to design and implement automation solutions using Golang, with a focus on tool-building and infrastructure automation.

Requirements and Qualifications:

  • 5+ years of experience in Site Reliability Engineering, with a strong focus on virtualization and Linux compute platforms.
  • Expert-level knowledge of IaaS orchestration tools, including OpenStack and CloudStack.
  • Strong understanding of Linux systems administration, including system virtualization, networking, and security.
  • Experience with load balancing, high availability, and failover solutions.
  • Strong problem-solving skills and a deep understanding of OSI model network layers.
  • Experience with Java application management, scaling, and troubleshooting.
  • Expert-level knowledge of Kickstart and Bootstrap, with a strong understanding of automation and tool-building principles.
  • Strong understanding of secure authentication schemes, certificates, and secrets management.
  • Experience with advanced telemetry and observability solutions.
  • Collaborative mindset and excellent communication skills.

Desired Skills and Experience:

SRE, virtualization, Linux, IaaS, OpenStack, CloudStack, Libvirt, QEMU, KVM, Java, Golang, authentication, certificates, secure secret, telemetry, log analysis, automation, load balancer, high availability, Kickstart, Bootstrap



  • Sunnyvale, California, United States Bayside Solutions Full time

    Job Title: Site Reliability Engineer, VirtualizationJob Summary:We are seeking an experienced Site Reliability Engineer with a strong background in virtualization and Linux systems administration to join our team at Bayside Solutions, Inc. The ideal candidate will have a deep understanding of Infrastructure as a Service (IaaS) orchestration tools, including...


  • Sunnyvale, California, United States Apple Full time

    Site Reliability Engineer - Infrastructure ExpertAt Apple, we're looking for a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in maintaining and enhancing the reliability of our production systems.You will collaborate with engineering teams to design, implement, and monitor infrastructure and...


  • Sunnyvale, California, United States Bayside Solutions Full time

    Job SummaryWe are seeking an experienced Site Reliability Engineer to join our team at Bayside Solutions, Inc. The ideal candidate will have a strong background in virtualization and Linux systems administration, with expertise in Infrastructure as a Service (IaaS) orchestration tools, including OpenStack and CloudStack.Key ResponsibilitiesDeploy services to...


  • Sunnyvale, California, United States Apple Full time

    Job DescriptionApple is seeking a highly skilled Site Reliability Engineer to join our dynamic team. As a key member of our engineering team, you will be responsible for designing, implementing, and maintaining large-scale engineering systems to ensure the reliability and scalability of our products and services.Key Responsibilities:Design and implement...


  • Sunnyvale, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our Cloud Network Infrastructure team at Apple. As a Site Reliability Engineer, you will play a critical role in ensuring the high availability, scalability, and resilience of our cloud services.Key ResponsibilitiesDesign and implement large-scale distributed, fault-tolerant,...

  • Virtualization Expert

    2 months ago


    Sunnyvale, California, United States Bayside Solutions Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineer with expertise in virtualization and Linux compute platforms to join our team at Bayside Solutions, Inc.Key ResponsibilitiesDeploy services to thousands of servers across multiple data centers simultaneouslyUtilize Infrastructure as a Service (IaaS) orchestration tools, including OpenStack...


  • Sunnyvale, California, United States Saxon Global Full time

    Job Title: Site Reliability EngineerAs a Site Reliability Engineer at Saxon Global, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based e-commerce and retail platforms.Key Responsibilities:Design, develop, and maintain tools to improve the reliability, latency, and scalability of our e-commerce and...


  • Sunnyvale, California, United States Diverse Lynx Full time

    Job DescriptionJob Title: Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud-based infrastructure.Key Responsibilities:Design and implement automated monitoring and...


  • Sunnyvale, California, United States Apple Full time

    Job DescriptionAt Apple, we're looking for a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in maintaining and enhancing the reliability of our production systems.Key ResponsibilitiesDesign, develop, and maintain scalable, reliable, and efficient infrastructure.Implement monitoring, alerting,...


  • Sunnyvale, California, United States Apple Full time

    About the RoleAt Apple, we're looking for a skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in maintaining and enhancing the reliability of our production systems.Key ResponsibilitiesDesign, develop, and maintain scalable, reliable, and efficient infrastructure.Implement monitoring, alerting,...


  • Sunnyvale, California, United States Apple Full time

    Job Title: Senior Site Reliability EngineerAt Apple, we're revolutionizing the way people interact with technology. As a Senior Site Reliability Engineer, you'll play a critical role in maintaining and enhancing the reliability of our production systems.Key Responsibilities:Design, develop, and maintain scalable, reliable, and efficient...


  • Sunnyvale, California, United States Apple Full time

    Job Title: Senior Site Reliability EngineerAt Apple, we're revolutionizing industries with innovative technology and customer experiences. As a Senior Site Reliability Engineer, you'll play a critical role in maintaining and enhancing the reliability of our production systems.Key Responsibilities:Design, develop, and maintain scalable, reliable, and...


  • Sunnyvale, California, United States Saxon Global Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Saxon Global. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our e-commerce and retail platforms.Key ResponsibilitiesDesign, develop, and maintain tools to improve the reliability, latency,...


  • Sunnyvale, California, United States Apple Full time

    Job DescriptionAt Apple, we're revolutionizing the way people interact with technology. As a Senior Site Reliability Engineer, you'll play a critical role in maintaining and enhancing the reliability of our production systems.Key ResponsibilitiesDesign, develop, and maintain scalable, reliable, and efficient infrastructure.Implement monitoring, alerting, and...


  • Sunnyvale, California, United States Apple Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Manufacturing Systems & Infrastructure (MSI) team at Apple. As a key member of our team, you will play a critical role in maintaining and enhancing the reliability of our production systems.Key ResponsibilitiesDesign, develop, and maintain scalable, reliable, and...


  • Sunnyvale, California, United States Apple Full time

    Job SummaryApple is seeking a highly skilled Senior Site Reliability Engineer to join its Manufacturing Systems & Infrastructure (MSI) team. As a key member of this team, you will play a critical role in maintaining and enhancing the reliability of our production systems. Key ResponsibilitiesDesign, develop, and maintain scalable, reliable, and efficient...


  • Sunnyvale, California, United States Saxon Global Full time

    Job SummaryAs a Site Reliability Engineer at Saxon Global, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based e-commerce and retail platform. You will work closely with our software engineering teams to design, build, and maintain tools that improve the overall system health and availability.Key...


  • Sunnyvale, California, United States Apple Full time

    The Apple Health team is seeking a highly skilled Site Reliability Engineer to join our dynamic group. As a member of our team, you will have the rare and rewarding opportunity to craft upcoming products that will delight and inspire millions of Apple's customers every day.You will enjoy using technology to automate solutions and optimize outcomes focusing...


  • Sunnyvale, California, United States Diverse Lynx Full time

    Job DescriptionAs a Site Reliability Engineer at Diverse Lynx LLC, you will play a critical role in ensuring the reliability and scalability of our infrastructure. We are seeking an experienced professional with a strong background in infrastructure management, automation, and cloud computing.Key ResponsibilitiesDesign and implement automation scripts to...


  • Sunnyvale, California, United States JobRialto Full time

    Job Summary:We are seeking a Senior Site Reliability Engineer to join our team on a 12-month contract. The ideal candidate will combine software and systems engineering expertise to build, run, and optimize large-scale, fault-tolerant systems. This role focuses on automation, reliability, scalability, and performance, ensuring that applications have the...