Site Reliability Engineer

3 days ago


San Leandro, California, United States Omni Inclusive Full time
Job Title: Site Reliability Engineer

We are seeking a highly skilled Site Reliability Engineer to join our team at Omni Inclusive. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our digital platforms.

Key Responsibilities:
  • Design, implement, and maintain scalable and reliable systems
  • Collaborate with cross-functional teams to identify and resolve production issues
  • Develop and maintain monitoring and alerting systems to ensure platform health
  • Work with engineering teams to improve platform metrics and communicate results to stakeholders
  • Stay up-to-date with industry trends and emerging technologies to drive innovation and efficiency
Requirements:
  • 10+ years of experience in software engineering or equivalent
  • Strong background in Java development and SRE principles
  • Experience with cloud infrastructure, including AWS or Azure
  • Knowledge of APM tools, such as Splunk, GCL, ELK, and Grafana
  • Ability to work with engineering teams across the ecosystem to resolve infrastructure challenges
Preferred Qualifications:
  • Experience with distributed storage technologies, such as NFS
  • Knowledge of dynamic resource management frameworks, such as PCF or Kubernetes
  • Proficiency in Shell Scripting and DevOps tools, such as Ansible
  • Experience with API styles, such as SOAP, REST, and Microservices
What We Offer:
  • A dynamic and collaborative work environment
  • Opportunities for professional growth and development
  • A competitive salary and benefits package


  • San Leandro, California, United States Omni Inclusive Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Omni Inclusive. As a key member of our SRE support team, you will be responsible for ensuring the resiliency, performance, and availability of our Digital Sales & Marketing platforms.Key ResponsibilitiesCollaborate with engineering teams to resolve production outages...


  • San Leandro, California, United States VDart Inc Full time

    Job OverviewPosition: Site Reliability EngineerCompany: VDart IncRole Summary:We are seeking a skilled Site Reliability Engineer with a strong background in Java to enhance our platform's performance and reliability. The ideal candidate will have a proven track record in production support and a commitment to optimizing system health.Key...


  • San Francisco, California, United States Resource Informatics Group Full time

    Job Title:Site Reliability EngineerJob Summary:We are seeking a highly skilled Site Reliability Engineer to join our team at Resource Informatics Group. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our large-scale Oracle database systems.Key Responsibilities:Administer and troubleshoot...


  • San Francisco, California, United States Instabase Full time

    About InstabaseInstabase is a cutting-edge AI innovation company that empowers organizations to solve complex unstructured data problems. With a global presence and a customer-centric approach, we deliver top-tier solutions that provide unmatched advantages for everyday business operations.Job Title: Site Reliability EngineerWe are seeking a highly skilled...


  • San Francisco, California, United States Apollo Solutions Full time

    Site Reliability EngineerApollo Solutions has partnered with a pioneering artificial intelligence business that is revolutionizing the use of AI/ML in gaming and security.The company is working closely with government contracts and gaming console companies and is seeking a Site Reliability Engineer to join their growing team.The Site Reliability Engineer...


  • San Francisco, California, United States Perplexity AI Full time

    Site Reliability EngineerPerplexity AI is seeking a skilled Site Reliability Engineer to join our team and contribute to the development of our cutting-edge conversational answer engine.As a Site Reliability Engineer, you will be responsible for designing, implementing, and scaling the infrastructure and systems that support our web and mobile products.Key...


  • San Francisco, California, United States iTCO Solutions Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at iTCO Solutions. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and security of our cloud-based infrastructure.Key Responsibilities:Design and implement operational and infrastructural...


  • San Jose, California, United States Syntricate Technologies Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design and implement automation scripts using...


  • San Jose, California, United States Diverse Lynx Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design and implement automation scripts using shell,...


  • San Francisco, California, United States Wasmer Full time

    About the RoleWe are seeking an exceptional Site Reliability Engineer to join our team at Wasmer. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our Edge computing platform.Key ResponsibilitiesDesign, implement, and maintain scalable and reliable infrastructure solutions for our Edge computing...


  • San Jose, California, United States Syntricate Technologies Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at Syntricate Technologies. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Design, implement, and maintain scalable and highly...


  • San Francisco, California, United States Instabase Full time

    About InstabaseInstabase is a cutting-edge AI innovation company that empowers organizations to solve complex unstructured data problems. With a global presence and a customer-centric approach, we deliver top-tier solutions that provide unmatched advantages for everyday business operations.Job DescriptionWe are seeking a highly skilled Site Reliability...


  • San Francisco, California, United States Instabase Full time

    About InstabaseAt Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry.With customers representing some of the largest and most complex organizations in the world, and investors like Greylock, Andreessen Horowitz, and Index...


  • San Francisco, California, United States Diverse Lynx Full time

    About the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Diverse Lynx LLC. As a key member of our organization, you will play a critical role in ensuring the reliability and efficiency of our digital infrastructure.Key Responsibilities:Design and implement reliable digital infrastructure solutionsCollaborate with...


  • San Francisco, California, United States Orb Full time

    About OrbOrb is a cutting-edge billing infrastructure company that empowers businesses to unlock their revenue potential. We believe that pricing and billing should not be a barrier to innovation and growth.Role & ImpactAs a Site Reliability Engineer at Orb, you will play a critical role in maintaining and scaling our robust infrastructure, ensuring...

  • Reliability Engineer

    2 weeks ago


    San Francisco, California, United States Diverse Lynx Full time

    Role OverviewWe are seeking a highly skilled Reliability Engineer to join our team at Diverse Lynx LLC. As a key member of our organization, you will be responsible for ensuring the reliability and resilience of our digital systems.Key ResponsibilitiesDesign and implement reliable digital systems and processesCollaborate with cross-functional teams to...


  • San Diego, California, United States ACL Digital Full time

    Job DescriptionDuration: 0-12 monthsJob Summary: We are seeking a highly skilled Site Reliability Engineer to join our team at ACL Digital. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based applications.Key Responsibilities:Hands-on application management and support for AWS...


  • San Francisco, California, United States GRNET Full time

    About GRNETGRNET is a leading provider of advanced network and cloud computing services to academic and research institutions, educational entities, and public sector agencies in Greece.Our ApproachWe adopt a Site Reliability Engineering approach to ensure the reliability, scalability, and efficiency of our infrastructure. Our team is divided into three...


  • San Francisco, California, United States GRNET Full time

    About GRNETGRNET is a leading provider of advanced network and cloud computing services to academic and research institutions, educational entities, and public sector agencies in Greece.Our ApproachWe adopt a Site Reliability Engineering approach to ensure the reliability, scalability, and efficiency of our infrastructure. Our team is divided into three...


  • San Francisco, California, United States Diverse Lynx Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer with 7+ years of experience in Java SRE to join our team at Diverse Lynx LLC. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our systems.Key ResponsibilitiesDesign and implement monitoring and alerting systems to ensure prompt...