Current jobs related to Reliability Engineer - San Francisco, California - Early Warning Services LLC

  • Reliability Engineer

    3 weeks ago


    San Francisco, California, United States SPAN Inc Full time

    About the RoleWe are seeking an experienced Electrical Engineer to lead our reliability program for SPAN Inc. through all stages of development.In this role, you will rely on your past experience to develop a comprehensive reliability test program for new products and support any reliability issues that arise in the field.You will work with the electrical,...

  • Reliability Engineer

    4 weeks ago


    San Francisco, California, United States SPAN Inc Full time

    About the RoleSPAN Inc is seeking an experienced Electrical Engineer to lead the reliability program for our company. As a key member of our team, you will be responsible for developing and implementing a comprehensive reliability test program for our products and supporting any reliability issues that arise in the field.Key ResponsibilitiesDevelop and...


  • San Francisco, California, United States Arbitrum Inc Full time

    Reliability EngineerAt Arbitrum Inc, we're on a mission to bring blockchain to a billion people. Our developer platform is designed to make building on the blockchain easy, and we're looking for a skilled Reliability Engineer to join our Infrastructure team.As a Reliability Engineer, you'll collaborate with our engineering team to design, deploy, and...


  • San Francisco, California, United States Alchemy Full time

    About the RoleAlchemy is seeking a highly skilled Infrastructure Reliability Specialist to join our team. As a key member of our Infrastructure department, you will play a critical role in designing, deploying, and continuously improving the infrastructure supporting our globally used developer platform.Your focus will be on enhancing developer productivity...


  • San Francisco, California, United States Unreal Gigs Full time

    Job Title: Site Reliability EngineerAt Unreal Gigs, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the high availability, scalability, and performance of our complex distributed systems.Key Responsibilities:Design and implement monitoring, logging, and alerting...


  • San Francisco, California, United States Unreal Gigs Full time

    Job Title: Site Reliability EngineerAt Unreal Gigs, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the high availability, scalability, and performance of our complex distributed systems.Key Responsibilities:Design and implement monitoring, logging, and alerting...


  • San Francisco, California, United States Roman Health Pharmacy LLC Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Xero. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and scalability of our cloud-based platform.Key ResponsibilitiesInvestigate operational surprises and support teams in post-incident activitiesConduct in-depth incident...


  • San Francisco, California, United States Wasmer Full time

    About the RoleWe are seeking an exceptional Site Reliability Engineer to join our team at Wasmer. As a Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining scalable and reliable infrastructure solutions for our Edge computing platform.Key ResponsibilitiesDesign and implement scalable and reliable infrastructure...


  • San Francisco, California, United States Instabase Full time

    About InstabaseAt Instabase, we're passionate about democratizing access to cutting-edge AI innovation to enable any organization to solve previously unsolvable unstructured data problems in their industry.With customers representing some of the largest and most complex organizations in the world, and investors like Greylock, Andreessen Horowitz, and Index...


  • San Francisco, California, United States Instabase Full time

    About InstabaseAt Instabase, we're passionate about harnessing the power of AI innovation to democratize access to cutting-edge technology and empower organizations to solve complex unstructured data problems. With a strong presence in the market and a talented team, we're committed to delivering top-tier solutions that drive business success.Job...


  • San Francisco, California, United States DaVita Full time

    About the RoleThe WEX Site Reliability Engineering team is seeking a skilled Site Reliability Engineer to join our Platform Reliability organization. As a key member of our team, you will be responsible for developing software and solutions focused on observability, incident response, reliability, and performance.You will collaborate with our engineering...


  • San Francisco, California, United States Instabase Full time

    About InstabaseInstabase is a global company with offices in San Francisco, New York, London, and Bengaluru. We're a people-first organization that values experimentation, curiosity, and customer obsession.Job SummaryWe're seeking a Site Reliability Engineer to join our Site Reliability and Platform Engineering team. As a key member of our team, you'll be...


  • San Francisco, California, United States Withorb Full time

    About UsOrb is a cutting-edge technology company on a mission to revolutionize the way businesses approach revenue growth. Our team is passionate about building a robust infrastructure that enables our customers to unlock their full potential.Job DescriptionWe are seeking a highly skilled Site Reliability Engineer to join our team. As a key member of our...


  • San Francisco, California, United States Apollo Solutions Full time

    Site Reliability EngineerApollo Solutions has partnered with a pioneering artificial intelligence business that is revolutionizing the use of AI/ML in gaming and security.The company is working closely with government contracts and gaming console companies and is seeking a Site Reliability Engineer to join their growing team.The Site Reliability Engineer...


  • San Francisco, California, United States Wasmer Full time

    About the RoleWe are seeking an exceptional Site Reliability Engineer to join our team at Wasmer. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our Edge computing platform.Key ResponsibilitiesDesign, implement, and maintain scalable and reliable infrastructure solutions for our Edge computing...


  • San Francisco, California, United States Roman Health Pharmacy LLC Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Xero. As a key member of our Reliability Enablement team, you will play a critical role in ensuring the reliability and performance of our systems.Key ResponsibilitiesInvestigate operational surprises and support teams in post-incident activitiesConduct in-depth...


  • San Francisco, California, United States BaseTen Labs, Inc. Full time

    About BaseTen Labs, Inc.We're a rapidly growing team of innovators backed by top-tier investors, including IVP, Spark Capital, and Sarah Guo at Conviction. Our mission is to empower machine learning teams at enterprises and AI-native companies to build scalable, reliable, and efficient infrastructure.Job DescriptionWe're seeking a skilled Site Reliability...


  • San Francisco, California, United States SpeedCast Full time

    Job Title: Site Reliability EngineerAt Speedcast, we're seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based communication solutions.Key Responsibilities:Analyze and design continuous...


  • San Francisco, California, United States Orb Full time

    About the RoleOrb is seeking a skilled Site Reliability Engineer to join our team. As a key member of our engineering organization, you will play a critical role in maintaining and scaling our robust infrastructure, ensuring stability, scalability, and performance.You will be responsible for tackling complex engineering challenges, from scaling our data...


  • San Francisco, California, United States Outdefine Full time

    About the JobWe are seeking a highly skilled Site Reliability Engineer to join our team at Outdefine. As a key member of our engineering team, you will be responsible for ensuring the reliability, scalability, and performance of our ecommerce platform.Key ResponsibilitiesDesign and implement scalable and highly available cloud infrastructure using Kubernetes...

Reliability Engineer

2 months ago


San Francisco, California, United States Early Warning Services LLC Full time
About the Role

We are seeking a highly skilled Reliability Engineer to join our team at Early Warning Services LLC. As a key member of our platform operations team, you will be responsible for ensuring the stability, performance, and growth of our critical business platforms.

Key Responsibilities
  • Platform Stability and Performance
    • Partner with Development to ensure the stability and performance of our business platforms.
    • Identify and resolve application performance challenges through monitoring, customer feedback, and team member feedback.
    • Suggest technical solutions to product or ART teams to increase performance and remove customer friction.
  • Documentation and Knowledge Management
    • Document, manage, and support code deployments into upper environments.
    • Create and update documentation focused on applications, services, infrastructures, business requirements, testing, and other processes and/or procedures.
  • Collaboration and Communication
    • Partner with all integrated applications and teams to identify, clarify, collaborate, and execute infrastructure upgrades and vulnerability remediation.
    • Provide training and mentorship to Production Reliability Engineers and Production Control Analysts.
    • Work closely with Product Owners, Architecture, Security, Engineering, and other teams to collaborate and agree upon requirements, priorities, etc.
  • Problem-Solving and Analysis
    • Analyze problems and review multiple alternate solutions, including analysis of advantages and disadvantages, and make decisions.
    • Relate business needs to system capabilities and fully understand the role of the systems and impacts to the business.
  • Process Improvement
    • Identify areas where efficiencies and/or automation could improve processes, reduce, or eliminate manual processes, reduce risk, and provide a better user experience.
    • Prioritize, manage, and keep current tasks, stories, and assignments within the team Kanban board.
  • Incident Management
    • Assist in triage, manage, and support incidents tickets and their corresponding Service Level Agreements.
  • Communication and Reporting
    • Provide or automate reports and status updates for timely delivery required for projects, leaders, and others.
  • Security and Compliance
    • Support the company commitment to risk management and protecting our integrity and confidentiality of systems and data.
Requirements
  • Education and Experience
    • Minimum of 5 or more years of related experience.
    • Education and/or experience typically obtained through completion of a bachelor's degree in Computer Science, Business, or related field.
  • Skills and Knowledge
    • Demonstrated experience in development, project management, and requirements gathering for systems performance and reliability.
    • Knowledge of Information Technology Infrastructure Library (ITIL) and Information Technology Service Management (ITSM) disciplines, practices, and procedures.
    • Knowledge and/or experience in financial institution operations related to assigned platform capabilities.
  • Personal Qualities
    • Ability to analyze problems and review multiple alternate solutions.
    • Ability to relate business needs to system capabilities.
    • Strong attention to detail and accuracy.
    • Effective verbal and written communications skills.
Preferred Qualifications
  • Recent experience with Application Support.
  • Experience with Oracle, SQL, Java, and/or Linux.
  • Experience with F5, IBM MQ, AWS, Splunk and/or AppDynamics is helpful.
  • Amazon Cloud certification.