Reliability Engineer for Enterprise Infrastructure

6 days ago


Redwood City, California, United States Stanford University Full time

Stanford University is seeking a seasoned Service Reliability Engineer to join the Enterprise Technology team in Redwood City, California. This role plays a vital part in ensuring the reliability and scalability of on-premise and cloud systems.

Core Responsibilities:

  • Design and Deployment: Develop highly available hybrid systems on On premise and cloud platforms like AWS and OCI, focusing on Infrastructure-as-a-Service (IaaS) and Platform-as-a-Service (PaaS) offerings.
  • System Management: Manage installations, configurations, and upgrades; troubleshoot outages and incidents.
  • Coding Practices: Implement Infrastructure as Code practices using tools like Terraform to automate cloud infrastructure provisioning and management. Improve operational efficiency by automating routine application tasks using Python and shell scripting.
  • Pipeline Development: Design, implement, and maintain CI/CD pipelines to streamline application deployment processes, ensuring high-quality software delivery.
  • Containerization: Deploy and manage containerized applications using Docker and orchestrate them with Docker Compose or Kubernetes for scalability and resilience.
  • Infrastructure Modernization: Lead efforts to modernize existing infrastructure and applications by integrating new technologies and cloud-native solutions.
  • Capacity Planning: Actively participate in scaling, performance tuning, and capacity planning of Enterprise Stack, including Single Sign-On and SSL keystore management.
  • Security Measures: Conduct application server hardening to enhance security against potential threats.
  • Technical Support: Provide technical support for complex issues by collaborating with all stakeholders to assess current systems, recommend improvements for enhanced performance and scalability. Ensure effective communication regarding system status and operations.
  • Documentation: Create and maintain comprehensive documentation for system configurations, procedures, and best practices to ensure knowledge transfer and compliance.
  • Monitoring Processes: Ensure robust monitoring processes are in place and compliance with production security standards.

Requirements and Qualifications:

  • Diverse Middleware Experience: Experience with diverse middleware technologies on bare metal and Docker containers.
  • Cloud Infrastructure Expertise: Experience with Infrastructure as Code like Terraform and container orchestration utilities.
  • Full-Stack Infrastructure Building: Demonstrate Cloud Infrastructure experience with experience in building full-stack infrastructure for enterprise-ready applications.
  • Data Management Systems: Demonstrated experience in the support requirements of large data management systems including performance analysis and tuning of high-volume, transaction systems.
  • Version Control and CI/CD Tools: Experience with version control systems (Git, SVN) and CI/CD tools.
  • Programming Languages: Proficiency in programming and scripting languages, especially Python and Shell.
  • Linux-Based Systems: Strong working knowledge of Linux-based systems.

Additional Information:

This is a hybrid eligible position. The expected pay range for this position is $152,461 per annum, based on factors such as the scope and responsibilities of the position, qualifications of the selected candidate, departmental budget availability, internal equity, geographic location, and external market pay for comparable jobs.

At Stanford University, base pay represents only one aspect of the comprehensive rewards package. For detailed information on Stanford's extensive range of benefits and rewards offered to employees, visit the Cardinal at Work website.



  • Redwood City, California, United States C3, Inc. Full time

    C3.ai, Inc. is a leading provider of enterprise AI software for accelerating digital transformation.The company's proven C3 AI Platform offers comprehensive services to build enterprise-scale AI applications more efficiently and cost-effectively than alternative approaches.The platform supports the value chain in any industry with prebuilt, configurable,...


  • Redwood City, California, United States Dexterity Full time

    About DexterityWe believe robots can positively transform the world. Our breakthrough technology frees people to do the creative jobs that humans do best by enabling robots to handle repetitive and physically difficult work.At Dexterity, we're starting with warehouse automation, where smarter, more resilient supply chains impact millions of lives and...

  • Cloud Engineer

    6 days ago


    Redwood City, California, United States Box Full time

    Transforming Cloud Infrastructure with KubernetesAt Box, we're leading the way in content management and collaboration. Our mission is to empower customers to transform workflows across their organizations by bringing intelligence to the world of content. We're seeking a talented Senior Software Engineer to join our team and help drive this mission...


  • Redwood City, California, United States Zilliz Full time

    About ZillizZilliz is a pioneering company that specializes in developing next-generation vector database technologies to empower organizations in creating AI applications. As a fast-growing startup, we are dedicated to simplifying data management for AI and making vector databases accessible to every organization.Job DescriptionWe are seeking an experienced...


  • Redwood City, California, United States Zilliz Full time

    At Zilliz, we're revolutionizing the way organizations manage data for AI applications. As a Senior Cloud Infrastructure Engineer, you'll play a crucial role in developing cutting-edge distributed database systems using our innovative data science platforms.About ZillizWe're a fast-growing startup behind the industry's leading vector database company for...


  • Redwood City, California, United States Dexterity Full time

    About DexterityWe are a robotics company building automation systems to perform pick-place-pack tasks in warehouses. Our end-to-end systems use intelligent software to enable human-like dexterity in commodity robot arms.About the RoleWe are hiring a Software Infrastructure Engineer to work closely with our software engineering teams to deploy, manage and...


  • Redwood City, California, United States TigerGraph Full time

    TigerGraph is a pioneering platform for advanced analytics and machine learning on interconnected data. Its proven core technology is the only scalable graph database designed for enterprise environments, supporting fraud detection, customer 360, MDM, IoT, AI, and machine learning applications.Fortune 500 organizations and innovative mid-size companies...


  • Redwood City, California, United States C3 IoT Full time

    Job DescriptionC3.ai, Inc. is a leading provider of enterprise AI software for accelerating digital transformation. The C3 AI Platform offers comprehensive services to build scalable AI applications efficiently and cost-effectively. Our platform supports the value chain in various industries with prebuilt, high-value AI applications for reliability, fraud...


  • Redwood City, California, United States Bear Robotics, Inc. Full time

    Bear Robotics, Inc. is a cutting-edge robotics company focused on developing innovative automation solutions for various industries. Our products, including robot devices, cloud services, and public APIs, are designed to help businesses operate more efficiently and effectively.We are seeking an experienced Data Engineering Technical Lead to lead the design,...


  • Redwood City, California, United States C3 IoT Full time

    Unlock the Power of Enterprise-Scale AI ApplicationsC3.ai, Inc. is a leading provider of Enterprise AI software for accelerating digital transformation. Our comprehensive C3 AI Platform enables businesses to build enterprise-scale AI applications more efficiently and cost-effectively than alternative approaches.The platform supports the entire value chain in...


  • Redwood City, California, United States C3, Inc. Full time

    C3 AI, a leading Enterprise AI software provider, accelerates digital transformation with its proven C3 AI Platform. This platform offers comprehensive services to build enterprise-scale AI applications more efficiently and cost-effectively than alternative approaches.The C3 AI Platform supports the value chain in any industry with prebuilt, configurable,...


  • Redwood City, California, United States C3, Inc. Full time

    Job Overview:C3, Inc. is a leading enterprise AI software provider accelerating digital transformation. Our proven C3 AI Platform supports the value chain in various industries with prebuilt, configurable AI applications for reliability, fraud detection, and more.About the Role:We seek an experienced AI Enterprise Solutions Architect to work with our Data...


  • Redwood City, California, United States TigerGraph Full time

    TigerGraph, a pioneering platform for advanced analytics and machine learning on connected data, seeks an exceptional Enterprise Sales Executive to drive sales growth in the West Coast region. As a core technology leader in scalable graph databases for the enterprise, TigerGraph supports fraud detection, customer 360, MDM, IoT, AI, and machine learning...


  • Redwood City, California, United States Bear Robotics, Inc. Full time

    Senior Software Engineering Manager Job DescriptionWe are seeking a seasoned Senior Software Engineering Manager to lead our Cloud and Data teams, driving the development and scalability of cloud infrastructure, on-premise systems, and data pipelines.Key Responsibilities:Cloud Billing and Permissions: Oversee cloud usage and optimize billing strategies to...


  • Redwood City, California, United States Stanford University Full time

    About the RoleWe are seeking a highly skilled Cybersecurity Expert to join our team in Enterprise Operations at Stanford University. In this role, you will be responsible for analyzing security alerts, assessing and triaging as needed for investigations/incident response.You will participate in a wide range of security programs and services, building and...


  • Redwood City, California, United States Box Full time

    About Box">Box is the leading cloud content management platform, empowering organizations to accelerate their digital transformation. Our mission is to power how the world works together, and we're partnering with top enterprise organizations to achieve this goal.">What We Offer">As a senior member of our sales team, you'll have the opportunity to capture a...


  • Culver City, California, United States Diverse Lynx Full time

    Job Title: Cloud Infrastructure Automation EngineerLocation: Culver City, CA (Onsite)Type: ContractJob Description:We are seeking an experienced Cloud Infrastructure Automation Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing and implementing automated infrastructure provisioning, deployment, and...


  • Foster City, California, United States Zoox Full time

    At Zoox, we're pushing the boundaries of innovation in autonomous vehicles. We're looking for a seasoned Site Reliability Engineer to join our team.About the RoleThis is a key position that involves designing and maintaining the infrastructure required to support our autonomous vehicle fleet. You will be responsible for ensuring the uptime of critical...


  • Redwood City, California, United States C3 AI Full time

    C3 AI, a leading Enterprise AI software provider, seeks an exceptional Software Engineer to build the next generation AI platform. With several petabyte level data volumes in mind, this role is crucial for our company's success.About UsWe accelerate digital transformation with our proven C3 AI Platform, providing comprehensive services to build...


  • Redwood City, California, United States C3 IoT Full time

    C3 AI OverviewWe are a leading provider of Enterprise AI software for accelerating digital transformation. Our C3 AI Platform enables comprehensive services to build enterprise-scale AI applications more efficiently and cost-effectively.Job Summary:The Lead QA Automation Engineer will work with the engineering, product development, QA, operations, and...