Highly Skilled Infrastructure Engineer

1 week ago


Palo Alto, California, United States Snarkify Full time
About the Role

We are seeking a highly skilled and motivated Production Engineer / Site Reliability Engineer / DevOps Specialist to join our team at Snarkify. As a key member of our infrastructure team, you will play a critical role in ensuring the stability, scalability, and performance of our groundbreaking Zero-Knowledge Proof (ZKP) prover network.

Key Responsibilities
  • Infrastructure Management: Understand the architecture and deployment requirements of modern Layer 2 rollup stacks, and take charge of maintaining and enhancing our in-house zkRollup infrastructure to ensure optimal performance and reliability.
  • Cloud Services: Set up and maintain highly available Kubernetes (K8s) clusters across multiple environments, ensuring scalability, resilience, and security for our prover network.
  • Deployment Pipelines: Develop, improve, and manage deployment pipelines for third-party Docker images, ensuring seamless integration and consistent deployment across different clusters.
  • Customer Collaboration: Collaborate with external customers to understand their system architecture and deployment requirements, designing and implementing tailored plans for third-party prover deployments.
  • Monitoring and Logging: Design and implement robust monitoring, logging, and alerting systems to ensure the health and reliability of our decentralized network and hosted services, utilizing tools such as Prometheus, Grafana, and ELK Stack.
  • Cloud Resource Management: Manage cloud services and resources across platforms such as AWS, GCP, and private clusters, optimizing for performance, cost-efficiency, and security in a multi-cloud environment.
  • CI/CD Pipelines: Build and maintain CI/CD pipelines to support continuous integration and delivery for a diverse codebase, including Rust, C++, Python, and other technologies, enabling rapid and reliable software releases.
Requirements
  • Education: Bachelor's or Master's degree in Computer Science, Information Technology, or a related field.
  • Experience: 3+ years of experience in site reliability engineering, DevOps, or a similar role, preferably in leading internet companies.
  • Skills: Expertise in containerization and orchestration technologies, including Docker and Kubernetes; strong experience with cloud platforms (AWS, GCP, Azure) and cloud-native tools and services; knowledge of monitoring and observability tools such as Prometheus, Grafana, and ELK Stack; proficiency with CI/CD tools such as GitHub Actions, CircleCI, or Jenkins.
  • Personal Qualities: Strong problem-solving skills, attention to detail, and a proactive approach to system reliability and scalability; excellent communication and teamwork skills, with the ability to work effectively in a remote, fast-paced, and dynamic environment.
About Snarkify

Snarkify is a cutting-edge technology company that empowers developers to build, deploy, and scale Zero-Knowledge Proof (ZKP) applications. We are passionate about scaling ZKPs for a trustless future and offer a competitive base salary with founding member equity, the opportunity to build the next-generation ZK computing platform, immersion in a team of top-notch global blockchain engineers, a flexible and innovative remote work environment, and room for continuous growth and development in the ZK field.



  • Palo Alto, California, United States stakefish Full time

    About the RoleWe are seeking a highly skilled DevOps Engineer to join our team at stakefish, a leading provider of staking services for blockchain networks. As a DevOps Engineer, you will play a critical role in building and maintaining our blockchain infrastructure, ensuring the security, scalability, and reliability of our systems.Key...


  • Palo Alto, California, United States Palantir Technologies Full time

    A Transformative Organization Palantir develops the premier software for data-centric decision-making and operational efficiency. By connecting the right information to those who require it, our platforms enable our partners to create life-saving medications, anticipate supply chain challenges, locate missing individuals, and much more. The Position We...


  • Palo Alto, California, United States SambaNova Systems Full time

    The age of ubiquitous AI is upon us. Organizations are increasingly leveraging generative AI to extract latent value from their data, streamline operations, minimize expenses, enhance efficiency, and drive innovation to fundamentally reshape their business models and operational frameworks.SambaNova SuiteTM stands as the pioneering full-stack generative AI...


  • Palo Alto, California, United States Wing Inflatables Inc Full time

    About Wing Inflatables IncWing Inflatables Inc is a leading provider of innovative solutions for last mile logistics. Our mission is to create a safe, fast, and sustainable delivery network that meets the needs of our customers. We design, build, and operate our aircraft, and offer delivery services on three continents. Our technology is designed to be easy...


  • Palo Alto, California, United States Palantir Technologies Full time

    About the RoleWe are seeking a highly skilled Backend Software Engineer to join our team at Palantir Technologies. As a key member of our infrastructure team, you will be responsible for designing and developing scalable, secure, and high-performance software systems that power our products.Key ResponsibilitiesDesign and implement performant search and...


  • Palo Alto, California, United States Tesla Full time

    The Supercomputing and AI infrastructure division at Tesla is integral to the high-performance computing and machine learning systems that support our advanced algorithms. This encompasses virtual simulations, hardware for Autopilot, silicon design, and our Dojo supercomputer. As the demand for enhanced data processing and optimized computational resources...


  • Palo Alto, California, United States AppLovin Full time

    About AppLovin AppLovin develops innovative technologies that empower businesses of all sizes to connect with their target audiences. The organization offers comprehensive software and AI solutions designed to help companies engage, monetize, and expand their global reach. For further details about AppLovin, please visit our website. To fulfill this mission,...


  • Palo Alto, California, United States Harmony Full time

    Position: AI Infrastructure Engineer at HarmonyHarmony is an innovative and open blockchain platform dedicated to Web3 applications, particularly in the realms of generative AI and machine learning. We invite you to be part of our mission to revolutionize the global economy through advanced technology and pioneering solutions in the blockchain industry.About...


  • Palo Alto, California, United States Protocol Labs Inc Full time

    Position Overview:As an Infrastructure Program Manager at Protocol Labs, you will be instrumental in steering initiatives and addressing intricate challenges associated with the development, security, and management of expansive infrastructure tailored to the decentralized framework of Web 3.0.Key Responsibilities:In this role, you will:Lead the execution of...


  • Palo Alto, California, United States Lane Clark & Peacock LLP. Full time

    About the RoleThis is a key position within Lane Clark & Peacock LLP's technology team, responsible for leading the development and implementation of cloud infrastructure solutions. The successful candidate will have a strong background in cloud computing, with experience in designing and delivering scalable and secure infrastructure solutions.Key...


  • Palo Alto, California, United States Zycus Inc. Full time

    Zycus Inc. is seeking a Director of IT Infrastructure with extensive expertise in SaaS Infrastructure and a hybrid framework encompassing Data Center Operations, Cloud, and virtualization technologies such as Openshift, Amazon Web Services (AWS), VMWare, and Azure.Key ResponsibilitiesDemonstrated proficiency in managing hybrid environments for Data Center...


  • Palo Alto, California, United States Zycus Inc. Full time

    Zycus Inc. is seeking a Director of IT Infrastructure with substantial expertise in SaaS Infrastructure and a comprehensive understanding of Hybrid Data Center Operations, Cloud technologies, and virtualization platforms such as Openshift, Amazon Web Services (AWS), VMWare, and Azure.Key ResponsibilitiesDemonstrate extensive experience in managing Hybrid...


  • Palo Alto, California, United States Wing Inflatables Inc Full time

    About Wing Inflatables IncWing Inflatables Inc is a leading provider of innovative solutions for last mile logistics. Our mission is to create a safe, fast, and sustainable delivery network that meets the needs of our customers. We design, build, and operate our aircraft, and offer delivery services on three continents.About the RoleWing Inflatables Inc is...


  • Palo Alto, California, United States Zycus Inc. Full time

    Zycus Inc. is seeking a Director of IT Infrastructure with extensive hands-on expertise in SaaS Infrastructure and a comprehensive understanding of Hybrid Data Center Operations, Cloud technologies, and virtualization solutions including Openshift, Amazon Web Services (AWS), VMWare, and Azure.Key ResponsibilitiesDemonstrate significant experience in managing...


  • Palo Alto, California, United States Zycus Inc. Full time

    Zycus Inc. is seeking a Director of IT Infrastructure with extensive hands-on expertise in SaaS Infrastructure and a hybrid model encompassing Data Center Operations, Cloud, and virtualization technologies such as Openshift, Amazon Web Services (AWS), VMWare, and Azure.Key ResponsibilitiesDemonstrate substantial experience in managing a hybrid environment of...


  • Palo Alto, California, United States Amazon Full time

    Machine Learning Engineer II, Search Science and Data Infrastructure Amazon Search creates powerful, customer–focused product search solutions and technologies. Whenever a customer visits an Amazon site worldwide and types in a query or browses through product categories, our systems go to work. We delight customers when we accurately understand their...


  • Palo Alto, California, United States Zycus Inc. Full time

    Zycus Inc. is seeking a Director of IT Infrastructure with extensive hands-on expertise in SaaS Infrastructure and a hybrid framework encompassing Data Center Operations, Cloud, and virtualization technologies such as Openshift, Amazon Web Services (AWS), VMWare, and Azure.Key ResponsibilitiesDemonstrated proficiency in managing hybrid environments of Data...


  • Palo Alto, California, United States Zycus Inc. Full time

    Zycus Inc. is seeking a Director of IT Infrastructure with extensive expertise in SaaS Infrastructure and a comprehensive understanding of Hybrid Data Center Operations, Cloud technologies, and virtualization solutions, including Openshift, Amazon Web Services (AWS), VMWare, and Azure.Key ResponsibilitiesDemonstrate substantial experience in managing Hybrid...


  • Palo Alto, California, United States ATR International Full time

    Job Title: Software Development EngineerAbout the Role:We are seeking a highly skilled Software Development Engineer to join our team at ATR International. As a member of our Software Engineering Group, you will play a key role in designing and developing innovative software solutions that advance businesses and careers.Key Responsibilities:Design and...


  • Palo Alto, California, United States Amazon Full time

    Amazon Search creates powerful, customer-focused product search solutions and technologies. Whenever a customer visits an Amazon site worldwide and types in a query or browses through product categories, our systems go to work. We delight customers when we accurately understand their intent expressed via a query or image, and reflect that understanding...