Platform Engineer

4 weeks ago


San Francisco, California, United States Voltage Park Inc Full time

About Voltage Park On-Demand

Voltage Park's mission is to make AI infrastructure accessible to all. Today, we own 24,000+ H100s and operate 7+ data-centers across the US. We serve customers of all sizes, from small research labs to large enterprises.

We're looking for a skilled Platform Engineer to join our On-Demand team, where you'll help us build a platform that allows customers to flexibly rent out these GPUs for as little or as long as they want.

Our team is small, highly motivated, and focused on engineering excellence. We all operate with a founder mentality; several of us have founded and launched prior businesses.

All team members are hands-on and contribute directly to our team's mission.

If you join us, you'll be an early team member and help us shape:

Our future company culture

Our engineering practices

People that we hire

The direction & focus of our products

Note:

this role is in-person and you must be based in the San Francisco Bay Area to apply. We are not able to provide sponsorship for this position.

Key Responsibilities

Maintain servers & systems integral to our platform's reliability

Develop software - either for automation or for front/back end

Write automation for backend orchestration systems - MaaS; Libvirt; PFsense

Track downtimes and conduct RCA's

Write automation scripts to audit performance anomalies across our fleet of servers

Requirements

5+ years Linux administration (Ubuntu/Debian focus)

Strong experience with Libvirt (KVM) virtualization

Proficient in Python and Bash scripting

Experience with automation tools (preferably Ansible)

Solid networking knowledge

Experience with PostgreSQL

Familiarity with CEPH and NFS storage solutions

Ideal Experiences

Experience with GPU virtualization and PCIe passthrough

Knowledge of Proxmox VE, OpenStack, or OpenNebula

Experience with Docker and Kubernetes

Experience with bare metal automation (e.g., Ubuntu MAAS)

Monitoring experience (Prometheus, Grafana, ELK Stack)

Experience with infrastructure-as-code tools (e.g., Terraform)

Experience with Redis

Experience working with Python (backend), Postgres (database), and React + Tailwind (frontend)

What We're Looking For

You are ambitious and always looking for ways to improve. We operate nearly $1B worth of assets, and the opportunity for impact is limitless.

This role will give you the most responsibilities you've ever had and hold you to higher standards than other companies you've worked at.

Expect to do the best and most impactful work of your career at Voltage Park.

You're focused on impact and don't get lost in the weeds on details that don't matter. You're excited to work on whatever solves the biggest customer problems, not just the coolest technical challenges.

You understand when making 80/20 trade-offs is the right thing to do and never compromise on your high standards when making those tradeoffs.

You have a strong work ethic. As a startup, we are trying to change the world and take on many large, $B+ competitors. Raw hours make a huge difference when facing overwhelming odds. Having a strong work ethic is a competitive advantage.

You take ownership of your initiatives. When you say you'll do something, you get it done without anyone having to check-in on you. You ship fully baked features end-to-end. You're accountable for the deadlines you set, and you figure out a solution if something unexpected occurs.

You make tradeoffs when necessary and are open to new ideas. As a startup, we have to make decisions quickly and often with incomplete information. We also face problems that have no obvious solutions. Sometimes, the best ideas sound crazy at first. You don't dismiss your teammate's ideas and are open to being challenged by others.

Voltage Park is an equal opportunity employer and makes employment decisions on the basis of merit.

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other characteristic under federal, state, or local law.

If you require an accommodation during the job application process, please notify your recruiter.


  • AI Platform Engineer

    3 weeks ago


    San Francisco, California, United States Labelbox Full time

    About the RoleLabelbox is seeking a skilled AI Platform Engineer to join our team. As a key member of our engineering organization, you will be responsible for building and maintaining a scalable AI platform that utilizes foundation models for real-world applications.Your Day to DayEnhance and improve Labelbox's core machine learning capabilities, including...


  • San Francisco, California, United States Acceler8 Talent Full time

    Backend Platform EngineerLocation: Flexible/HybridIntroduction:We are seeking a skilled Backend Platform Engineer to join our team at Acceler8 Talent. In this role, you'll have the opportunity to shape our data platform and work with advanced MLOps technologies. If you have a solid engineering background and a knack for developing scalable solutions, this...


  • San Francisco, California, United States ClassDojo Full time

    Job DescriptionWe are seeking a highly skilled Cloud Platform Engineer to join our team at ClassDojo. As a Cloud Platform Engineer, you will be responsible for designing, building, and maintaining our cloud-based infrastructure. This is a unique opportunity to work on a large-scale platform that connects teachers, children, and families globally.Key...


  • San Francisco, California, United States Capital One Full time

    Capital One is seeking a skilled Senior Platform Engineer to drive a major transformation within the company. As a key member of the engineering team, you will be responsible for creating and supporting DevOps tools with emerging technologies.Key Responsibilities:Work with product owners to understand desired application capabilities and testing...


  • San Francisco, California, United States Cruise Full time

    We're Cruise, a self-driving service designed for the cities we love. Our mission is to create a safer, more efficient, and more enjoyable transportation experience for everyone.As a Staff Engineer on our Capacity and Performance Engineering (CPE) Team, you will play a critical role in improving the scalability and efficiency of our cloud infrastructure. You...


  • San Francisco, California, United States Infinitus LLC Full time

    About the Role:We are seeking a highly skilled Platform Software Engineer to join our team at Infinitus LLC. As a key member of our engineering team, you will be responsible for designing and implementing scalable, robust backend systems that meet the needs of our customers.Our ideal candidate will have a strong background in backend development, with...


  • San Francisco, California, United States Chime Full time

    About the RoleWe are seeking a skilled Software Engineer to contribute to the development of our new financial platform. This platform will serve as the backbone for Chime, powering critical financial activities for millions of members.As a Software Engineer, you will be responsible for designing, developing, testing, and deploying components of the...


  • San Francisco, California, United States Cloudflare Inc Full time

    About the RoleWe are seeking a highly skilled Security Platform Engineer to join our team at Cloudflare. As a Security Platform Engineer, you will play a critical role in providing a secure infrastructure for one of the biggest online platforms in the world.Key Responsibilities:Build and manage the PKI that provides trusted certificates to all of our...


  • San Francisco, California, United States Tatari Full time

    About the RoleTatari is revolutionizing TV advertising by combining sophisticated media buying with proprietary analytics. We work with top brands to grow their business using linear and streaming TV ads.As Engineering Manager - Data Platform, you will lead the Data Platform team, working closely with product, data science, and other engineering teams to...


  • San Francisco, California, United States Cloudflare Inc Full time

    About UsAt Cloudflare, we're on a mission to help build a better Internet. Our team runs one of the world's largest networks, powering millions of websites and other Internet properties for customers ranging from individual bloggers to SMBs to Fortune 500 companies.Cloudflare protects and accelerates any Internet application online without adding hardware,...


  • San Francisco, California, United States Labelbox Full time

    About the RoleWe are seeking a highly skilled Senior Backend Engineer to join our Platform engineering team. As a key member of our team, you will be responsible for leading the development of critical backend systems for large-scale distributed data infrastructure.Key Responsibilities:Design, develop, and deploy high-throughput exportsArchitect highly...


  • San Francisco, California, United States Whatnot Full time

    bAbout Whatnot/bbrWe're building the future of ecommerce, bringing together community, shopping and entertainment. Whatnot has something for everyone.brbrWe're innovating in the fast-paced world of live auctions from fashion, beauty, electronics to collectibles like trading cards, comic books, and even live plants.brbrWe're a remote co-located team,...


  • San Francisco, California, United States Capital One Careers Full time

    Job DescriptionCapital One is seeking a highly skilled Platform Engineer to join our team. As a Platform Engineer, you will be responsible for developing and maintaining the Human Resource technology ecosystem at Capital One.Key ResponsibilitiesTranslate business priorities into technical requirements and integrate HR value proposition and strategies to meet...


  • San Francisco, California, United States CyberCube Full time

    Job SummaryWe are seeking a highly skilled Database Platform Engineer to join our team at CyberCube Analytics, Inc. in San Francisco, CA. As a key member of our data infrastructure team, you will be responsible for designing, implementing, and maintaining large-scale data systems.Key ResponsibilitiesDesign and develop data analytics platforms to store,...


  • San Francisco, California, United States Labelbox Full time

    About the RoleAs a Senior Backend Engineer on the Platform engineering team, you will lead the development of key aspects of our core services, including data I/O, streaming, and storage capabilities.In this role, you will drive the evolution of our data infrastructure, focusing on large-scale high-throughput import and export, enabling Labelbox customers to...


  • San Francisco, California, United States KingCom Full time

    Job Description:We are seeking a highly skilled Staff Backend Engineer to join our Ads Engineering team at KingCom. As a key member of our team, you will be responsible for designing and developing highly scalable, highly available, and highly reliable Ads & Monetization platform that handles billions of requests per day.As a Staff Backend Engineer, you will...


  • San Francisco, California, United States Labelbox Full time

    Labelbox is a leading provider of data infrastructure for generative AI. Our comprehensive platform combines on-demand labeling services with a robust data labeling platform. The Boost labeling service is powered by a community of highly-educated experts who span all major languages and advanced subjects. They are available on-demand to rapidly generate new...


  • San Francisco, California, United States Galileo Co. Full time

    About Galileo Co.Galileo Co. is a leading-edge technology company founded in 2021 by a team of seasoned engineering leaders from Google AI and Uber AI. With over $23M in funding and backed by top-tier investors, Galileo Co. is poised to revolutionize the AI/ML industry with its cutting-edge Large Language Model (LLM) evaluation and experimentation...


  • San Francisco, California, United States Aclima Full time

    About the RoleAclima is seeking a skilled Senior Data Platform Engineer to join our dynamic team. As a platform expert, you will be responsible for architecting and developing robust solutions that power our mobile air pollution sensor data platform.Key Responsibilities:Lead the design and architecture of scalable backend systems capable of processing and...


  • San Francisco, California, United States Sephora Full time

    About the Role:We are seeking a highly skilled Lead Data Platform Engineer to join our team at Sephora. As a key member of our technology group, you will be responsible for driving ML initiatives for the enterprise and developing innovative solutions to help our customers discover products.Key Responsibilities:Operationalize ML solutions and work alongside...