Senior Backend Engineer, AI Data Platform
1 day ago
At Labelbox, we're building the critical infrastructure that powers breakthrough AI models at leading research labs and enterprises. Since 2018, we've been pioneering data-centric approaches that are fundamental to AI development, and our work becomes even more essential as AI capabilities expand exponentially.
About LabelboxWe're the only company offering three integrated solutions for frontier AI development:
- Enterprise Platform & Tools: Advanced annotation tools, workflow automation, and quality control systems that enable teams to produce high-quality training data at scale
- Frontier Data Labeling Service: Specialized data labeling through Alignerr, leveraging subject matter experts for next-generation AI models
- Expert Marketplace: Connecting AI teams with highly skilled annotators and domain experts for flexible scaling
- High-Impact Environment: We operate like an early-stage startup, focusing on impact over process. You'll take on expanded responsibilities quickly, with career growth directly tied to your contributions.
- Technical Excellence: Work at the cutting edge of AI development, collaborating with industry leaders and shaping the future of artificial intelligence.
- Innovation at Speed: We celebrate those who take ownership, move fast, and deliver impact. Our environment rewards high agency and rapid execution.
- Continuous Growth: Every role requires continuous learning and evolution. You'll be surrounded by curious minds solving complex problems at the frontier of AI.
- Clear Ownership: You'll know exactly what you're responsible for and have the autonomy to execute. We empower people to drive results through clear ownership and metrics.
As a Backend Engineer, AI Data Platform at Labelbox, you will lead the design and development of our core data infrastructure, powering the seamless flow, storage, and processing of data for our AI platform. Your expertise will drive the evolution of scalable systems—anchored by high-performance databases—to support large-scale workflows, high-throughput data I/O, and streaming capabilities. You'll enable Labelbox customers to efficiently manage and stream data for training next-generation AI models. Owning critical components of our data infrastructure, including database architecture, you'll work end-to-end on projects from design to deployment. Collaborating cross-functionally with stakeholders, you'll transform ideas into robust, scalable solutions that enhance platform adoption and customer success.
Your Impact- Design and build scalable data infrastructure, integrating high-performance databases (relational, NoSQL, cloud-native) with distributed systems for data processing, storage, and streaming.
- Optimize database systems for performance, reliability, and scalability, ensuring efficient data retrieval, indexing, and querying to support AI workflows.
- Develop and maintain data pipelines using distributed queues, message brokers, and job management mechanisms to enable high-throughput import/export operations.
- Collaborate with team members and stakeholders to align data infrastructure with platform goals and customer needs.
- Participate in Sprint Planning, Standups, and related activities to drive data-focused initiatives forward.
- Mentor and guide less experienced engineers, sharing expertise in data infrastructure and database optimization.
- Support the team's area of ownership by working with the Support organization to resolve customer-facing data issues.
- Stay abreast of industry trends in data infrastructure and database technologies, incorporating relevant innovations into our systems.
- Contribute to technical documentation, research publications, blog posts, and presentations at conferences and forums.
- Innovation in AI: Enhance data infrastructure capabilities for an AI platform used by leading AI labs to develop powerful multi-modal large language models (LLMs).
- Bachelor's degree in Computer Science, Data Engineering, or a related field. Advanced degree preferred.
- 4+ years of work experience in a software or data-focused company, with significant expertise in data infrastructure and backend engineering.
- Deep knowledge of designing and managing scalable database systems, including relational databases (e.g., PostgreSQL, MySQL), NoSQL stores (e.g., MongoDB, Cassandra), and cloud-native solutions (e.g., Google Spanner, AWS DynamoDB).
- Strong experience with data infrastructure components such as data pipelines, streaming systems, and storage architectures (e.g., Cloud Buckets, Key-Value Stores).
- Proficiency in optimizing databases for performance (e.g., schema design, indexing, query tuning) and integrating them with broader data workflows.
- Previous experience with distributed systems tools (e.g., queues, message brokers like Kafka or RabbitMQ, job orchestration frameworks) for real-time data processing and other use cases..
- Previous experience with search engines (e.g., ElasticSearch).
- Knowledge of backend development using languages like Python, Java, or TypeScript; familiarity with NodeJS and NestJS is a plus.
- Proficient in data structures, algorithms, and system design for large-scale data management.
- Demonstrated ability to keep up with trends in data infrastructure and database technologies.
- Excellent communication and collaboration skills.
- Strong sense of ownership and ability to thrive in a fast-paced environment.
- Comfortable with ambiguity, breaking down high-level requirements into actionable data infrastructure tasks methodically.
- Resourceful problem-solver with attention to detail, eager to take initiative and deliver results.
- High proficiency in leveraging AI tools for daily development (e.g., Cursor, GitHub Copilot).
- Familiarity with data warehousing solutions (e.g., Snowflake, BigQuery).
- Experience with container orchestration systems (e.g., Kubernetes) for deploying data infrastructure components.
- Experience with one or more public cloud platforms:
- Google Cloud Platform (GCP) (preferred)
- Amazon Web Services (AWS)
- Microsoft Azure
- Understanding of the Data + AI ecosystem and its relevance to large-scale AI platforms.
- Knowledge of memory management and optimization in data-intensive systems.
- Experience with DevOps tools (e.g., ArgoCD, DataDog) for monitoring and managing data infrastructure.
- Previous experience using LLM backed AI services such as from OpenAI, Anthropic, Google, etc. to develop product features.
At Labelbox Engineering, we're building a comprehensive platform that powers the future of AI development. Our team combines deep technical expertise with a passion for innovation, working at the intersection of AI infrastructure, data systems, and user experience. We believe in pushing technical boundaries while maintaining high standards of code quality and system reliability. Our engineering culture emphasizes autonomous decision-making, rapid iteration, and collaborative problem-solving. We've cultivated an environment where engineers can take ownership of significant challenges, experiment with cutting-edge technologies, and see their solutions directly impact how leading AI labs and enterprises build the next generation of AI systems.
Our Technology StackOur engineering team works with a modern tech stack designed for scalability, performance, and developer efficiency:
- Frontend: with Redux, TypeScript
- Backend: , TypeScript, Python, some Java & Kotlin
- APIs: GraphQL
- Cloud & Infrastructure: Google Cloud Platform (GCP), Kubernetes
- Databases: MySQL, Spanner, PostgreSQL
- Queueing / Streaming: Kafka, PubSub
Labelbox strives to ensure pay parity across the organization and discuss compensation transparently. The expected annual base salary range for United States-based candidates is below. This range is not inclusive of any potential equity packages or additional benefits. Exact compensation varies based on a variety of factors, including skills and competencies, experience, and geographical location.
Annual base salary range
$160,000 - $260,000 USD
Life at Labelbox- Location: Join our dedicated tech hubs in San Francisco or Wrocław, Poland
- Work Style: Hybrid model with 2 days per week in office, combining collaboration and flexibility
- Environment: Fast-paced and high-intensity, perfect for ambitious individuals who thrive on ownership and quick decision-making
- Growth: Career advancement opportunities directly tied to your impact
- Vision: Be part of building the foundation for humanity's most transformative technology
We believe data will remain crucial in achieving artificial general intelligence. As AI models become more sophisticated, the need for high-quality, specialized training data will only grow. Join us in developing new products and services that enable the next generation of AI breakthroughs.
Labelbox is backed by leading investors including SoftBank, Andreessen Horowitz, B Capital, Gradient Ventures, Databricks Ventures, and Kleiner Perkins. Our customers include Fortune 500 enterprises and leading AI labs.
Your Personal Data Privacy: Any personal information you provide Labelbox as a part of your application will be processed in accordance with Labelbox's Job Applicant Privacy notice.
Any emails from Labelbox team members will originate from a email address. If you encounter anything that raises suspicions during your interactions, we encourage you to exercise caution and suspend or discontinue communications.
-
Senior Backend Engineer
4 days ago
San Francisco, California, United States David AI Full timeAbout David AIDavid AI is the first audio data research company. We bring an R&D approach to data–developing datasets with the same rigor AI labs bring to models. Our mission is to bring AI into the real world, and we believe audio is the gateway. Speech is versatile, accessible, and human—it fits naturally into everyday life. As audio AI advances and...
-
Senior Software Engineer
2 weeks ago
San Francisco, California, United States Loop AI - Delivery Intelligence Platform Full time $120,000 - $180,000 per yearAbout LoopLoop AI is a San Francisco–based tech company founded in 2022. We provide a Delivery Intelligence Platform for data-driven digital food brands, helping them optimize operations and make informed decisions. As a seed-stage AI startup, we are revolutionizing the restaurant industry by streamlining and automating back-office operations through...
-
Senior Backend Engineer
3 days ago
San Francisco, California, United States Jack & JillExternal ATS Full timeThis is a job that Jill is helping to recruit for on behalf of one of our customers. To apply, speak to Jack.Title: Senior Backend Engineer ($180k-$200k + Equity) at Pioneering AI simulation platformJob Title:Senior Backend EngineerSalary:$180k-$200k + EquityCompany Description:Pioneering AI simulation platformJob Description:Own backend systems and...
-
Senior Product Engineer, AI Data Platform
2 weeks ago
San Francisco, California, United States Labelbox Full timeShape the Future of AIAt Labelbox, we're building the critical infrastructure that powers breakthrough AI models at leading research labs and enterprises. Since 2018, we've been pioneering data-centric approaches that are fundamental to AI development, and our work becomes even more essential as AI capabilities expand exponentially.About LabelboxWe're the...
-
Senior Platform Engineer
2 weeks ago
San Francisco, California, United States Across-AI Full time $140,000 - $170,000 per yearSan Francisco, CA or Vancouver, BC, Canada – Hybrid / Onsite Preferred (3 days a week) Full-timeAbout the RoleWe're looking for a hands-on experienced Platform Engineer to shape and scale the foundation of our AI-driven enterprise platform. You'll be the technical backbone of our infrastructure efforts — working across backend systems, Kubernetes-based...
-
Senior Backend Engineer
2 weeks ago
San Francisco, California, United States Jobs via Dice Full time $120,000 - $200,000 per yearDice is the leading career destination for tech experts at every stage of their careers. Our client, Pivotal Solutions Inc, is seeking the following. Apply via Dice todayOur client, located in San Francisco, CA is looking to hire a Senior Backend Engineer as a permanent full-time employee.I have attached below a detailed job description for your review.If...
-
AI Engineer
2 weeks ago
San Francisco, California, United States Conduit AI Full timeAbout Conduit\ Conduit is the platform for building conversational AI agents focused on hospitality. Our AI agents automate inbound and outbound conversational workflows for to increase conversions, reduce costs, and improve customer satisfaction.Our MissionTo maximize conversational automation for high touch businesses.What Makes Conduit UniqueWe're an AI...
-
Platform Backend Engineer
1 day ago
San Francisco, California, United States Neon Redwood Full timeBackend EngineerThe companyNeon Redwood is a data services consulting company, working on cutting-edge AI and data-driven solutions. We are a team of passionate engineers and data experts, and we are currently looking for a Backend Engineer to join our team and help us develop and expand our data infrastructure and analytics capabilities.The RoleWe are...
-
Senior Backend Engineer
3 days ago
San Francisco, California, United States DRH Search Full timeWe're assisting a well-funded startup with their search for Senior Backend Engineers. They've created a platform marketplace for sports cards and collectibles. They're hiring multiple engineers in this capacity and the role will be onsite in their SF office. What you'll do: Design, build, and maintain scalable backend applications and APIs using ,...
-
Senior Backend Software Engineer
2 weeks ago
San Francisco, California, United States Retell AI Full time $215,000 - $290,000 per yearAbout Retell AiRetell AI is using the first principles to reimagine the call center with cutting edge voice AI.We believe voice is still the most natural way humans communicate, yet it has been trapped in outdated call centers for decades. Our mission is to bring intelligence, empathy, and speed to every phone conversation between businesses and their...