Staff Software Engineer, Data Curation

2 weeks ago


San Francisco, California, United States Altos Labs Full time $200,000 - $320,000 per year

Our Mission
Our mission is to restore cell health and resilience through cell rejuvenation to reverse disease, injury, and the disabilities that can occur throughout life.

For more information, see our website at

Our Value
Our Single Altos Value:
Everyone Owns Achieving Our Inspiring Mission
.

Diversity at Altos
We believe that diverse perspectives are foundational to scientific innovation and inquiry. At Altos, exceptional scientists and industry leaders from around the world work together to advance a shared mission. Our intentional focus is on Belonging, so that all employees know that they are valued for their unique perspectives. We are all accountable for sustaining a diverse and inclusive environment.

What You Will Contribute To Altos
Use AI agents to make complex research data FAIR—Findable, Accessible, Interoperable, Reusable—so scientists and product teams can ask richer questions, move faster, and advance discovery. Be part of a team using knowledge and data engineering to enable the transition from manual to LLM‑enabled, agentic data ingestion and curation. You'll sit at the intersection of data curation, data and knowledge engineering. Your job is to automate the ingestion and standardization of multi‑source datasets into governed, searchable, analytics-ready assets, and to model the domain knowledge that ties them together.

Responsibilities

  • Curate and harmonize data. Ingest, profile, clean, normalize, and annotate multi‑modal research datasets (e.g., genomics/transcriptomics, proteomics, imaging/microscopy, CRISPR screens, assay/instrument metadata). Map to controlled vocabularies and standards; manage identifiers, synonyms, and crosswalks.
  • Deliver insights from curated data. Focus on the substance—entities, relationships, and annotations that answer real research and product questions using public domain assets from Ensembl, GEO, PubMed, OMIM, OLS, amongst others. Use pipelines and existing data sources storage pragmatically as tools to deliver content and outcomes.
  • Model knowledge to serve decisions. Capture the concepts and links researchers actually use; keep schemas lightweight and purpose‑built. Leverage OBO Foundry ontologies; define with LinkML; align to the BioLink/Biolink Model; and integrate/serve with platforms such as BioCypher.
  • Quality, governance & AI enablement. Instrument automated checks (tests/expectations), process development to improvement data FAIRification, and LLM‑assisted validations; capture provenance/lineage; codify SOPs; and work to facilitate the migration of processes from manual → automation → agentic (MCP‑integrated) workflows.
  • Serve as a key technical liaison between scientific, data science, and engineering teams, translating complex research needs into scalable and maintainable data solutions.
  • Define and evangelize best practices for data and knowledge engineering across the organization, mentoring junior team members and building reusable, AI-enhanced, enterprise-level components.

Who You Are
Minimum Qualifications

  • PhD, Biological Sciences, Computer Science, Software Engineering, or related quantitative field, or equivalent technical experience
  • Candidates should have 8+ years of relevant experience in data curation, ontology/knowledge engineering, or data engineering (or equivalent experience) at a biotechnology company.
  • Mindset: You prioritize data and business objectives over tools; technology is a means to an end.
  • Demonstrably strong Python expertise, particularly in the context of data modeling and processing, with strong skills in both relational (SQL) and graph data stores, and the ability to choose pragmatically between them (e.g., Postgres/Redshift vs. Neo4j/Neptune).
  • Comfortable building pragmatic ETL/ELT workflows in a major cloud (preferably AWS), using orchestration frameworks or AWS-native tools.
  • Active user of AI coding editors such as Cursor, with an active interest in designing and building Model Context Protocol (MCP) applications; motivated to migrate processes from manual → automation → agentic.
  • Mature understanding of data quality, provenance, versioning, and "curation as code," including hands-on use of testing/validation frameworks.

Preferred Qualifications

  • Experience in basic/exploratory life‑science research across multiple modalities (genomics/transcriptomics, proteomics, imaging/microscopy, screening, model organisms); a user of curated content to achieve research/business outcomes.
  • Experience with a data platform such as
  • Experience with vector databases and search (e.g., Weaviate, FAISS, pgvector) and AI/LLM frameworks (e.g., LiteLLM, LangChain, LlamaIndex) for retrieval-augmented generation and agent workflows.
  • Experience with OBO Foundry ontologies and modern frameworks such as LinkML, BioLink, and BioCypher, familiarity with graph database technologies (e.g., Neo4j, AWS Neptune) and semantic standards (OWL, RDF, SPARQL).
  • Experience creating lightweight semantic layers and AI/LLM‑assisted curation workflows (LiteLLM, FastMCP).

The salary range for
Redwood City, CA
:

  • Staff Software Engineer: $221,850 - $300,150

Exact compensation may vary based on skills, experience, and location.

For UK applicants, before submitting your application:

  • Please click here to read the Altos Labs EU and UK Applicant Privacy Notice )
  • This Privacy Notice is not a contract, express or implied and it does not set terms or conditions of employment.

Equal Opportunity Employment
We value collaboration and scientific excellence.

We believe that diverse perspectives and a culture of belonging are foundational to scientific innovation and inquiry. At Altos Labs, exceptional scientists and industry leaders from around the world work together to advance a shared mission. Our intentional focus is on Belonging, so that all employees know that they are valued for their unique perspectives. We are all accountable for sustaining an inclusive environment.

Altos Labs provides equal employment opportunities to all employees and applicants for employment, without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. Altos prohibits unlawful discrimination and harassment. This policy applies to all terms and conditions of employment, including recruiting, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation and training.

Thank you for your interest in Altos Labs where we strive for a culture of scientific excellence, learning, and belonging.

Note: Altos Labs will not ask you to download a messaging app for an interview or outlay your own money to get started as an employee. If this sounds like your interaction with people claiming to be with Altos, it is not legitimate and has nothing to do with Altos. Learn more about a common job scam



  • San Francisco, California, United States Pinterest Full time $163,064 - $335,720 per year

    About PinterestMillions of people around the world come to our platform to find creative ideas, dream about new possibilities and plan for memories that will last a lifetime. At Pinterest, we're on a mission to bring everyone the inspiration to create a life they love, and that starts with the people behind the product.Discover a career where you ignite...


  • San Francisco, California, United States FUSTIS LLC Full time $200,000 - $300,000 per year

    Job Role: Senior Staff Platform Engineer – Data AcquisitionDuration 12-18mth+Pay Rate: $95/hr. on C2C or $76/hr. on W2Must have:Experience working at big tech company (Microsoft, Facebook, Google, Intel, etc..)Excellent communicationJob Description:As we continue to scale its Data-as-a-Service (DaaS) platform, the Data Acquisition Team plays a critical...

  • Staff Data Engineer

    4 days ago


    San Francisco, California, United States TechLine Consulting Full time $250,000 - $300,000 per year

    Title:Staff Data EngineerLocation:Remote (client based in San Francisco Bay Area)Engagement Type:Contract OR Direct HireCompensation:Client would like to convert around 250k + bonusAbout the ClientOur client is a high-growth technology company headquartered in the San Francisco Bay Area, operating at the intersection of large-scale data systems, machine...

  • Software Engineer

    2 days ago


    San Francisco, California, United States Beacon Software Full time $120,000 - $250,000 per year

    Beacon Software is a permanent capital holding company which acquires and grows essential businesses. We are a profitable series B+ firm that combines great technologists, operators and M&A professionals to accelerate the scale of the ambition of the dozens of businesses we own and operate. We are supported by capital from tier-1 venture capital, crossover,...


  • San Francisco, California, United States TechChain Talent Full time $172,500 - $227,500 per year

    Staff Software Engineer; Remote, United States (West Coast preferred); $172,500 - $227,500 base (flexible)AboutOur client is a global financial technology company at the forefront of the new internet of money. Our infrastructure including USDC, a blockchain-based dollar powers payments, commerce, and financial applications worldwide. We help businesses,...


  • San Francisco, California, United States AnyRoad Full time $150,000 - $250,000 per year

    Location: Hybrid (2 days per week in San Francisco) or Remote (US time zones)The RoleWe are looking for a Staff Software Engineer to design and build the backend systems, APIs, and integrations that power AnyRoad's platform. This includes bookings, memberships, analytics, and personalization for global brands and live artists.You will work hands-on to...


  • San Francisco, California, United States Activision Blizzard Full time $146,000 - $270,004

    Job Title:Senior Staff Software Engineer (Data) - Activision Blizzard MediaRequisition ID:R023566Job Description:Your Role Within the KingdomDo you want to build amazing high-scale backend systems for Advertising using the latest technologies? Are you an excellent communicator who enjoys working with people from several different business units? Can you...


  • San Francisco, California, United States Databricks Full time $190,900 - $232,800

    P-1285About This RoleAs a staff software engineer for GenAI inference, you will lead the architecture, development, and optimization of the inference engine that powers Databricks Foundation Model API.. You'll bridge research advances and production demands, ensuring high throughput, low latency, and robust scaling. Your work will encompass the full GenAI...


  • San Francisco, California, United States Tessera Data Full time $155,000 - $182,000 per year

    *About Checkr*Checkr is building the data platform to power safe and fair decisions. Established in 2014, Checkr's innovative technology and robust data platform help customers assess risk and ensure safety and compliance to build trusted workplaces and communities. Checkr has over 100,000 customers including DoorDash, Coinbase, Lyft, Instacart, and...

  • Staff Data Engineer

    4 days ago


    San Francisco, California, United States GEICO Full time $120,000 - $250,000 per year

    At GEICO, we offer a rewarding career where your ambitions are met with endless possibilities.Every day we honor our iconic brand by offering quality coverage to millions of customers and being there when they need us most. We thrive through relentless innovation to exceed our customers' expectations while making a real impact for our company through our...