Data Engineer AI Systems

2 days ago


St Louis, Missouri, United States MAK Technologies LLC Full time

Job Title:
Data Engineer - AI Systems

6 Months

St. Louis, MO Day 1 onsite role

Data Engineer – AI Systems (Databricks)

Primary Skills:
Data Engineer, Databricks, Python, PySpark, AI/ML

We'rebuilding intelligent, Databricks-powered AI systems that structure and activate information from diverse enterprise sources (Confluence, OneDrive, PDFs, andmore). As a
Data Engineer
, you'll design and optimize the data pipelinesthat transform raw and unstructured content into clean, AI-ready datasets formachine learning and generative AI agents.

You'llcollaborate with a cross-functional team of Machine Learning Engineers,Software Developers, and domain experts to create high-quality data foundationsthat power Databricks-native AI agents and retrieval systems.

KeyResponsibilities

  • Develop Scalable Pipelines:
    Design, build, and maintain high-performance ETL and ELT workflows using Databricks, PySpark, and Delta Lake.
  • Data Integration:
    Build APIs and connectors to ingest data from collaboration platforms such as Confluence, OneDrive, and other enterprise systems.
  • Unstructured Data Handling:
    Implement extraction and transformation pipelines for text, PDFs, and scanned documents using Databricks OCR and related tools.
  • Data Modeling:
    Design Delta Lake and Unity Catalog data models for both structured and vectorized (embedding-based) data stores.
  • Data Quality & Observability:
    Apply validation, version control, and quality checks to ensure pipeline reliability and data accuracy.
  • Collaboration:
    Work closely with ML Engineers to prepare datasets for LLM fine-tuning and vector database creation, and with Software Engineers to deliver end-to-end data services.
  • Performance & Automation:
    Optimize workflows for scale and automation, leveraging Databricks Jobs, Workflows, and CI/CD best practices.

What YouBring

  • Experience with
    data engineering, ETL development
    , or
    data pipeline automation
    .
  • Proficiency in
    Python
    ,
    SQL
    , and
    PySpark
    .
  • Hands-on experience with
    Databricks
    ,
    Spark
    , and
    Delta Lake
    .
  • Familiarity with
    data APIs
    ,
    JSON
    , and unstructured data processing (OCR, text extraction).
  • Understanding of
    data versioning
    ,
    schema evolution
    , and
    data lineage
    concepts.
  • Interest in
    AI/ML data pipelines
    ,
    vector databases
    , and
    intelligent data systems
    .

BonusSkills

  • Experience with
    vector databases
    (e.g., Pinecone, Chroma, FAISS) or Databricks'
    Vector Search
    .
  • Exposure to
    LLM-based architectures
    ,
    LangChain
    , or
    Databricks Mosaic AI
    .
  • Knowledge of
    data governance frameworks
    ,
    Unity Catalog
    , or
    access control
    best practices.

Familiarity with
REST API development
or
data synchronization services
(e.g., Airbyte, Fivetran, custom connectors


  • Lead AI Engineer

    7 days ago


    St Louis, Missouri, United States Equifax Full time

    Equifax is seeking a visionary AI engineer to lead our technology transformation initiative. In this role, you will lead a talented team in architecting and deploying cutting-edge, cloud-native solutions for a large enterprise. You will be at the forefront of modern development, employing vibe coding concepts with AI-powered coding assistants like GitHub...


  • St Louis, Missouri, United States Scale AI Full time $162,800 - $203,500 per year

    Our Security team works on operational issues at the leading edge of machine learning technology. You will join a creative and solutions-oriented team collaborating with internal teams at Scale and externally with our customers. Scale is looking for an experienced security and compliance professional to support Assessment and Authorization and agency audit...


  • St Louis, Missouri, United States National Information Solutions Cooperative (NISC) Full time $80,000 - $140,000 per year

    NISC develops and implements enterprise-level and customer-facing software solutions for over 960+ utilities and broadbands across North America. Our mission is to deliver technology solutions and services that are Member-focused, quality driven and valued priced. We exist to serve our Members and help them serve their communities through our innovative...

  • Data Engineer II

    3 days ago


    St Louis, Missouri, United States McCarthy Building Companies, Inc. Full time

    McCarthy Holdings, Inc. (McCarthy), is the holding entity for McCarthy Building Companies, Inc., the oldest privately-held national construction company in America, and Castle Contracting. McCarthy provides the crucial business infrastructure for these entities and connects the day-to-day operations to ensure seamless operations across the business....


  • St Louis, Missouri, United States Intellectix Full time

    Senior Systems Engineer (SAS)Location: St. Louis, MOClearance: TS/SCICitizenship: US Citizenship RequiredWhether running countries, corporations, or courtrooms, our clients bear the responsibility for shaping the lives of millions. At Intellectix, we are on a mission to empower them with innovative technical solutions and drive transformational change. As a...


  • St Louis, Missouri, United States Amazon Full time $143,300 - $247,600

    Application deadline: Applications will be accepted on an ongoing basisAre you excited to help the US Intelligence Community design, build, and implement AI algorithms, including advanced Generative AI solutions, to augment decision making while meeting the highest standards for reliability, transparency, and scalability? The Amazon Web Services (AWS) US...

  • Head of Engineering

    3 days ago


    St Louis, Missouri, United States Rezilient Health Full time

    At Rezilient, we're reimagining how primary and specialty care are delivered, making them more accessible, connected, and patient-centered than ever before. Through our hybrid CloudClinic model, we bring together in-person Medics and Care Teams with virtual Providers to deliver timely, tech-enabled care. By removing barriers to access and creating seamless...

  • Systems Engineer

    1 week ago


    St Louis, Missouri, United States Tulk Llc Full time

    Systems Engineer - ISP IntegrationAbout Us: TULK is a niche boutique consulting firm specializing in technology and management consulting for the US Federal Government. We empower Defense and National Security clients to tackle their most challenging issues by guiding them in acquiring, designing, managing, and developing cutting-edge technology systems and...

  • Systems Engineer

    7 days ago


    St Louis, Missouri, United States Peraton Full time $104,000 - $166,000

    ResponsibilitiesAs a Systems Engineer, you'll bring a multi-disciplinary approach to solving complex technical challenges across the National System for Geospatial-Intelligence (NSG), Allied System for Geospatial-Intelligence (ASG), and partner Federal Agencies. Your work will be instrumental in delivering timely, accurate, and mission-critical GEOINT.In...

  • Systems Engineer

    2 weeks ago


    St Louis, Missouri, United States Peraton Full time $104,000 - $166,000

    ResponsibilitiesAs a Mid-Level Systems Engineer, you'll bring a multi-disciplinary approach to solving complex technical challenges across the National System for Geospatial-Intelligence (NSG), Allied System for Geospatial-Intelligence (ASG), and partner Federal Agencies. Your work will be instrumental in delivering timely, accurate, and mission-critical...