Data Engineer

1 day ago


New York, New York, United States Redrob Full time $100,000 - $120,000 per year

Data Engineer

Redrob | New York, NY | Hybrid

About Redrob

Redrob is a Series A blitzscaling startup building the Android to ChatGPT's iPhone - creating accessible, application-layer LLMs that democratize AI for businesses and professionals worldwide. With $14M in funding from top-tier investors (including early backers of SpaceX and Lyft) and the South Korean government, we're on a mission to bridge the technology gap between developed and emerging markets.

Our global presence spans Seoul, New Delhi, Mumbai, and New York, positioning us at the forefront of making AI economically viable for every individual worldwide.

The Role

We're seeking a
data engineer who bridges software, MLOps, and infrastructure
, passionate about supporting real-world AI products. You should enjoy working hands-on with data pipelines, large-scale AWS services, and machine learning environments.

You'll be working directly with the
AI Engineering Team
driving the Redrob LLM roadmap, and together you'll shape the backbone that feeds our fine-tuned AI systems and deliver a production-ready model.

What You'll Do

  • Design and implement scalable data pipelines
    on AWS to ingest, clean, and transform structured and unstructured HR/Sales datasets (e.g., CRM exports, chat logs, resumes, support tickets).
  • Develop and manage data lake architecture
    using AWS S3, Glue Catalog, Athena, and Redshift to support model training, analytics, and evaluation workflows.
  • Integrate directly with SageMaker
    — building preprocessing pipelines, training data manifests, and calibration datasets for distillation, quantization, and fine-tuning.
  • Automate ETL and data validation
    with Glue, Lambda, or Step Functions to enforce schema integrity and ensure JSONL compliance for instruction-tuning datasets.
  • Implement robust data governance and lineage tracking
    using SageMaker Lineage, Glue Data Catalog, and LakeFS or MLflow for reproducibility and auditability.
  • Optimize data workflows for performance and cost
    through Spot usage, S3 Select, and Glue job tuning.
  • Ensure security and compliance
    with enterprise-grade controls — IAM roles, KMS encryption, private VPC endpoints, and data access monitoring via CloudWatch.
  • Collaborate cross-functionally
    with AI, backend, and DevOps teams to integrate high-quality datasets into LLM training pipelines and downstream APIs.

Requirements

  • 1+ years of experience in
    data engineering
    or
    ML data infrastructure
    .
  • Strong proficiency in
    Python
    (Pandas, PySpark, or Dask).
  • Deep knowledge of
    AWS services
    like S3, Glue, Athena, Redshift, Step Functions, Lambda, IAM, and KMS.
  • Experience designing
    ETL pipelines
    using Airflow, Glue Workflows, or Step Functions.
  • Familiarity with
    SageMaker Processing and Training
    workflows (data ingestion, manifests, and job orchestration).
  • Strong SQL and data modeling skills for analytical and ML workloads.
  • Experience implementing
    data versioning and lineage tracking
    (LakeFS, DVC, MLflow, or similar).
  • Solid understanding of
    data security, encryption, and access management
    within AWS environments.
  • Must be authorized to work in the United States (we cannot sponsor work visas at this time)

What We Offer

  • Base Compensation:
    $100,000 - $120,000 USD
  • Benefits:
    Comprehensive health, dental, and vision insurance
  • Retirement:
    401(k) plan
  • Time Off:
    Unlimited PTO policy
  • Work Style:
    Hybrid flexibility with our NYC office at 1 Penn Plaza
  • Growth:
    Join a rapidly scaling startup with opportunities for accelerated career growth
  • Impact:
    Work on products that democratize AI access for millions globally

Why Join Redrob?

This is a rare opportunity to join a well-funded startup at an inflection point, working on technology that rivals the biggest names in AI. You'll have unprecedented exposure to senior leadership, direct impact on product strategy, and the chance to grow your career alongside a company poised for explosive growth.

If you're ready to build the future of accessible AI and make your mark on the global technology landscape, we want to hear from you.

Apply now and help us build the world's most accessible, end-to-end AI stack.

Redrob is an equal opportunity employer committed to building a diverse and inclusive team.


  • Data Engineer

    4 days ago


    New York, New York, United States SoHo Dragon Full time $120,000 - $200,000 per year

    Position: Data EngineerLocation:New York City(Hybrid 3x in office)Employment Type:ContractContract Duration: 1-2 yearsThe data engineer is responsible for designing, building, and maintaining the systems and infrastructure needed for data storage, processing, and analysis. The data engineer will work with a multidisciplinary Agile team to build high-quality...

  • Data Engineer

    4 days ago


    New York, New York, United States Trinity Technology Solutions LLC Full time $100,000 - $170,000 per year

    Job Title: Data EngineerJob Location: NYC (Onsite)Job Summary:We are seeking a skilledData Engineerto design, develop, and maintain data pipelines on our modern data platform. The ideal candidate will have hands-on experience withAzure Data Lakehouse, Databricks, and Python/SQL, and will collaborate with analytics, engineering, and business teams to deliver...

  • Data Engineer

    5 days ago


    New York, New York, United States JetBlue Full time $100,000 - $128,600 per year

    Position SummaryThe Data Engineer is responsible for integrating and modeling data in JetBlue's modern data stack to support analysts, business intelligence users, data scientists, and decision-makers across the company. The Data Engineer must have a deep understanding of Structured Query Language (SQL) and be familiar with Snowflake, dbt (data build tool),...

  • Data Engineering

    6 days ago


    New York, New York, United States Goldman Sachs Full time $115,000 - $180,000

    Who We Are: Market Data EngineeringMarket Data Engineering is part of the firm's Data Engineering group handling global access to financial market data sourced both internally and externally from the firm. The team is currently undertaking a project to revolutionize data ingestion, curation and distribution and is looking for talented engineers to ensure...

  • Data engineer

    1 day ago


    New York, New York, United States Writer Full time $157,800 - $199,500 per year

    About this roleWe're looking for a Data Engineer to help design, build, and scale the data infrastructure that powers our analytics, reporting, and product insights. As part of a small but high-impact Data team, you'll define the architectural foundation and tooling for our end-to-end data ecosystem.You'll work closely with engineering, product, and business...

  • Data Engineer

    2 days ago


    New York, New York, United States BeaconFire Inc. Full time $80,000 - $120,000 per year

    Hi, Rameez here from Beaconfire. I hope you're doing well We're currently hiring for an exciting Associate Data Engineer role, and I wanted to reach out to see if you or someone in your network might be interested.About the CompanyBeaconFire is based in Central NJ, specializing in Software Development, Web Development, and Business Intelligence; looking for...

  • Data Engineer

    4 days ago


    New York, New York, United States SynapOne Full time $120,000 - $200,000 per year

    Job Title: Data Engineering ContractorDepartment: TechnologyAbout the Role:Join our dynamic team of talented engineers with a proven track record in constructing datawarehouses, lakes, and pipelines. We are on a mission to fuel Unite Us' ongoing expansion andenhance its positive influence on both the healthcare industry and individuals nationwide. As partof...

  • Lead Data Engineer

    2 hours ago


    New York, New York, United States TalentOla Full time

    Lead Data Engineer / Data ArchitectNew York City, NY (Onsite – 5 days/week)Tech skill sets: Data Architecture, Data Modelling, Expert level SQL, Python, AWS Technologies (Glue), Snowflake and data engineering design patternsDomain:Asset management, Alternatives Investments, Financial Services.About the Role:We are seeking an experienced Lead Data Engineer...

  • Lead Data Engineer

    4 days ago


    New York, New York, United States TEK NINJAS Full time $120,000 - $180,000 per year

    Job Title: Lead Data Engineer (Asset management/Alternatives Investments)Location: New York City, NY; (Onsite – 5 days/week)Duration: 6-12+ MonthsTech skill sets: Data Architecture, Data Modelling, Expert level SQL, Python, AWS Technologies (Glue), Snowflake and data engineering design patternsDomain:Asset management, Alternatives Investments, Financial...

  • Data Engineer

    4 days ago


    New York, New York, United States iCapital Full time $120,000 - $160,000 per year

    About The RoleiCapital is looking for an Associate or Assistant Vice President Data Engineer to join our Data and Analytics team. This individual will help build the data pipelines and infrastructure required to make data a central part of iCapital. iCapital's business runs on data, as a result, this role will not only be able to support data movement but...