AI Data Scientist, Evaluation

2 weeks ago


Chicago, United States Ironclad Full time

AI Data Scientist, Evaluation & InsightsJoin to apply for the AI Data Scientist, Evaluation & Insights role at IroncladAI Data Scientist, Evaluation & InsightsJoin to apply for the AI Data Scientist, Evaluation & Insights role at IroncladGet AI-powered advice on this job and more exclusive features.Ironclad is the leading AI-powered contract lifecycle management platform, processing billions of contracts every year.Every business is powered by contracts, but managing them can slow companies down and cost millions of dollars. Global innovators like L’Oréal, OpenAI, and Salesforce trust Ironclad to transform contracting into a strategic advantage - accelerating revenue, reducing risk, and driving efficiency. It’s the only platform that manages every type of contract workflow, whether a sales agreement, an HR agreement or a complex NDA.We’re building the future of intelligent contracting and writing the narrative for how contracts unlock strategic growth. Forrester Wave and Gartner Magic Quadrant have consistently recognized Ironclad as a leader in our category. We’ve also been named one of Fortune’s Great Places to Work six years running, featured on Glassdoor’s Best Places to Work, and recognized by Forbes’ 50 Most Promising AI Companies.We’re backed by leading investors like Accel, Sequoia, Y Combinator, and BOND. We’d love for you to join usAbout The RoleIronclad is accelerating its investment in AI to redefine how legal teams manage and understand contracts. As part of this effort, we are hiring an AI Evaluation Engineer to work within our AI Pillar. This role is focused on unlocking insights from our training data, designing feedback loops, and ensuring the continuous improvement of our agentic and ML or LLM-based systems through data-driven evaluation and iteration.You’ll partner closely with AI Engineers and Product Managers to drive better model quality through systematic analysis, experimentation, and the curation of high-leverage datasets. Your work will directly impact the effectiveness of features like Smart Import, contract understanding, and agentic workflows.What You'll Be DoingAnalyze training and evaluation datasets to identify distributional gaps, labeling inconsistencies, and long-tail opportunities.Design and execute labeling campaigns, including development of golden datasets and annotation guidelines.Build and maintain dashboards that track model accuracy, regression trends, and product-specific KPIs like success rate or answer helpfulness.Investigate failure modes via prompt clustering, error taxonomy development, and user intent classification.Operationalize feedback loops: mine product telemetry and human-in-the-loop reviews for signal, then translate into data-driven model improvement strategies.Partner with engineers and PMs to run structured A/B tests and human evaluations for new models or features.Support the development of scalable data and evaluation infrastructure for LLMs and agents.Work with product, engineering and legal to create clear & transparent processes for the handling of customer data in AI training, fine-tuning and evaluationAbout YouBachelor's or Master's degree in a quantitative field (e.g., Statistics, Computer Science, Data Science, Applied Math).1–3 years of experience in applied ML or data science, preferably in NLP or LLM-based applications.Strong SQL and Python skills; experience with Jupyter, Pandas, and experiment tracking tools.Comfortable navigating ambiguity, slicing large datasets, and communicating insights clearly to cross-functional stakeholders.Experience with prompt analysis, clustering, or user behavior modeling is a plus.Bonus: familiarity with LLM eval techniques, Reinforcement Learning from Human Feedback (RLHF), or agentic system design.Experience with program management.Why This Role MattersAI is critical to the value Ironclad customers get from their contracts, allowing their business to manage risk, close revenue faster and operate more effectively. None of this is possible without reliable and accurate data. This role will lead these efforts, becoming a key contributor to the development of AI solutions in an industry that is likely to be transformed by the new generation of models.What We ValueBias for action and data curiosityOwnership mindset and team-first attitudeComfort in fast-paced, iterative environmentsPassion for building AI products that solve real-world customer problemsPursuant to the San Francisco Fair Chance Ordinance, we will consider for employment qualified applicants with arrest and conviction records.Compensation Range: $125K - $170KSeniority levelSeniority levelEntry levelEmployment typeEmployment typeFull-timeJob functionIndustriesTechnology, Information and InternetReferrals increase your chances of interviewing at Ironclad by 2xSign in to set job alerts for “Data Scientist” roles.Chicago, IL $110,000.00-$130,000.00 4 hours agoChicago, IL $123,500.00-$212,850.00 1 day agoChicago, IL $85,000.00-$120,000.00 4 hours agoChicago, IL $123,500.00-$212,850.00 2 months agoChicago, IL $123,500.00-$212,850.00 2 months agoChicago, IL $123,500.00-$212,850.00 2 weeks agoChicago, IL $123,500.00-$212,850.00 1 month agoChicago, IL $123,500.00-$212,850.00 1 day agoData Scientist, RWE Clinical Trials - RemoteDeerfield, IL $127,500.00-$204,000.00 2 hours agoChicago, IL $123,500.00-$212,850.00 1 month agoChicago, IL $100,500.00-$173,250.00 1 week agoData Scientist/Python/ML/All Levels/ChicagoChicago, IL $123,500.00-$212,850.00 1 day agoChicago, IL $110,000.00-$130,000.00 4 hours agoChicago, IL $133,000.00-$287,800.00 3 weeks agoChicago, IL $62,000.00-$93,000.00 2 weeks agoData Scientist - Supply Chain OptimizationRosemont, IL $75,000.00-$120,000.00 2 weeks agoChicago, IL $84,000.00-$108,000.00 5 days agoChicago, IL $93,840.00-$140,760.00 3 days agoChicago, IL $197,000.00-$291,000.00 1 week agoChicago, IL $110,000.00-$160,000.00 1 week agoChicago, IL $123,500.00-$212,850.00 1 month agoChicago, IL $123,500.00-$212,850.00 1 month agoChicago, IL $106,400.00-$178,100.00 1 day agoWe’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr



  • Chicago, United States Ironclad Inc Full time

    AI Data Scientist, Evaluation & Insights Join to apply for the AI Data Scientist, Evaluation & Insights role at Ironclad AI Data Scientist, Evaluation & Insights Join to apply for the AI Data Scientist, Evaluation & Insights role at Ironclad Get AI-powered advice on this job and more exclusive features. Ironclad is the leading AI-powered contract lifecycle...


  • Chicago, United States Ironclad Inc Full time

    AI Data Scientist, Evaluation & Insights Join to apply for the AI Data Scientist, Evaluation & Insights role at Ironclad AI Data Scientist, Evaluation & Insights Join to apply for the AI Data Scientist, Evaluation & Insights role at Ironclad Get AI-powered advice on this job and more exclusive features. Ironclad is the leading AI-powered contract lifecycle...


  • Chicago, United States Tempus AI Full time

    Join to apply for the Senior Data Scientist, AI RWD role at Tempus AI Get AI-powered advice on this job and more exclusive features. Passionate about precision medicine and advancing the healthcare industry? Recent advancements in underlying technology have finally made it possible for AI to impact clinical care in a meaningful way. Tempus' proprietary...


  • Chicago, United States The Hartford Full time

    Sr Data Scientist - GD07AEData Scientist - GD08AEWe’re determined to make a difference and are proud to be an insurance company that goes well beyond coverages and policies. Working here means having every opportunity to achieve your goals – and to help others accomplish theirs, too. Join our team as we help shape the future. Step into the future with...


  • Chicago, United States BMO U.S. Full time

    Job Description Join to apply for the Senior Data Scientist, Responsible AI role at BMO U.S. This role sits within the AI+ Research & Commercialization (ARC) team reporting to the Head of Responsible AI (RAI) Innovation & Governance. A dynamic opportunity for skilled data scientists who are passionate about shaping the future of responsible AI in financial...


  • Chicago, United States BMO Financial Group Full time

    Job Description This role sits within the AI+ Research & Commercialization (ARC) team reporting to the Head of Responsible AI (RAI) Innovation & Governance. A dynamic opportunity for skilled data scientists who are passionate about shaping the future of responsible AI in financial services, this role will lead a team of data scientists in the development of...


  • Chicago, United States Bank of Montreal Full time

    Application Deadline: 11/29/2025 Address: 320 S Canal Street Job Family Group: Data Analytics & Reporting Job Description This role sits within the AI+ Research & Commercialization (ARC) team reporting to the Head of Responsible AI (RAI) Innovation & Governance. A dynamic opportunity for skilled data scientists who are passionate about shaping the future of...


  • Chicago, United States BMO Financial Full time

    Application Deadline: 11/29/2025 Address: 320 S Canal Street Job Family Group: Data Analytics & Reporting Job Description This role sits within the AI Research & Commercialization (ARC) team reporting to the Head of Responsible AI (RAI) Innovation & Governance. A dynamic opportunity for skilled data scientists who are passionate about shaping the future of...


  • Chicago, United States BMO Financial Full time

    Application Deadline: 11/29/2025 Address: 320 S Canal Street Job Family Group: Data Analytics & Reporting Job Description This role sits within the AI+ Research & Commercialization (ARC) team reporting to the Head of Responsible AI (RAI) Innovation & Governance. A dynamic opportunity for skilled data scientists who are passionate about shaping the future of...


  • Chicago, IL, United States BMO Financial Group Full time

    Job Description This role sits within the AI+ Research & Commercialization (ARC) team reporting to the Head of Responsible AI (RAI) Innovation & Governance. A dynamic opportunity for skilled data scientists who are passionate about shaping the future of responsible AI in financial services, this role will lead a team of data scientists in the development of...