LAUNCH Lab

university

https://launch.eecs.umich.edu/

AI & ML interests

Factuality, reasoning, alignment, LLM applications

Recent Activity

jpeper published a dataset about 20 hours ago

launch/LudoBench

jpeper published a Space about 20 hours ago

launch/LudoBench

jpeper updated a dataset about 20 hours ago

launch/LudoBench

View all activity

Papers

Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation

View all Papers

launch 's Spaces 7

LudoBench

Multimodal Game Reasoning Benchmark [ICLR 2026]

Answer Convergence Early Stopping

Demo for EMNLP Paper "Answer Convergence as a Signal..."

FactRBench

View and analyze long-form factuality leaderboard

ExpertLongBench

Leaderboard for ExpertLongBench

ManyICLBench

Leaderboard for ManyICLBench

MLRC-BENCH

Display model performance rankings

Factbench

View and compare language model factuality scores