Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

LAUNCH Lab

university
https://launch.eecs.umich.edu/
launchnlp
launchnlp
Activity Feed

AI & ML interests

Factuality, reasoning, alignment, LLM applications

Recent Activity

jpeper  published a dataset about 15 hours ago
launch/LudoBench
jpeper  published a Space about 15 hours ago
launch/LudoBench
jpeper  updated a dataset about 15 hours ago
launch/LudoBench
View all activity

Papers

Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation

View all Papers

Lu Wang's profile picture Yujian Liu's profile picture Shuyang Cao's profile picture Xinliang Frederick Zhang's profile picture xinyu hua's profile picture Yunxiang Zhang's profile picture Lechen Zhang's profile picture Farima Fatahi 's profile picture sheza munir's profile picture KJ's profile picture Joe Peper's profile picture Lee's profile picture Shitanshu Bhushan's profile picture Xin Liu's profile picture Muhammad Khalifa's profile picture Jie Ruan's profile picture

launch 's Spaces 7

Running

LudoBench

🎲

Multimodal Game Reasoning Benchmark [ICLR 2026]

about 15 hours ago
Running

Answer Convergence Early Stopping

🛑

Demo for EMNLP Paper "Answer Convergence as a Signal..."

Jan 4
Sleeping

FactRBench

🏆

View and analyze long-form factuality leaderboard

Nov 3, 2025
Running
3

ExpertLongBench

🚀

Leaderboard for ExpertLongBench

Sep 28, 2025
Sleeping
1

ManyICLBench

🚀

Leaderboard for ManyICLBench

Jun 20, 2025
Running

MLRC-BENCH

📊

Display model performance rankings

Apr 16, 2025
Running
3

Factbench

📈

View and compare language model factuality scores

Oct 30, 2024
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs