2024's picture

2 13 1

2024

tgy2024

·

AI & ML interests

None yet

Organizations

None yet

commented 2 papers 3 months ago

Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs

Paper • 2510.16062 • Published Oct 17, 2025 • 1 •

Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs

Paper • 2510.16062 • Published Oct 17, 2025 • 1 •

commented 3 papers 8 months ago

MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks

Paper • 2505.16459 • Published May 22, 2025 • 45 •

MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks

Paper • 2505.16459 • Published May 22, 2025 • 45 •

MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks

Paper • 2505.16459 • Published May 22, 2025 • 45 •