WebDevJudge: Evaluating (M)LLMs as Critiques for Web Development Quality Paper • 2510.18560 • Published Oct 21 • 1
The End of Manual Decoding: Towards Truly End-to-End Language Models Paper • 2510.26697 • Published Oct 30 • 115
BBA: Bi-Modal Behavioral Alignment for Reasoning with Large Vision-Language Models Paper • 2402.13577 • Published Feb 21, 2024 • 9
Mitigating Hallucinations of Large Language Models via Knowledge Consistent Alignment Paper • 2401.10768 • Published Jan 19, 2024 • 2
Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration Paper • 2310.09168 • Published Oct 13, 2023 • 2