CausalEmbed: Auto-Regressive Multi-Vector Generation in Latent Space for Visual Document Embedding Paper • 2601.21262 • Published 20 days ago • 1
Aligned but Stereotypical? The Hidden Influence of System Prompts on Social Bias in LVLM-Based Text-to-Image Models Paper • 2512.04981 • Published Dec 4, 2025 • 8
Explainable and Interpretable Multimodal Large Language Models: A Comprehensive Survey Paper • 2412.02104 • Published Dec 3, 2024
Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning Paper • 2502.02871 • Published Feb 5, 2025
EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models Paper • 2502.11916 • Published Feb 17, 2025 • 1
MemOS: An Operating System for Memory-Augmented Generation (MAG) in Large Language Models Paper • 2505.22101 • Published May 28, 2025
MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model Paper • 2406.11193 • Published Jun 17, 2024