arxiv:2512.02231
Le Thien Phuc Nguyen
plnguyen2908
·
AI & ML interests
Computer Vision, NLP, Applied AI
Recent Activity
authored
a paper
about 4 hours ago
See, Hear, and Understand: Benchmarking Audiovisual Human Speech Understanding in Multimodal Large Language Models
authored
a paper
about 4 hours ago
LASER: Lip Landmark Assisted Speaker Detection for Robustness
commented on
a paper
about 18 hours ago
See, Hear, and Understand: Benchmarking Audiovisual Human Speech Understanding in Multimodal Large Language Models