Factorizing Perception and Policy for Interactive Instruction Following Paper • 2012.03208 • Published Dec 6, 2020
Context-Aware Planning and Environment-Aware Memory for Instruction Following Embodied Agents Paper • 2308.07241 • Published Aug 14, 2023
Story Visualization by Online Text Augmentation with Context Memory Paper • 2308.07575 • Published Aug 15, 2023 • 1
Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback Paper • 2402.03746 • Published Feb 6, 2024
Multi-Level Compositional Reasoning for Interactive Instruction Following Paper • 2308.09387 • Published Aug 18, 2023
Online Continual Learning on Hierarchical Label Expansion Paper • 2308.14374 • Published Aug 28, 2023
Online Continual Learning For Interactive Instruction Following Agents Paper • 2403.07548 • Published Mar 12, 2024
Just Say the Name: Online Continual Learning with Category Names Only via Data Generation Paper • 2403.10853 • Published Mar 16, 2024
i-SRT: Aligning Large Multimodal Models for Videos by Iterative Self-Retrospective Judgment Paper • 2406.11280 • Published Jun 17, 2024
ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments Paper • 2407.18550 • Published Jul 26, 2024
Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few Examples Paper • 2412.17288 • Published Dec 23, 2024 • 1
Society of Mind Meets Real-Time Strategy: A Hierarchical Multi-Agent Framework for Strategic Reasoning Paper • 2508.06042 • Published Aug 8 • 2
Society of Mind Meets Real-Time Strategy: A Hierarchical Multi-Agent Framework for Strategic Reasoning Paper • 2508.06042 • Published Aug 8 • 2