MihailSlutsky's picture

MihailSlutsky

MihailSlutsky

·

AI & ML interests

None yet

Recent Activity

liked a dataset 12 days ago

longvideobench/LongVideoBench

upvoted a paper 14 days ago

Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities

liked a dataset 24 days ago

SpatialVID/SpatialVID

View all activity

Organizations

None yet

liked a dataset 12 days ago

longvideobench/LongVideoBench

Viewer • Updated Oct 14, 2024 • 6.68k • 8.56k • 36

upvoted a paper 14 days ago

Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities

Paper • 2601.21937 • Published 25 days ago • 19

liked 2 datasets 24 days ago

SpatialVID/SpatialVID

Viewer • Updated Dec 15, 2025 • 2.72M • 24.4k • 37

FelixYuan/SpatialVID-HQ

Viewer • Updated Dec 15, 2025 • 365k • 10.3k • 29

liked a model 25 days ago

robbyant/lingbot-world-base-cam

Image-to-Video • Updated 21 days ago • 313

liked a dataset about 1 month ago

theairlabcmu/TartanGround

Updated Oct 14, 2025 • 10.3k • 2

upvoted 14 papers about 1 month ago

Imagine-then-Plan: Agent Learning from Adaptive Lookahead with World Models

Paper • 2601.08955 • Published Jan 13 • 13

Flow Equivariant World Models: Memory for Partially Observed Dynamic Environments

Paper • 2601.01075 • Published Jan 3 • 6

V-DPM: 4D Video Reconstruction with Dynamic Point Maps

Paper • 2601.09499 • Published Jan 14 • 9

LSRIF: Logic-Structured Reinforcement Learning for Instruction Following

Paper • 2601.06431 • Published Jan 10 • 12

ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback

Paper • 2601.10156 • Published Jan 15 • 26

Action100M: A Large-scale Video Action Dataset

Paper • 2601.10592 • Published Jan 15 • 29

Transition Matching Distillation for Fast Video Generation

Paper • 2601.09881 • Published Jan 14 • 33

Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding

Paper • 2601.10611 • Published Jan 15 • 29

CoF-T2I: Video Models as Pure Visual Reasoners for Text-to-Image Generation

Paper • 2601.10061 • Published Jan 15 • 30

DanQing: An Up-to-Date Large-Scale Chinese Vision-Language Pre-training Dataset

Paper • 2601.10305 • Published Jan 15 • 36

Beyond Static Tools: Test-Time Tool Evolution for Scientific Reasoning

Paper • 2601.07641 • Published Jan 12 • 47

Urban Socio-Semantic Segmentation with Vision-Language Reasoning

Paper • 2601.10477 • Published Jan 15 • 155

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 193

PhyRPR: Training-Free Physics-Constrained Video Generation

Paper • 2601.09255 • Published Jan 14 • 3