Dokyoon

leeloolee

Eruly

AI & ML interests

Recent Activity

liked a dataset about 23 hours ago

Alibaba-NLP/EcomBench

reacted to ovi054's post with 🔥 1 day ago

Anim Lab AI⚡ Turn any math concept or logic into a clear video explanation instantly using AI. This is my submission for the MCP 1st Birthday Hackathon, and it’s already crossed 1,000 runs. 👉 Try it now: https://huggingface.co/spaces/MCP-1st-Birthday/anim-lab-ai Demo outputs are attached 👇

upvoted an article 4 days ago

Building Deep Research: How we Achieved State of the Art

View all activity

Organizations

upvoted an article 4 days ago

Article

Building Deep Research: How we Achieved State of the Art

15 days ago

•

upvoted a paper 5 days ago

Qwen3-VL Technical Report

Paper • 2511.21631 • Published 13 days ago • 114

upvoted a paper 20 days ago

Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models

Paper • 2511.08577 • Published 28 days ago • 104

upvoted an article 26 days ago

Article

Optimizing Mixture-of-Experts Training: A Cost-Effective, Two-Sided Approach

Sep 30

•

upvoted a paper about 1 month ago

Follow the Flow: Fine-grained Flowchart Attribution with Neurosymbolic Agents

Paper • 2506.01344 • Published Jun 2 • 6

upvoted a collection about 2 months ago

LLaDA 2.0

Collection

4 items • Updated 13 days ago • 21

upvoted 8 papers 3 months ago

THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning

Paper • 2509.13761 • Published Sep 17 • 16

Visual-TableQA: Open-Domain Benchmark for Reasoning over Table Images

Paper • 2509.07966 • Published Sep 9 • 4

Virtual Agent Economies

Paper • 2509.10147 • Published Sep 12 • 26

Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents

Paper • 2509.09265 • Published Sep 11 • 46

Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models

Paper • 2508.21365 • Published Aug 29 • 29

upvoted an article 3 months ago

Article

Learn the Hugging Face Kernel Hub in 5 Minutes

Jun 12

•

151

upvoted 3 papers 4 months ago

Fin-PRM: A Domain-Specialized Process Reward Model for Financial Reasoning in Large Language Models

Paper • 2508.15202 • Published Aug 21 • 4

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21 • 88

Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models

Paper • 2508.09138 • Published Aug 12 • 37

upvoted a collection 4 months ago

🔍 Interpretability & Analysis of LMs

Collection

Outstanding research in LM interpretability and evaluation, summarized • 134 items • Updated Oct 20 • 116

upvoted an article 5 months ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

Feb 4

•

121

Dokyoon

AI & ML interests

Recent Activity

Organizations

leeloolee's activity

Building Deep Research: How we Achieved State of the Art

Optimizing Mixture-of-Experts Training: A Cost-Effective, Two-Sided Approach

Learn the Hugging Face Kernel Hub in 5 Minutes

DABStep: Data Agent Benchmark for Multi-step Reasoning