Yeongtak's picture

5 24 3

Yeongtak

Yeongtak

·

oyt9306

AI & ML interests

None yet

Recent Activity

updated a model 6 days ago

Yeongtak/PMLLM-Qwen3-VL-8B-GSPO-n-gram-wo-radom-prompt

published a model 6 days ago

Yeongtak/PMLLM-Qwen3-VL-8B-GSPO-n-gram-wo-radom-prompt

updated a model 9 days ago

Yeongtak/PMLLM-Qwen3-VL-8B-GSPO-radom-prompt

View all activity

Organizations

None yet

upvoted a paper 14 days ago

VLA-4D: Embedding 4D Awareness into Vision-Language-Action Models for SpatioTemporally Coherent Robotic Manipulation

Paper • 2511.17199 • Published 17 days ago • 7

upvoted 2 papers 2 months ago

MMPB: It's Time for Multi-Modal Personalization

Paper • 2509.22820 • Published Sep 26 • 14

Democratizing AI scientists using ToolUniverse

Paper • 2509.23426 • Published Sep 27 • 39

upvoted a paper 3 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 193

upvoted 3 papers 4 months ago

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21 • 88

Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments

Paper • 2508.08791 • Published Aug 12 • 16

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 263

upvoted a paper 5 months ago

Energy-Based Transformers are Scalable Learners and Thinkers

Paper • 2507.02092 • Published Jul 2 • 69

upvoted 4 papers 6 months ago

RePIC: Reinforced Post-Training for Personalizing Multi-Modal Language Models

Paper • 2506.18369 • Published Jun 23 • 2

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published Jun 10 • 104

Reinforcement Pre-Training

Paper • 2506.08007 • Published Jun 9 • 263

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 187

upvoted a paper 10 months ago

Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published Feb 18 • 58

upvoted 6 papers about 1 year ago

Large-Scale Text-to-Image Model with Inpainting is a Zero-Shot Subject-Driven Image Generator

Paper • 2411.15466 • Published Nov 23, 2024 • 39

OminiControl: Minimal and Universal Control for Diffusion Transformer

Paper • 2411.15098 • Published Nov 22, 2024 • 61

Style-Friendly SNR Sampler for Style-Driven Generation

Paper • 2411.14793 • Published Nov 22, 2024 • 39

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 130

Simplifying, Stabilizing and Scaling Continuous-Time Consistency Models

Paper • 2410.11081 • Published Oct 14, 2024 • 18

Differential Transformer

Paper • 2410.05258 • Published Oct 7, 2024 • 179

upvoted a paper over 1 year ago

Guiding a Diffusion Model with a Bad Version of Itself

Paper • 2406.02507 • Published Jun 4, 2024 • 17