Mishig Davaadorj's picture

Mishig Davaadorj

mishig

·

AI & ML interests

NP-completeness, grammars, universality

Recent Activity

updated a dataset about 10 hours ago

huggingchat/papers-content

liked a model about 14 hours ago

zai-org/GLM-5

commented on a paper about 21 hours ago

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

View all activity

Organizations

commented 2 papers about 21 hours ago

OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration

Paper • 2602.05400 • Published 7 days ago • 289 •

Towards Scalable Pre-training of Visual Tokenizers for Generation

Paper • 2512.13687 • Published Dec 15, 2025 • 105 •

commented 2 papers 2 days ago

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Paper • 2602.07026 • Published 10 days ago • 129 •

Learning a Generative Meta-Model of LLM Activations

Paper • 2602.06964 • Published 6 days ago • 2 •

commented a paper 4 days ago

Generative Modeling via Drifting

Paper • 2602.04770 • Published 8 days ago • 3 •

New activity in hf-doc-build/doc-build 7 days ago

Upload v3.8.1.zip

#54 opened 7 days ago by

commented a paper 13 days ago

Shaping capabilities with token-level data filtering

Paper • 2601.21571 • Published 14 days ago • 26 •

New activity in hf-doc-build/doc-build 27 days ago

Delete jobs docs

#53 opened 27 days ago by

New activity in hf-doc-build/doc-build 28 days ago

reachy_mini

#52 opened 29 days ago by

commented a paper 30 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published Jan 8 • 225 •

commented 4 papers about 1 month ago

Guiding a Diffusion Transformer with the Internal Dynamics of Itself

Paper • 2512.24176 • Published Dec 30, 2025 • 8 •

UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision

Paper • 2601.03193 • Published Jan 6 • 47 •

InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields

Paper • 2601.03252 • Published Jan 6 • 101 •

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 237 •

New activity in mishig/test-tips 4 months ago

ask

#1 opened 4 months ago by

New activity in reach-vb/TinyLlama-1.1B-Chat-v1.0-q4_k_m-GGUF 4 months ago

Update metadata (tokenizer.chat_template)

#112 opened 4 months ago by

New activity in reach-vb/TinyLlama-1.1B-Chat-v1.0-Q2_K-GGUF 4 months ago

Update metadata (tokenizer.chat_template)

#3 opened 4 months ago by

Update metadata (tokenizer.chat_template)

#2 opened 4 months ago by

Update metadata (tokenizer.chat_template)

#1 opened 4 months ago by

New activity in mishig/xet-gguf-edit-test 4 months ago

benchmark

#21 opened 4 months ago by