1 14 10

Roman Garipov

garipovroma

https://garipovroma.github.io

AI & ML interests

ML & DL

Recent Activity

upvoted a paper about 2 months ago

Emergent Misalignment via In-Context Learning: Narrow in-context examples can produce broadly misaligned LLMs

liked a dataset 3 months ago

mightyneighbor/AutoJudge

updated a model 5 months ago

garipovroma/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

View all activity

Organizations

upvoted a paper about 2 months ago

Emergent Misalignment via In-Context Learning: Narrow in-context examples can produce broadly misaligned LLMs

Paper • 2510.11288 • Published Oct 13 • 48

liked a dataset 3 months ago

mightyneighbor/AutoJudge

Updated Sep 18 • 111 • 1

updated a model 5 months ago

garipovroma/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Text Generation • 2B • Updated Jul 5 • 12

upvoted an article 5 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

•

887

published a model 6 months ago

garipovroma/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Text Generation • 2B • Updated Jul 5 • 12

liked a dataset 6 months ago

yandex/mad-cars

Viewer • Updated Jun 29 • 5.88M • 109 • 31

upvoted a paper 6 months ago

MADrive: Memory-Augmented Driving Scene Modeling

Paper • 2506.21520 • Published Jun 26 • 36

published a model 6 months ago

garipovroma/my_DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated Jun 19

upvoted 2 papers 7 months ago

Alchemist: Turning Public Text-to-Image Data into Generative Gold

Paper • 2505.19297 • Published May 25 • 84

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Paper • 2505.14669 • Published May 20 • 78

published a model 8 months ago

garipovroma/gpt_2_shakespeare_finetuned

Text Generation • Updated Nov 14, 2023 • 15

authored a paper 8 months ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published Apr 8 • 110

upvoted a paper 8 months ago

Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

Paper • 2504.06261 • Published Apr 8 • 110

upvoted a paper 9 months ago

Scale-wise Distillation of Diffusion Models

Paper • 2503.16397 • Published Mar 20 • 41

upvoted an article 9 months ago

Article

Digest of models based on YandexGPT 5 Lite

Mar 19

•

liked a Space 9 months ago

The Ultra-Scale Playbook

🌌

3.55k

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper 10 months ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20 • 192

upvoted a paper about 1 year ago

Were RNNs All We Needed?

Paper • 2410.01201 • Published Oct 2, 2024 • 53

liked 2 datasets about 1 year ago

anon8231489123/ShareGPT_Vicuna_unfiltered

Updated Apr 12, 2023 • 35.7k • 833

Aeala/ShareGPT_Vicuna_unfiltered

Viewer • Updated Jun 1, 2023 • 121k • 2.25k • 50

Roman Garipov

AI & ML interests

Recent Activity

Organizations

garipovroma's activity

Open-R1: a fully open reproduction of DeepSeek-R1

Digest of models based on YandexGPT 5 Lite

The Ultra-Scale Playbook