18 6

Brendan Slevin

brend007

brendan-slevin-ab7496a7

AI & ML interests

None yet

Recent Activity

liked a model about 4 hours ago

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

upvoted a paper 1 day ago

ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

liked a dataset 1 day ago

WinkingFace/CryptoLM-Solana-SOL-USDT

View all activity

Organizations

None yet

liked a model about 4 hours ago

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

Text Generation • 28B • Updated about 20 hours ago • 153 • 11

upvoted a paper 1 day ago

ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

Paper • 2602.21534 • Published 4 days ago • 22

liked a dataset 1 day ago

WinkingFace/CryptoLM-Solana-SOL-USDT

Viewer • Updated Mar 19, 2025 • 32.3k • 74 • 10

liked a model 1 day ago

nvidia/Nemotron-Terminal-8B

Text Generation • 8B • Updated about 23 hours ago • 179 • 13

upvoted a paper 2 days ago

PyVision-RL: Forging Open Agentic Vision Models via RL

Paper • 2602.20739 • Published 4 days ago • 28

upvoted a paper 4 days ago

EgoPush: Learning End-to-End Egocentric Multi-Object Rearrangement for Mobile Robots

Paper • 2602.18071 • Published 9 days ago • 22

upvoted a paper 6 days ago

GLM-5: from Vibe Coding to Agentic Engineering

Paper • 2602.15763 • Published 11 days ago • 100

upvoted 2 articles 8 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

605

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

9 days ago

•

469

upvoted 2 papers 9 days ago

DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories

Paper • 2602.10809 • Published 17 days ago • 52

Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models

Paper • 2602.12036 • Published 16 days ago • 98

upvoted an article 11 days ago

Article

Forge: Scalable Agent RL Framework and Algorithm

16 days ago

•

127

upvoted a collection 11 days ago

Qwen3.5

Collection

13 items • Updated about 11 hours ago • 483

upvoted an article 13 days ago

Article

Custom Kernels for All from Codex and Claude

16 days ago

•

upvoted a paper 16 days ago

Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models

Paper • 2602.10224 • Published 18 days ago • 19

upvoted an article 23 days ago

Article

The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+

25 days ago

•

upvoted a paper 28 days ago

The Script is All You Need: An Agentic Framework for Long-Horizon Dialogue-to-Cinematic Video Generation

Paper • 2601.17737 • Published Jan 25 • 55

upvoted an article about 1 month ago

Article

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Jan 27

•

upvoted a paper 2 months ago

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Paper • 2512.20605 • Published Dec 23, 2025 • 62

upvoted an article 3 months ago

Article

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

Dec 8, 2025

•

Brendan Slevin

AI & ML interests

Recent Activity

Organizations

brend007's activity

We Got Claude to Fine-Tune an Open Source LLM

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

Forge: Scalable Agent RL Framework and Algorithm

Custom Kernels for All from Codex and Claude

The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day