view article Article The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator 23 days ago • 44
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 22 days ago • 108
view article Article Introducing Falcon-H1-Arabic: Pushing the Boundaries of Arabic Language AI with Hybrid Architecture 4 days ago • 31
Dynamic Large Concept Models: Latent Reasoning in an Adaptive Semantic Space Paper • 2512.24617 • Published 9 days ago • 54
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7, 2025 • 399
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning Paper • 2511.22570 • Published Nov 27, 2025 • 86
Lovelace-1 Collection First Edition of the Lovelace Coding Family • 2 items • Updated 15 days ago • 1
VLAI: A RoBERTa-Based Model for Automated Vulnerability Severity Classification Paper • 2507.03607 • Published Jul 4, 2025 • 7
End-to-End Training for Autoregressive Video Diffusion via Self-Resampling Paper • 2512.15702 • Published 22 days ago • 14