Running Featured 1.29k FineWeb: decanting the web for the finest text data at scale π· 1.29k Explore the FineWeb dataset and its creation process
User-Oriented Multi-Turn Dialogue Generation with Tool Use at scale Paper β’ 2601.08225 β’ Published Jan 13 β’ 52
Running 218 FineVision: Open Data is All You Need π 218 A new open-source dataset for training VLMs
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B Text Generation β’ 31B β’ Updated Oct 10, 2025 β’ 49.6k β’ 800
view article Article Tiny Agents in Python: a MCP-powered agent in ~70 lines of code +2 May 23, 2025 β’ 171