Runtime error Featured 2.95k The Smol Training Playbook š 2.95k The secrets to building world-class LLMs
Artificial Hippocampus Networks for Efficient Long-Context Modeling Paper ⢠2510.07318 ⢠Published Oct 8, 2025 ⢠31
Muon Outperforms Adam in Tail-End Associative Memory Learning Paper ⢠2509.26030 ⢠Published Sep 30, 2025 ⢠20
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain Paper ⢠2509.26507 ⢠Published Sep 30, 2025 ⢠547
iamtarun/python_code_instructions_18k_alpaca Viewer ⢠Updated Jul 27, 2023 ⢠18.6k ⢠4.73k ⢠321