SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published Jan 28, 2025 • 124
Systran/faster-whisper-large-v3 Automatic Speech Recognition • Updated Nov 23, 2023 • 517k • 514
nomic-ai/nomic-embed-text-v2-moe Sentence Similarity • 0.5B • Updated Apr 1, 2025 • 1.19M • 450