Inference Optimized Checkpoints (with Model Optimizer) Collection A collection of generative models quantized and optimized for inference with Model Optimizer. • 52 items • Updated about 22 hours ago • 83
FiNERweb: Datasets and Artifacts for Scalable Multilingual Named Entity Recognition Paper • 2512.13884 • Published Dec 15, 2025 • 15
What Layers When: Learning to Skip Compute in LLMs with Residual Gates Paper • 2510.13876 • Published Oct 13, 2025 • 11