Leaderboards Meaningful leaderboards showcasing LLM evaluation results across various tasks and dimensions Running 17 InferBench 🥇 17 A cost/quality/speed Leaderboard for Inference Providers! Running on CPU Upgrade 7.07k MTEB Leaderboard 🥇 7.07k Embedding Leaderboard Running on CPU Upgrade 13.9k Open LLM Leaderboard 🏆 13.9k Track, rank and evaluate open LLMs and chatbots Running 4.73k Arena Leaderboard 🏆 4.73k View the latest LMArena model leaderboard
Running on CPU Upgrade 13.9k Open LLM Leaderboard 🏆 13.9k Track, rank and evaluate open LLMs and chatbots
Instruction Models Popular instruction models to be run with Inference Endpoints meta-llama/Llama-3.2-1B-Instruct Text Generation • Updated Oct 24, 2024 • 3.42M • • 1.3k meta-llama/Llama-3.2-3B-Instruct Text Generation • Updated Oct 24, 2024 • 2.14M • • 2k meta-llama/Llama-3.3-70B-Instruct Text Generation • Updated Dec 21, 2024 • 794k • • 2.67k meta-llama/Llama-3.1-8B-Instruct Text Generation • Updated Sep 25, 2024 • 6.51M • • 5.5k
Leaderboards Meaningful leaderboards showcasing LLM evaluation results across various tasks and dimensions Running 17 InferBench 🥇 17 A cost/quality/speed Leaderboard for Inference Providers! Running on CPU Upgrade 7.07k MTEB Leaderboard 🥇 7.07k Embedding Leaderboard Running on CPU Upgrade 13.9k Open LLM Leaderboard 🏆 13.9k Track, rank and evaluate open LLMs and chatbots Running 4.73k Arena Leaderboard 🏆 4.73k View the latest LMArena model leaderboard
Running on CPU Upgrade 13.9k Open LLM Leaderboard 🏆 13.9k Track, rank and evaluate open LLMs and chatbots
Instruction Models Popular instruction models to be run with Inference Endpoints meta-llama/Llama-3.2-1B-Instruct Text Generation • Updated Oct 24, 2024 • 3.42M • • 1.3k meta-llama/Llama-3.2-3B-Instruct Text Generation • Updated Oct 24, 2024 • 2.14M • • 2k meta-llama/Llama-3.3-70B-Instruct Text Generation • Updated Dec 21, 2024 • 794k • • 2.67k meta-llama/Llama-3.1-8B-Instruct Text Generation • Updated Sep 25, 2024 • 6.51M • • 5.5k
sdiazlor/modernbert-embed-base-crossencoder-human-rights Text Ranking • 0.1B • Updated Jun 16, 2025 • 4
sdiazlor/deepseek-r1-distill-qwen-1.5-unsloth-sft-python-60-steps Text Generation • 2B • Updated Feb 9, 2025 • 4
sdiazlor/modernbert-embed-base-crossencoder-human-rights-1-epoch Text Classification • 0.1B • Updated Jan 19, 2025 • 1
sdiazlor/modernbert-embed-base-biencoder-human-rights Sentence Similarity • 0.1B • Updated Jan 19, 2025 • 2