-
nuprl/MultiPL-E
Viewer • Updated • 12.7k • 60.8k • 60 -
openai/openai_humaneval
Viewer • Updated • 164 • 140k • 364 -
Big Code Models Leaderboard
📈1.49kExplore and compare code generation models on a leaderboard
-
Copilot Evaluation Harness: Evaluating LLM-Guided Software Programming
Paper • 2402.14261 • Published • 10
Shaun
drgitt
AI & ML interests
None yet
Recent Activity
liked
a model
43 minutes ago
Lightricks/LTX-2
liked
a model
about 11 hours ago
drgitt/drgitt-flux-lora
liked
a model
5 months ago
Qwen/Qwen3-Embedding-0.6B
Organizations
None yet