Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
alexngai
's Collections
Latent Reasoning
Agent Eval
RL Agents
Autonomous Research
Memory/Search/Retrieval/RAG
Self-Critique
Automated Research
Test-Time Compute/Optimal Scaling
General LLM
Automated SWE
Code LLMs
Multi-Agent
Self-Improving Agents
Automated ML
Codegen Benchmarks
Agent Eval
updated
about 12 hours ago
Upvote
-
Survey on Evaluation of LLM-based Agents
Paper
•
2503.16416
•
Published
Mar 20
•
95
OAgents: An Empirical Study of Building Effective Agents
Paper
•
2506.15741
•
Published
Jun 17
•
35
Upvote
-
Share collection
View history
Collection guide
Browse collections