view article Article ABBL: NextGen LLM Benchmark & Leaderboard for evaluating Arabic models May 18, 2025 โข 3
view article Article SILMA RAGQA V1.0: A Comprehensive Benchmark for Evaluating LLMs on RAG QA Use-Cases Dec 18, 2024 โข 1