onejune2018/Awesome-LLM-Eval
Evaluation & TestingAwesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs and models, mainly for Evaluation on LLMs. 一个由工具、基准/数据、演示、排行榜和大模型等组成的精选列表,主要面向基础大模型评测,旨在探求生成式AI的技术边界.
Has docs site (https://arxiv.org/abs/2508.18646). Description: 198 chars. Stars signal: 631. Contributors: 5. Score: 5.7/10
Stars: 631. Contributors: 5. Watchers: 8. Forks: 58. Issue ratio: 2.2%. Score: 3.9/10
Last commit: 143d ago. Weekly commits: 0. No releases published. Maturity bonus: 3.0y old. Score: 3.9/10
Stars/issues ratio: 45. Has documentation site. Permissive license: MIT. Popularity signal: 631 stars. Score: 6.9/10
Battle-tested: 631 stars. Peer review: 5 contributors. No versioned releases. Licensed: MIT. Age: 3.0 years. Maintenance: last commit 143d ago. Score: 3.8/10
Fork interest: 58. Integration-friendly: MIT. Adoption: 631 stars. Has web presence. Score: 5.3/10