{"id":"llm-benchmarks-mmlu-hellaswag-bbh-and-beyond-confident-ai","name":"LLM Benchmarks: MMLU, HellaSwag, BBH, and Beyond - Confident AI","desc":"MMLU/HellaSwag/BBH等主流LLM基准测试详解","url":"https://www.confident-ai.com/blog/llm-benchmarks-mmlu-hellaswag-and-beyond","category":"对话助手","tags":["LLM"],"rating":0,"users":null,"updated":"2026-04-08T00:00:00.000Z","source":"filipecalegario/awesome-generative-ai","addedAt":"2026-04-08T00:00:00.000Z","hot":false,"isNew":false}