AI Bits and Pieces
Community
Classroom
Calendar
Members
Map
Leaderboards
About
LLMs - Benchmark Stats Made Easy
0%
Introduction
5 Key LLM Benchmarks
1. MMLU - Language Understanding
2. MMMU-Pro: — Massive Multimodal Multitask
3. AIME — American Invitational Mathematics Exam
4. GPQA — Graduate-Level Problem Solving
5. SWE-Bench / LiveCodeBench
Benchmark Resources