Block Bench Model Tutorial

Provider-agnostic, open-source evaluation infrastructure for language models

openbench provides standardized, reproducible benchmarking for LLMs across 30+ evaluation suites (and growing) spanning knowledge, math, reasoning, coding, science, reading comprehension, health, long ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Feedback

Provider-agnostic, open-source evaluation infrastructure for language models

Trending now