OpenAI's open-source eval framework — practical, model-agnostic, widely-forked.
framework for evaluating OpenAI models
Read on Wikipedia ↗
Open source ↗