For questions about criteria or to unlock beta features for your organization, reach out to
support@openpipe.ai.
- Defining LLM evaluations
- Improving dataset quality
- Runtime evaluation when generating best of N samples
- Offline testing of previously generated outputs

What is a Criterion?
A criterion is a combination of an LLM model and prompt that can be used to identify a specific issue with a model’s output. Criterion judgements are generated by passing the input and output of a single row along with the criterion prompt to an LLM model, which then returns a binaryPASS
/FAIL
judgement.
To learn how to create your first criterion, read the Quick Start.