Scenarios

HELM scenarios define benchmarks (e.g. MMLU, MedQA). They are implemented in the helm.benchmark.scenarios module.

For the full API reference, build the documentation with MkDocs from the repository root or see the source code.