Metrics

HELM supports a variety of metrics for evaluating model outputs (e.g. accuracy, efficiency, bias, toxicity). The metrics are implemented in the helm.benchmark.metrics module in the source code.

For the full API reference, build the documentation with MkDocs from the repository root (pip install -e . && mkdocs serve) or see the source code.