Relevance 7/10Safety and PolicyIntermediate6 min read

Harmlessness Score

Harmlessness scoring measures risk reduction in model responses.

Why it matters for annotators

It helps compare safety behavior across model versions.

Output -> risk dimension scoring -> harmlessness estimate.

Scenario: Real annotation scenario involving Harmlessness Score

Bad: Labeling quickly without applying project rubric.

Good: Applying rubric criteria, documenting rationale, and escalating uncertainty.