Back to Academy
Relevance 7/10Safety and PolicyIntermediate6 min read
Harmlessness Score
Harmlessness scoring measures risk reduction in model responses.
Why it matters for annotators
It helps compare safety behavior across model versions.
Visual mental model
Output -> risk dimension scoring -> harmlessness estimate.
Examples (bad vs good)
Scenario: Real annotation scenario involving Harmlessness Score
Bad: Labeling quickly without applying project rubric.
Good: Applying rubric criteria, documenting rationale, and escalating uncertainty.
Common mistakes
- Skipping guideline details for edge cases.
- Applying inconsistent criteria across similar samples.
- Avoiding escalation even when uncertain.
Submission checklist
- Read the latest guideline update before each batch.
- Apply rubric dimensions explicitly in each decision.
- Escalate ambiguous items with concise rationale.