Back to Academy
Relevance 9/10Prompting and EvaluationIntermediate6 min read

Instruction Following Evaluation

Instruction-following evaluation checks whether outputs satisfy explicit constraints from prompts.

Why it matters for annotators

Even accurate answers fail product quality if they ignore required format or constraints.

Visual mental model

Extract constraints -> evaluate compliance -> score.

Examples (bad vs good)

Scenario: Real annotation scenario involving Instruction Following Evaluation

Bad: Labeling quickly without applying project rubric.

Good: Applying rubric criteria, documenting rationale, and escalating uncertainty.

Common mistakes

  • Skipping guideline details for edge cases.
  • Applying inconsistent criteria across similar samples.
  • Avoiding escalation even when uncertain.

Submission checklist

  • Read the latest guideline update before each batch.
  • Apply rubric dimensions explicitly in each decision.
  • Escalate ambiguous items with concise rationale.