Gold Standard
A set of human-verified, ground-truth examples used to calibrate and evaluate AI output. The gold standard IS the verification instrument. Without it, you are measuring with a broken ruler.
Why It Exists
You can't improve what you can't measure, and you can't measure without a reference. The gold standard is that reference.
Rosetta Stone
Four circles, four readings of the same object. Each role reads the artifact through its own lens.
The valuation instrument. You cannot price an instrument without a reference price; you cannot price an AI output without a reference example. The gold standard is that reference.
The set of outputs the team agrees are right. Curated by humans, referenced by the agent. Grows with every review.
The test fixtures that never go stale. Labeled, versioned, used in CI. The appreciating side of the dual curve.
A labeled dataset drawn from the ground-truth distribution, used for calibration and evaluation. Its value grows with coverage; information content asymptotes but rarely saturates.
Related Terms
Proof Layer - The verification rubric, asymmetry profile, and verification cost analysis built BEFORE the capability.
Quality Ratchet - A CI-enforced floor that only moves up.
Autonomy State Machine - A graduated trust system for AI deployments with three states: Disabled, HITL (human verifies every output), and Autonomous (spot-check only).