Exercise every contract feature, collect calibration data. V1.1 ship target.
Production-quality V1: error handling, test expansion, doc polish. V1.2 ship target.
Prove the contract works across researcher types. V2 begins. Includes arxiv-rag (M5.1.x) and contract validation (M5.2).
An agent that coordinates multiple researchers. V2 ship target.