Add investigation quality instrumentation #74
Labels
No labels
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: archeious/luminos#74
Loading…
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Add lightweight quality metrics so we can measure whether changes to the investigation pipeline (starting with Phase 3) improve or degrade output.
Three pieces:
Turn utilization logging: track turns used vs turns allocated per directory. Emitted to stderr during the run and recorded in cache metadata.
completenessfield on dir-scopesubmit_report: the agent self-rates how thoroughly it investigated the directory (0.0-1.0). Added to thesubmit_reporttool schema for the dir loop scope.plan_evaluation.json: emitted at the end of investigation. Compares the plan's predictions (priority/shallow/skip, suggested turns) against what actually happened (turns used, confidence achieved, files examined). This is the planning pass's report card.These give us a feedback loop for tuning without building a full evaluation framework. Run luminos on known repos before and after Phase 3, compare the metrics.