2026-01-10T13:30:14Z - 2026-04-10T13:30:14Z

Overview

26 active pull requests
33 active issues
Excluding merges, 2 authors have pushed 7 commits to main and 33 commits to all branches. On main, 9 files have changed and there have been 316 additions and 28 deletions.

26 pull requests merged by 1 user

Merged #59 docs(stress-tests): M3.3 Phase A — calibration data collection 2026-04-09 02:22:08 +00:00

Merged #58 feat(arxiv): ingest pipeline (M5.1.1) 2026-04-09 02:03:59 +00:00

Merged #57 docs(stress-tests): archive M3.2 multi-axis results 2026-04-09 01:35:02 +00:00

Merged #56 fix(observability): persist full ResearchResult and per-item trace events 2026-04-09 01:27:48 +00:00

Merged #55 docs(stress-tests): archive M3.1 results 2026-04-09 01:21:44 +00:00

Merged #36 Record per-step durations in trace and operational logs 2026-04-08 22:58:12 +00:00

Merged #33 depth flag now drives constraint defaults 2026-04-08 22:33:49 +00:00

Merged #34 chore: Makefile with venv-based dev workflow 2026-04-08 22:32:29 +00:00

Merged #32 Mirror trace steps to the operational logger 2026-04-08 22:23:02 +00:00

Merged #31 Display budget as spend status, not exhaustion alarm 2026-04-08 22:20:02 +00:00

Merged #29 M2.5.3: marchwarden costs CLI command 2026-04-08 21:59:08 +00:00

Merged #28 M2.5.2: Cost ledger with price table 2026-04-08 21:54:24 +00:00

Merged #27 M2.5.1: Structured application logger 2026-04-08 21:48:11 +00:00

Merged #23 Propagate parent env to MCP server subprocess 2026-04-08 21:32:11 +00:00

Merged #22 Enforce token_budget before each iteration 2026-04-08 21:30:27 +00:00

Merged #21 Fix invalid default model id 2026-04-08 21:26:06 +00:00

Merged #20 Fix synthesis truncation and trace masking 2026-04-08 21:24:42 +00:00

Merged #14 chore: docker-based test environment 2026-04-08 21:08:28 +00:00

Merged #12 M2.2: marchwarden replay CLI command 2026-04-08 20:59:13 +00:00

Merged #11 M2.1: marchwarden ask CLI command 2026-04-08 20:55:00 +00:00

Merged #7 M1.4: MCP server 2026-04-08 20:41:29 +00:00

Merged #6 Add OpenQuestion to research contract 2026-04-08 20:37:54 +00:00

Merged #5 M1.3: Inner agent loop 2026-04-08 20:29:41 +00:00

Merged #4 M1.2: Trace logger 2026-04-08 20:25:59 +00:00

Merged #3 M1.1: Search and fetch tools 2026-04-08 20:19:22 +00:00

Merged #2 M0.3: Contract v1 Pydantic models 2026-04-08 20:14:46 +00:00

19 issues closed from 1 user

Closed #38 M5.1.1 arxiv-rag: ingest pipeline (marchwarden arxiv add) 2026-04-09 02:04:00 +00:00

Closed #45 M3.2 Multi-axis stress test 2026-04-09 01:35:03 +00:00

Closed #54 Persist full ResearchResult alongside trace (observability gap) 2026-04-09 01:27:49 +00:00

Closed #44 M3.1 Single-axis stress tests 2026-04-09 01:14:15 +00:00

Closed #35 Record per-step duration in trace and operational logs 2026-04-08 23:13:11 +00:00

Closed #30 depth flag should drive iteration / budget / source defaults 2026-04-08 22:34:02 +00:00

Closed #26 M2.5.3: marchwarden costs CLI command 2026-04-08 21:59:38 +00:00

Closed #25 M2.5.2: Cost ledger 2026-04-08 21:54:40 +00:00

Closed #24 M2.5.1: Structured application logger (structlog) 2026-04-08 21:48:38 +00:00

Closed #1 V1: Web-search researcher MCP + CLI shim 2026-04-08 21:37:39 +00:00

Closed #10 M2.3: First end-to-end smoke test (Utah crops) 2026-04-08 21:35:44 +00:00

Closed #18 Bug: MCP stdio client doesn't propagate parent env to server subprocess 2026-04-08 21:33:05 +00:00

Closed #17 Bug: token_budget is not actually enforced 2026-04-08 21:30:58 +00:00

Closed #15 Bug: server default model id is invalid (claude-sonnet-4-5-20250514) 2026-04-08 21:26:22 +00:00

Closed #19 Bug: trace logger truncates long field values 2026-04-08 21:25:02 +00:00

Closed #16 Bug: synthesis output parsing fails on real research runs 2026-04-08 21:25:01 +00:00

Closed #13 Docker-based testing environment 2026-04-08 21:08:57 +00:00

Closed #9 M2.2: marchwarden replay <trace_id> CLI command 2026-04-08 20:59:45 +00:00

Closed #8 M2.1: marchwarden ask CLI command 2026-04-08 20:55:00 +00:00

33 issues created by 1 user

Opened #1 V1: Web-search researcher MCP + CLI shim 2026-04-08 17:59:06 +00:00

Opened #8 M2.1: marchwarden ask CLI command 2026-04-08 20:48:42 +00:00

Opened #9 M2.2: marchwarden replay <trace_id> CLI command 2026-04-08 20:48:45 +00:00

Opened #10 M2.3: First end-to-end smoke test (Utah crops) 2026-04-08 20:48:51 +00:00

Opened #13 Docker-based testing environment 2026-04-08 21:03:56 +00:00

Opened #15 Bug: server default model id is invalid (claude-sonnet-4-5-20250514) 2026-04-08 21:09:32 +00:00

Opened #16 Bug: synthesis output parsing fails on real research runs 2026-04-08 21:12:42 +00:00

Opened #17 Bug: token_budget is not actually enforced 2026-04-08 21:12:47 +00:00

Opened #18 Bug: MCP stdio client doesn't propagate parent env to server subprocess 2026-04-08 21:12:52 +00:00

Opened #19 Bug: trace logger truncates long field values 2026-04-08 21:15:56 +00:00

Opened #24 M2.5.1: Structured application logger (structlog) 2026-04-08 21:43:25 +00:00

Opened #25 M2.5.2: Cost ledger 2026-04-08 21:43:38 +00:00

Opened #26 M2.5.3: marchwarden costs CLI command 2026-04-08 21:43:46 +00:00

Opened #30 depth flag should drive iteration / budget / source defaults 2026-04-08 22:04:49 +00:00

Opened #35 Record per-step duration in trace and operational logs 2026-04-08 22:47:07 +00:00

Opened #37 Researcher #2: arxiv-rag — semantic search over a curated arXiv reading list 2026-04-08 23:07:03 +00:00

Opened #38 M5.1.1 arxiv-rag: ingest pipeline (marchwarden arxiv add) 2026-04-08 23:17:12 +00:00

Opened #39 M5.1.2 arxiv-rag: retrieval primitive 2026-04-08 23:17:20 +00:00

Opened #40 M5.1.3 arxiv-rag: ArxivResearcher agent loop 2026-04-08 23:17:34 +00:00

Opened #41 M5.1.4 arxiv-rag: MCP server 2026-04-08 23:17:41 +00:00

Opened #42 M5.1.5 arxiv-rag: CLI integration (--researcher arxiv) 2026-04-08 23:17:49 +00:00

Opened #43 M5.1.6 arxiv-rag: cost ledger fields (embedding_calls) 2026-04-08 23:17:59 +00:00

Opened #44 M3.1 Single-axis stress tests 2026-04-08 23:23:57 +00:00

Opened #45 M3.2 Multi-axis stress test 2026-04-08 23:24:03 +00:00

Opened #46 M3.3 Confidence calibration (V1.1) 2026-04-08 23:24:11 +00:00

Opened #47 M4.1 Error handling and graceful degradation 2026-04-08 23:24:21 +00:00

Opened #48 M4.2 Test suite expansion and contract compliance 2026-04-08 23:24:29 +00:00

Opened #49 M4.3 Documentation polish (15-minute new-developer test) 2026-04-08 23:24:36 +00:00

Opened #50 M5.2 Contract validation across two researchers 2026-04-08 23:24:44 +00:00

Opened #51 M6.1 PI Agent core 2026-04-08 23:24:56 +00:00

Opened #52 M6.2 PI-driven CLI (replaces V1 ask command) 2026-04-08 23:25:03 +00:00

Opened #53 Budget cap lags one iteration behind tool payload growth 2026-04-09 01:12:50 +00:00

Opened #54 Persist full ResearchResult alongside trace (observability gap) 2026-04-09 01:21:00 +00:00