2025-05-25T17:10:12Z - 2026-05-25T17:10:12Z

Overview

26 active pull requests
33 active issues
Excluding merges, 2 authors have pushed 7 commits to main and 33 commits to all branches. On main, 9 files have changed and there have been 316 additions and 28 deletions.

26 pull requests merged by 1 user

Merged #59 docs(stress-tests): M3.3 Phase A — calibration data collection 2026-04-08 20:22:08 -06:00

Merged #58 feat(arxiv): ingest pipeline (M5.1.1) 2026-04-08 20:03:59 -06:00

Merged #57 docs(stress-tests): archive M3.2 multi-axis results 2026-04-08 19:35:02 -06:00

Merged #56 fix(observability): persist full ResearchResult and per-item trace events 2026-04-08 19:27:48 -06:00

Merged #55 docs(stress-tests): archive M3.1 results 2026-04-08 19:21:44 -06:00

Merged #36 Record per-step durations in trace and operational logs 2026-04-08 16:58:12 -06:00

Merged #33 depth flag now drives constraint defaults 2026-04-08 16:33:49 -06:00

Merged #34 chore: Makefile with venv-based dev workflow 2026-04-08 16:32:29 -06:00

Merged #32 Mirror trace steps to the operational logger 2026-04-08 16:23:02 -06:00

Merged #31 Display budget as spend status, not exhaustion alarm 2026-04-08 16:20:02 -06:00

Merged #29 M2.5.3: marchwarden costs CLI command 2026-04-08 15:59:08 -06:00

Merged #28 M2.5.2: Cost ledger with price table 2026-04-08 15:54:24 -06:00

Merged #27 M2.5.1: Structured application logger 2026-04-08 15:48:11 -06:00

Merged #23 Propagate parent env to MCP server subprocess 2026-04-08 15:32:11 -06:00

Merged #22 Enforce token_budget before each iteration 2026-04-08 15:30:27 -06:00

Merged #21 Fix invalid default model id 2026-04-08 15:26:06 -06:00

Merged #20 Fix synthesis truncation and trace masking 2026-04-08 15:24:42 -06:00

Merged #14 chore: docker-based test environment 2026-04-08 15:08:28 -06:00

Merged #12 M2.2: marchwarden replay CLI command 2026-04-08 14:59:13 -06:00

Merged #11 M2.1: marchwarden ask CLI command 2026-04-08 14:55:00 -06:00

Merged #7 M1.4: MCP server 2026-04-08 14:41:29 -06:00

Merged #6 Add OpenQuestion to research contract 2026-04-08 14:37:54 -06:00

Merged #5 M1.3: Inner agent loop 2026-04-08 14:29:41 -06:00

Merged #4 M1.2: Trace logger 2026-04-08 14:25:59 -06:00

Merged #3 M1.1: Search and fetch tools 2026-04-08 14:19:22 -06:00

Merged #2 M0.3: Contract v1 Pydantic models 2026-04-08 14:14:46 -06:00

19 issues closed from 1 user

Closed #38 M5.1.1 arxiv-rag: ingest pipeline (marchwarden arxiv add) 2026-04-08 20:04:00 -06:00

Closed #45 M3.2 Multi-axis stress test 2026-04-08 19:35:03 -06:00

Closed #54 Persist full ResearchResult alongside trace (observability gap) 2026-04-08 19:27:49 -06:00

Closed #44 M3.1 Single-axis stress tests 2026-04-08 19:14:15 -06:00

Closed #35 Record per-step duration in trace and operational logs 2026-04-08 17:13:11 -06:00

Closed #30 depth flag should drive iteration / budget / source defaults 2026-04-08 16:34:02 -06:00

Closed #26 M2.5.3: marchwarden costs CLI command 2026-04-08 15:59:38 -06:00

Closed #25 M2.5.2: Cost ledger 2026-04-08 15:54:40 -06:00

Closed #24 M2.5.1: Structured application logger (structlog) 2026-04-08 15:48:38 -06:00

Closed #1 V1: Web-search researcher MCP + CLI shim 2026-04-08 15:37:39 -06:00

Closed #10 M2.3: First end-to-end smoke test (Utah crops) 2026-04-08 15:35:44 -06:00

Closed #18 Bug: MCP stdio client doesn't propagate parent env to server subprocess 2026-04-08 15:33:05 -06:00

Closed #17 Bug: token_budget is not actually enforced 2026-04-08 15:30:58 -06:00

Closed #15 Bug: server default model id is invalid (claude-sonnet-4-5-20250514) 2026-04-08 15:26:22 -06:00

Closed #19 Bug: trace logger truncates long field values 2026-04-08 15:25:02 -06:00

Closed #16 Bug: synthesis output parsing fails on real research runs 2026-04-08 15:25:01 -06:00

Closed #13 Docker-based testing environment 2026-04-08 15:08:57 -06:00

Closed #9 M2.2: marchwarden replay <trace_id> CLI command 2026-04-08 14:59:45 -06:00

Closed #8 M2.1: marchwarden ask CLI command 2026-04-08 14:55:00 -06:00

33 issues created by 1 user

Opened #1 V1: Web-search researcher MCP + CLI shim 2026-04-08 11:59:06 -06:00

Opened #8 M2.1: marchwarden ask CLI command 2026-04-08 14:48:42 -06:00

Opened #9 M2.2: marchwarden replay <trace_id> CLI command 2026-04-08 14:48:45 -06:00

Opened #10 M2.3: First end-to-end smoke test (Utah crops) 2026-04-08 14:48:51 -06:00

Opened #13 Docker-based testing environment 2026-04-08 15:03:56 -06:00

Opened #15 Bug: server default model id is invalid (claude-sonnet-4-5-20250514) 2026-04-08 15:09:32 -06:00

Opened #16 Bug: synthesis output parsing fails on real research runs 2026-04-08 15:12:42 -06:00

Opened #17 Bug: token_budget is not actually enforced 2026-04-08 15:12:47 -06:00

Opened #18 Bug: MCP stdio client doesn't propagate parent env to server subprocess 2026-04-08 15:12:52 -06:00

Opened #19 Bug: trace logger truncates long field values 2026-04-08 15:15:56 -06:00

Opened #24 M2.5.1: Structured application logger (structlog) 2026-04-08 15:43:25 -06:00

Opened #25 M2.5.2: Cost ledger 2026-04-08 15:43:38 -06:00

Opened #26 M2.5.3: marchwarden costs CLI command 2026-04-08 15:43:46 -06:00

Opened #30 depth flag should drive iteration / budget / source defaults 2026-04-08 16:04:49 -06:00

Opened #35 Record per-step duration in trace and operational logs 2026-04-08 16:47:07 -06:00

Opened #37 Researcher #2: arxiv-rag — semantic search over a curated arXiv reading list 2026-04-08 17:07:03 -06:00

Opened #38 M5.1.1 arxiv-rag: ingest pipeline (marchwarden arxiv add) 2026-04-08 17:17:12 -06:00

Opened #39 M5.1.2 arxiv-rag: retrieval primitive 2026-04-08 17:17:20 -06:00

Opened #40 M5.1.3 arxiv-rag: ArxivResearcher agent loop 2026-04-08 17:17:34 -06:00

Opened #41 M5.1.4 arxiv-rag: MCP server 2026-04-08 17:17:41 -06:00

Opened #42 M5.1.5 arxiv-rag: CLI integration (--researcher arxiv) 2026-04-08 17:17:49 -06:00

Opened #43 M5.1.6 arxiv-rag: cost ledger fields (embedding_calls) 2026-04-08 17:17:59 -06:00

Opened #44 M3.1 Single-axis stress tests 2026-04-08 17:23:57 -06:00

Opened #45 M3.2 Multi-axis stress test 2026-04-08 17:24:03 -06:00

Opened #46 M3.3 Confidence calibration (V1.1) 2026-04-08 17:24:11 -06:00

Opened #47 M4.1 Error handling and graceful degradation 2026-04-08 17:24:21 -06:00

Opened #48 M4.2 Test suite expansion and contract compliance 2026-04-08 17:24:29 -06:00

Opened #49 M4.3 Documentation polish (15-minute new-developer test) 2026-04-08 17:24:36 -06:00

Opened #50 M5.2 Contract validation across two researchers 2026-04-08 17:24:44 -06:00

Opened #51 M6.1 PI Agent core 2026-04-08 17:24:56 -06:00

Opened #52 M6.2 PI-driven CLI (replaces V1 ask command) 2026-04-08 17:25:03 -06:00

Opened #53 Budget cap lags one iteration behind tool payload growth 2026-04-08 19:12:50 -06:00

Opened #54 Persist full ResearchResult alongside trace (observability gap) 2026-04-08 19:21:00 -06:00