M5.1.3 arxiv-rag: ArxivResearcher agent loop #40
Labels
No labels
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: archeious/marchwarden#40
Loading…
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Third sub-milestone of Issue #37. Design: ArxivRagProposal.
Goal
Wraps the M5.1.2 retrieval primitive in the same plan → tool-use → iterate → synthesize loop the web researcher uses, but with arxiv chunks instead of web fetches. Returns a
ResearchResultmatching the v1 contract.Scope
researchers/arxiv/agent.py:ArxivResearcherclass with the same shape asWebResearcherasync research(question, context, depth, constraints) -> ResearchResultretrieve_chunks(query, k)— call into M5.1.2read_full_section(arxiv_id, section)— fetch the entire section by ID for cases where the chunk excerpt isn't enough contextiteration_start,retrieve_chunks,retrieve_chunks_complete,synthesis_start,synthesis_complete,complete) so duration tracking and operational logs work for free[Author et al., Year, arXiv:ID], prefer methods sections for "how" questions, results for "what" questionscost_metadatastructure withmodel_idset to whatever Claude model performed the synthesisCitation.locatoris the arxiv abs URL (https://arxiv.org/abs/<id>);raw_excerptis the chunk text verbatimsource_authorityis alwayshigh(peer-reviewed),recencyderived from the paper yearTests
ResearchResultwith the canned content as citationsgaps[].category=source_not_foundmax_iterationsandtoken_budgetthe same wayWebResearcherdoesOut of scope
Branch
feat/arxiv-rag-agentBlocked by: M5.1.2. Blocks: M5.1.4.
A.3 arxiv-rag: ArxivResearcher agent loopto M5.1.3 arxiv-rag: ArxivResearcher agent loop