tasks

Manuscript Polish TODO

Working list of things needed to lift manuscript.md from second-pass draft to submission-ready. Keep this file honest: if a claim is not ready, park the work here instead of bending the prose around missing evidence.

Highest-Leverage Work Still Ahead

Extraction itself is delivered and evidenced by three real, paper-worthy analyses (S. aureus mobile resistome PRJDB8599, TAL1 differential ChIP, differential ATAC-seq). The “prove extraction with a worked vignette” and “find a domain contributor” goals are closed, and there is no remaining required build work for the paper.

Optional enhancement: graph confirmation / prune view

The manuscript has been softened so it no longer depends on this: the Extraction section now describes a three-step flow (identify → backward walk → extract) and frames a read-only, selectable confirm/prune graph view as a natural human-in-the-loop addition, explicitly noting the reported results came from page-based extraction. So the draft is honest and complete whether or not this view ever ships. This item is entirely optional.

How it would slightly strengthen the paper:

Where it would slot in (if built):

Already delivered (kept for record):

Evidence and Numbers to Fill In

Figures

Canonical set is the four figures referenced in manuscript.md; detailed spec and asset inventory live in figures.md, archived capture report in old/FIGURE_CAPTURE_REPORT.md.

Still open from the capture pass (all optional / polish):

Citations and Literature

Manuscript Hygiene

Honest Risks in the Current Draft