UC3_PAPER_INTEGRATION

UC3 → paper integration proposal (differential ATAC-seq, issue #14)

What this is: ideas for integrating the UC3 work into the Galaxy Notebooks paper (vault/papers/galaxy-notebooks/manuscript.md), generated by feeding the UC3 debrief pair (UC3_DEBRIEF.md + UC3_DEBRIEF_2.md) and the paper draft to a review subagent. The paper itself was not modified. Line numbers reference manuscript.md as of 2026-06-14.

Why UC3 earns a place

UC3 is the only use case that turns an abstract design claim — “reuse Galaxy’s existing markdown renderer” / “reference artifacts, not just describe them” — into a falsifiable, before/after, code-backed story. It is the sharpest evidence for three things the draft asserts but doesn’t yet demonstrate: (1) display is what seeds extraction, (2) the markdown renderer is an extensible reproducibility surface, (3) a notebook full of plausible figures can still seed nothing.

Calibration (state explicitly so UC3 isn’t over-claimed)

MUST-INCLUDE

1. The display anti-pattern (Extraction section)

Where: after the paragraph ending “…recover the computational closure needed to produce them” (~lines 84–86).

A reference seeds extraction only if it points at an output the history actually produced — easy to violate. In one differential ATAC-seq analysis, the entire pipeline (a five-step DESeq2 → filter → volcano chain) ran as real Galaxy tool jobs, yet the first notebook seeded nothing: it displayed only PNG screenshots that had been exported, re-uploaded, and embedded. Walking backward from those uploads reached the upload boundary immediately — extraction recovered two dead inputs and zero tools. Rewriting the notebook to reference the on-graph tool outputs (the actual volcano and DESeq2 PDFs) seeded the full pipeline: a five-step workflow, zero dangling inputs, three outputs. The analysis never changed; only what the notebook displayed did.

Converts “references seed extraction” from claim to demonstrated mechanism with a sharp negative control (0 tools) and positive result (5 steps).

2. The cross-cutting lesson (Discussion/Limits)

Where: strengthen the “embedded artifact references can become misleading” limits paragraph (~line 176).

A notebook’s reproducibility value is bounded by what it displays. An analysis can be entirely tool-driven yet yield a notebook from which nothing reusable can be recovered, if its figures are pasted-in or re-uploaded copies rather than genuine on-graph outputs. The discipline Galaxy Notebooks reward is specific: every embedded figure/result should be a real history artifact on the provenance path. This is both a limit (the system cannot rescue a notebook showing only off-graph artifacts) and a design pressure in the right direction (the most reproducible way to document a result is also the easiest way to seed its workflow).

The paper’s strongest single sentence of earned insight from the use-case work.

3. Renderer-as-extensible-surface (Implementation → API/Content Pipeline)

Where: new paragraph (~lines 107–112); cross-ref the Abstract’s “reuses Galaxy’s existing… markdown renderer” (~line 5) and Methods “reuse Galaxy markdown rendering utilities” (~line 182). Upgrades “we touched nothing” to “the renderer is a deliberate extension point.”

The markdown rendering pipeline is not only reused but extensible. A large class of R/Bioconductor plotting tools (e.g. volcano-plot, DESeq2) emit figures as PDF, while the image directive expected a raster — forcing users to export, convert, and re-upload PNGs, breaking provenance and extraction. We addressed this at the renderer: the PDF datatype rasterizes a chosen page to PNG server-side (via PyMuPDF) and the renderer delegates format-specific rendering to the datatype, so a referenced PDF output both displays in the baked report and is recorded by the extraction collector (which already tracked image references regardless of format). We also added a history_dataset_as_pdf(history_dataset_id, page=N) directive with page selection, so a multi-page diagnostic PDF (e.g. a five-page DESeq2 report) can surface a single chosen page (PCA on page 1) as a figure. Both changes carry unit and Selenium tests. One renderer change retires the entire “tool emits PDF → cannot display → re-upload screenshot → break provenance” class.

Frame as “reuse and extend,” disclose the PyMuPDF dependency (see Tension C).

4. Figure — before/after extraction (degenerate vs full pipeline)

Where: new figure in the Extraction section or a panel of planned Figure 3. Same analysis, two panels:

Caption seed: “The notebook content determines what can be reused. Top: a notebook displaying re-uploaded screenshots seeds a degenerate two-input, zero-tool workflow. Bottom: the identical analysis, re-documented to reference its on-graph outputs, seeds the complete five-step pipeline.” The most persuasive single visual UC3 offers.

NICE-TO-HAVE

Tensions / honesty (frame proactively)

Priority summary

IdeaPrioritySection
1. Display anti-patternMUSTExtraction
2. Cross-cutting lessonMUSTDiscussion/Limits
3. Renderer extensibilityMUSTImplementation (Content Pipeline)
4. Before/after extraction figureMUSTExtraction / Fig 3
5. Rendered-notebook figureNICEDemo/Fig
6. Evaluation-plan namingNICEEvaluation Plan

Cross-references the edits must stay consistent with: Abstract ~line 5 (renderer reuse), Design Goals “reference artifacts, not just describe them” (~line 29), Extraction “references are not natural-language guesses” (~line 84), Discussion limits (~line 176), Methods “reuse Galaxy markdown rendering utilities” (~line 182).