UC1_NEXT_STEPS

UC1 Next Steps — bring Bakta annotation + JBrowse locus view into the workflow

Issue: https://github.com/jmchilton/galaxy-brain/issues/12 For: the agent that built the clean UC1 history. Handoff to push the MRSA mobile-AMR analysis from “great paper figures” to “complete workflow.”

Why this handoff

You optimized UC1 for the Galaxy Notebooks paper: a clean collection map-over that extracts to a flawless 14-step, sample-agnostic workflow with two on-graph heatmaps. That goal was met (history 48916fac0de9a85d, page eafb646da3b7aac5, extracted workflow 33b43b4e7093c91f). For that goal, Bakta was deliberately left out as “non-extractable enrichment” and JBrowse was parked as a TODO.

But the issue’s plan (#12 steps 4 and 8) names both, and at the workflow level they are load-bearing, not decoration:

Both tools are already installed (mrsa-mobile-amr-tools.yml: iuc/bakta, iuc/jbrowse). The full Bakta DB-setup gotchas are documented in UC1_RECIPE.md §1 (db-light layout, the amrfinder-db move, .loc registration, serial-on-Apple-Silicon). Don’t re-derive them — follow that section.

Step A — Bakta whole-genome annotation (mapped over the collection)

Step B — JBrowse locus view for the headline mobile-AMR region

Verification (what “done” looks like)

CheckExpected
Workflow step countgrows from 14 → ~16+ (Bakta + JBrowse, plus any closest/annotate glue)
Bakta outputGFF3 annotation collection exposed as a workflow output
JBrowse outputHTML locus view(s) exposed; aac(6')-aph(2'') visible beside its IS in both KUN1163 (plasmid/IS6) and KUH180129 (chr/IS256)
Re-extractionnew outputs exposed, zero dangling input_connections, report rewritten clean (hold to the UC1 §7 standard)
Science unchangedthe existing 14-step headline still reproduces byte-identically (don’t perturb the staramr/ISEScan/closest spine; remove_short_is=true still required)

Scope / risks