Foundry Hardening Follow-Ups
Candidate work to strengthen the Foundry’s docs, motivation, and external credibility. Drawn from gaps surfaced while writing FOUNDRY_WHITE_PAPER.md and the three background docs (KNOWLEDGE_BASES_IN_AI.md, SKILLS_IN_AI.md, BRIDGING_KB_AND_SKILLS.md).
Grouped by intent. Each item names the gap, the fix, and an estimate of effort.
Positioning and Motivation
-
Name the pattern. The white paper calls it “casting” internally, but externally the Foundry’s pattern is “KB → compile → skill with provenance.” Pick a one-line public framing and use it consistently across README, docs, and any external posts. Today the README says “Knowledge base + casting pipeline”; that’s good but never reused. Effort: low.
-
Lead with the failure mode, not the solution. README and
GUIDING_PRINCIPLES.mdboth open from the project’s side. A reader who doesn’t already know Galaxy workflow conversion needs the “monolithic skills rot for these specific reasons” frame first. Borrow §1 of the white paper. Effort: low. -
Add a one-page “why now” doc. The white paper is long. A 1–2 page elevator pitch sized for a PR description or grant intro is missing. Effort: low.
-
Compare-and-contrast section. The white paper has §11 comparing Foundry to “just a wiki,” “just skills,” “auto-gen docs,” “monolithic conversion skills.” The repo docs do not. A standalone
docs/COMPARISONS.mdwould help readers who arrive via “how is this different from MCP / Custom GPTs / llms.txt.” Effort: medium. -
Connect to the broader field. The Foundry never cites external prior art. Add a
docs/PRIOR_ART.mdor appendix that situates the project against Corpus2Skill, MCP resources, OpenAPI-to-skill, Voyager, llms.txt — even one paragraph per. Credibility multiplier. Effort: medium.
Doc Structure and Discoverability
-
Add a docs index / reading order.
AGENTS.mdhas a reading order for agents. There’s no equivalent landing for humans. A shortdocs/README.mdmapping which doc to read for which question. Effort: low. -
Glossary surfacing.
content/glossary.mdis canonical but only loaded by agents. A static-site glossary page with anchor links plus inbound links from every doc would make the vocabulary scrutable to outsiders. Check whether the site already exposes it; if not, surface. Effort: low. -
Architecture doc length.
ARCHITECTURE.mdis 559 lines and mixes layout, validation pipeline, and design rationale. Split intoARCHITECTURE.md(structural authority — what files exist, what they mean) andDESIGN_RATIONALE.md(why the choices). Lowers onboarding cost. Effort: medium. -
Diagram inventory. Zero diagrams in
docs/. The pipeline → Mold → reference → cast flow, the artifact-graph contract, the validation layers, and the casting per-kind dispatch all have natural diagrams. Even Mermaid in-markdown would help. Effort: medium. -
Subway-map rendering.
HARNESS_PIPELINES.mddescribes pipelines as subway maps and the site is meant to render them that way — verify that’s actually rendered. If not, ship one canonical pipeline as a real subway map and link from the white paper. Effort: medium to high.
Evidence and Validation
-
Show a real cast end-to-end. The docs describe casting in abstract terms. The most credibility-multiplying thing the project could do is publish one fully cast Mold (
SKILL.md+references/tree +_provenance.json) in the white paper or a “Day in the Life” doc, with annotations explaining what came from where. Effort: low if a cast exists; medium otherwise. -
Cite the prior-art failures concretely. The white paper says monolithic skills fail in specific detectable ways (UUIDs,
+galaxyNrevisions, parameter mismatches). Link or quote actual instances from priornf-to-galaxyruns orfind-shed-toolskill outputs. Concrete failure cases beat enumerated ones. Effort: medium. -
First-Mold case study. When
summarize-paper,implement-galaxy-tool-step,validate-galaxy-step, andvalidate-galaxy-workflowreach a workable state, writedocs/CASE_STUDY_FIRST_PIPELINE.mdshowing what a real PAPER → GALAXY run looks like, with cast diffs, validation hits, and provenance excerpts. This becomes the primary external artifact. Effort: high but high-value. -
Failure-mode catalog (selective). The Foundry deliberately doesn’t maintain prose caveats — schemas are the truth. But for external motivation, a one-time doc enumerating the failure classes
gxwfcatches deterministically (with examples) demonstrates why schema-driven validation is load-bearing. Frame as “these are the bugs we no longer have to write about.” Effort: medium.
Conceptual Hardening
-
Tighten the Mold-vs-pattern boundary.
MOLDS.md“Not Molds” section is good but the line between an action Mold and a recipe Pattern can blur (e.g.,compare-against-iwc-exemplarversus a multi-step pattern). One worked example showing why a candidate landed on one side would help. Effort: low. -
Document the cast-target adapter contract more formally.
casts/<target>/_target.ymlis referenced but not specified in one place. Cleaning this up makes “add a new cast target” a real public extension point rather than a future-work hand-wave. Effort: medium. -
Provenance schema reference page.
_provenance.jsonschema v2 is described inline inCOMPILATION_PIPELINE.md. Surface it as its own renderable schema note so external readers can read the contract without hunting. Effort: low. -
Versioning posture rationale. “No semver, identity is content hash” is unusual and reads as a missing feature to newcomers. Add a short FAQ-style entry justifying the choice and pointing to lockfile-style mitigations. Effort: low.
-
Eval philosophy doc.
MOLD_SPEC.mdhas the eval contract but the why of eval — property checks over prescriptive solutions, hallucination guardrails, handoff fidelity — is buried. A standalonedocs/EVAL_PHILOSOPHY.mdwould generalize the lessons. Effort: medium.
Audience Targeting
-
A “for Galaxy maintainers” doc. Why should a Galaxy contributor invest time here? What does this take off their plate? Different framing from “for AI engineers.” Effort: low.
-
A “for AI/agent engineers” doc. Why is this interesting outside Galaxy? Generalize the pattern (KB-as-compile-source for high-stakes domain skills). Could borrow heavily from the white paper plus
BRIDGING_KB_AND_SKILLS.md. Effort: medium. -
A “for IWC contributors” doc. Explain corpus-first authoring, how Patterns earn pages from corpus uptake, why Foundry doesn’t mirror IWC. Pre-empts the “why aren’t you just contributing to IWC” question. Effort: low.
External Surface
-
Project tagline on README that survives skim reading. Current README opens “Knowledge base + casting pipeline for building Galaxy workflows with
gxwf.” Strong, but the next line is “Site: …”. Consider adding the failure-mode hook between them. Effort: trivial. -
A “see also” cluster. Link out to MCP, Anthropic Agent Skills, Corpus2Skill, llms.txt, Voyager from a single page so readers can place the Foundry in the field’s geography. Effort: low.
-
Blog-post-shaped writeup. When ready externally, an “introducing Galaxy Workflow Foundry” post written for the broader AI-engineering audience — not Galaxy-specific. The white paper is too long; a 1500-word version would land. Effort: medium.
-
Sample cast as a shareable artifact. A standalone gist or repo containing one fully cast skill plus its provenance, downloadable, so people can browse the artifact without cloning the Foundry. Effort: low once a real cast exists.
Process and Continuity
-
CI-publish key invariants. “Molds = union of pipeline phases” is machine-checked. Surface that check’s output as a badge or per-Mold widget. Same for cast staleness — a public dashboard showing “N casts up-to-date, M stale” would make the provenance story tangible. Effort: medium.
-
Refinement-journal digest.
refinements/*.mdper Mold is append-only. A periodic rollup (monthly?) of decisions across all Molds would surface the project’s evolution and make corpus-first authoring legible from outside. Effort: medium. -
Status page or roadmap. The README’s “Status” section is one paragraph. A real
docs/STATUS.mdor live status page enumerating which Molds, Patterns, CLI pages, and casts exist with last-touched dates would communicate momentum. Effort: low — could be generated.
Quick Wins (low effort, high signal)
- One-page elevator pitch alongside the white paper.
- Docs index / reading order in
docs/README.md. - Publish one annotated end-to-end cast as a reference artifact.
- Add
docs/COMPARISONS.mdcross-referencing MCP, Custom GPTs, Corpus2Skill, llms.txt. - Add
docs/PRIOR_ART.md— even one paragraph per related project. - Surface
_provenance.jsonschema as a renderable schema note. - README tagline tweak adding the failure-mode hook.