IDEA_INTEROP_BACKGROUND

IDEA_INTEROP_BACKGROUND.md

Pitch framing

The proposal is two-pronged. Prong A specifies Galaxy’s Tool Shed 2.0 typed-schema HTTP API as ToolEndpoint/1.0 — an open compile-time agentic protocol that complements Anthropic’s Model Context Protocol (MCP). MCP’s Resources primitive is necessary but insufficient: it is runtime-only, lacks provenance linking generated skill instructions to source KB entries, and has no version contract or conformance test suite. Tool Shed 2.0 already serves typed ParsedTool schemas for ~10,000 Galaxy tools over HTTP — there is no comparable artifact in any competing workflow system, and it is the natural substrate for a compile-time KB→skill compilation pattern. Prong B generalizes the same pattern to a second life-sciences domain that hits OS4LS RFA priority #1 head-on: assay metadata standards (ISA-Tab/JSON, RO-Crate, Bioschemas, MIxS, DCAT-AP). The frame: in an era of agents, the UI for everything becomes prompts within a year — so reproducibility and standards-grounded metadata become more important, not less, because they are the substrate that makes scientific work legible to agents. Galaxy already sits at the intersection (Workflow RO-Crate, WorkflowHub two-way integration, ELIXIR node) which gives the proposal a natural unifying narrative across both prongs.


1. MCP landscape

Model Context Protocol was released by Anthropic in November 2024 and donated to the Linux Foundation (under the new Agentic AI Foundation umbrella) in late 2025. The spec is at modelcontextprotocol.io; reference servers and SDKs live at github.com/modelcontextprotocol. MCP defines three server-side primitives:

The 2026 MCP roadmap prioritizes four areas: Transport Evolution and Scalability (streamable HTTP, sessions, load-balancing); Agent Communication (refining the Tasks primitive); Governance Maturation (Working Groups, contributor ladders); and Enterprise Readiness (audit, SSO, gateways). Notably, the roadmap does not prioritize Resources semantics or schema validation; capabilities like “streamed and reference-based result types” sit in the “On the Horizon” tier. There is one adjacent thread relevant to us: a “standard metadata format, served via .well-known, so that server capabilities are discoverable without a live connection” — an explicit acknowledgement that runtime-only discovery is a known gap.

Why MCP Resources is necessary but insufficient for our use:

  1. Runtime-only. A Resource is a URI plus a read operation. Validation is performed by the server at request time (FastMCP and similar enforce URI-template validation at runtime) — there is no ahead-of-time contract a downstream agent or skill compiler can rely on without a live connection. URI provenance is explicitly the client’s responsibility, out-of-band, per current authorization guidance.
  2. No provenance link from skill text to KB entry. Once a Resource is fetched and stuffed into a prompt, the lineage from a specific instruction sentence back to the canonical KB version is lost. There is no mechanism for “this assistant utterance was compiled from KB entry X at version Y” that downstream tooling can check.
  3. No version contract. Resources have no required versioning, semver, or change-log primitive in the protocol. Two clients consuming the same URI at different times have no protocol-level guarantee they saw the same data.
  4. No conformance test suite. Server authors can ship anything that satisfies the JSON-RPC envelope.

The compile-time complement: a typed, versioned, conformance-tested KB endpoint whose entries are referenceable from generated skills, so that a regression in the KB is detectable as a regression in the skill artifact (not as a silent change in retrieval).

2. Tool Shed 2.0 as agentic protocol

Galaxy’s Tool Shed 2.0 serves typed ParsedTool schemas — a normalized, validated representation of every Galaxy tool (inputs, outputs, type coercions, citations, requirements) — for ~10,000 tools over HTTP. The VSCode galaxy-workflows extension is a working consumer: it uses Tool Shed 2.0 schemas at edit time to drive completion and validation of workflow YAML/format2 documents. No competing workflow system (Nextflow, Snakemake, WDL) ships an analogous machine-readable typed-tool registry.

Mapping onto MCP: Tool Shed 2.0 is structurally a Resource server in MCP terms — read-only, URI-addressed (toolshed://{shed}/{repo}/{tool}/{version}), JSON-typed. But the compile-time value is the killer feature MCP Resources doesn’t capture: a downstream agent’s “use Galaxy tool X” skill can be compiled against the schema, with the resulting artifact pinning a specific tool version, parameter contract, and citation back to the shed entry. If the shed entry changes incompatibly, the compile fails — a CI-detectable regression, not a runtime surprise.

ToolEndpoint/1.0 as an open protocol would consist of:

This positioning is complementary to MCP, not competitive — ToolEndpoint/1.0 is the schema-bound source-of-truth; MCP is the runtime delivery mechanism for clients that need it.

3. Assay metadata standards landscape

StandardGovernanceSchema locationVersion stateAdoption
ISA-Tab / ISA-JSONISA Commons (Sansone/Oxford lineage), isa-tools.orgisa-tools/isa-api, isa-tools/isa-rwval (GitHub)ISA-JSON v1; isatools Python libMetaboLights, MetabolomicsWorkbench, EBI BioStudies; FAIRification community
RO-Crateresearchobject.org community (Manchester eScience Lab)researchobject.org/ro-crate, profiles registry1.2 (June 2025); Workflow RO-Crate 1.0; Workflow Run Crate; Process Run CrateWorkflowHub, Galaxy export/import, EOSC, LifeMonitor
Bioschemasbioschemas.org community (ELIXIR-aligned)bioschemas.org/profilesComputationalWorkflow 1.0-RELEASE (Mar 2021); 1.1-DRAFT (Nov 2023); Dataset 1.0-RELEASE (Jul 2022); Sample 0.2-RELEASE; BioSample 0.1-DRAFTbio.tools, WorkflowHub, training portals; embedded as JSON-LD in life-sci sites
MIxSGenomic Standards Consortium (GSC)github.com/GenomicsStandardsConsortium/mixsv6.x (LinkML-generated; 6 main checklists: genomes, marker genes, metagenomes, MAGs, SAGs, UViGs)ENA, NCBI BioSample, JGI/IMG
MIAME / MINSEQEFGED SocietyStable text specsMature, mostly-frozenGEO, ArrayExpress (legacy)
DCAT / DCAT-APW3C / SEMIC (EU)w3.org/TR/vocab-dcat-3, github.com/SEMICeu/DCAT-APDCAT 3 (W3C); DCAT-AP 3.0.1 (2025, SHACL templates included)EU data portals, EOSC catalog

Overlaps and incompatibilities — important because they are both a risk and an opportunity:

4. Existing metadata-authoring tooling — and the gap

The gap: existing tools handle authoring or validation in isolation; none provide a compile-time-grounded agent that (a) selects the right standard from FAIRsharing/registry context, (b) maps free-text experiment descriptions into validated structured submissions, (c) emits provenance linking each generated field to its KB source (ontology term, profile clause, controlled-vocabulary entry), and (d) fails CI when the upstream standard changes incompatibly. CEDAR-plus-LLM proves recall lifts; nobody has packaged this as a compile-time-grounded, version-pinned agentic substrate that spans ISA and RO-Crate and Bioschemas and MIxS uniformly.

5. Why metadata is the right second domain

6. Galaxy connection

The two prongs are not orthogonal — they share Galaxy as the anchor:

So Prong B is not a separate domain bolted on for grant-aesthetic reasons — the same RO-Crate that prong B compiles against is the format Galaxy emits for workflow runs in Prong A. The unifying narrative is real.

7. AI-readiness research landscape for metadata authoring

Published work to cite (do not pitch ML model development per RFA out-of-scope):

These give the LOI cover that the technical direction is grounded in current literature without making us a model-training project.

8. Potential collaborators

The user has indicated confidence about finding collaborators on the metadata prong. The obvious doors to knock on:

The realistic ask in an LOI is a letter from Manchester (Goble or Soiland-Reyes), Oxford (Sansone or Rocca-Serra), and one Bioschemas voice — that triangulates ISA, RO-Crate, and Bioschemas in one collaborator triangle.

9. Suggested LOI landscape-analysis paragraph (≤200 words)

Two complementary substrates make scientific work legible to agents: machine-readable tool contracts and machine-readable data contracts. Anthropic’s Model Context Protocol (MCP, donated to the Linux Foundation in 2025) standardizes runtime delivery of both via its Resources primitive, but the 2026 MCP roadmap explicitly defers schema versioning, provenance, and offline conformance — leaving a compile-time gap. Galaxy’s Tool Shed 2.0 already serves typed ParsedTool schemas for ~10,000 tools over HTTP, a registry artifact no competing workflow system possesses; specifying it as ToolEndpoint/1.0 (OpenAPI surface, JSON Schema profile, conformance suite, MCP bridge) gives the agentic ecosystem a compile-time complement to MCP Resources. The same compile-time KB→skill compilation pattern applies to assay metadata: ISA-Tab/JSON (Sansone, Oxford), RO-Crate (Goble, Manchester; spec 1.2 released June 2025), Bioschemas profiles, MIxS (now LinkML-native), and DCAT-AP 3.0.1 are all schema-bound and conformance-tested, and CEDAR-plus-LLM work has already shown recall on submission metadata can be more than tripled. Agents that author submission-ready metadata grounded in versioned, curated standards directly address OS4LS priority #1, with Galaxy/WorkflowHub/RO-Crate as the unifying anchor.

10. Risks and weaknesses


Open questions for the human

Sources