🚧 Early alpha — building the foundation. See the roadmap →

Ontology-web querying

Updated Jun 1, 2026

What Crosswalker actually is

Crosswalker is a general ontology-web query engine built on top of an Obsidian vault.

Read the previous sentence carefully. Three pieces:

Ontology-web — Crosswalker’s data model is a graph of concepts connected by typed predicates. Concepts can be controls, taxonomies, threats, library subjects, biomedical terms — anything with stable identifiers and structured relationships. Predicates can be SKOS-style (broader, narrower, related), SSSOM-style mapping vocabulary (equivalent_to, narrower_match, closely_matches), STRM (Crosswalker’s 5-relationship vocabulary), or domain-specific.
Query engine — a way to ask questions over that graph and get useful answers. The query engine architecture is three layers: primitives (filter / project / traversal / closure / anti-join / pivot / aggregate), view shapes (table / list / pivot / graph / hierarchy / timeline), and recipes (marketplace instances composing primitives + shape).
Built on top of an Obsidian vault — the storage tier is canonical Markdown with YAML frontmatter and wikilinks. This makes the data portable across tools, diffable in git, syncable via any Obsidian sync mechanism.

Compliance is the launch market — not the project’s identity

GRC compliance (NIST 800-53 ↔ ISO 27001 ↔ CIS ↔ SOC 2 ↔ HIPAA ↔ …) is Crosswalker’s launch market. It’s where the first canonical recipes ship, where the first user community lives, and where the project description on GitHub leads. The “Coverage Matrix” recipe is the canonical compliance instance.

But Crosswalker’s architecture is general-domain. Per feedback_general_ontology_positioning.md memory:

Internal vocabulary stays general — ontology / concept / subject / object / predicate. The Tier 1 schema, the recipe grammar, the query primitives, the view shapes — all are ontology-neutral.
User-facing surfaces stay GRC-first — control / framework / evidence / coverage / audit. The README, the import wizard, the marketplace recipes lead with compliance vocabulary because that’s where launch users start.
The recipe layer is the bridge — recipes compose general primitives + shape into named domain-specific reports. “Coverage Matrix” is a recipe; the underlying view shape is pivot (mechanism-neutral); the underlying primitives are filter + traversal + anti-join + pivot.

This matters because if we treat compliance as the project’s identity, we lock the architecture to compliance semantics. The 13-field junction-note schema, the 5-mechanism recipe grammar, the SSSOM/SKOS/STRM cross-domain vocabulary — all of these are durable specifically because they were designed general-domain from the start.

Cross-domain examples — where ontology-web querying applies

Crosswalker’s launch market is GRC, but the same engine applies anywhere there’s an ontology web. Examples:

Domain	Example ontologies	Why ontology-web querying matters
Compliance / GRC (LAUNCH)	NIST 800-53, ISO 27001, CIS Controls, SOC 2 TSC, HIPAA, PCI-DSS, NIST CSF, FedRAMP	Crosswalking control sets across frameworks; evidence linking; audit deliverables
Threat intelligence	MITRE ATT&CK, MITRE D3FEND, MITRE ENGAGE, CAPEC, CWE, CVE	Mapping threats to defenses; tactic ↔ technique ↔ mitigation traversal
Biomedical	OBO Foundry (Gene Ontology, ChEBI, MONDO disease ontology, UBERON anatomy), UMLS, SNOMED CT, ICD-10	Cross-ontology mapping for clinical informatics; concept reconciliation across vocabularies
Library / information science	LCSH (Library of Congress Subject Headings), MeSH (Medical Subject Headings), Dewey Decimal, SKOS thesauri	Subject taxonomy alignment; hierarchical browsing; cross-vocabulary search
Ontology mapping research	NIST OLIR (Online Informative References), OxO2 (Ontology Cross-Reference Service), BioPortal	The discipline of crosswalking IS the use case
Cyber risk frameworks	FAIR, ISO 31000, NIST RMF, COBIT, OWASP risk-rating	Mapping risk semantics across enterprise frameworks
Privacy frameworks	GDPR, CCPA, NIST Privacy Framework, ISO 29100	Cross-jurisdiction privacy control mapping

For each of these domains, the same query primitives apply: filter controls/concepts by ontology, traverse the mapping edges, closure for transitive reachability, anti-join for gaps, pivot for cross-tabulation. The primitives don’t change; the recipes (and their domain naming) do.

The cross-domain vocabulary — SSSOM, SKOS, STRM, OLIR

Crosswalker uses W3C and standards-track vocabularies to express ontology mappings. This is what makes the architecture general — we didn’t invent the cross-domain semantics; we adopted what the ontology-mapping community has converged on.

Standard	What it is	Where used in Crosswalker
SSSOM (Simple Standard for Sharing Ontology Mappings)	TSV-shaped envelope for ontology mappings: subject/object/predicate/justification/confidence/author/license. Maintained by the Mapping Commons community.	Tier 1 junction-note frontmatter shape; recipe `kind: crosswalk-edge` validation envelope
SKOS (Simple Knowledge Organization System)	W3C standard for thesauri, taxonomies, classification schemes. Defines `broader`, `narrower`, `related`, `exactMatch`, `closeMatch`, `broadMatch`, `narrowMatch`.	Recipe `graph_edges` semantically aligned with SKOS broader/narrower for hierarchical predicates
STRM (Secure Trust Reference Mapping — Crosswalker-specific)	The 5-relationship predicate vocabulary settled in v0.1: `equivalent_to`, `closely_matches`, `narrower_match`, `broader_match`, `related_to`. SSSOM-shaped.	The closed predicate set Crosswalker emits at v0.1; user-facing surfaces show STRM names
OLIR (Online Informative References)	NIST’s submission-based catalog of crosswalks between cybersecurity frameworks. Cross-jurisdiction reuse.	Reference example for cross-framework mapping at scale; pending Ch 35 rerun considers OLIR-scale graph→tabular
OxO2 (EBI Ontology Cross-Reference Service)	EBI’s mapping catalog across biomedical ontologies	Reference architecture for ontology-mapping query engine; pending Ch 36 rerun considers OxO2’s query language
OWL transitive properties	Standard semantic web vocabulary for transitively-closed predicates	Conceptual basis for closure primitive; Crosswalker doesn’t ship OWL reasoner but borrows the semantics

What Crosswalker is NOT

Important non-claims to set expectations:

Not a triple store — we do not implement RDF storage or SPARQL. The ontology-web data model is implemented in Markdown frontmatter + sqlite-wasm sidecar, not in a triple store. (Whether to add SPARQL via Oxigraph-WASM as a Tier 3 option is open per Ch 33.)
Not an OWL reasoner — we borrow OWL semantics (transitive properties, equivalence) but don’t implement DL reasoning. Recipes can declare predicates as transitive; closure follows them; that’s the limit of what we infer.
Not a knowledge-graph database product — Neo4j, Stardog, GraphDB, AllegroGraph, TerminusDB are all valid alternatives for users who want server-side graph databases. Crosswalker’s value is vault-native ontology-web management — your data stays in Obsidian, portable, diffable, syncable.
Not a federated query engine across vaults — explicitly out of scope per Ch 27 + Ch 28 anti-patterns. One ontology web per vault. Cross-vault federation may revisit in v0.3+ if user demand emerges.
Not a replacement for compliance GRC tools — Crosswalker is the substrate on which GRC workflows can be expressed in Obsidian. It does not replace ticketing, evidence-collection automation, audit-readiness scoring engines, or controls-as-code platforms. It complements them by being the canonical structured-knowledge tier.

Why ontology webs in Obsidian

The hard question: why build an ontology-web query engine on top of an Obsidian vault, when the ontology-mapping community has BioPortal / OLS / OxO2 / NIST OLIR as established platforms?

Three reasons:

Vault-native = local-first + portable. Your ontology web lives as Markdown files you own. Diffable. Git-versionable. Sync-friendly. Survives any tool transition. BioPortal et al. are vendor-hosted; if they go away, you re-do the work.
Knowledge graph + working notes in one tool. Obsidian users already have notes, daily journals, project plans, evidence dumps. Putting the ontology web in the same vault lets you wikilink between a control and the evidence note that proves it, between a SKOS taxonomy term and the research note that uses it. Cross-context queries become trivially answerable.
AI-agent-readable substrate. Plain Markdown + YAML frontmatter is the universal substrate AI agents (Claude, ChatGPT, local models) can read and write without any specialized API. The MCP server pattern (kepano/obsidian-skills) makes the vault directly queryable by agents — and that vault includes ontology-web data alongside everything else the user has.

The tradeoff: Crosswalker won’t match a dedicated ontology-management platform on raw scale (BioPortal hosts 700+ ontologies; UMLS has 3.5M concepts). The substrate scaling question is open per Ch 33 and Ch 37. Crosswalker’s sweet spot is medium-scale (tens of ontologies, tens of thousands of concepts) where vault-native portability outweighs server-side scale.

Concept pillars:

Query primitives — Layer A (mechanism-neutral operations)
View shapes — Layer B (visual presentations)
Problem — what Crosswalker addresses
What makes Crosswalker unique — vault-native + general ontology-engine framing
Hierarchy primitives — emission-side primitives
Metadata ecosystem — Bases as the v0.1.6 query mechanism
System architecture — three-tier storage + query model
Terminology — definitions for ontology / concept / subject / object / predicate / STRM
ETL and import — schema-as-primitive philosophy

Decision logs:

Synthesis log (in progress) — settled item #13 references this positioning

Research challenges (pending):

Ch 33 — Multi-modal landscape audit — surveys multi-modal/ontology-aware DBs as substrate alternatives
Ch 35 — Graph→tabular bridging rerun — central question for ontology-web querying
Ch 37 — Tier 2 scale rerun — substrate ceiling at ontology-web scale

External references: