v0.11 — Active development Local web app Python + Flask

Topical intelligence built for
the era of AI Search.

Stratum crawls your site and the top 30 SERP results, builds knowledge graphs of both, then scores your content against six gap types — telling you precisely what to write and why.

3 Knowledge graphs
site, SERP, primary domain

6 Gap types scored
with weighted formula

30 SERP competitors
analysed per query

SERP Competitive Landscape — "AI Search consulting" 30 results

Rank depth →

LLM Optimisation

Entity signals

Citation signals

Predicate gap

0.71

Entity gap

0.44

Depth gap

0.18

The architecture

Three knowledge graphs. One complete picture.

Every other topical gap tool measures your site against what currently ranks. Stratum goes further — measuring against what's actually knowable. That's the difference between chasing competitors and understanding the full opportunity.

G_S — Site Graph

What your site covers

Built by crawling your domain. Maps every entity, predicate, and relationship claim your content currently makes — what you say, how you say it, and how it connects.

Source: your crawled site

G_D — Domain Reference Graph

What competitors cover

Built from the top 30 SERP results for your target query. Establishes the reference standard — what the content that currently ranks actually claims and covers.

Source: top 30 SERP results via Playwright

G_K — Knowledge Saturation

What's knowable

Built from primary knowledge sources — Wikipedia and Wikidata SPARQL. Measures your coverage against what's actually known, independent of what competitors cover.

Source: Wikipedia / Wikidata SPARQL

Phase 14 — planned

The workflow

From URL to prioritised content roadmap.

Stratum runs locally in your browser — start a Python server, open localhost, and your data never leaves your machine.

Start a new audit

Paste your target domain or sitemap URLs. Stratum auto-discovers sitemap structure via robots.txt — Playwright handles JS-rendered or Cloudflare-protected pages automatically.

Review and confirm URL list

Stratum previews the discovered URL set. You review, set max pages, and confirm before the crawl begins. Full control over scope before any processing runs.

Crawl and analysis pipeline

Concurrent Playwright crawl (5 workers default) runs TF-IDF, LSI, spaCy NER, LLM entity extraction, and Neo4j graph writing per page. Real-time SSE progress throughout.

Run SERP analysis

Enter your target query. Stratum scrapes the top 30 results, builds G_D, and projects your site into SERP embedding space — generating the 3D competitive landscape.

Review your worklist

A prioritised, gap-typed URL table tells you which pages need work and why. Six gap types — predicate absence weighted highest because fixing a relationship claim is lower effort than creating a new page.

Generate content from gaps

The 4-step Content Writer uses SERP intelligence — consensus entities (cross-SERP frequency-weighted) and information gain opportunities (what competitors collectively miss) — to generate targeted briefs and replacement content.

What makes Stratum different

Built on ideas no other SEO tool has implemented.

Stratum's architecture borrows from knowledge representation, epistemic logic, and information theory — disciplines that don't usually touch SEO tooling. The result is gap analysis that goes significantly deeper than keyword coverage.

Gap scoring

Six gap types. Weighted by ROI, not just presence.

Predicate absence — missing relationship claims between entities — is weighted highest (β=0.30) because fixing it requires editing, not new page creation. Entity absence is next. Neighbourhood incompleteness, depth, authority, and query-response gaps complete the picture.

Predicate absence β=0.30 Entity absence Depth gap

Knowledge model

Epistemic tier on every extracted triple.

Every Subject-Predicate-Object triple extracted from your content is assigned an epistemic tier — Factual (>0.8 belief), Assumption (0.5–0.8), or Derivative (<0.5). No other SEO tool distinguishes between what content asserts as fact versus what it implies or derives.

Factual >0.8 Assumption 0.5–0.8 Derivative <0.5

Temporal model

Bitemporal knowledge. Every value timestamped.

Every node and edge in the knowledge graph carries valid_from and valid_until timestamps. A product specification claim has a different temporal stability class than market data. Stratum tracks seven stability classes — from permanent metadata to market data — because not all content claims age the same way.

7 stability classes Bitemporal model

Competitive intelligence

3D SERP competitive landscape. See where you stand.

A Three.js interactive map plots 30 SERP results across depth layers and topic clusters. Your page is projected into SERP embedding space so you can see exactly where you are relative to what ranks — and the Content Writer generates a brief from the gap directly.

Three.js 3D map 30 rank layers ForceAtlas2

Content writer

Content briefs built from SERP consensus and information gain.

The Content Writer's intelligence panel shows consensus entities (cross-SERP frequency-weighted, with coverage badges) and information gain opportunities — topics competitors collectively miss. Briefs and generated content reflect both what you must cover and what you alone could own.

4-step workflow SERP intelligence panel

Privacy

Local-first. Client data never leaves your machine.

Stratum runs as a local Flask server — Neo4j and Postgres run on your machine in server mode. No cloud database, no telemetry, no data leaving your environment. The only external calls are LLM API requests using your own keys and Playwright SERP scraping.

Local Neo4j + Postgres BYOK No telemetry

Get early access

Stratum is ~55% complete. Register to follow its development.

The data pipeline, crawl system, and gap scoring framework are built. The knowledge graph intelligence layer — entity resolution, trust scoring, and information gain — is next. Register and we'll keep you updated as each capability ships.

✓

Security, bug fixes, tech debt, performance — Phases 1-4

✓

Flask Blueprint refactor, pause/resume crawl — Phases 5-6

✓

Neumorphic UI, audit pipeline stability — Phase 7

✓

Embedding store, Neo4j + Postgres backend — Phases 8-10

spaCy NER + LLM extraction → Neo4j, licensing — Phase 11

○

Entity resolution, cross-encoder, Wikidata grounding — Phase 12

○

Subjective Logic trust scoring — Phase 13

○

Knowledge saturation from Wikipedia/Wikidata SPARQL — Phase 14

○

Information gain scoring + export — Phase 15

Local-first — client data stays on your machine

BYOK — your API keys for all LLM calls

Early access pricing for registered users

STRATUM interest form

First Name

Role

What do you do?

How do you currently handle topical gap analysis?

Short Description - so we know if we are helping - let us know what is currently missing

What interests you most about Stratum?

Knowledge graph and entity mapping

SERP 3D competitive landscape

Six-dimension gap scoring

AI-assisted content briefs

Local-first data privacy

Are you comfortable setting up a Python environment?