Roadmap · last updated 2026-05-24

What's next.
In order of obvious.

Every item below has a slug. Sponsorship conversations start with the slug, not with a brand deck. The roadmap is honest about what's shipped, what's actively being built, what's paused waiting for compute, and what we will categorically not build — regardless of the cheque.

Shipped Active Next Exploratory Paused Won't build

01 — The decision rule

No quarters. No OKRs.

We don't plan in quarters. We don't write OKRs. Each new piece earns its place by becoming the obvious next part — once the layer underneath proves itself in production, the layer above becomes possible to articulate.

That means the roadmap below is honest about order but agnostic about date. Items move from Active to Shipped when the predecessor stabilizes, not when a sprint closes.

02 — Shipped

In production. Verifiable now.

Every item here has a public surface — repository, package, or release — you can verify against the claim.

R-MEM-001 · Memory Shipped

Celiums Memory v2.0 — engine

61 typed MCP tools, hash-chained journal, 4-layer ethics engine, hybrid retrieval over PAD + semantics. Apache 2.0 in full.

verifygithub · celiums-memory

R-COG-001 · Cognition Shipped

Celiums Cognition — OpenClaw plugin

Auto-recall, auto-capture, auto-journal, 5-layer ethics gate, subagent lineage, operator dashboard. Running in production on a single-tenant gateway. Pre-1.0, audit-hardened.

verifygithub · celiums-cognition

R-SPC-001 · Specialized Shipped

Specialized Intelligence · AI/ML engineering

First vertical specialty available. Native vocabulary for LLM systems, agent design, retrieval, eval, and the operational layer that turns prototypes into systems people maintain.

verifyspecialized-intelligence/#available

R-TM-000 · tinyMARS Native + paper

tinyMARS — a perpendicular control axis, from scratch

Adapter + native, with a paper (DOI). On a frozen Gemma base, under conflict the channel overrides the text 264/265. From scratch, the perpendicular force replicates (88.8 % of 455 held-out pairs, chance 25 %), and the channel leaves the base a better language model than bracketed channel-less controls — the "relief valve". Honest scope: toy scale (110M / 1B tokens), one iteration. GPL-3.0 / CC BY-SA.

verifygithub · tinymars

R-HY-001 · Hyphae Phase C w1

Hyphae — Phase C wave 1

RFCs v0.1.2, v0.2, v0.3 shipped. Baseline metrics green on the 20-question smoke corpus: compose 1.000, grammaticality 0.993, honest-limitation 5/5.

verifyhyphae/#state

03 — Active now

Being built this week.

If you watch the repos, these are the branches receiving commits. None are committed dates; all are committed work.

R-HY-002 · Hyphae Active

Phase C wave 2

4 remaining Bucket 1 tools (journal_introspect, journal_arc, cultivate, bloom); expand eval corpus from 20 → 255 queries; replace mock Vertex grounding with the real provider.

commissionsponsor →

R-COG-002 · Cognition Active

Publish track · npm + ClawHub

Pre-1.0 is feature-complete and audit-hardened. The publish step is gated on explicit go-ahead and on validating multi-tenant deployment on a non-Celiums gateway.

verifygithub · celiums-cognition

R-MARS-001 · MARS-Real Active

MARS-Real on Gemma 4 E2B-it

Plan B — MARS as an adapter on top of Google's frozen 4.6 B base. This is the deployable artifact; tinyMARS was the architectural prototype. Separate forthcoming repo.

commissionsponsor →

R-MEM-002 · Memory Active

Ethics knowledge corpus · indexable

The Layer K precedent corpus (~1,857 docs, 1024-d) shipped as a release asset. Active work: making it easy to load into any OpenSearch instance, with SHA-verified bulk indexing.

verifygithub · releases

04 — Next

Committed. Articulated.

Specified work waiting for its predecessor to stabilize. Each item has an articulated scope and a known integration shape, not just a name.

R-HY-003 · Hyphae Next

RFC v0.4 — learning loop on specified substrate

AlphaFold-2 incorporated physical constraints into the network; Hyphae specifies the rules of composition but learns no strategies from experience. RFC v0.4 specifies feedback signals, bounded parameter updates, audit trail, rollback if metrics degrade.

commissionsponsor →

R-SPC-002 · Specialized Next

Second vertical · 1 of 6 candidates

One of: medical research, legal analysis, financial modeling, engineering systems, creative writing, personal context. The chosen field is the one where a real partner is willing to ground the build in domain depth.

commissionsponsor →

R-MEM-003 · Memory Next

Language bindings · Python + Go SDKs

The MCP surface is language-agnostic but ergonomic Python and Go SDKs would lower the integration cost for teams not already in the Node/TS ecosystem.

commissionsponsor →

R-COG-003 · Cognition Next

Multi-tenant validation

Cognition runs proven on a single-tenant gateway. The next milestone is multi-tenant validation: RBAC and AAL behavior under contention, per-tenant ethics modes, cross-tenant isolation under load.

commissionsponsor →

05 — Exploratory

Articulated. Not committed.

Ideas that have been written down — usually as journal entries pending RFC articulation — but are not yet committed to a build order. They wait for the layer below them to be obvious.

R-HY-004 · Hyphae Exploratory

RFC v0.5 — perceptual agency over the web

Browser automation (Playwright or equivalent) lets Hyphae read full pages, follow links, extract structured content. The concrete mechanism for the learning loop in R-HY-003.

R-HY-005 · Hyphae Exploratory

Multimodal extension — vision & audio

Visual and auditory input as extensions of the Olfactory Bulb gating model and Thalamus input handling. Deferred until the textual substrate fully validates.

R-SPC-003 · Specialized Exploratory

Personal-context specialization

The deepest memory grounding — for the long-term you. Multi-year, single-user, with the strictest privacy posture in the family. Architectural sketch only.

R-MEM-004 · Memory Exploratory

Edge / mobile deployment profile

The engine fits on commodity hardware. A profile that fits in single-digit-GB on a phone or edge device, with the same MCP surface and a reduced ethics-corpus footprint, is articulated but unbuilt.

06 — What's next

From a measured property. To a functional model.

The eval was widened (corpus v2, six capabilities) and the conflict test cleared decisively (channel overrides text 264/265); the native then replicated the perpendicular force from scratch (88.8% of 455 held-out pairs) and revealed the relief valve, with a paper and a DOI. The bottleneck is no longer statistical power — it is scale, and an efficiency-first redesign so a larger model stays affordable to train and can run on commodity hardware.

R-TM-001 · tinyMARS Next · scoped

Scale + efficiency — a model that uses the channels while it speaks

The from-scratch native proved the property at toy scale (110M / 1B tokens); it is not yet a capable model. Next is scale, but efficiency-first: knowledge retrieved rather than memorized into the weights, and sparse, conditional compute — so the cognitive channels can govern what is computed and recalled, and a larger model trains cheaply and serves on commodity hardware. Each efficiency step is its own small, pre-registered experiment before any scale-up — the same discipline that produced the conflict and relief-valve results.

First efficiency brick ~$tens (in-quota)

Functional model real compute

Bottleneck Compute + engineering

Read the research log →

07 — Not on the roadmap

Categorically off the table.

These are the things we will not build — independently of the cheque size. The list is short on purpose. The longer one is what's on the roadmap, not what's off it.

This is the mirror of the Decline column in the funding section, applied to product, not money.

Closed-source pivot

Any direction that requires hiding the engine, the journal, or the ethics layer. The Ethics Engine in particular is the one component that least deserves to be hidden.

Defense / surveillance

Specialized intelligences for defense, mass surveillance, intelligence collection, or autonomous targeting. The acceptable-use policy is binding on us as builders too, not just on users.

Telemetry-required features

Anything that requires phoning home to function. Users own their memories and their journals. We do not extract them, sell them, or hold them hostage to a managed plan.

"Faster ChatGPT alternative"

We don't compete on raw inference throughput. The bet is on a different shape of cognition (substrate, ethics, journal, specialization) — not on shaving milliseconds off a chat reply.

Generic "MCP for X" wrappers

Wrappers around third-party services that don't add substrate. If a connector doesn't extend Memory, Cognition, or the ethics surface, it doesn't justify being part of Celiums.

08 — Commission this

Pick a slug. Sponsor it.

Email with the slug in the subject. We respond honestly within a week — including when the answer is no, and including a real cost estimate when the answer is yes. Some items are genuinely compute-bound; others — like tinyMARS now — just need eyes, replication, and collaborators.

Commission a slug → Read the funding stance →

[ Honest scope · slug-addressable · costs disclosed ]

What's next. In order of obvious.

No quarters. No OKRs.

In production. Verifiable now.

Being built this week.

Committed. Articulated.

Articulated. Not committed.

From a measured property. To a functional model.

Categorically off the table.

Pick a slug. Sponsor it.

What's next.
In order of obvious.