WikiLLM

WikiLLM Technology

From raw documents to compiled corporate memory — not RAG, but knowledge compilation.

The core idea: Compiled knowledge instead of RAG

Traditional RAG

Documents split into chunks
Every question: vector search
Knowledge rebuilt from scratch every time

Nothing accumulates. No learning effect.

WikiLLM (ROZUM)

Source is read and integrated
Persistent wiki: linked, updated
Knowledge compiles, grows, stays current

Every source enriches the total knowledge.

Most AI systems use RAG: with every query, document fragments are searched and assembled. Knowledge is rebuilt from scratch every time — nothing accumulates.

WikiLLM works differently. Instead of re-deriving knowledge with every query, the system builds a persistent, interlinked wiki — a structured network of summaries, entities, concepts, and cross-references. New knowledge is compiled once and then kept current.

The idea is based on Andrej Karpathy's LLM Wiki concept: the LLM becomes a disciplined wiki maintainer. It reads sources, extracts key information, updates existing pages, and maintains cross-references — automatically and traceably.

Four-Layer Architecture

ROZUM extends the WikiLLM idea with enterprise governance.

Layer 1

Source & Provenance

Immutable raw data with SHA-256 hash, versions, ACL snapshots, and provenance tracking. SharePoint, file shares, ERP, PDF archives.

SHA-256 HashACL SnapshotsVersion ControlSource Lineage
Governance at every transition
Layer 2

Compiled Knowledge

Summaries, entities, concepts, comparisons, contradictions — structured, linked, and with source citations. The persistent corporate wiki.

SummariesEntitiesConceptsCross-References
Governance at every transition
Layer 3

Retrieval & Reasoning

Hybrid search: tree reasoning, lexical and vector search, graph traversal, reranking, source verification, calibrated refusal on insufficient evidence.

Hybrid SearchRerankingCitation VerificationCalibrated Refusal
Governance at every transition
Layer 4

Governance & Control

Identity management, policies, ACL synchronization, audit trail, retention, deletion — applied at every transition, not as an outer shell.

IdentityPoliciesAudit TrailDeletion Propagation

Permission-Aware Knowledge

The critical part that WikiLLM alone doesn't solve: when a knowledge page is created from three documents with different permissions, a user only sees it if they have access to all relevant sources. Permissions are inherited, not ignored.

derived_acl = intersection(source_acl_sets)
Permission Inheritance
Doc A
ACL: R1,R2
Doc B
ACL: R2,R3
Doc C
ACL: R1,R2,R3
Compiled Knowledge Page
derived_acl = R2 (intersection)
R2 User: ✓
R1 User: ✗
R3 User: ✗

Cost comparison: Local vs. Cloud

Open-source LLMs on your server eliminate ongoing token costs. The difference grows with every additional use.

ROZUM LocalCloud API
Token-Kosten/Jahr~€200-500 (Strom)€10.000-20.000
Hardware (einmalig)€4.000-8.000
ScalingFreeLinear increase
Vendor lock-inNoneHigh

Supported Models

All models: MIT/Apache license, commercial use allowed, no vendor lock-in.

Qwen 3 70B

Parameters70B
VRAM40GB+
QualityHigh
Use CasePrimary

Llama 4 Scout

Parameters17B
VRAM10-16GB
QualityMedium-High
Use CaseCompact

Mistral Large 2

Parameters123B
VRAM70GB+
QualityHigh
Use CasePowerful

Gemma 4 27B

Parameters27B
VRAM16GB
QualityMedium
Use CaseEdge

Ready for sovereign AI?