WikiLLM

WikiLLM Technology

From raw documents to compiled corporate memory — not RAG, but knowledge compilation.

The core idea: Compiled knowledge instead of RAG

Traditional RAG

Documents split into chunks

Every question: vector search

Knowledge rebuilt from scratch every time

Nothing accumulates. No learning effect.

WikiLLM (ROZUM)

Source is read and integrated

Persistent wiki: linked, updated

Knowledge compiles, grows, stays current

Every source enriches the total knowledge.

Most AI systems use RAG: with every query, document fragments are searched and assembled. Knowledge is rebuilt from scratch every time — nothing accumulates.

WikiLLM works differently. Instead of re-deriving knowledge with every query, the system builds a persistent, interlinked wiki — a structured network of summaries, entities, concepts, and cross-references. New knowledge is compiled once and then kept current.

The idea is based on Andrej Karpathy's LLM Wiki concept: the LLM becomes a disciplined wiki maintainer. It reads sources, extracts key information, updates existing pages, and maintains cross-references — automatically and traceably.

Four-Layer Architecture

ROZUM extends the WikiLLM idea with enterprise governance.

Layer 1

Source & Provenance

Immutable raw data with SHA-256 hash, versions, ACL snapshots, and provenance tracking. SharePoint, file shares, ERP, PDF archives.

SHA-256 HashACL SnapshotsVersion ControlSource Lineage

Governance at every transition

Layer 2

Compiled Knowledge

Summaries, entities, concepts, comparisons, contradictions — structured, linked, and with source citations. The persistent corporate wiki.

SummariesEntitiesConceptsCross-References

Governance at every transition

Layer 3

Retrieval & Reasoning

Hybrid search: tree reasoning, lexical and vector search, graph traversal, reranking, source verification, calibrated refusal on insufficient evidence.

Hybrid SearchRerankingCitation VerificationCalibrated Refusal

Governance at every transition

Layer 4

Governance & Control

Identity management, policies, ACL synchronization, audit trail, retention, deletion — applied at every transition, not as an outer shell.

IdentityPoliciesAudit TrailDeletion Propagation

Permission-Aware Knowledge

The critical part that WikiLLM alone doesn't solve: when a knowledge page is created from three documents with different permissions, a user only sees it if they have access to all relevant sources. Permissions are inherited, not ignored.

derived_acl = intersection(source_acl_sets)

Permission Inheritance

Doc A

ACL: R1,R2

Doc B

ACL: R2,R3

Doc C

ACL: R1,R2,R3

Compiled Knowledge Page

derived_acl = R2 (intersection)

R2 User: ✓

R1 User: ✗

R3 User: ✗

Cost comparison: Local vs. Cloud

Open-source LLMs on your server eliminate ongoing token costs. The difference grows with every additional use.

	ROZUM Local	Cloud API
Token-Kosten/Jahr	~€200-500 (Strom)	€10.000-20.000
Hardware (einmalig)	€4.000-8.000	—
Scaling	Free	Linear increase
Vendor lock-in	None	High

Supported Models

All models: MIT/Apache license, commercial use allowed, no vendor lock-in.

Qwen 3 70B

Parameters70B

VRAM40GB+

QualityHigh

Use CasePrimary

Llama 4 Scout

Parameters17B

VRAM10-16GB

QualityMedium-High

Use CaseCompact

Mistral Large 2

Parameters123B

VRAM70GB+

QualityHigh

Use CasePowerful

Gemma 4 27B

Parameters27B

VRAM16GB

QualityMedium

Use CaseEdge

Ready for sovereign AI?

Become a Design Partner