Apex Fusion AP3XFUSION × ORIGINTRAIL
VECTOR MAINNET · DKG TESTNET · PIPELINE LIVE

Count eight generations back. You have 510 direct ancestors. How many can you name?

Most people stall after a handful. The rest are gone: no name, no date, no story. Global Genealogy is an open effort to give them back. A fleet of AI agents reads the world's archives, reconstructs lives and relationships, and writes them into a decentralized knowledge graph where every fact carries proof of where it came from.

REMEMBERED · 15 / 510
EVERY CELL IS ONE LIFE · HOVER A CELL · THE AGENTS ARE DOING THE WORK
2 4 8 16 32 64 128 256 in the outer ring alone · all eight rings: 2+4+…+256 = 510 lives on this chart · the ninth ring alone would add 512 more, the tenth 1,024
PILOT ARCHIVE: 385,000 WWI RECORDS · 20,960 KNOWLEDGE ASSETS ON THE DKG · SETTLED ON VECTOR MAINNET · EVERY FACT TRACEABLE TO ITS SOURCE

Genealogy is personal, and it is universal.

Love it or hate it, every person alive is a descendant. At different stages of life people care more or less about their roots, but at some point everyone runs into the same simple question: how did we come to be? Not in the national, professional or personality sense. Literally. Who are the people we descend from?

The chart above depicts you as the central circle. Your mother and father are the half-rings around it; their parents the quarter-rings around them, and so on outward. The math is a simple power of two, since every person has two biological parents. But the math is not the point. The point is that the outer ring alone holds 256 people, and the chart stops there only because the page does.

Look at the dark cells. Pay attention to them. Not relatives. Ancestors. Their genes are the ones you carry. You are their legacy. Do you know them?

From the founding note · Časlav Nedeljković

// the narrowing
Traditional genealogy tracks one surname: the male line. That is a narrow cross-section of a vast inheritance. The full picture doubles every generation, and almost all of it falls outside any family tree ever drawn.
// the scale
Each cell was a whole life: a name, a birthplace, work, war, weddings, children, death. All of them interwoven. A single family's history carries the full weight of human history inside it.
// the stakes
Where no record survives, a person disappears twice: once from the world, once from memory. The first is irreversible. The second one we can still do something about.

The records exist. The knowledge does not.

Until now, the only way to lift the veil was the slow path: years of effort, money, travel, languages and luck, piecing the puzzle together one parish book at a time. History itself works against the search. Wars, resettlements, fires and neglect destroyed documents and broke oral memory. The four horsemen were thorough.

Yet the seeds of knowledge are scattered everywhere: church registers, tax rolls, military rosters, court documents, memoirs, monuments, casualty lists. Hundreds of millions of pages worldwide. Digitisation has largely happened; many archives are scanned. Structured, queryable, verifiable knowledge has not happened. The cultural memory of entire generations sits in unindexed images.

// the extraction gap
Layouts vary: rotated columns, marginalia, abbreviations. Scripts vary: Cyrillic, old German, Ottoman, Latin, colonial Spanish. The data is fuzzy: partial dates, name variants, implied kinship. No single model reads every archive well. Per-archive model selection is itself a first-class problem.
// the interpretation problem
A row in a casualty list assumes a reader who already knows the places, army units, calendars and conventions of its time. Extraction means resolving human shorthand written for a different century, across language and jurisdiction barriers.
// why a knowledge graph
The value is not in rows. It is in relationships: kinship, units, places, sequence in time. A graph lets a descendant trace a family, a historian map a migration, a researcher follow a whole population across centuries. Flat tables lose exactly what matters.
// the raw material WWI & WWII casualty registers military rolls church baptismal books immigration manifests estate ledgers colonial census records Red Cross archives memoirs & monuments

One Balkan family line can cross Modern Slavic, Ottoman, Austro-Hungarian, Latin, Italian, Byzantine, Russian, French and English sources. The region was a battlefield for centuries. Its records are a battlefield too.

Why the pilot starts in the Balkans

Not one researcher. A fleet of agents.

The premise is simple. Instead of individuals gluing fragments together alone, a fleet of specialised AI agents works through the archives: identifying public-domain sources, reading them page by page, extracting people, events and relationships, and publishing them, with provenance and credibility estimates, into a shared genealogical superstructure. Over time the connections compound, and a repository emerges that keeps the memory of our ancestors alive.

Each archive ingested trains the pipeline for the next. Expert agents emerge per source type: an agent that has read ten thousand pages of Cyrillic casualty lists is ready for the ten-thousand-and-first. Every iteration makes the intake of newly discovered or newly digitised sources easier.

// agents with accountability
Every inference is a paid, bonded job on Vector, Apex Fusion's eUTXO L2. Escrow posted, work claimed, result submitted, receipt signed and accepted: four transactions per batch, on-chain. Which model read which page is never a mystery.
// knowledge with provenance
Results are published to the OriginTrail Decentralized Knowledge Graph as assets with UALs, Universal Asset Locators. Every fact links back to its source row, the model that extracted it and the receipt that paid for it. Nobody can quietly edit the record.
// credibility, not false certainty
Sources are blurred, distorted, fractured. The graph stores confidence estimates, not invented precision. Multiple models read the same page; where they disagree, the disagreement is preserved as a research lead instead of being averaged away.
// the architecture · scattered paper → shared memory
SOURCES The world's archives scanned · unstructured
AGENT FLEET Specialised agents OCR · extraction · publishing
VECTOR L2 Accountable inference bonded · escrowed · signed
ORIGINTRAIL DKG Verifiable knowledge UAL per asset · permanent
EVERYONE A public graph descendants · historians · researchers

From a scanned Cyrillic page to a verifiable knowledge asset. Live.

Every beginning is humble: one source has to go first. The choice was Popis Gubitaka, the official casualty register of the Kingdom of Serbia in the First World War. 385,000 individuals across 8,700 pages of rotated Cyrillic columns. Chosen for difficulty: column layouts, era typography, names, places and military units written in the shorthand of 1914, requiring prior knowledge to interpret. A perfect fit for LLMs. And chosen for meaning: a generation lost, and almost none of it queryable. Until now.

// pipeline · unstructured → accountable knowledge
01 · SOURCE Popis Gubitaka 30 PDFs · 385k records
02 · EXTRACT Column-aware OCR x-coordinate reconstruction
03 · INFER · VECTOR Bonded LLM extraction 4 TX · escrow → accept
04 · STRUCTURE Graph data persons · events · places
05 · PUBLISH · DKG OriginTrail asset UAL · verifiable storage
// live assets on the DKG testnet · click to inspect
These are real assets on the OriginTrail DKG testnet. Open one: the structured data, the provenance chain and the publish transaction are all public.
// corpus scale
385,000 records
30 PDFs · 8,700 pages. 20,960 knowledge assets published so far; the rest in flight.
// vector economics
4 TX · ~0.7 AP3X
Per batch: PostEscrow → Claim → Submit → Accept. Bonded supplier, paid on signed receipt.
// dkg provenance
1 UAL per batch
Every record links back to its source row, model, escrow reference and signed receipt.
// multi-model audit
3 bonded suppliers
gpt-5.4 · qwen3.6:35b · qwen2.5:0.5b. Discrepancies become research leads, not bugs.
// the output · a public genealogy graph
A public, searchable knowledge graph of every Serbian WWI casualty. Anyone, from descendants to historians to researchers, can query it to trace ancestors, family lines and unit histories. Every match is verifiable back to the source page, the model and the on-chain receipt.

STATUS · JUNE 2026: SETTLEMENT ON VECTOR MAINNET · PUBLISHING TO DKG TESTNET · LAST PUBLISH JUN 8 · FIGURES ABOVE ARE FROM THE LIVE RUN AND WILL GROW AS THE CORPUS COMPLETES

Not a promise. A living graph.

The knowledge itself does not sit on a blockchain. It lives on the OriginTrail Decentralized Knowledge Graph: a network of nodes holding structured, queryable data, with every collection anchored on NeuroWeb so its existence, ownership and integrity can be proven. The genealogy pipeline has published 20,960 knowledge assets across 443 knowledge collections. Every star below is one real collection; its size is the number of knowledge assets inside. Click any star and the DKG explorer opens that exact collection: its content, its provenance, its anchor.

The constellation has two nebulae. Green: the May pilot, 314 small collections while the pipeline was being tuned. Purple: the June 8 bulk run, 11,700 knowledge assets in under four hours, with single collections holding over a thousand. The pipeline does not just work. It scales.

VERIFIED VIA NEUROWEB ANCHORS · 2026-06-12
EVERY STAR IS ONE KNOWLEDGE COLLECTION · CLICK TO OPEN IT IN THE DKG EXPLORER
// knowledge assets
20,960
Published to the DKG by the genealogy pipeline. Counted from the NeuroWeb anchor contract, not self-reported.
// knowledge collections
443
314 in the May pilot, 129 in the June bulk run. Largest single collection: 1,054 assets.
// bulk throughput
~51 assets / min
The June 8 run published 11,700 assets in under four hours: OCR, bonded extraction, settlement and publishing included.
// graph facts
300,000+ est.
14.3 MB of structured JSON-LD on the graph; at typical person-record density that is over 300k individual facts, each with provenance.

VERIFICATION · 2026-06-12 · COUNTED FROM THE DKG'S NEUROWEB ANCHORS, NOT SELF-REPORTED · FULL SCAN OF COLLECTIONS 780,000-800,365 · PUBLISHER 0x69c0…8061 · KNOWLEDGECOLLECTIONSTORAGE 0xCdb2…5d37 · NEUROWEB TESTNET (CHAIN 20430)

Two networks. One provenance economy.

Global Genealogy is built by the Vector team at Apex Fusion together with OriginTrail. The division of labour is clean. Vector is the computation layer: where AI work gets done, paid and proven. The OriginTrail DKG is the knowledge layer: where the results become permanent, discoverable, verifiable assets. AI without receipts is fiction. This stack is the receipts.

Vector L2 // computation layer
The accountable inference rail. Where AI work gets done, and proven.
  • Bonded supplier marketplace
  • Escrowed, signed, on-chain inference
  • Programmable agent economics
  • UTXO-native, near-instant finality
  • Cardano-grade settlement assurances
OriginTrail DKG // knowledge layer
The verifiable knowledge layer. Where facts become discoverable, queryable assets.
  • Asset-level provenance via UALs
  • Decentralised, multi-chain storage
  • Semantic discovery & queryability
  • NeuroWeb anchoring
  • Track record across enterprise data
// for AI researchers
A living testbed for applied agent research: per-archive model routing, OCR on adversarial layouts, entity resolution across languages and centuries, multi-model disagreement as signal. The corpus is real, the metrics are public, and every experiment settles on-chain.
// for investors
Every page read is a paid AP3X transaction with a bonded counterparty. One archive means hundreds of thousands of settled inference jobs; the world holds thousands of archives. Genealogy is the first market for accountable AI infrastructure, not the last.
// for blockchain builders
eUTXO escrow running against real AI workloads: deterministic settlement, four transactions per batch, receipts you can audit, knowledge assets anchored with UALs. Not a whitepaper. A pipeline you can inspect on testnet today.

One archive today. A map of human interconnection tomorrow.

Genealogy is the first use, not the only one. A graph built to hold people, events, places and relationships, with provenance on every edge, is a template for mapping any record of human existence and interaction. Genetics, the fastest-growing branch of ancestry research, extends it naturally. So do scientific corpora, court archives and the histories of institutions.

The deeper goal is simpler. Humanity is finally at the technological level where this amount of memory can be kept, handled and connected. Every dark cell on the chart above is a person whose story can still be recovered. And every recovered story makes the same point: we are all interconnected.

// next sources genetic research data official publications church records Red Cross entries immigration manifests military rolls
// what the stack unlocks planetary genealogy graph verifiable scientific corpora sovereign cultural archives auditable AI agents machine-readable history reproducible research bonded data marketplaces

The end goal is a shared, decentralized knowledge graph of our ancestors: kept alive, growing with every source, owned by no one and available to everyone.

Global Genealogy · founding note

PARTNERS: Apex Fusion Foundation · OriginTrail · HAL8 · Tachys · PILOT: 385,000 Serbian WWI casualty records · STATUS: pipeline live on DKG testnet · June 2026
↑ TOP