JOURDANLABS
← BENCHMARKS / CITADEL · ENTITY RESOLUTION
CITADEL · ENTITY RESOLUTION

CITADEL

Corporate hierarchy reasoning.

CITADEL fortress artifact
0.616
F1
400/660
COVERAGE
E.2
CHECKPOINT (SEALED)
What It Is

CITADEL tests entity resolution and hierarchical reasoning over corporate ownership structures. The system must resolve corporate entities across multiple data sources and reconstruct ownership hierarchies from SEC Exhibit 21 filings and related documents.

The methodology arc follows CP-D → E → E.1 → E.2, with each checkpoint documenting specific fixes and their per-task attribution. The sealed corpus contains 660 entities; the current pipeline covers 400 with scored results.

Methodology
CorpusSealed, SHA-verified before pipeline contact
BaselinesHonest baselines, real implementations
PipelineDeterministic, no LLM inference at runtime
AttributionPer-fix attribution shown in checkpoint arc
Limitations42 systematic zero-TP entities documented
ReproducibilityFull instructions in GitHub repo
Reproducibility
Corpus sourceSEC EDGAR (public)
Corpus sealSHA-256 in CHECKPOINT_RESULTS.md
Repogithub.com/jourdanlabs/benchmarks/citadel
Limitations

Coverage ceiling at F1 ~0.62. Class C structural issues (PDF embeds, non-standard layouts) represent the ceiling without new data sources.

42 systematic zero-TP entities. Root causes documented: PDF-embedded Exhibit 21 documents, abbreviated filings, GLEIF-only fallback coverage.

Same-source ground truth. Ground truth assembled from EDGAR using independent implementation but shares upstream data source.