CITADEL
Corporate hierarchy reasoning.

CITADEL tests entity resolution and hierarchical reasoning over corporate ownership structures. The system must resolve corporate entities across multiple data sources and reconstruct ownership hierarchies from SEC Exhibit 21 filings and related documents.
The methodology arc follows CP-D → E → E.1 → E.2, with each checkpoint documenting specific fixes and their per-task attribution. The sealed corpus contains 660 entities; the current pipeline covers 400 with scored results.
Coverage ceiling at F1 ~0.62. Class C structural issues (PDF embeds, non-standard layouts) represent the ceiling without new data sources.
42 systematic zero-TP entities. Root causes documented: PDF-embedded Exhibit 21 documents, abbreviated filings, GLEIF-only fallback coverage.
Same-source ground truth. Ground truth assembled from EDGAR using independent implementation but shares upstream data source.