JOURDANLABS
CRUCIBLE

Open research
and validation
infrastructure.

CRUCIBLE is the JourdanLabs research division. It runs the benchmark program, maintains the COSMIC methodology playbook, and operates VANTAGE — the diagnostic suite that validates the other four divisions.

Crucible research artifact - open book with laboratory flask
What's in CRUCIBLE
Why Open Research?

CRUCIBLE benchmarks are public because the claim that COSMIC outperforms baselines in regulated domains is only credible if it can be verified. Engine implementations are proprietary — the corpus, scoring harnesses, and baseline code are not. Anyone can run the baselines. Anyone can verify the corpus SHA. Anyone can point their own pipeline at our scoring harness and compare.

This is how SuperGLUE, HELM, and other credible benchmark programs operate. We follow the same model.