Claim Certificates
Split answers into atomic claims, then certify whether each claim is supported—robust to evidence ordering and measured as an information budget.
- Claim extraction Splits answers into atomic, verifiable claims
- Budget gaps (bits/nats) Quantifies how much supporting information is missing
- QMV permutation probing Tests across evidence orderings for robustness
- Confidence metrics q_bar (mean) and q_lo (robust) support scores
- Certificate output Supported / needs more evidence / abstain