Rodman et al., 2023 · JAMA Network Open
Clinicians and AI diverge most when context requires reframing — not just pattern matching.
Rodman et al. found that the performance gap between expert and AI is largest when new information requires updating the problem frame itself. Experts revised their assessments based on context and case framing. The Confidence Score operationalizes this: it tells you how stable the diagnosis is across varied contexts before you act.
Rodman et al. AI vs Clinician Performance. JAMA Network Open, 6(12):e2347075, 2023.
Berman & Katona, 2024 · Marketing Science
42–65%
Customer journeys now invisible to standard attribution — making diagnostic confidence structurally essential.
Following iOS privacy changes and cookie deprecation, a significant portion of the customer journey cannot be reliably attributed. When measurement systems are structurally incomplete, knowing the confidence level of a recommendation isn't optional — it's the only honest way to act.
Berman, R. & Katona, Z. Privacy changes and attribution model accuracy. Marketing Science, 2024.
Zhang et al., 2024 · Wharton School
23–31%
Average overestimation of marketing performance — the cost of acting on unqualified diagnostic outputs.
When metrics are accepted without cross-validation, performance is systematically overstated by nearly a quarter. The Confidence Score is the cross-validation signal built into every episode — it flags where the diagnostic has been stress-tested across contexts and where it hasn't.
Zhang et al. Channel silos and marketing performance overestimation. Journal of Marketing Analytics. Wharton, 2024.
Kassirer, 2025 · Annals of Internal Medicine
The expert advantage is problem construction — not just pattern selection.
Kassirer identified the structural limit of AI decision support: it can process supplied information quickly but cannot judge context, weight incomplete history, or resolve ambiguity. Under ambiguity, the human diagnostician actively constructs, revises, and tests the problem frame. The Confidence Score tells you when ambiguity is present and what to resolve before acting.
Kassirer, J.P. Artificial Intelligence in Medical Practice: Is It Ready? Annals of Internal Medicine, 178(4):596–597, 2025.