home / rubrics / calibration_analysis

calibration_analysis: 24

This data as json

id skill_code dimension level_pair events_analyzed human_events deepseek_events human_agreement deepseek_agreement weighted_agreement avg_response_time_ms avg_margin_score dominant_feature health_score revision_recommended analyzed_at
24 RI-EVIDENCE performance L3v4 1 0 1   1.0 1.0 19434.0 0.75 evidence_quality 81.0 0 2026-05-26 02:37:26
Powered by Datasette · Queries took 2.558ms