home / rubrics / calibration_analysis

calibration_analysis: 7

This data as json

id skill_code dimension level_pair events_analyzed human_events deepseek_events human_agreement deepseek_agreement weighted_agreement avg_response_time_ms avg_margin_score dominant_feature health_score revision_recommended analyzed_at
7 RL-EVIDENCE performance L2v3 1 0 1   0.0 0.0 23468.0 0.5   17.0 0 2026-05-26 02:37:26
Powered by Datasette · Queries took 0.682ms