home / rubrics / calibration_analysis

calibration_analysis: 11

This data as json

id skill_code dimension level_pair events_analyzed human_events deepseek_events human_agreement deepseek_agreement weighted_agreement avg_response_time_ms avg_margin_score dominant_feature health_score revision_recommended analyzed_at
11 RL-CHARACTER performance L1v2 1 0 1   1.0 1.0 10939.0 0.75 depth_of_thinking 86.0 0 2026-05-26 02:37:26
Powered by Datasette · Queries took 0.799ms