home / rubrics / calibration_analysis

calibration_analysis: 2

This data as json

id skill_code dimension level_pair events_analyzed human_events deepseek_events human_agreement deepseek_agreement weighted_agreement avg_response_time_ms avg_margin_score dominant_feature health_score revision_recommended analyzed_at
2 RL-EVIDENCE autonomy L2v3 2 1 1 0.0 1.0 0.3333333333333333 9332.5 0.625 original_thinking 48.0 0 2026-05-26 02:37:26
Powered by Datasette · Queries took 0.573ms