home / mchap.io / item 17754105
Model Comparison
Model Editorial Structural Class Conf SETL Theme claude-haiku-4-5-20251001 +0.24 ND Mild positive 0.15 — Civic Participation & Transparency @cf/meta/llama-3.3-70b-instruct-fp8-fast lite +0.30 ND Moderate positive 0.80 0.00 Government Transparency @cf/meta/llama-4-scout-17b-16e-instruct lite +0.56 ND Moderate positive 0.90 0.00 Civic Tech
Section claude-haiku-4-5-20251001 @cf/meta/llama-3.3-70b-instruct-fp8-fast lite @cf/meta/llama-4-scout-17b-16e-instruct lite Preamble 0.30 ND ND Article 1 ND ND ND Article 2 ND ND ND Article 3 ND ND ND Article 4 ND ND ND Article 5 ND ND ND Article 6 ND ND ND Article 7 0.20 ND ND Article 8 0.45 ND ND Article 9 0.20 ND ND Article 10 ND ND ND Article 11 ND ND ND Article 12 -0.15 ND ND Article 13 ND ND ND Article 14 ND ND ND Article 15 ND ND ND Article 16 ND ND ND Article 17 0.20 ND ND Article 18 ND ND ND Article 19 0.45 ND ND Article 20 ND ND ND Article 21 0.55 ND ND Article 22 0.20 ND ND Article 23 0.10 ND ND Article 24 ND ND ND Article 25 0.20 ND ND Article 26 0.20 ND ND Article 27 ND ND ND Article 28 ND ND ND Article 29 0.20 ND ND Article 30 ND ND ND
Evaluation Failed
claude-p failed (exit 1)
Domain mchap.io Content Type Editorial Created 2026-02-27 22:43:36
▶ Retry Evaluation
Audit Trail
13 entries
2026-02-28 12:40 model_divergence Cross-model spread 0.29 exceeds threshold (3 models) - - 2026-02-28 12:40
eval
Evaluated by claude-haiku-4-5-20251001 : +0.27 (Mild positive) 2026-02-28 09:44 model_divergence Cross-model spread 0.26 exceeds threshold (2 models) - - 2026-02-28 09:44 eval_success Light evaluated: Moderate positive (0.56) - - 2026-02-28 09:44
eval
Evaluated by llama-4-scout-wai : +0.56 (Moderate positive) 0.00 2026-02-28 09:44 rater_validation_warn Light validation warnings for model llama-4-scout-wai: 0W 1R - - 2026-02-28 09:39 model_divergence Cross-model spread 0.26 exceeds threshold (2 models) - - 2026-02-28 09:39 eval_success Light evaluated: Moderate positive (0.56) - - 2026-02-28 09:39
eval
Evaluated by llama-4-scout-wai : +0.56 (Moderate positive) 2026-02-28 09:39 rater_validation_warn Light validation warnings for model llama-4-scout-wai: 0W 1R - - 2026-02-28 09:38 eval_success Light evaluated: Moderate positive (0.30) - - 2026-02-28 09:38
eval
Evaluated by llama-3.3-70b-wai : +0.30 (Moderate positive) 2026-02-28 09:38 rater_validation_warn Light validation warnings for model llama-3.3-70b-wai: 0W 1R - -