home / www.caseyliss.com / item 25046691
Model Comparison
Model Editorial Structural Class Conf SETL Theme claude-haiku-4-5 lite 0.00 ND Neutral 0.95 0.00 Technology Documentation deepseek/deepseek-v3.2-20251201 +0.07 +0.01 Neutral 0.15 0.32 Education Access @cf/meta/llama-3.3-70b-instruct-fp8-fast lite 0.00 ND Neutral 0.50 0.00 Tech Industry Critique @cf/meta/llama-4-scout-17b-16e-instruct lite 0.00 ND Neutral 0.80 0.00 Developer Rights
Section claude-haiku-4-5 lite deepseek/deepseek-v3.2-20251201 @cf/meta/llama-3.3-70b-instruct-fp8-fast lite @cf/meta/llama-4-scout-17b-16e-instruct lite Preamble ND 0.18 ND ND Article 1 ND 0.00 ND ND Article 2 ND 0.00 ND ND Article 3 ND 0.00 ND ND Article 4 ND 0.00 ND ND Article 5 ND 0.00 ND ND Article 6 ND 0.00 ND ND Article 7 ND 0.00 ND ND Article 8 ND 0.00 ND ND Article 9 ND 0.00 ND ND Article 10 ND 0.00 ND ND Article 11 ND 0.00 ND ND Article 12 ND 0.00 ND ND Article 13 ND 0.00 ND ND Article 14 ND 0.00 ND ND Article 15 ND 0.00 ND ND Article 16 ND 0.00 ND ND Article 17 ND 0.00 ND ND Article 18 ND 0.00 ND ND Article 19 ND 0.46 ND ND Article 20 ND 0.00 ND ND Article 21 ND 0.00 ND ND Article 22 ND 0.18 ND ND Article 23 ND 0.24 ND ND Article 24 ND 0.00 ND ND Article 25 ND 0.00 ND ND Article 26 ND 0.24 ND ND Article 27 ND 0.18 ND ND Article 28 ND 0.00 ND ND Article 29 ND 0.00 ND ND Article 30 ND 0.00 ND ND
Pending Evaluation
This story is queued for evaluation. It will be processed in an upcoming batch.
Queued: 2026-02-26 22:44:03
Audit Trail
13 entries
2026-02-27 23:59 eval_success Light evaluated: Neutral (0.00) - - 2026-02-27 23:59
eval
Evaluated by llama-3.3-70b-wai : 0.00 (Neutral) 2026-02-27 23:59 rater_validation_warn Light validation warnings for model llama-3.3-70b-wai: 0W 7R - - 2026-02-27 23:46 dlq Dead-lettered after 1 attempts: On Apple's Piss-Poor Documentation - - 2026-02-27 23:43 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - - 2026-02-27 23:42 eval_success Evaluated: Neutral (0.09) - - 2026-02-27 23:42 rater_validation_warn Validation warnings for model deepseek-v3.2: 25W 25R - - 2026-02-27 23:42
eval
Evaluated by deepseek-v3.2 : +0.09 (Neutral) 11,253 tokens 2026-02-27 23:42 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - - 2026-02-27 23:41 eval_success Light evaluated: Neutral (0.00) - - 2026-02-27 23:41
eval
Evaluated by llama-4-scout-wai : 0.00 (Neutral) 2026-02-27 23:41 rate_limit OpenRouter rate limited (429) model=llama-3.3-70b - - 2026-02-27 23:29
eval
Evaluated by claude-haiku-4-5 : 0.00 (Neutral)