home / queue.acm.org / item 18179972
Model Comparison
Model Editorial Structural Class Conf SETL Theme claude-haiku-4-5 lite 0.00 ND Neutral 0.10 0.00 Technical computing deepseek/deepseek-v3.2-20251201 +0.30 -0.15 Mild positive 0.10 — Work & Productivity @cf/meta/llama-3.3-70b-instruct-fp8-fast lite 0.00 ND Neutral 0.50 0.00 Productivity Strategies @cf/meta/llama-4-scout-17b-16e-instruct lite 0.00 ND Neutral 1.00 0.00 productivity motivation
Section claude-haiku-4-5 lite deepseek/deepseek-v3.2-20251201 @cf/meta/llama-3.3-70b-instruct-fp8-fast lite @cf/meta/llama-4-scout-17b-16e-instruct lite Preamble ND -0.20 ND ND Article 1 ND ND ND ND Article 2 ND ND ND ND Article 3 ND 0.30 ND ND Article 4 ND ND ND ND Article 5 ND ND ND ND Article 6 ND ND ND ND Article 7 ND ND ND ND Article 8 ND ND ND ND Article 9 ND ND ND ND Article 10 ND ND ND ND Article 11 ND ND ND ND Article 12 ND -0.30 ND ND Article 13 ND ND ND ND Article 14 ND ND ND ND Article 15 ND ND ND ND Article 16 ND ND ND ND Article 17 ND ND ND ND Article 18 ND ND ND ND Article 19 ND 0.20 ND ND Article 20 ND ND ND ND Article 21 ND ND ND ND Article 22 ND ND ND ND Article 23 ND 0.40 ND ND Article 24 ND 0.20 ND ND Article 25 ND ND ND ND Article 26 ND 0.20 ND ND Article 27 ND 0.60 ND ND Article 28 ND ND ND ND Article 29 ND ND ND ND Article 30 ND ND ND ND
Pending Evaluation
This story is queued for evaluation. It will be processed in an upcoming batch.
Queued: 2026-02-27 04:04:06
Audit Trail
8 entries
2026-02-28 00:08 eval_success Light evaluated: Neutral (0.00) - - 2026-02-28 00:07
eval
Evaluated by llama-4-scout-wai : 0.00 (Neutral) 2026-02-27 23:58 eval_success Light evaluated: Neutral (0.00) - - 2026-02-27 23:58 rater_validation_warn Light validation warnings for model llama-3.3-70b-wai: 0W 7R - - 2026-02-27 23:58
eval
Evaluated by llama-3.3-70b-wai : 0.00 (Neutral) 2026-02-27 23:57 eval_success Evaluated: Mild positive (0.16) - - 2026-02-27 23:57
eval
Evaluated by deepseek-v3.2 : +0.16 (Mild positive) 13,363 tokens 2026-02-27 23:51
eval
Evaluated by claude-haiku-4-5 : 0.00 (Neutral)