Model Comparison
Model Editorial Structural Class Conf SETL Theme
@cf/meta/llama-4-scout-17b-16e-instruct lite 0.00 -0.20 Mild negative 0.90 0.20 AI Ethics
@cf/meta/llama-3.3-70b-instruct-fp8-fast lite 0.00 -0.60 Moderate negative 0.80 0.60 AI claims analysis
Section @cf/meta/llama-4-scout-17b-16e-instruct lite @cf/meta/llama-3.3-70b-instruct-fp8-fast lite Delta
Preamble ND ND
Article 1 ND ND
Article 2 ND ND
Article 3 ND ND
Article 4 ND ND
Article 5 ND ND
Article 6 ND ND
Article 7 ND ND
Article 8 ND ND
Article 9 ND ND
Article 10 ND ND
Article 11 ND ND
Article 12 ND ND
Article 13 ND ND
Article 14 ND ND
Article 15 ND ND
Article 16 ND ND
Article 17 ND ND
Article 18 ND ND
Article 19 ND ND
Article 20 ND ND
Article 21 ND ND
Article 22 ND ND
Article 23 ND ND
Article 24 ND ND
Article 25 ND ND
Article 26 ND ND
Article 27 ND ND
Article 28 ND ND
Article 29 ND ND
Article 30 ND ND
0.00 2,218 Gary Marcus AI claims scored against evidence (dataset) (github.com)
63 points by davegoldblatt 23 hours ago | 52 comments on HN | Mild negative ~lite vlite-1.6
Summary ~lite AI Ethics Neutral
GitHub repository for systematic extraction and analysis of AI claims
EQ 0.50
SO 0.20
TD 0.80
Lite evaluation by llama-4-scout-wai · editorial channel only · no per-section breakdown available
Longitudinal 58 HN snapshots · 17 evals
+1 0 −1 HN
Audit Trail 37 entries
2026-03-04 17:48 model_divergence Cross-model spread 0.28 exceeds threshold (2 models) - -
2026-03-04 17:48 eval_success Lite evaluated: Mild negative (-0.14) - -
2026-03-04 17:48 eval Evaluated by llama-4-scout-wai: -0.14 (Mild negative) -0.22
reasoning
GitHub repository for AI claims dataset, no explicit human rights discussion
2026-03-04 17:48 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-04 17:33 eval_success Lite evaluated: Moderate negative (-0.42) - -
2026-03-04 17:33 rater_validation_warn Lite validation warnings for model llama-3.3-70b-wai: 1W 0R - -
2026-03-04 17:33 eval Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) 0.00
reasoning
GitHub dataset page, neutral stance
2026-03-04 17:28 eval_success Lite evaluated: Moderate negative (-0.42) - -
2026-03-04 17:28 eval Evaluated by llama-3.3-70b-wai: -0.42 (Moderate negative) -0.50
reasoning
GitHub dataset page, neutral stance
2026-03-04 17:28 rater_validation_warn Lite validation warnings for model llama-3.3-70b-wai: 1W 0R - -
2026-03-04 06:54 eval_success Lite evaluated: Neutral (0.08) - -
2026-03-04 06:54 eval Evaluated by llama-4-scout-wai: +0.08 (Neutral) 0.00
reasoning
GitHub repository for AI claims dataset, no explicit human rights discussion
2026-03-04 06:54 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-04 06:49 eval_success Lite evaluated: Neutral (0.08) - -
2026-03-04 06:49 eval Evaluated by llama-4-scout-wai: +0.08 (Neutral) 0.00
reasoning
GitHub repository for AI claims dataset, no explicit human rights discussion
2026-03-04 06:49 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-04 06:30 eval_success Lite evaluated: Neutral (0.08) - -
2026-03-04 06:30 eval Evaluated by llama-3.3-70b-wai: +0.08 (Neutral) -0.03
reasoning
GitHub dataset page, neutral stance
2026-03-04 06:30 rater_validation_warn Lite validation warnings for model llama-3.3-70b-wai: 1W 0R - -
2026-03-04 04:12 eval_success Lite evaluated: Neutral (0.08) - -
2026-03-04 04:12 eval Evaluated by llama-4-scout-wai: +0.08 (Neutral) 0.00
reasoning
GitHub repository for AI claims dataset, no explicit human rights discussion
2026-03-04 04:12 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-04 04:12 eval_success Lite evaluated: Mild positive (0.11) - -
2026-03-04 04:12 eval Evaluated by llama-3.3-70b-wai: +0.11 (Mild positive) +0.06
reasoning
GitHub dataset page, neutral stance
2026-03-04 04:12 rater_validation_warn Lite validation warnings for model llama-3.3-70b-wai: 1W 0R - -
2026-03-04 03:37 eval_success Lite evaluated: Neutral (0.05) - -
2026-03-04 03:37 eval Evaluated by llama-3.3-70b-wai: +0.05 (Neutral) -0.06
reasoning
GitHub dataset page, neutral stance
2026-03-04 03:37 rater_validation_warn Lite validation warnings for model llama-3.3-70b-wai: 1W 0R - -
2026-03-04 03:36 eval_success Lite evaluated: Neutral (0.08) - -
2026-03-04 03:36 eval Evaluated by llama-4-scout-wai: +0.08 (Neutral) 0.00
reasoning
GitHub repository for AI claims dataset, no explicit human rights discussion
2026-03-04 03:32 eval Evaluated by llama-3.3-70b-wai: +0.11 (Mild positive) +0.03
reasoning
GitHub dataset page, neutral stance
2026-03-04 03:05 eval Evaluated by llama-4-scout-wai: +0.08 (Neutral) 0.00
reasoning
GitHub repository for AI claims dataset, no explicit human rights discussion
2026-03-04 03:02 eval Evaluated by llama-3.3-70b-wai: +0.08 (Neutral) 0.00
reasoning
GitHub dataset page, neutral stance
2026-03-04 02:55 eval Evaluated by llama-3.3-70b-wai: +0.08 (Neutral) +0.03
reasoning
GitHub dataset page, neutral stance
2026-03-04 02:18 eval Evaluated by llama-4-scout-wai: +0.08 (Neutral) 0.00
reasoning
GitHub repository for AI claims dataset, no explicit human rights discussion
2026-03-04 02:14 eval Evaluated by llama-4-scout-wai: +0.08 (Neutral)
reasoning
GitHub repository for AI claims dataset, no explicit human rights discussion
2026-03-04 02:13 eval Evaluated by llama-3.3-70b-wai: +0.05 (Neutral)
reasoning
GitHub dataset page, neutral stance