| |
Alpha This system is experimental. Scores and classifications are early-stage research and may be unreliable. Methodology → |
Model Comparison
| Model | Editorial | Structural | Class | Conf | SETL | Theme | | @cf/meta/llama-4-scout-17b-16e-instruct lite | ND | ND | — | 0.53 | — | — | | @cf/meta/llama-4-scout-17b-16e-instruct lite | +0.10 | -0.65 | Mild negative | 0.80 | 0.69 | Human Rights | | @cf/meta/llama-3.3-70b-instruct-fp8-fast lite | ND | ND | — | 0.53 | — | — | | @cf/meta/llama-3.3-70b-instruct-fp8-fast lite | -0.20 | -0.65 | Moderate negative | 0.70 | 0.54 | Financial Regulation | | Section | @cf/meta/llama-4-scout-17b-16e-instruct lite | @cf/meta/llama-4-scout-17b-16e-instruct lite | @cf/meta/llama-3.3-70b-instruct-fp8-fast lite | @cf/meta/llama-3.3-70b-instruct-fp8-fast lite | | Preamble | ND | ND | ND | ND | | Article 1 | ND | ND | ND | ND | | Article 2 | ND | ND | ND | ND | | Article 3 | ND | ND | ND | ND | | Article 4 | ND | ND | ND | ND | | Article 5 | ND | ND | ND | ND | | Article 6 | ND | ND | ND | ND | | Article 7 | ND | ND | ND | ND | | Article 8 | ND | ND | ND | ND | | Article 9 | ND | ND | ND | ND | | Article 10 | ND | ND | ND | ND | | Article 11 | ND | ND | ND | ND | | Article 12 | ND | ND | ND | ND | | Article 13 | ND | ND | ND | ND | | Article 14 | ND | ND | ND | ND | | Article 15 | ND | ND | ND | ND | | Article 16 | ND | ND | ND | ND | | Article 17 | ND | ND | ND | ND | | Article 18 | ND | ND | ND | ND | | Article 19 | ND | ND | ND | ND | | Article 20 | ND | ND | ND | ND | | Article 21 | ND | ND | ND | ND | | Article 22 | ND | ND | ND | ND | | Article 23 | ND | ND | ND | ND | | Article 24 | ND | ND | ND | ND | | Article 25 | ND | ND | ND | ND | | Article 26 | ND | ND | ND | ND | | Article 27 | ND | ND | ND | ND | | Article 28 | ND | ND | ND | ND | | Article 29 | ND | ND | ND | ND | | Article 30 | ND | ND | ND | ND | | Summary ~lite Article discusses SBF's pardon push in Congress, using derogatory language.
Lite evaluation by llama-4-scout-wai-psq · editorial channel only · no per-section breakdown available
| |
Longitudinal
13 HN snapshots · 8 evals | |
Audit Trail
16 entries | 2026-03-17 01:19 | eval_success | PSQ evaluated: g-PSQ=-0.420 (3 dims) | - - | | 2026-03-17 01:19 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.42 (Moderate negative) 0.00 | | | 2026-03-17 01:19 | eval_success | Lite evaluated: Mild negative (-0.20) | - - | | 2026-03-17 01:19 |
eval
|
Evaluated by llama-4-scout-wai: -0.20 (Mild negative) 0.00 | | | reasoning Editorial stance on SBF's pardon push, transparency indicators | | 2026-03-17 00:35 | eval_success | PSQ evaluated: g-PSQ=-0.420 (3 dims) | - - | | 2026-03-17 00:35 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.42 (Moderate negative) | | | 2026-03-17 00:31 | eval_success | Lite evaluated: Moderate negative (-0.38) | - - | | 2026-03-17 00:31 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.38 (Moderate negative) | | | reasoning Politico article on SBF pardon push | | 2026-03-16 23:48 | eval_success | PSQ evaluated: g-PSQ=-0.420 (3 dims) | - - | | 2026-03-16 23:48 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.42 (Moderate negative) 0.00 | | | 2026-03-16 23:48 | eval_success | Lite evaluated: Mild negative (-0.20) | - - | | 2026-03-16 23:48 |
eval
|
Evaluated by llama-4-scout-wai: -0.20 (Mild negative) 0.00 | | | reasoning Editorial stance on SBF's pardon push, transparency indicators | | 2026-03-16 22:49 | eval_success | PSQ evaluated: g-PSQ=-0.420 (3 dims) | - - | | 2026-03-16 22:49 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.42 (Moderate negative) | | | 2026-03-16 22:48 | eval_success | Lite evaluated: Mild negative (-0.20) | - - | | 2026-03-16 22:48 |
eval
|
Evaluated by llama-4-scout-wai: -0.20 (Mild negative) | | | reasoning Editorial stance on SBF's pardon push, transparency indicators | | |
| |