| 2026-03-30 21:05 | eval_success | Lite evaluated: Mild negative (-0.24) | - - |
| 2026-03-30 21:05 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-30 21:05 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-30 20:29 | eval_success | PSQ evaluated: g-PSQ=0.440 (3 dims) | - - |
| 2026-03-30 20:29 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-30 19:40 | eval_success | Lite evaluated: Mild negative (-0.24) | - - |
| 2026-03-30 19:40 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-30 19:40 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-30 18:40 | eval_success | PSQ evaluated: g-PSQ=0.440 (3 dims) | - - |
| 2026-03-30 18:40 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-30 18:16 | eval_success | Lite evaluated: Mild negative (-0.24) | - - |
| 2026-03-30 18:16 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-30 18:16 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-30 17:17 | eval_success | PSQ evaluated: g-PSQ=0.440 (3 dims) | - - |
| 2026-03-30 17:17 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-30 16:59 | eval_success | Lite evaluated: Mild negative (-0.24) | - - |
| 2026-03-30 16:59 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-30 16:59 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-30 16:01 | eval_success | PSQ evaluated: g-PSQ=0.440 (3 dims) | - - |
| 2026-03-30 16:01 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-30 15:38 | eval_success | Lite evaluated: Mild negative (-0.24) | - - |
| 2026-03-30 15:38 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-30 15:38 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-30 14:36 | eval_success | PSQ evaluated: g-PSQ=0.440 (3 dims) | - - |
| 2026-03-30 14:36 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-30 14:17 | eval_success | Lite evaluated: Mild negative (-0.24) | - - |
| 2026-03-30 14:17 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-30 14:17 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-30 04:53 | eval_success | PSQ evaluated: g-PSQ=0.440 (3 dims) | - - |
| 2026-03-30 04:53 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-30 04:52 | eval_success | Lite evaluated: Mild negative (-0.24) | - - |
| 2026-03-30 04:52 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-30 04:52 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-30 03:32 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-30 03:12 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-30 02:50 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-30 02:30 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-30 01:52 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-30 01:15 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-30 01:11 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-30 00:29 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) | |
| 2026-03-30 00:25 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) | |
| reasoning Technical content, zero rights discussion |
| 2026-03-29 23:41 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 23:36 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 22:31 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 22:26 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 21:21 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 21:18 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 20:06 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 20:05 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 19:27 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 19:23 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 18:48 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 18:43 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 18:11 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 18:06 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 17:37 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 17:30 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 17:01 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 16:49 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 16:25 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 16:13 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 15:49 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 15:37 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 15:12 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 15:03 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16 | |
| 2026-03-29 14:37 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 14:26 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16 | |
| 2026-03-29 14:02 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 13:50 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16 | |
| 2026-03-29 13:27 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 13:14 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16 | |
| 2026-03-29 12:52 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 12:37 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 12:17 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 12:00 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 11:42 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 11:24 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.32 | |
| 2026-03-29 11:07 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 10:47 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) -0.32 | |
| 2026-03-29 10:31 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 10:10 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 09:51 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 09:33 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 09:15 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 08:56 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 08:39 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 08:20 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 08:04 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 07:44 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 07:30 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 07:09 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 06:57 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 06:34 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 06:21 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 05:59 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 05:50 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 05:37 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 05:32 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 05:16 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 05:12 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 04:51 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00 | |
| 2026-03-29 04:48 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00 | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |
| 2026-03-29 04:14 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) | |
| 2026-03-29 04:13 |
eval
|
Evaluated by llama-4-scout-wai: -0.24 (Mild negative) | |
| reasoning Technical article on spatial reasoning and puzzle difficulty, no human rights discussion |