| 2026-03-12 00:43 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-12 00:43 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-12 00:41 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-12 00:41 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, color guessing game, no human rights discussion |
| 2026-03-12 00:41 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-11 23:35 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-11 23:35 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-11 23:34 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-11 23:34 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, color guessing game, no human rights discussion |
| 2026-03-11 23:34 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-11 22:57 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-11 22:57 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-11 22:56 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-11 22:56 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-11 22:56 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, color guessing game, no human rights discussion |
| 2026-03-11 22:18 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-11 22:18 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-11 22:15 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-11 22:15 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-11 22:15 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, color guessing game, no human rights discussion |
| 2026-03-11 10:55 | credit_exhausted | Credit balance too low, pausing provider for 30 min | - - |
| 2026-03-11 09:56 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-11 09:56 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-11 09:31 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-11 09:31 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-11 09:31 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, color guessing game, no human rights discussion |
| 2026-03-11 09:15 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-11 09:15 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-11 08:55 | eval_success | Lite evaluated: Neutral (0.00) | - - |
| 2026-03-11 08:55 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, color guessing game, no human rights discussion |
| 2026-03-11 08:55 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-11 08:39 | eval_success | PSQ evaluated: g-PSQ=0.600 (3 dims) | - - |
| 2026-03-11 08:39 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-11 08:19 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, color guessing game, no human rights discussion |
| 2026-03-11 08:03 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-11 07:42 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, color guessing game, no human rights discussion |
| 2026-03-11 07:26 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-11 07:07 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, color guessing game, no human rights discussion |
| 2026-03-11 06:50 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-11 06:32 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, color guessing game, no human rights discussion |
| 2026-03-11 06:14 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-11 05:57 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, color guessing game, no human rights discussion |
| 2026-03-11 05:36 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-11 05:22 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, color guessing game, no human rights discussion |
| 2026-03-11 04:55 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-11 04:42 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, color guessing game, no human rights discussion |
| 2026-03-11 03:42 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-11 03:31 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, color guessing game, no human rights discussion |
| 2026-03-11 02:28 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-11 02:17 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, color guessing game, no human rights discussion |
| 2026-03-11 01:42 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-11 01:39 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, color guessing game, no human rights discussion |
| 2026-03-11 00:54 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-11 00:53 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, color guessing game, no human rights discussion |
| 2026-03-11 00:08 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.28 (Mild positive) | |
| 2026-03-11 00:05 |
eval
|
Evaluated by llama-3.3-70b-wai: 0.00 (Neutral) | |
| reasoning Technical content, zero rights discussion |
| 2026-03-10 23:03 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00 | |
| 2026-03-10 23:01 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) 0.00 | |
| reasoning Technical content, color guessing game, no human rights discussion |
| 2026-03-10 22:33 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) | |
| 2026-03-10 22:33 |
eval
|
Evaluated by llama-4-scout-wai: 0.00 (Neutral) | |
| reasoning Technical content, color guessing game, no human rights discussion |