| 2026-03-07 20:12 | eval_success | PSQ evaluated: g-PSQ=-0.260 (3 dims) | - - |
| 2026-03-07 20:12 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.26 (Mild negative) 0.00 | |
| 2026-03-07 19:56 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-07 19:56 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-07 19:56 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00 | |
| reasoning Article about Elon Musk's Hyperloop failure, no explicit human rights discussion |
| 2026-03-07 19:50 | eval_success | Lite evaluated: Mild negative (-0.10) | - - |
| 2026-03-07 19:50 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.10 (Mild negative) 0.00 | |
| reasoning News article on Elon Musk's Hyperloop failure |
| 2026-03-07 19:46 | eval_success | Lite evaluated: Mild negative (-0.10) | - - |
| 2026-03-07 19:46 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.10 (Mild negative) 0.00 | |
| reasoning News article on Elon Musk's Hyperloop failure |
| 2026-03-07 19:22 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-07 19:22 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 18:36 | eval_success | PSQ evaluated: g-PSQ=-0.260 (3 dims) | - - |
| 2026-03-07 18:36 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.26 (Mild negative) 0.00 | |
| 2026-03-07 18:23 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-07 18:23 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 18:18 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-07 18:18 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 18:13 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-07 18:13 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 18:07 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-07 18:07 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 18:02 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-07 18:02 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 17:57 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-07 17:57 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 17:51 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-07 17:51 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 17:47 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-07 17:47 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 17:41 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-07 17:41 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00 | |
| 2026-03-07 17:36 | eval_success | PSQ evaluated: g-PSQ=0.280 (3 dims) | - - |
| 2026-03-07 17:36 |
eval
|
Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) | |
| 2026-03-07 17:36 | eval_success | PSQ evaluated: g-PSQ=-0.260 (3 dims) | - - |
| 2026-03-07 17:36 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.26 (Mild negative) | |
| 2026-03-07 17:36 | eval_success | Lite evaluated: Neutral (-0.08) | - - |
| 2026-03-07 17:36 |
eval
|
Evaluated by llama-4-scout-wai: -0.08 (Neutral) | |
| reasoning Article about Elon Musk's Hyperloop failure, no explicit human rights discussion |
| 2026-03-07 17:36 | rater_validation_warn | Lite validation warnings for model llama-4-scout-wai: 1W 0R | - - |
| 2026-03-07 17:35 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.10 (Mild negative) | |
| reasoning News article on Elon Musk's Hyperloop failure |