| 2026-03-09 07:28 | eval_success | PSQ evaluated: g-PSQ=-0.030 (3 dims) | - - |
| 2026-03-09 07:28 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.03 (Neutral) 0.00 | |
| 2026-03-09 07:28 | eval_success | PSQ evaluated: g-PSQ=0.290 (3 dims) | - - |
| 2026-03-09 07:28 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.29 (Mild positive) 0.00 | |
| 2026-03-09 07:27 | eval_success | Lite evaluated: Neutral (0.10) | - - |
| 2026-03-09 07:27 | model_divergence | Cross-model spread 0.32 exceeds threshold (2 models) | - - |
| 2026-03-09 07:27 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Neutral) 0.00 | |
| reasoning Investigative journalism on Havana Syndrome, US military tests, and potential human rights abuses |
| 2026-03-09 06:59 | eval_success | Lite evaluated: Moderate positive (0.41) | - - |
| 2026-03-09 06:59 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.41 (Moderate positive) 0.00 | |
| reasoning Investigative journalism on Havana Syndrome |
| 2026-03-09 06:54 | eval_success | Lite evaluated: Moderate positive (0.41) | - - |
| 2026-03-09 06:54 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.41 (Moderate positive) 0.00 | |
| reasoning Investigative journalism on Havana Syndrome |
| 2026-03-09 06:22 | eval_success | Lite evaluated: Neutral (0.10) | - - |
| 2026-03-09 06:22 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Neutral) 0.00 | |
| reasoning Investigative journalism on Havana Syndrome, US military tests, and potential human rights abuses |
| 2026-03-09 06:19 | eval_success | PSQ evaluated: g-PSQ=-0.030 (3 dims) | - - |
| 2026-03-09 06:19 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.03 (Neutral) 0.00 | |
| 2026-03-09 06:18 | eval_success | PSQ evaluated: g-PSQ=0.290 (3 dims) | - - |
| 2026-03-09 06:18 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: +0.29 (Mild positive) +0.35 | |
| 2026-03-09 06:17 | eval_success | Lite evaluated: Neutral (0.10) | - - |
| 2026-03-09 06:17 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Neutral) 0.00 | |
| reasoning Investigative journalism on Havana Syndrome, US military tests, and potential human rights abuses |
| 2026-03-09 05:48 | eval_success | Lite evaluated: Moderate positive (0.41) | - - |
| 2026-03-09 05:48 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.41 (Moderate positive) 0.00 | |
| reasoning Investigative journalism on Havana Syndrome |
| 2026-03-09 04:44 | eval_success | PSQ evaluated: g-PSQ=-0.030 (3 dims) | - - |
| 2026-03-09 04:44 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.03 (Neutral) 0.00 | |
| 2026-03-09 04:43 | eval_success | Lite evaluated: Neutral (0.10) | - - |
| 2026-03-09 04:43 | model_divergence | Cross-model spread 0.32 exceeds threshold (2 models) | - - |
| 2026-03-09 04:43 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Neutral) 0.00 | |
| reasoning Investigative journalism on Havana Syndrome, US military tests, and potential human rights abuses |
| 2026-03-09 04:39 | eval_success | PSQ evaluated: g-PSQ=-0.030 (3 dims) | - - |
| 2026-03-09 04:39 |
eval
|
Evaluated by llama-4-scout-wai-psq: -0.03 (Neutral) | |
| 2026-03-09 04:38 | eval_success | PSQ evaluated: g-PSQ=-0.064 (3 dims) | - - |
| 2026-03-09 04:38 |
eval
|
Evaluated by llama-3.3-70b-wai-psq: -0.06 (Neutral) | |
| 2026-03-09 04:38 | model_divergence | Cross-model spread 0.32 exceeds threshold (2 models) | - - |
| 2026-03-09 04:38 | eval_success | Lite evaluated: Neutral (0.10) | - - |
| 2026-03-09 04:38 |
eval
|
Evaluated by llama-4-scout-wai: +0.10 (Neutral) | |
| reasoning Investigative journalism on Havana Syndrome, US military tests, and potential human rights abuses |
| 2026-03-09 04:37 | eval_success | Lite evaluated: Moderate positive (0.41) | - - |
| 2026-03-09 04:37 |
eval
|
Evaluated by llama-3.3-70b-wai: +0.41 (Moderate positive) | |
| reasoning Investigative journalism on Havana Syndrome |