| 2026-03-04 18:46 | eval_success | Lite evaluated: Strong negative (-0.70) | - - |
| 2026-03-04 18:46 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.70 (Strong negative) 0.00 | |
| reasoning Landing page with no rights discussion |
| 2026-03-04 18:46 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 1W 0R | - - |
| 2026-03-04 17:46 | eval_success | Lite evaluated: Strong negative (-0.67) | - - |
| 2026-03-04 17:46 |
eval
|
Evaluated by llama-4-scout-wai: -0.67 (Strong negative) 0.00 | |
| reasoning Landing page of Eurosky, a social media platform, with a focus on European laws and regulations. |
| 2026-03-04 17:24 | eval_success | Lite evaluated: Strong negative (-0.70) | - - |
| 2026-03-04 17:24 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.70 (Strong negative) 0.00 | |
| reasoning Landing page with no rights discussion |
| 2026-03-04 17:24 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 1W 0R | - - |
| 2026-03-04 11:55 | eval_success | Lite evaluated: Strong negative (-0.70) | - - |
| 2026-03-04 11:55 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.70 (Strong negative) 0.00 | |
| reasoning Landing page with no rights discussion |
| 2026-03-04 11:55 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 1W 0R | - - |
| 2026-03-04 11:50 | eval_success | Lite evaluated: Strong negative (-0.70) | - - |
| 2026-03-04 11:50 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.70 (Strong negative) 0.00 | |
| reasoning Landing page with no rights discussion |
| 2026-03-04 11:50 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 1W 0R | - - |
| 2026-03-04 11:07 | eval_success | Lite evaluated: Strong negative (-0.67) | - - |
| 2026-03-04 11:07 |
eval
|
Evaluated by llama-4-scout-wai: -0.67 (Strong negative) 0.00 | |
| reasoning Landing page of Eurosky, a social media platform, with a focus on European laws and regulations. |
| 2026-03-04 11:05 | eval_success | Lite evaluated: Strong negative (-0.70) | - - |
| 2026-03-04 11:05 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.70 (Strong negative) 0.00 | |
| reasoning Landing page with no rights discussion |
| 2026-03-04 11:05 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 1W 0R | - - |
| 2026-03-04 11:02 | eval_success | Lite evaluated: Strong negative (-0.67) | - - |
| 2026-03-04 11:02 |
eval
|
Evaluated by llama-4-scout-wai: -0.67 (Strong negative) | |
| reasoning Landing page of Eurosky, a social media platform, with a focus on European laws and regulations. |
| 2026-03-04 11:00 | eval_success | Lite evaluated: Strong negative (-0.70) | - - |
| 2026-03-04 11:00 |
eval
|
Evaluated by llama-3.3-70b-wai: -0.70 (Strong negative) | |
| reasoning Landing page with no rights discussion |
| 2026-03-04 11:00 | rater_validation_warn | Lite validation warnings for model llama-3.3-70b-wai: 1W 0R | - - |