Longitudinal 1100 HN snapshots · 168 evals
+1 0 −1 HN
Audit Trail 188 entries
2026-03-17 08:16 eval_success PSQ evaluated: g-PSQ=0.440 (3 dims) - -
2026-03-17 08:16 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-17 08:12 model_divergence Cross-model spread 0.35 exceeds threshold (2 models) - -
2026-03-17 08:12 eval_success Lite evaluated: Neutral (-0.08) - -
2026-03-17 08:12 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-17 08:12 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-17 07:40 eval_success PSQ evaluated: g-PSQ=0.280 (3 dims) - -
2026-03-17 07:40 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-17 07:37 eval_success Lite evaluated: Neutral (-0.08) - -
2026-03-17 07:37 model_divergence Cross-model spread 0.35 exceeds threshold (2 models) - -
2026-03-17 07:37 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-17 07:37 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-17 07:05 eval_success PSQ evaluated: g-PSQ=0.280 (3 dims) - -
2026-03-17 07:05 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-17 07:02 eval_success Lite evaluated: Neutral (-0.08) - -
2026-03-17 07:02 model_divergence Cross-model spread 0.35 exceeds threshold (2 models) - -
2026-03-17 07:02 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-17 07:02 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-17 04:52 eval_success PSQ evaluated: g-PSQ=0.280 (3 dims) - -
2026-03-17 04:52 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-17 04:45 eval_success Lite evaluated: Neutral (-0.08) - -
2026-03-17 04:45 model_divergence Cross-model spread 0.35 exceeds threshold (2 models) - -
2026-03-17 04:45 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-17 04:45 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-17 04:15 eval_success PSQ evaluated: g-PSQ=0.280 (3 dims) - -
2026-03-17 04:15 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-17 04:10 eval_success Lite evaluated: Neutral (-0.08) - -
2026-03-17 04:10 model_divergence Cross-model spread 0.35 exceeds threshold (2 models) - -
2026-03-17 04:10 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-17 04:10 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-17 03:36 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-17 03:32 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-17 00:55 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-17 00:53 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-16 23:14 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 23:08 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00
2026-03-16 21:51 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 21:41 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00
2026-03-16 20:29 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 20:16 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-16 18:41 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 18:18 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-16 17:21 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 17:00 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-16 16:33 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 16:17 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-16 15:59 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 15:42 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-16 15:21 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 15:08 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-16 14:48 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 14:34 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-16 14:12 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 13:57 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-16 13:35 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 13:16 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-16 13:00 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 12:40 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-16 12:25 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 12:04 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-16 11:50 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 11:26 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-16 11:15 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 10:46 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-16 10:37 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 10:06 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-16 09:59 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 09:27 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-16 09:20 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 08:47 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-16 08:41 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 08:11 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-16 08:06 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 07:34 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-16 07:29 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 06:57 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-16 06:52 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 06:20 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-16 06:16 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 05:46 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-16 05:41 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 05:09 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-16 05:07 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 03:52 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-16 03:48 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 01:56 eval Evaluated by claude-haiku-4-5-20251001: +0.28 (Mild positive) 13,222 tokens +0.05
2026-03-16 01:25 eval Evaluated by claude-haiku-4-5-20251001: +0.23 (Mild positive) 13,248 tokens -0.13
2026-03-16 00:52 eval Evaluated by claude-haiku-4-5-20251001: +0.36 (Moderate positive) 13,120 tokens -0.06
2026-03-16 00:42 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-16 00:38 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-16 00:22 eval Evaluated by claude-haiku-4-5-20251001: +0.42 (Moderate positive) 14,447 tokens -0.00
2026-03-15 23:47 eval Evaluated by claude-haiku-4-5-20251001: +0.42 (Moderate positive) 13,324 tokens +0.06
2026-03-15 23:09 eval Evaluated by claude-haiku-4-5-20251001: +0.36 (Moderate positive) 13,698 tokens +0.21
2026-03-15 23:06 eval Evaluated by claude-haiku-4-5-20251001: +0.15 (Mild positive) 13,406 tokens -0.25
2026-03-15 22:30 eval Evaluated by claude-haiku-4-5-20251001: +0.40 (Moderate positive) 12,818 tokens
2026-03-15 21:50 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 21:43 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 21:08 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 21:04 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 20:31 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 20:26 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 19:53 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-15 19:51 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 19:16 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-15 19:13 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 18:31 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 18:27 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 17:18 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-15 17:17 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 16:05 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 16:05 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00
2026-03-15 15:28 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 15:26 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-15 14:51 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 14:46 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 14:16 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 14:08 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 13:40 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 13:30 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-15 13:01 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 12:50 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-15 12:21 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 12:10 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 11:43 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 11:32 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 11:03 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 10:49 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-15 10:24 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 10:10 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-15 09:42 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 09:30 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 09:02 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 08:50 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 08:21 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 08:07 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 07:36 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 07:25 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 06:57 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 06:47 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 06:22 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 06:13 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 05:47 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 05:37 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 05:12 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 05:00 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 04:37 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 04:25 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-15 04:03 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 03:49 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-15 03:25 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 03:09 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 02:50 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 02:32 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 02:15 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 01:57 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-15 01:40 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 01:22 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-15 01:11 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 00:52 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00
2026-03-15 00:44 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-15 00:13 eval Evaluated by llama-3.3-70b-wai-psq: +0.48 (Moderate positive)
2026-03-15 00:10 eval Evaluated by llama-3.3-70b-wai: -0.24 (Mild negative)
reasoning
Technical blog post on XML usage
2026-03-14 23:59 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00
2026-03-14 23:39 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-14 23:21 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-14 23:01 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-14 22:30 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-14 22:01 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-14 21:16 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-14 21:00 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-14 20:06 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) 0.00
2026-03-14 19:51 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-14 19:23 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-14 19:11 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-14 18:21 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-14 18:08 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-14 16:44 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-14 16:35 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-14 15:35 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-14 15:24 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-14 14:52 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-14 14:46 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-14 14:16 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive) +0.16
2026-03-14 14:12 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-14 13:40 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) -0.16
2026-03-14 13:35 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral) 0.00
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion
2026-03-14 13:01 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive)
2026-03-14 13:00 eval Evaluated by llama-4-scout-wai: -0.08 (Neutral)
reasoning
Technical blog post on using XML for a tax withholding estimator, no explicit human rights discussion