Claude broke a ZIP password in a smart way
6 points by jgrahamc 7 days ago | 2 comments on HN
Pending Evaluation
This story is queued for evaluation. It will be processed in an upcoming batch.
Queued: 2026-03-14 18:53:58
Longitudinal 251 HN snapshots · 68 evals
+1 0 −1 HN
Audit Trail 88 entries
2026-03-18 02:41 eval_success PSQ evaluated: g-PSQ=0.120 (3 dims) - -
2026-03-18 02:41 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-18 02:18 eval_success Lite evaluated: Mild negative (-0.24) - -
2026-03-18 02:18 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-18 02:18 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-18 00:48 eval_success PSQ evaluated: g-PSQ=0.120 (3 dims) - -
2026-03-18 00:48 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-18 00:33 eval_success Lite evaluated: Mild negative (-0.24) - -
2026-03-18 00:33 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-18 00:33 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-17 23:13 eval_success PSQ evaluated: g-PSQ=0.120 (3 dims) - -
2026-03-17 23:13 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-17 22:46 eval_success Lite evaluated: Mild negative (-0.24) - -
2026-03-17 22:46 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-17 22:46 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-17 21:54 eval_success PSQ evaluated: g-PSQ=0.120 (3 dims) - -
2026-03-17 21:54 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-17 21:29 eval_success Lite evaluated: Mild negative (-0.24) - -
2026-03-17 21:29 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-17 21:29 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-17 20:35 eval_success PSQ evaluated: g-PSQ=0.120 (3 dims) - -
2026-03-17 20:35 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-17 20:17 eval_success Lite evaluated: Mild negative (-0.24) - -
2026-03-17 20:17 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-17 20:17 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-17 19:22 eval_success PSQ evaluated: g-PSQ=0.120 (3 dims) - -
2026-03-17 19:22 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) -0.16
2026-03-17 19:05 eval_success Lite evaluated: Mild negative (-0.24) - -
2026-03-17 19:05 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-17 19:04 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-17 01:36 eval_success Lite evaluated: Mild negative (-0.24) - -
2026-03-17 01:36 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-17 01:36 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 1W 0R - -
2026-03-17 00:57 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) +0.16
2026-03-16 23:38 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 23:11 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-16 22:16 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 21:48 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-16 21:03 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 20:24 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) -0.16
2026-03-16 19:04 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 18:37 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) +0.16
2026-03-16 17:56 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 17:12 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-16 16:40 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 16:22 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) -0.16
2026-03-16 16:04 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 15:47 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) +0.16
2026-03-16 15:26 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 15:11 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-16 14:50 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 14:38 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-16 14:15 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 14:00 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-16 13:38 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 13:22 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) -0.16
2026-03-16 13:03 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 12:45 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) +0.16
2026-03-16 12:27 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 12:10 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-16 11:51 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 11:32 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-16 11:15 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 10:53 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-16 10:37 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 10:14 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-16 10:00 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 09:32 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-16 09:20 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 08:52 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-16 08:42 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 08:14 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-16 08:06 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 07:36 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-16 07:30 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 06:59 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-16 06:52 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 06:21 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-16 06:17 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 05:44 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-16 05:42 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 05:07 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-16 05:07 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 03:44 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive) 0.00
2026-03-16 03:43 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative) 0.00
reasoning
Technical content, neutral editorial stance, low transparency
2026-03-16 01:06 eval Evaluated by claude-haiku-4-5-20251001: +0.01 (Neutral) 10,783 tokens
2026-03-15 01:00 eval Evaluated by llama-4-scout-wai-psq: +0.12 (Mild positive)
2026-03-15 00:56 eval Evaluated by llama-4-scout-wai: -0.24 (Mild negative)
reasoning
Technical content, neutral editorial stance, low transparency