ND AI is great at writing code. It's terrible at making decisions (untangle.work)
15 points by kdbgng 5 days ago | 4 comments on HN ~lite vlite-2.0
Summary ~lite
AI-generated code requires human judgment for architecture and decision-making.
Lite evaluation by llama-4-scout-wai-psq · editorial channel only · no per-section breakdown available
Longitudinal 55 HN snapshots · 22 evals
+1 0 −1 HN
Audit Trail 42 entries
2026-03-13 08:57 eval_success PSQ evaluated: g-PSQ=0.280 (3 dims) - -
2026-03-13 08:57 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-13 08:35 eval_success Lite evaluated: Moderate negative (-0.34) - -
2026-03-13 08:35 eval Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00
reasoning
The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,
2026-03-13 08:35 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 0W 1R - -
2026-03-13 08:16 eval_success PSQ evaluated: g-PSQ=0.280 (3 dims) - -
2026-03-13 08:16 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-13 07:56 eval_success Lite evaluated: Moderate negative (-0.34) - -
2026-03-13 07:56 eval Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00
reasoning
The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,
2026-03-13 07:56 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 0W 1R - -
2026-03-13 07:32 eval_success PSQ evaluated: g-PSQ=0.280 (3 dims) - -
2026-03-13 07:32 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-13 07:13 eval_success Lite evaluated: Moderate negative (-0.34) - -
2026-03-13 07:13 eval Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00
reasoning
The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,
2026-03-13 07:13 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 0W 1R - -
2026-03-13 06:53 eval_success PSQ evaluated: g-PSQ=0.280 (3 dims) - -
2026-03-13 06:53 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-13 06:36 eval_success Lite evaluated: Moderate negative (-0.34) - -
2026-03-13 06:36 eval Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00
reasoning
The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,
2026-03-13 06:36 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 0W 1R - -
2026-03-13 06:11 eval_success PSQ evaluated: g-PSQ=0.280 (3 dims) - -
2026-03-13 06:11 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-13 05:59 eval_success Lite evaluated: Moderate negative (-0.34) - -
2026-03-13 05:59 eval Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00
reasoning
The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,
2026-03-13 05:59 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 0W 1R - -
2026-03-13 05:34 eval_success PSQ evaluated: g-PSQ=0.280 (3 dims) - -
2026-03-13 05:34 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-13 05:24 eval_success Lite evaluated: Moderate negative (-0.34) - -
2026-03-13 05:24 rater_validation_warn Lite validation warnings for model llama-4-scout-wai: 0W 1R - -
2026-03-13 05:24 eval Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00
reasoning
The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,
2026-03-13 04:57 eval_success PSQ evaluated: g-PSQ=0.280 (3 dims) - -
2026-03-13 04:57 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-13 04:49 eval_success Lite evaluated: Moderate negative (-0.34) - -
2026-03-13 04:49 eval Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00
reasoning
The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,
2026-03-13 04:19 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-13 04:14 eval Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00
reasoning
The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,
2026-03-13 03:41 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-13 03:39 eval Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00
reasoning
The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,
2026-03-13 03:05 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-13 03:04 eval Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00
reasoning
The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,
2026-03-13 02:30 eval Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive)
2026-03-13 02:30 eval Evaluated by llama-4-scout-wai: -0.34 (Moderate negative)
reasoning
The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,