AI is great at writing code. It's terrible at making decisions

Alpha This system is experimental. Scores and classifications are early-stage research and may be unreliable. Methodology →

Model: @cf/meta/llama-4-scout-17b-16e-instruct lite ND @cf/meta/llama-4-scout-17b-16e-instruct lite +0.10 Compare

ND	AI is great at writing code. It's terrible at making decisions (untangle.work)
	15 points by kdbgng 5 days ago \| 4 comments on HN ~lite vlite-2.0

Summary ~lite

AI-generated code requires human judgment for architecture and decision-making.

Lite evaluation by llama-4-scout-wai-psq · editorial channel only · no per-section breakdown available

Longitudinal 55 HN snapshots · 22 evals

Audit Trail 42 entries

2026-03-13 08:57	eval_success	PSQ evaluated: g-PSQ=0.280 (3 dims)	- -
2026-03-13 08:57	eval	Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-13 08:35	eval_success	Lite evaluated: Moderate negative (-0.34)	- -
2026-03-13 08:35	eval	Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00
	reasoning The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,
2026-03-13 08:35	rater_validation_warn	Lite validation warnings for model llama-4-scout-wai: 0W 1R	- -
2026-03-13 08:16	eval_success	PSQ evaluated: g-PSQ=0.280 (3 dims)	- -
2026-03-13 08:16	eval	Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-13 07:56	eval_success	Lite evaluated: Moderate negative (-0.34)	- -
2026-03-13 07:56	eval	Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00
	reasoning The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,
2026-03-13 07:56	rater_validation_warn	Lite validation warnings for model llama-4-scout-wai: 0W 1R	- -
2026-03-13 07:32	eval_success	PSQ evaluated: g-PSQ=0.280 (3 dims)	- -
2026-03-13 07:32	eval	Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-13 07:13	eval_success	Lite evaluated: Moderate negative (-0.34)	- -
2026-03-13 07:13	eval	Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00
	reasoning The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,
2026-03-13 07:13	rater_validation_warn	Lite validation warnings for model llama-4-scout-wai: 0W 1R	- -
2026-03-13 06:53	eval_success	PSQ evaluated: g-PSQ=0.280 (3 dims)	- -
2026-03-13 06:53	eval	Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-13 06:36	eval_success	Lite evaluated: Moderate negative (-0.34)	- -
2026-03-13 06:36	eval	Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00
	reasoning The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,
2026-03-13 06:36	rater_validation_warn	Lite validation warnings for model llama-4-scout-wai: 0W 1R	- -
2026-03-13 06:11	eval_success	PSQ evaluated: g-PSQ=0.280 (3 dims)	- -
2026-03-13 06:11	eval	Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-13 05:59	eval_success	Lite evaluated: Moderate negative (-0.34)	- -
2026-03-13 05:59	eval	Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00
	reasoning The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,
2026-03-13 05:59	rater_validation_warn	Lite validation warnings for model llama-4-scout-wai: 0W 1R	- -
2026-03-13 05:34	eval_success	PSQ evaluated: g-PSQ=0.280 (3 dims)	- -
2026-03-13 05:34	eval	Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-13 05:24	eval_success	Lite evaluated: Moderate negative (-0.34)	- -
2026-03-13 05:24	rater_validation_warn	Lite validation warnings for model llama-4-scout-wai: 0W 1R	- -
2026-03-13 05:24	eval	Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00
	reasoning The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,
2026-03-13 04:57	eval_success	PSQ evaluated: g-PSQ=0.280 (3 dims)	- -
2026-03-13 04:57	eval	Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-13 04:49	eval_success	Lite evaluated: Moderate negative (-0.34)	- -
2026-03-13 04:49	eval	Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00
	reasoning The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,
2026-03-13 04:19	eval	Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-13 04:14	eval	Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00
	reasoning The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,
2026-03-13 03:41	eval	Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-13 03:39	eval	Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00
	reasoning The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,
2026-03-13 03:05	eval	Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive) 0.00
2026-03-13 03:04	eval	Evaluated by llama-4-scout-wai: -0.34 (Moderate negative) 0.00
	reasoning The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,
2026-03-13 02:30	eval	Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive)
2026-03-13 02:30	eval	Evaluated by llama-4-scout-wai: -0.34 (Moderate negative)
	reasoning The content discusses AI's capabilities in writing code but emphasizes its limitations in making engineering decisions,

build ee2b489+gzrb · deployed 2026-03-10 22:52 UTC · evaluated 2026-03-16 02:03:38 UTC