Human coders are still better than LLMs

Alpha This system is experimental. Scores and classifications are early-stage research and may be unreliable. Methodology →

ND	Human coders are still better than LLMs (antirez.com)
	655 points by longwave 285 days ago \| 735 comments on HN ~lite vlite-2.0

Summary ~lite

Human coders still excel over LLMs in creativity and problem-solving.

Lite evaluation by llama-4-scout-wai-psq · editorial channel only · no per-section breakdown available

Longitudinal · 5 evals

Audit Trail 12 entries

2026-03-05 12:20	eval_success	PSQ evaluated: g-PSQ=0.280 (3 dims)	- -
2026-03-05 12:20	eval	Evaluated by llama-4-scout-wai-psq: +0.28 (Mild positive)
2026-03-05 12:13	eval_success	PSQ evaluated: g-PSQ=0.323 (3 dims)	- -
2026-03-05 12:13	eval	Evaluated by llama-3.3-70b-wai-psq: +0.32 (Moderate positive)
2026-03-02 12:49	model_divergence	Cross-model spread 0.38 exceeds threshold (2 models)	- -
2026-03-02 12:49	eval_success	Lite evaluated: Moderate positive (0.38)	- -
2026-03-02 12:49	eval	Evaluated by llama-4-scout-wai: +0.38 (Moderate positive) 0.00
	reasoning Editorial discussing AI vs human coding capabilities
2026-03-02 12:44	model_divergence	Cross-model spread 0.38 exceeds threshold (2 models)	- -
2026-03-02 12:44	eval_success	Lite evaluated: Moderate positive (0.38)	- -
2026-03-02 12:44	eval	Evaluated by llama-4-scout-wai: +0.38 (Moderate positive)
	reasoning Editorial discussing AI vs human coding capabilities
2026-03-02 12:41	eval_success	Lite evaluated: Neutral (0.00)	- -
2026-03-02 12:41	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral)
	reasoning Technical post with neutral rights stance

build ee2b489+gzrb · deployed 2026-03-10 22:52 UTC · evaluated 2026-03-08 02:36:46 UTC