Claude Opus 4.6

Beta This system is experimental. Scores and classifications are early-stage research and may be unreliable. Methodology →

ND	Claude Opus 4.6 (www.anthropic.com)
	2346 points by HellsMaddy 28 days ago \| 1032 comments on HN ~lite vlite-2.0

Summary ~lite

Anthropic introduces Claude Opus 4.6, a state-of-the-art AI model with improved coding, agentic, and reasoning capabilities.

Lite evaluation by llama-4-scout-wai-psq · editorial channel only · no per-section breakdown available

Longitudinal · 6 evals

Audit Trail 12 entries

2026-03-05 10:29	eval_success	PSQ evaluated: g-PSQ=0.600 (3 dims)	- -
2026-03-05 10:29	eval	Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive) 0.00
2026-03-05 10:24	eval_success	PSQ evaluated: g-PSQ=0.600 (3 dims)	- -
2026-03-05 10:24	eval	Evaluated by llama-4-scout-wai-psq: +0.60 (Strong positive)
2026-03-05 10:22	eval_success	PSQ evaluated: g-PSQ=0.600 (3 dims)	- -
2026-03-05 10:22	eval	Evaluated by llama-3.3-70b-wai-psq: +0.60 (Strong positive) +0.60
2026-03-05 10:18	eval_success	PSQ evaluated: g-PSQ=0.000 (3 dims)	- -
2026-03-05 10:18	eval	Evaluated by llama-3.3-70b-wai-psq: 0.00 (Neutral)
2026-02-28 23:57	eval_success	Lite evaluated: Neutral (0.00)	- -
2026-02-28 23:57	eval	Evaluated by llama-4-scout-wai: 0.00 (Neutral)
	reasoning PR content, neutral tech product info
2026-02-28 23:53	eval_success	Lite evaluated: Neutral (0.00)	- -
2026-02-28 23:53	eval	Evaluated by llama-3.3-70b-wai: 0.00 (Neutral)
	reasoning PR tech update

build bab9649+v7kr · deployed 2026-03-05 19:25 UTC · evaluated 2026-03-03 07:16:53 UTC