AIs Will Increasingly Fake Alignment

Alpha This system is experimental. Scores and classifications are early-stage research and may be unreliable. Methodology →

+0.10	AIs Will Increasingly Fake Alignment (thezvi.substack.com)
	104 points by theptip 441 days ago \| 106 comments on HN \| Mild positive ~lite vlite-1.6

Summary ~lite AI Ethics Acknowledges

Discusses AI alignment research, no explicit human rights mention

EQ 0.50

SO 0.60

TD 0.70

Lite evaluation by llama-4-scout-wai · editorial channel only · no per-section breakdown available

Longitudinal · 5 evals

Audit Trail 10 entries

2026-03-07 21:05	eval_success	Lite evaluated: Mild positive (0.14)	- -
2026-03-07 21:05	eval	Evaluated by llama-4-scout-wai: +0.14 (Mild positive)
	reasoning AI alignment discussion, no explicit human rights mention
2026-03-07 20:58	eval_success	Lite evaluated: Mild negative (-0.10)	- -
2026-03-07 20:58	eval	Evaluated by llama-3.3-70b-wai: -0.10 (Mild negative)
	reasoning AI alignment discussion
2026-03-06 14:48	eval_success	PSQ evaluated: g-PSQ=-0.080 (3 dims)	- -
2026-03-06 14:48	eval	Evaluated by llama-3.3-70b-wai-psq: -0.08 (Neutral) 0.00
2026-03-06 14:48	eval_success	PSQ evaluated: g-PSQ=0.440 (3 dims)	- -
2026-03-06 14:48	eval	Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive)
2026-03-06 14:44	eval_success	PSQ evaluated: g-PSQ=-0.080 (3 dims)	- -
2026-03-06 14:44	eval	Evaluated by llama-3.3-70b-wai-psq: -0.08 (Neutral)

build ee2b489+gzrb · deployed 2026-03-10 22:52 UTC · evaluated 2026-03-08 02:36:46 UTC