+0.10 AIs Will Increasingly Fake Alignment (thezvi.substack.com)
104 points by theptip 441 days ago | 106 comments on HN | Mild positive ~lite vlite-1.6
Summary ~lite AI Ethics Acknowledges
Discusses AI alignment research, no explicit human rights mention
EQ 0.50
SO 0.60
TD 0.70
Lite evaluation by llama-4-scout-wai · editorial channel only · no per-section breakdown available
Longitudinal · 5 evals
+1 0 −1 HN
Audit Trail 10 entries
2026-03-07 21:05 eval_success Lite evaluated: Mild positive (0.14) - -
2026-03-07 21:05 eval Evaluated by llama-4-scout-wai: +0.14 (Mild positive)
reasoning
AI alignment discussion, no explicit human rights mention
2026-03-07 20:58 eval_success Lite evaluated: Mild negative (-0.10) - -
2026-03-07 20:58 eval Evaluated by llama-3.3-70b-wai: -0.10 (Mild negative)
reasoning
AI alignment discussion
2026-03-06 14:48 eval_success PSQ evaluated: g-PSQ=-0.080 (3 dims) - -
2026-03-06 14:48 eval Evaluated by llama-3.3-70b-wai-psq: -0.08 (Neutral) 0.00
2026-03-06 14:48 eval_success PSQ evaluated: g-PSQ=0.440 (3 dims) - -
2026-03-06 14:48 eval Evaluated by llama-4-scout-wai-psq: +0.44 (Moderate positive)
2026-03-06 14:44 eval_success PSQ evaluated: g-PSQ=-0.080 (3 dims) - -
2026-03-06 14:44 eval Evaluated by llama-3.3-70b-wai-psq: -0.08 (Neutral)