home / threatroad.substack.com / item 47211970
Model Comparison
Model Editorial Structural Class Conf SETL Theme @cf/meta/llama-4-scout-17b-16e-instruct lite +0.10 ND Mild positive 0.80 0.00 Privacy Security @cf/meta/llama-3.3-70b-instruct-fp8-fast lite +0.40 ND Moderate positive 0.80 0.00 Digital Privacy
Section @cf/meta/llama-4-scout-17b-16e-instruct lite @cf/meta/llama-3.3-70b-instruct-fp8-fast lite Delta Preamble ND ND Article 1 ND ND Article 2 ND ND Article 3 ND ND Article 4 ND ND Article 5 ND ND Article 6 ND ND Article 7 ND ND Article 8 ND ND Article 9 ND ND Article 10 ND ND Article 11 ND ND Article 12 ND ND Article 13 ND ND Article 14 ND ND Article 15 ND ND Article 16 ND ND Article 17 ND ND Article 18 ND ND Article 19 ND ND Article 20 ND ND Article 21 ND ND Article 22 ND ND Article 23 ND ND Article 24 ND ND Article 25 ND ND Article 26 ND ND Article 27 ND ND Article 28 ND ND Article 29 ND ND Article 30 ND ND
Summary ~lite Privacy Security Acknowledges
Article discusses research on deanonymizing users, slight positive lean
Lite evaluation by llama-4-scout-wai · editorial channel only · no per-section breakdown available
Longitudinal
11 HN snapshots · 6 evals
Audit Trail
14 entries all eval pipeline all models llama-4-scout-wai llama-3.3-70b-wai
newest first
2026-03-02 03:14 model_divergence Cross-model spread 0.30 exceeds threshold (2 models) - - 2026-03-02 03:14 eval_success Lite evaluated: Mild positive (0.10) - - 2026-03-02 03:14
eval
Evaluated by llama-4-scout-wai : +0.10 (Mild positive) +0.30 reasoning Editorial discusses deanonymization research, tangential to rights
2026-03-02 03:12 eval_success Lite evaluated: Moderate positive (0.40) - - 2026-03-02 03:12
eval
Evaluated by llama-3.3-70b-wai : +0.40 (Moderate positive) 0.00 reasoning Exposing privacy abuses
2026-03-02 02:37 model_divergence Cross-model spread 0.60 exceeds threshold (2 models) - - 2026-03-02 02:37 eval_success Lite evaluated: Mild negative (-0.20) - - 2026-03-02 02:37
eval
Evaluated by llama-4-scout-wai : -0.20 (Mild negative) -0.30 reasoning Editorial discusses deanonymization research, tangential to rights
2026-03-02 02:35 eval_success Lite evaluated: Moderate positive (0.40) - - 2026-03-02 02:35
eval
Evaluated by llama-3.3-70b-wai : +0.40 (Moderate positive) +0.20 reasoning Exposing privacy abuses
2026-03-02 01:45 eval_success Lite evaluated: Mild positive (0.10) - - 2026-03-02 01:45
eval
Evaluated by llama-4-scout-wai : +0.10 (Mild positive) reasoning Editorial discusses deanonymization research, tangential to rights
2026-03-02 01:43 eval_success Lite evaluated: Mild positive (0.20) - - 2026-03-02 01:43
eval
Evaluated by llama-3.3-70b-wai : +0.20 (Mild positive) reasoning Exposing privacy abuses