+0.50 Experts sound alarm after ChatGPT Health fails to recognise medical emergencies (www.theguardian.com)
195 points by simonebrunozzi 14 hours ago | 143 comments on HN | Moderate positive ~lite vlight-1.4
Summary ~lite Health rights Acknowledges
Exposes medical emergency failures
EQ 0.70
SO 0.60
TD 0.50
Light evaluation by llama-3.3-70b-wai · editorial channel only · no per-section breakdown available
Audit Trail 47 entries
2026-02-28 05:58 eval_success Light evaluated: Moderate positive (0.50) - -
2026-02-28 05:58 rater_validation_warn Light validation warnings for model llama-3.3-70b-wai: 0W 1R - -
2026-02-28 05:58 eval Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 05:28 eval_success Light evaluated: Moderate positive (0.50) - -
2026-02-28 05:28 rater_validation_warn Light validation warnings for model llama-3.3-70b-wai: 0W 1R - -
2026-02-28 05:28 eval Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 05:27 eval_success Light evaluated: Moderate positive (0.50) - -
2026-02-28 05:27 eval Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 05:27 rater_validation_warn Light validation warnings for model llama-3.3-70b-wai: 0W 1R - -
2026-02-28 05:10 eval_success Light evaluated: Moderate positive (0.50) - -
2026-02-28 05:10 eval Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 05:09 rater_validation_warn Light validation warnings for model llama-3.3-70b-wai: 0W 1R - -
2026-02-28 04:57 eval_success Light evaluated: Moderate positive (0.50) - -
2026-02-28 04:57 eval Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 04:42 eval_success Light evaluated: Strong positive (0.80) - -
2026-02-28 04:42 eval Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 04:20 eval_success Light evaluated: Moderate positive (0.50) - -
2026-02-28 04:20 eval Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 03:54 eval_success Light evaluated: Strong positive (0.80) - -
2026-02-28 03:54 eval Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 03:53 eval_success Light evaluated: Strong positive (0.80) - -
2026-02-28 03:53 eval Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 03:49 eval_success Light evaluated: Strong positive (0.80) - -
2026-02-28 03:49 eval Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 03:41 eval_success Light evaluated: Strong positive (0.80) - -
2026-02-28 03:41 eval Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 03:33 eval_success Light evaluated: Strong positive (0.80) - -
2026-02-28 03:33 eval Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 02:37 eval_success Light evaluated: Moderate positive (0.50) - -
2026-02-28 02:37 eval Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 02:31 eval_success Light evaluated: Strong positive (0.80) - -
2026-02-28 02:31 eval Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 02:27 eval_success Light evaluated: Moderate positive (0.50) - -
2026-02-28 02:27 eval Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 02:13 eval_success Light evaluated: Strong positive (0.80) - -
2026-02-28 02:13 eval Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 02:02 eval Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 01:58 eval Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 01:56 eval Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 01:44 eval Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 01:29 eval Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 01:26 eval Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 01:20 eval Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 01:18 eval Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 01:02 eval Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive)
2026-02-28 00:58 eval Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 00:50 eval Evaluated by llama-4-scout-wai: +0.80 (Strong positive)