Experts sound alarm after ChatGPT Health fails to recognise medical emergencies

+0.50	Experts sound alarm after ChatGPT Health fails to recognise medical emergencies (www.theguardian.com)
	195 points by simonebrunozzi 14 hours ago \| 143 comments on HN \| Moderate positive ~lite vlight-1.4

Summary ~lite Health rights Acknowledges

Exposes medical emergency failures

EQ 0.70

SO 0.60

TD 0.50

Light evaluation by llama-3.3-70b-wai · editorial channel only · no per-section breakdown available

Audit Trail 47 entries

2026-02-28 05:58	eval_success	Light evaluated: Moderate positive (0.50)	- -
2026-02-28 05:58	rater_validation_warn	Light validation warnings for model llama-3.3-70b-wai: 0W 1R	- -
2026-02-28 05:58	eval	Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 05:28	eval_success	Light evaluated: Moderate positive (0.50)	- -
2026-02-28 05:28	rater_validation_warn	Light validation warnings for model llama-3.3-70b-wai: 0W 1R	- -
2026-02-28 05:28	eval	Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 05:27	eval_success	Light evaluated: Moderate positive (0.50)	- -
2026-02-28 05:27	eval	Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 05:27	rater_validation_warn	Light validation warnings for model llama-3.3-70b-wai: 0W 1R	- -
2026-02-28 05:10	eval_success	Light evaluated: Moderate positive (0.50)	- -
2026-02-28 05:10	eval	Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 05:09	rater_validation_warn	Light validation warnings for model llama-3.3-70b-wai: 0W 1R	- -
2026-02-28 04:57	eval_success	Light evaluated: Moderate positive (0.50)	- -
2026-02-28 04:57	eval	Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 04:42	eval_success	Light evaluated: Strong positive (0.80)	- -
2026-02-28 04:42	eval	Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 04:20	eval_success	Light evaluated: Moderate positive (0.50)	- -
2026-02-28 04:20	eval	Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 03:54	eval_success	Light evaluated: Strong positive (0.80)	- -
2026-02-28 03:54	eval	Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 03:53	eval_success	Light evaluated: Strong positive (0.80)	- -
2026-02-28 03:53	eval	Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 03:49	eval_success	Light evaluated: Strong positive (0.80)	- -
2026-02-28 03:49	eval	Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 03:41	eval_success	Light evaluated: Strong positive (0.80)	- -
2026-02-28 03:41	eval	Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 03:33	eval_success	Light evaluated: Strong positive (0.80)	- -
2026-02-28 03:33	eval	Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 02:37	eval_success	Light evaluated: Moderate positive (0.50)	- -
2026-02-28 02:37	eval	Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 02:31	eval_success	Light evaluated: Strong positive (0.80)	- -
2026-02-28 02:31	eval	Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 02:27	eval_success	Light evaluated: Moderate positive (0.50)	- -
2026-02-28 02:27	eval	Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 02:13	eval_success	Light evaluated: Strong positive (0.80)	- -
2026-02-28 02:13	eval	Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 02:02	eval	Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 01:58	eval	Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 01:56	eval	Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 01:44	eval	Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 01:29	eval	Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 01:26	eval	Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 01:20	eval	Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive) 0.00
2026-02-28 01:18	eval	Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 01:02	eval	Evaluated by llama-3.3-70b-wai: +0.50 (Moderate positive)
2026-02-28 00:58	eval	Evaluated by llama-4-scout-wai: +0.80 (Strong positive) 0.00
2026-02-28 00:50	eval	Evaluated by llama-4-scout-wai: +0.80 (Strong positive)

build f6413b5+ta8p · deployed 2026-02-28 05:57 UTC · evaluated 2026-02-28 06:03:59 UTC