1 stories
spicypete 150 karma 2y 3m on HN HN profile →
Stories 1 (0 evaluated) Avg HRCB ND
Avg SETL ND Avg Conf ND
1.
HRCB -- L
E 0.00
S --
Measuring AI Ability to Complete Long Tasks (metr.org)
247 points by spicypete 69 days ago | 193 comments | hrcb AI research