nicolaib 1 karma 1y 2m on HN HN profile →
Coverage
We've seen 1 of ~3 submissions
Full eval: 0 Lite-only: 0 Unevaluated: 1
1 stories
1. Show HN: Multimodal test cases for LLM evals (what we built and what broke)
Hey HN, Nicolai here, co-founder of Rhesis AI.<p>Most eval frameworks were designed when LLM inputs ...
3 points by nicolaib 8 days ago | 0 comments | skipped