nicolaib — HRO

Alpha This system is experimental. Scores and classifications are early-stage research and may be unreliable. Methodology →

nicolaib 1 karma 1y 2m on HN HN profile →

Coverage

We've seen 1 of ~3 submissions

Full eval: 0 Lite-only: 0 Unevaluated: 1

1 stories

1.		Show HN: Multimodal test cases for LLM evals (what we built and what broke)
		Hey HN, Nicolai here, co-founder of Rhesis AI.<p>Most eval frameworks were designed when LLM inputs ...
		3 points by nicolaib 8 days ago \| 0 comments \| skipped

build ee2b489+gzrb · deployed 2026-03-10 22:52 UTC · evaluated 2026-03-16 02:03:38 UTC