xdotli 15 karma 2y 8m on HN HN profile →
Founder BenchFlow.ai, a benchmark company.
Coverage
We've seen 3 of ~36 submissions
Full eval: 0 Lite-only: 0 Unevaluated: 3
3 stories
1. Chaos of Agent (agentsofchaos.baulab.info)
1 points by xdotli 5 days ago | 1 comments | skipped
2. Native CLI scaffolds consistently outper-form OpenCode when using the same model (arxiv.org)
1 points by xdotli 5 days ago | 1 comments | skipped
3. We compare model quality in Cursor (cursor.com)
2 points by xdotli 5 days ago | 0 comments | skipped