Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Their benchmark is full of nonsense like this and I'm amazed the fact most of their interactions on the site are promoting it hasn't gotten the account banned for spam.

They have Gemini 2.5 Flash ahead of Opus 4.6: https://aibenchy.com/compare/anthropic-claude-opus-4-6-mediu...

Absolutely worthless benchmark but every release has a comment linking to this nonsense.

 help



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: