Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
maxdo
3 days ago
|
parent
|
context
|
favorite
| on:
Show HN: A new benchmark for testing LLMs for dete...
gpt 5.5 seems to be the recent leader overall, it make sense to include it , just to see what you trade off for speed/open source nature vs cutting edge leader.
help
khurdula
1 day ago
|
next
[–]
hey! we've evaluated gpt 5.5 as well along with other frontier models. gemini and gemma models outperform it across all three modalities.
Open source models like glm 4.7 still compete closely with table toppers.
reply
khurdula
3 days ago
|
prev
[–]
Yep, we will be adding it soon as well.
reply
Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: