gpt 5.5 seems to be the recent leader overall, it make sense to include it , jus... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		maxdo 3 days ago \| parent \| context \| favorite \| on: Show HN: A new benchmark for testing LLMs for dete... gpt 5.5 seems to be the recent leader overall, it make sense to include it , just to see what you trade off for speed/open source nature vs cutting edge leader.

		help

khurdula 1 day ago | [–]

hey! we've evaluated gpt 5.5 as well along with other frontier models. gemini and gemma models outperform it across all three modalities.

Open source models like glm 4.7 still compete closely with table toppers.

khurdula 3 days ago | [–]

Yep, we will be adding it soon as well.

Consider applying for YC's Summer 2026 batch! Applications are open till May 4
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact