How does it compare with MiniCPM-Llama3-V 2.5 [0]? Based on what I see it seems ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		vikrantrathore on May 29, 2024 \| parent \| context \| favorite \| on: Llama 3-V: Matching GPT4-V with a 100x smaller mod... How does it compare with MiniCPM-Llama3-V 2.5 [0]? Based on what I see it seems much better than Llama 3-V on the benchmarks. Also it can directly be tried on Huggingface Spaces to check the performance [1]. It has the dataset, code and fine-tuning details with screenshots of it running on Xiaomi 14 pro. It has strong OCR performance and supports 30+ languages. [0] https://github.com/OpenBMB/MiniCPM-V [1] https://huggingface.co/spaces/openbmb/MiniCPM-Llama3-V-2_5

wy35 on June 3, 2024 | [–]

This aged well...

https://github.com/OpenBMB/MiniCPM-V/issues/196

cpursley on May 29, 2024 | [–]

Woah, this actually did quite well on table data extraction. I wonder how this could be used for long documents. Maybe paired with some kind of hybrid rag approach.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact