Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

LLMs can't access the training data that's less than the statistically most common token, so they use a random jitter.

With that randomness comes statistically irrelevant results.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: