LLMs can't access the training data that's less than the statistically most comm...

		nosuchthing 18 days ago \| parent \| context \| favorite \| on: I want to wash my car. The car wash is 50 meters a... LLMs can't access the training data that's less than the statistically most common token, so they use a random jitter. With that randomness comes statistically irrelevant results.