Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

What do you mean by draft model? And how would one disable it? Cheers


A draft model is something that you would explicitly enable. It uses a smaller model to speculatively generate next tokens, in theory speeding up generation.

Here’s the LM Studio docs on it: https://lmstudio.ai/docs/app/advanced/speculative-decoding




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: