I always wonder how absolute in performance a given model is. Sometimes i ask fo...

steveklabnik · 2025-09-29T17:33:55 1759167235

> It would make sense they scale up and down depending on utilization right?

It would, but

> To state it plainly: We never reduce model quality due to demand, time of day, or server load.

https://www.anthropic.com/engineering/a-postmortem-of-three-...

If you believe them or not is another matter, but that's what they themselves say.

transcriptase · 2025-09-29T17:54:40 1759168480

Well knowing the state of the tech industry they probably have a different, legal-team approved definition of “reducing model quality” than face value.

After all, using a different context window, subbing in a differently quantized model, throttling response length, rate limiting features aren’t technically “reducing model quality”.

richwater · 2025-09-29T17:22:03 1759166523

They absolutely mess with it