mangolie's comments

mangolie · 2026-05-21T06:53:50 1779346430

You can't really in this way. They have a parameter they control on the backend that can force how much time it thinks for

mangolie · 2026-05-18T09:01:00 1779094860

In that case the author is simply saying the LLM is a p zombie, why does he not know what he's talking about?

Sankozi · 2026-05-18T10:34:11 1779100451

They are saying something similar to "LLM has no soul", depending on context it might something insightful or (in technical/scientific context) they are making fool of themselves.

mangolie · 2026-05-18T08:55:36 1779094536

That does not explain how it is possible for a simulation to experience itself

orwin · 2026-05-21T13:38:05 1779370685

It doesn't. It has the illusion of experience.

mangolie · 2026-04-23T18:33:11 1776969191

Not for coding because it actually needs to read and write large files

baalimago · 2026-04-23T18:58:06 1776970686

Well, sort of. Imagine the case where it first scans the repo, then "intelligently" creates architecture files describing the project. The level of intelligence will create a varying quality of summary, with varying need of deep-scans on subsequent sessions. Level of intelligence will also increase comprehension of these architecture files.

Same principle applies when designing plans for complex tasks, etc. Token amount to grasp a concept is what matters.

jstummbillig · 2026-04-23T19:21:18 1776972078

Tbf, I have not super kept track of what is actually happening inside the "thinking" portion of recent releases. But last time I checked there still was a lot of verbosity and mistakes, that beat the actual amount of required, usable code generation by a wide margin.

mangolie · 2026-01-27T06:01:41 1769493701

they cooked

mangolie · 2025-12-01T11:57:46 1764590266

https://x.com/deepseek_ai/status/1995452646459858977

Boom

andy12_ · 2025-12-01T13:31:03 1764595863

Do note that that is a different model. The one we are talking about here, DeepSeekMath-V2, is indeed overcooked with math RL. It's so eager to solve math problems, that it even comes up with random ones if you prompt it with "Hello".

https://x.com/AlpinDale/status/1994324943559852326?s=20

yorwba · 2025-12-01T12:11:36 1764591096

That's a different model: https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Speciale

simianwords · 2025-12-01T12:02:07 1764590527

Oh you may be correct. Are these models general purpose or fine tuned for mathematics?