More

canjobear · 2026-05-27T18:47:35 1779907655

As early as last year "AI psychosis" seemed to refer to people going crazy due to talking to ChatGPT too much. That was a useful term for a real phenomenon! Now it seems like it's been taken over to mean "thinking that AI is promising" which is more of a rhetorical bludgeon and less useful as a concept.

canjobear · 2026-05-14T20:42:00 1778791320

PhD students paying tuition would be highly unusual.

CrazyStat · 2026-05-14T20:51:40 1778791900

In STEM fields, yes. In humanities it’s not uncommon.

zipy124 · 2026-05-14T22:20:36 1778797236

No it's typical. It's just that your stipend is usually just x amount of Dollars + whatever tuition is so you never have to care what it is and you don't pay it directly per se, it's just included in your stipend. Someone pays for it at the end of the day though.

canjobear · 2026-05-08T01:14:30 1778202870

Is there a $10 billion "fix everything easily" button you have in mind for homelessness?

scythe · 2026-05-08T02:30:49 1778207449

Homelessness and visible homelessness need to be distinguished here. The large majority of homeless people are not the ones you notice on the streets. Most try to be discreet. Some have jobs. A person who lives in their car is considered homeless.

The best measure to reduce homelessness is to provide timely support for people who are being evicted from their homes before they lose their jobs (which they might still have) and before their mental health deteriorates. This is the point at which assistance is most effective. You have heard the saying, "an ounce of prevention is worth a pound of cure". Such programs have been applied to great effect in e.g. London.

The way to respond to people who have experienced chronic homelessness with complications is different and more difficult.

rgblambda · 2026-05-08T03:49:00 1778212140

The Simon Community where I live went around the city one December and counted how many rough sleepers there were. I forget the exact figure but it was less than 100. Meanwhile there were thousands classified as homeless due to being in temporary accomodation. And this is a part of the UK well known for having a homelessness problem.

jedimastert · 2026-05-08T02:46:36 1778208396

You know oddly enough, if you just put someone up in a real place to live for like a year, that's enough for the majority of people to get back on their feet.

AmazingEveryDay · 2026-05-08T03:32:17 1778211137

I'd like to believe that! Can you link to any research to back up your claim?

jedimastert · 2026-05-09T12:59:43 1778331583

There's a local charity I donate to that does this, they've got I think 10 or 15 apartments they put people up in for 3 months, and offer them various forms of placement assistance while they're there. In the last annual report they had I believe a 95% success rate with people they checked in with a year later.

Unfortunately this is pretty selective evidence, but I know for a fact they don't exclude people on the basis of having mental illness or addiction problems, I've worked with them personally.

pstuart · 2026-05-08T05:11:57 1778217117

https://www.nytimes.com/2022/06/14/headway/houston-homeless-...

kQq9oHeAz6wLLS · 2026-05-08T04:04:07 1778213047

[flagged]

pstuart · 2026-05-08T05:13:56 1778217236

Why are you singling out immigrants? The homeless in our metro area are very much domestically produced.

But you do help illustrate a concern: homelessness is ultimately a federal issue, as some states dump theirs on other more "accepting" states.

fragmede · 2026-05-08T02:25:15 1778207115

$10 billion would build a lot of homes. If you give homeless people homes, they're not homeless anymore.

The problem is also political, unfortunately. $10 billion isn't going to change zoning laws and a NIMBY attitude of freezing things in time.

If they like that point in time so much, they should build a museum, sheesh!

Dylan16807 · 2026-05-08T04:02:30 1778212950

> $10 billion would build a lot of homes. If you give homeless people homes, they're not homeless anymore.

But that doesn't answer the question. It's "a lot" of homes, but less than a tenth of what would be needed.

And you need a bunch of social workers too at minimum.

pstuart · 2026-05-08T05:26:09 1778217969

That wouldn't be enough to do the job but it would be a great start if it was done right. My point was we've flushed $50B (and likely far far more) and what do we get for it? High gas prices. So hurray for the push for renewables and EVs, but there's nicer ways to do it.

> And you need a bunch of social workers too at minimum

Ok, sure. Remember, we're spending the $50B that's been lit on fire so this gives us more jobs and a happier country. And that money circulates in the economy rather than expatriated profits by the defense contractors.

loglog · 2026-05-08T20:07:11 1778270831

The "fix everything" button is abolishing zoning laws, and its aggregate cost is negative. Aggregate cost is not the issue preventing problems from being solved.

pocksuppet · 2026-05-08T06:27:30 1778221650

how many homes can you buy for $10 billion? especially if you don't care too much about size, extra niceties, or location?

Tanoc · 2026-05-08T19:25:27 1778268327

Given a construction budget of $125,000 per residence for a particularly nice two bedroom single bathroom house of about 1000ft², that's 80,000. Estimates are that there are currently between 750,000 to 800,000 people that have no home right now. Taking the high number of 800,000 that $10,000,000,000 is 10% of those people housed. You could reasonably go down to $30,000 for a build for a single floor house of the same footprint if you used mass produced prefabs, and get 330,000 people housed, or over 41%. Do you realize how much that would uplift things if we suddenly had a 41% reduction in homelessness? Considering that companies like Google and OpenAI are throwing around hundreds of billions of dollars which never get circulated back into the wider economy, spending $10,000,000,000 to bring 330,000 people back into economic and societal participation sounds amazing. That's assuming a somewhat low yearly income of $45,000 per person, adding up to $14,850,000,000 in circulation, or a gain of almost $5,000,000,000 right there. Even if we only achieve half of any of this that's $7,000,000,000 for one year. Two years in and the cost has already been paid back and more.

This comes with the giant caveat that we exclude the external costs of such a huge project, like social welfare visits, probation or monitoring if needed, or even just placement programs. Likely those all combined would be a third of the total cost.

canjobear · 2026-05-07T19:00:43 1778180443

China at least has this. The stuff you get at the Ole Supermarket inside a shopping mall is different from the stuff you get from the little store facing the street on the ground floor of your apartment building.

canjobear · 2026-05-07T18:23:14 1778178194

On the contrary, the fact that they exclude free riders is what makes the whole thing possible.

randallsquared · 2026-05-07T18:51:29 1778179889

The price is so low that it makes no real difference at this point. I barely go to my local CostCo (on the edge of being worth the annual fee myself!) because it's so incredibly crowded at all times that the savings are only worth it for more expensive items. In contrast, there are several BJ's nearby and the closest one to me is often blissfully sparse. No idea how they manage to stay in business in that location, but it's really nice.

OutOfHere · 2026-05-07T19:14:35 1778181275

There is no such thing as a free rider in this context, only a fee rider. After all, other supermarkets do fine without a fee. This is not Amazon Prime with free shipping that we're talking about. Charging for just the item still works.

canjobear · 2026-05-07T16:32:35 1778171555

Also there are only 27 variables. Which made it hard for me to write a Snake clone until I figured out how to use Lists.

canjobear · 2026-05-05T20:57:14 1778014634

I think you're vastly underestimating how little of human intent is really encoded in language in a strict sense, and how much nontrivial inference of intents LLMs do every day with simple queries. This used to be an apparently insurmountable barrier in pre-LLM NLP, and now it is just not a problem.

Suppose I'm in a cold room, you're standing next to a heater, and I say "it's cold". Obviously my intent is that I want you to turn on the heater. But the literal semantics is just "the ambient temperature in the room is low" and it has nothing to do with heaters. Yet ChatGPT can easily figure out likely intent in situations like this, just as humans do, often so quickly and effortlessly that we don't notice the complexity of the calculation we did.

Or suppose I say to a bot "tell me how to brew a better cup of coffee". What is encoded in the literal meaning of the language here? Who's to say that "better" means "better tasting" as opposed to "greater quantity per unit input"? Or that by "cup of coffee" I mean the liquid drink, as opposed to a cup full of beans? Or perhaps a cup that is made out of coffee beans? In fact the literal meaning doesn't even make sense, as a "cup" is not something that is brewed, rather it is the coffee that should go into the cup, possibly via an intermediate pot.

If the bot only understands literal language then this kind of query is a complete nonstarter. And yet LLMs can handle these kinds of things easily. If anything they struggle more with understanding language itself than with inferring intent.

applfanboysbgon · 2026-05-06T02:18:58 1778033938

> Yet ChatGPT can easily figure out likely intent in situations like this, just as humans do

No, it is not "figuring out" anything, much less like a human might. Every time "I'm cold" appears in the training data, something else occurs after that. ChatGPT is a statistical model of what is most likely to follow "I'm cold" (and the other tokens preceding it) according to the data it has been trained on. It is not inferring anything, it is repeating the most common or one of the most common textual sequences that comes after another given textual sequence.

frozenseven · 2026-05-06T03:14:49 1778037289

>it is repeating the most common...

This nonsense hasn't been true since GPT-2, and even before that it was a poor description.

For instance, do you think one just solves dozens of Erdős problems with the "most common textual sequence": https://github.com/teorth/erdosproblems/wiki/AI-contribution...

applfanboysbgon · 2026-05-06T04:07:55 1778040475

A slight oversimplification, as LLMs are also capable of generating the most statistically plausible textual sequence, which can be a sequence not found in the dataset but rather a synthesized combination of the likely sequences of multiple preceding sets of tokens, but yes, that is in fact what it is doing. Computer software does what it is programmed to do, and LLMs are not programmed to do logical inference in any capacity but rather operate entirely on probabilities learned from a mind-bogglingly large corpus of text (influenced by things like RLHF, which is still just massaging probabilities).

The claims about solving Erdos problems have been wildly overstated, and notably pushed by people who have a very large financial stake in hyping up LLMs. Nonetheless, I did not say that LLMs are useless. If they are trained on sufficient data, it should not be surprising that correct answers are probabilistically likely to occur. Like any computer software, that makes them a useful tool. It does not make them in any way intelligent, any more than a calculator would be considered intelligent despite being completely superior to human intelligence in accomplishing their given task.

frozenseven · 2026-05-06T04:21:33 1778041293

>not programmed to do logical inference in any capacity

Yet have no problem doing so when solving Erdős problems. This isn't up for debate at this point.

>The claims about solving Erdos problems have been wildly overstated

These are verified solutions. They exist, are not trivial, and are of obvious interest to the math community. Take it up with Terence Tao and co.

>pushed by people who have a very large financial stake in hyping up LLMs

Libel.

>It does not make them in any way intelligent

Word games.

vincston · 2026-05-06T08:33:42 1778056422

Honestly big noobquestion: isn't math just very very nested patternmatching based on a few foundational operators? ive always felt, that im bad at math, cause i forget all the rules, but seeing solutions (and knowing the used pattern) always made "sense".

I always thought the hard math problems are so deeply nested or you have to remember trick xyz that people just didnt think about it yet..

frozenseven · 2026-05-06T12:39:05 1778071145

The amount of mathematical structures and transformations you can apply (the possible rules) is effectively infinite. Simply remembering the rules might work at first, but you'll soon run into the combinatorial explosion: https://en.wikipedia.org/wiki/Combinatorial_explosion

You could go a step further, and simply say "well, ok, then the LLMs are merely doing some form of incremental/heuristic search!". Yes, but at that point you'd also be hard-pressed to claim that humans themselves are doing anything beyond that. You run out of naturalistic explanations.

applfanboysbgon · 2026-05-06T04:33:12 1778041992

> This isn't up for debate at this point.

If by not up for debate, you mean that it is delusional and literally evidence of psychosis to suggest that computer software is doing something it is not programmed to do, you would be correct. Probabilistic analysis can carry you very, very far in doing something that looks like logical inference at the surface level, but it is nonetheless not logical inference. LLM models have been getting increasingly good at factoring in larger and longer contexts and still managing to generate plausibly correct answers, becoming more and more useful all the while, but are still not capable of logical inference. This is why your genius mathematician AGI consciousness stumbles on trivial logic puzzles it has not seen before like the car wash meme.

Mithriil · 2026-05-11T15:37:15 1778513835

> Probabilistic analysis can carry you very, very far in doing something that looks like logical inference at the surface level, but it is nonetheless not logical inference.

A statistical approximation of logical inference (as vague as I state it) could (and will) very well pass for logical inference, at least for the common people, whose logic skills are far from perfect.

Also, humans are certainly not capable of the perfect logical inference you speak of. And I get the irony of what I'm saying with such certitude. Logic is still framed in axioms that are framed in languages, we'll never truly get there. Ah, but absoluteness gets in the way of practicality.

Yet, here we are with a tool, that is maybe not at its prime yet, that equals and beat many human beings at logical inference on some problems that are pragmatically relevant. Should I say symptoms of logical inference at that point?

As to why LLMs capacity for (apparent) logical inference is only limited to specific use cases, I don't have a clue. But I'd like to argue that, humans are like that too.

frozenseven · 2026-05-06T04:47:33 1778042853

>delusional and literally evidence of psychosis to suggest that computer software is doing something it is not programmed to do

These are just insults and outright lies, and you know that. We're done here.

AI progress from here on out will be extra sweet.

card_zero · 2026-05-06T08:07:11 1778054831

You don't have the ability to predict progress, either.

frozenseven · 2026-05-06T11:38:25 1778067505

Well, I'm not clairvoyant, but this is a very easy prediction to make. And we're not talking about decades in the future, this is simply a matter of letting the near-future unfold.

goatlover · 2026-05-05T22:29:09 1778020149

The LLMs are doing this via chat, not by physically standing in a room inferring context. You have to prompt the LLM that you're in a room next to someone saying it's cold, the most likely answer being a desire to have temperature turned up. Of course that won't always be the case. Could be an inside joke, could be a comment with no intent to have the heat adjusted, could be a room where the heat can't be adjusted, could be a reference to someone's personality bringing down the temperature so to speak.

23dsfds · 2026-05-05T23:38:15 1778024295

Precisely.. this is what the bozo AI-accelerants don't understand.

What LLM's are is almost like a hacked-means of intuition. Its very impressive no doubt. But ultimately it isn't even close to what the well-trained human can infer at lightning speed when combined with intuition.

The LLM producers really ought to accept their existing investments are ultimately not going to yield the returns necessary for a viable self-sustaining business when accounting for future reinvestment needs, and instead move their focus towards understanding how to marry the human and LLM technology. Anthropic has been better on this front of course. OAI though? Complete diasaster.

mikestorrent · 2026-05-06T03:23:24 1778037804

> it isn't even close to what the well-trained human can infer at lightning speed when combined with intuition.

It's a lot closer to that than anything was five years ago. Do you really think we're going to be interacting with them the same way five years from now?

tyyyy3 · 2026-05-06T14:42:42 1778078562

This is an empty statement - if you pour in hundreds of billions should you not expect progress?

The question is will these firms be able to continue to spend at that rate. The managers of the firms don’t necessarily have control over that - ultimately punishment in the form of drop in stock price hurts many people involved and will force the management to act in the interest of marginal investors. Even Zuckerberg who has majority control had to concede when meta’s stock cratered to below 100.

quibono · 2026-05-05T21:47:32 1778017652

I know what you're getting at but those examples are reaching

nevertoolate · 2026-05-05T21:43:44 1778017424

it’s cold -> turn on the heater

I’d never just turn on the heater silently if someone said this to me. I think it means something else.

hackable_sand · 2026-05-05T21:46:32 1778017592

If someone just said "it's cold" then yeah that's kinda toxic.

If they said "turn on the heater" then you have no ambiguity

canjobear · 2026-05-03T01:06:47 1777770407

You could simulate your own brain in Minecraft. What do you conclude from this?

search_facility · 2026-05-03T01:28:41 1777771721

I can not simulate my brain, it's a huge stretch to imply this.

But with LLMs - anyone can simulate LLM. LLM can be simulated without any uncertainties in pen and paper and a lot of time. Does it mean that 100 tons of paper plus 100 years of time (numbers are just examples) calculating long formulae makes this pile of paper consiousness? Imho answer is definitive no.

thrownthatway · 2026-05-03T10:23:30 1777803810

I don’t think anyone is arguing the silicon is conscious.

Similarly the paper.

What about the agent doing the calculations.

He may be conscious. Or anyway, we can’t rule it out.

search_facility · 2026-05-03T15:21:43 1777821703

With both cases the agent doing something is human. And human beings are indeed conscious. Outside human needs LLM are useless.

Math, as a tool, is just a proxy for people using LLMs, as well as GPUs spending cycles on calculating the math

canjobear · 2026-05-01T15:39:18 1777649958

The softmax, after the network has been trained, yields an estimate of the probability in the training data, but it is not that probability itself.

jmalicki · 2026-05-01T15:57:11 1777651031

Which models are not trained with the log softmax as the loss function?

canjobear · 2026-05-01T16:09:57 1777651797

Softmax isn't a loss function. It is used to transform model outputs into positive numbers that sum to 1, so that they can be interpreted as probabilities, and then those numbers are passed into (typically) the cross entropy loss function. I think you mean, which models are trained using some function other than softmax to transform the model outputs. There are a number of alternatives to softmax, such as the ones described here https://www.emergentmind.com/topics/sparsemax

jmalicki · 2026-05-01T16:20:19 1777652419

The cross entropy loss function is softmax. They are one and the same.

canjobear · 2026-05-01T16:29:48 1777652988

They’re not. Cross entropy loss is E[-log q] where q is a probability. You could convert the model outputs x into probabilities using some other function like q = 1/Z x^2, and compute cross entropy loss just fine.

jmalicki · 2026-05-01T16:40:23 1777653623

Behold the softmax: https://docs.pytorch.org/docs/2.11/generated/torch.nn.CrossE...

canjobear · 2026-05-01T16:53:50 1777654430

Behold the actual definition of cross entropy: https://en.wikipedia.org/wiki/Cross-entropy

It's true that the PyTorch API conflates cross entropy and softmax, but they are separate concepts.

canjobear · 2026-04-30T15:50:30 1777564230

The mechanics of engines was understood at the beginning of the Industrial Revolution, and they were fully reproducible: all of which is true of LLMs today. An LLM is a bunch of floating point numbers and simple operations on them, all of which are fully known.

But the way that steam engines emergently transformed heat into work was not understood at the beginning of the Industrial Revolution. Figuring this out led to an entire new branch of physics, thermodynamics. Figuring out how big next-token predictors give rise to interesting systems is likely to lead to similarly new ideas.