Hacker Newsnew | past | comments | ask | show | jobs | submit | sam_goody's commentslogin

The AI would confidently give him the wrong answer, since it has no way to provide the correct answer, and doesn't know its own limitations. (Or however you wish to describe "hallucinations", which is about as accurate as my description ;))

And he would think he has the right answer, perhaps write up an essay about his findings, which later AI bots will read and learn from, propgating the mistake...


The AI would confidently give him the wrong answer

There is irony here that does not sleep.


Wait, the author identified the shell as "Sphincterochila candidissima". Which is a living species of air-breathing land snail, a terrestrial pulmonate gastropod mollusk. Completely off.

Except that if you tried one-shotting your ticket twenty times at different hours of the day and different days of the week, you would have enough changes to make benchmarks even if you used the same model every time. Much moreso if you fiddled with the thinking or changed the prompt.

Because non-deterministic, because of constant updates and changes, and because the models are throttled according to number of users, releases, et al.


You never get "the same" Steph Curry, he might be tired, annoyed by a fan, getting older... but if he and I were to throw 100 3-pointers, we could all correctly guess who will perform better.


Good point.

But I use Codex and Claude daily (work and hobby respectively). And there are days where one or the other just seems to have gotten up on the wrong side of the bed. Or is just being lazy. Or is suddenly super-powered do everything including what i asked it not to. (To be fair, the same thing happens with myself. :/)

I am convinced that if I was bench-marking, I would be convinced these are different models on different days.

[This conviction may say more about me then about the model.]


That's also fair, Anthropic lobotomized their services a couple of times already. One week, you are in awe that the tools figure out everything, explain everything, consider everything, produce a clean fix... next week, they are completely useless.


> sn't the US building mass detention camps right now for all the brown people there?

Why would you think that?

> And arresting / detaining / demanding papers from any and everyone?

I have lots of friends from outside the U.S. that come regularly and don't find it onerous. Maybe it depends where you are coming from?

> With federal agents killing civilians?

OK, I agree that there are issues, and even very serious ones. Obviously, not on the level of China, but still serious issues. Nonetheless, what you see on left leaning media is not representative of what is happening on the ground throughout the U.S. Not even close.

IMO, the US is definitely positive wrt human rights. There are issues, but you can go to a No Kings protest, and live your life happily without issues, and it is hard to find another country that is nearly as forgiving. And it at least has people trying to spread concepts of individual liberty, vs most countries in Europe, almost all countries in Asia, and ALL Muslim countries, that are leaning to removing individual rights.


Let's face it - Grok is not nearly as popular among programmers as Claude or Codex, and that means that xAI is not able to vacuum all the data that his competitors have access to.

Cursor is installed on a LOT of computers.

Once Grok becomes the default engine, it will raise adoption.

More importantly, if you have Cursor installed all your data may be sent to their labs whether you use it or not (unfortunately - this is par for the course for all the LLMs, a la Microsoft).

That's worth a lot - especially considering that Cursor might also grow with the shift to more powerful local models and the fact that it has a respectable income stream.


I use the CLI often enough, but still most of my time is in a GUI. It just makes the diffs easier, the flow simpler, etc.

As such, I wanted to break into jj via the GUI, and only adopt the command line after I could visualize the concepts and differences in my head.

Alas, the GUI I tried - a VSCode plugin - did more to confuse me than to help, and it made it very easy to catastrophically rewrite the history by accident. I tried another UI and it kept failing and leaving me with cleanup work. I couldn't find a third UI that looked promising.

So, I gave up. One less jj user on the planet - no biggie. But I really wonder if it would be worth the effort for some of the jj pushers to try to get a proper UI in place - I bet I am not the only one that likes to learn visually.


Very nice. Good Luck!

Did the private equity buy the domain videojs.org (did it take control of the project and you somehow regained control after selling) or was this domain (and the project) always under your control?


TL;DR Modern gear is actually much better than what they wore a few generations ago. More flexible, waterproof, requiring less thought, better and overall lighter. Even with lots more effort and investment, there remains a significant difference between two brothers when one insists on wearing recreated ancient gear.

Well, whadaya know!

But I bet you didn't know that you can find modern pro hiking shoes that are even heavier than the old ones they recreated!


So, basically, a random site from their index of ~30,000 sites.

You can choose similar sites by index.

But what are the criterion to have your site listed here, or how it will prevent this from just becoming a massive gamified advertising index, or anything more about "why these?" is not obvious to me.

Can anyone explain what is special about these sites specifically, or where this project is going?



As someone who works extensively with youth, but is somewhat of a extrovert..

There are many reasons why people are introverts. Some of them are "broken" and could use a fix, some are healthy and should not be touched.

If the source is a lack of confidence, it should be dealt with. Sometimes the fix involves pushing oneself to talk beyond their comfort level, sometimes not.

If the source is a form of ASD or Autism spectrum, then it depends very much on what the "introvertness" is costing them. If the person is either suffering or is living out of touch with reality, that's broken. Sometimes talking is absolutely required to create a fix. (Over here, even more-so, getting guidance is a must. Which is a shame since the more a person on the spectrum needs guidance the less he is likely to accept it)

If the source is an understanding of one's relative position (eg. at a meeting of seniors) then that's fine, but too much of that can be unhealthy.

If it's from depression, than please don't introvert. Get professional help.

If you are so into your phone that you've forgotten the joy of interaction with humans, that's broken. Unfortunately, there are a lot of people in this category, and social media and AI are making things worse. (Nothing new though - 30 years ago I helped a kid who was watching 18+ hours of video, on a CRT! He was mighty introverted, and wanted nothing to do with us humans for weeks after starting treatment.)

If you are just a confident guy/gal that just happens to enjoy your own space, than all the power to you! From what I have heard there are a LOT of people that fall into this category, and I have even meet two or three of you!

But let me stress, even if things are broken, the advice in this article can often make things worse. And if it's not a problem for you, enjoy!


There is a popular [excellent non vibe-coded] web server called FrankenPHP; A port of PHP to Go bundled with Caddy.

Are there any other FrankenProjects out there that have had any success?

Were we so impressed by the concept of the original Frankenstein?

Is this a Freudian slip, that we are expecting these AI projects to turn on their creators?


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: