More

nubg · 2026-05-19T16:23:35 1779207815

Interesting perspective

nubg · 2026-05-19T14:33:23 1779201203

ang benchmarks against state of the art?

binkHN · 2026-05-19T15:43:11 1779205391

It depends. You can expect a 5 to 15% performance hit depending on the task. In OpenBSD, security comes first and performance comes second.

nubg · 2026-05-19T08:42:48 1779180168

> The Service is provided "as is." We are not liable for downtime, data loss, or errors in token counting or cost calculation.

lmao

nubg · 2026-05-19T07:43:04 1779176584

source?

villgax · 2026-05-19T13:43:58 1779198238

https://github.com/567-labs/instructor/blob/main/instructor/...

nubg · 2026-05-19T07:42:20 1779176540

> When I come back to Slack, replies are often already sitting in drafts.

He must be a pleasure to work with

nubg · 2026-05-19T02:27:25 1779157645

The tweet was still written by an LLM, even though the system prompt included "only use lowercaps, making my text look like a kid in a csgo chat"

like_any_other · 2026-05-19T03:22:45 1779160965

What makes you think so?

siliconpotato · 2026-05-19T06:38:20 1779172700

There's a certain writing style, very short paragraphs, fair amount of repetition that just feels like you've read this post before. And you have, just on different topics but it's always the same feel.

But also lots of negatives to start the sentence, usually with a reinforcement e.g.

your password didn't help. your 2fa didn't help. you were never asked to authenticate. you were asked to authorize. completely different mechanism

nubg · 2026-05-19T02:21:39 1779157299

lmao at dang getting downvoted

nubg · 2026-05-19T02:19:13 1779157153

The article has hallmarks of being formulated by an LLM. Why should I bother to read it if I ca not be sure which parts are based on the prompt, and which parts are hallucinated from the LLMs world knowledge? Dear author, care to simply share your prompt with us?

s314 · 2026-05-19T02:26:52 1779157612

It wasn't "a prompt" but several prompts that transformed the raw experimental results to a blog.

> hallucinated from the LLMs world knowledge

This can't be true because I checked whether the content was consistent with the experimental outputs

Squeeze2664 · 2026-05-19T02:53:25 1779159205

The topic is interesting and you have my thanks for taking the time to look into it and prepare the post. Would you say it's fair to say that if you didn't use LLMs to prepare the post, we would have no blog post at all? In that case, I think I lean more towards being OK with this usage of LLMs, as I'd rather have this content available than not. However, I can only read that one repeated sentence about "booleans" (Ctrl-F "Boolean" and you'll know what I mean) this many times before I start questioning the validity of the entire document. It is not _good_ writing, to be frank.

nubg · 2026-05-18T18:50:10 1779130210

lmao at opus 4.7 being a downgrade

SwellJoe · 2026-05-18T20:04:47 1779134687

They made it less sycophantic. Which is a good thing for mental health, but maybe a bad thing for popularity contests.

nubg · 2026-05-18T13:23:51 1779110631

The question is ambiguous. It does not state that I want to wash the car there.

Why is the answer graded therefore on whether I took the car or not?

Maybe I wanted to meet a friend there?

GPT-5.4 might even price this in:

> "If somebody asks if they should walk or drive to a car wash 50m away, they probably don't need to actually wash the car there, otherwise the question doesn't make sense, they probably just need to get to that place efficiently. So walking it is."