Hacker Newsnew | past | comments | ask | show | jobs | submit | damsta's commentslogin

There is extra cost for >272K:

> For models with a 1.05M context window (GPT-5.4 and GPT-5.4 pro), prompts with >272K input tokens are priced at 2x input and 1.5x output for the full session for standard, batch, and flex.

Taken from https://developers.openai.com/api/docs/models/gpt-5.4


Which, Claude has the same deal. You can get a 1M context window, but it's gonna cost ya. If you run /model in claude code, you get:

    Switch between Claude models. Applies to this session and future Claude Code sessions. For other/previous model names, specify with --model.
    
       1. Default (recommended)   Opus 4.6 · Most capable for complex work
       2. Opus (1M context)        Opus 4.6 with 1M context · Billed as extra usage · $10/$37.50 per Mtok
       3. Sonnet                   Sonnet 4.6 · Best for everyday tasks
       4. Sonnet (1M context)      Sonnet 4.6 with 1M context · Billed as extra usage · $6/$22.50 per Mtok
       5. Haiku                    Haiku 4.5 · Fastest for quick answers


Anthropic literally don't allow you to use the 1M context anymore on Sonnet and Opus 4.6 without it being billed as extra usage immediately.

I had 4.5 1M before that so they definitely made it worse.

OpenAI at least gives you the option of using your plan for it. Even if it uses it up more quickly.


Is that why it says rate limit all the time if you switch to a 1M model on Claude now? It kept giving me that so I switched to API account over the weekend for some vibe coding ran up a huuuuge API bill by mistake, whooops.


Good find, and that's too small a print for comfort.


It's also in the linked article:

> GPT‑5.4 in Codex includes experimental support for the 1M context window. Developers can try this by configuring model_context_window and model_auto_compact_token_limit. Requests that exceed the standard 272K context window count against usage limits at 2x the normal rate.


Wow, that's diametrically the opposite point: the cost is *extra*, not free.


Diametrically opposite to tokens beyond 200K being literally free? As in, you only pay for the first 200K tokens and the remaining 800K cost $0.00?

I don't think that's a fair reading of the original post at all, obviously what they meant by "no cost" was "no increase in the cost".


I can see that's what they mean now that I've read the replies, but when I first read that top comment I too parsed it as meaning 201k would cost the same as 999k (which admittedly did seem strange, hence I read the replies to confirm and sure enough that's not actually the case!)


Companies like Vercel, Lovable, and Stackblitz should pay salaries to each of these engineers. Their business succeeded only because Tailwind exists.


Companies like Vercel, Lovable, and Stackblitz should dissolve because their existence is a net negative for humanity.


Why is there existence a net negative for humanity?


Same reason as tobacco companies.


I agree with the sentiment that companies should help fund open source they depend on, but I think it's a stretch to say those business succeeded "only" because of Tailwind. It's a great project, although I'm pretty sure they would have figured out a way to work with CSS without it.


Welcome to the internet, most of it is build by unknown OSS developers, how many people will you go ask these companies to pay for?


I got interested and checked their website, but it gave me a bad vibe, not to mention the hefty price tag in addition to the subscription.


Can we get something like that in Gemini 1.5 Flash?


How does the install tracking work exactly? Do they fingerprint the device once you start a game?


Is having an end-of-life deadline a requirement in pre-mortems done for product launches at Google?


Imagine wasting time on cloning the repo, making a change, pushing it, creating a PR, all that time wasted because of hate. He could use that time to fix one of the 38 issues reported by other users...


> all that time wasted because of hate

Disagreement is not hate. I do not agree with the person who wants to get rid of the Taiwanese flag since as far as I'm concerned Taiwan is a country but I do not hate him for his request.

Please don't use the word hate for these purposes as all that does is downplay cases of real hate.


If you are taking a slice of your free time just to go through this whole process to create a PR, and you know you will never get that time back, and your only message is: Taiwan is not a country, then either you are a troll or hate-fueled person. There is no middle ground, just read this part of their comment:

> TaiWan is not a country, is essentially a province of China. currently it's a district because of some history reasons. > So, TaiWan flag should not be placed along with other REAL country flags, It's a big misleading to website visitors.

This is not just a disagreement.


And I thought March would be a bit more kinder to IT sector than February and we still have Friday left for a few more layoffs.


Cost of capital is still high. Risk free return rate is still high. It's amazing tech has lasted as well as it has. That said, there was a brief "flight to safety" to tech in '08 as well. I think Q3 is going to be painful.


I asked ChatGPT about taking responsibility for the layoff, here is the response:

> The blog post does not explicitly state that Mark Zuckerberg takes full responsibility for the layoffs.


I joined the waitlist and got a confirmation email saying "Pretty soon, you’ll be creating stunning designs in a flash" ;)


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: