It is called 'jagged intelligence'. A lot progress was made in the last 2 years. Most notably reasoning models, tools use, harness progress. It takes time to build the skill to make those models useful, but they do provide a lot of value.
Ah yeah jagged intelligence is the perfect phrase for it. I do also get some value from them, both in coding and in images. I find it the least usable for information primarily because of the hallucination problem. I still do use it for that purpose but it's kind of annoying when it writes something that's wrong, and I find it out from a Google search later.
You are cooking them wrong. You absolutely must ask the model to do grounding work — search online, search in your files, cross-check with different agents. They are universal reasoning engines, not a fact recalling tool.
Check my 'deep research' skill, you will get the idea
In my experience, current agentic workflows are so slow, that for many cases it only makes sense to run them in parallel. So a lot of context switching. If we could have 10-100× faster token generation, we could have task delivery at the speed of human review.
Ten thousand years ago (around 8000 BCE), the global human population was estimated to have been roughly 5 million people. This is significantly smaller than the current population of just Poland (about 36 million).
In absolute numbers there might be more now, even if the percent is smaller. It is difficult to compare this things without having a specific place in mind.
The fact that humankind grew from 5M to 8.3B, while dramatically improving longevity and quality of life speaks volumes. Multiply life quality × population × life duration, not only "misery and destruction" is not the case, but you could rather see powers of positive technology influence.
Next generation of OS should have constant video and audio recognition by on device LLM. This will provide valuable context for a lot of scenarios. So instead of frequent copy-pasting we are used to, we can let agents access context of our whole workflows from different apps.
But Google is a very ill positioned candidate for such OS. I would rather trust Apple and local-first on-device models.
why not? if context stays on device and operator is in full control, what downsides are there? there is no observer here, only operator and full context of his activity at his fingers
>Sousveillance (/suːˈveɪləns/ soo-VAY-lənss) is the recording of an activity by a member of the public, rather than a person or organisation in authority, typically by way of small wearable or portable personal technologies.[14] The term, coined by Steve Mann,[15] stems from the contrasting French words sur, meaning "above", and sous, meaning "below", i.e. "surveillance" denotes the "eye in the sky" watching from above, whereas "sousveillance" denotes bringing the means of observation down to human level, either physically (by mounting cameras on people rather than on buildings) or hierarchically (with ordinary people observing, rather than by higher authorities or by architectural means).[16][17][23]
The goal is not to record activities of others, it is to have full context of operator activity. More like a personal knowledge base, that camera pointing to a cop.
How many times we have to hear again about Erdös problems? :) It sounds like a great achievement for humanity at first, but after a while they keep coming back!
why can't we simply raise the bar for posting? I remember semi-open platforms, where you were invited, had to earn the right to post comments and posts. and you could easily lose those rights when downvoted. its seems strange in the AI-bot era that we allow any entity the freedom of speech.
That's essentially how most small chatrooms work these days. Join a bigger GC or small Discord/Matrix/IRC and bad behavior gets flagged with impunity. But most of the big web forums like HN, Reddit, etc predate that and moving to a model like that would pretty much kill the sites as we know them.
Start by repeatedly occupying the same room and seeing if the other person doesn't leave. Then you can graduate to very brief eye contact. Then maybe 'hi' to the whole room and seeing if the person responds. Longer eye contact. A nod. Negative response at any point = back to square 1.
1/ Sorted. Some buyers have priority. They can be sorted by price paid, by amount of minutes listened, depends on the sale.
2/ Random with KYC. Everyone has the same chance to purchase.
reply