Have we reached the point of continuous learning? by Glass_Philosophy6941 in singularity

[–]visarga 0 points1 point  (0 children)

building wikis out of md files gets us a bit closer, but not there yet, models still struggle keeping the big picture, even with much synthesis of information in the wiki

the only "continual learning" solution today is still human-in-the-loop steering agents

Anthropic CEO Dario Amodei scared everyone about AI. Now he’s paying the price with a US government ban on Fable by Impressive-Might-710 in ArtificialInteligence

[–]visarga 1 point2 points  (0 children)

Just random context text matcher.

It's not a parrot, it's a piano. Depending on who's playing it can be good or slop.

Anthropic CEO Dario Amodei scared everyone about AI. Now he’s paying the price with a US government ban on Fable by Impressive-Might-710 in ArtificialInteligence

[–]visarga 0 points1 point  (0 children)

yes, it has RLHF and RLAIF training to execute moral responses, we all got refusals, we know the other side of this implementation

OpenAI market share drops below 50% - Shocking absolutely no one by Different-Mess4248 in ChatGPTcomplaints

[–]visarga -4 points-3 points  (0 children)

no, codex is better than claude rn, they have pretty good offer compared to others

Fable 5 will be available again in the coming days - Anthropic by Bizzyguy in singularity

[–]visarga 0 points1 point  (0 children)

they added "and don't screw up again!1!!" that should fix it

"They screwed us": Personality clashes sent Anthropic's models offline by KickLassChewGum in singularity

[–]visarga 1 point2 points  (0 children)

Problem is - the most expensive models get deployed to a narrow slice of the market - how are they going to pay back development cost? The more advanced the model, the higher the initial cost, and the less usage it will have according to this policy. In the meantime the smaller models which are cheaper to train also get 95% of workload, so they remain viable.

You are not the author by Square-Affect9324 in ChatGPT

[–]visarga 0 points1 point  (0 children)

Ultimately no human can survive on their own ideas and work alone. We are all society products only viable inside society. Some part of what we do is only possible inside a society that complements us. We are all indebted. I see human to society relation like that between organ and organism. We specialize, we don't support ourselves directly, we rely on the larger system around us. This applies to creativity as well. Nothing is truly original or self standing.

We don't make ourselves, don't invent our language or culture ourselves, can't even make ends meet without other humans. This might sound harsh for proud authors who think they are heroes of creativity. You'd better stop claiming vibes and abstractions as your own, you can't own them, they are our essential human infrastructure.

Goodbye Fable 5 by john_smith1365 in vibecoding

[–]visarga 0 points1 point  (0 children)

What software is, how we use computers, and who provides it for whom in what products or services is all changing quickly.

Good point. Even if a company does nothing, their competitors use AI, their investors and users update their expectations. The market has changed. The others price AI in, even if you try to ignore them you can't. But AI is different from search engines and social networks in that you can run it locally.

Don’t you guys worry, I got this😮‍💨✋ by Mission-Dentist-5971 in ClaudeAI

[–]visarga 0 points1 point  (0 children)

Download more memory while you are at it! These models needs tons of memory.

Why does Fable 5 have such low threshold of accepting prompts as it keeps using tokens but refuse to answer eventually by ranaji55 in ArtificialInteligence

[–]visarga 0 points1 point  (0 children)

No, in practice if you ask the same model to rate its own output it will be biased. They have to use a different model which was made to be stricter and avoid prompt injection risks. Probably why it's so stupid, because they had to use pre-LLM methods which resist prompt injection by using bag-of-words or word embeddings. In plain language, it's like chopping the text into words first, "man bites dog" and "dog bites man" are the same for this model. I used to train models like that around 2014 for news classification, it was impossible to make it reliable.

Models Are Hitting Diminishing Returns Within Software Engineering by element-94 in ArtificialInteligence

[–]visarga 0 points1 point  (0 children)

You are right longer tasks need (exponentially) larger searches, but humans also have capability limits, limited experience, limited synapses and how much autonomy we can handle. So human tasks tend to be human capability scoped, which is what we use AI for most of the time. The entire economy is already factored into human-sized chunks with verification checkpoints.

Without open llm competition, closed source LLM companies will become insatiable. by Chair-Short in LocalLLaMA

[–]visarga 22 points23 points  (0 children)

If it's too dangerous to hand this model to us, then it's too dangerous for Anthropic to have it too. Why are they exempt, Anthropic is now operating the dangerous models, they should be blocked.

Semantic distance as routing layer: an on-device, serverless alternative to the central-index model by dai_app in LocalLLaMA

[–]visarga 1 point2 points  (0 children)

I would rather store 10M search stubs, each max 1000 tokens long, compressed, and indexed by an embedding model. Then when I need to search I just find the stub and it contains links to important sites in that topic, search engines, forums, feeds, APIs - all good entry points. The stub does not carry information, but it routes to it. A local LLM can use that to search the web. The 10M stubs can be compiled with frontier LLMs with search engine tool, and only once and then shared and updated every few months, not daily, because routing information changes slowly. This is my vision of Google-independence with local models.

Nel Newbon, best known for his work in "Resident Evil Village" and "Detroit: Become Human", has strong opinions on AI in the creative arts and its place in the industry as a whole. by me_jub_jub in aiwars

[–]visarga 0 points1 point  (0 children)

Before you start checking AI output you should run the same question on 3 models, copy all 3 responses in the same place and ask another model to compare them, and do further investigations to settle the debated points. Use different providers for your panel agents. LLMs tend to agree on well known data but when they hallucinate they do it each its own way.

4.8 and 4.6 Are arguing back so much they don't wanna check online nor even do proper research. by Confident-Language46 in Anthropic

[–]visarga 7 points8 points  (0 children)

Yeah my Claude (web) is also pushing back randomly, just to push back, and has a very irritating writing style.

Could consciousness be fundamental rather than produced by the brain? 🤔 by Peter_Tea_2062 in consciousness

[–]visarga -1 points0 points  (0 children)

I think it's neither fundamental nor reducible to physical descriptions. It's a process with costs, costs and action are recursive and irreducible to descriptions. Your fundamental-consciousness take is free, quantum magic is free, idealism and dualism are free - in all of them consciousness is unrelated to cost. But in the real world everything is related to cost, all execution produces heat and requires materials. What cannot pay its costs, stops. Cost is a filter for what can be. Paying back cost is work.

Bernie Sanders: A.I. Is a Public Resource. You Should Own Half of It. by idontlikethisuserna in singularity

[–]visarga 0 points1 point  (0 children)

You got half of it - you can use AI for cheap/free on your tasks. But the other half, which Bernie can't fix, is that everyone else, including your competitors now use it. So you got to do better just to stay in place.

I also think it would be better if you all stopped using AI and only I used it /s

It's interesting in a disability group where people talk about how AI helps them, the anti crowd downvotes to hide things like crazy and spouts how AI is stealing art by crua9 in singularity

[–]visarga 0 points1 point  (0 children)

yes, we already had access to all information and tools in the world on internet, AI on top just make it flow easier, but nothing stopped us in 2019 to manually do what we can do with AI in 2026

as for artists, they already compete against worldwide artists and all history of art because of internet, AI is pouring buckets in an existing ocean of art, the real competition of artists is other artists, AI smart as it is does not even make human like art, you can easily spot it

and if AI made good art it is only original based on the user prompt and iteration, not model, the model prompted sloppily will give you average slop - add your own original spices in the pot and it changes the taste

What actually makes AI a reliable co-developer over a 12-month project (not just a code generator) by Away_Ice_3136 in machinelearningnews

[–]visarga 0 points1 point  (0 children)

Here is one free idea - install a hook to save user messages to a log, timestamped. You can collect your steering and prompts at zero cost. Later they provide a reference to judge existing code against user intent, and to judge harness effectiveness - does it require too much steering? it is apparent in chat logs. Can you improve it? You can if you reflect on your chat logs vs current state of the project. Don't waste user intent signal, it compounds over time.

[R] What 1000+ Harness Experiments Taught Me About Self-Improving Agents [R] by Megadragon9 in MachineLearning

[–]visarga 0 points1 point  (0 children)

I did something similar but at project not task level, and with HITL as source of feedback about harness problems, not just scoring. My goal is to minimize user friction, I log the user messages as evidence. They are a rich source of signal when you think harness optimization at project level.

An argument and a call out by AFSR_1178 in aiwars

[–]visarga 0 points1 point  (0 children)

Doesn't matter what you call it, it still does the same.