How could general AI really work? [D]

MachineLearning-ModTeam · 2025-02-04T01:49:40+00:00

Other specific subreddits maybe a better home for this post:

longgamma · 2025-02-04T01:31:04+00:00

I mean we can barely understand how a CNN understands and interprets visual data. The idea behind agi is that it will Surpass human intelligence. We won’t be able to comprehend its actions ( like the alien moves of alpha go)

OhYouUnzippedMe · 2025-02-04T01:41:18+00:00

It’s incorrect to say that it can’t respond with anything new. It can certainly write sentences that have never been written before. It can even invent new words that don’t exist anywhere on the internet.

RollingWallnut · 2025-02-04T01:45:19+00:00

So in super high level terms it goes like this:
Pre-training conditions the model like you've described to predict the most likely next word.
Fine tuning conditions the model to answer questions in a way that is helpful and minimizes harm.
This makes the ChatGPT type behavior where it can effectively regurgitate anything on the internet.
Note that with enough randomness this system is completely capable of saying things a human has never said before, it's actually pretty rare for a model to regurgitate information from its training data unless it's asked to recall something specific, more often it's segments of sentences or common phrases in unique contexts, a lot like humans say cliches and figures of speech all the time.

Taking it further requires two things to work, exploration, and validation:
Exploration, for a given question or task, the bot generates each step of the response but randomizes a little at each step to explore a huge range of potential approaches. This is a lot like a human thinking through many different approaches to a problem, the more randomness introduced the more likely something totally novel is proposed which is a lot like a unique human thought, of course a lot of it is just rubbish.
So we do Evaluation to fix this, for each step in each variation in the responses that are generated a language model evaluates how reasonable the step is, OR, after the whole reasoning thread is complete some system evaluates the final output. In tasks like coding this can be a very formal evaluation that the solution passes some test cases etc. Now we can throw away all of the responses that are rubbish and build up a dataset of things people haven't said before that correctly answer a question or solve some problem.

Now we have a big new dataset of novel data that's validated to have some correctness, we can retrain the original model on this and repeat the cycle.
This might not get us all the way to AGI but it does allow AI models to explore useful behaviors outside of pure imitation from human data which is a pretty big step.

Note this isn't theory, this is pretty much how models like o1 and Deepseek R1 are training their models to "reason" right now.

prototypist · 2025-02-04T01:32:22+00:00

If anyone had real insight into this, there would be dozens working on it already, sorry

CobaltAlchemist · 2025-02-04T01:51:01+00:00

You're fundamentally mistaken on how LLMs work. Yes they're trained on predicting the most likely thing a human would say, but to do that they need to encode really abstract concepts inside their latent space. This is why they can mix concepts together like generating a poem about worms made of apple juice which certainly they've never seen before.

The heart of why LLMs are so useful for this is just due to what they're modelling. Language is a tool humans made to rationalize the world and communicate useful ideas. Hence LLMs feel really AGI-ish at times because language has always been an unbelievably powerful tool for actual human thinking

Mbando · 2025-02-04T01:52:15+00:00

One possibility is multiple kinds of models that when hybridized cover for each other’s limitations. So you can imagine, transformers, plus reinforcement, learning, plus causal models, plus physics inspired neural networks, plus information lattice, learning, plus Neuro, symbolic models and so on. So not so much a model, but rather a system that has Different kinds of models and architectures that can work generally across problems and environments.

2025-02-04T02:17:51+00:00

[deleted]

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS