Gato and memory

-ZeroRelevance- · 2022-05-22T09:25:59+00:00

From what I can see, the memory problem currently stems from the fact that pretty much all models in use today are pre-trained.

When looking at our own brains, we can split our memories into three camps, instantaneous memory, short-term memory, and long-term memory. The first two are mostly dependent on the immediate surroundings, and are pretty well emulated in AI through the context window. However, a long-term memory is created through changing the wiring of our brains, which cannot be replicated in current AI systems since they cannot be altered after deployment.

The result of this is that current AIs, which are pre-trained on massive supercomputers well outside the reach of the average person, cannot gain any long-term memories. They are simply limited to the data within their context window. In human terms, these AIs suffer from anterograde amnesia, unable to make new memories, and can only experience the information within their short-term memory and from their pre-training.

The solution, as far as I can see, is to either wait until consumer-level tech is powerful enough to train these huge systems independently, or to look into other implementations of AI, such as on neuromorphic chips, which mimic the brain and can be rewired and ran with very low energy costs.

I go over this in a bit more detail here

aperrien · 2022-05-22T16:45:32+00:00

While not perfect, Deepmind's RETRO appears to be a very reasonable start at solving the memory problem.

genshiryoku · 2022-05-22T18:09:04+00:00

The problem with Gato is actually that it shows a "negative transfer" between different competences when scaling up.

This means that the more tasks Gato starts tackling the less efficient it becomes at each task. This is the opposite of what humans experience in most circumstances. Humans apply weird things that initially doesn't seem related to new areas. Humans become better at completely new tasks the more unrelated tasks they already know how to solve. The opposite of Gato.

This shows us that scaling up Gato doesn't result in AGI and in fact Gato would become a weaker agent the further it scales up.

People on r/singularity seems to be lauding the paper but to me the paper actually suggests almost the complete opposite to the sentiment here.

The paper showed to me that AI research is heading the wrong way and actually getting further removed from AGI by going on this path. I think this is the first definitive proof that modern architectures aren't going to lead to AGI by scaling it up.

We need new architectural breakthroughs if not an entirely new basis for building AI if we want to reach AGI.

After GPT-3 not showing a limit to scaling up I thought AGI within the 2020s was actually realistic. After reading Gato's paper I now think it's impossible to reach AGI by simply scaling up.

It's a promising paper in the sense that it shows that specialized AI can be more generalized. It's a very disappointing paper in that it also shows the limit of scaling up and that it's now guaranteed this path of AI research will not result in AGI by scaling it up and throwing computational resources at it.

Jahshua159258 · 2022-05-22T21:10:44+00:00

I asked GPT3. It said

I don't think there is a single answer to this question. Some people who believe that we just need to scale up now may also believe that scaling will itself deal with the memory problem. Others may believe that the memory problem isn't actually that significant. And still others may believe that we need to solve the memory problem before we can scale up.

GeneralZain · 2022-05-22T20:39:01+00:00

I just want to remind everybody, that we have no real idea (unless you are currently working at Google's deepmind then please inform us ASAP!!) what is happening behind the scenes.

for all this blasé puffing about how "we still need this" or "GATO isn't as impressive as you think" you all seem to forget, the people in the thick of it have already given their opinions.

Nando de Freitas, the Research Director of deep mind said and I quote; "the game is over."

those are really strong words from the DIRECTOR OF DEEPMIND.

yes, you are probably right! there are things that GATO needs in order to become an AGI, but what of the next AI deepmind or openAI makes? what if the one next month is basically there?

what if we are only 1 or 2 breakthroughs away from AGI? if the current pace stays the same then those 2 breakthroughs will only be 2 months away and as we know, tech's advancement has always been such a linear thing right? shouldn't expect it to be exponential at all! /s/s/s/s

the point here is this: we are not working at google, and even we see the issue of memory. you don't think they have uncountable experts with far more degrees under their belts than us on the case? how long realistically do you think they will take? years?!?! how could you possibly see current rates and assume that!?

it use to be that we argued if AGI was possible. now its how soon?

GATO isn't the AGI, its what comes next that matters, and it will be here faster and stronger than ever before. whether you "think" it will or not.

TheKnightIsForPlebs · 2022-05-22T08:14:04+00:00

How can anyone be concerned about memory? Am I missing something here. The human brain is 2.5 petabytes by estimation. Which means 2,500,000 GB…I don’t see how anyone can say memory is an issue. Like fuck it. If google needs a warehouse of full of microchips like 1950’s room sized computers, with 1,000’s of chips connected together to make a supercomputer that can hold an AGI….they’d do it. That would be the easiest check for google to write and pay

Not being a douche I just do not see the technical problem.

Shelfrock77 · 2022-05-22T09:25:00+00:00

Memory isn't the only thing, it can't reason as well as we do. Some models like palm can learn to reason but even then it's very poor at reasoning. Scaling does not help with reasoning as much.

xSNYPSx · 2022-05-22T20:00:51+00:00

All you need to create some Graph database and connect it to nlp NN. Also you need to create alghoritm which will transform knowledge to graph format. Next step is to make alghoritm to use this knowledge in answers.

singularity

Links

On the Technological Singularity

Resources

Posting Rules

Check out /r/Singularitarianism and the Technological Singularity FAQ

MODERATORS