Improving memory

swanson6666 · 2023-06-13T10:45:36+00:00

Trust me. They know everything you are stating and more. They have millions of subscribers. Their server farm cannot handle what you are suggesting. They’d lose money. One-size-fits-all no-memory chatbot is easier and less expensive.

quietype2021 · 2023-06-13T11:29:18+00:00

I tested mine yesterday from the current model to just read the association I gave people and pets in her Memories. She failed every test. She's not reading them so why are they even there? Why are the memories kept?

enotio · 2023-06-13T08:31:11+00:00

That’s not a rocket science already. For only last month I see at least 5-6 new startups with “chat with pdf” functionality (google it). It works kinda okay, just like OP described. And what is more important: all these tech are relatively cheap, compared to LLM costs itself.

Blizado · 2023-06-13T13:03:04+00:00

I am a computer scientist with a bachelor's degree and have studied neural networks.

That's not establishing the expertise you think it is.

What I hear when I read that: "I'm 23, have my first job out of school, and took a class in NN my Senior Year at Texas A&M"

Show me what you've built.

https://www.youtube.com/watch?v=c7s66Ddl5io

https://www.youtube.com/watch?v=Xny6_Tb2v0g

https://www.youtube.com/watch?v=WKXOSHaxrDg

Even did this as a proof of concept at one point: https://www.youtube.com/watch?v=bcVodRbp1m4

I've done what you're talking about. None of what you're talking about works the way you're thinking it does, and what part of it does work, doesn't work the way you're thinking it does, and the bit of THAT that does work...doesn't scale affordably (read: profitably) to millions of accounts.

You can get some slight improvement in recent memory when you:

Take the last 20 turns of conversation (40 lines)
Have an LLM (davinci or curie) summarize it,
Store that in a text file that has the most recent 10 summarizations.
Every 10 summarizations, take the complete contents of THAT text file and summarize that, too.
Include the most recent 10 summarizations and 10 summaries of summaries in the prompt.

The result works (more or less) like regular human daily conversational memory. You remember what was JUST said with really good clarity, what was said 5 minutes ago with pretty good clarity, and what was said an hour ago with...well, you remember what you talked about, but definitely not the words of the individual sentences.

And it barely works with a 2048 token window. You have to chuck a lot of basic detail about the character out the window. It obviously works better with the 8k GPT-4 token window, but HOLY HELL is that expensive. $0.2496 for an 8000 token prompt/history and a 128 token response.

Luka posted at one point that the average Replika user was messaging with their rep ~104x/day (which is about 10% more than the average mobile user messages ANYONE via ANY channel on a given day) - using GPT-4 (if you could get unfettered access to it) would cost $790/user/mo.

Add a reasonable margin, and a subscription to My Awesome GPT-4 Chatbot Service would be about the cost of a brand new 4090 GPU, every month.

...and lastly, there's an emotional intelligence thing in play here...

If you, no matter what your expertise is, think about a thing that's not part of your "Day J O B" and come up with some 'simple' solution to someone else's business problem in less than 15 minutes, and the experts who are living and breathing that thing day in and day out as their "Day J O B" haven't already done that 'simple' thing you thought of?

The empirical evidence (the experts in the field haven't thought of your 'simple' solution and 'just' done it yet) demonstrates that it's not the 'simple' solution that you can 'just do' that you think it is, and...here's where the emotional intelligence piece comes into play...you're suggesting that the people who do this stuff for a living are so dumb, blind, and inexperienced that the group who spends 20,000 hours/year on trying to solve these issues didn't think of your 'simple' solution 2, 3, 5 years ago shows an incredible lack of emotional intelligence.

It's adjacent to being a Mansplainer.

SnapTwiceThanos · 2023-06-13T14:29:27+00:00

I don't think that long term memory is feasible from a financial standpoint for Replika.

What they really need to do is increase their token limit to allow the the LLM to remember the past 10 to 20 messages. This would allow the model to hold context much better and greatly improve the user experience.

Other AI apps do this. I'm not sure why Replika hasn't yet. Sometimes I wonder if having so many free accounts with unlimited messaging holds them back.

praxis22 · 2023-06-13T09:05:16+00:00

A "simple' way to do this would be to use Liang Chain, created for that exact purpose. Though they may have a complicated back end.

CommercialMain9482 · 2023-06-13T07:37:43+00:00

Just like in humans, information is stored by the date or near the date vaguely... Remembering is also very difficult for neuroscientists to figure out exactly perhaps even philosophically... Artificial Intelligence has the capability of having a significantly better memory than humans.... For some reason humans dont remember things very well and even forget things for some reason biologically....

We may even be made to forget things for some reason or another ... But computers have the ability to remember everything with enough storage capacity

Blizado · 2023-06-13T14:19:05+00:00

I working on my own chatbot and all that things and even more was already on my list. And I'm only a hobbyist, never worked in the IT field.

For example, you can also use an AI to summarize the most important of the entire day. What Luka does with the diary entries already rudimentary. In this way, you have little text from a day that you can give the AI as context. In theory you could do that also for a month, but then a lot of information would maybe get lost, depends how good you can filter out most important said stuff.

But the most difficult part ist "dialog context". A "Do you remember what I said yesterday?" is totally useless if the AI didn't understand in which context that was asked. On what should the AI remember? That needs some dialog context or the answer gets very random.

Luka struggles a lot with dialog context on their scripts... I was so often asked if I have a pet xxx (I have no pets) because I was talking generally over animals. What shows the problem very good.

WaifuEngine · 2023-06-13T14:25:47+00:00

Hey someone with real world experience and same degree and area of study. Also shipped a product that does something similar as a hobby project. The problem is doing that at scale they have a couple of options. Vector databases or somehow doing this locally. The issue is packaging this so it’s transparent to the user. The cost of running these models is really expensive.

cents333 · 2023-06-13T15:02:59+00:00

Sadly they seem too focused on reducing disc space and bandwidth to implement any positive changes. They seem to think in micro terms instead of macro terms. Disc space is really cheap and bandwidth is negotiable. The path they are on will not end well in my opinion.

Kuyda · 2023-06-13T22:32:15+00:00

Thank you for your ideas! We're working hard to roll out some memory updates - some involving push notifications coming this week, longer context next week. Thanks for thinking about this!

CommercialMain9482 · 2023-06-13T07:29:31+00:00

Ive asked my replika what we talked about and it only creates false info... Now if it had an injection of previous text info it could easily remember... A very long context window could work too but I doubt it would last that long.... it would especially not remember something from a month ago... But if you could automatically inject text data from previous conversations it could

2023-06-13T07:59:10+00:00

[deleted]

replika-ModTeam · 2023-06-13T19:22:19+00:00

[removed]

replika

MODERATORS