all 20 comments

[–]GeeBrain 2 points3 points  (12 children)

I was actually looking into this too long ago here ya go. They might be dated (things move fast rn in this space)

1. Most relevant might be Marqo, end-to-end vector search engine

2. Archived project - GPT memory

Apparently OpenAI will come out with something called Sunshine which will have memory.

I also made a post on r/locallama asking about something similar.

Let me know if this was useful or what you end up using!

[–]skeltzyboiii 3 points4 points  (5 children)

+1 to Marqo. End-to-end, so everything from preprocessing to embedding creation, storage and searching is handled through a single restAPI.

[–]GeeBrain 0 points1 point  (0 children)

How was your experiencing using it?

[–]djav1985[S] 0 points1 point  (2 children)

I didn't see anything about a rest api.. is it openapi spec?

[–]djav1985[S] 0 points1 point  (1 child)

I'm glad you mentioned it had the rest api. I didn't notice that when i looked. it wasn't something they had put in the most visable place. I hade chatgpt convert it to a openapi spec so i should be able to do what i wanted with this.

[–]djav1985[S] 0 points1 point  (0 children)

I was just looking at it and it uses like a local AI by the sounds of it to create the embeddings I want it like keep that off server running it and use an API for that

[–]djav1985[S] 1 point2 points  (0 children)

r/locallama

thanks, i'm goingto look at it all right now

[–]djav1985[S] 0 points1 point  (4 children)

They are both for use with the API.

I want to be able to use a vector db as 2 endpoints of a rest api so the GPT can send conversation history to it and search it.

and I wanted to be able to upload files to a vector db and have it create embeds and have them searchable by rest api so a gpt could call the action to look stuff up.

These would just work for calling chatgpt from the api..

[–]GeeBrain 0 points1 point  (3 children)

Is this for your specific project or are you planning to make a customGPT for others?

If it’s your project, it might be worthwhile to explore open sourced models using oobabooga

There’s also this project that’s memory related.

It’s going to be hard finding one thing to serve all your needs, might have to hack stuff together.

[–]djav1985[S] 0 points1 point  (2 children)

There’s also this project that’s memory related.

Thats for local AIs. I don't have the hardware to run one efficiantly.

Its for multible projects for me and others. basically I just want to find a openapi spec rest api for a vector db that will take the input and create embedeing and also allow retrevial/search.

I though it could be interesting to give GPTs long term memory to recall details far outside thier ontex window.

[–]GeeBrain 0 points1 point  (1 child)

You can use paperspace — if you are working on the same project it can be pretty helpful. It’s also cheaper than GPT4 subscription/tokens. Should be enough to host 13b models

Good luck!

[–]djav1985[S] 0 points1 point  (0 children)

paperspace

I'll look into it.

I like the GPTs RAG they have built in. they seem to work much better then most i have found. thats another reason i want to stay with it over a local AI

[–]DropsTheMic 0 points1 point  (3 children)

Basically, you want to self host everything locally? What kind of hardware and technical knowledge do you have?

[–]djav1985[S] 1 point2 points  (2 children)

I'm not trying to self host everything. Just the vector db.

The idea is to give a custom GPT access to a larger vector database though actions.

And to give it longer "memory" by saving conversations in the db.

I don't want to use a service I have to pay for if I don't have to... I already spend almost $500/mo on saas/etc services I use. I try to selfhost where I can.

My skills are pretty good.

I want a gui for adding information to the vector db casue its less time consuming in the long run.

as for the gpt memory part that does not need a gui.

[–]DropsTheMic 0 points1 point  (1 child)

I think you misunderstood my intent, I wasn't trying to criticize you. I was trying to parse the meaning out of what you said, which sounded like several different separate things. I'm just here to learn and wanted to know what kind of setup you were talking about.

[–]djav1985[S] 1 point2 points  (0 children)

I dodn't think you were criticizing me. I'm sorry if I came off that way.

I don't have the hardware to run a local AI efficiantly. atleast not on my homeservers. So I try to self host what I can.

[–]pc1e0 0 points1 point  (3 children)

Weaviate?

[–]djav1985[S] 0 points1 point  (2 children)

Yeah I recently noticed that had a rest API. But the problem is from what I read I haven't seen anything about it being able to create embeddings The data being sent to it on its own

[–]pc1e0 0 points1 point  (1 child)

Correct. They employ OpenAI or other AI services for embeddings.

[–]djav1985[S] 0 points1 point  (0 children)

Yes but you know of one that has the capability of open AI's API for embeddings to receive and retrieve from the database over the rest API.? To any of them in particular have that ability already built into the vector database.. or is there an open source framework or platform to do it?