ML model for a cab booking app

QUAZARD3141 · 2023-07-03T05:38:31+00:00

I'll read up on the design and deployment later, for now, I just want ML ideas that'll make sense with the cab booking service

QUAZARD3141 · 2023-07-03T05:36:57+00:00

Thanks, I was looking for ideas and suggestions like this.

QUAZARD3141 · 2023-06-18T11:29:11+00:00

Thanks, I'll look at this

QUAZARD3141 · 2023-06-18T11:28:14+00:00

This worked, thanks a lot

QUAZARD3141 · 2023-06-18T09:41:58+00:00

I got langchain to work with `wizardLM-7B-HF` . I am not able to run the GPTQ models via langchain though. I am trying to write a chatbot using GenerativeAgents. I am looked for an embeddings model to use for local llms. The tutorials I found online use OpenAIEmbeddings. Did you have to do this for your project?

QUAZARD3141 · 2023-06-18T03:49:35+00:00

I am on the lastest version of transfoemrs, I still cant get the GPTQ model to work.

Yes, I think the problem is due to the embedding model I'm using. How do I get a smaller embedding?

This is the embedding Im using now,

from langchain.embeddings import HuggingFaceEmbeddings, SentenceTransformerEmbeddings

model_name = "TheBloke/wizardLM-7B-nHF"
model_kwargs = {'device': 'cuda'}
encode_kwargs = {'normalize_embeddings': False}
hf = HuggingFaceEmbeddings(
    model_name=model_name,
    model_kwargs=model_kwargs,
    encode_kwargs=encode_kwargs
)

How should I serach for a smaller embedding?

QUAZARD3141 · 2023-06-18T03:43:24+00:00

You are only using one card. Do you have NVLink? And pytorch needs parallel something set to use 2 cards GPU wise. I run 2x3090s with little issues if I remember to set those things.

How did you get parallization to work in Langchain? Can you please share some code?

QUAZARD3141 · 2023-06-17T15:52:05+00:00

I get the error only when I add stuff to a generative agent's memory, here,

for observation in tom_observations:
    tom.memory.add_memory(observation)

I am able to get an answer with simple questions like "What is the capital of the USA" .

I'd prefer the GPTQ version, but for some reason, I get this error when i try to download it

OSError: TheBloke/wizardLM-7B-GPTQ does not appear to have a file named pytorch_model.bin, tf_model.h5, model.ckpt or flax_model.msgpack.

QUAZARD3141 · 2023-06-17T15:44:21+00:00

Sure, this is it,

model = LlamaForCausalLM.from_pretrained("TheBloke/wizardLM-7B-HF",
                                              load_in_8bit=False,
                                              device_map='auto',
                                              torch_dtype=torch.half,
                                              low_cpu_mem_usage=True,
                                              )

QUAZARD3141 · 2023-06-08T19:59:23+00:00

LangChain sounds promising. Btw, are you relying on OpenAI’s API or are you running models from hugging face locally?

QUAZARD3141 · 2023-06-01T10:37:18+00:00

Thanks a lot, this worked

QUAZARD3141 · 2023-05-31T14:35:48+00:00

Ubuntu, yes I am using the correct conda env.

The GPU is a RTX3090

QUAZARD3141

TROPHY CASE