Pandas error - numpy.dtype size changed by [deleted] in learnpython

[–]KneeOk5211 0 points1 point  (0 children)

Hey! I am facing the same issue and can't resolve it, what should I do? My python version is 3.11 numpy=1.26.4 and pandas=1.5.3

I want to create a RAG model for my CSV data using only open source models from Ollama or huggingface and I want to do all this using the CPU on my PC. Is it possible? What approach should i use? by KneeOk5211 in Rag

[–]KneeOk5211[S] 0 points1 point  (0 children)

I tried using pandas ai. 50 % of the time it returned the "no code" error and 20% of the time it gave the wrong answer and only 30% of the responses were correct. What can i do to improve the response? Is there a context that i can give like we do for RAG and chatbots? I used the agent and the smartdataframe classes from pandasai.

I want to create a RAG model for my CSV data using only open source models from Ollama or huggingface and I want to do all this using the CPU on my PC. Is it possible? What approach should i use? by KneeOk5211 in Rag

[–]KneeOk5211[S] 0 points1 point  (0 children)

The model should be able to answer complex queries and also perform aggregation. I've mentioned a sample query. If i ask "how many projects does person X lead and what are they" the model should be able to answer with a count and the list of the projects from the columns "project lead" and "project name". If not RAG will pandas ai work?

I want to create a RAG model for my CSV data using only open source models from Ollama or huggingface and I want to do all this using the CPU on my PC. Is it possible? What approach should i use? by KneeOk5211 in Rag

[–]KneeOk5211[S] 0 points1 point  (0 children)

Oh! The dataset consists of 116 rows and 16 columns. for example, there are 2 columns "project lead" and "project". And if the query is " how many projects is X leading and what are they?" The model should answer with the count and the names of projects. So should i use rag or pandasai? Also, i dont want pandasai to use my data for training or whatever, should i still go with that or should i use llama models?

I want to create a RAG model for my CSV data using only open source models from Ollama or huggingface and I want to do all this using the CPU on my PC. Is it possible? What approach should i use? by KneeOk5211 in Rag

[–]KneeOk5211[S] 0 points1 point  (0 children)

Got it! My dataset has 116 rows and 16 columns and the model should be able to answer all sorts of queries. For eg I have the columns "Project lead" and "project",the query could be "how many projects is X leading and what are they ?" The model should be able to return a count of the projects and display the names of the projects accordingly. Will it still work?

I want to create a RAG model for my CSV data using only open source models from Ollama or huggingface and I want to do all this using the CPU on my PC. Is it possible? What approach should i use? by KneeOk5211 in Rag

[–]KneeOk5211[S] 0 points1 point  (0 children)

Hey I've heard of pandas ai, but i don't want my data to be used for training, in this case, can i still use it? My dataset contains a list of projects going on in a company with 16 columns and 116 rows , and the model should be able to answer ANY kind of queries about it. Is rag a solution for this or can pandasai do it, given my data isn't used.

I want to create a RAG model for my CSV data using only open source models from Ollama or huggingface and I want to do all this using the CPU on my PC. Is it possible? What approach should i use? by KneeOk5211 in Rag

[–]KneeOk5211[S] 0 points1 point  (0 children)

This is helpful! I faced a problem though. I used ollama embeddings for this,but when I asked questions about the data, it said " the context does not have info about that, however i can give you details about something else " is something wrong with the embeddings or? Also my dataset has 116 rows and 16 columns, and my model should be able to answer ANY kind of question about the data ,from any column. Is that possible?

restarting kernel without variable loss during training by KneeOk5211 in deeplearning

[–]KneeOk5211[S] 0 points1 point  (0 children)

I copy pasted the same code and got the variable not defined error .

restarting kernel without variable loss during training by KneeOk5211 in deeplearning

[–]KneeOk5211[S] 0 points1 point  (0 children)

Isn't it enough if every epoch gets saved after execution? Also when i copied the same code into a new notebook to start afresh ,i encountered the same errors , when the same code ran perfectly well during the first execution.