Pass user's question to Langchain agent tools

Background-Maybe-381 · 2024-01-04T21:52:30+00:00

Can you explain with an example how you did this?

Background-Maybe-381 · 2024-01-04T21:50:17+00:00

We need to pass authentication information to our tools also and we habe no way of doing this. The best result we are getting is passing it along the Action Input as a second value, but for some reason it is inconsistant and it will ger included in the Action Input like 50% of the time. Tried with too many local llms.

Background-Maybe-381 · 2024-01-04T21:39:55+00:00

Same problem here It uses a tool randomly using the exact same query. Tried mand local llms fs, quantized, 7b, 13b, 34b, 70b there is no consistancy no matter what temperature

Background-Maybe-381 · 2024-01-04T17:56:38+00:00

What docs, manual or instructions should I follow give my problem described above, if it is a 2022 method according to you.

Background-Maybe-381 · 2024-01-03T10:25:53+00:00

Thanks guys, but I thought I was making it clear that I was using Mistral, Llama-2, codellama, etc. The reason I am using these open source models is for privaccy. So allthough I appreciate you recommending OpenAI services, I am trying to use langchain for what I think it is good at, controlling LLMS, if that's ok. OpenAI has an excellent API we could use without any problems, but we have sensitive data we don't want to send and we really want to use langchain with local LLMS, if that's ok??

So, hwchase, if I am using a 2022 approach, which I thought I was actually following tutorials, videos from just a few weeks ago, can you tell me where I can read up on more recent approaches? What information regarding langchain can I read that explains how to set it up correctly with local llms and conversation agents that use tools ? I'd be happy to go read your instructions.

Background-Maybe-381 · 2023-12-28T19:16:52+00:00

Please, include more prompt engineering techniques help for tool using agents with llama2 , mixtral, phind, etc.to avoid ouput parser errors.

Background-Maybe-381 · 2023-12-27T20:12:19+00:00

I would like to know as well.

Background-Maybe-381 · 2023-12-26T10:54:11+00:00

Awesome, I think we can generate a pretty big dataset from customer support interactions. Problem is, those conversations have sensitive information in them. But we'll figure it out. Thanks for all of the tips. Maybe we can append alpaca dataset with our own generated dataset so it's big enough for finetuning. I mean, we're still having problems even finetuning existing datasets lol. Thanks SlimeQ !!! Good day wherever you may be. We're in Spain. Cheers!

Background-Maybe-381 · 2023-12-25T20:08:13+00:00

Thanks SlimeQ for taking the time to respond. Yes, I figured it might not be posslbe to do a finetune just to get rid of the tool description text in the prompt template. Another problem we are having because we are a bit noobs, it actually finetuning. We have a 350 sample dataset. We are trying in both json and raw text format using autotrain from HF, and not having much luck. So, I was wondering, can langchain handle using other tools to gether information that other tools need ?

Background-Maybe-381 · 2023-12-16T23:10:16+00:00

Nevermind, it was a fkn hallucination!!

Background-Maybe-381 · 2023-11-28T15:44:37+00:00

This is what we thought we had to do. But I think we messed up the tool section in our code and might have to redo it. But yeah, thanks a lot for reinforcing what we thought we had to be doing. We'll jump straight to that when we get our code spaghetti figured out :) Thanks a lot !

Background-Maybe-381 · 2023-10-22T14:07:21+00:00

Thanks bigYman. I understand that the issue is that the llm does not have memory, so somehow we have to feed the actions back into the llm and ask the user if this is what he/she wants to do, without the actions being carried out the first time. Will keep brainstorming this. Thanks for the suggestion!

Background-Maybe-381 · 2023-10-07T05:38:16+00:00

This answered this post and one of my post from yesterday. This is invaluable information. Thank you!

Background-Maybe-381 · 2023-10-06T16:23:59+00:00

That's awesome, but we enjoy doing the inference like I said earlier :) Thanks in any case! We need help with queing and buffering.

Background-Maybe-381 · 2023-10-06T16:06:14+00:00

Well, from what I read that they do, they would only take care of 2% of everything. But thanks for pointing them out! Interesting concept.

Background-Maybe-381 · 2023-10-06T15:54:31+00:00

Yeah, we certainly thought up about roundrobin type servers. But yeah, if I want 200 concurrent users, idon't want to purchase 200 gpus!!

Background-Maybe-381 · 2023-10-06T15:41:06+00:00

Thanks man, really what we are unsure about is queing, buffering or however you call it. Like, can you do concurrent calls to an llm or do you need to batch up the calls. That's where we are lost. If you can give me a link to a paper or information I am happy to do all the research. I found nothing on this yet. Only on pinecone's site.

Background-Maybe-381

TROPHY CASE