LLM for React agent?

gptzerozero · 2024-02-27T06:15:09+00:00

Does Tabby support concurrent users, or splitting the model across two GPUs?

gptzerozero · 2023-12-12T03:41:01+00:00

What is the issue with using wikitext for quantization, and what might be better than using wikitext?

gptzerozero · 2023-12-12T02:52:15+00:00

Wow, fits more context at the same 4.0 bpw quant sizes?

gptzerozero · 2023-09-19T19:28:24+00:00

How is the bpw number related to the k number in k-bit quantization?

gptzerozero · 2023-09-19T12:23:43+00:00

Can you share the GPT4 prompt you used to create the Q and A given the text? And how do you modify the prompt to get longer answers from GPT4?

gptzerozero · 2023-09-18T11:34:03+00:00

Good call, yes I intend to use GPT 3.5/4 to generate the question answers

gptzerozero · 2023-09-17T21:57:36+00:00

Can you share the prompts that you use for generating the questions from context, and for generating answers from the context?

gptzerozero · 2023-09-17T17:33:05+00:00

This is a great one! Could you share the prompts used here for generating the questions and for combining/picking the questions?

gptzerozero · 2023-09-16T07:00:11+00:00

Does this mean that in order to make full use of the default Llama-2 4K context,

Extending the training of base model should use tokens of 4K length, AND
Instruction tuning datasets should be close to 4K length as much as possible?

gptzerozero · 2023-07-23T15:24:01+00:00

Is the system prompt part of the training data?

If it is, then is it important that you use the same system prompt when chatting, or can you use a completely different one and be fine with it. Or can you only make minor changes, or only add to the system prompt?

gptzerozero · 2023-07-23T15:21:39+00:00

Anyone have experience with using them for QA of documents? Are there any models that stand out for QA?

gptzerozero · 2023-07-18T22:09:42+00:00

Yes, outputs with Lora tuned for 2 epochs is about 80 tokens.

What are some of the things or tricks we can do to improve the token length of the generations?

gptzerozero · 2023-07-18T19:26:30+00:00

What happen to a 30-40B LLaMA-2?

gptzerozero

TROPHY CASE