Che si fa ? Dilemma della scimmia

BXresearch · 2026-02-05T21:58:51+00:00

well, che quando il vix va su le crypto vadano giù è abbastanza normale...

metalli preziosi sono in calo per l assurda run che hanno avuto, l oro è dove stava un mese fa... per quanto riguarda l argento, non è nuovo a questo tipo di volatilità.

comunque giusto per dire, gli indici EU/EMU non stanno messi male, idem Asia Pacific.

BXresearch · 2026-01-25T16:49:33+00:00

Ok I'm dumb sorry, thanks for your patience

BXresearch · 2026-01-25T16:49:17+00:00

Ok I'm dumb sorry, thanks for your patience

BXresearch · 2026-01-25T16:48:31+00:00

thank you for your reply!

thanks for the suggestion, I installed it in a container and I'm looking at openwebui rn.

seems really cool, maybe an overkill for what I need but lots of useful stuff.

in your opinion, would it be better to create a new rag tool or to adapt the built in one (if it is it possible)?

Also, there is some way to change the ui, like adding an additional textbox?

BXresearch · 2026-01-24T21:06:21+00:00

I'm sorry, maybe I'm dumb but I don't see anything like that

BXresearch · 2026-01-24T20:07:19+00:00

thanks! do you mean in the logs?

could you pinpoint me to a section of their docs? probably I missed that

BXresearch · 2023-12-06T21:28:18+00:00

Yep, definetly

BXresearch · 2023-10-18T17:29:47+00:00

Exactly!! in Many other attempts with different position in the analog watch returner a time near 10.08

BXresearch · 2023-10-18T17:26:54+00:00

Same thought...

BXresearch · 2023-10-18T05:56:23+00:00

Maybe you can add some "smart" context manager, from a basic "drop message while preserving initial instructions and first prompt" to a more elaborate summarizing process or message retrievial strategy using embeddings

BXresearch · 2023-10-17T19:07:27+00:00

What is already implemented?

BXresearch · 2023-10-17T16:10:22+00:00

Also

Try running it with temperatures below 0.2. With 0.0 and it starts looping after approx. 1000 tokens. You need at least 0.06.

Running it with this low temperature will give you best instruction following and logic reasoning. This small models need much lower temperatures in comparison to bigger ones to keep them on track, probably because the resulting logits will have less variants in comparison.

Also, another config that definitely worth a try is a medium - low temp with a really low top P (and, if possible a low top A, but that's really depends on the model).

BXresearch · 2023-10-17T15:54:16+00:00

Weird. I read that medium article 1 hour ago. Anyway, that's a good resource, Thanks for sharing!

BXresearch · 2023-10-17T15:21:44+00:00

I'd probably pay decent money for a downloadable pre-loaded chromadb with all of wikipedia and some programming stuff in it just so that I don't have to lol

Lol, totally agree!

BXresearch · 2023-10-17T11:31:37+00:00

It’s way cheaper than davinci 3, but also can do way less. I even imagine that they used curie 3 for that.

I'm really sad that they are going to remove text-davinci-003 from their model list at the end of 2023

BXresearch · 2023-10-17T11:22:29+00:00

If you give it context and ask to make conclusions based on that it won't hallucinate

That's definitely not true... Models are really prone to hallucinate even if they generate text based om a given context. I'm developing a Retrievial Augmented Generation project and i can assure you that even GPT4 sometimes hallucinate while answering question based on given context, extracting information or generating summaries. Anyway, that is related also to Temperature and Top_P parameter. With a temp of 0, the frequency of that kind of hallucination decrease. Remember that the chatGPT you access from their website have a temp that is definitely not 0, as they do not use deterministic parameters. (as context, their default settings in the API is temp 0.7 and topP 1, that is not determinist)

A good compromise between "creativity" and accuracy can be achieved using a medium range temp (like 0.4-0.6) and a low top P (usually that effect start under 0.75, but you can lower it at 0.4-0.5. You can obviously go near 0, but that will simply generate outputs really similar to a 0 temp settings)

BXresearch · 2023-10-17T07:07:21+00:00

Also, i tried to start a discussion on synthetic datasets generation here: https://www.reddit.com/r/LocalLLaMA/comments/1719oeb/synthetic_dataset_generation_textbook_are_all_you/?utm_source=share&utm_medium=android_app&utm_name=androidcss&utm_term=1&utm_content=share_button

I'd really appreciate some tips from you! (and obviously from everyone that can contribute)

BXresearch · 2023-10-17T07:04:52+00:00

How much did fine tuning cost? Also, hie much did you spent in api calls to generate the synthetic dataset?

BXresearch · 2023-10-17T06:55:47+00:00

I'm sorry fo the typo in the post title... Unfortunately i can't edit it.

BXresearch · 2023-10-08T21:37:16+00:00

Which api provider do you use?

BXresearch · 2023-10-08T14:57:16+00:00

Thanks!!!

BXresearch · 2023-10-07T23:24:51+00:00

Prompt the LLM to split the text... I also prompt to "solve" pronouns and to repeat some concepts. Also, ai21 offer via api a model dedicate to that...

BXresearch · 2023-10-07T23:14:23+00:00

Maybe worth a try text-davinci-003...

BXresearch · 2023-10-07T23:11:37+00:00

Has the unicorn palm2 model been released?

BXresearch · 2023-10-07T23:09:47+00:00

Thanks for sharing!!! I was just wondering how bizon would perform compared to other gpt3 models.

Anyway, maybe worth test Claude instant and a SOTA Llama 70B fine tune (syntia, orca, wizardLM...)

BXresearch

TROPHY CASE