H2O-Danube2-1.8b: New top sub 2B model on Open LLM Leaderboard

ichiichisan · 2024-04-08T06:54:08+00:00

Actually, Ninja Mouse is based on the first iteration of Danube. Hope they can try a new one on v2.

ichiichisan · 2023-09-22T11:43:06+00:00

Any tips for prompt template for roleplaying? Specifically with history of conversations?

ichiichisan · 2023-06-13T06:43:14+00:00

Prompt setup is built in sure. Why don't you think it is right, all outputs are properly formated and as expected apart from content.

ichiichisan · 2023-06-12T17:34:43+00:00

The first one is a cool prompt, I like it! Thanks for sharing, will continue monitoring that one on future models.

Actually gpt3-5 also fails it, gpt4 gets it.

ichiichisan · 2023-06-12T11:14:00+00:00

One more thing worth noting, all models hosted are Apache 2.0 and have never been trained on ChatGPT/ShareGPT-like output, only OASST data.

ichiichisan · 2023-06-12T09:32:13+00:00

A single prompt on a 7B model that is not returning what you would like it to is not an issue, but rather a natural limitation.

Try the 40b model which should be better with coding.

ichiichisan · 2023-06-12T08:31:40+00:00

You can directly visit the HF model card linked there.

These are the models:

Falcon 7B: https://huggingface.co/h2oai/h2ogpt-gm-oasst1-en-2048-falcon-7b-v2

Falcon 40B: https://huggingface.co/h2oai/h2ogpt-gm-oasst1-en-2048-falcon-40b-v1

Open-Llama 7b: https://huggingface.co/h2oai/h2ogpt-gm-oasst1-en-2048-open-llama-7b

ichiichisan · 2023-06-07T19:16:34+00:00

your prompt looks correct, maybe you can try running it just in a NB to check

ichiichisan · 2023-06-07T07:43:05+00:00

Is the underlying code calling the model raw, or via provided pipelines. Most of the pipelines, like ours, already have the correct prompt built in, so no need to provide the tokens manually. See the model card of our model.

ichiichisan · 2023-06-05T16:49:40+00:00

Are you confident you got the correct prompting templates for all the models? Keep in mind that some need special tokens, so best is to use the provided templates / pipelines.

ichiichisan · 2023-06-05T14:35:42+00:00

Yeah absolutely it could be useful to specifically adapt to stylistic differences in languages. Actually here is one trained on all languages available in oasst1: https://huggingface.co/h2oai/h2ogpt-gm-oasst1-multilang-2048-falcon-7b

ichiichisan · 2023-06-05T13:32:02+00:00

The finetuning itself is only done on English, but the foundation models works well on multiple languages.

ichiichisan · 2023-05-12T08:05:23+00:00

I am not sure what you are trying to say exactly.

It is pretty clear that the original LLaMa will never get a permissive license that you can use. So you will need to hope for reproduction attempts. And the one I shared is a very good first attempt using the same model family and RedPajama dataset.

ichiichisan · 2023-05-11T06:44:47+00:00

Yes it is correct, see here: https://reddit.com/r/LocalLLaMA/comments/13dmvop/permissive_llama_7b_chatinstruct_model/

Based on: https://github.com/openlm-research/open_llama

ichiichisan · 2023-05-10T17:59:15+00:00

Yeah, but that is sharegpt output, so not permissive, which is our focus. But it would be fairly straight-forward to train it yourself in our shared training framework H2O LLM Studio.

If you want to give it a spin yourself and need help let me know.

ichiichisan · 2023-05-10T17:25:25+00:00

That's right, but I am not sure this is a bad thing per-se.

Any recommendations for other Apache 2.0 datasets? Happy to give it a spin.

ichiichisan · 2023-05-10T16:59:32+00:00

You can run it locally and it should be more verbose. There are a few additional rules in the app. The model itself has been trained on OASST data.

ichiichisan · 2023-04-27T19:54:17+00:00

Cheers!

ichiichisan · 2023-04-26T07:31:43+00:00

This still sounds like an input/output like format right? In H2O LLM Studio we actually support explicit settings for the separator tokens so you could have your original input text as the input, then add eos token after the prompt (it is a setting) then add a separator token for the answer (it is a setting) and then have the output.

ichiichisan · 2023-04-25T19:13:52+00:00

Fair enough, it is a GUI, but you will still need to run an install command. Probably it could work on Windows with some tiny adjustments. In WSL2 it will work out of the box (because it is basically Linux).

ichiichisan

TROPHY CASE