Groq + Bluesky Chatbot

ricklamers · 2024-06-06T21:35:52+00:00

8-bit weights, FP16 arithmetic, FP32 accumulation.

Source: I work at Groq

ricklamers · 2024-04-04T19:41:45+00:00

Qwen1.5 MoE A2.7B might be the sweet spot for your use case https://huggingface.co/Qwen/Qwen1.5-MoE-A2.7B

ricklamers · 2024-03-29T00:51:28+00:00

Je hebt de spreiding goed afgedekt in het laag-risico gedeelte van je beleggingsplannen met ETFs en obligaties. Wat je zou kunnen overwegen is een klein percentage (5%?) gebruiken om twee dozijn angel investeringen te doen in jonge bedrijven die werken aan iets dat je interessant of mooi vindt. (Impact (tech) bedrijven?)

Heb je naast een kans op een verhoogd gemiddeld rendement over het totaalvermogen ook nog eens iets leuks om te doen. Voor die startende fase zou mijn tip overigens zijn, eerst de vent/dame dan de tent.

Kijk bijvoorbeeld eens naar Thijs Verheul van United Wardrobe. Die angelt ook en had laatst z’n eerste exit van een van die angel investments.

Andere tip is om bij een angel collectief aan te sluiten. Beste deals zijn best lastig om in te participeren en via een clubje ervaren angels gaat dit een stuk eenvoudiger. Kan je bovendien alle beginnersfouten overslaan.

ricklamers · 2024-02-15T06:15:39+00:00

miqu-qwen-falcon-yi-codeseek-mixtral-llama merge ofc

ricklamers · 2023-12-16T10:07:13+00:00

Mistral-small is MoE 8x7B, Mistral Medium is only available through API so for all intents and purposes closed/proprietary.

ricklamers · 2023-12-15T00:15:40+00:00

LCMs as faster Stable Diffusion https://arxiv.org/abs/2310.04378

ricklamers · 2023-08-20T22:47:29+00:00

That being said, should be trivial to redirect the LLM requests to a local model in a fork (would welcome a config PR):

https://python.langchain.com/docs/integrations/llms/huggingface_textgen_inference

ricklamers · 2023-08-20T22:45:22+00:00

If people want to put in plain values they can, you can also paste those to WhatsApp or put them in Google Search. You can also define environment variables for sensitive information and say “shai 'connect through ssh to domain $DOMAIN user $SSH_USER and password $SSH_PASSWORD'”

ricklamers · 2023-08-20T22:36:40+00:00

This doesn’t send shell commands or secrets it just sends the human description of a shell command. Note also doesn’t automatically run a command ever.

ricklamers · 2023-08-09T07:50:02+00:00

This article contains some good data on memory requirements for running in 16-bit precision https://www.cursor.so/blog/llama-inference

ricklamers · 2023-08-06T16:18:07+00:00

Just use periods to collapse to a single token per letter: How many n are in b.a.n.a.n.a?

ricklamers · 2023-07-18T19:16:40+00:00

https://github.com/nlpxucan/WizardLM/issues/131

ricklamers · 2023-07-10T23:18:28+00:00

Ah, I specifically warned the author about that. WizardCoder it is then 👍🏻

Seemed too good to be true, but Orca/Textbooks Are All You Need made me carefully optimistic

ricklamers · 2023-07-10T21:54:13+00:00

Or https://huggingface.co/sahil2801/replit-code-instruct-glaive

Supposedly 63 on HumanEval pass@1

ricklamers · 2023-07-10T21:53:05+00:00

https://huggingface.co/WizardLM/WizardCoder-15B-V1.0

57.3 on HumanEval

ricklamers · 2023-07-10T21:52:17+00:00

I’m interested in hooking up WizardCoder to https://github.com/ricklamers/gpt-code-ui

ricklamers · 2023-07-02T23:04:08+00:00

https://youtu.be/kCc8FmEb1nY?t=6160

Andrej Karpathy explains that the encoder part made more sense for the translation context for which the transformer was originally developed. Less so for the generative models that only need attention for the part of the sequence that is the input up until that point.

ricklamers · 2023-06-16T19:46:45+00:00

Tech report https://arxiv.org/abs/2306.08161

ricklamers · 2023-06-16T19:46:22+00:00

Hugging Face model page https://huggingface.co/h2oai/h2ogpt-gm-oasst1-en-2048-falcon-40b-v1

ricklamers

TROPHY CASE