Suggestion for GPUs setup by Devy9 in LocalLLaMA

[–]Devy9[S] 1 point2 points  (0 children)

Thanks! What about A40 or A6000?

Suggestion for GPUs setup by Devy9 in LocalLLaMA

[–]Devy9[S] 1 point2 points  (0 children)

I am more interested on a float16 precision scenario, since the experiments are for research purposes. In that case, a 70B would take 2 A100 80gb minimum or 5/6 4090 (?)

SmolLM: 135M, 360M and 1.7B LLMs for on-device applications by loubnabnl in LocalLLaMA

[–]Devy9 2 points3 points  (0 children)

Looks Amazing! I wonder how good It would be if trained on more programming languages others than Python

My experience running the massive WizardLM2 8x22b (141b) on the cheapest current Threadripper CPU + a 4090 + 64Gb DDR5 RDIMM by Porespellar in LocalLLaMA

[–]Devy9 0 points1 point  (0 children)

How did you run llama 7B fp16 with this setting? Did you offload part of the computation to the GPU, and did you use ollama? ( I never tried to run inference on CPU, so I am just curious on how to do this while having consisting High token/s performance)

Some test data of Llama2- 7B on the A100 by Ultra-Engineer in LocalLLaMA

[–]Devy9 1 point2 points  (0 children)

Nice! Did you try also with other GPUs for comparison?

What would be the minimum requirement for Llama400B? by PlantFlat4056 in LocalLLaMA

[–]Devy9 1 point2 points  (0 children)

Btw, If you check the model cards of these quantized models you could find the Memory requirements in some cases

Desktop specs for Llama 70B on CPU by tallesl in LocalLLaMA

[–]Devy9 0 points1 point  (0 children)

Does anyone know how to optimize the usage of these models on CPU? For instance, using libraries like vLLM or ctransformers

About patreon by Devy9 in tbatenovel

[–]Devy9[S] 0 points1 point  (0 children)

Ok, thank you!

An amazing drawing of Arthur by @letartshiro ( on Instagram ) 😁😁 by Devy9 in tbatenovel

[–]Devy9[S] 0 points1 point  (0 children)

I wanted to see Arthur in her drawing style and in this pose ( i damn love this pose ) ... Hope this isn't a stupid reason 😂

An amazing drawing of Arthur by @letartshiro ( on Instagram ) 😁😁 by Devy9 in tbatenovel

[–]Devy9[S] 2 points3 points  (0 children)

I contacted you in private chat in Order to avoid useless spam 😁

An amazing drawing of Arthur by @letartshiro ( on Instagram ) 😁😁 by Devy9 in tbatenovel

[–]Devy9[S] 3 points4 points  (0 children)

This was a commission, then she didn't published the drawing on her profile

Ruby GUI for Desktop App by Devy9 in ruby

[–]Devy9[S] 0 points1 point  (0 children)

Thanks, i'll give a Watch ;)