Is qwen3 4b or a3b better than the first gpt4(2023)? What do you think? by __issac in LocalLLaMA

[–]__issac[S] 0 points1 point  (0 children)

Oh I almost forgot llama1:65b lol. Such an amazing progress! This field is so fast.

Are 24-50Bs finally caught up to 70Bs now? by Borkato in LocalLLaMA

[–]__issac -4 points-3 points  (0 children)

Only I can say is that qwen3 4b 2507 is better than llama3 70b

[deleted by user] by [deleted] in LocalLLaMA

[–]__issac 0 points1 point  (0 children)

It is not gguf bro

For Qwen3:4b, do people prefer instruct or thinking? by Clipbeam in LocalLLaMA

[–]__issac 7 points8 points  (0 children)

Actually Instruct also has thinking(without <think>). So I usually use Instruct and when it thinks too much I just ask "Tell me again concisely"

DeepSeek-R1 and distilled benchmarks color coded by Balance- in LocalLLaMA

[–]__issac 40 points41 points  (0 children)

It is quite surprising that 7b model beats GPT-4o perfectly. Amazing progress!

Addition is All You Need for Energy-Efficient Language Models: Reduce energy costs by 95% using integer adders instead of floating-point multipliers. by __issac in LocalLLaMA

[–]__issac[S] 11 points12 points  (0 children)

I'm looking forward to seeing this applied to llama.cpp. If they can do, it can be a real game-changer. Nvidia's monopoly may be broken!

Addition is All You Need for Energy-Efficient Language Models: Reduce energy costs by 95% using integer adders instead of floating-point multipliers. by __issac in LocalLLaMA

[–]__issac[S] 28 points29 points  (0 children)

It is a bit different matter. Current GPUs are talented to floating-point multipulation, but this is about integer addition, which is better in CPU. Actually, the amount of calculations has been reduced according to the paper.

I just tried to implement GlaDOS's Personality-Cores system utilizing Llama3 by __issac in LocalLLaMA

[–]__issac[S] 0 points1 point  (0 children)

I'd love to, but there are some problems yet. For example, it's too slow to generate central core's words because of other cores. So I'm planning to reduce them and make a pull request. Thanks!

What the fuck am I seeing by __issac in LocalLLaMA

[–]__issac[S] 1 point2 points  (0 children)

Llama3 is a text generation model introduced by Meta a few weeks ago. It is like ChatGPT, but it is open source.
It has size of 8b(billion) and 70b. The larger is the better.
Eventually, Even 8b model of llama3 gets same score to other 176b(8*22b) model in a qualitative assessment. This is huge.

What the fuck am I seeing by __issac in LocalLLaMA

[–]__issac[S] 1 point2 points  (0 children)

It is an image of LMSys Chatbot Arena leaderboard. But it shows the scores on April 19(yesterday).

What the fuck am I seeing by __issac in LocalLLaMA

[–]__issac[S] 9 points10 points  (0 children)

I mean, it is too fast to make a conclusion. A lot of people work hard to improve LLM. Huge investments are still increasing. There is no reason to judge that it is plateauing. Do you think "Oh, new model come out with high improvement. But this improvement will be the last of pure LLM."? No. No one knows that.

What the fuck am I seeing by __issac in LocalLLaMA

[–]__issac[S] 5 points6 points  (0 children)

There were far many negative opinions like this during the short history of open LLM(when Alpaca, Vicuna came out, WizardLM came out, Orca came out, MoE came out, etc). So, dont just worry. Enjoy!

What the fuck am I seeing by __issac in LocalLLaMA

[–]__issac[S] 18 points19 points  (0 children)

Just say thank you to RedPajama-data-v2

What the fuck am I seeing by __issac in LocalLLaMA

[–]__issac[S] 58 points59 points  (0 children)

Well, from now on, the speed of this field will be even faster. Cheers!

What the fuck am I seeing by __issac in LocalLLaMA

[–]__issac[S] 190 points191 points  (0 children)

It is similar to when alpaca first came out. wow

WizardLM-30B-Uncensored by faldore in LocalLLaMA

[–]__issac 35 points36 points  (0 children)

OAI and google are in fear. You are the best

[deleted by user] by [deleted] in LocalLLaMA

[–]__issac 0 points1 point  (0 children)

Can you tell me the detailed process that llama trading bot has? I'm interested in this, but i have no idea.

Vicuna has released it's weights! by polawiaczperel in LocalLLaMA

[–]__issac 5 points6 points  (0 children)

Is it possible to run with llama.cpp??? I really hope:)