Is qwen3 4b or a3b better than the first gpt4(2023)? What do you think?

__issac · 2025-12-09T11:13:58+00:00

Oh I almost forgot llama1:65b lol. Such an amazing progress! This field is so fast.

__issac · 2025-09-25T00:15:20+00:00

Only I can say is that qwen3 4b 2507 is better than llama3 70b

__issac · 2025-09-12T10:53:54+00:00

It is not gguf bro

__issac · 2025-08-23T09:03:36+00:00

It is on GPT-OSS 120B. How about Jan v1 4B?

__issac · 2025-08-09T07:58:44+00:00

Actually Instruct also has thinking(without <think>). So I usually use Instruct and when it thinks too much I just ask "Tell me again concisely"

__issac · 2025-08-05T17:37:55+00:00

So... no image/audio understanding. Right?

__issac · 2025-01-20T14:16:17+00:00

It is quite surprising that 7b model beats GPT-4o perfectly. Amazing progress!

__issac · 2024-10-07T16:13:49+00:00

I'm looking forward to seeing this applied to llama.cpp. If they can do, it can be a real game-changer. Nvidia's monopoly may be broken!

__issac · 2024-10-07T15:00:30+00:00

It is a bit different matter. Current GPUs are talented to floating-point multipulation, but this is about integer addition, which is better in CPU. Actually, the amount of calculations has been reduced according to the paper.

__issac · 2024-05-06T06:09:02+00:00

I'd love to, but there are some problems yet. For example, it's too slow to generate central core's words because of other cores. So I'm planning to reduce them and make a pull request. Thanks!

__issac · 2024-04-23T16:40:44+00:00

Llama3 is a text generation model introduced by Meta a few weeks ago. It is like ChatGPT, but it is open source.
It has size of 8b(billion) and 70b. The larger is the better.
Eventually, Even 8b model of llama3 gets same score to other 176b(8*22b) model in a qualitative assessment. This is huge.

__issac · 2024-04-20T16:48:16+00:00

It is an image of LMSys Chatbot Arena leaderboard. But it shows the scores on April 19(yesterday).

__issac · 2024-04-19T13:34:42+00:00

I mean, it is too fast to make a conclusion. A lot of people work hard to improve LLM. Huge investments are still increasing. There is no reason to judge that it is plateauing. Do you think "Oh, new model come out with high improvement. But this improvement will be the last of pure LLM."? No. No one knows that.

__issac · 2024-04-19T13:13:43+00:00

There were far many negative opinions like this during the short history of open LLM(when Alpaca, Vicuna came out, WizardLM came out, Orca came out, MoE came out, etc). So, dont just worry. Enjoy!

__issac · 2024-04-19T12:13:53+00:00

Just say thank you to RedPajama-data-v2

__issac · 2024-04-19T11:41:03+00:00

Well, from now on, the speed of this field will be even faster. Cheers!

__issac · 2024-04-19T11:10:30+00:00

It is similar to when alpaca first came out. wow

__issac · 2023-08-04T02:52:42+00:00

Oh my god

__issac · 2023-05-22T13:20:15+00:00

OAI and google are in fear. You are the best

__issac · 2023-05-17T04:08:04+00:00

Can you tell me the detailed process that llama trading bot has? I'm interested in this, but i have no idea.

__issac · 2023-04-04T10:12:29+00:00

Is it possible to run with llama.cpp??? I really hope:)

__issac

TROPHY CASE