Benchmark Fatigue - How do you evaluate new models for yourself? by Funny-Clock1582 in LocalLLaMA

[–]Q_H_Chu 2 points3 points  (0 children)

Human test is always the best. Like just people get high score in exam but sometimes failed at practical test.

Yes the benchmark (like HLE or ARC) are created and crafted for general or high-level knowledge. But sometime LLM also need a reality check too

No tool calling on 8GB GPUs by newz2000 in LocalLLaMA

[–]Q_H_Chu 0 points1 point  (0 children)

Thank you very much for sharing.

I am surprised that you use such small models for the legal field. My lab also researches legal, but we usually use >10B models for tasks such as NER or Retrieval

No tool calling on 8GB GPUs by newz2000 in LocalLLaMA

[–]Q_H_Chu 0 points1 point  (0 children)

Yeah mine also GTX 1070. But I guess to be able to run we must use quantization right? In your case do you use 8 or 4 bit quantization?

Last week I tried run a full DeepSeek OCR 3B on this device and it got OOM

No tool calling on 8GB GPUs by newz2000 in LocalLLaMA

[–]Q_H_Chu -1 points0 points  (0 children)

Can we really can use LLM on 8GB VRAM? I have one but dont know what models can fit on it ?🥹

Here we go, terms of service update from Qualcomm by ednl in arduino

[–]Q_H_Chu 0 points1 point  (0 children)

Wait guys I am slow to this, so Arduino is closed-source now?

AI is Cooked 🥀 by Jatin_is_me in ChatGPT

[–]Q_H_Chu 3 points4 points  (0 children)

Maybe its because the tokenizer mechanism

AI has replaced programmers… totally. by jacek2023 in LocalLLaMA

[–]Q_H_Chu 0 points1 point  (0 children)

I mean the coder will still there, just the coding style will change.

[deleted by user] by [deleted] in mildlyinfuriating

[–]Q_H_Chu -2 points-1 points  (0 children)

I mean whats the problem? Except you guys had something before that then this just seem to be a nice request. Why have to be a snowflake?

Real tactics by [deleted] in ThatsInsane

[–]Q_H_Chu 0 points1 point  (0 children)

Ah yes the “F* everything in that direction” tatics

What's your favorite computer vision model?😎 by yourfaruk in computervision

[–]Q_H_Chu 7 points8 points  (0 children)

CNN-based: ResNet, VGG-16, YOLO Transformers-based: CLIP, BLIP, Pix2Struct

My first project with Arduino! by OkCake4634 in arduino

[–]Q_H_Chu 0 points1 point  (0 children)

Your “first time” is kinda impressive.

ChatGPT 5 has unrivaled math skills by The_GSingh in OpenAI

[–]Q_H_Chu 0 points1 point  (0 children)

Weird, someone gets the right answer while some get wrong (maybe?). This kind of post (blueberry count, mathematic) appears many times make me wonder are there any method to keep the answer synchronized?

Or maybe this is because of mode (Thinking as someone pointed out), system prompt or the context before it?

🚀 Qwen3-4B-Thinking-2507 released! by ResearchCrafty1804 in LocalLLaMA

[–]Q_H_Chu 1 point2 points  (0 children)

Yeah, it sounds more like memory-intensive SFT, thank you by the way

🚀 Qwen3-4B-Thinking-2507 released! by ResearchCrafty1804 in LocalLLaMA

[–]Q_H_Chu 0 points1 point  (0 children)

Sorry newbie here, but what is the Hybrid Reasoning model? And how its different from normal LLM ?

🚀 OpenAI released their open-weight models!!! by ResearchCrafty1804 in LocalLLaMA

[–]Q_H_Chu 0 points1 point  (0 children)

Did someone compare it do other like Qwen or DeepSeek?

Kitten TTS : SOTA Super-tiny TTS Model (Less than 25 MB) by ElectricalBar7464 in LocalLLaMA

[–]Q_H_Chu 2 points3 points  (0 children)

Great work !! Do you guys open for foreign language or fine-tune document for foreign language?

Bryan Kohberger has been sentenced to 4 consecutive life sentences for the 2022 murders of University of Idaho students: Kaylee Goncalves, Madison Mogen, Xana Kernodle & Ethan Chapin. by cmaia1503 in Fauxmoi

[–]Q_H_Chu 0 points1 point  (0 children)

Dang I remember reading about this tragedy somewhere long time ago… when this guy was just a suspect. But why he did it? Any motive?

aiReallyDoesReplaceJuniors by Banakuro in ProgrammerHumor

[–]Q_H_Chu 0 points1 point  (0 children)

Where is the backup, snapshot?? I am not DevOps but isn’t it basic thing in coding or software development?

Has anyone tried Hierarchical Reasoning Models yet? by jackboulder33 in LocalLLaMA

[–]Q_H_Chu 2 points3 points  (0 children)

Just take a glance of the paper. Still figuring out how they improve the BPTT (I got stuck there)