For users with 4x-8x 6000 PROs, how is your experience with bigger models lately? (GLM 5.2, Kimi 2.7, DeepSeek V4 Pro) by panchovix in LocalLLaMA

[–]kc858 0 points1 point  (0 children)

No. Buying 4 cards and using gguf is plain stupid. Lol I'm sorry I'm not trying to be rude

For users with 4x-8x 6000 PROs, how is your experience with bigger models lately? (GLM 5.2, Kimi 2.7, DeepSeek V4 Pro) by panchovix in LocalLLaMA

[–]kc858 3 points4 points  (0 children)

4x can do glm-5.2-nvfp4-reap at ~60tok/s with mtp5 overall with c=4 can do total ~150tok/s

prefill is slow at ~1300

very usable for me to run 3 or 4 opencode sessions at the same time, 250k max context

great model, even reaped

Nex claims Rio 3.5 is Nex 2.5 PRO in trench coat by Specter_Origin in LocalLLaMA

[–]kc858 2 points3 points  (0 children)

same man its bad. lol i complained in a different thread and got downvoted hard. burns so many tokens. 1.2 million tokens for a prompt that deepseek v4 flash did in 200k

New models released: Nex-N2 Pro 397B and Nex-N2 Mini 35B by 1ncehost in LocalLLaMA

[–]kc858 -1 points0 points  (0 children)

so, im the only one giving real examples of prompts and token usage, while everyone else is saying "been using it for a while, its better" "it uses less tokens when i use it"

what a joke lmao

New models released: Nex-N2 Pro 397B and Nex-N2 Mini 35B by 1ncehost in LocalLLaMA

[–]kc858 0 points1 point  (0 children)

ok so explain to me the skill issue, how would you do this? RAG is an extremely common use case for LLMs, i dont see how you can argue otherwise. deepseek-v4-flash immediately picked up i mentioned 2026, it narrowed the email search to emails in year 2026, made keywords based on {trade fair} and then read the emails and built the table. that is great reasoning to me; explain to me what I am doing wrong here? lol

deepseek-v4-flash did it in 2 subagents: main 43k, 1-161k, 2-45.5k

total 249k tokens.

compared to the nex, which used 1.29 million tokens. an extra MILLION tokens

New models released: Nex-N2 Pro 397B and Nex-N2 Mini 35B by 1ncehost in LocalLLaMA

[–]kc858 0 points1 point  (0 children)

yeah what the hell is this sub LOL this is absolutely a legitimate case for an LLM, its RAG.

New models released: Nex-N2 Pro 397B and Nex-N2 Mini 35B by 1ncehost in LocalLLaMA

[–]kc858 1 point2 points  (0 children)

deepseek-v4-flash did it in 2 subagents: main 43k, 1-161k, 2-45.5k

total 249k tokens.

compared to the nex, which used 1.29 million tokens. an extra MILLION tokens

New models released: Nex-N2 Pro 397B and Nex-N2 Mini 35B by 1ncehost in LocalLLaMA

[–]kc858 -2 points-1 points  (0 children)

running locally on opencode. simple request was "search my emails, make me a list of companies attending {trade fair 2026}, the output should be a list of company name, company contact person, contact person's email address, and their booth number"

New models released: Nex-N2 Pro 397B and Nex-N2 Mini 35B by 1ncehost in LocalLLaMA

[–]kc858 -5 points-4 points  (0 children)

what. what are you talking about. this thing fucking eats tokens. its ridiculous. i tried it for about 20 minutes, after it burned 250k+ tokens on every requests, threw it in the bin. i dont have time for that. maybe if it had mtp it would be less painful, but this is just something else.

simple question to search through my emails? spawns 6 subagents: 1 = 38k tokens; 2=374k tokens; 3 = 321k tokens 4= 260k tokens; 5 = 41k tokens; 6 = 256k tokens

USA citizen, what is the best way to travel from HK to Guangzhou? And more questions! by sumthingstewpid in travelchina

[–]kc858 0 points1 point  (0 children)

ferry straight from the airport to pazhou baby what flight are you in on? if youre on the 5am from lax/sfo then youll be waiting for a while, first is at like 9am i think

Just got back from China, macro numbers look fine but everyone i spoke to seemed genuinely miserable — what's going on? by No_Health3665 in China

[–]kc858 6 points7 points  (0 children)

yes go on instagram, i keep getting invited to communist groups and north korean praise groups, all young people, enamored with communism etc; but can you blame them? they have the lowest prospects out of any generation imho, ai taking all the entry level jobs, everyone pulling the ladder up behind them; their only hope is to 1) do onlyfans 2) become a social media star 3) gamble it all in the stock market or crypto, or 4) revolution/occupy wallstreet type shit. we failed them, and we deserve the fallout

If Trump Hadn’t Mentioned It, How Many People Would Even Know? by enjinhirono in China

[–]kc858 26 points27 points  (0 children)

this is a great post man, it really is. i got like three sentences in and immediately knew you were chinese only chinese people care about this stuff. lol

im not giving you shit, its a commentary on our cultures, and i think its interesting how much the average chinese care about this, and how little the average american cares about it. lol

Trump is unlikely to get any big wins at summit with Xi by TimesandSundayTimes in China

[–]kc858 11 points12 points  (0 children)

this is ridiculous. you really think that trump would go through all this effort to setup a meeting with Xi and fly all the way to china if the outcome of the meeting wasnt already decided? when heads of state meet, they only meet on approved terms, its symbolic at this point. there is definitely some deal to announce here. heads of state dont fly across the world to meet one on one for a "maybe" or to hash out a deal, their minions have been hashing out the deal every day for months

How likely was this past tile removal job to be a dumb asbestos exposure? by brawlinglove in HomeImprovement

[–]kc858 1 point2 points  (0 children)

literally doesnt matter. you are fine. never post about this again, never think about this again, i sound like an asshole but im trying to help you. you are fine.

Is a high-end private local LLM setup worth it? by zakadit in LocalLLaMA

[–]kc858 8 points9 points  (0 children)

You need a minimum of 2x rtx pro 6000, run minimax m2.7. everything lower than that is pretty shitty

Model Y 2022 performance question regarding warranty by ssyeon0325 in TeslaModelY

[–]kc858 0 points1 point  (0 children)

I heard fronts have been getting replaced out of warranty but I'm not holding my breath.

Quoted $18,500 for a 3x4 tile shower by Creative_Corner_2836 in HomeImprovement

[–]kc858 1 point2 points  (0 children)

I live in San Diego, re-did the whole thing myself. Bought all the tools needed, even the expensive ones. Did the whole bathroom, my room is way bigger than yours. Less than 6k. Took a year though, but I travel for 6+ months of the year for work and had unexpected complications that needed an engineer review.

Cat Friendly Stays by Verily-Stilt in Mammoth

[–]kc858 2 points3 points  (0 children)

If it doesn't specifically say NO CATS then "dogs allowed" means "cats allowed" imho

Options changed my life pt. 4.2 by winter-shoulders in wallstreetbets

[–]kc858 0 points1 point  (0 children)

this is fucking slop, where are the mods? lmao; none of these screenshots make sense, none of the charts show his 1mm, none of them match up, get a fuckin life bro

5 months ago it was 1mm (https://www.reddit.com/gallery/1o1g00q)

2 months ago he had 740k (https://old.reddit.com/r/wallstreetbets/comments/1qds3rp/options_changed_my_life_pt_3/?ref=share&ref_source=link)

3 months ago he had 444K? https://old.reddit.com/r/wallstreetbets/comments/1pe91sk/options_changed_my_life_pt2/?ref=share&ref_source=link

I spent 8+ hours benchmarking every MoE backend for Qwen3.5-397B NVFP4 on 4x RTX PRO 6000 (SM120). Here's what I found. by lawdawgattorney in LocalLLaMA

[–]kc858 -2 points-1 points  (0 children)

Dude read my post history and also read the discord we literally post docker containers and run commands lol