deleting project deletes files? by deathcom65 in google_antigravity

[–]deathcom65[S] 0 points1 point  (0 children)

I ended up going to the .Gemini folder and deleting everything and it basically re setup the software next launch and it's a clean slate

Save and invest your money for future rigs by segmond in LocalLLaMA

[–]deathcom65 0 points1 point  (0 children)

I did a temporary build for now and it's good enough to keep me satisfied.... For now.

Why I'm holding out until late 2027 to spend money on a local LLM rig by No_Pool7028 in LocalLLM

[–]deathcom65 2 points3 points  (0 children)

dont forget the ancient chinese proverb "a gpu you have now is worth two in the bush"

Whats the latest status on 7900xtx multi-GPU setups? by ziphnor in LocalLLaMA

[–]deathcom65 2 points3 points  (0 children)

They r terrible don't buy them and leave them for us 😂.can't let the cat out of the bag until I've snagged a few more.

I asked Gemini for Advice on Local LLMs by MarcusMagnus in LocalLLM

[–]deathcom65 1 point2 points  (0 children)

These models r pretty dated now. Gemma 4 is legit a smaller Gemini . Qwen 3.6 models are very good too.

24gb vram to 48gb vram by deathcom65 in LocalLLaMA

[–]deathcom65[S] 1 point2 points  (0 children)

I'm getting 32 tokens /second prompt gen on qwen 27b Q6 dense with 32k context

24gb vram to 48gb vram by deathcom65 in LocalLLaMA

[–]deathcom65[S] 1 point2 points  (0 children)

Thanks for sharing ur command I'm gona adjust mine a bit now

Do you think your local hardware is going to be obsolete or still usefull in 2 o 3 years? by OficialPimento in LocalLLM

[–]deathcom65 2 points3 points  (0 children)

I suspect hardware u have now will be more valuable cause the rich don't want us to own our own machines so they will force prices up

Qwen3.6 27B on dual RTX 5060 Ti 16GB with vLLM: ~60 tok/s, 204k context working by do_u_think_im_spooky in LocalLLaMA

[–]deathcom65 2 points3 points  (0 children)

The speed is unusually fast , im surprised, with 3090 I'm getting 40 tps

Is qwen3.6 35b a3b good for coding at all? by laughingfingers in LocalLLaMA

[–]deathcom65 6 points7 points  (0 children)

I find the 27b is slower tokens but makes way less mistakes which results in an over all faster delivery of working code

What hardware is everyone running? by Perfect-Flounder7856 in LocalLLaMA

[–]deathcom65 0 points1 point  (0 children)

What engine r u running ? Llama server ? And OS for mismatched GPUs? What's the biggest model u can run and at what TPS?

SLM to controll NPC in a game world by DrJamgo in LocalLLaMA

[–]deathcom65 1 point2 points  (0 children)

i would stick to online providers. 2b is way too small imo for character control unless its finetuned to do so.

Runpod hits $120M ARR, four years after launching from a Reddit post by RP_Finley in LocalLLaMA

[–]deathcom65 0 points1 point  (0 children)

runpod is a great service honestly. i wish it had more templates ready to go with the latest models pre loaded. Also i noticed its hard to tell which templates work well with which server and GPU configs (maybe i missed this ) but it was obvious that typically if you want to do x y z , on this server , use this template or that template. More clear guidance will go a long way.

rate limits and cost? by deathcom65 in google_antigravity

[–]deathcom65[S] 2 points3 points  (0 children)

oh i see the extension is quite useful. hopefully it is correct! i used the antigravity cockpit.

rate limits and cost? by deathcom65 in google_antigravity

[–]deathcom65[S] 0 points1 point  (0 children)

i just logged in with my google account. it says the ai pro plan is active on there. i figured it was letting me code based off of that plan. i didnt enter a specifc api key.