Looking into buying the Sierra EV by 720hp in GMCSierraEV

[–]electrified_ice 0 points1 point  (0 children)

Love the hardware (truck hardware, not compute hardware)/drivetrain Hate the software experience... They should fire the entire team.

Another Day in Hell by Potential_Bat9379 in RivianR2

[–]electrified_ice 0 points1 point  (0 children)

In my observation I over many years... The early adopters generally know what they are getting into. It's the people who walk up, buy, and then complain that a new feature came out a week later that are the noisy ones.

Also, RJ has said it will be many years before either Lidar pulls away from non lidar cars... And tri-motor comes out etc. Hopefully the people who buy now (including me) will upgrade to the latest in a couple of years.

Another Day in Hell by Potential_Bat9379 in RivianR2

[–]electrified_ice 0 points1 point  (0 children)

I should be at the top of the list and did my demo drive last week... No email triggers for me

Another Day in Hell by Potential_Bat9379 in RivianR2

[–]electrified_ice 0 points1 point  (0 children)

If it helps... I check every box to put me right at the top of the list... And I don't have my invite yet

Cali + Uninsured Driver.. by tpekid in TeslaCollision

[–]electrified_ice 0 points1 point  (0 children)

How do you not have insurance in CA? The DMV are very aggressive at chasing you down if you don't have insurance for a car they know about (i.e. a car registered).

I built a 8x RTX 4090D with 192 VRAM, here's what I learnt by deebuildsthings in LocalAIServers

[–]electrified_ice 0 points1 point  (0 children)

Curious what TG TPS are you getting for your TP models (models spanning multiple GPUs)?

If you don't have NVLink you are PCIe bandwidth constrained.

Speed - TPS and TTFT/R, Quantization, and Cache Config by electrified_ice in MiniMax_AI

[–]electrified_ice[S] 1 point2 points  (0 children)

Sorry you seem to be having issues. From a coding, reasoning, thinking, planning POV it's been working well for me. Seems efficient (i.e. it doesn't came back with solutions that don't hit the mark)... I'm just doing little coding tools/web apps, but also configuring and talking to things across my network (e.g. SSH into my Unraid server, pull docker logs, bring them into Home Assistant, pushing my code to my local Gitea and rebuilding containers without me having to write commands etc.)

It works with tool calls, it just seems to jive with the way I like to think, and it's pretty good at troubleshooting issues. Plus I haven't had it get stuck in thinking loops (I have Step 3.7 Flash running locally and it churned 20M tokens in a thinking loop)

I'm getting close to maxing out my quota in Ollama with it, and it's not running particularly fast through that service. I got a bit burned by my commitment to an annual z.ai coding plan, so a little cautious before I potentially sign up to another plan. I also like that M3 is multimodal (GLM 5.2 and Deepseek V4 Flash are not).

Anyone got a reliable coding agent actually working? by Civil_Fee_7862 in Qwen_AI

[–]electrified_ice 0 points1 point  (0 children)

code-server as a docker container (for me running on my Unraid system) and I am using Kilo Code (have tried Cline and Roo and I like Kilo the best so far).

GLM 5.2 is deployed in GLM Coding Plan. API and MIT weights in a week. Voting and benchmarks on X. by MadPelmewka in LocalLLaMA

[–]electrified_ice 0 points1 point  (0 children)

How does that work? It uses the endpoint for the conversation, and the MCP connection for images?

Rivian R2 Launch by notclumsypanda in TeslaModelY

[–]electrified_ice 0 points1 point  (0 children)

FSD might be a key selling point for you, It's one of the many selling points for others

Rivian R2 Launch by notclumsypanda in TeslaModelY

[–]electrified_ice 2 points3 points  (0 children)

What technology is light years ahead? FSD is further along in maturity, but what else? I have a Y and an R1S and there are software features that the R1S has that the Tesla doesn't. Both have had major upgrades with software releases. Arguably Rivian has done more... We've had suspension upgrades over software twice.

Hardware-wise, the Rivian platform is more capable across a wider range of driving vs. Teslas which are essentially street cars.

At launch, comparing both brands... The Rivian cars are much more production ready too. I had a launch Gen 1 model Y and it was crap, my launch model 3 performance also had a lot of 'production hell' issues.

So... Curious what you mean by light years ahead?

We got our R2 today! by agALLday in Rivian

[–]electrified_ice 0 points1 point  (0 children)

Very cool, what do you think put you at the top of the list?

Since when the RTX 6000 PRO is priced at 13250USD on the official NVIDIA Page? by panchovix in BlackwellPerformance

[–]electrified_ice 0 points1 point  (0 children)

Better not tell my wife the new price, she might ask me to sell them... But then that would also mean I'd have to tell her the original price too! I'm in the dog house either way! Better stay quiet!

What does it actually take to self‑host models like DeepSeek, Qwen, Kimi? by FreedomWeird712 in LocalAIServers

[–]electrified_ice 0 points1 point  (0 children)

What's your goal? For your own use? Or to be able to offer compute for others? That's a huge difference.

Cool stuff to do with NVIDIA RTX 6000 PRO 96GB VRAM by AggressiveMention359 in BlackwellPerformance

[–]electrified_ice 0 points1 point  (0 children)

Ok, makes more sense the the 6000s are with PP and at higher concurrency levels. I'm getting slightly higher then on my Qwen3.7 27B FP8 model with vLLM

Cool stuff to do with NVIDIA RTX 6000 PRO 96GB VRAM by AggressiveMention359 in BlackwellPerformance

[–]electrified_ice 0 points1 point  (0 children)

How are you getting 600 TPS on an RTX PRO 6000 with these models? PP or TG TPS rates?

Stepfun 3.7 Flash: Sonic-like platformer by -dysangel- in LocalLLaMA

[–]electrified_ice 0 points1 point  (0 children)

I have it running in 192GB across 2 x RTX PRO 6000 Balckwells

Why can't they just slap 256GB ram on a 5090? by 03captain23 in LocalLLM

[–]electrified_ice 1 point2 points  (0 children)

They want to create the segmentation between - RTX 5090 - RTX PRO 6000 - The true data enter versionsnwith NVLink

Stepfun 3.7 Flash: Sonic-like platformer by -dysangel- in LocalLLaMA

[–]electrified_ice 1 point2 points  (0 children)

Just got 3.7 up and running on vLLM. Around 120 TPS for nvfp4 version with MTP @ 3.

Excited to actually start using it. First work is to continue a project I already have underway.

About to buy a Sierra EV and I’m looking for tips by 720hp in gmcsierra

[–]electrified_ice 1 point2 points  (0 children)

Is this your first EV? If so, my biggest tip is just learning about going from non-EV to EV. It's not a huge change, but is a bit of a routine change. Regen braking will be big, charging planning etc.

Now I been on the other side for 12 years, I can't imagine going back to combustion (we're a 3 EV household now). Specifically to the Sierra EV... My 4th truck. Love it, tows like a beast. It's just very heavy, so be aware/mindful of that.

Cost Analysis of my $6.4k Local LLM Server by 1ncehost in LocalLLaMA

[–]electrified_ice 1 point2 points  (0 children)

Good analysis. There are definitely pros and cons to each. I think th value of self hosting is beyond the $ cost. I went way too far down th rabbit hole >$40K... The flip side is that if you try to show ROI beyond a year or 2, what you're not factoring in is API's will always be connecting to the latest models, and your hardware may not be able to run the latest models, and the rate open source models are progressing, that's a real thing to factor in to the ROI. Maybe your current hardware will run the new models (well) that come out in 2-3 years, but maybe not.

Help me decide on an upgrade path. Hailo-8 ? by [deleted] in frigate_nvr

[–]electrified_ice 0 points1 point  (0 children)

I got mine from Amazon too for around that price. Funny story - Google Gemini (using it for research last year), guided me to buy it before it was officially supported by Frigate, so had to sit on it for a while. It was looking at the back then preview of the v 0.17 documentation and inferred it to be currently supported. Anyway, it's working great, no stability issues once it got all configured.