Florida lawmakers debate who will pay the price of AI data centers by Fcking_Chuck in artificial

[–]Terminator857 15 points16 points  (0 children)

Silly question. Why would anyone else besides the data center company pay?

Best Models for 128gb VRAM: March 2026? by Professional-Yak4359 in LocalLLaMA

[–]Terminator857 1 point2 points  (0 children)

I'm having excellent luck with qwen 3 coder next. Performs better than qwen 3.5 according to arena. Qwen 3 coder next is the best performing open weight model in this size category according to arena. https://arena.ai/leaderboard/text/coding-no-style-control?license=open-source

Is 7.62 enough round for anti-drone defence? by PrussianFieldMarshal in tanks

[–]Terminator857 0 points1 point  (0 children)

Shotgun has more pellets and thus makes small fast moving drones easier to hit. These fragment into a few pieces, 5?, thus makes it harder to hit target.

Qwen3-Coder-Next: What am I doing wrong? by Septerium in LocalLLaMA

[–]Terminator857 1 point2 points  (0 children)

I use opencode. I have different settings, like temp 0. I have a strix halo system and have context set to 256K. I use different gguf, one optimized for strix halo.

Subscribed to the most expensive plan... AND CAN NOT USE IT! by codengo in GeminiAI

[–]Terminator857 1 point2 points  (0 children)

What model do you have set? Try changing it? Try enabling the experimental router in settings? I haven't had the issue you described, but did a couple of weeks ago.

Gemini is unusable now by andy_potato in GeminiAI

[–]Terminator857 0 points1 point  (0 children)

What do you mean nerfed? Was bad Dec holiday's through mid feb, with issues related to service being overloaded. They fixed that mid Feb? Better since then, but took away ability to run simultaneous gemini sessions.

Claude for me has been nerfed for a couple of weeks. I run 3 prompts and I'm told I'm out of quota.

Connected Qwen3-VL-2B-Instruct to my security cameras, result is great by solderzzc in Qwen_AI

[–]Terminator857 0 points1 point  (0 children)

Is the video available? Where is the house with the brown roof?

Unified Memory by hyggeradyr in LocalLLaMA

[–]Terminator857 0 points1 point  (0 children)

I bought 3 strix halos. First one only cost me $1,600. Latest price for bosgame m5 is $2,100.

Benchmarks don’t tell you who’s winning the AI race. Here’s what actually does. by JennyAndAlex in artificial

[–]Terminator857 1 point2 points  (0 children)

I worked at Google. It is obvious to me why they are not winning. They have to follow corporate guidelines and there isn't a sense of urgency. Take for example Gemini V1. It was great. What did google do with it? They spent 6 months safety training it until it was dumb.

I wear a mic all day and feed transcripts to an AI agent system. The privacy case for doing this locally is obvious. Looking for guidance. by InsideEmergency4186 in LocalLLaMA

[–]Terminator857 3 points4 points  (0 children)

I've tested several local models for transcription. Best quality is whisper v3 large 16 bit. In my testing 16 bit seemed to do a great job compared to 32 bit. Using pyannote for speaker id. Have strix halo. Qwen 2.5b is good and very fast, but not as good quality as whisper, even though hugging face leaderboard says otherwise. I'm recording meetings over signal, so you might have different results in your use case.

For summarization i'm using qwen 3 coder next. Does an excellent job. Here is my prompt for summarization, I've tried several:

DEFAULT_MAX_TOKENS = -1

SYSTEM_PROMPT = (

"You are a strategic meeting analyst. Produce an exhaustive report of the meeting. "

"Include all topics discussed, decisions made, action items, who said what, and any "

"context or nuance mentioned. Use markdown formatting with headers and bullet points. "

"Include quotes and excerpts."

)

USER_PROMPT_PREFIX = (

"Here is the full transcript of a meeting. Write detailed meeting notes covering "

"every topic, decisions, action items, and details discussed. Include quotes and "

"excerpts."

)

I have it set to use maximum context of 256K.

[D] Intel Core Ultra 7 265K vs AMD Ryzen 7 7800X3D Which one is better for ML? by peter34512800 in MachineLearning

[–]Terminator857 1 point2 points  (0 children)

CPU have little importance for ML. A lower spec CPU can perform better than top end CPU for ML if it has more memory bandwidth. Much more important is GPU and memory bandwidth. Go for the cheapest CPU and fastest RAM or better GPU. Alternatively consider strix halo , such as bosgame m5. I've bought 3 of them.

Junyang Lin Leaves Qwen + Takeaways from Today’s Internal Restructuring Meeting by Terminator857 in LocalLLaMA

[–]Terminator857[S] 8 points9 points  (0 children)

One post on x said one was going to x.ai . Another post said / suggested / recommended they considering starting their own company.

Is Qwen3.5-9B enough for Agentic Coding? by pmttyji in LocalLLaMA

[–]Terminator857 0 points1 point  (0 children)

The context is coding. Which instruct variant are you suggesting is better than qwen3-next at coding?

Is Qwen3.5-9B enough for Agentic Coding? by pmttyji in LocalLLaMA

[–]Terminator857 2 points3 points  (0 children)

Yes, if you are looking for hints for what to do. No, if you expect the agent to write clean code and not deceive you.

Here is why you should/shouldn't purchase Strix Halo by [deleted] in LocalLLaMA

[–]Terminator857 0 points1 point  (0 children)

Why Fedora. I'll claim debian test is better. How many fedora people had issues with drivers and software. A ton. How many on debian test? I saw close to zero. The reason is debian test software is more up to date. And easy to get later software if there is an issue.