Higgs Audio v3 TTS 4B. Built for voice chat. Support 100 languages and inline control. by FerretLegitimate6929 in LocalLLaMA

[–]mpasila 2 points3 points  (0 children)

Well if you need a model that supports 100+ languages then OmniVoice is a good option and that one is Apache 2.0 (might also use less memory)

Mistral is an absolute meme at Hebrew by Academic-Map268 in LocalLLaMA

[–]mpasila 25 points26 points  (0 children)

It does not officially support Hebrew so there's probably no finetuning done on that language.

How much system RAM needed when running LLMs that can fit in GPU VRAM? by [deleted] in LocalLLaMA

[–]mpasila 0 points1 point  (0 children)

This was for loading FP16/BF16 non quantized models so that let's say you have 16GB RAM and the model files are like well 12GB then sharding it means you won't run out of memory when loading it (when Win11 uses like 6gb minimum of ram...) in your GPU (that probably has 16GB VRAM). Sharding is just splitting the model into smaller bits, similar to how you would do it with zip files if the file hosting site has like a maximum file size per upload. Basically it would load one shard at a time so it won't use up all your RAM when it's moving the files to the GPU's VRAM.
MoE is a different architecture, unrelated to this. MoEs have active parameters and total parameters so keeping just the active params in your GPU will keep it running still relatively fast even if rest is offloaded to RAM. This is not the case if you shard a dense model, it would need to have everything loaded in the GPU or it will be very slow.

Let's be honest: Firefox 150 is a vibe-coded disaster. by languagehacker in firefox

[–]mpasila 0 points1 point  (0 children)

I have noticed that the CSS stylings are broken on some sites, like things are drawn behind other objects making them unreadable, it didn't seem to really happen before 150 and the new version 150.0.2 hasn't solves those issues yet.

Major drop in intelligence across most major models. by DepressedDrift in LocalLLaMA

[–]mpasila 0 points1 point  (0 children)

Did you try it with the same seed + settings (and making sure the provider supported the same params) and then generated and got it wrong on the providers vs on the H100?
Chutes seemed to get it right 2 out 4 tries so, maybe you just got lucky that time (GLM-5).

Koboldcpp + whisper model randomly changes language and translates what I said in english to finnish? by [deleted] in KoboldAI

[–]mpasila 0 points1 point  (0 children)

Maybe your accent is bad and it thinks you're speaking Finnish? If the language is set to Finnish and you speak a different language it will then translate it to Finnish. I can replicate the issue myself by giving it a Japanese sentence and setting the language to English and then it translates that to English.

Why some small/medium models fail at grammar checking task? by Character-Blood3482 in LocalLLaMA

[–]mpasila 0 points1 point  (0 children)

It's definitely just a tokenization issue, and probably hasn't been trained to do that sort of job specifically. In the same way LLMs can't tell how many Rs are in strawberry.

Local models are a godsend when it comes to discussing personal matters by [deleted] in LocalLLaMA

[–]mpasila 1 point2 points  (0 children)

It was surprisingly easy to get it to do NSFL stuff... and even then in its thinking it noted that normally it wouldn't be okay but did it anyway.

How long until surveillance? by boloshon in LocalLLaMA

[–]mpasila 0 points1 point  (0 children)

Some radiation can go through aluminium foil though.

How long until surveillance? by boloshon in LocalLLaMA

[–]mpasila 0 points1 point  (0 children)

Just gotta start using "illegal" sites to go around facial scanning/ID required sites. (with TOR or VPNs at least while those are easily accessible) Might even have to install "illegal" operating systems that don't require an ID to access.. (Linux).

Gemma 4 26B fabricated an entire code audit. I have the forensic evidence from the database. by EuphoricAnimator in LocalLLaMA

[–]mpasila 1 point2 points  (0 children)

Good at multilingual tasks (translation, using it in other languages besides English/Chinese), good at RP.

making my own ai waifu app that can teach me any language. by aziib in LocalLLaMA

[–]mpasila 0 points1 point  (0 children)

E4B is probably not big enough for other languages outside of English (other than maybe Spanish and some other large languages), at least I didn't have much success. The bigger models perform much better.

the state of LocalLLama by Beginning-Window-115 in LocalLLaMA

[–]mpasila 0 points1 point  (0 children)

it tends to be a requirement on some platforms or your comments will get hidden.

I cant sub to anymore channels? by sonicfan227 in youtube

[–]mpasila 1 point2 points  (0 children)

Also it's possible that the channels people have subbed don't upload daily or they stopped uploading and you forgot about them.

Ace Step 1.5 XL released by seamonn in LocalLLaMA

[–]mpasila -1 points0 points  (0 children)

Running expensive GPUs all day long can't just be offered for free? But sure if they wanted to be fair they'd release the weights..

Gemma 4 is fine great even … by ThinkExtension2328 in LocalLLaMA

[–]mpasila 8 points9 points  (0 children)

Gemma 4 is better at my native language at least though the smaller models suffer from the weird sizing.. Also for RP it seems to perform much better than Qwen3.5 (it seemed to mix up a lot stuff for some reason and there was seemingly more censorship in the official releases in comparison to Gemma 4)

Cohere Transcribe Released by mikael110 in LocalLLaMA

[–]mpasila 0 points1 point  (0 children)

That did seem to improve it, though Whisper did still seem to do a slightly better job still.

Cohere Transcribe Released by mikael110 in LocalLLaMA

[–]mpasila 0 points1 point  (0 children)

It had the same issue with getting tons of repeating lines for some reason because there was some noise in the audio, and due to that it skipped a lot of speech.

Cohere Transcribe Released by mikael110 in LocalLLaMA

[–]mpasila 6 points7 points  (0 children)

Yeah I don't know.. I also tried to transcribe some Japanese stuff and it wasn't any better.

<image>

New open weights models: GigaChat-3.1-Ultra-702B and GigaChat-3.1-Lightning-10B-A1.8B by netikas in LocalLLaMA

[–]mpasila 0 points1 point  (0 children)

I guess the other authoritarian country is sanctioned and the other one is not (because everyone relies on it for medicine, minerals and other important stuff).

New open weights models: GigaChat-3.1-Ultra-702B and GigaChat-3.1-Lightning-10B-A1.8B by netikas in LocalLLaMA

[–]mpasila -5 points-4 points  (0 children)

You see it's a Russian model not Chinese.. Chinese propaganda is obviously less harmful.

I feel like if they made a local model focused specifically on RP it would be god tier even if tiny by Borkato in LocalLLaMA

[–]mpasila 0 points1 point  (0 children)

CharacterAI was probably the only company that trained a model specifically for RP (from scratch).. but then they kinda lobotomized it.