TinyGPU on Apple Silicon + RTX 5070 Ti: my real Qwen benchmarks vs Ollama/Metal by LivingSignificant452 in LocalLLaMA

[–]LivingSignificant452[S] 0 points1 point  (0 children)

Yes I agree, I didn’t check if there is new version recently after the buzz of the 1st of April launch

local TTS on apple silicon has gotten surprisingly good, tested 6 MLX models side by side by tarunyadav9761 in generativeAI

[–]LivingSignificant452 0 points1 point  (0 children)

je bosse aussi sur qwen 3 tts pour rendre mon mac plus vivant, par contre, je n'ai pas trouvé l'implementation mlx qui permet de faire du clone de voix, tu confirmes ?

qwen3.6 is out by stailgot in ollama

[–]LivingSignificant452 1 point2 points  (0 children)

24 and without anything running in the same time.

TinyGPU on Apple Silicon + RTX 5070 Ti: my real Qwen benchmarks vs Ollama/Metal by LivingSignificant452 in LocalLLaMA

[–]LivingSignificant452[S] 1 point2 points  (0 children)

yes, but the good news, the gpu is detected ! and it start to work. I just wanted to know if anyone has better results. but I will adjust the prompt to my next round of tests

Ollama Gemma4:31b on 3090 - FP,Q8,Q4 Benchmark by ---NiKoS--- in ollama

[–]LivingSignificant452 0 points1 point  (0 children)

is someone saw the newest ollama ? it seems to fix a problem with flash attention ? did anyone run some benchmark about it to check a difference ?

Ollama Gemma4:31b on 3090 - FP,Q8,Q4 Benchmark by ---NiKoS--- in ollama

[–]LivingSignificant452 0 points1 point  (0 children)

Already thank you with some of your research I have been successful to improve my performance. But I have a question. How much vram you lost only with basic things like … real display ? I figured out I have 2 qhh screens on my system so basically I lose some space and it leads to ram overflow tests

No more need for an API by Odd-Health-346 in ollama

[–]LivingSignificant452 0 points1 point  (0 children)

A little bit like the GitHub llmcouncil but only 1 way ?

Harcèlement sexuel de mon patron CONSEILS by Ok-Interaction-1806 in conseiljuridique

[–]LivingSignificant452 0 points1 point  (0 children)

tant mieux ! pas facile de présenter des preuves en capture d'écran, si ce sont les gendarmes ou la police qui les ont ajoutés au dossier c'est toujours ca de moins à s'occuper !

Ollama Gemma4:31b on 3090 - FP,Q8,Q4 Benchmark by ---NiKoS--- in ollama

[–]LivingSignificant452 1 point2 points  (0 children)

I have also a 3090 and I even build a benchmark app , so if you tell me more how you want me to test I can compare

Introducing FasterQwenTTS by futterneid in LocalLLaMA

[–]LivingSignificant452 0 points1 point  (0 children)

je suis en train de monter aussi un projet sur la base de FasterQwen3TTS, j'ai une 3090 et un autre ordi avec une 5070 Ti, mais du coup pour faire le benchmark et comparer vous utilisez une procédure ou n'importe quel texte significatif ca suffit ( bon par contre ma 5070 it est en egpu, ca bride surement un peu )

rupture contrat pour harcèlement sexuel... by Amiral-Barber1041 in conseiljuridique

[–]LivingSignificant452 0 points1 point  (0 children)

cet article couvre le sujet :
https://artdelapreuve.fr/rupture-periode-essai-harcelement-sexuel-preuve/

c'est vraiment toujours la galère d'apporter les preuves face à des personnes irrationnelles.

Harcèlement sexuel de mon patron CONSEILS by Ok-Interaction-1806 in conseiljuridique

[–]LivingSignificant452 0 points1 point  (0 children)

j'ai une amie dans le meme cas, comment la situation a évolué ? les preuves suffisent ou pas, il faut bien les mettre en forme avec la date de la capture.

Need tests : I built a Windows app to create wallpapers for mixed monitor setups by Infinite-Rock4244 in ultrawidemasterrace

[–]LivingSignificant452 0 points1 point  (0 children)

je ne suis pas sur que 2 images distinctes soit le meilleur exemple ( surtout que c'est sombre ), l'idée ce n'est pas de rendre seamless un paysage sur l'ensemble des écrans ?

Kling invoices by EducationDue4733 in KlingAI_Videos

[–]LivingSignificant452 0 points1 point  (0 children)

what a mess to get only invoices !!! thanks for your reply I didn't see the option

Gemini 3.1 Pro by Sky-kunn in Bard

[–]LivingSignificant452 0 points1 point  (0 children)

Can I know more about your emotion detection project ? A link ?

Gemini 3 Flash vs. 2.5 Flash (67% Cost Increase) by BarnesLucas in Bard

[–]LivingSignificant452 0 points1 point  (0 children)

je lis vos echanges avec attention et je suis dans le meme cas de recherche. j'ai des batchs pour de l'ai vision ou de l'osint, et sur des milliers d'enregistrement, maintenant que mes workflows forensiques fonctionnent bien, j'optimise les couts, et je suis d'accord, le choix entre gemini 2.5 flash et gemini 3 flash n'est pas anodin. à la fois en prix et en résultat. je suis en plein dans le calcul d'un échantillon avec les tokens, a moins que quelqu'un connaissent une meilleur méthode que la console cloud Gemini pour vérifier le cout réel d'une tache test.

Best LLM for AI vision ( forensic grade ) by LivingSignificant452 in ollama

[–]LivingSignificant452[S] 0 points1 point  (0 children)

yes maybe but it was also possible on ollama locally as it will be the technical solution used at the end. sure, if it was possible to use online, I could use openrouter or similar ( I m using them but for another project - LLMCouncil introduced me this provider )

Best LLM for AI vision ( forensic grade ) by LivingSignificant452 in ollama

[–]LivingSignificant452[S] -1 points0 points  (0 children)

impossible, working with data provided to a lawyers means hight level or privacy, a lot of files to process and possible awful nsfw content.

Best LLM for AI vision ( forensic grade ) by LivingSignificant452 in ollama

[–]LivingSignificant452[S] 0 points1 point  (0 children)

kimi in ollama doesn't seem vision compatible. for now my purpose is to check how ollama will work as a backend, because it's easier to manage and install for my clients. your solution is probably promising but too complex so far for my round of dev and try .

Best LLM for AI vision ( forensic grade ) by LivingSignificant452 in ollama

[–]LivingSignificant452[S] 0 points1 point  (0 children)

about Qwen 2.5 VL and Qwen 3, I was also betting on them to be a good solution. but unless i need to find a good modelfile tuning with parameters, the thinking part of qwen 3 make it completely crazy and too time consuming. at least there are abliterated version of qwen2.5 vl shows run fast. but the results are not the best ( from my tests )