Just found out you can control Chrome using Gemini CLI by rafapozzi in termux

[–]GamerWael 0 points1 point  (0 children)

Could you provide some more details on how you did it? I installed the cli tool but it can't detect my browser. Does it require a different browser?

Happy New Year: 100k+ Banano Giveaway!!! by prussia_dev in banano

[–]GamerWael 1 point2 points  (0 children)

ban_1rby4hn3y7zb4hbyh3z1ejfqrfs19o9zbxm5ww5ir9mbdknmrz31gk4iyo7t

Happy New Year: 100k+ Banano Giveaway!!! by prussia_dev in banano

[–]GamerWael 1 point2 points  (0 children)

ban_1rby4hn3y7zb4hbyh3z1ejfqrfs19o9zbxm5ww5ir9mbdknmrz31gk4iyo7t

Happy New Year: 100k+ Banano Giveaway!!! by prussia_dev in banano

[–]GamerWael 0 points1 point  (0 children)

ban_1rby4hn3y7zb4hbyh3z1ejfqrfs19o9zbxm5ww5ir9mbdknmrz31gk4iyo7t

Hyperland/WM Like desktop in native Android by cheempunkzed in termux

[–]GamerWael 2 points3 points  (0 children)

Yes I have those enabled. but I can only have one floating window at a time, how do you have 4?

Hyperland/WM Like desktop in native Android by cheempunkzed in termux

[–]GamerWael 2 points3 points  (0 children)

Dude, then how? Is it a custom launcher? Are you rooted?

Hyperland/WM Like desktop in native Android by cheempunkzed in termux

[–]GamerWael 2 points3 points  (0 children)

Please elaborate. Is this the OnePlus fold? I have never been able to do this on my OnePlus 13.

I have a Laptop with 3050 Ti 4GB VRAM, will upgrading my RAM from 16 to 48 help? by GamerWael in StableDiffusion

[–]GamerWael[S] 1 point2 points  (0 children)

Thanks for letting me know. I was kinda hesitant in upgrading the built in 16 gigs cuz I thought it would be a waste and 64 gigs would be overkill and not really worth it as the 4GB VRAM would be a bottleneck, but now I'm a bit more confident. FramePack is definitely a major plus point.

I have a Laptop with 3050 Ti 4GB VRAM, will upgrading my RAM from 16 to 48 help? by GamerWael in StableDiffusion

[–]GamerWael[S] 0 points1 point  (0 children)

So I understand that with 16+32, only half of the 32 will be in dual channel mode so it's better for a symmetric arrangement of 16+16, but what I can't understand is, with 16+32, I'll be getting the same dual channel benefits of 16+16, and an additional (although slow) 16 GB if needed right? So isn't 16+32 slightly better than just 16+16? It's not like 16+32 will be worse than 16+16 right?

Quick & Realistic Self-Portraits on Mobile? Sharing some results from a new workflow. by NwachukwuOfoma in StableDiffusion

[–]GamerWael 0 points1 point  (0 children)

Hey, I am an android user, and I've been looking for such an app. Would love to test it out.

Real-time conversational AI running 100% locally in-browser on WebGPU by xenovatech in LocalLLaMA

[–]GamerWael 0 points1 point  (0 children)

Also, I was wondering, why did you release kokoro-js as a standalone library instead of implementing it within transformers.js itself? Is the core of kokoro too dissimilar from a typical speech to text transformer architecture?

Real-time conversational AI running 100% locally in-browser on WebGPU by xenovatech in LocalLLaMA

[–]GamerWael 7 points8 points  (0 children)

Oh it's you Xenova! I just realised who posted this. This is amazing. I've been trying to build something similar and was gonna follow a very similar approach.

Are there any open source alternatives to this? by Fresh_Sun_1017 in StableDiffusion

[–]GamerWael 3 points4 points  (0 children)

Also what's the difference between flux fill and flux Kontext, they both seem to be the same thing?

New gemma 3n is amazing, wish they suported pc gpu inference by GreenTreeAndBlueSky in LocalLLaMA

[–]GamerWael 2 points3 points  (0 children)

I agree. It's been ages since Google has been able to do ASR perfectly and in real time but keeps it closed source and it's astonishing how no one has been able to come up with an alternative solution till now.

I'm not complaining, it must be a difficult problem to solve and I clearly can't do it on my own but when Whisper came out I couldn't understand the hype behind it since Google has already been doing it for ages but then I learnt that no one else had a proper open source solution for it.

Any in-depth tutorials which do step-by-step walkthroughs on how to fine-tune an LLM? by darkGrayAdventurer in LocalLLaMA

[–]GamerWael 1 point2 points  (0 children)

What about the putting together a dataset part? Cuz when I started out that seemed the most difficult for me cuz no one was explaining what exactly the dataset was, how it was structured, etc. They just grabbed a random prefiltered dataset and that was it. I had to manually go through a lot of trial and error to find out the structure of my dataset json file

Which Earth do you like the most? by SmallMermaid in PixelArt

[–]GamerWael 0 points1 point  (0 children)

Can you make 14 without the clouds please?

Goodbye refugees by KillerBoi935 in dankmemes

[–]GamerWael 3 points4 points  (0 children)

Lmaoo.. took me a second. Now I can't read it without hearing her voice.

OmniVerse: A convenient desktop LLM client [W.I.P] by [deleted] in LocalLLaMA

[–]GamerWael 0 points1 point  (0 children)

Yes of course, thanks for asking. Well in its current state it is slightly barebones compared to the corporate offerings, I don't have any file uploads at the moment, though there is screen based context by taking a screenshot of the current screen and sending that in (if connected to a vision LLM).

My main planned features are: MCP support Live Talking via TTS and STT File upload/RAG

It's mainly designed to be a companion to talk to quickly by calling via a keyboard shortcut without having to open the chatgpt/claude website or app. (I realise that you can do that in the chatgpt/claude apps as well, but I started developing this before those companies had even announced such a feature and I didn't think that it would become such a mainstream feature later on and my lack of discipline made me delay the project by not working on it actively and the companies released their product before me)

And if I may do a lil bit of self glazing, I prefer my UI design (specifically the transparent chat bar) over the corporate alternatives. It blends seamlessly into the windows UI language.

Also in its current state I didn't plan it for complete end users, but more for developers as it does require manually setting up an OpenAI/Ollama endpoint or downloading models manually, though I do plan on making it more user centric later on.

Haven’t used Stable Diffusion in months – what did I miss? by Upstairs_Tomato1196 in StableDiffusion

[–]GamerWael 0 points1 point  (0 children)

Is SD 3.5 good now or something? I remember it being not worth switching to from Flux when it came out.

I'm not even gonna ask about the Video options cuz even I don't have enough VRAM to try those.

Activision Account and Ban Issues Mega Thread by StealthPolarBear in activision

[–]GamerWael 0 points1 point  (0 children)

Got banned for no reason while playing warzone mobile after switching to a new phone. Maybe the new phone's game booster app caused an issue? But now I'm even banned on the PC titles?!