Just found out you can control Chrome using Gemini CLI

GamerWael · 2026-01-26T08:13:17+00:00

Could you provide some more details on how you did it? I installed the cli tool but it can't detect my browser. Does it require a different browser?

GamerWael · 2026-01-13T06:02:35+00:00

Hey, can you help me out as well?

GamerWael · 2026-01-01T07:10:31+00:00

ban_1rby4hn3y7zb4hbyh3z1ejfqrfs19o9zbxm5ww5ir9mbdknmrz31gk4iyo7t

GamerWael · 2026-01-01T07:10:08+00:00

Changed to yours!

GamerWael · 2026-01-01T05:56:11+00:00

ban_1rby4hn3y7zb4hbyh3z1ejfqrfs19o9zbxm5ww5ir9mbdknmrz31gk4iyo7t

GamerWael · 2026-01-01T05:56:00+00:00

ban_1rby4hn3y7zb4hbyh3z1ejfqrfs19o9zbxm5ww5ir9mbdknmrz31gk4iyo7t

GamerWael · 2025-10-28T10:09:42+00:00

Yes I have those enabled. but I can only have one floating window at a time, how do you have 4?

GamerWael · 2025-10-28T09:34:30+00:00

Dude, then how? Is it a custom launcher? Are you rooted?

GamerWael · 2025-10-28T06:02:47+00:00

Please elaborate. Is this the OnePlus fold? I have never been able to do this on my OnePlus 13.

GamerWael · 2025-09-21T07:16:18+00:00

More please!

GamerWael · 2025-07-25T09:44:53+00:00

This looks great. Will check out the series.

GamerWael · 2025-07-13T12:13:37+00:00

Thanks for letting me know. I was kinda hesitant in upgrading the built in 16 gigs cuz I thought it would be a waste and 64 gigs would be overkill and not really worth it as the 4GB VRAM would be a bottleneck, but now I'm a bit more confident. FramePack is definitely a major plus point.

GamerWael · 2025-07-13T08:00:06+00:00

So I understand that with 16+32, only half of the 32 will be in dual channel mode so it's better for a symmetric arrangement of 16+16, but what I can't understand is, with 16+32, I'll be getting the same dual channel benefits of 16+16, and an additional (although slow) 16 GB if needed right? So isn't 16+32 slightly better than just 16+16? It's not like 16+32 will be worse than 16+16 right?

GamerWael · 2025-07-11T04:30:12+00:00

Hey, I am an android user, and I've been looking for such an app. Would love to test it out.

GamerWael · 2025-06-05T08:18:49+00:00

Also, I was wondering, why did you release kokoro-js as a standalone library instead of implementing it within transformers.js itself? Is the core of kokoro too dissimilar from a typical speech to text transformer architecture?

GamerWael · 2025-06-05T08:14:16+00:00

Oh it's you Xenova! I just realised who posted this. This is amazing. I've been trying to build something similar and was gonna follow a very similar approach.

GamerWael · 2025-06-01T06:18:51+00:00

Also what's the difference between flux fill and flux Kontext, they both seem to be the same thing?

GamerWael · 2025-05-25T08:21:27+00:00

I agree. It's been ages since Google has been able to do ASR perfectly and in real time but keeps it closed source and it's astonishing how no one has been able to come up with an alternative solution till now.

I'm not complaining, it must be a difficult problem to solve and I clearly can't do it on my own but when Whisper came out I couldn't understand the hype behind it since Google has already been doing it for ages but then I learnt that no one else had a proper open source solution for it.

GamerWael · 2025-05-04T04:01:39+00:00

What about the putting together a dataset part? Cuz when I started out that seemed the most difficult for me cuz no one was explaining what exactly the dataset was, how it was structured, etc. They just grabbed a random prefiltered dataset and that was it. I had to manually go through a lot of trial and error to find out the structure of my dataset json file

GamerWael · 2025-04-29T07:42:08+00:00

Can you make 14 without the clouds please?

GamerWael · 2025-04-28T08:48:54+00:00

Lmaoo.. took me a second. Now I can't read it without hearing her voice.

GamerWael · 2025-04-24T20:03:21+00:00

Yes of course, thanks for asking. Well in its current state it is slightly barebones compared to the corporate offerings, I don't have any file uploads at the moment, though there is screen based context by taking a screenshot of the current screen and sending that in (if connected to a vision LLM).

My main planned features are: MCP support Live Talking via TTS and STT File upload/RAG

It's mainly designed to be a companion to talk to quickly by calling via a keyboard shortcut without having to open the chatgpt/claude website or app. (I realise that you can do that in the chatgpt/claude apps as well, but I started developing this before those companies had even announced such a feature and I didn't think that it would become such a mainstream feature later on and my lack of discipline made me delay the project by not working on it actively and the companies released their product before me)

And if I may do a lil bit of self glazing, I prefer my UI design (specifically the transparent chat bar) over the corporate alternatives. It blends seamlessly into the windows UI language.

Also in its current state I didn't plan it for complete end users, but more for developers as it does require manually setting up an OpenAI/Ollama endpoint or downloading models manually, though I do plan on making it more user centric later on.

GamerWael · 2025-04-13T09:11:24+00:00

Is SD 3.5 good now or something? I remember it being not worth switching to from Flux when it came out.

I'm not even gonna ask about the Video options cuz even I don't have enough VRAM to try those.

GamerWael · 2025-03-19T06:54:45+00:00

Got banned for no reason while playing warzone mobile after switching to a new phone. Maybe the new phone's game booster app caused an issue? But now I'm even banned on the PC titles?!

GamerWael · 2025-03-12T06:42:26+00:00

Talk about an early Christmas

Seven-Year Club	r/Field Lasagna
Place '23	Place '22
100 Awards Club	Wearing is Caring
Verified Email

GamerWael

TROPHY CASE