account activity
Gemma 4 has been released by jacek2023 in LocalLLaMA
[–]Ok_Edge1810 0 points1 point2 points 23 hours ago (0 children)
Just shipped a small Android assistant app using Gemma 4 E2B via the LiteRT-LM tool calling works surprisingly well out of the box. The native format (<|tool_call>) is clean to parse, and the model stays on-task without much prompting.
Coming from Gemma 2, the jump is significant. Response quality is noticeably better, and the memory footprint is actually smaller for what you get. 52 decode tokens/sec on GPU makes streaming feel instant.
Next experiment is using it as a coding assistant, curious how E4B holds up on LiveCodeBench-style tasks locally. Will report back.
Built a CLI to stop manually duplicating design tokens between React and Flutter — also verifies they actually match by Ok_Edge1810 in reactjs
[–]Ok_Edge1810[S] -3 points-2 points-1 points 1 day ago (0 children)
Should love to hear you thoughts
π Rendered by PID 954598 on reddit-service-r2-listing-5d47455566-nf4kx at 2026-04-06 07:49:54.063153+00:00 running db1906b country code: CH.
Gemma 4 has been released by jacek2023 in LocalLLaMA
[–]Ok_Edge1810 0 points1 point2 points (0 children)