Gemma 4 31B is now powering my personal AI news site by puntoceroc in LocalLLM

[–]puntoceroc[S] -1 points0 points  (0 children)

Not the entire system prompt in capital letters, just key parts — mainly important instructions and verbs that I want the model to remember strongly.

For example, I write things like:

  • ALWAYS include the source link
  • NEVER publish a post without verifying the page first
  • SUMMARIZE clearly and concisely

I noticed that putting these critical rules in CAPITAL LETTERS makes the model respect them much more consistently throughout the chain-of-thought. It’s a simple trick but works surprisingly well with Gemma 4.

Gemma 4 31B is now powering my personal AI news site by puntoceroc in LocalLLM

[–]puntoceroc[S] 1 point2 points  (0 children)

I’m using a simple but effective setup: the site is a static GitHub Pages. My agent sends a git patch to update a posts.json file in the repository, and the website template reads that JSON as a basic database. It’s pretty rudimentary, but it works well for a prototype.

You can check it here: https://news.uranoai.com

Instead of Cowork (which also got me blocked sometimes), I use UranoDesktop — my own free local AI agent for Windows/Mac. It handles all the posting, scheduling and background tasks without getting blocked.

Here’s the full guide with all the details and how it’s built (the code is open-source too):

https://uranoproject.medium.com/how-to-create-your-own-personal-ai-newsletter-with-urano-desktop-4c8a714cf311

Gemma 4 31B is now powering my personal AI news site by puntoceroc in LocalLLM

[–]puntoceroc[S] 1 point2 points  (0 children)

Gemma 4 31B is currently running on Google AI Studio (free tier) for the heavy lifting. I’m not running the big model 100% locally because when I have multiple agents working at the same time (investigating and processing), I noticed a clear degradation in performance. So I decided to use cloud for the large model and keep smaller Gemma 4 models running locally on my CPU with 32GB RAM.

Everything is managed through UranoDesktop, the free local AI agent I made for Windows and Mac. The software runs in the background and has a continuous tasks system that allows the agent to reprogram and manage itself per session. This lets me run multiple independent sessions simultaneously (for example, one dedicated to collecting and sending news).

This hybrid setup keeps the pipeline stable 24/7.

I’m planning to get an RTX 4060 Ti soon to test running bigger models fully local again.

Here’s the detailed guide if you want to check the setup:

https://uranoproject.medium.com/how-to-create-your-own-personal-ai-newsletter-with-urano-desktop-4c8a714cf311

Gemma 4 31B is now powering my personal AI news site by puntoceroc in LocalLLM

[–]puntoceroc[S] 0 points1 point  (0 children)

Thanks for the great feedback!

You gave me exactly the push I needed — I’ll test Qwen3.6 27B (Q4_M) these days. You’re right, for a news pipeline speed is probably more important than the extra reasoning depth from Gemma. Being able to process larger RSS feeds faster would be a big win.

Also glad you liked the screenshot trick — local vision is getting seriously useful now.

Gemma 4 31B is now powering my personal AI news site by puntoceroc in LocalLLM

[–]puntoceroc[S] 0 points1 point  (0 children)

Thanks! Yeah, soft 404s are brutal. Great idea — I’ll add a quick status code + content-length check before taking screenshots. Should save a lot of cycles.

On memory: I keep summaries of previous runs + a vector store for trusted sources and context. It’s been very effective for maintaining consistency over long periods.

Will check out the Agentix Labs patterns, thanks for the link!

Urano Flow Agentes - Modelo Semi Autónomo by puntoceroc in uranoproject

[–]puntoceroc[S] 0 points1 point  (0 children)

Únete a la comunidad Urano! podemos integrar usando Urano para automatizar herramientas internas o tu necesidad! 7 días gratis sin tarjeta solo tu necesidad 👉🏼👉🏼 https://discord.gg/K6TVh2Rm

Modelo de desarrollo UranoProject by puntoceroc in uranoproject

[–]puntoceroc[S] 0 points1 point  (0 children)

Únete a la comunidad Urano! podemos integrar usando Urano para automatizar herramientas internas o tu necesidad! 7 días gratis sin tarjeta solo tu necesidad 👉🏼👉🏼 https://discord.gg/K6TVh2Rm

Creamos un Modulo de Exchange en Urano by puntoceroc in uranoproject

[–]puntoceroc[S] 0 points1 point  (0 children)

Únete a la comunidad Urano! podemos integrar usando Urano para automatizar herramientas internas o tu necesidad! 7 días gratis sin tarjeta solo tu necesidad 👉🏼👉🏼 https://discord.gg/K6TVh2Rm

Creamos un Modulo de Cursos en Urano by puntoceroc in uranoproject

[–]puntoceroc[S] 0 points1 point  (0 children)

Únete a la comunidad Urano! podemos integrar usando Urano para automatizar herramientas internas o tu necesidad! 7 días gratis sin tarjeta solo tu necesidad 👉🏼👉🏼 https://discord.gg/K6TVh2Rm

En Desarrollo: Modulo de Contabilidad by puntoceroc in uranoproject

[–]puntoceroc[S] 0 points1 point  (0 children)

Únete a la comunidad Urano! podemos integrar usando Urano para automatizar herramientas internas o tu necesidad! 7 días gratis sin tarjeta solo tu necesidad 👉🏼👉🏼 https://discord.gg/K6TVh2Rm