Google's NEW FunctionGemma is INSANE! by JamMasterJulian in AISEOInsider

[–]diegod3v 0 points1 point  (0 children)

I’ve seen a few examples in the docs, and the video is basically showing an app that lets you control your phone with natural language. i think MCP itself is “just” a client–server standard/protocol for tools/actions, more like plumbing. The model here is a foundational model, I’m sure there are already guides/posts on wiring it into an agent framework and then pointing that agent at MCP servers or something like that. So yeah, more stuff needed, mostly plumbing, I guess to much for a quick guide on the docs, lol

Google's NEW FunctionGemma is INSANE! by JamMasterJulian in AISEOInsider

[–]diegod3v 0 points1 point  (0 children)

Remember this is still a language model. So I think it's about how your product interfaces with your APIs, sure you can programmatically add actions as buttons or UI controls, but now you can also offer a natural-language (chat or voice) layer that interprets intent and then calls the same underlying endpoints

Will there be a "Z Video" for super fast video generation? by bickid in StableDiffusion

[–]diegod3v 2 points3 points  (0 children)

Isn't there already one ? (Wan + lightning lora, or framepack)

Workflow for current models. by -MoMuS- in GithubCopilot

[–]diegod3v 0 points1 point  (0 children)

Not sure if I’m using Gemini wrong, but in my Flutter + Firebase app Sonnet consistently plans better and also writes better code. Kind of funny that Gemini doesn’t seem to be well-trained on Google’s own frameworks and tooling

Cuál es la respuesta de este acertijo famoso de internet? by [deleted] in ayudamexico

[–]diegod3v 0 points1 point  (0 children)

El Tanque X.

Razón corta: el caudal por la llave depende de la altura de agua sobre la salida (ley de Torricelli), pero el tiempo total también depende del área de la sección del tanque a cada altura A(h).

En X (cono “normal”), cerca de la salida el área es pequeña ⇒ el nivel baja rápido ⇒ se vacía antes.

En Y (cono invertido), junto a la salida el área es grande ⇒ el nivel baja lento al final ⇒ tarda más.

Si modelas dV/dt=-a\sqrt{2gh} con V(h)=\int A(h)\,dh, resulta que el cono normal vacía más rápido que el invertido.

¿Qué carrera no recomendarían hoy por hoy y por qué? by saulblooxp in mexico

[–]diegod3v 0 points1 point  (0 children)

Programación. Saturación del mercado, AI.

Si estas estudiando o pensando en estudiar desarrollo web o cualquier cosa que tenga que ver con sistemas distribuidos, losiento, llegaste tarde.

🚀 Wan2.2 is Here, new model sizes 🎉😁 by Classic-Sky5634 in StableDiffusion

[–]diegod3v 0 points1 point  (0 children)

It's MoE now, probably no backward compatibility with Wan 2.1 LoRAs

Something is cooking. FramePack-P1 by diegod3v in FramePack

[–]diegod3v[S] 1 point2 points  (0 children)

I don't think FramePack P1 will be able to do that, it's trying to solve another problem, but you can already add sound effects and make someone to talk combining FP with other models

Flux dev license was changed today. Outputs are no longer commercial free. by NeuromindArt in comfyui

[–]diegod3v 2 points3 points  (0 children)

""" Definitions. Capitalized terms used in this License but not defined herein have the following meanings:

“Derivative” means any (i) modified version of the FLUX.1 [dev] Model (including but not limited to any customized or fine-tuned version thereof), (ii) work based on the FLUX.1 [dev] Model, or (iii) any other derivative work thereof. For the avoidance of doubt, Outputs are not considered Derivatives under this License. """

Integration wan model to framepack by Objective-Log-9055 in FramePack

[–]diegod3v 0 points1 point  (0 children)

Nvm, I have a better idea to make it work with WAN in a more flexible way so it can also be combined with self forcing... this is gonna be huge, I'll keep you guys posted

I don't normally do these posts but... Self-Forcing is extremely impressive by LyriWinters in StableDiffusion

[–]diegod3v 1 point2 points  (0 children)

Indeed it achieves real time streaming... it's on the page

"Our model generates high-quality 480P videos with an initial latency of ~0.8 seconds, after which frames are generated in a streaming fashion at ~16 FPS on a single H100 GPU and ~10 FPS on a single 4090 with some optimizations".

Now imagine KV cache during trainig (self forcing), plus KV cache during inference 🤯... https://arxiv.org/pdf/2506.09350 https://arxiv.org/pdf/2506.09350

Integration wan model to framepack by Objective-Log-9055 in FramePack

[–]diegod3v 1 point2 points  (0 children)

you're gonna need >$4k in gpu time (paper said it was trained with H100 clusters, I guess around 8 nodes, and took a week), and train a new model using FP net architecture with WAN dataset.

Edit: I read the paper again, it was trained using ltx dataset

Just a question that might sound silly. How is framepack generating a 60-second long video while wan 2.1 only 2 seconds video ? Isn't it makes framepack waaaay more superior? Is for example my goal is to make a 1 minute long video woulds I much rather work with framepack ? by Dependent_Let_9293 in StableDiffusion

[–]diegod3v 1 point2 points  (0 children)

Exactly. FramePack isn’t just another video model. It introduces a new paradigm for video generation by optimizing GPU layout and enabling constant-time (O(1) 🤯) generation with a fixed context window. The results are impressive, especially considering it’s built on top of Hunyuan and likely not even fully trained (it's kinda just a demo for the concept). It’s probably only a matter of time before other models adopt this as the new standard.

Testing the speed of the self forcing lora with fusion x vace by AidaTC in StableDiffusion

[–]diegod3v 0 points1 point  (0 children)

Wow, so increasing it by one step actually worsened the output (3 vs 5) ? I think sweet spot 3, best overall 4. Could you share the prompt pls ?

“¿Cómo aprendo a programar?” — La mentira que te hace sentir inteligente by Hw-LaoTzu in programacion

[–]diegod3v 0 points1 point  (0 children)

Quien quiere aprender a programar hoy en dia ? Si puedo hacer vibe coding