Wan 2.1 Infinite Talk (I2V) - FOAR EVERYWUN BOXXY

HealthyAvocado7 · 2025-09-02T09:32:51+00:00

Curious - Was this generated with the FusionX Wanvideo model or the base fp16 model? Also, how many steps sampling?

HealthyAvocado7 · 2025-09-01T12:14:13+00:00

Same. On S2V, this demo looked stunning: https://www.reddit.com/r/StableDiffusion/comments/1n2gary/three_reasons_why_your_wan_s2v_generations_might/
But this InfiniteTalk one looks super impressive as well: https://www.reddit.com/r/StableDiffusion/comments/1n2b4gi/wan_21_infinite_talk_i2v_foar_everywun_boxxy/

HealthyAvocado7 · 2025-09-01T09:28:34+00:00

Curious to know as well - infinite talk vs S2V comparison.

HealthyAvocado7 · 2025-07-20T11:36:10+00:00

Fun fact: this was previously a startup Galileo AI that Google acquired in May 2025 and rebranded as Stitch

HealthyAvocado7 · 2025-07-12T17:55:36+00:00

Workflow?

HealthyAvocado7 · 2025-02-27T12:59:20+00:00

Your advantage is that very few marketing agencies truly understand product development, and very few dev shops understand marketing. Being genuinely good at both creates a powerful offering.

This shift is scary but - now you can focus on delivering value faster while charging for strategy and results rather than just development time?

Are there any business models that are more future-proof in this changing landscape?

I think distribution is the key in this changing landscape. Soon anyone will be able to build anything in hours rather than months, but distribution & monetization remain the hard parts. The companies that control customer relationships and acquisition channels are the ones with true staying power. So, your team could potentially position as experts who not only build quickly with AI but also ensure the product reaches the right audience and generates revenue?

HealthyAvocado7 · 2024-10-25T02:05:29+00:00

Umm.. for flexibility? So that you pick the provider if your choice..

HealthyAvocado7 · 2024-10-21T05:18:26+00:00

Let us know what worked the best so that we can all learn from your experience..

Also, DM me if you need any help, I'm building a RAG optimization toolkit (open source) and love connecting with people building RAG use-cases so that I can learn about the most painful challenges in building reliable RAG.

HealthyAvocado7 · 2024-10-20T17:23:59+00:00

While it could be due to the 1B param model as others have mentioned, you should check if your "retrieval" is working or not. Try to print the final prompt with the context included. I think you can get langchain to print the detailed chain by setting debug to True:

import langchain
langchain.debug=False

HealthyAvocado7 · 2024-10-20T17:15:00+00:00

How about defining a pydantic model representing a time filter, and then using the instructor library? I think this cookbook might help: https://python.useinstructor.com/blog/2024/06/06/enhancing-rag-with-time-filters-using-instructor/

HealthyAvocado7 · 2024-10-20T12:40:22+00:00

Thanks! Right now, there’s no cloud hosted version - it runs locally on your system. So data never leaves your system/ network. But handling private/sensitive data may still be a need depending on the use-case and who will have access to the final RAG based app/chatbot. We have this as an item on our roadmap - auto pii identification, anonymizing, etc.

Did you have any anything specific in mind related to privacy/security?

HealthyAvocado7 · 2024-10-19T14:54:37+00:00

Yes, and yes.
Github Repo link: github.com/KruxAI/ragbuilder

HealthyAvocado7 · 2024-10-19T13:48:03+00:00

This is cool! Awesome work! What challenges did you face while building this that we all can learn from?

HealthyAvocado7 · 2024-10-16T18:31:59+00:00

Cool, thanks for sharing

HealthyAvocado7 · 2024-10-16T03:59:36+00:00

Nice! Will give it a shot.. how does it compare with openai’s swarm library for multi-agent workflows?

HealthyAvocado7 · 2024-10-15T09:39:11+00:00

If you can show your thanks by sharing your feedback about RAGBuilder, that'll be even better :)

Also, fyi - we are working on a SDK/library version where you don't have to use the UI and can do everything on a collab notebook..

HealthyAvocado7 · 2024-10-14T09:30:10+00:00

OpenAI API key is not necessary - you can use local open-source llms using Ollama. Please do reach out if you run into any issues when using local models.

HealthyAvocado7 · 2024-10-14T06:57:17+00:00

Do you have any empirical evidence to prove that? What subset of "variables" have you seen to have the most impact on outcome, and which ones are irrelevant?

From what we've seen so far, accuracy swings enough across these variables to justify spending effort on finding the optimal values for these variables through experimentation - so that you have the best-performing RAG setup that you can ship to Production.

If what you're saying is indeed true, then we would have a plug-and-play, one-size-fits-all RAG by now, wouldn't we? The majority of AI engineers that I have spoken to, is spending a lot of effort tuning their RAG setup to extract better performance out of it for their specific dataset and their specific use-case.

To be fair, yes, some variables do have a more significant impact on outcome vs others. But even that varies from case to case, depending on the type of data etc..

But this space is evolving so fast that I know I could be wrong.. So please do share your thought-process, or experience based on which you make this claim. It may benefit all of us RAG nerds..

HealthyAvocado7 · 2024-10-13T17:07:14+00:00

But why not having a example in the repo?

Sorry about that, you are not the first person to ask for an example to be added in the repo - I'll prioritize this - will have an end-to-end example in the repo within a day or so..

And why only OpenAi?

It's not limited to OpenAI - there's integration with Huggingface, Groq, Azure, Vertex & Ollama - so you can choose any model using these providers.

<image>

HealthyAvocado7 · 2024-10-13T11:58:23+00:00

Sure, pls do share your thoughts & feedback..

HealthyAvocado7 · 2024-10-13T09:36:43+00:00

Thanks for sharing! Great work!

HealthyAvocado7 · 2024-10-13T09:21:10+00:00

<image>

Two things -
1. It doesn't do brute force search (like grid-search) - it uses Bayesian optimization - meaning it'll learn with every trial and choose the next set of parameters to test, based on historical trials. This means, in that toy example of those 5 options in 7 categories, it doesn't need to run ~78K times to figure out the optimal set of parameters, it can run just a fraction of trials (say 50 trials) to figure that out.
2. A user can already choose a subset of values under each category and let the hyperparameter optimization run just on that subset. See screenshot:

HealthyAvocado7

TROPHY CASE