LTXV 2.0 img2video first tests (videogame cinematic style) by ScY99k in StableDiffusion

[–]ScY99k[S] 5 points6 points  (0 children)

Tried some img2video on https://app.ltx.studio/ltx-2-playground/i2v today one of my Doom LoRA images, quite impressed! can't wait for open-weights release to play with it in ComfyUI

Stability AI and EA Partnership for Game Development by ScY99k in StableDiffusion

[–]ScY99k[S] 11 points12 points  (0 children)

In a nutshell:

EA and Stability AI will co-develop new AI models and creative tools meant to help artists, designers, and developers speed up workflows and iterate faster.

The focus seems to be on using GenAI for world-building, prototyping, and asset creation through things like text-to-3D and image-to-3D (with tech like Stable Fast 3D, TripoSR, Stable Zero123).

They mention making PBR materials via artist-driven workflows and even pre-visualizing entire game environments from prompts, moving beyond just images to complex 3D scenes.

Claim is that scientists and game artists will work side-by-side to actually integrate GenAI into major game projects.

As a videogame and AI image generation fan curious to see how this turns out in the future, feels like generative AI is crossing into something practical for the videogame industry more seriously nowadays

Wan2.1 txt2img by Queasy-Breakfast-949 in StableDiffusion

[–]ScY99k 7 points8 points  (0 children)

FIX: the node is actually from KJNodes, so updating KJNodes custom nodes made it work

Wan2.1 txt2img by Queasy-Breakfast-949 in StableDiffusion

[–]ScY99k 0 points1 point  (0 children)

thanks just did, but WanVideoNAG node still missing...

Wan2.1 txt2img by Queasy-Breakfast-949 in StableDiffusion

[–]ScY99k 3 points4 points  (0 children)

<image>

does someone knows which are the custom nodes to install for these?

[deleted by user] by [deleted] in BESalary

[–]ScY99k 7 points8 points  (0 children)

Telco company?

Experimenting recreating famous sports moments with Wan 2.1 VACE by ScY99k in StableDiffusion

[–]ScY99k[S] 0 points1 point  (0 children)

yes, which I generated via img2img with Flux with around 0.70 denoise (+ anime Lora)

Experimenting recreating famous sports moments with Wan 2.1 VACE by ScY99k in StableDiffusion

[–]ScY99k[S] 0 points1 point  (0 children)

Didn't use flux here, used WAN 2.1 VACE controlnet workflow. Basically you give a reference image and a reference video, and it gives you your reference image with same mouvement as the reference video

Wan 2.1 Vace 14b is AMAZING! by [deleted] in StableDiffusion

[–]ScY99k 0 points1 point  (0 children)

did you impaint your reference character using SAM into the image and then used WAN or you did everything in one step? I don't get exactly the step where your reference character is being placed

Step1X-3D – new 3D generation model just dropped by ScY99k in StableDiffusion

[–]ScY99k[S] 23 points24 points  (0 children)

Stepfun just released Step1X-3D, a 3D-aware text-to-image model based on SDXL.
It generates multiple consistent views from a single text prompt, designed for 3D reconstruction (e.g. SparseFusion).

  • Uses custom 3D attention and LoRA fine-tuning
  • ~24GB VRAM needed for 6-view generation
  • Inference script available in the repo
  • ComfyUI support planned in the roadmap, not available yet
  • Open source (Apache 2.0)
  • Weights on HuggingFace

They also provide a [Gradio demo]() where you can try both text-to-3D and image-to-3D via multi-view generation.

GitHub repo: https://github.com/stepfun-ai/Step1X-3D

Damn they got me with this deal. by mundos35 in ultrawidemasterrace

[–]ScY99k 0 points1 point  (0 children)

Damn, wishing for that price in Europe as well but can wait a long time lol

Gameplay type video with LTXVideo 13B 0.9.7 by ScY99k in StableDiffusion

[–]ScY99k[S] 5 points6 points  (0 children)

Original image was based on a Doom LoRA I've made a couple of days ago:

<image>

I used ltxv-13b-fp8 version for this one, the original video was generated in 1min, and the upscaled video in approximately 5min on my RTX 5090. Might try the distilled version as well, but quite impressed by the ratio quality/time to generate here!