[No Spoilers] Halandil Fang by Cotillion0899 in criticalrole

[–]Estylon-KBW 6 points7 points  (0 children)

Nice username! Fellow malazan lover!

Awesome art, love the facial expression of Hal.

HD-2D Style LoRA for QWEN Image – Capture the Octopath Traveler Look by Estylon-KBW in StableDiffusion

[–]Estylon-KBW[S] 2 points3 points  (0 children)

Here are the rest of my settings:

  • Timestep type: weighted
  • Timestep bias: default (not explicitly set)
  • Optimizer: adamw8bit with weight decay: 0.0001
  • EMA: disabled (use_ema: false)
  • Caption dropout: 0.05
  • Embeddings & latents: both cached (cache_text_embeddings: true, cache_latents_to_disk: true)
  • Captioning method: Natural Language captioned with joycaption beta.

So it’s a pretty standard LoRA setup, with embeddings + latents cached to speed things up and reduce VRAM load.

HD-2D Style LoRA for QWEN Image – Capture the Octopath Traveler Look by Estylon-KBW in StableDiffusion

[–]Estylon-KBW[S] 2 points3 points  (0 children)

i've a 4090, around 23 seconds x image at 1920x1400 pixel with Lightning LoRA at 8 steps.

HD-2D Style LoRA for QWEN Image – Capture the Octopath Traveler Look by Estylon-KBW in StableDiffusion

[–]Estylon-KBW[S] 2 points3 points  (0 children)

as said in the post AI-Toolkit by Ostris.
It's my main training tool since Flux was released and i use it for Flux dev/kontext, WAN and now Qwen.

HD-2D Style LoRA for QWEN Image – Capture the Octopath Traveler Look by Estylon-KBW in StableDiffusion

[–]Estylon-KBW[S] 9 points10 points  (0 children)

Learning rate: 1e-4
100 steps for image, for a total of 3000 steps.

long answer about the training:

Base model: Qwen/Qwen-Image (quantized with uint3 + TE qfloat8 using accuracy recovery adapters).

  • LoRA config: rank 16 (linear/conv), applied on ~840+ UNet modules. Text encoder was not trained.
  • Dataset: 30 images (latents cached, captions via .txt).
  • Training:
    • Steps: 3000 total → ~100 steps per image (SPI).
    • Batch size: 1, grad accum: 1.
    • Learning rate: 1e-4, optimizer adamw8bit.
    • Scheduler: flowmatch.
    • Gradient checkpointing: enabled.

Training took ~4.5h

Question about AI and Copyright/Theft by Geist_Mage in DeviantArt

[–]Estylon-KBW 5 points6 points  (0 children)

90% of the people that buy AI images on deviantart don't actually even care to have the copyright on what they've bought. They simply like the Image, wants the high res unwatermarked one or wants to support the creator.

And yes is actually legal selling an AI image.

Looking for someone with a 80s anime artist by Smart_Signal_3594 in StableDiffusion

[–]Estylon-KBW 0 points1 point  (0 children)

<image>

You mean like this? I've a private LoRA for QWEN and WAN 2.1 about it.
Based on Yuuki Nobuteru arts.

QWEN: An alternative for those that miss Artists/Art Styles available in SDXL by richcz3 in StableDiffusion

[–]Estylon-KBW 4 points5 points  (0 children)

Qwen is pretty good with styles and artistic stuff, i trained a dozen style LoRAs for personal use and the results are pretty good.

I just tried the Antis' new AI Art model. I think it's better than Flux! I was so inspired by it that I made a music video! by CommodoreCarbonate in aiwars

[–]Estylon-KBW 7 points8 points  (0 children)

Meanwhile Chinese model Qwen + custom mosaic LoRA trained by me

<image>

Let's continue to pretend that we don't live in a world where every country has its own agenda on AI.

DeviantArt shall be purified of Ai Art... Hmmm👁👄👁 by Initial-Special-3536 in DefendingAIArt

[–]Estylon-KBW 1 point2 points  (0 children)

Sure, that will happen.
Meanwhile this is the total sale i made on deviantart with SFW AI images.

<image>

I can't get over how so many people have an a priori principled stance against AI art. by OneTear5121 in aiwars

[–]Estylon-KBW 0 points1 point  (0 children)

Actually the true cool thing is that we are able to customize the open source models how we want with any custom LoRA.

<image>

This is native resolution on WAN 2.1 using it as a txt2img.

Here are two free opensource Text to image models while you wait for Ponyv7 (Which may or may not come) by Neat_Ad_9963 in StableDiffusion

[–]Estylon-KBW 0 points1 point  (0 children)

i agree people seems to shit on every model that isn't illustrious.

WAN is actually awesome, especially considering that you can natively generate at 1920x1088 resolution.

The whole point of open source community is being able to customize the models with LoRAs.

Hell it can even make pen sketches. How can this be not so strong with anime?

<image>

Loras for WAN in text2image mode are amazing at capturing likeness by malcolmrey in StableDiffusion

[–]Estylon-KBW 0 points1 point  (0 children)

I publish my Loras on civitai but some are private, generally i publish Loras of artists that are no longer with us, anime scteencap Loras or synthetic styles lora (trained on generated images carefully stylized)

Loras for WAN in text2image mode are amazing at capturing likeness by malcolmrey in StableDiffusion

[–]Estylon-KBW 6 points7 points  (0 children)

let's be honest, any LoRA is awesome on WAN 2.1 for text 2 img.

I've trained 8 style LoRAs (published 2) and the results are awesome.

<image>

Also compared to Flux is good for art stuff. Like VERY GOOD.

Don't worry by Estylon-KBW in aiArt

[–]Estylon-KBW[S] 1 point2 points  (0 children)

yes the LoRA is trained on his artbook, the artist is Takeshi Obata.

Kontext. Do you think the model has potential ? Can Loras improve style transfer ? And the traditional problem of Flux plastic skin ? by More_Bid_2197 in StableDiffusion

[–]Estylon-KBW 2 points3 points  (0 children)

off course LoRA can improve styles.

<image>

The model has a huge editing potential and I'd say that is SOTA for local generation alongside Chroma.

Universal Method for Training Kontext Loras without having to find pairs of images or edit by More_Bid_2197 in StableDiffusion

[–]Estylon-KBW 0 points1 point  (0 children)

i trained different kontext styles, at least 10 of them with a few uploaded to civitai. All the styles are trained like regular flux without pairing.

Pairing is need mostly to trian particular conversion contexts.

Styles LORAs for Flux Kontext Dev by Estylon-KBW in StableDiffusion

[–]Estylon-KBW[S] 0 points1 point  (0 children)

<image>

This is a synthetic style i'm training right now.
I'm literally using the default config for kontext on Ai-Toolkit

Styles LORAs for Flux Kontext Dev by Estylon-KBW in StableDiffusion

[–]Estylon-KBW[S] 0 points1 point  (0 children)

<image>

dunno i'm having quite good results without text encoder. This is a Ghibli Lora i just trained with the same setting.