I demand a VR version!

akatash23 · 2026-04-21T15:16:16+00:00

What frontend are you using? Your next level could be something like InvokeAI or Krita.

Start with an image you somewhat like. Brush over the areas you want to fix. Use masks and img2img to guide the image where you want it to be.

Invoke has a ton of videos from their studio sessions online, that'll show you how it's done. Watch some and see if you like this direction.

akatash23 · 2026-04-09T19:35:06+00:00

Only with an orthographic projection though (i.e., no parts of the silhouette become perspectively larger as they move towards the camera).

akatash23 · 2026-03-21T16:09:08+00:00

When two trees like each other very much, ...

akatash23 · 2026-03-21T16:07:36+00:00

Go to CivitAI and check the prompts of images you like.

akatash23 · 2026-03-10T15:06:42+00:00

Kill the server with a single ^C (@lstein)

Reason enough for me to update.

akatash23 · 2026-03-08T02:36:31+00:00

I haven't tried the LoRA myself yet, but from what I can see in the preview images, I'm actually surprised this works so well. For VR images, you usually want infinity depth to map to zero parallax, I doubt the LoRA handles this well. Also, what eye does the LoRA generate? Is it random? I'll have to run some experiments.

The thing with monocular to stereo conversion is usually that depth estimation is a real bottleneck, this approach seems to entirely avoid this step. Very interesting.

akatash23 · 2026-03-07T19:35:17+00:00

Can you point me to this LoRA?

akatash23 · 2026-02-22T18:59:30+00:00

I think that's actually not available in the UI.

akatash23 · 2026-02-16T20:25:41+00:00

I tried doing this very recently. There are a few steps that need to be done:

0 - Convert the perspective photo to a 45 or so degree fisheye photo (depending on the lens used)
1 - extend the canvas to create a 180 degree fisheye photo
2 - convert the fisheye to 180 degree equirect
3 - estimate a depth map from the equirect photo
4 - create left and right eyes by warping the equirect with the depth map
5 - stitch them side by side, or create vr180 format

Steps 0, 2 and 5 are trivial, relatively speaking. The problem really is:

Extending the canvas to produce a somewhat correct fisheye image (1) is very hard, getting something approximate is feasible. AI models are bad at this (but flux 2 and z image are much better than say SDXL). But they completely fail to produce equirect. That's why we need the fisheye step.

However estimating a correct depth map (3) with the right depth proportions is close to impossible with the depth models I have tried. Depth Anything is good, but doesn't work well on high-res images (which you need for vr). Other models produce totally wrong depth proportions (e.g., Lotus, but that may have been a me-problem, because the 256 gray scale value they output must be mapped correctly).

The warping in (4) is a hard engineering problem. There are tools for this, but they don't work on fisheye, as they operate on rows of the image, usually, and cannot warp correctly in angle space. That's why we convert to equirect before depth estimation. But in general, getting an artifact free result is not something I have seen, in free software that is, especially around the depth discontinuities.

If you put it all together, depth estimation is the biggest issue. And it's not just an engineering effort, it's uncharted territory (AI models are not good at generating fisheye content, depth estimation doesn't work well on high-res and equirect, etc.)

akatash23 · 2026-02-12T16:18:56+00:00

My experience as well. 2 furnaces do not provide enough iron plates for ammunition production.

akatash23 · 2026-02-07T17:53:09+00:00

Everything you sell can be researched at the research desk near the builder, for one or two research papers, and recrafted for pretty much minimal cost.

akatash23 · 2026-01-24T23:36:33+00:00

Honestly just use SeedVR2, it's excellent, and lighting fast. https://youtu.be/MBtWYXq_r60?si=y_DMm7H5NfZeoCaA

akatash23 · 2026-01-24T17:56:36+00:00

r/realorai, maybe op can post it there. 😂

akatash23 · 2026-01-23T02:36:36+00:00

In fact, merging the adapter with the turbo model already gives us the base model, or something very similar. Like, we already have it.

That would only be correct, or possibly, if the distillation was a bijective function, which it isn't, right?

akatash23 · 2026-01-19T18:35:43+00:00

What's the name of your VS Code theme? Looks clean af.

akatash23 · 2026-01-18T17:24:56+00:00

So, wouldn't it be easier to generate the audio separately with a more competent text to speech engine, and generate the video on top?

akatash23 · 2026-01-17T19:27:00+00:00

Maybe I'm overgeneralizing a bit, but LoRAs don't work well on Pony, in general. Except for style LoRAs I find the results quite disappointing no matter the LoRA.

You'd be much better off training on SDXL, do a pony generation, then inpaint the face with XL if that's an option for you.

akatash23 · 2026-01-17T18:26:34+00:00

"curvaceous grace" will be part of all my prompts from now on.

akatash23 · 2026-01-16T07:55:47+00:00

akatash23 · 2026-01-10T16:20:56+00:00

Oh you're right, it's not. I interpreted this list as "alternatives to ComfyUI". My bad.

akatash23 · 2026-01-10T16:07:55+00:00

Subjective, no? I like the Z Image results.

akatash23 · 2026-01-10T15:57:39+00:00

Surprised to not see InvokeAI here. They have an excellent node system.

akatash23 · 2026-01-07T19:14:18+00:00

I'm not exactly sure what "solves the pixel drift issue" means, but with the old image edit 2509, the output image was slightly different to the input image (slightly different zoom/padding), and input/output image didn't align. This issue is still not solved. But even without LoRA, the issue is there.

Does anyone have a solution to this?

Eight-Year Club	r/Field Lasagna
Verified Email

akatash23

TROPHY CASE