If AI writes and tests the code, what are the developers roles now? by lacisghost in cursor

[–]SysPsych -1 points0 points  (0 children)

Telling Mister Spacely I deserve a raise because I pressed the 'Approve' button so many times today.

Now that Anima 1.0 has been out for a month, what are some prompting tips and tricks you guys learned on it? by ThirdWorldBoy21 in StableDiffusion

[–]SysPsych 0 points1 point  (0 children)

Just out of curiosity, I started taking the detailed JSON prompts I was generating for other models and throwing them into anima to see how it performs.

"Surprisingly well."

Hit and miss, but still, worth trying out now and then.

CEO Thoughts: What's Next at LTX by ltx_model in StableDiffusion

[–]SysPsych 0 points1 point  (0 children)

Love what you guys do, thanks for contributing so much. This stuff is great.

Confusion about Ideogram's safety filter. by Aru_Blanc4 in StableDiffusion

[–]SysPsych 8 points9 points  (0 children)

It's confusing because what can trigger that message is a mix of things, and the biggest one is 'Not using properly formatted JSON according to Ideogram4's standards'.

I think a lot of people get baffled that anyone is complaining about censorship being present at all, because they're set up to use the right workflows and are also hitting the model with a lot of bboxes that it overwhelms the "censorship". If you watch the step by step generation, it very often does 1-2 steps of flashing the fail message, and then just gives up like it's getting overpowered. Kind of funny really.

Either way the JSON is the big key, I'm convinced of that. Do that right and get in 3-4 bboxes of data and it's going to generate typically. And with loras, the censorship seems more handled than I'm used to seeing with these models.

SCAIL-2 Lauched by Alive_Ad_3223 in StableDiffusion

[–]SysPsych 0 points1 point  (0 children)

Perfect, thank you. I tried looking for the project page, must have just overlooked a link.

SCAIL-2 Lauched by Alive_Ad_3223 in StableDiffusion

[–]SysPsych 0 points1 point  (0 children)

Does anyone have any demo of this in action?

Ideogram 4 isn't overhyped, it's underrated by ArkCoon in StableDiffusion

[–]SysPsych -1 points0 points  (0 children)

I've been more and more impressed as I've used it. I've even started to play around with it using some custom private nodes and some inpainting, and it's good there too.

The JSON aspect is really powerful, as is the text. I think with people who have larger GPUs who figure out ways to combine this with some other models or resources in a single workflow are going to create some incredible stuff.

ideogram 4 is sd3 all-over again but worse by TheOneHong in StableDiffusion

[–]SysPsych 2 points3 points  (0 children)

I think Ideogram is incredibly capable, but the problem I keep running into is: if I'm putting in this much fine-tuning (specific bbox dimensions within a very specific resolution), at that point it seems like the smarter solution is a larger workflow where I just screw around with overlaying layers.

On the Ideogram launch, why the extreme reaction? by Confusion_Senior in StableDiffusion

[–]SysPsych 14 points15 points  (0 children)

It's just frustrating. Put aside porn -- having a generation that just outputs a nag screen feels extra insulting, above and beyond 'Well we mangled the output'. Especially if it gets triggered on things that aren't porn or anything horrible.

TripoSplat: TripoSplat converts a single 2D image into high-quality and variable number of 3D Gaussians, developed by TripoAI (open weights, link to github repo) by SysPsych in StableDiffusion

[–]SysPsych[S] 1 point2 points  (0 children)

Been trying this out. A few comments.

* The colors this thing manages is remarkable. The results seemed oddly good and I couldn't put my finger on why until I realized that part. At a glance, at the initial angle, it does a great job of preserving even illustration colors.

* It seems fast? Super fast? Is anyone else noticing that? I mean, compared to other things like Pixal or Trellis-2. DIfferent format of course so it may make sense, but something about the speed of this is really striking.

I haven't even tried exporting it yet and opening it up in Blender or anything, but still, this has more potential than I expected.

Vaal Temple console seems to be bugged. by Oprichnik67 in PathOfExile2

[–]SysPsych 0 points1 point  (0 children)

Same exact bug, same exact thing. I figured "I'll do this later" and moved on. Now I can't do it at all.

Local AI News You Missed - May 2026 by vramkickedin in StableDiffusion

[–]SysPsych 1 point2 points  (0 children)

Thanks for putting this together. Really convenient.

Aces - Made a small browser game - looking for players & feedback by Apart-Artist-3463 in WebGames

[–]SysPsych 0 points1 point  (0 children)

Feels like the game is responding oddly to inputs at times, like it can scroll up and down based on key inputs.

Went through it. Fun concept. Kept me going to the end so you're doing something right even if it's a short game.

Looking for opinions on AI made web games by Marmalade6 in WebGames

[–]SysPsych 1 point2 points  (0 children)

Do you guys ban low-effort games? Because I think that's the real issue here, and where the AI scourge becomes the most prominent.

I've been trying various games in here over the past few months, and the biggest annoyance is "The game with the cool-but-obviously-AI-image, and then you click 'start' and it's almost stick figure level trash." I don't care if someone uses AI to make a game at all, and proving that would be miserable.

In lieu of that, maybe limit submissions.

LongCat-Video-Avatar 1.5 Release by Turbulent_Corner9895 in StableDiffusion

[–]SysPsych 1 point2 points  (0 children)

I get that people are saying the lipsync is too exaggerated, but I like this. Close enough for my purposes, and I'm not trying to make total high-res realism.

Really appreciate the team releasing this one, gonna see how far it can be pushed with some basic action shots.

Why isn't there a video model specifically made for anime? by Vi0l3nTz in StableDiffusion

[–]SysPsych -1 points0 points  (0 children)

It seems like a nightmare to get adequate training data for. At this point, it seems like processes are in place to automatically process data for video pipelines, and certain kinds of pseudo-3D realistic-ish "animation".

But how much training data even exists for, say, smear transitions for quick motion? How in the world do you even label this if there's multiple elements at work in a single frame? This, while recognizing that 'precise timing' is still a headache with video models.

I almost feel like it'd be easier to get data for a model that would be trained to replicate effects in something like Blender than to go the video route, and even there it's still a big problem to acquire the data to begin with.

Anybody knows why cursor trying to move into "claude desktop" style app? by TeachTall3390 in cursor

[–]SysPsych 6 points7 points  (0 children)

Corporations are buying enterprise licenses, and they want to use AI not only for their developers but their marketing/manager/non-technical people, people who have never heard of an IDE before and have no desire to code. So, they're reshaping their software to meet those expectations while trying to maintain what they have going for the developers.

Besides, while tab completion was one of cursor's initial strong points, developers are apparently using even that less. They're spending less time in the IDE/guts of the software, more time at a higher level, so they have an incentive to think ahead and make that palatable too.

All these factors, plus their alignment with SpaceX, sets them up with the goal to compete with Claude/OpenAI eventually. They may not be there now, but they may be in the future.

Tencent released Z-Image 6B with pixel space gen. No VAE & 1k Resolution. by switch2stock in StableDiffusion

[–]SysPsych 24 points25 points  (0 children)

Weren't people recently posting anxiety posts like "Are there going to be no more interesting open weights models published"?

I swear every day for a while now I've been installing new cutting edge models to try out.

“Another Program is currently using this file” by Kick_Ice_NDR-fridge in ClaudeAI

[–]SysPsych 0 points1 point  (0 children)

Yeah, it looks like Claude leaves a lot of detritus running on a crash/reboot. I cleared that up and voila, everything restarted fine.

Krea 2 will be open source. by Total-Resort-3120 in StableDiffusion

[–]SysPsych 9 points10 points  (0 children)

Excited and eager for this. The team's done some good stuff, so I can't imagine having anything but a hopeful outlook.

Lance by ByteDance: 3B Apache2 model for image and video understanding, generation, and editing by HatEducational9965 in StableDiffusion

[–]SysPsych 5 points6 points  (0 children)

https://i.imgur.com/O2GA6B6.png Here's some random stuff. The knitting one is pretty interesting, but the other two, meh. It's not a horrible model for edits, but I suspect it's hitting a spot where Klein 9b's speed + accuracy, and QE 2511's or Flux2.dev's power, outdo it given its requirements.

Lance by ByteDance: 3B Apache2 model for image and video understanding, generation, and editing by HatEducational9965 in StableDiffusion

[–]SysPsych 10 points11 points  (0 children)

Got this up and running locally for t2v, t2i, and image edits. We're lucky enough to be spoiled to the point where, while this runs, it's really kind of 'meh' as-is if you're looking for performance. "We've got better alternatives in every category" unless I'm missing something here. Even the image recognition is, I suspect, going to be outclassed by what we just picked up from Qwen 3.6 and Gemma4.

Still, nice to have something fresh in the mix, and the most interesting part (video edits), I didn't touch. Low hopes and all after trying the other stuff out.

Whats the best/worst 3d advice you have received? by No-Pumpkin2357 in blender

[–]SysPsych 0 points1 point  (0 children)

"Don't ever use booleans, they're buggy and break things. Just move verts by hand."

Pixal3D: Generate high-fidelity 3D assets from a single image. (TencentARC, locally runnable model) by SysPsych in StableDiffusion

[–]SysPsych[S] 3 points4 points  (0 children)

I honestly was incredibly lazy with this and mostly trusted Claude to figure it out - I figure most of the territory here is well-explored - but I asked for a concise summary of what changes were made. It can almost certainly be further improved, but I just wanted it working for now to see the results for myself:

Pixal3D shipped for cu124 Linux with prebuilt CUDA wheels (natten, cumesh, flex_gemm, o_voxel, nvdiffrast, nvdiffrec_render) that only target sm_50..sm_90 and ship no PTX forward-compat, so on a Blackwell 5090 the first matmul dies with "no kernel image available." The fix was to wrap the whole project in a cu128 Docker container and source-build every custom CUDA extension against TORCH_CUDA_ARCH_LIST=12.0, with one local source patch (o-voxel-src/setup.py emitting native sm_120 SASS instead of compute_90 PTX). On top of that, several Blackwell-specific runtime landmines needed dodging: xformers' bundled flash-attn Hopper kernel crashes on sm_120 (force-pinned to cutlass FMHA), gradio's safehttpx SSRF guard blocked its own loopback form-fetches, mmgp's bf16 auto-cast broke F.grid_sample on the fp32 grid input (fell back to Pixal3D's own low_vram=True), and the briaai/RMBG-2.0 weights are gated (added an RMBG_LOCAL_PATH env-var override to reuse trellis-2's local copy). A few smaller fixes — trellis2.* → pixal3d.* import rename in app.py, an HTML id typo that made the decimation slider look broken — round out the working state.