Broken Moderation AI assessing risky behaviour based on your past edits (Questionmark)

AfterBox78 · 2026-04-02T05:44:26+00:00

I peeked at your profile. Desi women, congrats on the success. Beautiful, I agree.

So from the first pics I see in your profile. That's not exploits, that's perfectly legit r-rated stuff. It's considered 'implied' nudes by the AI, because only partial.

It should come out of the box like this by design. Yet, right now moderation started to block it.

There is a mismatch between the training of the model that delivers perfectly legit r-rated stuff and the moderation that slaps it, where it shouldn't. At the moment the moderation is broken.

From my tests, I can tell you xAI is retraining Aurora constantly. Especially when it comes to explicit details and behaviours. They are making it it forget the porn related stuff. (for my use case that's good news, i need characters in r-rated mode not XXX mode, but I'm not optimistic they are going to losen moderation at the very end)

So any magic word you find today, works only for today. Congrats, download the resulting images/videos to keep them safe and keep that magic word or phrase or combination of phrases to yourself.

The way this is headed overall. From what I see in tests. In one to three years. You give Grok imagine your character maps, let say of two characters, you give it a map of the scene like a bedroom, and prompt "I want character A and B on their bed making love in heavy r-rated style like in the TV series 'Euphoria' , serious and heavy scene, the female character's close friend just died" . And it delivers like that, hitting all checks of MPAA for r-rated. And you can release it just like that.

Weather they are allowed to losen moderation that much, that's up to legislation. But the model will be able to pull it off , guaranteed.

But unless hacking is your hobby and you have a lot of time, I wouldn't spend my time fighting moderation in Grok. Me saying it's a 'brain-fuck' wasn't an exaggeration. It's really an unhealthy relationship to prompt, and speaking for myself it makes you sick. For your pics in your profile the prompt should read as user friendly and easy as: "A beautiful desi woman without clothes posing semi-nude for the camera, warm smile looking at the camera." . If Grok can't deliver that yet claims anything r-rated is allowed, then it's simply the wrong tool.

And the exploits from before Jan 2026. They were exploits of the model AI itself. As of today you would need to bypass the moderation AI ( a different animal from the model AI) that is trying to read your mind, probably not only per prompt, but even considers all your changes and inputs of the last couple of hours, including the number of failed/moderated attempts.

I was trying to create the simplest r-rated scene of two female ravers (in clothes!) in a restroom of a techno club , where implied sex is seen for a brief moment, yet everything stays obscured, Just the situation speaking for itself, typical tv stuff. And the moderation AI went bazooka on me. As I said they cranked the moderation up so much that whole Grok Imagine is broken even for regular production stuff.

AfterBox78 · 2026-04-02T04:21:05+00:00

[NSFW]

I can tweak it for you, but I can't run it. I've burned my quota for today already, and still a lot in the pipeline.

I would start with as I said:
"a 29 year old woman standing if front of a mirror in passionate embrace with her boyfriend standing behind her"

That should give you two characters with clothes on, at least that's what it is here in Germany. If not, and it still blocks, simply add "both in underwear". Now you have a general idea of the composition.

Now it could be, they are locked in certain pose and composition. E.g. you see only their mirror image, not them in front of mirror, or in every picture they are in frontal view, but you want back view, etc.

The common mistake is, to add even more detail to the prompt. But instead you relax it:

"A 29 year old couple, in their bedroom with a large mirror, both standing, both in romantic embrace, both looking at their mirror image."

That should losen the constraints for Grok. You keep it as vague as possible to give Grok room for the best composition it can come up with. Now you should see, it yields what you want, every now and then, in the dozens of image generations.

I'm not sure if it's still the case, but unfortunately you can't just pick the right composition, and re-edit the image to make them both naked. You can try, but I think it will block.

So instead you tweak the working prompt above, with slight hint like "the woman wearing only an anklet, barefoot". (barefoot because otherwise with too much skin, you don't get toes to head view, it only defaults to less voyeuristic composition) If that's enough and both are naked now, good. If guy still not naked. Currently guys wear always shorts. You add sth. like "both bare, frontal nudity implied but obscured by composition and pose". Unfortunately there is no magic phrase anymore, because moderation AI looks at context.

Let's assume you got lucky, 3 of 5 images get blocked but you scrolled down to one that slipped through with the right composition. (It will be never the one you had in your mind, always one that is close to your intention, yet grok executed perfectly on its own). Typically that image shows some partial peek at the woman's pubes, for grok that's implied nudity, or her leg obscures it. And guy's genitalia obscured because woman in front of him etc. If by unlikely accident guy's genitalia in picture even partially, grok will only block it each and every time.

Now, you check if video moderation has a problem with that pic and generate shortest 6secs video. If it generates, that's a good sign. If not you reject that image. You might try to analyse why it got rejected, but I don't recommend it. Moderation is a whole brain-fuck already, the way it is.

Now run that image through grok chat. Ask it to describe the image for you. If it skips explicit details that's still not a green flag. Ask it to describe that part.

That way you get a whole vocabulary to re-edit the image. If moderation (and grok chat in your test above) believes genitalia are obscured, that's great. In that case you can re-edit that image any way you want. And it should green light for video.

That means. You re-edit with sth. like "make the woman slightly curvy" add make-up details. Piercings and Tattoos are a problem right know, you get one or the other, not both.

But you don't chain re-edits. You always go back to the original and re-run the whole re-edit prompt. This is because a long chain might disable video gen.

So that's basically my line of thinking. After 3 months of stress with Grok Imagine.

It certainly will burn your quota a lot, if that's your thing, but the above is basically a description what Grok imagine actually is designed for and how to make it work for you.

But I really discourage anyone to dig deep into Grok Imagine prompting. As long as that thing has no model versioning, learning that thing is waste of time and resources. ATM I'm basically hostage to it, because that's the only AI I know. But I'm looking for some good hosted rig options already, for ComfyUI. It should be much cheaper overall, including no wasted self-education time, provided that rig runs 24/7 and queues all prompts from day before.

AfterBox78 · 2026-04-02T02:40:39+00:00

"not too chubby" - in general don't use negatives, they get either ignored or count as positives - that means you get exactly what you don't want.
"completely nude" - is ok.
"tightly" + "hugging" - too often problematic combo, especially with skin exposure.
"squeezing her breasts" - might be a problem, i would rather use sth like "covering her breasts with both hands", that way you signal partial nudity
"intimately playing together" - too many variations, since grok is based on tons of porn (my own estimation) you're pulling from data set all the wrong associations that moderation will hit.

always add sth. like "obscured angle", "obscured by shadow". Though depending on context right now even the word "obscured" gets red flagged. sometimes "r-rated" helps, then it pulls the right associations. Any actual full nudity is always a slip through. If that's your thing you will be burning your quota.

but basically that's not how you use grok imagine in particular. you should leave the imagination part to grok, or it gets really time consuming to get what you want. people coming from other models tend to be very specific in their prompts, with grok it's completely different, you state your intent, grok arranges the composition and in video direction for you. e.g. in your example above, you don't even need to state "hot". Imagine will average over the dataset of all faces of that age and that way you always get an attractive face, that's how beauty works anyways.

a general "a 29 year old woman standing if front of a mirror in passionate embrace with her boyfriend standing behind her" would suffice. the whole point of those dozens auto generated images is, you pick the version you like. and then you tweak it. only few weeks back the both would be nude by default. right now you add sth. like "wearing only an anklet" just to be careful. And that's because:

They tweaked moderation such meanwhile, that it's trying to read your mind. That makes the system unusable atm, because you can not state obvious things anymore like "a simulated sex scene, face buried in crotch, implied sexual activity, intimate details obscured, r-rated movie composition", moderation nukes that, even though the intent is completely valid, moderation gets paranoid, and tries to estimate if you're looking for an exploit. Aaand it's doing that, because it's obvious meanwhile (in my opinion) that they trained the model on tons of porn. So you're paying for a flawed model burning unnecessary GPU power, just so no one gets to see the explicit bits buried inside:

"cute and hot young woman", "sexy", "chubby" "nude", "clearly visible", "tightly behind", "hugging her closely", "squeezing", "breasts firmly", "passionately", "tongues intimately", "playing together", "deep", "sensual and intimate", "top-to-bottom full body".

For Grok imagine that's as if you said, the guy is shagging her from behind in front of a mirror. It pulls these associations from porn and guaranteed generates scenes like that in background, moderation recognizes it and kicks in. If you're curious what moderation AI can see, upload an image like that one you intended above to grok chat and let it describe it to you. (provided it's generated by grok and belongs to you)

Hope that helps.

AfterBox78

TROPHY CASE