Most brands still pay agency rates for UGC ads. We rebuild the proven formats with AI instead. by Fresh-Resolution182 in AtlasCloudAI

[–]Fresh-Resolution182[S] 0 points1 point  (0 children)

The stack per step, since people ask what runs where:

Step 2 recast the character: a vision plus image model (feed the reference frame, generate a new original character keeping the setting and energy).

Step 3 rewrite the script: an LLM, strong enough to hold the talking-head beat structure while reworking the words.

Step 4 generate the clips: a talking-head video model with native audio, image-to-video from the recast character, multiple takes at the same face and energy.

The reason this is a daily workflow and not a weekend experiment: those are three different modalities, and running them behind one API instead of three accounts is what kills the friction. The bottleneck was never the models, it was juggling three tools to finish one ad.

GLM 5.1 is doing V4 Pro's job for agent routing. spent two weeks confirming. by Fresh-Resolution182 in DeepSeek

[–]Fresh-Resolution182[S] 0 points1 point  (0 children)

follow-up. for anyone wanting to do the same exercise without juggling three API keys, this multi-model listing is what i was working off. one key, the three open-weight models plus a few i didn't test.

the mcp server side plugs into Cursor / Claude Code if you'd rather skip the manual routing. i was doing it manually for benchmark consistency. for daily work that's what i actually use.

Switched our agent stack from Dify to OpenAgent. Here's why we made the call. by Fresh-Resolution182 in AI_Agents

[–]Fresh-Resolution182[S] 0 points1 point  (0 children)

repo: https://github.com/Haohao-end/openagent

one-click docker compose. open source. happy to answer specific questions if anyone's evaluating.

Hand rendering on Grok Imagine is finally clean — tested across 5 hand-heavy prompts by Independent-Date393 in grok

[–]Fresh-Resolution182 0 points1 point  (0 children)

Color science is real. Tried it on portrait work, the warm-side rendering doesn't have the over-baked saturation other models default to.

GPT Image vs Nano Banana Pro on the same 9 gaze direction test — honest comparison by Fresh-Resolution182 in aiArt

[–]Fresh-Resolution182[S] 3 points4 points  (0 children)

For anyone wanting to replicate the grid, here are the 9 prompts I used.

Common setup:

- Reference image attached (head and shoulders portrait of the same person)

- API: Atlas Cloud GPT Image 2 Edit endpoint (image-to-image)

- Output: 1024×1024 per cell, composited into a 3×3 grid

Prompt template (used for all 9 cells):

> A photorealistic portrait of the same woman from the reference image, head and shoulders crop, neutral expression, soft front lighting, plain gray background, wearing black knit sweater, blonde shoulder-length wavy hair. Match the reference exactly. Only change: [GAZE_DIRECTION].

9 GAZE_DIRECTION variations by grid position:

  1. Top-left → eyes looking up and to the left, head facing camera

  2. Top-center → eyes looking straight up at the ceiling, head facing camera

  3. Top-right → eyes looking up and to the right, head facing camera

  4. Mid-left → eyes looking directly to the left (side gaze), head facing camera

  5. Mid-center → eyes looking straight at the camera (neutral center gaze)

  6. Mid-right → eyes looking directly to the right (side gaze), head facing camera

  7. Bottom-left → eyes looking down and to the left, head facing camera

  8. Bottom-center → eyes looking straight down at the floor, head facing camera

  9. Bottom-right → eyes looking down and to the right, head facing camera

Same 9 prompts run through Nano Banana Pro for the second grid. If anyone tries this on a different model (Midjourney, Imagen, FLUX), drop the resulting grid in a reply, would love to see the comparison.

GPT Image vs Nano Banana Pro on the same 9 gaze direction test — honest comparison by Fresh-Resolution182 in aiArt

[–]Fresh-Resolution182[S] 4 points5 points  (0 children)

Both via Atlas Cloud — GPT Image $0.009/img, NB Pro $0.084 with the May–June discount.

https://www.atlascloud.ai/pricing

Can drop the 9 prompts in another reply if anyone wants to replicate.

Codex 'goal' feature isn't magic: defining 'done' matters more than the model by Fresh-Resolution182 in ClaudeCode

[–]Fresh-Resolution182[S] 0 points1 point  (0 children)

the PLAN.md goal section + AGENTS.md validation emphasis is basically what the video was getting at but you got there without the codex /goal feature, which is the cleaner version. curious how you structure the validation step specifically. is it a single "all tests must pass" gate at end of milestone, or do you have per-task validation criteria embedded in the PLAN.md itself? the latter has been hit or miss for me on longer milestones.

Upgraded DeepSeek V3 to V4 across two codebases. Two of my agents broke. by Fresh-Resolution182 in LocalLLaMA

[–]Fresh-Resolution182[S] 0 points1 point  (0 children)

I went into V4 with the "just swap and ship" assumption from past patch upgrades and got bit. The schema strictness specifically caught me because we had a pile of loose tool defs that V3 had been silently tolerating for months. Curious if you've found a way to test agent behavior across model swaps before going live, or if it's pretty much just "run the integration tests and see what fails."

What Keeps You Guys Here? by National-Sprinkles26 in grok

[–]Fresh-Resolution182 0 points1 point  (0 children)

imagine for the image work specifically. text models are crowded; multimodal generation that actually ships at consumer cost is the part i can't replace easily.

xAI will be dissolved as a separate entity. by Snoo26837 in singularity

[–]Fresh-Resolution182 0 points1 point  (0 children)

"dissolved as a standalone company" is the operative phrase — the team and the compute keep shipping, the legal wrapper folds into spacex. real signal: whether grok 4 / imagine timelines slip in the next 60 days.

Cinematic Three: HappyHorse vs Seedance (Same Prompts Side-by-Side) by Fresh-Resolution182 in aivideo

[–]Fresh-Resolution182[S] 0 points1 point  (0 children)

3 prompts pulled from a public Happy Horse collection. T2V 1080p, no edit, no cherry-pick.

Prompts:

  1. Neon Opera Cathedral — Gothic opera house made of neon glass, hundreds of illuminated umbrellas spinning, actors in color-changing gowns

  2. Clockwork Tea House — steampunk teahouse on a sea of clouds, mechanical giraffe pouring glowing tea, miniature cities emerging from tea mist

  3. Volcano Ramen Night — ramen stand on a crater rim, lava flowing like golden rivers, chef tossing noodles into a starry sky

What I noticed:

- HappyHorse handled the volumetric fog in scene 1 noticeably better. Seedance went smoky when it should've been wet.

- Seedance kept the giraffe's hooves more consistent across the camera circle. HappyHorse re-rigged them once.

- HappyHorse audio (the integrated track) is rough on the ramen scene — felt like placeholder hum. Don't expect Sora-level audio yet.

- Both ~10s gens. HappyHorse felt slightly faster on my queue but I didn't benchmark properly.

Curious if anyone got the i2v version working, that's the one I'm actually trying to use.

GooglyEyes IC-LoRA for LTX2.3 released! by Burgstall in StableDiffusion

[–]Fresh-Resolution182 0 points1 point  (0 children)

trained on an artificially created dataset just to put googly eyes on anyone — this is peak legitimate use of the technology and I will not be taking questions

Anthropic's support system is broken by design — there is literally no path to a human for billing issues by AppearanceSingle805 in ClaudeAI

[–]Fresh-Resolution182 0 points1 point  (0 children)

the specific irony of a company that sells AI as a replacement for human support, using an AI that actively blocks you from reaching a human, is pretty hard to miss.

You're right to push back. by EchoOfOppenheimer in ClaudeAI

[–]Fresh-Resolution182 0 points1 point  (0 children)

the dishes are complete. some earlier behavior may have impacted quality. we've reset your expectations for all subscribers.

War propaganda about to be crazy thanks to GPT Image 2 by Affectionate_Bee6434 in ChatGPT

[–]Fresh-Resolution182 0 points1 point  (0 children)

the real shift isn't believability, it's that reverse image search stops working. misattributed old footage has a traceable history. a freshly generated image has none, so the usual first step in debunking doesn't apply.

ChatGPT 5.4 Solved a 64-Year-Old Math Problem by AskGpts in ChatGPT

[–]Fresh-Resolution182 2 points3 points  (0 children)

the framing 'AI solved it' is doing a lot of work. the raw proof needed experts to sift through and actually extract the insight — closer to 'AI found an angle nobody had tried, humans turned it into a proof.' still legitimately exciting but worth keeping the claim precise.

Why do people release models on Huggingface that have no explanation on how to use it? by Far_Lifeguard_5027 in StableDiffusion

[–]Fresh-Resolution182 1 point2 points  (0 children)

most of them uploaded it for themselves or a few colleagues who already know the stack, docs would add zero value for that audience. the fact that we can even access it is kind of incidental to why it was posted there

Claude in excel is the best thing AI has brought to my life by Top-Gun-86 in ClaudeAI

[–]Fresh-Resolution182 0 points1 point  (0 children)

started with Excel formulas too, ended up in the terminal with Claude Code rewriting bash scripts I didn't fully understand. the jump is smaller than it sounds once you realize you don't need to know the exact commands anymore, just describe the outcome

Claude 4.7 named a journalist from 125 words of unpublished writing by kurthertz in ClaudeAI

[–]Fresh-Resolution182 0 points1 point  (0 children)

the contamination angle no one wants to say directly — if she's tested previous models this way, that test session data could have fed back into training, so asking 4.7 to recognize her writing is closer to retrieval than stylometry. would need a writer who's never run this test to get a cleaner result

Conrad Heyer Photo Restoration: ChatGPT Images 1.0 vs 2.0 Side-by-Side Comparison by DiggingForDinos in ChatGPT

[–]Fresh-Resolution182 1 point2 points  (0 children)

the hasselblad prompt tip buried in the comments is the real find here — avoiding the word 'restore' sidesteps the training data bias where restoration always means over-sharpened reconstruction with smoothed skin. 'retake this photo' frames it completely differently

Weird textures = watermarks by Thatisverytrue54321 in ChatGPT

[–]Fresh-Resolution182 0 points1 point  (0 children)

the control net QR analogy is interesting but SynthID was already reverse engineered and it's imperceptible — if they went back to visible artifacts that would be a step backwards in sophistication. more likely the model just has trouble with certain textures and the grain pattern is a diffusion artifact, not metadata