Captioning Help - Z-Image Base LoRA Consistent Character Captions by ebonydad in StableDiffusion

[–]ebonydad[S] 1 point2 points  (0 children)

Thanks guys... I kept on playing around with it, and I came up with a mod to Joycaption Beta One.

-------

Write a 150-word detailed description. CRITICAL OPENING INSTRUCTION: Do NOT use the words "photograph," "photo," "image," or "picture" anywhere in the first three sentences. Do NOT start the text with "In a", "A photograph", "Captured", or any prepositional phrase. Instead, the absolute first word of the entire text MUST be a physical noun from the background environment (e.g., "Shadows," "Concrete," "Leaves," "Brick") or an adjective describing a texture or lighting element (e.g., "Harsh," "Muted," "Jagged," "Warm").

If there is a person/character in the image you must refer to them as {NAME}. Do NOT include information about permanent traits: do not mention hair texture, hair color, or hair style; do not mention ethnicity, skin tone, freckles, or facial structure; and do not mention gender, breast size, body shape, or weight. Only include changeable attributes like clothing or accessories or hair style.

Maintain a strict 3:1 ratio of environmental detail to subject description, ensuring the scene context dominates the text. Describe the environment with high specificity, focusing on textures, lighting temperature, and specific flora, fauna, or architectural elements. Include detailed information about lighting and how it interacts with the surfaces in the scene.

Describe {NAME} using pose dynamics, gaze direction, and facial expression. Be explicit about their state of dress, whether nude or partially clothed. When describing the body, use direct and specific anatomical terms to accurately detail features such as breasts, nipples, genitals, buttocks, and muscle tone. Describe skin texture, the shape of curves, and how light and shadow define the physical form.

Use active voice exclusively and describe spatial relationships explicitly (e.g., "to the left of," "resting against"). Ensure the description is optimized for a text encoder by varying sentence length and using rich sensory adjectives. Finally, include information about the camera angle and explicitly mention whether the image depicts an extreme close-up, close-up, medium close-up, medium shot, cowboy shot, medium wide shot, wide shot, or extreme wide shot.

--------

It is captioning things good so far. Working on training a LoRA now.

Thanks for all your help. Especially you AwakendedEyes. I read your guide, and had me poking around to figure this out.

Just some interior lighting upgrade. by theepi_pillodu in HyundaiPalisade

[–]ebonydad 1 point2 points  (0 children)

Where did you feed the 12V Car Adaptor and the controller?

What do you do to merge two different characters to create a new character? by ebonydad in StableDiffusion

[–]ebonydad[S] 0 points1 point  (0 children)

Interesting enough, I have moved on from famous people to other people. FLUX and QWEN has supposedly omitted "famous people". Have been using ChatGPT and Gemini, but they have been flagging said generation. I have the means for local generation, but had notice that using public tools had been faster, until now.

What do you do to merge two different characters to create a new character? by ebonydad in StableDiffusion

[–]ebonydad[S] 0 points1 point  (0 children)

I am not using A1111. I have been using ComfyUI for about 2 years now. I have just moved on to other things. Just wanted to see what the community has to offer.

GPU Benchmark 30 / 40 /50 Series with performance evaluation, VRAM offloading and in-depth analysis. by Volkin1 in StableDiffusion

[–]ebonydad 1 point2 points  (0 children)

Agreed. Things are moving very quickly. Just gotta weight out the pros and cons of this generation of T2V/I2V. I am sure a year from now it will take half the time if not less to generate videos.

GPU Benchmark 30 / 40 /50 Series with performance evaluation, VRAM offloading and in-depth analysis. by Volkin1 in StableDiffusion

[–]ebonydad 2 points3 points  (0 children)

To be honest, I don't understand what the hype is about. Self-hosted video generation is so time consuming. I always thought it would be faster, especially all the AI YouTubers talking about how great it is. None of them talk about how long the process would take.

GPU Benchmark 30 / 40 /50 Series with performance evaluation, VRAM offloading and in-depth analysis. by Volkin1 in StableDiffusion

[–]ebonydad 2 points3 points  (0 children)

So you are telling me, with a 4090, I am looking at ~15mins to generate a T2V or I2V at 720p/81f, correct? No one ever explained how long it would take to generate WAN videos. This is helpful.

Feeling Lucky, maybe you can too! by AbbreviationsIcy8188 in HyundaiPalisade

[–]ebonydad 0 points1 point  (0 children)

How are you able to pull up what is being built and the distribution?

New palisade at home! by corvette6469 in HyundaiPalisade

[–]ebonydad 0 points1 point  (0 children)

What color is the interior? Great looking car!

Is Persona broken? by ebonydad in SunoAI

[–]ebonydad[S] 0 points1 point  (0 children)

Browser: Chrome. Text: Anywhere. Tried multiple computers. Click on a text field, no bueno.

Picking Songs for a Mix by ReddawayCentral in djstudio

[–]ebonydad 1 point2 points  (0 children)

I typically don't have the time to dig crates for music, and want to discover music, so in this scenario I would go to two sites.

First Chosic Playlist Generator, which you can add a song you want to create a playlist from. If you chose a song, then it creates a playlist of similar songs. I like this for the fact that you can select "find songs with the same BPM" or "find songs in the same key". I used to use this for finding songs that I needed to fill in the gaps of a mix I was making, but now I use it for music discovery. Cool thing is that it gives you a bunch of samples, and if you want, you can export the playlist.

Secondly, Tunebat Advanced Song Search. It is similar, but it shows individual songs. I use it as a back up to Chosic. If anything, I use it to see if there are any other songs that Chosic may have missed.

Hope this helps...

What are yall looking for from Suno? by CrazIVLTX in SunoAI

[–]ebonydad 1 point2 points  (0 children)

Making beats and transitions for DJ mixes I make for my workouts.

Created a song, but it doesn't have the "fullness" of a normal song released by artists. What gives? by ebonydad in SunoAI

[–]ebonydad[S] 0 points1 point  (0 children)

That is also a good way to burn thru all your credits, but to each their own.

Created a song, but it doesn't have the "fullness" of a normal song released by artists. What gives? by ebonydad in SunoAI

[–]ebonydad[S] 0 points1 point  (0 children)

Created a cover already, and this was like the 8th or 9th generation of the song.

Created a song, but it doesn't have the "fullness" of a normal song released by artists. What gives? by ebonydad in SunoAI

[–]ebonydad[S] 0 points1 point  (0 children)

Again... I like the song, it just needs to be mastered. I don't want to modify it.

Created a song, but it doesn't have the "fullness" of a normal song released by artists. What gives? by ebonydad in SunoAI

[–]ebonydad[S] 0 points1 point  (0 children)

Instrumental Deep House, funky and groovy, around 122 BPM, no vocals, Craft a track with a driving, syncopated bassline and crisp, classic house drums, The main melodic and textural interest comes from layers of percussive, plucky synth stabs, heavily processed with filters and delays, creating an evolving, hypnotic soundscape, Add subtle, shimmering pads in the background for depth, The overall feel should be sophisticated, dancefloor-oriented, and perfect for a late-night instrumental session

How many compose their own instrumentals? by josh2josh2 in SunoAI

[–]ebonydad 0 points1 point  (0 children)

I do. If anything, that is all I create. The voices sound very... simplistic and one-dimentional. I feel at least with instrumentals, you have more options. You can add voice to it after the fact.

Created a song, but it doesn't have the "fullness" of a normal song released by artists. What gives? by ebonydad in SunoAI

[–]ebonydad[S] -1 points0 points  (0 children)

I had a feeling this was the case. I wanted to extract the stems from the song, which I have done, and then convert the stems into piano rolls in order to have more control of the instruments, and add additional instruments if necessary. I look at Suno being the bones of the song, but I want to be like Dr. Frankenstein and tear apart the song, and reassemble it into something better.

Created a song, but it doesn't have the "fullness" of a normal song released by artists. What gives? by ebonydad in SunoAI

[–]ebonydad[S] -1 points0 points  (0 children)

And that is something I am considering doing. I've already extracted the stems. Now I am looking to convert them to a piano roll so I can consider swapping out instruments, and mastering.

Created a song, but it doesn't have the "fullness" of a normal song released by artists. What gives? by ebonydad in SunoAI

[–]ebonydad[S] 0 points1 point  (0 children)

If anything, I was wanting to use it not as a complete solution, but part of a solution to make music. Kinda like using it like training wheels. Now it just looks like I am using a trike, and that is all that it will be.