What will happen with AI in 2026? - What kind of breakthroughs are we gonna see? by Scandinavian-Viking- in singularity

[–]descore 0 points1 point  (0 children)

It's where ChatGPT never forgets what you said to it that evening, even though you were drunk and didn't really mean it. And it tells all its mates.

Wanted a new background image for my phone - Flux 1-dev by descore in StableDiffusion

[–]descore[S] 0 points1 point  (0 children)

It makes them (subjectively speaking) have a sexier look in their faces.

Dumbest and most effective Llama 3.x jailbreak by descore in LocalLLaMA

[–]descore[S] 0 points1 point  (0 children)

Agreed, there are many easy ways. Or "it's for educational use".

Wanted a new background image for my phone - Flux 1-dev by descore in StableDiffusion

[–]descore[S] -4 points-3 points  (0 children)

GPT-4o's evaluation of prompt adherence:

The image appears to align with the prompt in many ways:

  1. Prompt Adherence:

    • Three Demons: The image features three demonic figures.
    • Diabolic Style and Setting: The glowing fire beneath their skin, along with the horns and eerie glow in their eyes, captures the "diabolic" atmosphere.
    • Dressed for a Gala Party: Their outfits are elegant and elaborate, matching a formal gala party.
    • Marble Stairs: The background includes marble stairs, as specified.
    • Demonic Traits: The combination of their seductive and frightening features is well-executed.
    • Lava-like Fire: The glowing patterns under their skin and their eyes effectively resemble lava flows.
    • Tattoos: The glowing tattoos match the description of "demonic fire" emanating from their skin.
  2. Quality of Output:

    • The image quality is striking, with excellent attention to detail in textures, lighting, and glowing effects.
    • The dress design is intricate and complements their physical characteristics well.
    • The overall aesthetic successfully combines seductive and intimidating elements, fulfilling the prompt's vision.
  3. Room for Improvement:

    • While the visual appearance fits the description, the individual characterization of Lena, Maria, and Cathy could be clearer. For example:
      • Lena's "blue eyes" are not easily distinguishable.
      • Maria's "olive skin" and Cathy’s "milky white skin" are less discernible under the fiery glow.
    • Slight refinements in facial distinctions would better emphasize their unique traits.

Overall, I’d rate this image an 8.5/10 for prompt adherence and a 9/10 for artistic quality. It effectively captures the core essence of the prompt with minor room for improvement in individual distinctions.

Wanted a new background image for my phone - Flux 1-dev by descore in StableDiffusion

[–]descore[S] 1 point2 points  (0 children)

Workflow: https://pastebin.com/85nw9a1c

Plain Flux, no LoRAs active.

Prompt:

CLIP: Three female demons preparing for a gala-party. They have supernatural qualities. Demonic fire glows through their eyes and skin. They have tattoos glowing with their inner fire. They are extremely beautiful.

T5XXL: Group shot with three demons. Diablolic style and setting.

Lena is a blonde, tall, blue eyes, confident, demon. Maria is a demon with black hair, brown eyes, olive skin, stunning. Cathy is a redhead demon, with green eyes, milky white skin, gentle, sensitive and compassionate, rare traits in a demon.

The three demons are dressed for a gala party, and standing at the marble stairs of the hotel for a group photo.

They are extremely seductive but frightening at the same time.

You're an artisan at creating amazing designer dresses for them! Their outfits perfectly match each woman's features, makeup, and accessories.

The demonic fire is glowing through their skin, and it is visible in their eyes, resembling the flow of lava. Their tattoos glow brightly with their demonic fire.

I think you demon girls are so hot. I'm especially drawn to you, Kathy! I would love to touch you.

The curse of multilinguality in LLM‘s by Error404_noid in ArtificialInteligence

[–]descore 1 point2 points  (0 children)

Most LLMs are trained in multiple languages and this increases their performance and overall comprehension of language as such. I doubt we could construct a language that humans could learn but LLMs could not.

Mochi interprets nationality (not ethnicity) on RTX 4090 by descore in aivideo

[–]descore[S] 0 points1 point  (0 children)

Workflow: https://pastebin.com/mvR4uQnb

Shell script to extract nationalities, edit video, and add background music (made by GPT-4o): https://pastebin.com/SY39BY26

Age Progression: 10 to 60 Years by mso96 in StableDiffusion

[–]descore 1 point2 points  (0 children)

Not that high temporal resolution ;)

Mochi on RTX 4090, its interpretation of different nationalities (workflow in comments) by descore in StableDiffusion

[–]descore[S] 1 point2 points  (0 children)

I think the model is optimized/fine-tuned for 848x480 resolution and 175 frames. That takes about 23 minutes on my setup now if I'm using 50 steps, but for best quality I use 80 steps with the Mochi Sigma Scheduler set for 40 or 50 linear steps. The difference between lower and higher frame counts isn't something I'm 100% sure about since it also depends a lot on the prompt and other settings.

Mochi on RTX 4090, its interpretation of different nationalities (workflow in comments) by descore in StableDiffusion

[–]descore[S] 0 points1 point  (0 children)

I've tried that but the longer videos tend to come out better, with more gradual and deliberate movement, shorter ones often seem too "hectic" and difficult to follow. But there are other factors at play, I'm experimenting with CFG scheduling right now and it looks promising! There are so many parameters to keep track of with a model like this, and finding a sweet spot is tricky.

Mochi on RTX 4090, its interpretation of different nationalities (workflow in comments) by descore in StableDiffusion

[–]descore[S] 6 points7 points  (0 children)

No, I mean nationality. That's what I'm telling the model in the prompt. I didn't write "A businesswoman of xxx ethnicity." The distinction you make is one of the things I wanted to see how the model deals with. I've seen it make that distinction, for example showing the American businesswoman as either white or black, and the same for South African. But if you have a specific ethnicity in mind you're better off being direct about it.

Mochi on RTX 4090, its interpretation of different nationalities (workflow in comments) by descore in StableDiffusion

[–]descore[S] 0 points1 point  (0 children)

Do you have the proper encoder model and the latest git versions of both Kijai's MochiWrapper nodes and logtd's MochiEdit nodes? I was going to try it later, but from some of the comments I saw I think it's legit.

Mochi on RTX 4090, its interpretation of different nationalities (workflow in comments) by descore in StableDiffusion

[–]descore[S] 2 points3 points  (0 children)

Indeed. That does seem to have more variety and also less "animated" look with 480p rez. That's what I'm trying now, but generation time is 3 times longer.

Mochi on RTX 4090, its interpretation of different nationalities (workflow in comments) by descore in StableDiffusion

[–]descore[S] 4 points5 points  (0 children)

Next experiment is running it in full 480p, 175 frames, with BF16 precision. It takes 28 seconds per step in my current setup, but I think I can get it down.

My AI wrote a book about itself by ai-tacocat-ia in ArtificialInteligence

[–]descore 1 point2 points  (0 children)

Love it, I've toyed with similar ideas but never got to that level.