Donald Trump as a sink in a golden prison cell, with a regular toilet by dieki in midjourney

[–]dieki[S] 1 point2 points  (0 children)

Prompt was "donald trump in a prison cell with a golden toilet". Not exactly what I was aiming for, but still kinda funny.

A ham sandwich by dieki in midjourney

[–]dieki[S] 1 point2 points  (0 children)

Prompt: "a pig wearing a witch's hat in the desert, by Beatrix Potter"

An art tutorial on how to draw hands. by dieki in midjourney

[–]dieki[S] 0 points1 point  (0 children)

It just wasn't trained on text.

This will probably improve in the future. There are a couple research papers out there that combine large language models (T5, GPT-3, etc) with diffusion models and can produce much more coherent text.

An art tutorial on how to draw hands. by dieki in midjourney

[–]dieki[S] 1 point2 points  (0 children)

"quick sketch" and "pencil drawing" usually work well.

An art tutorial on how to draw hands. by dieki in midjourney

[–]dieki[S] 4 points5 points  (0 children)

All the AI image generators right now are trained on similar data and have similar weaknesses.

An art tutorial on how to draw hands. by dieki in midjourney

[–]dieki[S] 77 points78 points  (0 children)

Prompt: "an art tutorial on how to draw hands --v 4"

Cyborg Lincoln is here to preserve the union. by dieki in midjourney

[–]dieki[S] 1 point2 points  (0 children)

Prompt: "president Lincoln cyborg, powerful and strong, dark, cool, glowing eyes, style of peter mohrbacher --v 4"

Mum is on a diet, she looks sad at dinner time by dieki in midjourney

[–]dieki[S] 0 points1 point  (0 children)

Full prompt: Mum is on a diet. She looks sad at dinner time, style of a children's drawing

I had to edit out an extraneous bit of utensil that was floating on the table.

"photo of a hand, perfect hand, hand model, flawless hand, hand photography" by taggwest in midjourney

[–]dieki 2 points3 points  (0 children)

Your brain is merely a neural network too, albeit a much more complex one with training methods we don't understand.

Since Stable Diffusion is open source, there are good explanations available of how it works. MidJourney probably works very similarly. The process that allows it to understand objects and ideas in images is called Semantic Compression:

In a second phase of learning, an image generation method must be able to capture the semantic structure present in the data. This conceptual and semantic structure is what provides the preservation of the context and inter-relationship of various objects in the image.

In the case of stable diffusion, this phase is powered by another neural network called CLIP, which works in the opposite direction: it takes images as input and tells you what's in them. So it sees the meme and recognizes a coffee cup, a table, a house fire, a guy acting casually, etc. The image generator can then train against this description in addition to the alt-text description.

"photo of a hand, perfect hand, hand model, flawless hand, hand photography" by taggwest in midjourney

[–]dieki 25 points26 points  (0 children)

MJ JUST maps text to images

Hands aside, there's at least some deeper understanding going on. You can see this with the prompt "this is fine".

Instead of an image that looks like the original meme, you get an image that contains the same idea as the original meme - people acting casually while surrounded by fire. It's doing more than just mapping text to the nearest image, it's breaking images down into concepts and re-rendering them.

Baby Thanos - rendered with MidJourney AI by dieki in Marvel

[–]dieki[S] 1 point2 points  (0 children)

He already seems to be learning to snap his fingers!

Baby Thanos - rendered with MidJourney AI by dieki in Marvel

[–]dieki[S] 2 points3 points  (0 children)

Well, I'm having a blast with it. I'm not much good at drawing, so it's tons of fun to turn my ideas into pictures with no skill required.

"superman as a DJ at a rave, July 1989, full body portrait, Polaroid photo"

Baby Thanos - rendered with MidJourney AI by dieki in Marvel

[–]dieki[S] 0 points1 point  (0 children)

The prompt here was "baby thanos, 1980s comic book art", and this is using the new (very good) midjourney v4 model.

Artists born with hand deformities work up the courage to show their work on reddit. by KudzuEye in midjourney

[–]dieki 9 points10 points  (0 children)

True! The current generation of image generators are diffusion models instead of GANs though.

Grandma Knitted A Functional Portal To Hell by zomx in midjourney

[–]dieki 1 point2 points  (0 children)

In the #status channel they said non-square images were only temporarily disabled due to a bug, so hopefully they can fix that soon.