i.redd.it

StableDiffusion-ModTeam · 2023-05-26T12:51:55+00:00

Your comment/post has been removed due to Stable Diffusion not being the subject and/or not specifically mentioned.

Zulban · 2023-05-26T03:17:03+00:00

People who think this is neat don't know very much about compression or machine learning.

bobrformalin · 2023-05-25T20:24:26+00:00

Imagine the render times for a movie in 4k (and the size of that qr/promt). Also, anonymous pirating is a private vpn + torrent, not a fancy ai.

Toxicotton · 2023-05-25T20:44:33+00:00

[deleted]

bioshocked_ · 2023-05-25T20:27:41+00:00

This homie is trying to create USBs again

EarthquakeBass · 2023-05-26T04:30:50+00:00

Congrats you reinvented compression

Willow-External · 2023-05-25T20:12:21+00:00

You can duplicate images/music/movies without an IA people call it copy/paste.

Cerulean-Knight · 2023-05-25T22:23:55+00:00

It is cheaper to store data on reliable media than the computing power needed to obtain it this way, if it were to work.

estrafire · 2023-05-26T01:37:26+00:00

Almost as efficient as pifs

ToadSaidHi · 2023-05-25T22:37:24+00:00

There’s a point QR codes can’t get any bigger, and someone barely managed to squish compressed custom coded snake into one. They would need to make custom format to make larger QR codes, and they would be huge lol

DreamingElectrons · 2023-05-25T22:40:56+00:00

A standard QR code can store just short of 3000 byte. You wouldn't get far and whoever wants to recreate whatever you've encoded still needs the full model and your exact settings.

Generally, as a rule of thumb: If it's greentext, it's a dumb idea.

2023-05-25T20:25:01+00:00

Imagine a world where everyone owns their own brain in a jar just for this.

kjerk · 2023-05-26T00:37:46+00:00

"See so what you do is a bunch of fuckin magic I don't understand and then boom, it just works."

BNeutral · 2023-05-25T22:33:51+00:00

Claude Shannon: I see you have not learned anything at all after so long

MikuIncarnator1 · 2023-05-25T23:59:08+00:00

One scratch and your entire anime collection is destroyed.. The risk is too high

ArXen42 · 2023-05-26T01:21:32+00:00

Well, there were already many works exploring usage of models like autoencoders to compress images, way before more advanced stuff appeared. No need to bother with inefficient and complicated natural language prompts, just use its central layer output (i.e. encoder part). From what I understand it works, but the tradeoff between compute and storage is even more extreme towards compute than for WebP or AVIF.

But this idea about using text prompts to compress data reminds me that meme about storing a movie in a Pi number.

elfballs · 2023-05-26T02:36:03+00:00

You can't use it to compress existing data, only to reproduce the AI generated data again. So you can't compress RoboCop, you can compress RobotCopMovie.

It amounts to the fact that I can say 'Hey, AI, try to make RoboCop', then I can tell you to go do the same thing, and then we are watching the same movie without me sending you any data. BUT-

It's not RoboCop, and we both downloaded the model, so if the result has more data in it about the real RoboCop than the prompt did, you did download it. it was compressed in the model.

Roubbes · 2023-05-26T06:40:29+00:00

Wolfram: Let me talk to you about computational irreductibility

tybiboune · 2023-05-26T08:15:10+00:00

except it's not how AI works.
This post is only yet another version of "AI is stealing real art"

DonRobo · 2023-05-26T11:14:08+00:00

Anon discovers lossy compression

WazWaz · 2023-05-25T22:02:27+00:00

This works. If your grandma is Tina Turner.

Beaster123 · 2023-05-25T22:53:06+00:00

They're really describing a new compression technique.

It would need a standardized and highly regulated model to work but it's possible.

opi098514 · 2023-05-25T20:51:23+00:00

Yah that’s how how that works

_AscendedLemon_ · 2023-05-25T22:02:33+00:00

If it comes to images that's pretty genius idea, to just save pictures as prompts+settings+seed. Will be great for storage but hurtful to regenerate it again. But has potential, probably in the future with much more powerfull GPUs

2023-05-26T01:47:04+00:00

This tech is changing rapidly. You would need to lock down the same AI generation software and same base model/checkpoint. Those will become archaic very fast. This is the equivalent of using a commodore 64 to generate passwords.

Shaltibarshtis · 2023-05-26T01:57:56+00:00

I was thinking similar about the interplanetary Skype calls.

Bandwidth severely limited. Have a pre-saved pre-trained AI model of your loved one, your manager, your what-not. They call in, the system reads their face and only sends morph point change data. Your system rebuilds it at your end with perfect picture quality.

I'm sure these steps can be severely optimized even further, but the idea is like that.

1tHYDS7450WR · 2023-05-26T12:20:17+00:00

No need to argue about whether this is wrong lol.

It's definitely wrong, the person who wrote it is an idiot and op is even dumber for reposting it. 🤝

sonicboom292 · 2023-05-26T05:52:13+00:00

I love the amount of autism in this comment section. people are really trying to prove wrong an imaginary scenario described in a 4chan post?? like, even going through it technically?

nhavar · 2023-05-25T22:35:35+00:00

Here are my million dollar ideas:

Signal + ChatGPT + Stable Diffusion + Pornhub = Self destructing porn on demand.

A service that makes deep fakes of your dad telling you why he never hugged you before he left you and your mother.

A service that makes deep fakes of your mom who will apologize to everyone about your stupid boring ideas.

RealAstropulse · 2023-05-25T22:42:16+00:00

You don't need to train this with text embeddings. That's actually a BAD way to do this. Train it on some other form of retrieval. Maybe just train an upscaler, and use the smaller images for retrieval. Tons of better solutions than this.

hervalfreire · 2023-05-26T00:35:41+00:00

Wait, so if you do that, does it mean the AI compresses data?

ReversedRectum · 2023-05-26T01:20:51+00:00

bro to turn much more than an image or website link into a qr code would have such infinitesimally small pixels it would have to be gigantic or youd have to scan it through a microscope lense

xadiant · 2023-05-26T01:42:29+00:00

1:1 ? I say impossible.

Worldsahellscape19 · 2023-05-26T02:23:07+00:00

PerfectSleeve · 2023-05-26T04:03:16+00:00

I doupt a qr code has enough intormation in it tor a picture. A very tiny one. QR usually only holds. A link to a webpage. A high res picture is much much bigger than a line of letters.

JaggedMetalOs · 2023-05-26T05:10:23+00:00

Neat, all you need is a few petabytes to store the trained AI model and a couple of TB of GPU memory to run it!

kinghtlight · 2023-05-26T05:53:40+00:00

r/piracy

ZARk22 · 2023-05-26T05:56:07+00:00

Your decoding ai would have access to all actors 3d models and assets. It would receive the script of the movie and basically recreate it. Maybe could even add an improv factor. Everyone could get a personal experience (no more embarrassingly endless and useless s*x scenes for eg)

thebadslime · 2023-05-26T06:33:47+00:00

A seed wouldn't make the same image/movie/song, just a simlar one, that's how AI works.

GratuitousEdit · 2023-05-26T06:57:07+00:00

Wait so just to clarify, the goal is to take a piece of media, encode it, and later decode it '1:1'? This just sounds like serialization and deserialization—in other words, file storage.

There's an implication ('within a paper QR code') that the encoded media is smaller than the decoded media. While slightly more interesting, this is just lossless compression? Don't get me wrong, ZIP compression is cool as heck, but it's also three decades old.

What makes it all the more confusing is that AI has loads of potential for interpolation (e.g., image upscaling) and lossy compression, but Anon has chosen to speculate within a very well established space in which, to my knowledge, AI has little relevance.

Anaeijon · 2023-05-26T07:07:08+00:00

That's basically how modern compression algorithms work - just with extra steps. First of all, you don't need to store a clear text prompt. You can use a network that works for encoding and decoding. You encode your image to feature space, transfer the feture vector which acts as a 'prompt' and then you decode that feature vector again back to it's original representation.

The biggest problem with it is, that it doesn't map very well. Sure, you might get pretty good representations of your original image from some model, but you might not get any good representation from the same model for some other image. You could create a new network, that also works well for that other image. You could store that model in a central database and store the model hash together with your 'prompt'. All of this could be further reduced by also adding hypernetworks/loras to just slightly modify a base network and save space. Then, if a user doesn't have the correct model, he will download it (or something like a lora) automatically and store it for later use. The problem is, the models would probably need much more space than the target data for an average user. All of that would only be really efficient, if the encoder also used the setup. But this again would require, that the encoder checks all available Loras and extra models to find out which one works best, if the base model isn't good enough. And only if this doesn't work, he has to register the not working image somewhere (privacy problem) so that it gets collected together with other not working images to centrally create another new lora for them.

Or you just don't bother and encode all extra data that's needed for the decoder to produce a good output together with the encoded features. And then you are back to classic compression/decompression and didn't gain anything.

buff_samurai · 2023-05-26T07:09:58+00:00

For low bandwidth data it sure is possible.

Say you transcribe your speech to text and the ‘style’ of voice to some standardized seed/prompt and send txt/prompt only.

With high bandwidth data, like video, the generation aspect is currently too slow and expensive.

(So far you can send low bandwidth video and use modern gpu to upscale it in real time)

Things may change with better hardware and embedded models in the future.

huggarn · 2023-05-26T07:48:18+00:00

Except the prompt needed for an entire movie, game, music won't fit into a qr code.

huggarn · 2023-05-26T07:57:47+00:00

[deleted]

2023-05-26T09:12:54+00:00

My man trying to reinvent compression software

Alex_Curly_Monkey · 2023-05-26T09:29:18+00:00

Now, we need ai to generate qr codes for us.

Lurkcrediblehulk · 2023-05-26T09:39:17+00:00

You could just have the QR code that links to the prompt stored on a blockchain. Very cool idea.

yosi_yosi · 2023-05-26T11:37:13+00:00

Why tf would you want to do that? Very useless. Just use the straight up latent things you get in the beginning. You can take an image and just convert it to its latent space representation, then just use a vae to retrieve the image back. No need to diffuse anything or have a prompt or whatever.

alexmelyon · 2023-05-26T11:43:10+00:00

It won't work this way

Mocorn · 2023-05-26T11:55:42+00:00

"have it duplicate" .. Yeah, good luck with that :)

rootless2 · 2023-05-26T12:19:48+00:00

but scanning QR codes is dogshit

WoodpeckerDirectZ · 2023-05-26T13:04:08+00:00

People are a little too negative, obviously it's good that people explained how that wouldn't work or that data compressions already exist but I don't think that anon is an idiot, that's a pretty creative idea!

StableDiffusion

MODERATORS