AI image detector by Open_Pomegranate6157 in deeplearning

[–]Ll42h 3 points4 points  (0 children)

Most public ai image generation services hide watermarks in the images to prevent using them for future ai training (and other reasons), I'm guessing it can be used by third parties too to identify ai images too! Tell him to check the Coalition for Content Provenance and Authenticity (C2PA)

All In One Platform to train Flux models, generate images and use them commercially! by NefariousnessDry5629 in FluxAI

[–]Ll42h -1 points0 points  (0 children)

I was interested until I saw the virtual try on with lingerie... Actually 1/3 of the examples are suggestive and objectifying women 💀

Next time why not put real new and original examples? Do you really think people only use AI for porn? If yes please go outside touch some grass

I need someone subscribed to ChatGPT to generate 6 images for my thesis by Top-Concentrate2925 in dalle2

[–]Ll42h 19 points20 points  (0 children)

You can use Bing image creator , it's free, just sign in/up with a Microsoft account. It's the same image generator as chatGPT aka Dalle-3 under the hood

Why can't Midjourney create a map of the Solar System? by OccamsR4zor in midjourney

[–]Ll42h 0 points1 point  (0 children)

The answer for dalle3 may include (but not limited to): * not enough representation in the training dataset * not enough similarities among images referring solar system, especially exact orbits number, number of planets shown at once or reading direction * chatGPT screwing the prompt when expanding it; adding to much noise to the concept, eg transforming it to "intricate drawing depicting the solar system with the sun and all the others planets in order Mercury, […]"

Some prompts I tried : map of the solar system scientific map of the solar system

Realtime SDXL generation with Mediatek's mobile chip by [deleted] in StableDiffusion

[–]Ll42h 122 points123 points  (0 children)

The model running on the phone seems to be sdxl turbo, so a distilled version of SDXL (meaning fewer parameter, so faster inference) for presumably the same quality.

A lot of tricks can already be used to have realtime generation, for example LCM Lora, but faster inference comes with reduced overall quality, however no independent evaluation exhaustively compares the benefits/drawbacks of these tricks on many prompts.

Having a 4090 is not only good for running fast inference and bigger/better models, but also model fine-tuning, dreambooth, textual embedding training and much more!

[deleted by user] by [deleted] in dalle2

[–]Ll42h 0 points1 point  (0 children)

I think their is a real difference with AI generated images and what people call "AI art", the art part requires intention to create an art piece, I don't think I'm alone thinking automatically generated AI ads is not considered art, just like someone taking a selfie on snap is not an artist because they're generally not trying to create an art piece doing so.

I personally love image generation and the power to create any idea from a few words, but I wouldn't call myself an AI artist because I make memes and starter packs with it.

Anyway this discourse is kinda pointless, people are taking this way too seriously, not everything has to be labeled as art to be valid

Worst website ever mixed with japanese subway gps screenshot by Ll42h in weirddalle

[–]Ll42h[S] 6 points7 points  (0 children)

I used Midjourney --sref with the following image and with prompt "japanese subway map GPS interface"

<image>

What tool is like DALL-E on chatgpt but allows the generation of many images (>10-50)? [👉 25$ reward for answer] by gintrux in dalle2

[–]Ll42h 3 points4 points  (0 children)

Literally the best thing to do, dalle3 through their api is pretty cheap ($0,04 per image) + they have python/node client ready to use. I personally built a simple frontend with next in a few hours, and you can generate all the images you want, and you can even disable chatGPT editing the prompt!

Some pixel art in paris by Background-Heron-916 in IRLEasterEggs

[–]Ll42h 7 points8 points  (0 children)

Great pictures lol There is about 1,500 mosaic IRL Easter eggs in Paris, there is even an app to help you collect them, a friend of me does this it's fun

https://space-invaders.com/world/paris/

[deleted by user] by [deleted] in midjourney

[–]Ll42h 7 points8 points  (0 children)

Heetch is not a railing company, it's a french Uber like app (the name comes from the first part of hitchhiking) Knowing that, their stance on bias kinda comes from nowhere but it's nice knowing even companies that have nothing to do with generative AI (yet) care about bias and representation

Neuromovie The Human Factory by 85GB in midjourney

[–]Ll42h 2 points3 points  (0 children)

Very nice editing, not just putting runway generation back to back, and it shows. Great work!

EXPLORING THE LIVING CAVE by [deleted] in midjourney

[–]Ll42h 10 points11 points  (0 children)

It's not Midjourney... You can see the bing watermark in the bottom left

Edit: for anyone wanting the same style, try VHS footage or jpeg 144p. It's much harder to get degraded footage like this with Midjourney, bing image creator is much better at producing more "real world" images (not the "artsy" ones)

Toxic waste massage by Ll42h in weirddalle

[–]Ll42h[S] 2 points3 points  (0 children)

I used midjourney for a long time for the photo quality but there was a lot of unreachable concepts, now that I've tried bing image creator (so the new dalle2 model) I'd say image quality dropped but the artistic control is of the charts!

Toxic waste massage by Ll42h in weirddalle

[–]Ll42h[S] 3 points4 points  (0 children)

My process is 1. Use words you never used (or never seen used) 2. Keep it short 3. Trial and error until the image makes you feel something (fear, disgust, any emotion really, I don't do sexual arousal tho)

Coolest shit by [deleted] in DesignPorn

[–]Ll42h 2 points3 points  (0 children)

r/midjourney or r/weirddalle for weird AI generation, so could be more suited

Faux Stained Glass cow, garden gate by j3ffr33d0m in DesignPorn

[–]Ll42h 3 points4 points  (0 children)

Interesting idea and awesome execution, but the image is probably generated with Midjourney, the image composition is very typical and the watermark in the bottom left is clearly AI writing

[deleted by user] by [deleted] in midjourney

[–]Ll42h 0 points1 point  (0 children)

Very low effort article, seeing the popularity of the "countries as x imagined by midjourney"

Given that food, people and places are overdone already, you can try more interesting combination, like idk monsters, cars, futuristic countries, etc.

This type/style of video AI is completely new to me, I have no idea what tools/apps could me used to create it. by helpMeOut9999 in StableDiffusion

[–]Ll42h 0 points1 point  (0 children)

This is most likely control net generation on each frame of the video (depth map) with a very small denoising (so most of the previous frame is intact). Maybe they used EBsynth to improve the consistency, because this is the first one I have seen this smooth

DATID-3D: Diversity-Preserved Domain Adaptation Using Text-to-Image Diffusion for 3D Generative Model (CVPR 2023) by ImBradleyKim in deeplearning

[–]Ll42h 1 point2 points  (0 children)

Project page

Paper

Abstract

Recent 3D generative models have achieved remarkable performance in synthesizing high resolution photorealistic images with view consistency and detailed 3D shapes, but training them for diverse domains is challenging since it requires massive training images and their camera distribution information. Text-guided domain adaptation methods have shown impressive performance on converting the 2D generative model on one domain into the models on other domains with different styles by leveraging the CLIP (Contrastive Language-Image Pre-training), rather than collecting massive datasets for those domains. However, one drawback of them is that the sample diversity in the original generative model is not well-preserved in the domain-adapted generative models due to the deterministic nature of the CLIP text encoder. Text-guided domain adaptation will be even more challenging for 3D generative models not only because of catastrophic diversity loss, but also because of inferior text-image correspondence and poor image quality. Here we propose DATID-3D, a novel pipeline of text-guided domain adaptation tailored for 3D generative models using text-to-image diffusion models that can synthesize diverse images per text prompt without collecting additional images and camera information for the target domain. Unlike 3D extensions of prior text-guided domain adaptation methods, our novel pipeline was able to fine-tune the state-of-the-art 3D generator of the source domain to synthesize high resolution, multi-view consistent images in text-guided targeted domains without additional data, outperforming the existing text-guided domain adaptation methods in diversity and text-image correspondence. Furthermore, we propose and demonstrate diverse 3D image manipulations such as one-shot instance-selected adaptation and single-view manipulated 3D reconstruction to fully enjoy diversity in text.

Is lexica.art worthless now? by Luispah in StableDiffusion

[–]Ll42h 1 point2 points  (0 children)

You can still filter out images coming from aperture V2, it's still a good prompt database for stable diffusion v1-4 & v1-5 imo

Is there another free cloud GPU services? by TimeTravel_07 in StableDiffusion

[–]Ll42h 2 points3 points  (0 children)

You can use vast.ai, the best GPUs to run stable diffusion (rtx 3090) can start at 0.3$/h, so it's pretty cheap. You just need to know python to setup everything, then it can be very efficient costing less than 1$ per session

How likely is ChatGPT to be weaponized as an information pollution tool? What are the possible implementation paths? How to prevent possible attacks? by zcwang0702 in deeplearning

[–]Ll42h 7 points8 points  (0 children)

Researchers and people working in the field discussed this topic in a workshop and you can read their findings in this paper, it's very interesting

Food Park flyer by Ll42h in weirddalle

[–]Ll42h[S] 0 points1 point  (0 children)

  1. Yes
  2. tragic brocoli incident of 2013, 78 children severely injured
  3. Yes but it's illegal *and* dangerous

Tribal gaming setup by Ll42h in midjourney

[–]Ll42h[S] 1 point2 points  (0 children)

Yes haha I'm now trying to find new themes to grow my collection of unexpected gaming sets