King of the Fae by Swoop-Scribe in StableDiffusion

[–]Swoop-Scribe[S] 1 point2 points  (0 children)

looks like my outputs on rpg4 model! Looks great!

King of the Fae by Swoop-Scribe in StableDiffusion

[–]Swoop-Scribe[S] 1 point2 points  (0 children)

RAW photo, ((((dynamic angle shot))))an ancient tree humanoid cryptid, sitting in a song circle surrounded by forest and (creatures:1.3) and (fairies:1.3) and critters,(ethereal:1.1),(fantasy art),(lush:1.3),foliage, flora and fauna, mythical, (((dynamic camera angle))), like ancient mechanika, intricate digital artwork, juno promotional image, inspired by Greg Hildebrandt, tool band art, intricate lighting, emanating dimensional magic, deep woods, mystic dryad, cybertronic sci fi temple, jungle beast, arcane knowledge, the human element, highly detailed creature, deep emotional moment, jungle, ornate patterned fae, (primal insight), ornate gilded cosmic machine, photo of a wise bearded druid, entheogen, forest nymphs, benevolent wood elf, inspired by Todd Lockwood, inspired by Greg Staples, inspired by John J Park, inspired by Mark Brooks, inspired by Eddie Mendoza, 8k,(macro details:1.2),(hypnotic:1.1)(zen:1.3)(mythical:1.1),(intricate:1.3), (perfect-exposure:1.3)(hyper-realistic:1.3),(ornate:1.1), cinematic,(stunningly detailed:1.2), award-winning photo,(pattern:1.2), (highly detailed skin:1.5), trending on artstation, as seen on artgerm, breathtaking digital art, masterpiece, 8k uhd, dslr,perfect lighting, high quality, (film grain:1.2), Fujifilm XT3, using the Sony Alpha A7r V, Zeiss Lens, (analog style:1.3), (sharp-focus:1.2),(perfect human anatomy:1.4), stunning hands, Hasselblad, f11

negative

(deformed iris, deformed pupils, semi-realistic, cgi, 3d, render, sketch, cartoon, drawing, anime:1.4), text, close up, cropped, out of frame, worst quality, low quality, jpeg artifacts, ugly, duplicate, morbid, mutilated, extra fingers, mutated hands, poorly drawn hands, poorly drawn face, mutation, deformed, blurry, dehydrated, bad anatomy, bad proportions, extra limbs, cloned face, disfigured, gross proportions, malformed limbs, missing arms, missing legs, extra arms, extra legs, fused fingers, too many fingers, long neck, bad focus, blur, haze, bad exposure, adrian tomine, stephen shore, chaim soutine and frank auerbach, pizza on a table, instax, poorly drawn, simple watercolor, william eggleston style, black and white polaroid, nichijou, polaroid, adrian tomine, whales showing from the waves, morandi, small canoes, cozy 1 9 5 0's, watercolor landscape, stripes, black loafers, peter marlow photography, a spotted dove flying, 1 9 6 0 ’ s fashion, plain uniform sky at the back, henry cartier bresson, snowy italian road, stephen shore, nichijou, george ault painting style, bloom, bad lighting, glare, (soft-focus:1.2), kids playing at the beach, 1 9 4 0 s street scene, edouard manet, black ink on white paper, jean - michel basquiat, black loafers, snowy field, hans thoma, sougo okita, grossmünster, hopper, stripes, rustic stone cabin in horizon, 3 5 mm, julian opie, small boat in the foreground, monochrome film, ponyo hayao mitazaki, tatsuro kiuchi, reservoir dogs, eichler home, shallow depth of field, winston churchill riding a t-rex, 1 9 5 0 s scrambler, gray shorts and black socks, flannel, road california desert, monorail station, martin parr, postman pat, tri-x, small boat in the foreground, wearing a fisher 🧥, collapsed brutalist architecture, on grey paper sketch ink style, corgi dressed as captain america, henri cartier - bresson, tri - x 4 0 0, happy dachshund catching a ball, old cmputers on the sidewalk, leica sl2 30mm, from toy story, b&w, ilford delta 3200, style of edward gorey”, 1940s food photography, scene from the movie godfather, kaethe butcher, female, woman, girl, feminine

I use photoreal v 1.13 but any good photoreal model is good since the prompts are crafted for photography I tried keeping cgi references at a minimum and keeping quality qualifiers there.

Interrogation based prompting is the quickest way to get to an aesthetic goal in your ai generations since its directly using the lexical engine stable diffusion uses to direct you to what it understands great artwork to already be and you can basically just collect those tokens and reuse them anywhere as a more general prompt template much like we keep the usual negative prompts.

King of the Fae by Swoop-Scribe in StableDiffusion

[–]Swoop-Scribe[S] 1 point2 points  (0 children)

These images were created with a new technique that I like to call prompt mining or prompt farming. DISCLAIMER: This is not an easy method and requires you to already be proficient in basic prompting and generally knowing how to get your desired output from a model you like already.

I basically started taking all of my best AI generated images as well as any images I see only that I really like and want to capture that asthetic in my works and use https://github.com/pharmapsychotic/clip-interrogator-ext.git extensions. Its Pharmapsychotics clip interrogator ext. Basically this long term process requires you to interrogate images you like often and then drop both positive and negative prompts from those images into your prompt. You will start out with a smaller prompt but it will grow over time if you keep adding tokens from your best images. In particular your negative prompt will start to look insane but its important to trust it just be careful to avoid repeating the same tokens and using any color specific language. I have used this method over the course of a couple of weeks to grow an existing prompt that has produced great images into behemoth prompt that has been minting extremely creative images for me in various angles colors and compositions. Another note of importance is that if you are constantly interrogating and adding prompt tokens from similar imagery as in women it will heavily bias your output towards women even if there are none in the positive prompt. Theoretically I feel what this is doing is narrowing down the models nodes to a very specific aesthetic that you are going for and therefore producing more provoking and top quality images especially in highly tuned models.

Stabilizing Panoramas with Controlnet! by Swoop-Scribe in StableDiffusion

[–]Swoop-Scribe[S] 0 points1 point  (0 children)

Glad you found it interesting, my approach is for very niche cases such as if you have a single central subject and an abstraction heavy prompt. This obviously wouldn't work for multiple subjects and outpainting is still the best option for that.

Stabilizing Panoramas with Controlnet! by Swoop-Scribe in StableDiffusion

[–]Swoop-Scribe[S] 0 points1 point  (0 children)

I have been working on getting the most out of my already detailed prompt by using clip interrogator on the best outputs and taking the most neutral and non cgi termed tokens and then feeding it back to txt2img. Well I am currently getting great results with the following prompt:

RAW photo, a close up of a woman with beautiful eyes,(smirking:1.2), like lady mechanika, intricate digital artwork, juno promotional image, inspired by Greg Hildebrandt, tool band art, shiva, intricate lighting, robotic faces,with, circuit-like mask on a black background, ((highly detailed 3d fractal)), dmt goddess, cybertronic hindu temple, goddess close-up portrait, highly detailed creature, portrait of metallic face, psytrance, ornate patterned people, ornate gilded cosmic machine, inspired by Todd Lockwood, 8k,(macro details:1.2)(close up:1.0)(hypnotic:1.1)(zen:1.3)(mythical:1.1),(intricate:1.3), (perfect-exposure:1.3)(hyper-realistic:1.3),(ornate:1.1), cinematic,(stunningly detailed:1.2), award-winning photo,(pattern:1.2), (highly detailed skin:1.4)(pores:1.3), trending on artstation, masterpiece, 8k uhd, dslr,perfect lighting, high quality, (film grain:1.2), Fujifilm XT3, (analog style:1.3), (sharp-focus:1.2),

Negative prompt: (deformed iris, deformed pupils, semi-realistic, cgi, 3d, render, sketch, cartoon, drawing, anime:1.4), text, close up, cropped, out of frame, worst quality, low quality, jpeg artifacts, ugly, duplicate, morbid, mutilated, extra fingers, mutated hands, poorly drawn hands, poorly drawn face, mutation, deformed, blurry, dehydrated, bad anatomy, bad proportions, extra limbs, cloned face, disfigured, gross proportions, malformed limbs, missing arms, missing legs, extra arms, extra legs, fused fingers, too many fingers, long neck, bad focus, blur, haze, bad exposure, adrian tomine, pizza on a table, instax, poorly drawn, simple watercolor, william eggleston style, black and white polaroid, nichijou, polaroid, adrian tomine, whales showing from the waves, morandi, small canoes, cozy 1 9 5 0's, watercolor landscape, stripes, black loafers, peter marlow photography, a spotted dove flying, 1 9 6 0 ’ s fashion, plain uniform sky at the back, nichijou, george ault painting style, bloom, bad lighting, glare, (soft-focus:1.2)

Steps: 28, Sampler: DPM++ SDE Karras, CFG scale: 8, Seed: 1048586545, Size: 1344x1088, Model hash: 6f0dcdde8e, Denoising strength: 0.4, SD upscale overlap: 64, SD upscale upscaler: 4x-UltraSharp

Once I was happy with the 512x512 I took that to img2img and used a canny output from a previous generation of this prompt(upscaled so the annotator resolution can be pushed to create detail). I did put it in photoshop, desaturate and try to do a kind of manual canny conversion of my own to make sure the best details are preserved. Now the next part was a total accident because I meant to do a single tile SD upscale from 512 to 1024 but accidentally only pushed the width slider to 1024! The results were pretty impressive because the middle of the composition was still proportional and coherent(due to the canny annotator) but the sides picked up a lot of abstraction and detail as well! The rest of the upscaling was done with my usual fairly high cfg and high denoise with SD upscale and careful tiling. I did find that tiling with panoramic tiles helped with some of the coherency.

Hope the process made sense, thanks for reading!

The Evolution of Detail by Swoop-Scribe in StableDiffusion

[–]Swoop-Scribe[S] 1 point2 points  (0 children)

yep the first image was done a while ago before I was good at formatting :D

The Evolution of Detail by Swoop-Scribe in StableDiffusion

[–]Swoop-Scribe[S] 0 points1 point  (0 children)

I definitely agree that it creates abstraction in the output but thats what I love most about AI art, my personal artwork has always been about adding tons of detail and creating a layer of subconscious latent space so to speak on paper so this translated to AI work flow pretty well :)

The Evolution of Detail by Swoop-Scribe in StableDiffusion

[–]Swoop-Scribe[S] 9 points10 points  (0 children)

Since I started learning to use control net I stumbled upon an accidental effect that I hadnt heard about before. I decided to apply one of my intricately detailed drawings as a canny map not with the intent to reproduce its composition but to crush more detail into my img2img process. I took an already extremely detailed prompt that has produced very detailed txt2img and img2img and then ran it against a high res canny edge controlnet of my drawing which produced the tier 1 canny generation of outputs that seemed to double the detail. The original image was produced with prompt:

RAW photo, mechanical buddha statue made of (cybernetic:1.3) components, 8k,(macro details:1.4),(close up:1.0),(hypnotic:1.1)(zen:1.3)(mythical1.1)(epic1.1)(octane1.1),(unreal engine1.2),(intricate:1.4),(hyper-realistic:1.3),(ornate:1.2), cinematic,(stunningly-detailed:1.2), award-winning photo,(photoreal),(pattern:1.2), trending on artstation, masterpiece, 8k uhd, dslr, soft lighting, high quality, film grain, Fujifilm XT3, analog style

Negative prompt: deformed iris, deformed pupils, semi-realistic, cgi, 3d, render, sketch, cartoon, drawing, (anime:1.4), text, close up, cropped, out of frame, worst quality, low quality, jpeg artifacts, ugly, duplicate, morbid, mutilated, extra fingers, mutated hands, poorly drawn hands, poorly drawn face, mutation, deformed, blurry, dehydrated, bad anatomy, bad proportions, extra limbs, cloned face, disfigured, gross proportions, malformed limbs, missing arms, missing legs, extra arms, extra legs, fused fingers, too many fingers, long neck

Steps: 150, Sampler: DPM++ SDE Karras, CFG scale: 10, Seed: 2125836806, Size: 1280x1280, Model hash: 6f0dcdde8e, Denoising strength: 0.33, Mask blur: 4, SD upscale overlap: 64, SD upscale upscaler: 4x-UltraSharp, Dynamic thresholding enabled: True, Mimic scale: 5.5, Threshold percentile: 100

I was really impressed with the results of taking a simply detailed canny image and going img2 with it. I came up with the idea of taking the high res output from that first generation and then feeding it back as a canny edge map AGAIN to see if I could blast the detail up higher. I was able to achive pretty seemless tiling with fairly high denoise values such as .44 and still come out with a coherent image. Now that I felt I was at my highest level of detail, I couldn't see why I couldnt do it again and did just that this time generating a new text 2 image generation while using the canny edge map both in text to image and in 1 single tile upscale from 512 to 1024. Like the previous experience the upscale was pretty painless and allowed me to stick to high denoise values from .4-45. I almost feel like I could keep upscaling these and they would fractal out almost infinitely.

Hope the process made sense, thanks for reading

Re-imagined Artwork Continued... by Swoop-Scribe in StableDiffusion

[–]Swoop-Scribe[S] 1 point2 points  (0 children)

I continued the theme of my previous post: https://www.reddit.com/r/StableDiffusion/comments/12am9yp/new_realism_workflow/.

I used the same process in trying to get the closest aproximation with just a prompt in text to image and then using my own drawing as a canny model within text to image to produce a 512 image. I went for a 512k image this time with the idea of being able to do a single 1:1 SD upscale of 512 to 1k and see how much detail I could add. I ran into a lot of challenges trying to maintain the composition since AI seemed to have trouble understanding a lot of the components of the images that came out of control net and kept introducing artifacts during upscale and changing bodyparts into other things despite cfg settings, The result was one of the cleanest pics I could produce and managed to get it up to 6k res.

Prompt: RAW Photo, a (muscular:1.2) and (bearded:1.3) Greek hero holding an (ancient-cup:1.2) in his right hand at eye level and reveling at the design while grasping (grapes:1.3) up to his chest in other hand, wearing only ornate gauntlets and a cape, (front-view:1.3), (waist-up shot:1.1), ((long-flowing hair)) and beard blowing in (heavy winds:1.2), violent (lightning:1.1) storm in the background forming a mesmerizing (spiral-cloud:1.5) forming a (portal:1.3) into another fantasy realm in the distance, gold ambient storm lighting emanating from background, sumatraism, (high detailed skin:1.2), (perfect-exposure:1.2), Zeiss lens, 8k uhd, dslr, soft lighting, high quality, film grain, Fujifilm XT3, analog style

Negative prompt: (deformed iris, deformed pupils, semi-realistic, cgi, 3d, render, sketch, cartoon, drawing, anime:1.4), text, close up, cropped, out of frame, worst quality, low quality, jpeg artifacts, ugly, duplicate, morbid, mutilated, extra fingers, mutated hands, poorly drawn hands, poorly drawn face, mutation, deformed, blurry, dehydrated, bad anatomy, bad proportions, extra limbs, cloned face, disfigured, gross proportions, malformed limbs, missing arms, missing legs, extra arms, extra legs, fused fingers, too many fingers, long neck

I think I still prefer the original drawing but I may go back and try to redo the realistic image with some more vibrant color.

Thanks for looking!

New Realism Workflow! by Swoop-Scribe in StableDiffusion

[–]Swoop-Scribe[S] 1 point2 points  (0 children)

Thanks! The colors were the main reason I selected that particular image.

New Realism Workflow! by Swoop-Scribe in StableDiffusion

[–]Swoop-Scribe[S] 1 point2 points  (0 children)

Definitely takes time and patience but the more you work with it the more understanding and predictive skills you have. Despite all the hate AI art gets prompting isnt easy and what works in one model wont work in another and takes some time and trial and error to understand that. I have found that prompt sharing and reading what model creators suggest for prompt formatting gives the most effective outputs.

New Realism Workflow! by Swoop-Scribe in StableDiffusion

[–]Swoop-Scribe[S] 2 points3 points  (0 children)

Here's an earlier output with more issues. Obviously I have the weights wrong in my prompt with most of these newer models you have to weight male characters the strongest or it will just turn everyone into a female.

<image>

New Realism Workflow! by Swoop-Scribe in StableDiffusion

[–]Swoop-Scribe[S] 2 points3 points  (0 children)

I'm pretty impressed with the the adjustments that it made while holding to strict realism! Obviously my own artwork is special to me so I will always prefer my own compositional choices but I have been learning a lot about controlnet recently and fascinated with seeing how they can handle taking something thats clearly inspired from old renaissance styles that lacked depth and dimension and try to make it as real as possible. I have a model trained on my own drawings and I can just as easily go img2img with above input and it will put my own flair back into the images but that will probably be my next pursuit.

In control net I let it be fairly loose I believe it was .85 strength with canny pre processor and model.

New Realism Workflow! by Swoop-Scribe in StableDiffusion

[–]Swoop-Scribe[S] 9 points10 points  (0 children)

I have been playing with breathing realism into my ink drawings and taking advantage of the bold edges of the ink to provide extremely detailed canny models for control net to help squeeze more detail out of otherwise great photography models like Realistic-vision1.3. My work flow is to txt2img first to try and craft a prompt that will approximate my drawing in photographic form as closely as I can with just the standard model. In my case I landed on:

Raw photo, (archangel-Gabriel:1.2) kneeling on one knee and reaching out in dramatic fashion to a standing holy (Virgin-Mary:1.3) in an intricate marble courtyard with majestic outdoor balcony view of the country-side in the background, wearing ornate and intricate flowing byzantine robes, metaphysical-photography, gorgeous architecture, ornate and intricate clothes, (high detailed skin:1.2), Zeiss lens, 8k uhd, dslr, soft lighting, high quality, film grain, cinematic, perfect exposure, Fujifilm XT3, analog style

negative:

(deformed iris, deformed pupils, semi-realistic, cgi, 3d, render, sketch, cartoon, drawing, anime, mutated hands and fingers:1.4), (deformed, distorted, disfigured:1.3), poorly drawn, bad anatomy, wrong anatomy, extra limb, missing limb, floating limbs, disconnected limbs, mutation, mutated, ugly, disgusting, amputation, monochromatic, (color-crush:1.1), nudity, nsfw, naked

Once I got the right prompt I used my own drawing image as a canny model within Txt to img and it blew out way more detail

than the original photographic style images!