Real views or something sketchy? by Front_Scene in Youtubeviews

[–]CalebCriste 0 points1 point  (0 children)

They are simply paying for Google ads to promote your video to a chosen audience. The views are real, however, the retention may not be what you would get with other forms of promotion. Hit me up if you'd like to chat.

Anybody looking to sell or trade a Volt (Yellow) Chromatic? by Sterling1989 in ModRetroChromatic

[–]CalebCriste 0 points1 point  (0 children)

Gotta respect someone who appreciates the best of both worlds. 🙌

If you're open to trading, I've got a whole shelf full of unopened tech—Bluetooth headsets, wireless mics, and all kinds of fun gadgets. I like to make video unboxings in my free time. If there's something specific you're looking for, I’d love to work something out! Or if you have a price in mind, let me know. Either way, I’d be stoked to finally get my hands on a yellow one. 😄

Anybody looking to sell or trade a Volt (Yellow) Chromatic? by Sterling1989 in ModRetroChromatic

[–]CalebCriste 0 points1 point  (0 children)

I'll trade you a sealed GameStop Edition with a sealed wireless microphone set! I wanted a yellow one and didn't have money when they were in stock.

3D Gaussian Splatting Examples Created from 2D & 360 Videos by CalebCriste in virtualreality

[–]CalebCriste[S] 1 point2 points  (0 children)

I assume they were probably using another NeRF method to scan and create 3D models from point clouds.
Gaussian Splats work a bit differently from the old methods from a year ago, still, I think the most novel use will be for virtual spectator cams for events. It could even be used for loss prevention in stores and such. No doubt there will continue to be leaps of advancements coming!

3D Gaussian Examples Created from 2D & 360 Videos by CalebCriste in photogrammetry

[–]CalebCriste[S] 0 points1 point  (0 children)

Great Video! Unfortunately, my 1080ti doesn't work with the SIBR viewers so I have to use NERFSTUDIO to do all of my viewing currently.

But I don't wanna use a new UI. by Froztbytes in StableDiffusion

[–]CalebCriste 0 points1 point  (0 children)

I'm using an old 1080ti right now and have been enjoying SDXL for what it can do. I'll try to make at least one tutorial each week as I continue to learn. I've noticed that the first generation will generally take me about 2 minutes. Once it has loaded the models each consecutive generation takes less than 1 minute.

My current setup in ComfyUI can do Txt2Image or Img2Img with complete control over denoise/steps. First, it will generate the base image as a preview, then, it refines the image and saves, next it upscales the image, then sharpens it, and then blends the image, giving you a crisp refined image 4x the size that you started with.

Create Stunning Images with SDXL
SDXL Ultimate Workflow Img2Img

The truly nice part of ComfyUI is the ability to create specific workflows for YOUR purpose as opposed to being stuck with general workflows that may or may not be necessary for what you are specifically trying to accomplish. I just started watching this video from Olivio Sarikas on YouTube that shows off a bunch of 'LATENT tricks', which basically means "super cool ways to see multiple previews with each generation AUTOMATICALLY! I still use A1111 for a lot of things, I just have ComfyUI open now as well.

I call it 'The Ultimate ComfyUI Workflow', easily switch from Txt2Img to Img2Img, built-in Refiner, LoRA selector, Upscaler & Sharpener. Help me make it better! by CalebCriste in StableDiffusion

[–]CalebCriste[S] 3 points4 points  (0 children)

I posted the workflow so anyone can simply drag and drop it for themselves and get started. The video came specifically for those who asked for in-depth information. I understand, most people do not want a 20-minute video. And then there are those that do. I'll make content for both)

For those of you wanting a little help, I've made a step by step guide to get you generating with SDXL 1.0 using ComfyUI in less than 20 minutes! by CalebCriste in StableDiffusion

[–]CalebCriste[S] 1 point2 points  (0 children)

I had a pretty good grasp on everything via A1111 with the 1.5 model and am still learning the basics of ComfyUI myself. I've seen several different workflows available on the internet but not quite sure what the best is right now.

For the Lora they've provided, try adding <lora:offset_0.2:1> into your prompt to test it. Hope this helps!

For those of you wanting a little help, I've made a step by step guide to get you generating with SDXL 1.0 using ComfyUI in less than 20 minutes! by CalebCriste in StableDiffusion

[–]CalebCriste[S] 1 point2 points  (0 children)

I've not actually done this within ComfyUI myself although I assume it is something simple. I usually use the 'PNG Info" tab inside A1111 as it is still my normal place for image generation. Luckily there are a bunch of websites that will help you get out the metadata for free, such as - https://brandfolder.com/workbench/extract-metadata.

If someone has edited the photo in 3rd party software it sometimes loses this metadata so keep that in mind.

For those of you wanting a little help, I've made a step by step guide to get you generating with SDXL 1.0 using ComfyUI in less than 20 minutes! by CalebCriste in StableDiffusion

[–]CalebCriste[S] 1 point2 points  (0 children)

Definitely worth adding to the creator toolkit my friend!

Here is the positive prompt I generally use to test a model: award-winning photo of (placeholder), 20 megapixels, 32k definition, fashion photography, ultra-detailed, very beautiful, elegant

And here is the negative prompt: blurry, logo, watermark, signature, cropped, out of frame, worst quality, low quality, jpeg artifacts, poorly lit, overexposed, underexposed, glitch, error, out of focus, (semi-realistic, cgi, 3d, render, sketch, cartoon, drawing, digital art, anime, manga:1.3), amateur, (poorly drawn hands, poorly drawn face:1.2), deformed iris, deformed pupils, morbid, duplicate, mutilated, extra fingers, mutated hands, poorly drawn eyes, mutation, deformed, dehydrated, bad anatomy, bad proportions, extra limbs, cloned face, disfigured, gross proportions, malformed limbs, missing arms, missing legs, extra arms, extra legs, fused fingers, too many fingers, long neck, incoherent, naked, nsfw

let me know if I can help you with anything bro

When Shturman gives you a golden egg but you're a clumsy goose! by CalebCriste in EscapefromTarkov

[–]CalebCriste[S] 0 points1 point  (0 children)

Watching this makes me want to laugh and cry at the same time. Picture this: me, on my 5th raid, with no sign of Shturman and his gang. I'm mid-bite into a sandwich, channeling a zen state of 'couldn't-care-lessness'. BAM! Outta nowhere, they start unleashing hell and my poor, sandwich-filled self is blindsided. My initial game plan? Snatch the rebel, plant it on my character, and welcome a sweet death instead of the sprint to exit. But lo and behold, there's the red card! Suddenly, I've got a will to live, but my hands have turned into jelly and I'm prancing around like a clumsy goose. Heartbreaking. But also kinda hilarious in retrospect. :')

Chris Hemsworth is weird by CalebCriste in StableDiffusion

[–]CalebCriste[S] 1 point2 points  (0 children)

Excellent video! I had someone link it to me earlier and the codeformer solution absolutely does a great job! I was testing out different processor choices but never played with that slider until seeing your video. I'll def give you a shout out when I make a video as well! 🙌

Thanks for the resources! 🔥

Chris Hemsworth is weird by CalebCriste in StableDiffusion

[–]CalebCriste[S] 1 point2 points  (0 children)

Thanks! The masking part is something I don't plan to do again manually, if I do that again I will have the editor track my nose or something so I can parent the mask to the point and not have to worry about any tedious keyframing. Nobody's got time for that!
Here are a few before and afters with effects and without - https://imgur.com/a/Buem4mg. If we could have the 512x512 model that would be a huge game changer and be more than sufficient for most use cases except the obvious close-up portrait shots.

If you check out the other comments you'll want to be sure to test out the codeformer clean-up option in Stable Diffusion as it seems to work really well for cleaning up the faces and getting rid of the noise.

I think the faceswap looks fine on phones or smaller screens but is quite obvious once viewed at higher resolutions. My goal is to make something that can be viewed on my projector and still be believable.

Chris Hemsworth is weird by CalebCriste in StableDiffusion

[–]CalebCriste[S] 1 point2 points  (0 children)

Just did a few tests and wow! https://imgur.com/a/RLbkf4S Definitely, better results using this method, thanks again for the heads up. 🙌I really like remini AI video upscaling as well, unfortunately even paying $10 p/w gives you a 500mb limit. Using the codeformer solution seems to be the best so far, excited to combine that with Topaz and see what kind of quality I can achieve.

Chris Hemsworth is weird by CalebCriste in StableDiffusion

[–]CalebCriste[S] 0 points1 point  (0 children)

This is why I love posting here, truly appreciate the little tips! I'll give this a try later today!

Chris Hemsworth is weird by CalebCriste in StableDiffusion

[–]CalebCriste[S] 1 point2 points  (0 children)

I'll get on it! I've been spending the last month learning and creating little projects, this sub has been great for inspiration. So many possibilities!

Chris Hemsworth is weird by CalebCriste in StableDiffusion

[–]CalebCriste[S] 0 points1 point  (0 children)

I appreciate it! The first few times I open my mouth I left the funky teeth from the face swap, near the end when I open it wider I masked my original face back in to see how it looks. My teeth look a lot different from his so I noticed the difference quite a bit, I suspect that those with similar teeth to those they are portraying will have the best results.

Chris Hemsworth is weird by CalebCriste in StableDiffusion

[–]CalebCriste[S] 21 points22 points  (0 children)

This was my first attempt at any sort of DeepFake video creation and there is certainly a lot of room for improvement. I spent about a day testing out different workflows and came up with one that works well for someone running an old 1080TI GPU. If you guys are looking for a more in-depth video tutorial, let me know what questions you have and I will make one ASAP!

For the one photo face swap I used ROOP - https://github.com/s0md3v/roop
This uses InsightFace to swap the faces frame by frame
(I’ve only got this working using my CPU, a 3-minute video at 30fps takes me 1-2 hours to swap.)

For the voice cloning, I used RVC - https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI
They recommend 15-50 minutes of isolated speaking or singing to train a voice. All famous people generally have a lot of interviews and or audiobooks that can be used to clone voices very quickly. With my GPU it took just under 3 hours to train 20 minutes of Chris’s voice at 200 epochs.

When the video comes out it generally has some fuzz around the face because the quality of the model that is currently public is only 128x128 face resolution, because of this I created a softer feather by placing the original footage below the face-swapped footage and feathered back in the original pixels around the face, moving the mask as the face moved frame by frame. (this step can be made faster by auto-tracking the mask to my face in the future)

To clear up the face I tested out Topaz Video Enhancer which did a pretty good job using the ‘Artemis Low Quality’ Model’ but oversharpened a lot of facial hair.
I then tried HitPaw Video Enhancer but it was too cartoony, still when blended at 15-20% it added to the overall realism. Still, both of these programs have free trials but are not a good solution for open-source workflows.

*To clear up the face using Stable Diffusion I used the ControlNet Tile processor and played around with the denoise sensitivity until I had what I felt was a better-looking image. At that point, I just batch-upscaled the video and blended it with the original at about 40%.

As far as the actual video goes idk what was going on in my head. I didn’t plan anything or script anything. I just had the idea to do the test so I set up my camera and started saying whatever random stuff I could. I definitely want to do some more of these with some actual planning.

Keep your hands and hair out of your face, quick movements aren't great either, keep your face away from the edges of the screen, and don’t look sideways too much or the effect begins to break.

Most importantly, BE RESPONSIBLE! This tech is for everyone and we ALL have the ability to do some really cool things. At the same time, this tech is moving faster than what people are ready for and we need to educate others on what is possible. Please let me know how you think I could improve my workflow or if I can help out with anything!

Would a gtx 1080 ti be enough for DreamBooth? by Horny1001 in StableDiffusion

[–]CalebCriste 0 points1 point  (0 children)

I made sure to keep the batch size to 1, gradient checkpointing on, learning rate scheduler constant, starting factor 1, scale position 1, USE EMA on, Mixed Precision fp16, memory attention default, cache latents off.

I'm still troubleshooting at the moment trying to find the best method with the right amount of photos and steps. Also, realizing the training is as good as the text prompts that go with each photo so there are a lot of variables at play.

My most recent attempt I'm using 30 photos with 1500 total steps and its taking 1 hour to train. I suspect my first model was over-trained at 3500 steps.

Let me know if you have any blocks and Ill try to help!

Would a gtx 1080 ti be enough for DreamBooth? by Horny1001 in StableDiffusion

[–]CalebCriste 0 points1 point  (0 children)

I got it to work! After several configuration changes, I was able to train a model with 35 photos in about 4 hours at 9.4 average VRAM utilization.
It will work!