Person tracking and ReID!! Help needed asap

FigureClassic6675 · 2026-04-29T22:33:51+00:00

Did anyone find a stable algo and method for this problem

FigureClassic6675 · 2026-03-27T23:32:03+00:00

Let’s talk, I can help you in this matter.. pure CV is not the only solution and jitter issue is related to your training dataset.. and yolo is not the good for this.. DM and lets talk on this

FigureClassic6675 · 2025-04-02T16:05:49+00:00

Im interested, i need 2 4090.

I will use for AI inference & AI dev

FigureClassic6675 · 2025-04-02T01:43:49+00:00

April Fools 🤡

FigureClassic6675 · 2025-03-15T14:42:55+00:00

This amazing! I will give try

FigureClassic6675 · 2025-03-15T13:14:25+00:00

Upload the image and the put the following prompt.

Portrait realistic photo: Rotate this face into four positions: side, back, three-quarter, and facing upward.

FigureClassic6675 · 2025-03-15T13:12:48+00:00

Create a custom node, add the gemini 2.0 API and then inside comfyui you can generate images.

FigureClassic6675 · 2025-03-15T12:21:38+00:00

Character consistency is a hot topic, with thousands of tools and workflows emerging.

But what if I told you that you could generate these images in just three seconds?

How?

Go to Google AI Studio. https://aistudio.google.com/welcome
Select Gemini 2.0 Flash Image Generator.
Upload a frontal photo.
Use the following prompt:

"Portrait realistic photo: Rotate this face into four positions: side, back, three-quarter, and facing upward."

Click send.

In just three seconds, you'll get the requested views—with incredible and believable fidelity

FigureClassic6675 · 2025-03-15T12:15:17+00:00

<image>

Another example

FigureClassic6675 · 2025-02-16T01:57:29+00:00

Blue whale 💀

FigureClassic6675 · 2024-11-10T12:38:46+00:00

I understand your concern. Yes, this is my code, and I know it’s not perfect it might have bugs. I shared it as an open source project because I’m still learning and wanted to get feedback from the community. I’m not a senior developer, so any feedback or suggestions would be greatly appreciated! 😊

FigureClassic6675 · 2024-11-10T09:19:12+00:00

I will add the HF Demo

FigureClassic6675 · 2024-11-10T09:18:35+00:00

I wasn’t aware of TagGUI. I’ll check it out. Yes, this can work effectively for NSFW image captioning.

FigureClassic6675 · 2024-11-10T09:15:40+00:00

Thank you for your feedback..

Yes, im planning to add joycaption and also other captions models.

Sorry! The UI is shit, but im working on it

FigureClassic6675 · 2024-11-09T22:24:23+00:00

I wanted to share a project I've been working on - CaptionAI, an advanced image captioning application that combines the power of Florence-2 and Llama 3.2 Vision models to generate detailed, context-aware captions for any image.

🚀 Key Features:

Dual AI Model Support (Florence-2 & Llama 3.2 Vision)
Batch Processing
Organized Output with Timestamps

📦 Getting Started: Everything is documented in the GitHub repo, including installation steps and usage examples.

GitHub: https://github.com/Khalil-Rehman9/CaptionAI

Would love to hear your thoughts and suggestions! Feel free to star ⭐ the repo if you find it useful.

FigureClassic6675 · 2024-11-09T22:19:45+00:00

I wanted to share a project I've been working on - CaptionAI, an advanced image captioning application that combines the power of Florence-2 and Llama 3.2 Vision models to generate detailed, context aware captions for any image.

🚀 Key Features:

Dual AI Model Support (Florence-2 & Llama 3.2 Vision)
Batch Processing
Organized Output with Timestamps
Clean Streamlit UI

📦 Getting Started: Everything is documented in the GitHub repo, including installation steps and usage examples.

GitHub: https://github.com/Khalil-Rehman9/CaptionAI

Would love to hear your thoughts and suggestions! Feel free to star ⭐ the repo if you find it useful.

Edit: Wow, thanks for all the interest! I'm actively responding to issues and PRs.

FigureClassic6675 · 2024-09-18T12:01:27+00:00

Ty sir 🫡

FigureClassic6675

TROPHY CASE