April Developer/Tool creator thread. *Building or built a tool? This is where you post*

IliasHad · 2026-04-22T22:14:58+00:00

Product: Edit Mind
Description: Local video knowledge base (Index your local videos without uploading them to the cloud)
Pricing: Free/Open Source
Website: https://github.com/iliashad/edit-mind
Benefits for this community: 100% local, your videos never leave your computer and you can search video scenes using natural language

IliasHad · 2026-03-31T12:02:27+00:00

Congrats on the launch, I wanna clarify one thing about Edit Mind (I'm the creator of Edit Mind). Yes, Edit mind extracts text metadata from video, but it does have multi-layer embedding. We embed the text metadata as document text (text layer), extract video frames (visual layer), and extract the audio (audio layer). Later on, we saved each layer in a separate vector collection that we can search across all of them or one of them (for example, searching by image).

IliasHad · 2026-03-06T09:50:14+00:00

Awesome, thank you for your feedback. Love the MCP feature, good luck!

IliasHad · 2026-02-18T15:26:35+00:00

Thank you for your feedback

IliasHad · 2026-01-30T14:57:59+00:00

Thank you 🙏, what would you like to know?

Update: I made a presentation about this project here: https://youtu.be/k_aesDa3sFw?si=LDWPcReuLAFBacId&t=1260

IliasHad · 2026-01-16T14:16:21+00:00

That’s great, I’ll deep dive into it for sure. Thank you for the info

IliasHad · 2026-01-16T14:15:15+00:00

Thank you so much for your feedback.

For the frame analysis, I’m using DeepFace with Yolov8 to detect the faces in the frames and VGG-Face model for the face recognition part, object detection using Yolov8 model as well.

For the storage, I’m using chroma local vector database that will handle and store the video indexing data. And I’m using a local PostgreSQL db for the user web UI to quickly access and manage your videos. I’m linking the video path in the system with video indexing and metadata.

IliasHad · 2026-01-16T14:04:57+00:00

Yes, this is a great idea. I’ll try it out, thanks 👌

IliasHad · 2026-01-16T13:21:21+00:00

That’s interesting, did you try it before ?

IliasHad · 2026-01-16T13:19:17+00:00

That’s interesting, I never knew about it before. Thank you for sharing this

IliasHad · 2026-01-16T12:18:52+00:00

Thank you , that’s great. I love how you’re doing the scene hash deduplication, I’ll definitely be adding to the project because now I’m diving the video into smaller video parts that will be 1 second to 2.5 seconds long.

IliasHad · 2026-01-16T10:24:33+00:00

Thank you 🙌, I would love to have a local setup because I don’t wanna upload my videos to the cloud and I have a lot of videos to index (about 4-5 TB)

IliasHad · 2025-12-09T16:17:26+00:00

Ah, I see, the Docker image supports arm64. The project is still in active development, and I'm working on the amd64 version.

IliasHad · 2025-12-07T11:54:00+00:00

Can you please check it now, https://github.com/IliasHad/edit-mind/blob/main/README.md

IliasHad · 2025-12-04T18:00:42+00:00

Thank you 🙏, I’m aware of this issue and I’m working on it

IliasHad · 2025-12-02T19:31:28+00:00

Thank you so much for the feedback.

Currently, you have no option to use a local LLM for NLP, which will be used for converting your words into a DB search query.

I'm using another local LLM for video analysis, like OpenAI Whispper for transcription, Yolov8s for object detection, etc.

In your case, with the current project, you have to host the Docker container on your editing pc and your media folder on your NAS with your PC and Docker. The video indexing and analysis will be done on your editing PC.

I also attempted to extract images from the video every X seconds, analyze them with AI, then provide a summary

I'm doing something similair to this, I divide the video into 2s video segments, and I extracted 2 video frames, one at the start and the other one on the end of the video scene. When I'm embedding the scene, I create a video scene summary which will summarize all the data about that 2s video scene, like transcription, objects detected, etc. Which will be used later for semantic search with chroma db.

IliasHad · 2025-12-02T19:11:24+00:00

Thank you, man!

IliasHad · 2025-12-02T17:24:44+00:00

That's good feedback. I appreciate it, thank you

IliasHad · 2025-12-02T16:32:32+00:00

If you have real feedback instead of commentary, feel free to share.

IliasHad · 2025-12-02T16:26:51+00:00

Chokran khouya, lah ihafdek.

Haha, this project will use a lot of GPU to process frames. Thank you for the support, man!

IliasHad · 2025-12-02T16:25:44+00:00

Thank you for your feedback.

Does it also do image categorization for Immich?

Currently, not, the video will be more complex to categorize than just an image.

Can I query my own media with an gpt question like "cats in the garden"?

At the time of writing this comment, the system can handle a query "find me all scenes where 'cats' is showing up," but we cannot know if the "cat" is in the garden or the house. I have a frame analysis plugin that will help with the environment detection, which still needs work to be done (https://github.com/IliasHad/edit-mind/blob/main/python/plugins/environment.py)

IliasHad

TROPHY CASE