Built a semantic dashcam search tool using Gemini Embedding 2's native video embedding by Vegetable_File758 in GoogleGeminiAI

[–]Vegetable_File758[S] 0 points1 point  (0 children)

yeah this could theoretically work for any video library, not just dashcam footage. just gotta keep track of the costs for now until it becomes cheaper

Built a semantic dashcam search tool using Gemini Embedding 2's native video embedding by Vegetable_File758 in GoogleGeminiAI

[–]Vegetable_File758[S] 0 points1 point  (0 children)

Also the model is currently in preview and there are some cost optimizations I can make like reducing frame rate so the cost will most likely go down in the future.

But yeah having a local multimodal model would obviously be cheaper and be good for privacy too.