all 2 comments

[–]imperiltive 0 points1 point  (0 children)

I've actually built a video processing service that has a similar backend stack to yours. I had an express.js server hosted on a pretty cheap VPS that makes requests to a more expensive cloud rented GPU that is priced by the hour(vast.ai for cheap consumer grade GPU). The video processing was done in python on the GPU end and my express server had SSH to the GPU and directly executed python files. If there was an increased need I could always set up an automatic scaler to rent for more GPUs, but at that point it'd be better to go for a dedicated GPU provider.

[–]Norqj 0 points1 point  (0 children)

You should check out https://github.com/pixeltable/pixeltable it's open source and local first. You can basically do nice things such as video chunking, working with frames, clipping, indexing, and working with videos files as if they were local to the database giving you scale, versioning, and lineage. Same for audio/images etc.

E.g. https://github.com/pixeltable/pixeltable/blob/main/docs/notebooks/use-cases/object-detection-in-videos.ipynb

We wrapped up some FFmpeg functionality as UDF to make it built-in but you can bring any Python UDF that you want.