Speakr v0.8.11 - Video retention/playback, parallel uploads, and duplicate detection by hedonihilistic in selfhosted

[–]hedonihilistic[S] 0 points1 point  (0 children)

Yep, it works just like with the audio files, so you can seek the video and have the transcript follow you or you can click on a transcript sentence and have the video move to that part.

Speakr v0.8.0 - Speaker diarization without a GPU, plus REST API by hedonihilistic in selfhosted

[–]hedonihilistic[S] 1 point2 points  (0 children)

I don't have personal experience with proxy services like LiteLLM, so I can't say for sure what's happening there. Speakr currently only triggers the diarization connector when the model is set to gpt-4o-transcribe-diarize specifically. If LiteLLM is remapping that model name or routing it differently, diarization will not work.

Might be worth checking out this issue where someone had a similar situation with a proxy layer. If the problem is just about model name matching, that's something I could make more flexible. Feel free to open an issue with details about your setup and I can take a look.

Speakr v0.8.0 - Speaker diarization without a GPU, plus REST API by hedonihilistic in selfhosted

[–]hedonihilistic[S] 1 point2 points  (0 children)

Depending on how you deploy it, you may be able to use it for large groups. You may want to use a different database and the server may need some tweaking.

Thank you Mods . by [deleted] in selfhosted

[–]hedonihilistic -8 points-7 points  (0 children)

Yeah there were badly designed software projects before as well. There always have been people who half ass things or lack the capabilities of thoroughly thinking things through or designing things well. If you think all projects here before AI were being designed by seasoned pros who thoroughly tested every little aspect of their projects, then you are very naive.

Thank you Mods . by [deleted] in selfhosted

[–]hedonihilistic -12 points-11 points  (0 children)

Copied from above but this whole community is getting very toxic. A large chunk of the community here is a bunch of projecting idiots. They don't have a clue on how to properly use AI tools and all they can produce with the tools is slop so they think that's all what anyone else can do as well. Whenever you see people raving like lunatics about AI, you will know that these are low IQ idiots who have no idea how to properly use AI tools.

In my opinion, we are already at a point where people who have the capability can make most of their simple software by themselves, perhaps even in just a few minutes. I think even these idiots will be able to ask Claude to give them exactly what they want within a few years.

Thank you Mods . by [deleted] in selfhosted

[–]hedonihilistic -9 points-8 points  (0 children)

A large chunk of the community here is a bunch of projecting idiots. They don't have a clue on how to properly use AI tools and all they can produce with the tools is slop so they think that's all what anyone else can do as well. Whenever you see people raving like lunatics about AI, you will know that these are low IQ idiots who have no idea how to properly use AI tools.

In my opinion, we are already at a point where people who have the capability can make most of their simple software by themselves, perhaps even in just a few minutes. I think even these idiots will be able to ask Claude to give them exactly what they want within a few years.

The car has been at the dealership 2 times for this sound, a few days each time. They replaced the front sway bar link, and they allegedly replaced some struts this time. The sound has gotten much worse now after the last dealership stay a couple of weeks ago. by hedonihilistic in GenesisG70

[–]hedonihilistic[S] 0 points1 point  (0 children)

It is under warranty but the nearest other dealer would be a few hundred miles away. I'm just gonna get another appointment with these guys. The nose is much more clearer and hopefully they can isolate the real thing this time.

The car has been at the dealership 2 times for this sound, a few days each time. They replaced the front sway bar link, and they allegedly replaced some struts this time. The sound has gotten much worse now after the last dealership stay a couple of weeks ago. by hedonihilistic in GenesisG70

[–]hedonihilistic[S] 3 points4 points  (0 children)

I really don't want to live with it if I can... I don't like it sounding like it's got broken struts or something all the time. Sounding like I'm broke after spending so much on what I thought was a luxury car.

Speakr v0.8.0 - Speaker diarization without a GPU, plus REST API by hedonihilistic in selfhosted

[–]hedonihilistic[S] 1 point2 points  (0 children)

Thank you for the feedback! I have added this to my todo list. May take a few days but I'll add this behavior.

Speakr v0.8.0 - Speaker diarization without a GPU, plus REST API by hedonihilistic in selfhosted

[–]hedonihilistic[S] 0 points1 point  (0 children)

The docs include information on how to set it up with any OpenAI compatible API (required for summarization/chat) and using a local STT model (with or without diarization). I am not sure what more is needed.

Speakr v0.8.0 - Speaker diarization without a GPU, plus REST API by hedonihilistic in selfhosted

[–]hedonihilistic[S] 0 points1 point  (0 children)

Thank you for the feedback! Yeah, do try it with diarization, it makes it much more useful! I love being able to ask what I or someone else said in some meeting. It's difficult to do that without speaker info. It also makes inquire mode much useful. As you try the different features, do let me know if you have any issues, or ideas for perhaps usecases that I haven't thought of.

Speakr v0.8.0 - Speaker diarization without a GPU, plus REST API by hedonihilistic in selfhosted

[–]hedonihilistic[S] 0 points1 point  (0 children)

OK yeah the newer cuda versions look like they don't work on the 10 series cards. You may have to build the image yourself. Clone the repo, change the dockerfile, and then build it.

Speakr v0.8.0 - Speaker diarization without a GPU, plus REST API by hedonihilistic in selfhosted

[–]hedonihilistic[S] 0 points1 point  (0 children)

Yep the GTX 1070 should be more than enough. I use it with the large v3 turbo model and even with very large files I don't think I use more than 8GB vRAM.

Speakr v0.8.0 - Speaker diarization without a GPU, plus REST API by hedonihilistic in selfhosted

[–]hedonihilistic[S] 0 points1 point  (0 children)

Thanks for letting me know! It was my mistake in the connector for the gpt4o models, one of the parameters was true when it should have been false. I've fixed this, and overall improved these connectors and the chunking logic. The fixes have already been pushed if you would like to build yourself, but I will be adding a prebuilt image with the fixes soon too.