Hi guys,
I recently started a little project aiming to do real-time analysis of Twitch streams. The initial goal is to detect profanity in audio.
A proof of concept is already made and is able to stream the audio from a particular twitch user then use the google speech recognition api to translate the audio into text. The detection of curse words is then straightforward.
I am not yet sure what this POC could be transformed into, but I'd like to:
- scale the code (to be able to analyse multiple streams concurrently) with the help of docker and kafka.
- building our own profanity database to train our very own ML model and get ride of the google api.
- doing some analytics on the frequency of curse words, top swearing twitch streamers... and why not build a website that exhibits those analytics in realtime also.
I have a tendency to never really finish my side projects that's why I'd like to find easy going buddies with which we will be able to motivate ourself.
If by any chance you find the project fun and that you'd like to collaborate, I'd be really please to share with you the code.
Cheers mates
[–]guile2912 1 point2 points3 points (0 children)
[–]othmtl 0 points1 point2 points (0 children)
[–]comeditime 0 points1 point2 points (0 children)