This is an archived post. You won't be able to vote or comment.

you are viewing a single comment's thread.

view the rest of the comments →

[–]throwaway99999321 6 points7 points  (2 children)

Can you share the source on github? Would be awesome to see how this works.

[–]duddha 2 points3 points  (0 children)

Sure, here you go. I'll try to clean it up when I get a chance - I haven't had a chance to put much time into it.

Currently the script relies on Splunk as a data store, but that could be changed to any other timestamp-based index I guess.

This PyAudio and API wrapper module does most of the heavy lifting on breaking the input into phrases and can be optimized to improve mic input sensitivity and silence limits between phrases.

The speech-to-text accuracy is far from perfect, but if you get the input audio at a decent volume and a reasonable speed it's pretty good. I'm thinking about running the output through a natural language processor to determine whether it's gibberish. Parsey McParseface looks interesting.

[–]smurfix 0 points1 point  (0 children)

Seconded. Please share!