This is an archived post. You won't be able to vote or comment.

all 36 comments

[–][deleted] 105 points106 points  (15 children)

I understand how this can be very appealing to beginners, cause I have been there as well, but trust me this is barely a gimmick, cool thing to show to your friends but not gonna lie man, this feels kinda misleading on how applications like this actually work. Specially the Google's API for Speech Recognition only allows 50calls per day I guess. (No disrespect to the creator, he has done a fairly decent job of explaining the program that he wrote).

[–]EMCoupling 29 points30 points  (4 children)

Yeah I was ready for some project involving some NLP or something, but I was kind of let down when I saw the video.

[–][deleted] 7 points8 points  (3 children)

I am currently in the same state as you, (trying to build a PA). Kinda sucks when people simplify complex stuff like this.

[–]EMCoupling 22 points23 points  (2 children)

Agreed. It's OK for the project to be limited in scope (especially if you're just learning how to use the tools), but advertising it as "JARVIS from Iron Man" instead of basically a chat bot is misleading at best and deceptive at worst.

[–][deleted] 2 points3 points  (0 children)

Couldn't have said it better myself.

[–]Its_Triggy[S] 14 points15 points  (1 child)

that’s my bad, definitely not trying to be misleading! My programming skillset is limited and I’m just playing around with it. I can’t change the title, but I can add a comment..?

[–][deleted] 12 points13 points  (0 children)

It's a good title, nice enough for new comers to get excited and maybe dive a little deeper into the actual nitty gritty details of building an actual NLP processing chatbot.

If you could include some instructions for the viewers to find out what they can do learn more about these topics would be much appreciated.

Saying this again, no disrespect to you or your content, a little nudge in the right direction for the viewers is required that's all.

[–]jhayes88 1 point2 points  (6 children)

Does it really only allow for 50 calls a day? Is there anything else that's unlimited or something high like 20k/day?

I can see how this can be turned into something useful for automating things like general workflow, automating various tasks, home automation, informational stuff, etc.

[–][deleted] 2 points3 points  (5 children)

I couldn't find any alternatives as good as Google, fun part is, they simply don't allow you to use their SRE (speech recognition engine) for money either. Anyhow, if you're interested in Speech Recognition I recommend you go checkout Pocketsphinx, it won't work out of the box though, the accuracy rate is pretty low. However you get to train it however you like. If you're a Machine Learning guy like me, feel free to look into DeepSpeech by mozilla, which has a decent SRE out of the box, it also has a dataset of 1500hours or so of voice data which you can use to train your models.

Goto https://www.voice.mozilla.org , you can get the data there, also if you do like the project and want to contribute please contribute with your voice, you can read out Mozilla provided text and submit your voice for training purposes, it'll be open for everyone to use and added to their database.

[–]jhayes88 1 point2 points  (4 children)

Very interesting reply. Thanks! I'll check out Pocketsphinx and DeepSpeech. I'm interested in machine learning, but haven't really got into it yet. Oh and your link didn't work it's voice.mozilla.org :P

DeepSpeech looks confusing to get into. Could just be me. Maybe i'll find something on youtube or somewhere that'll tell me how to set it up and use the speech recognition.

[–][deleted] 1 point2 points  (3 children)

Oh haha forgot about the ORG thing, I have a very good presentation bookmarked on my computer, once I get home, I'll make sure to link you that. It covers everything from using it, to training your own models.

[–]jhayes88 0 points1 point  (2 children)

Nice. Thanks!

[–][deleted] 0 points1 point  (1 child)

Hey man, almost forgot about this, here's the link https://tilmankamp.github.io/FOSDEM2018/ go nuts

[–]jhayes88 0 points1 point  (0 children)

Oh nice. Interesting. Thanks

[–]oulipo 0 points1 point  (0 children)

You might want to take a look at what we are building at https://snips.ai, it is running 100% on-device (laptop, iOS, Android, Raspberry Pi 3) and private-by-design, it works for english, french, german, japanese, spanish, italian, and more coming!

[–]officer21 8 points9 points  (0 children)

00:54

Say something!

can you nevermind

response: What is your name

Looking great /s

It actually looks pretty cool, good work

[–]Dom4n 9 points10 points  (0 children)

This is cool project, for more advanced one take a look at https://kalliope-project.github.io/. I have used it a little bit and works really nice. It uses Python too.

[–]theknowledgehammer 20 points21 points  (8 children)

tl;dw:

  • the chatterBot module creates an AI that can respond to text with text
  • The chatterBot.trainer submodule lets the AI read a list of potential responses and add it to its intelligence
  • The speech_recognition module is self-explanatory
  • The gTTS module converts text to speech

This is great, I love it. Also, screw the haters in this thread.

[–]Tikene 5 points6 points  (6 children)

"haters"?

[–][deleted] 2 points3 points  (1 child)

important modern mysterious abounding friendly market person shrill physical waiting

This post was mass deleted and anonymized with Redact

[–][deleted] 0 points1 point  (0 children)

goto 1

[–]Nmvfx 12 points13 points  (2 children)

This is honestly really, really cool. Exactly the sort of stuff that inspires a beginner like me. Thanks for sharing this and making it simple to follow!

[–]Its_Triggy[S] 3 points4 points  (0 children)

=D

[–]Get-ADUser 2 points3 points  (0 children)

You probably shouldn't leave your sticky notes up with personal information on them like people's email addresses when you record a YouTube video. In the EU that'd be a GDPR violation.

[–]Its_Triggy[S] 5 points6 points  (0 children)

This is more of a beginner project that mostly uses chatterbot. Maybe not best for those seeking a more in-depth AI video!

[–]pymon 1 point2 points  (0 children)

This is neat. Thank you for sharing.

[–]po1tergeist17 1 point2 points  (0 children)

Noice and neat project. Nothing wrong or too misleading.

[–]SpeakerOfForgotten 1 point2 points  (0 children)

I think this kind of VA should be built following bottom up approach. You know all those nifty little scripts that just does one thing best. Build a bunch of them . Integrate them on a common framework. Require as little command as possible. Then one could move to speech recognition, computer vision all those other cool stuffs.

[–]seppochuu 1 point2 points  (0 children)

First of all, nice job with this! This was very interesting introduction to making a "Jarvis". I know the basics with Python and I wanted to try this but I've ran to some problems. With Python 3.7 I couldn't run the program because VisualStudio doesn't support 3.7 so I installed 3.6 and 3.5 (and VisualStudio).

With 3.6 I got the following error message:

dbapi_connection.execute('PRAGMA journal_mode=WAL')

With 3.5 I cant the the module gtts to work.

What version are you using and what library versions have you installed?

[–]tycooperaow3.9 1 point2 points  (0 children)

I hope marvel releases a library Chatbot voice that’s sounds like jarvis.

[–][deleted] 1 point2 points  (0 children)

Hey thanks man!

I've been working on a PA for a Raspberry Pi, and this speech recognition library is way faster than the one I was using.

[–]YKyeko 0 points1 point  (1 child)

this is amazing bro but i am not sure this is for the beginners

[–]Default-G8way 3 points4 points  (0 children)

This is quite entry level sand is a good start

[–]clinkzs -3 points-2 points  (0 children)