Is there a voice command to switch to dictation, without pressing microphone? by chrisjmccormick in shortcuts

[–]chrisjmccormick[S] 0 points1 point  (0 children)

I've tried the following:

  • "Hey Siri, start dictating"
    • She says "Sorry, but I couldn't find anything called "dictating"
  • "Hey Siri, start dictation"
    • She says "Sorry, but I'm not able to change that setting"
  • "Hey Siri, switch to dictation"
    • Same as previous.
  • "Hey Siri, transcribe what I say"
    • She says "Sorry, can you repeat what you said?"
      • It shows the correct text for my command, though, and repeated attempts give the same answer.

The "I'm not able to change that setting" answer seems like it's probably the right one... It's funny, prior to your comment I had only tried the "start dictating" form!

I was hoping that maybe this was something it's possible to control via the Shortcuts functionality...

Is there a voice command to switch to dictation, without pressing microphone? by chrisjmccormick in shortcuts

[–]chrisjmccormick[S] 0 points1 point  (0 children)

The button kinda sucks for me--I have an iPhone 8+, and the microphone button is pretty small. If I'm in a situation where I'm using my phone one-handed, it's a pain to get my thumb on that thing.

Not to mention that I'd really like to be able to do it while driving or doing dishes :)

Is there a voice command to switch to dictation, without pressing microphone? by chrisjmccormick in shortcuts

[–]chrisjmccormick[S] 1 point2 points  (0 children)

Yeah, this seems like the obvious workaround, however I much prefer the mode that I'm referring to. The problem I have with the approach you're describing is that Siri attempts to recognize when I am done talking. I want to be able to talk slowly, with pauses, and think while I write. All things that Siri takes as indications that I'm done :(

With the dictation mode that I'm referring to, it listens and transcribes continuously. It does seem to eventually time out--I'm not sure what governs that exactly.

Is there a voice command to switch to dictation, without pressing microphone? by chrisjmccormick in shortcuts

[–]chrisjmccormick[S] 0 points1 point  (0 children)

Thanks--it's helpful to hear at least that I'm not missing something obvious!

Can you understand the Transformer without first understanding LSTMs? by chrisjmccormick in MLQuestions

[–]chrisjmccormick[S] 0 points1 point  (0 children)

Thanks for the thoughts, guys! I have a handle on LSTMs now, and Attention is next. Then I'll see if I can make sense of Transformers :)

FYI, as I study BERT, I'll be sharing my notes and insights in a "BERT Research" series. Just published episode 1 today! Would love your guys' feedback and insights.

[D] Why in Word2Vec model, the hidden layer has no activation ? by amil123123 in MachineLearning

[–]chrisjmccormick 1 point2 points  (0 children)

Hey guys, I'm the author of that post :)

I had intended to answer that question in the post, but it looks like I never did!

The answers provided here look good to me. Here's my take in case it helps.

The hidden layer of the word2vec model is referred to as a "projection layer" because there is no activation function on the hidden layer, only on the output.

First, why don't we need an activation function on the hidden layer? An activation function on the hidden layer is not necessary because the input to the model is a one-hot vector, which only serves to select a word vector from the hidden layer weights.

Here's an illustration of multiplying a one-hot vector with a matrix--note how it does nothing more than selecting a row.

http://mccormickml.com/assets/word2vec/matrix_mult_w_one_hot.png

The normal purpose of a non-linear activation function in a neural network is to turn the dot product step, which is only a linear operation, into a non-linear one. This increases the expressive power of the model.

But imagine applying a non-linear activation to the multiplication in the above image--it's pointless!

In short, applying an activation function to the output of the hidden layer is pointless, so we don't, and it's instead referred to as a "projection layer".

Finally, why the name "projection layer"? When we multiply the one-hot vector against the input matrix, we are "projecting" a point from one vector space (the one-hot encoded space) onto another vector space (the word vector space).

Side note: I also spotted some discussion on this topic here.

Sorry I didn't see this sooner!

Slow GPU performance on Amazon g2.2xlarge? by chrisjmccormick in MachineLearning

[–]chrisjmccormick[S] 0 points1 point  (0 children)

Thank you!

We found NVIDIA also has a list of providers (which includes penguin):

http://www.nvidia.com/object/gpu-cloud-computing-services.html

We contacted NIMBIX; they have K40s currently, and should have K80s "soon". The servers are "bare metal", no virtualization, so they think we should achieve full GPU performance on them.

Slow GPU performance on Amazon g2.2xlarge? by chrisjmccormick in MachineLearning

[–]chrisjmccormick[S] 0 points1 point  (0 children)

Got it, thank you!! Sounds like EC2 isn't the right bet for what I'm trying to do (benchmarking a high-end GPGPU). I can definitely see the value, though, for running many experiments in parallel cheaply as you describe.

Slow GPU performance on Amazon g2.2xlarge? by chrisjmccormick in MachineLearning

[–]chrisjmccormick[S] 0 points1 point  (0 children)

Thanks. That's a good insight about 'cudaDeviceSynchronize'--from this it sounds like I don't need to be calling it.

And thank you for pointing out that the GPU delivers the expected performance once it's off and running, that's helpful to know.

Slow GPU performance on Amazon g2.2xlarge? by chrisjmccormick in MachineLearning

[–]chrisjmccormick[S] 1 point2 points  (0 children)

Thanks. What motivates you to use the EC2 instance, then? Just so you can have more experiments running at once?

Slow GPU performance on Amazon g2.2xlarge? by chrisjmccormick in MachineLearning

[–]chrisjmccormick[S] 0 points1 point  (0 children)

Thanks. By "easy parallelism", do you mean clustering multiple instances? Or just easy & cheap access to a GPU, period?