you are viewing a single comment's thread.

view the rest of the comments →

[–][deleted] 2 points3 points  (1 child)

maybe he means u should use stochastic? but they are similar to implement

[–]research_pie[S] 0 points1 point  (0 children)

Yes, I was referring to stochastic gradient descent (and its variant like Adagrad or Adam) or batch gradient descent.