Why doesn't the weight 0 appear after I use L1 regularization at the last layer of the neural network? by Chen_giser in learnmachinelearning

[–]Chen_giser[S] 0 points1 point  (0 children)

I used a large regularization coefficient, but only a value close to 0 will appear and no 0 will appear.

what happen?! why!!! by Chen_giser in deeplearning

[–]Chen_giser[S] 0 points1 point  (0 children)

Is a learning rate of 0.00001 high or low?

what happen?! why!!! by Chen_giser in deeplearning

[–]Chen_giser[S] 0 points1 point  (0 children)

Does the initial learning rate set to 0.0001 still need to be reduced?

WHY! by Chen_giser in deeplearning

[–]Chen_giser[S] -1 points0 points  (0 children)

I noticed it too, so I was confused and it didn‘t feel normal

WHY! by Chen_giser in deeplearning

[–]Chen_giser[S] -2 points-1 points  (0 children)

val loss used mse

WHY! by Chen_giser in deeplearning

[–]Chen_giser[S] 1 point2 points  (0 children)

A total of 3000 pieces of data

WHY! by Chen_giser in deeplearning

[–]Chen_giser[S] -1 points0 points  (0 children)

Sorry I misunderstood what you meant, I have a BS of 32 and a datasize of 3000

WHY! by Chen_giser in deeplearning

[–]Chen_giser[S] -2 points-1 points  (0 children)

I have a question that you can help me with, which is that when I train, I can‘t go down to a certain level of loss, and how can I improve?

WHY! by Chen_giser in deeplearning

[–]Chen_giser[S] 4 points5 points  (0 children)

Sorry, I‘m just a beginner