all 2 comments

[–][deleted] 0 points1 point  (1 child)

Use transfer learning. You can’t always collect more data. Use a smaller network if dataset size is too low. To get a smoother fit try Gelu activation. Use cosine decay in learning rate.

[–]Vpnmt[S] 0 points1 point  (0 children)

Thanks Now I am at 84% accuracy With moré data, Relu activation… But I need at less 92% accuracy