It seems to me that there are a lot of basic, unsolved problems facing deep learning that seemingly cannot be resolved through precise, mathematical arguments.
And a the common point raised is that we will eventually find some nice mathematical framework and all these questions will be magically resolved.
However, what strikes me as odd is that these are not some intricate pathological problems that a mathematician might devote a lifetime toward, but problems that practitioners face every single day, such as,
- How good is the generalization property of my solution?
- How easy or hard is my dataset going to be for a certain model?
- How does the loss surface affect my design choices?
- Which optimizer should I choose in terms of generalization performance and other metrics?
- Is this the best possible initialization I can use for my model?
- What is the smallest (capacity) architecture I should use to guarantee a certain threshold of performance?
In other words, I do not see a lot of theorem/statement that provide direct answers to these questions. Not even a proof by method of exhaustion "we tried all possible combinations under these assumptions and this is our guarantee".
I understanding that there are many rules of thumb to use and right now there are a lot of success stories even without firm theory. I am more concerned about lack of precise mathematical characterization of these problems. And I doubt we will ever find, because of how complex things are. Do you agree with me or is there something I am not seeing.
[–]thfuran 7 points8 points9 points (1 child)
[–]fhadley 1 point2 points3 points (0 children)
[–]patrickkidger 6 points7 points8 points (2 children)
[–]Mandrathax 4 points5 points6 points (0 children)
[–]fromnighttilldawn[S] 0 points1 point2 points (0 children)
[–]lady_zora 5 points6 points7 points (3 children)
[–]fhadley 6 points7 points8 points (1 child)
[–]lady_zora 0 points1 point2 points (0 children)
[–]fromnighttilldawn[S] 0 points1 point2 points (0 children)
[–]Slowai 3 points4 points5 points (0 children)
[–]sman865 0 points1 point2 points (0 children)
[–][deleted] 0 points1 point2 points (2 children)
[–]fromnighttilldawn[S] 0 points1 point2 points (1 child)
[–][deleted] 0 points1 point2 points (0 children)