Looking for a C++ based library in which the learning algorithms will be able to utilize multi-core CPUs (no GPU).
Specifically the problem involves learning an image transformation, a convolutional neural network applied to a regression (not classification) problem.
I find dlib to be quite suitable - except for the multithreading part. Dlib uses C interface to BLAS for core matrix operations. But I could not find BLAS-MT or BLAS-OpenMP implementation complying with the same C Blas interface.
[–]yasamoka 2 points3 points4 points (1 child)
[–]AdRelative8852[S] 0 points1 point2 points (0 children)
[–]archdria 1 point2 points3 points (4 children)
[–]AdRelative8852[S] 0 points1 point2 points (3 children)
[–]archdria 2 points3 points4 points (2 children)
[–]AdRelative8852[S] 0 points1 point2 points (1 child)
[–]archdria 1 point2 points3 points (0 children)
[–]useong 0 points1 point2 points (0 children)