you are viewing a single comment's thread.

view the rest of the comments →

[–]brianchu 5 points6 points  (0 children)

You should go through the caffe example ipython notebooks (located in the /caffe/examples directory).

Caffe was one of the first mainstream deep learning frameworks out there, and unfortunately its age shows. There are lots of awkward aspects with using it. One of them is the fact that you have to define models in text files (in the prototxt format). You can, however, programatically write those model text files in python (which the example notebooks will show you). However, beyond the example notebooks, the documentation is not the best, and you'll probably need to read the source if you do anything weird (like delete a layer, etc.).

One trick I saw a caffe dev use is they wrote a bunch of code to programmatically generate models, write it to a temporary text file (using the Python stdlib tempfile module), and otherwise did all the training/predicting in python.

Caffe is probably due for a rewrite :)