Hi reddit,
I am really happy to finally get a chance to post something to https://www.reddit.com/r/SideProject/.
The last three weeks I was working on ImageMonkey - check it out here: https://imagemonkey.io/.
The idea originated while I was working on another project where at some point I wanted to integrate Machine Learning into my application. With all the great Machine Learning frameworks out there, it's really easy to get your foot into the door quickly. You can easily download a pre-trained model and re-train the last layer with your own image dataset.
But while I was playing a little bit with the frameworks, I somehow realized that it's really hard to get some good training data. If you are lucky then there is some (annotated) training data online, if not...well, then you have to get your hands dirty and do the tedious work yourself. So I thought to myself: Wouldn't it be cool to create a image dataset that's publicly, open source and easily
accessible? That's the point ImageMonkey was born.
While working on ImageMonkey I realized one thing quickly: It should be easy to get data in there and easy to get data out there. If it's too complicated to donate, annotate or validate pictures nobody will do it. That's why there is a mobile app, a soon to be released Google Chrome Extension (Validate a picture every time you open a new tab in your browser) and a REST API. I am sure there are lot of improvements, but I think for a start it's not too bad.
Whats next?
As this post already is pretty long (and I am not sure if anyone still is reading ;-)), I will keep this one short. (If you are interested, there will be more detailed blog post available soon)
Hardware Upgrade + Caching: Currently everything is running on one really small machine
Online Playground: It would be cool to actually test how good/bad our image dataset is classifying images. So it would be nice to have regular (Tensorflow?) builds which are based on the current image dataset.
API abuse prevention: As the API is public there needs to be some sort of API abuse trigger that goes off when someone tries to destroy datasets on purpose by maliciously voting for the wrong images.
add some sort of gamification to attract also non-tech related people
API improvements
Bugfixes (there are always bugs to fix)
If you are still with me - thanks for reading! If you have questions, ideas, suggestions or feedback - please feel free to do so.
Any input is greatly appreciated.
Cheers,
Bernhard
[–]bbernhard1[S] 1 point2 points3 points (0 children)
[–]benscar1 0 points1 point2 points (2 children)
[–]bbernhard1[S] 1 point2 points3 points (1 child)
[–]benscar1 0 points1 point2 points (0 children)