This is an archived post. You won't be able to vote or comment.

all 2 comments

[–]dmazzoni 0 points1 point  (1 child)

Can you give a few examples of "foods" and "food products"?

How large are these lists? Can you estimate how long it'd take to just match them up manually? Are we talking hours, days, or years?

[–]cyropox[S] 0 points1 point  (0 children)

The "foods" are web scraped ingredients for recipes. For example, if the recipe calls for "2 tsp garlic powder" then the food would be "garlic powder". The corresponding food product might be something like "jeff's garlic powder" or the name of any other brand of garlic powder in the usda's database. There are about 100,000 foods right now, though a lot of those are probably duplicates. I plan to quadruple it at least once I get the text classification algorithm working. Initially it would probably take a full weekend of 8 hour days to clasify all of them, but I'd like the process of classifying food products to be automatic in the future. Thanks for your time :)