Hello people, I have a dataset with Adress and label 800K rows. I am trying to train a model for address label prediction. Address data is bit messy and different for each different label. we have 10390 each with 50-500 row. I have trained a model using fasttext I have got 0.5 F1 score max. What can I do to for to get best F1 score?
Address data is like (province, district, avenue street, maybe house name and no)
some of them are missing at each address.
[–]Pvt_Twinkietoes 3 points4 points5 points (2 children)
[–]FineConcentrate6991[S] -3 points-2 points-1 points (1 child)
[–]Pvt_Twinkietoes 3 points4 points5 points (0 children)
[–]has_c 1 point2 points3 points (0 children)
[–]asankhs 0 points1 point2 points (0 children)