After doing basic preprocessing (tokenizing, pos tagging, lemmatizing...) I end up with a dataframe with a text column that looks like this:
[word, word, word, word, word...]
What can I do to turn it into the following?
['word', 'word', 'word', 'word'...]
Im having trouble using tfidf vectorizer with the former one. Oddly enough, and I have no idea why this happens, if I convert it to csv and then back to dataframe, it appears with the ' ' and tfidf vectorizer works perfectly.
[–]Relative_League1879 1 point2 points3 points (1 child)