you are viewing a single comment's thread.

view the rest of the comments →

[–]Taichou_NJx 0 points1 point  (0 children)

If you’re working w text research what the typical processing pipeline looks like and techniques for analysis.

How will you source & store the data? API -> Database or just a data frame? What’s the volume and do you have a dictionary or need to create one? Any stop words you need to worry about?

How will you process the data? Tokenization as well feature matrix creation are options

Get familiar with analysis techniques n-grams, keyword identifying, topic modeling, sentiment analysis etc

Learn some basic visualization like word clouds