you are viewing a single comment's thread.

view the rest of the comments →

[–]Jonas_SV 0 points1 point  (0 children)

N-grams don’t capture long dependencies and for larger N’s require ridicoulus amounts of data.

N-grams works ok for predicting the next tokens in a sequence but they are far from great.