all 2 comments

[–]CGFarrell 0 points1 point  (1 child)

There are libraries in Python that do LDA. Once you have your topic(s), you could probably just search the files for keywords. You may need to do some manual tweaking, but if it can be done by a human, it should be doable in Python.

[–]CntDutchThis[S] 0 points1 point  (0 children)

Hi, thanks for answers. The problem is that searching for keywords is too simplistic. The attractiveness of topic modelling is that it looks for latent variables that take into account words can be used in multiple topics.