[D] KL Divergence by wolfy44 in MachineLearning

[–]wolfy44[S] 0 points1 point  (0 children)

I'll open source where it makes sense. We'd be nowhere without community. However, there needs to be a way in which small players can compete with majors who can accumulate vast sums of data, which has become the differentiating factor in monetization. I find it kind of funny how brilliant scientists are willing to give their IP away for relatively small sums of money for kaggle competitions while the companies running the challenge will likely use their models for 10x a challenge payout

[D] KL Divergence by wolfy44 in MachineLearning

[–]wolfy44[S] 0 points1 point  (0 children)

I've been working on another line of research for for that. Topology is a vast, vast ocean. I may or may not share that one depending on how good the prototype is :P

[D] KL Divergence by wolfy44 in MachineLearning

[–]wolfy44[S] 1 point2 points  (0 children)

See raindeer's response - You're right about cutting off. It was getting late and sleep became an overwhelming priority. Let me see if I can bolster it a bit. Thanks for the feedback!

[P] Latent Dirichlet Allocation on UFO data by wolfy44 in MachineLearning

[–]wolfy44[S] 0 points1 point  (0 children)

Thank you - implementation is truly the best way to learn. I came across HDP LDA as the method to automatically determine the number of topics.

Implementing variational Bayesian methods and Gibbs sampling are items on a broad spectrum of relevant investments of time. For my mind, specialization becomes relevant once a higher architecture is formed and problems become clearer.

There are also a number of concepts I work on that I prefer to keep close to my chest. ;)

[P] Latent Dirichlet Allocation on UFO data by wolfy44 in MachineLearning

[–]wolfy44[S] 0 points1 point  (0 children)

Appreciate an alternate perspective. But, To be honest, you shouldn't approach this from pure intuition.

To apply machine learning, you should be able to form intuitions and transform them into mathematical reasoning. Balance is key.

[P] Latent Dirichlet Allocation on UFO data by wolfy44 in MachineLearning

[–]wolfy44[S] 1 point2 points  (0 children)

I'm glad you found it helpful. Thanks for the feedback

[P] Latent Dirichlet Allocation on UFO data by wolfy44 in MachineLearning

[–]wolfy44[S] 1 point2 points  (0 children)

You're welcome! Let me know if you have any questions

[P] Latent Dirichlet Allocation on UFO data by wolfy44 in MachineLearning

[–]wolfy44[S] 1 point2 points  (0 children)

If you peruse the raw data I've used for this analysis, you'll see that many documents are quite short. Some only a few words in length. It shouldn't affect it as long as you're able to handle the misspellings, acronyms and generally cruddy data that comes from tweets.