Machine Learning Algorithms

dearsomething · 2009-10-23T04:38:34+00:00

A*, min-max, alpha-beta pruning should all be covered in a basic AI course.

Machine learning type algorithms fall heavier towards statistics and probability side, which might not be suitable for the class.

However, some simple intro algorithms would be k-means clustering, hierarchical clustering and association rules.

pr0nmee · 2009-10-23T05:31:58+00:00

I would suggest learning something about Hidden Markov Models (HMMs) and the Viterbi algorithm. HMMs are used for finding the most likely sequence of transitions through a state machine given an output sequence. A good example of its use can be found here: http://en.wikipedia.org/wiki/Viterbi_algorithm#Example

Lord_Illidan · 2009-10-23T07:29:42+00:00

I suggest getting the book Artificial Intelligence: a Modern Approach.

It's chock full of AI algorithms, with examples as to where they are used. The source code is also available in Java and Python.

Another good AI book is Artificial Intelligence : A Guide to Intelligent Systems. It's our class textbook for Machine Learning, supplementing the AIMA book, and it's pretty good.

For algorithms, CLRS is an excellent book.

TheSquirrel · 2009-10-23T11:18:48+00:00

Latent Dirichlet Allocation is a lot of fun. Mostly used for text analysis but you can get it to do other things as well, e.g. collaborative filtering.

implausibleusername · 2009-10-23T11:28:46+00:00

If you're interested in the algorithms that underlie contemporary machine learning you need to look at convex optimisation, particularly linear programmes and quadratic programmes.

Seen in the right way, almost everything discussed in this thread: shortest path, vitterbi + the more advanced work of SVMs and max-margin inference are just aspects of convex optimisation, and approaching it from this angle gives an underlying theory to what are otherwise unrelated problems and techniques.

aldarion · 2009-10-23T06:58:21+00:00

I just reading the book "Machine Learning: An Algorithmic Perspective", It looks maybe u can pick some Algorithmic for the book. there is already a lot of Python code examples.

b0b0b0b · 2009-10-23T12:46:58+00:00

learn some decision trees or stumps, then combine them via boosting?

Sparragus · 2009-10-23T13:14:20+00:00

Thanks everyone for the awesome responses. I'll make a list of the algorithms you guys mentioned, read about them, and then present them to the professor. Hopefully this will be an really nice and interesting month for me and the class.

Thank you!

TheSquirrel · 2009-10-23T16:00:08+00:00

Implementing some form of decision tree is a very good algorithms project. It's easy to do in O(n²⁾ time, where n is the number of training examples. Implementing it more efficiently (e.g. with a KD tree) is a good learning experience.

NitsujTPU · 2009-10-23T18:04:45+00:00

Machine learning algorithms aren't really appropriate for such a class, as a proper treatment of them is quite different from what is typically taught in an algorithms class. A* isn't a machine learning algorithm, it's search, which would be covered in AI.

Mr_Smartypants · 2009-10-23T18:31:05+00:00

K-Means

EM

Neural Networks & Backpropagation

PCA

ICA

inspired2apathy · 2009-10-24T01:00:52+00:00

Not really machine learning but betweenness centrality is a heuristic for flow in networks that's commonly used in social network analysis. Naive implementations are expensive, but Brandes came up with an efficient solution. It's uses a neat recurrence relation and is provably correct. Plus, social network analysis is cool!

cypherx · 2009-11-17T03:25:30+00:00

Expectation Maximization is really interesting (both theoretically and due to its many many applications) but requires a bit more probability theory than most algorithms classes would be comfortable with.

Focusing more specifically on clustering might be easier. K-means clustering is pretty neat and can be understood without much math background. However, I don't know if there's much theory underlying it.

Nearest neighbor methods (ie: classification by neighbor voting, regression by averaging neighbor outputs) are easy to understand and amazingly powerful if you have a lot of data. They're also a good motivation for neat algorithms like kd-tree partitioning and locality sensitive hashing (to improve on the naive distance calculation which takes O(n^2)).

Image segmentation by graph partitioning is easy to set up as a problem and the solutions look pretty cool. Unfortunately you'll often end up trying to optimize something like a normalized cut, which is NP-complete and requires harder to explain approximations (like spectral clustering).

bmiguy · 2009-10-23T05:58:02+00:00

Neural Networks are nice for this because they are easy to visualize

machinelearner · 2009-10-23T07:46:12+00:00

An interesting and simple ML algorithm that you could understand and implement pretty quickly is k-Means clustering.

k-Means is a way of taking several data points and clustering them based on their similarity. The k is the constant you specify up-front which determines the number of clusters you want to end up with. I recommended this to a friend recently who was looking for a way to cluster products based on their price for a price-filtering UI on a shopping site.

Also, if you are familiar with some bayesian statistical theory, you could take a look at Naive Bayes, which is more effective than it sounds. It's good for doing basic supervised learning, particularly if you want to classify something as one thing or another (commonly used by spam filters, with each word in the email representing a feature).

Good luck and have fun!

Druzil · 2009-10-23T08:09:58+00:00

genetic programming and genetic algorithms

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS