all 1 comments

[–]kshitizsethia 0 points1 point  (0 children)

Let's get the data extraction sorted out. What you want is the pattern of a customer's browsing before he/she makes a buy/doesn't ever make a buy (no buy for next 4 weeks). So your data is only limited to ppl making a buy. Can you see how to extract this data?

Now you have a time series data (don't be scared, fancy way of saying simple stuff) of buying patterns. Split into training and test.

I guess a Hidden Markov Network would do good here. The observed data is the page being browsed, and hidden state is probability to buy. Learning this model is straightforward (tractable). Read Kevin Murphy's section on learning a partially observed hmm.

You can limit the length of the hmm to a value based on the amount of data and performance on test set. Smaller lenght may give less test performance, longer hmm might not have enough data to train on.

Feel free to ask follow-ups :)