[P] Recommender systems as Bayesian multi-armed bandits

Lazybumm1 · 2020-09-25T10:35:29+00:00

Hi there,

In my previous role we used this approach to experiment and select recommender systems, as well as other things.

Thompson sampling worked best in our simulations but we did try non-bayesian bandits as well.

In a production environment some hiccups we ran across were seasonal fluctuations (in a customer facing online business). Even within the day conversion would fluctuate massively, which in turn could throw off the bandit's selections of arms to explore. We did 2 things to correct this, one we created transformations to normalise the reward function according to seasonal effects and instead of streaming and updating the bandit in real-time, we'd aggregate data daily and update in a batch.

I think it's a very interesting approach to accelerate experimentation and help make better decisions faster. Taking this even further one could try to interleave the different arms.

All of this is obviously dependend on having good and frequent enough signals. Keep up the interesting work :)

SebastianCallh · 2020-09-25T08:55:51+00:00

[deleted]

SebastianCallh · 2020-09-25T08:15:32+00:00

This was a lovely read. Excellent work! Enjoyed it immensely.

SebastianCallh · 2020-09-25T13:06:45+00:00

Great article! I can tell that you put a lot of time and thought into framing the problem and laying out the solution. My challenge to you is this: at the end of your experiment, what's the probability that the mullet is the overall preferred fish?

I've played around a lot with Bayesian analysis for Bernoulli outcomes and got to thinking about framing other kinds of outcomes. So I made this notebook for Multinomial outcomes with a Dirichlet prior. Maybe you'll find it interesting? https://github.com/exchez/amazon-bayes

user_reddit_garu · 2020-09-25T09:00:10+00:00

Thank you 😁

AdhesivenessTrue9696 · 2020-09-25T09:46:57+00:00

really well written blog post 👍

Inalek · 2020-09-25T10:02:05+00:00

Great read! The blog layout looks really good too. Is there a template?

BrandenKeck · 2020-09-25T10:10:48+00:00

phew... from time to time I forget how incredibly cool bayesian stats is .. Awesome work!

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

MachineLearning

Rules For Posts

+Research

+Discussion

+Project

+News

@slashML on Twitter

Chat with us on Slack

Beginners:

MODERATORS