Toggle navigation
contextual
0.9.8.4
Reference
Articles
Demo: Basic Synthetic cMAB Policies
Demo: Offline cMAB LinUCB evaluation
Demo: MAB Replication Eckles & Kaptein (Bootstrap Thompson Sampling)
Demo: Basic Epsilon Greedy
Getting started: running simulations
Demo: MAB Policies Comparison
Demo: MovieLens 10M Dataset
Demo: Offline cMAB: CarsKit DePaul Movie Dataset
Offline evaluation: Replication of Li et al 2010
Demo: Bandits, Propensity Weighting & Simpson's Paradox in R
Demo: Replication Sutton & Barto, Reinforcement Learning: An Introduction, Chapter 2
Demo: Replication of John Myles White, Bandit Algorithms for Website Optimization
Changelog
FAQ
Authors
Robin van Emden
. Author, maintainer.
Maurits Kaptein
. Contributor.