Toggle navigation
contextual
0.9.8.4
Reference
Articles
Demo: Basic Synthetic cMAB Policies
Demo: Offline cMAB LinUCB evaluation
Demo: MAB Replication Eckles & Kaptein (Bootstrap Thompson Sampling)
Demo: Basic Epsilon Greedy
Getting started: running simulations
Demo: MAB Policies Comparison
Demo: MovieLens 10M Dataset
Demo: Offline cMAB: CarsKit DePaul Movie Dataset
Offline evaluation: Replication of Li et al 2010
Demo: Bandits, Propensity Weighting & Simpson's Paradox in R
Demo: Replication Sutton & Barto, Reinforcement Learning: An Introduction, Chapter 2
Demo: Replication of John Myles White, Bandit Algorithms for Website Optimization
Changelog
FAQ
Changelog
Source:
NEWS.md
contextual 0.9.8.4
Unreleased
Minor documentation updates.
Fix for Exp3 bug (thanks,
@leferrad
)
Cleanup of propensity score related code (thanks again,
@leferrad
)
Updated tests.
contextual 0.9.8.3
2020-03-04
Tested and confirmed to be R 4.0.0 proof.
Minor documentation updates.
Now correctly restores global seed on completing a simulation (thanks,
@pstansell
)
contextual 0.9.8.2
2019-07-08
Minor documentation update
Minor refactoring: Private utility functions moved from the History to the Plot class.
contextual 0.9.8.1
2019-03-17
Specified previous version of set.seed sampler with RNGversion() calls
contextual 0.9.8
2019-02-10
Major update
API change for offline Bandits
Fixes inverse propensity score weighting
Documentation updates
Additional demo scripts
contextual 0.9.1
2018-10-30
First CRAN release
contextual 0.9.0
Unreleased
CRAN Submission
Contents