Safe Policy Improvement with Baseline Bootstrapping

Microsoft Research blog