Q-learning for estimating optimal dynamic treatment rules from observational data.

Q-learning for estimating optimal dynamic treatment rules from observational data.