RT DF A1 Zhan, Ruohan. T1 Policy Evaluation and Learning in Adaptive Experiments