RT DF A1 Uehara, Masatoshi.Computer Science. T1 Statistically Efficient Reinforcement Learning- [electronic resource]