본문

Statistically Efficient Reinforcement Learning- [electronic resource]
Statistically Efficient Reinforcement Learning - [electronic resource]
Sommaire Infos
Statistically Efficient Reinforcement Learning- [electronic resource]
자료유형  
 학위논문파일 국외
최종처리일시  
20240214101531
ISBN  
9798380318662
DDC  
004
저자명  
Uehara, Masatoshi.
서명/저자  
Statistically Efficient Reinforcement Learning - [electronic resource]
발행사항  
[S.l.]: : Cornell University., 2023
발행사항  
Ann Arbor : : ProQuest Dissertations & Theses,, 2023
형태사항  
1 online resource(296 p.)
주기사항  
Source: Dissertations Abstracts International, Volume: 85-03, Section: B.
주기사항  
Advisor: Kallus, Nathan.
학위논문주기  
Thesis (Ph.D.)--Cornell University, 2023.
사용제한주기  
This item must not be sold to any third party vendors.
사용제한주기  
This item must not be added to any third party search indexes.
초록/해제  
요약My research focus is on developing algorithms and statistical theories of sequential decision making on the intersection of reinforcement learning (RL) and causal inference. RL is concerned with the ways agents learn to make sequential decisions in unknown environments. It has been one of the most vibrant research frontiers in machine learning over the last few years. We have empirical success in a variety of applications, especially for games such as AlphaGo (Silver et al., 2016). Despite its popularity, the real-world application of RL in fields such as biomedicine and social science is still limited. This is because these real-world applications do not have good simulators, and experimentation is often expensive and risky (e.g., running clinical trials, deploying new marketing strategies in companies) unlike for games. Although running new experiments can be difficult, fortunately, in an era of big data, we often have access to massive historical datasets such as web-logged data and large electronic health records. This motivated me to find ways to use offline data in a statistically efficient manner, which is a central topic in the subfield of offline RL and causal machine learning. However, there is a certain limitation in offline RL when the quality of the offline data is poor. In this scenario, we want to find the best policy by adaptively collecting data. This motivated me to find ways to collect the data and search for the best policy, which is a central topic in online RL. Since experiments are often costly, it again needs to be performed in a statistically efficient way. Hence, building statistically efficient RL algorithms in both offline and online settings is the key to bringing RL to a variety of real-world applications.
일반주제명  
Computer science.
일반주제명  
Information technology.
키워드  
Causal inference
키워드  
Reinforcement learning
키워드  
Decision making
키워드  
Offline RL
키워드  
online RL
기타저자  
Cornell University Computer Science
기본자료저록  
Dissertations Abstracts International. 85-03B.
기본자료저록  
Dissertation Abstract International
전자적 위치 및 접속  
로그인 후 원문을 볼 수 있습니다.
New Books MORE
최근 3년간 통계입니다.

Info Détail de la recherche.

  • Réservation
  • n'existe pas
  • My Folder
  • Demande Première utilisation
  • 비도서대출신청
  • 야간 도서대출신청
Matériel
Reg No. Call No. emplacement Status Lend Info
TF08416 전자도서
마이폴더 부재도서신고 비도서대출신청

* Les réservations sont disponibles dans le livre d'emprunt. Pour faire des réservations, S'il vous plaît cliquer sur le bouton de réservation

해당 도서를 다른 이용자가 함께 대출한 도서

Related Popular Books

로그인 후 이용 가능합니다.