본문

Statistically Efficient Reinforcement Learning- [electronic resource]
Statistically Efficient Reinforcement Learning - [electronic resource]
내용보기
Statistically Efficient Reinforcement Learning- [electronic resource]
자료유형  
 학위논문파일 국외
최종처리일시  
20240214101531
ISBN  
9798380318662
DDC  
004
저자명  
Uehara, Masatoshi.
서명/저자  
Statistically Efficient Reinforcement Learning - [electronic resource]
발행사항  
[S.l.]: : Cornell University., 2023
발행사항  
Ann Arbor : : ProQuest Dissertations & Theses,, 2023
형태사항  
1 online resource(296 p.)
주기사항  
Source: Dissertations Abstracts International, Volume: 85-03, Section: B.
주기사항  
Advisor: Kallus, Nathan.
학위논문주기  
Thesis (Ph.D.)--Cornell University, 2023.
사용제한주기  
This item must not be sold to any third party vendors.
사용제한주기  
This item must not be added to any third party search indexes.
초록/해제  
요약My research focus is on developing algorithms and statistical theories of sequential decision making on the intersection of reinforcement learning (RL) and causal inference. RL is concerned with the ways agents learn to make sequential decisions in unknown environments. It has been one of the most vibrant research frontiers in machine learning over the last few years. We have empirical success in a variety of applications, especially for games such as AlphaGo (Silver et al., 2016). Despite its popularity, the real-world application of RL in fields such as biomedicine and social science is still limited. This is because these real-world applications do not have good simulators, and experimentation is often expensive and risky (e.g., running clinical trials, deploying new marketing strategies in companies) unlike for games. Although running new experiments can be difficult, fortunately, in an era of big data, we often have access to massive historical datasets such as web-logged data and large electronic health records. This motivated me to find ways to use offline data in a statistically efficient manner, which is a central topic in the subfield of offline RL and causal machine learning. However, there is a certain limitation in offline RL when the quality of the offline data is poor. In this scenario, we want to find the best policy by adaptively collecting data. This motivated me to find ways to collect the data and search for the best policy, which is a central topic in online RL. Since experiments are often costly, it again needs to be performed in a statistically efficient way. Hence, building statistically efficient RL algorithms in both offline and online settings is the key to bringing RL to a variety of real-world applications.
일반주제명  
Computer science.
일반주제명  
Information technology.
키워드  
Causal inference
키워드  
Reinforcement learning
키워드  
Decision making
키워드  
Offline RL
키워드  
online RL
기타저자  
Cornell University Computer Science
기본자료저록  
Dissertations Abstracts International. 85-03B.
기본자료저록  
Dissertation Abstract International
전자적 위치 및 접속  
로그인 후 원문을 볼 수 있습니다.
신착도서 더보기
최근 3년간 통계입니다.

소장정보

  • 예약
  • 소재불명신고
  • 나의폴더
  • 우선정리요청
  • 비도서대출신청
  • 야간 도서대출신청
소장자료
등록번호 청구기호 소장처 대출가능여부 대출정보
TF07066 전자도서
마이폴더 부재도서신고 비도서대출신청

* 대출중인 자료에 한하여 예약이 가능합니다. 예약을 원하시면 예약버튼을 클릭하십시오.

해당 도서를 다른 이용자가 함께 대출한 도서

관련 인기도서

로그인 후 이용 가능합니다.