Adaptivity, Structure, and Objectives in Sequential Decision-Making- [electronic resource]
Adaptivity, Structure, and Objectives in Sequential Decision-Making- [electronic resource]
- 자료유형
- 학위논문파일 국외
- 최종처리일시
- 20240214100120
- ISBN
- 9798379711702
- DDC
- 004
- 서명/저자
- Adaptivity, Structure, and Objectives in Sequential Decision-Making - [electronic resource]
- 발행사항
- [S.l.]: : Cornell University., 2023
- 발행사항
- Ann Arbor : : ProQuest Dissertations & Theses,, 2023
- 형태사항
- 1 online resource(368 p.)
- 주기사항
- Source: Dissertations Abstracts International, Volume: 84-12, Section: B.
- 주기사항
- Advisor: Yu, Christina Lee.
- 학위논문주기
- Thesis (Ph.D.)--Cornell University, 2023.
- 사용제한주기
- This item must not be sold to any third party vendors.
- 초록/해제
- 요약Sequential decision-making algorithms are ubiquitous in the design and optimization of large-scale systems due to their practical impact, leading to a renaissance of incorporating machine learning for decision-making. This widespread societal adoption includes improving data centers with machine-learned advice and managing supply chain optimization for mobile food pantry services. The typical algorithmic paradigm ignores the sequential notion of these problems: use a historical dataset to predict future uncertainty and solve the resulting offline planning problem. Reinforcement learning (RL) provides a more natural highfidelity model for these systems, giving theoretical tools for the design and analysis of an algorithm's performance. These algorithms have seen historical success, but mainly in the context of large-scale game playing and robotics with tabula rasa algorithms. The fundamental gap in their adoption and performance in operations management domains is theoretically understanding how algorithms adapt to additional structure observed in these problems by improving over min-max bounds, incorporating domain-specific constraints, and adjusting to multi-criteria objectives.In this thesis, we will develop machine learning algorithms for data-driven sequential decision making in the framework of RL, with applications to social good, societal systems, and operations management. We will consider designing methods for sequential decision-making (bandits, reinforcement learning) that leverage auxiliary data sources (imitation learning, exogenous datasets, geometric assumptions). We will specialize this framework to areas including nonparametric RL algorithms for memory management and metrical task systems, fair resource allocation, and data-driven algorithm design for bin packing with applications in cloud computing. Central to this, we will additionally discuss our open-source code instrumentation and methodology to analyze the multi-criteria performance of algorithms on these problems.To summarize, we will outline an approach toProvide techniques to scale reinforcement learning algorithms to societal systems through three lenses: adaptivity, structure, and objectives.In more detail, this thesis will be separated into three distinct parts each focused on considering the following questions: (1) Adaptivity: How can we design algorithms which optimally exploit geometry in the data to provide enhanced performance and reduce run-time and storage complexity? (2) Structure: What additional structure and constraints, either on the operational behavior of the algorithm or on the system, lead to provably improved domain-specific algorithms?(3) Objectives: How can we characterize and attain the Pareto frontier of tradeoffs between the multi-criteria objectives in sequential decision-making problems?
- 일반주제명
- Computer science.
- 일반주제명
- Mathematics.
- 키워드
- Machine learning
- 키워드
- Market design
- 기타저자
- Cornell University Operations Research and Information Engineering
- 기본자료저록
- Dissertations Abstracts International. 84-12B.
- 기본자료저록
- Dissertation Abstract International
- 전자적 위치 및 접속
- 로그인 후 원문을 볼 수 있습니다.