백석예술대학교 도서관

본문 바로가기
탑 메뉴 바로가기
주 메뉴 바로가기
하단 바로가기

내용보기

Recent Developments in Robust Statistics- [electronic resource]

자료유형: 학위논문파일 국외

최종처리일시: 20240214101643

ISBN: 9798380366298

DDC: 004

저자명: Cherapanamjeri, Yeshwanth.

서명/저자: Recent Developments in Robust Statistics - [electronic resource]

발행사항: [S.l.]: : University of California, Berkeley., 2023

발행사항: Ann Arbor : : ProQuest Dissertations & Theses,, 2023

형태사항: 1 online resource(84 p.)

주기사항: Source: Dissertations Abstracts International, Volume: 85-03, Section: B.

주기사항: Advisor: Bartlett, Peter.

학위논문주기: Thesis (Ph.D.)--University of California, Berkeley, 2023.

사용제한주기: This item must not be sold to any third party vendors.

초록/해제: 요약The design of statistical estimators robust to outliers has been a mainstay of statistical research through the past six decades. These techniques are even more prescient in the contemporary landscape where large-scale machine learning systems are deployed in increasingly noisy and adaptive environments. In this thesis, we consider the task of building such an estimator for arguably the simplest possible statistical estimation problem - that of mean estimation. There is surprisingly little understanding of the computational and statistical limits of estimation and the trade-offs incurred even for this relatively simple setting. We make progress on this problem along three complementary axes.Our first contribution is a simple algorithmic framework for constructing robust estimators. Our framework allows for a significant speed-up over prior approaches for mean estimation while also allowing for easy extensibility to other statistical estimation tasks where it achieves state-of-the-art performance.Secondly, we investigate the statistical boundaries of mean estimation where we demonstrate the necessary statistical degradation incurred in extremely heavy-tailed scenarios. While prior work showed that estimation could be performed as well as if one had access to Guassian data, we establish that this is no longer true when the data possesses heavier tails. We provide lower bounds which exhibit this degradation and an (efficient) algorithm matching them.Lastly, we consider the stability of these estimators to natural transformations of the data. Inspired by the empirical mean, classical work constructed estimators equivariant to affine transformations. These works, however, lacked the strong quantitative performance of more recent approaches. We demonstrate that such trade-offs are in fact necessary by constructing novel lower bounds for affine-equivariant estimators. We then show that classical estimators are quantitatively deficient even in this restricted class and devise an estimator based on a novel notion of a high-dimensional median which matches the lower bound.