Back to Search
Start Over
Oracle inequalities and upper bounds for kernel conditional U-statistics estimators on manifolds and more general metric spaces associated with operators.
- Source :
-
Stochastics: An International Journal of Probability & Stochastic Processes . Dec2024, Vol. 96 Issue 8, p2135-2198. 64p. - Publication Year :
- 2024
-
Abstract
- U-statistics represent a fundamental class of statistics that emerge from modelling quantities of interest defined by multi-subject responses. These statistics generalize the empirical mean of a random variable $ \bf {X} $ X to summations encompassing all distinct k-tuples of observations drawn from $ \bf {X} $ X . A significant advancement was made by Stute [ConditionalU-statistics. Ann. Probab. 19 (1991), pp. 812–825], who introduced conditional U-statistics as a generalization of the Nadaraya–Watson estimates for regression functions. Stute demonstrated their robust pointwise consistency towards the conditional function \[ r^{(k)}(\varphi,\tilde{\bf{t}})=\mathbb{E}\left(\varphi(\bf{Y}_{1},\ldots,\bf{Y}_{k})\,\mid\, (\bf{X}_{1},\ldots,\bf{X}_{k})=\tilde{\bf{t}}\right), \quad\mbox{for}\ \tilde{\bf{t}} \in \mathbb{R}^{pk}, \] r (k) (φ , t ~) = E (φ (Y 1 , ... , Y k) ∣ (X 1 , ... , X k) = t ~) , for t ~ ∈ R pk , where φ is a measurable function. In this investigation, we develop oracle inequalities and upper bounds for kernel-based estimators of conditional U-statistics of general order, applicable across a wide range of metric spaces associated with operators. Our analysis specifically targets doubling measure metric spaces, incorporating a non-negative self-adjoint operator characterized by Gaussian regularity in its heat kernel. Remarkably, our study achieves an optimal convergence rate in certain cases. To derive these results, we explore the regression function within a general framework, introducing several novel insights. These findings are established under sufficiently broad conditions on the underlying distributions. The theoretical results serve as essential tools for advancing the field of general-valued data, with potential applications including the examination of conditional distribution functions, relative-error prediction, the Kendall rank correlation coefficient, and discrimination problems – areas of significant independent interest. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 17442508
- Volume :
- 96
- Issue :
- 8
- Database :
- Academic Search Index
- Journal :
- Stochastics: An International Journal of Probability & Stochastic Processes
- Publication Type :
- Academic Journal
- Accession number :
- 181802967
- Full Text :
- https://doi.org/10.1080/17442508.2024.2391898