Back to Search Start Over

Oracle inequalities and upper bounds for kernel conditional U-statistics estimators on manifolds and more general metric spaces associated with operators.

Authors :
Bouzebda, Salim
Taachouche, Nourelhouda
Source :
Stochastics: An International Journal of Probability & Stochastic Processes. Dec2024, Vol. 96 Issue 8, p2135-2198. 64p.
Publication Year :
2024

Abstract

U-statistics represent a fundamental class of statistics that emerge from modelling quantities of interest defined by multi-subject responses. These statistics generalize the empirical mean of a random variable $ \bf {X} $ X to summations encompassing all distinct k-tuples of observations drawn from $ \bf {X} $ X . A significant advancement was made by Stute [ConditionalU-statistics. Ann. Probab. 19 (1991), pp. 812–825], who introduced conditional U-statistics as a generalization of the Nadaraya–Watson estimates for regression functions. Stute demonstrated their robust pointwise consistency towards the conditional function \[ r^{(k)}(\varphi,\tilde{\bf{t}})=\mathbb{E}\left(\varphi(\bf{Y}_{1},\ldots,\bf{Y}_{k})\,\mid\, (\bf{X}_{1},\ldots,\bf{X}_{k})=\tilde{\bf{t}}\right), \quad\mbox{for}\ \tilde{\bf{t}} \in \mathbb{R}^{pk}, \] r (k) (φ , t ~) = E (φ (Y 1 , ... , Y k) ∣ (X 1 , ... , X k) = t ~) , for t ~ ∈ R pk , where φ is a measurable function. In this investigation, we develop oracle inequalities and upper bounds for kernel-based estimators of conditional U-statistics of general order, applicable across a wide range of metric spaces associated with operators. Our analysis specifically targets doubling measure metric spaces, incorporating a non-negative self-adjoint operator characterized by Gaussian regularity in its heat kernel. Remarkably, our study achieves an optimal convergence rate in certain cases. To derive these results, we explore the regression function within a general framework, introducing several novel insights. These findings are established under sufficiently broad conditions on the underlying distributions. The theoretical results serve as essential tools for advancing the field of general-valued data, with potential applications including the examination of conditional distribution functions, relative-error prediction, the Kendall rank correlation coefficient, and discrimination problems – areas of significant independent interest. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
17442508
Volume :
96
Issue :
8
Database :
Academic Search Index
Journal :
Stochastics: An International Journal of Probability & Stochastic Processes
Publication Type :
Academic Journal
Accession number :
181802967
Full Text :
https://doi.org/10.1080/17442508.2024.2391898