Back to Search Start Over

General adapted‐threshold monitoring in discrete environments and rules for imbalanced classes.

Authors :
Steland, Ansgar
Rafajłowicz, Ewaryst
Rafajłowicz, Wojciech
Source :
Statistica Neerlandica. Aug2024, p1. 18p.
Publication Year :
2024

Abstract

Having in mind applications in statistics and machine learning such as individualized care monitoring, or watermark detection in large language models, we consider the following general setting: When monitoring a sequence of observations, Xt$$ {X}_t $$, there may be additional information, Zt$$ {Z}_t $$, on the environment which should be used to design the monitoring procedure. This additional information can be incorporated by applying threshold functions c(Zt)$$ c\left({Z}_t\right) $$ to the standardized measurements to adapt the detector to the environment. For the case of categorical data encoding of discrete‐valued environmental information we study several classes of level α$$ \alpha $$ threshold functions including a proportional one which favors rare events among imbalanced classes. For the latter rule asymptotic theory is developed for independent and identically distributed and dependent learning samples including data from new discrete autoregressive moving average model (NDARMA) series and Hidden Markov Models. Further, we propose two‐stage designs which allow to distribute in a controlled way the α$$ \alpha $$ budget over an a priori partition of the sample space of Zt$$ {Z}_t $$. The approach is illustrated by a real medical data set. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00390402
Database :
Academic Search Index
Journal :
Statistica Neerlandica
Publication Type :
Academic Journal
Accession number :
179102217
Full Text :
https://doi.org/10.1111/stan.12352