Back to Search Start Over

Efficient algorithms for calculating the probability distribution of the sum of hypergeometric-distributed random variables

Authors :
Nataliya Chukhrova
Philippe Castagliola
Arne Johannssen
University of Hamburg
Systèmes Logistiques et de Production (SLP )
Laboratoire des Sciences du Numérique de Nantes (LS2N)
Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST)
Université de Nantes (UN)-Université de Nantes (UN)-École Centrale de Nantes (ECN)-Centre National de la Recherche Scientifique (CNRS)-IMT Atlantique Bretagne-Pays de la Loire (IMT Atlantique)
Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT)-Université de Nantes - UFR des Sciences et des Techniques (UN UFR ST)
Institut Mines-Télécom [Paris] (IMT)-Institut Mines-Télécom [Paris] (IMT)
IMT Atlantique Bretagne-Pays de la Loire (IMT Atlantique)
Université de Nantes (UN)-Université de Nantes (UN)-École Centrale de Nantes (ECN)-Centre National de la Recherche Scientifique (CNRS)
Source :
MethodsX, MethodsX, Elsevier, 2021, 8, pp.101507. ⟨10.1016/j.mex.2021.101507⟩, MethodsX, Vol 8, Iss, Pp 101507-(2021)
Publication Year :
2021
Publisher :
HAL CCSD, 2021.

Abstract

Highlights • This article presents theoretical bases of three methods for determining the probability distribution of the sum of i.i.d. hypergeometric random variables: (1) direct convolution, (2) recursive algorithm by De Pril, (3) approximation. • We provide associated MATLAB codes (including context-specific customizations) for direct implementation of these methods and discuss technical aspects and essential details of the tweaks we have made. • A representative application example in SPM shows that the proposed approximation is considerably simpler in application than both other methods and it ensures a remarkable high accuracy of the results while reducing computational time considerably.<br />Graphical abstract<br />In probability theory and statistics, the probability distribution of the sum of two or more independent and identically distributed (i.i.d.) random variables is the convolution of their individual distributions. While convoluting random variables following a binomial, geometric or Poisson distribution is a straightforward procedure, convoluting hypergeometric-distributed random variables is not. The problem is that there is no closed form solution for the probability mass function (p.m.f.) and cumulative distribution function (c.d.f.) of the sum of i.i.d. hypergeometric random variables. To overcome this problem, we propose an approximation for the distribution of the sum of i.i.d. hypergeometric random variables. In addition, we compare this approximation with two classical numerical methods, i.e., convolution and the recursive algorithm by De Pril, by means of an application in Statistical Process Monitoring (SPM). We provide MATLAB codes to implement these three methods for computing the probability distribution of the sum of i.i.d. hypergeometric random variables in an efficient way. The obtained results show that the proposed approximation has remarkable properties and may be helpful in all fields, where the problem of convoluting hypergeometric-distributed random variables occurs. Therefore, the approximation considered in this paper is well suited to make a change over established practices.•This article presents theoretical bases of three methods for determining the probability distribution of the sum of i.i.d. hypergeometric random variables: (1) direct convolution, (2) recursive algorithm by De Pril, (3) approximation.•We provide associated MATLAB codes (including context-specific customizations) for direct implementation of these methods and discuss technical aspects and essential details of the tweaks we have made.•A representative application example in SPM shows that the proposed approximation is considerably simpler in application than both other methods and it ensures a remarkable high accuracy of the results while reducing computational time considerably.

Details

Language :
English
ISSN :
22150161
Database :
OpenAIRE
Journal :
MethodsX, MethodsX, Elsevier, 2021, 8, pp.101507. ⟨10.1016/j.mex.2021.101507⟩, MethodsX, Vol 8, Iss, Pp 101507-(2021)
Accession number :
edsair.doi.dedup.....b2233ead17f6d8774d321a67b7b2e2d8