229 results on '"Leisen, Fabrizio"'
Search Results
2. Asymptotics of predictive distributions driven by sample means and variances
- Author
-
Garelli, Samuele, Leisen, Fabrizio, Pratelli, Luca, and Rigo, Pietro
- Subjects
Mathematics - Statistics Theory ,Statistics - Methodology - Abstract
Let $\alpha_n(\cdot)=P\bigl(X_{n+1}\in\cdot\mid X_1,\ldots,X_n\bigr)$ be the predictive distributions of a sequence $(X_1,X_2,\ldots)$ of $p$-dimensional random vectors. Suppose $$\alpha_n= \mathcal{N} _p (M_n,Q_n)$$ where $M_n=\frac{1}{n}\sum_{i=1}^nX_i$ and $Q_n=\frac{1}{n}\sum_{i=1}^n(X_i-M_n)(X_i-M_n)^t$. Then, there is a random probability measure $\alpha$ on the Borel subsets of $\mathbb{R}^p$ such that $\lVert\alpha_n-\alpha\rVert\overset{a.s.}\longrightarrow 0$ where $\lVert\cdot\rVert$ is total variation distance. An explicit expression for $\alpha$ is provided and the convergence rate of $\lVert\alpha_n-\alpha\rVert$ is shown to be arbitrarily close to $n^{-1/2}$. Moreover, it is still true that $\lVert\alpha_n-\alpha\rVert\overset{a.s.}\longrightarrow 0$ even if $\alpha_n=\mathcal{L}(M_n,Q_n)$ where $\mathcal{L}$ belongs to a class of distributions much larger than the normal. The predictives $\alpha_n$ are useful in various frameworks, including Bayesian predictive inference and predictive resampling. Finally, the asymptotic behavior of copula-based predictive distributions (introduced in [13]) is investigated and a numerical experiment is performed.
- Published
- 2024
3. Generating knockoffs via conditional independence
- Author
-
Dreassi, Emanuela, Leisen, Fabrizio, Pratelli, Luca, and Rigo, Pietro
- Subjects
Mathematics - Statistics Theory ,62E10, 62H05, 60E05, 62J02 - Abstract
Let $X$ be a $p$-variate random vector and $\widetilde{X}$ a knockoff copy of $X$ (in the sense of \cite{CFJL18}). A new approach for constructing $\widetilde{X}$ (henceforth, NA) has been introduced in \cite{JSPI}. NA has essentially three advantages: (i) To build $\widetilde{X}$ is straightforward; (ii) The joint distribution of $(X,\widetilde{X})$ can be written in closed form; (iii) $\widetilde{X}$ is often optimal under various criteria. However, for NA to apply, $X_1,\ldots, X_p$ should be conditionally independent given some random element $Z$. Our first result is that any probability measure $\mu$ on $\mathbb{R}^p$ can be approximated by a probability measure $\mu_0$ of the form $$\mu_0\bigl(A_1\times\ldots\times A_p\bigr)=E\Bigl\{\prod_{i=1}^p P(X_i\in A_i\mid Z)\Bigr\}.$$ The approximation is in total variation distance when $\mu$ is absolutely continuous, and an explicit formula for $\mu_0$ is provided. If $X\sim\mu_0$, then $X_1,\ldots,X_p$ are conditionally independent. Hence, with a negligible error, one can assume $X\sim\mu_0$ and build $\widetilde{X}$ through NA. Our second result is a characterization of the knockoffs $\widetilde{X}$ obtained via NA. It is shown that $\widetilde{X}$ is of this type if and only if the pair $(X,\widetilde{X})$ can be extended to an infinite sequence so as to satisfy certain invariance conditions. The basic tool for proving this fact is de Finetti's theorem for partially exchangeable sequences. In addition to the quoted results, an explicit formula for the conditional distribution of $\widetilde{X}$ given $X$ is obtained in a few cases. In one of such cases, it is assumed $X_i\in\{0,1\}$ for all $i$., Comment: 26 pages
- Published
- 2022
4. A probabilistic view on predictive constructions for Bayesian learning
- Author
-
Berti, Patrizia, Dreassi, Emanuela, Leisen, Fabrizio, Rigo, Pietro, and Pratelli, Luca
- Subjects
Statistics - Methodology ,Mathematics - Probability ,Mathematics - Statistics Theory - Abstract
Given a sequence $X=(X_1,X_2,\ldots)$ of random observations, a Bayesian forecaster aims to predict $X_{n+1}$ based on $(X_1,\ldots,X_n)$ for each $n\ge 0$. To this end, in principle, she only needs to select a collection $\sigma=(\sigma_0,\sigma_1,\ldots)$, called ``strategy" in what follows, where $\sigma_0(\cdot)=P(X_1\in\cdot)$ is the marginal distribution of $X_1$ and $\sigma_n(\cdot)=P(X_{n+1}\in\cdot\mid X_1,\ldots,X_n)$ the $n$-th predictive distribution. Because of the Ionescu-Tulcea theorem, $\sigma$ can be assigned directly, without passing through the usual prior/posterior scheme. One main advantage is that no prior probability is to be selected. In a nutshell, this is the predictive approach to Bayesian learning. A concise review of the latter is provided in this paper. We try to put such an approach in the right framework, to make clear a few misunderstandings, and to provide a unifying view. Some recent results are discussed as well. In addition, some new strategies are introduced and the corresponding distribution of the data sequence $X$ is determined. The strategies concern generalized P\'olya urns, random change points, covariates and stationary sequences.
- Published
- 2022
5. BAYESIAN PREDICTIVE INFERENCE WITHOUT A PRIOR
- Author
-
Berti, Patrizia, Dreassi, Emanuela, Leisen, Fabrizio, Pratelli, Luca, and Rigo, Pietro
- Published
- 2023
6. Kernel based Dirichlet sequences
- Author
-
Berti, Patrizia, Dreassi, Emanuela, Leisen, Fabrizio, Pratelli, Luca, and Rigo, Pietro
- Subjects
Mathematics - Probability - Abstract
Let $X=(X_1,X_2,\ldots)$ be a sequence of random variables with values in a standard space $(S,\mathcal{B})$. Suppose \begin{gather*} X_1\sim\nu\quad\text{and}\quad P\bigl(X_{n+1}\in\cdot\mid X_1,\ldots,X_n\bigr)=\frac{\theta\nu(\cdot)+\sum_{i=1}^nK(X_i)(\cdot)}{n+\theta}\quad\quad\text{a.s.} \end{gather*} where $\theta>0$ is a constant, $\nu$ a probability measure on $\mathcal{B}$, and $K$ a random probability measure on $\mathcal{B}$. Then, $X$ is exchangeable whenever $K$ is a regular conditional distribution for $\nu$ given any sub-$\sigma$-field of $\mathcal{B}$. Under this assumption, $X$ enjoys all the main properties of classical Dirichlet sequences, including Sethuraman's representation, conjugacy property, and convergence in total variation of predictive distributions. If $\mu$ is the weak limit of the empirical measures, conditions for $\mu$ to be a.s. discrete, or a.s. non-atomic, or $\mu\ll\nu$ a.s., are provided. Two CLT's are proved as well. The first deals with stable convergence while the second concerns total variation distance.
- Published
- 2021
7. Bayesian predictive inference without a prior
- Author
-
Berti, Patrizia, Dreassi, Emanuela, Leisen, Fabrizio, Rigo, Pietro, and Pratelli, Luca
- Subjects
Mathematics - Statistics Theory ,Statistics - Methodology - Abstract
Let $(X_n:n\ge 1)$ be a sequence of random observations. Let $\sigma_n(\cdot)=P\bigl(X_{n+1}\in\cdot\mid X_1,\ldots,X_n\bigr)$ be the $n$-th predictive distribution and $\sigma_0(\cdot)=P(X_1\in\cdot)$ the marginal distribution of $X_1$. In a Bayesian framework, to make predictions on $(X_n)$, one only needs the collection $\sigma=(\sigma_n:n\ge 0)$. Because of the Ionescu-Tulcea theorem, $\sigma$ can be assigned directly, without passing through the usual prior/posterior scheme. One main advantage is that no prior probability has to be selected. In this paper, $\sigma$ is subjected to two requirements: (i) The resulting sequence $(X_n)$ is conditionally identically distributed, in the sense of Berti, Pratelli and Rigo (2004); (ii) Each $\sigma_{n+1}$ is a simple recursive update of $\sigma_n$. Various new $\sigma$ satisfying (i)-(ii) are introduced and investigated. For such $\sigma$, the asymptotics of $\sigma_n$, as $n\rightarrow\infty$, is determined. In some cases, the probability distribution of $(X_n)$ is also evaluated.
- Published
- 2021
8. New perspectives on knockoffs construction
- Author
-
Berti, Patrizia, Dreassi, Emanuela, Leisen, Fabrizio, Pratelli, Luca, and Rigo, Pietro
- Subjects
Mathematics - Statistics Theory ,Statistics - Methodology - Abstract
Let $\Lambda$ be the collection of all probability distributions for $(X,\widetilde{X})$, where $X$ is a fixed random vector and $\widetilde{X}$ ranges over all possible knockoff copies of $X$ (in the sense of \cite{CFJL18}). Three topics are developed in this paper: (i) A new characterization of $\Lambda$ is proved; (ii) A certain subclass of $\Lambda$, defined in terms of copulas, is introduced; (iii) The (meaningful) special case where the components of $X$ are conditionally independent is treated in depth. In real problems, after observing $X=x$, each of points (i)-(ii)-(iii) may be useful to generate a value $\widetilde{x}$ for $\widetilde{X}$ conditionally on $X=x$.
- Published
- 2021
9. A Copula-based Fully Bayesian Nonparametric Evaluation of Cardiovascular Risk Markers in the Mexico City Diabetes Study
- Author
-
Wehrhahn, Claudia, Fuentes-García, Ruth, Mena, Ramsés H., Leisen, Fabrizio, González-Villalpando, Maria Elena, and González-Villalpando, Clicerio
- Subjects
Statistics - Applications ,Statistics - Methodology - Abstract
Cardiovascular disease lead the cause of death world wide and several studies have been carried out to understand and explore cardiovascular risk markers in normoglycemic and diabetic populations. In this work, we explore the association structure between hyperglycemic markers and cardiovascular risk markers controlled by triglycerides, body mass index, age and gender, for the normoglycemic population in The Mexico City Diabetes Study. Understanding the association structure could contribute to the assessment of additional cardiovascular risk markers in this low income urban population with a high prevalence of classic cardiovascular risk biomarkers. The association structure is measured by conditional Kendall's tau, defined through conditional copula functions. The latter are in turn modeled under a fully Bayesian nonparametric approach, which allows the complete shape of the copula function to vary for different values of the controlled covariates.
- Published
- 2020
10. Completely Random Measures and L\'evy Bases in Free probability
- Author
-
Collet, Francesca, Leisen, Fabrizio, and Thorbjørnsen, Steen
- Subjects
Mathematics - Probability ,Mathematics - Operator Algebras - Abstract
This paper develops a theory for completely random measures in the framework of free probability. A general existence result for free completely random measures is established, and in analogy to the classical work of Kingman it is proved that such random measures can be decomposed into the sum of a purely atomic part and a (freely) infinitely divisible part. The latter part (termed a free L\'evy basis) is studied in detail in terms of the free L\'evy-Khintchine representation and a theory parallel to the classical work of Rajput and Rosinski is developed. Finally a L\'evy-It\^o type decomposition for general free L\'evy bases is established.
- Published
- 2020
11. Compound vectors of subordinators and their associated positive L\'evy copulas
- Author
-
Palacio, Alan Riva and Leisen, Fabrizio
- Subjects
Statistics - Methodology ,Mathematics - Probability ,Mathematics - Statistics Theory - Abstract
L\'evy copulas are an important tool which can be used to build dependent L\'evy processes. In a classical setting, they have been used to model financial applications. In a Bayesian framework they have been employed to introduce dependent nonparametric priors which allow to model heterogeneous data. This paper focuses on introducing a new class of L\'evy copulas based on a class of subordinators recently appeared in the literature, called \textit{Compound Random Measures}. The well-known Clayton L\'evy copula is a special case of this new class. Furthermore, we provide some novel results about the underlying vector of subordinators such as a series representation and relevant moments. The article concludes with an application to a Danish fire dataset.
- Published
- 2019
12. A P\'olya-Gamma Sampler for a Generalized Logistic Regression
- Author
-
Valle, Luciana Dalla, Leisen, Fabrizio, Rossini, Luca, and Zhu, Weixuan
- Subjects
Statistics - Methodology ,Statistics - Computation ,Statistics - Other Statistics - Abstract
In this paper we introduce a novel Bayesian data augmentation approach for estimating the parameters of the generalised logistic regression model. We propose a P\'olya-Gamma sampler algorithm that allows us to sample from the exact posterior distribution, rather than relying on approximations. A simulation study illustrates the flexibility and accuracy of the proposed approach to capture heavy and light tails in binary response data of different dimensions. The methodology is applied to two different real datasets, where we demonstrate that the P\'olya-Gamma sampler provides more precise estimates than the empirical likelihood method, outperforming approximate approaches., Comment: Revised Version of the paper
- Published
- 2019
13. On a flexible construction of a negative binomial model
- Author
-
Leisen, Fabrizio, Mena, Ramsés H., Mancilla, Freddy Palma, and Rossini, Luca
- Subjects
Statistics - Methodology ,Statistics - Other Statistics - Abstract
This work presents a construction of stationary Markov models with negative-binomial marginal distributions. A simple closed form expression for the corresponding transition probabilities is given, linking the proposal to well-known classes of birth and death processes and thus revealing interesting characterizations. The advantage of having such closed form expressions is tested on simulated and real data., Comment: Forthcoming in "Statistics & Probability Letters"
- Published
- 2018
14. A Loss-Based Prior for Gaussian Graphical Models
- Author
-
Hinoveanu, Laurentiu Catalin, Leisen, Fabrizio, and Villa, Cristiano
- Subjects
Statistics - Methodology - Abstract
Gaussian graphical models play an important role in various areas such as genetics, finance, statistical physics and others. They are a powerful modelling tool which allows one to describe the relationships among the variables of interest. From the Bayesian perspective, there are two sources of randomness: one is related to the multivariate distribution and the quantities that may parametrise the model, the other has to do with the underlying graph, $G$, equivalent to describing the conditional independence structure of the model under consideration. In this paper, we propose a prior on G based on two loss components. One considers the loss in information one would incur in selecting the wrong graph, while the second penalises for large number of edges, favouring sparsity. We illustrate the prior on simulated data and on real datasets, and compare the results with other priors on $G$ used in the literature. Moreover, we present a default choice of the prior as well as discuss how it can be calibrated so as to reflect available prior information.
- Published
- 2018
15. Loss-based approach to two-piece location-scale distributions with applications to dependent data
- Author
-
Leisen, Fabrizio, Rossini, Luca, and Villa, Cristiano
- Subjects
Statistics - Methodology ,Statistics - Other Statistics - Abstract
Two-piece location-scale models are used for modeling data presenting departures from symmetry. In this paper, we propose an objective Bayesian methodology for the tail parameter of two particular distributions of the above family: the skewed exponential power distribution and the skewed generalised logistic distribution. We apply the proposed objective approach to time series models and linear regression models where the error terms follow the distributions object of study. The performance of the proposed approach is illustrated through simulation experiments and real data analysis. The methodology yields improvements in density forecasts, as shown by the analysis we carry out on the electricity prices in Nordpool markets., Comment: 26 pages, 6 Figures
- Published
- 2018
16. Bayes Calculations from Quantile Implied Likelihood
- Author
-
Karabatsos, George and Leisen, Fabrizio
- Subjects
Statistics - Methodology - Abstract
In statistical practice, a realistic Bayesian model for a given data set can be defined by a likelihood function that is analytically or computationally intractable, due to large data sample size, high parameter dimensionality, or complex likelihood functional form. This in turn poses challenges to the computation and inference of the posterior distribution of the model parameters. For such a model, a tractable likelihood function is introduced which approximates the exact likelihood through its quantile function. It is defined by an asymptotic chi-square confidence distribution for a pivotal quantity, which is generated by the asymptotic normal distribution of the sample quantiles given model parameters. This Quantile Implied Likelihood (QIL) gives rise to an approximate posterior distribution which can be estimated by using penalized log-likelihood maximization or any suitable Monte Carlo algorithm. The QIL approach to Bayesian Computation is illustrated through the Bayesian analysis of simulated and real data sets having sample sizes that reach the millions. The analyses involve various models for univariate or multivariate iid or non-iid data, with low or high parameter dimensionality, many of which are defined by intractable likelihoods. The probability models include the Student's t, g-and-h, and g-and-k distributions; the Bayesian logit regression model with many covariates; exponential random graph model, a doubly-intractable model for networks; the multivariate skew normal model, for robust inference of the inverse-covariance matrix when it is large relative to the sample size; and the Wallenius distribution model.
- Published
- 2018
17. Limiting behaviour of the stationary search cost distribution driven by a generalized gamma process
- Author
-
Kume, Alfred, Leisen, Fabrizio, and Lijoi, Antonio
- Subjects
Statistics - Methodology ,Mathematics - Probability - Abstract
Consider a list of labeled objects that are organized in a heap. At each time, object $j$ is selected with probability $p_j$ and moved to the top of the heap. This procedure defines a Markov chain on the set of permutations which is referred to in the literature as Move-to-Front rule. The present contribution focuses on the stationary search cost, namely the position of the requested item in the heap when the Markov chain is in equilibrium. We consider the scenario where the number of objects is infinite and the probabilities $p_j$'s are defined as the normalization of the increments of a subordinator. In this setting, we provide an exact formula for the moments of any order of the stationary search cost distribution. We illustrate the new findings in the case of a generalized gamma subordinator and deal with an extension to the two--parameter Poisson--Dirichlet process, also known as Pitman--Yor process.
- Published
- 2018
18. An Approximate Likelihood Perspective on ABC Methods
- Author
-
Karabatsos, George and Leisen, Fabrizio
- Subjects
Statistics - Methodology - Abstract
We are living in the big data era, as current technologies and networks allow for the easy and routine collection of data sets in different disciplines. Bayesian Statistics offers a flexible modeling approach which is attractive for describing the complexity of these datasets. These models often exhibit a likelihood function which is intractable due to the large sample size, high number of parameters, or functional complexity. Approximate Bayesian Computational (ABC) methods provides likelihood-free methods for performing statistical inferences with Bayesian models defined by intractable likelihood functions. The vastity of the literature on ABC methods created a need to review and relate all ABC approaches so that scientists can more readily understand and apply them for their own work. This article provides a unifying review, general representation, and classification of all ABC methods from the view of approximate likelihood theory. This clarifies how ABC methods can be characterized, related, combined, improved, and applied for future research. Possible future research in ABC is then suggested.
- Published
- 2017
19. Integrability conditions for Compound Random Measures
- Author
-
Palacio, Alan Riva and Leisen, Fabrizio
- Subjects
Statistics - Methodology ,Mathematics - Probability ,Mathematics - Statistics Theory - Abstract
Compound random measures (CoRM's) are a flexible and tractable framework for vectors of completely random measure. In this paper, we provide conditions to guarantee the existence of a CoRM. Furthermore, we prove some interesting properties of CoRM's when exponential scores and regularly varying L\'evy intensities are considered.
- Published
- 2017
20. On a Class of Objective Priors from Scoring Rules
- Author
-
Leisen, Fabrizio, Villa, Cristiano, and Walker, Stephen G.
- Subjects
Statistics - Methodology ,Statistics - Applications ,Statistics - Other Statistics - Abstract
Objective prior distributions represent an important tool that allows one to have the advantages of using the Bayesian framework even when information about the parameters of a model is not available. The usual objective approaches work off the chosen statistical model and in the majority of cases the resulting prior is improper, which can pose limitations to a practical implementation, even when the complexity of the model is moderate. In this paper we propose to take a novel look at the construction of objective prior distributions, where the connection with a chosen sampling distribution model is removed. We explore the notion of defining objective prior distributions which allow one to have some degree of flexibility, in particular in exhibiting some desirable features, such as being proper, or centered on specific values which would be of interest in nested model comparisons. The basic tool we use are proper scoring rules and the main result is a class of objective prior distributions that can be employed in scenarios where the usual model based priors fail, such as mixture models and model selection via Bayes factors. In addition, we show that the proposed class of priors is the result of minimising the information it contains, providing solid interpretation to the method.
- Published
- 2017
21. Bayesian nonparametric estimation of survival functions with multiple-samples information
- Author
-
Palacio, Alan Riva and Leisen, Fabrizio
- Subjects
Statistics - Methodology ,Statistics - Applications ,Statistics - Computation - Abstract
In many real problems, dependence structures more general than exchangeability are required. For instance, in some settings partial exchangeability is a more reasonable assumption. For this reason, vectors of dependent Bayesian nonparametric priors have recently gained popularity. They provide flexible models which are tractable from a computational and theoretical point of view. In this paper, we focus on their use for estimating survival functions with multiple-samples information. Our methodology allows to model the dependence among survival times of different groups of observations and extend previous work to an arbitrary dimension . Theoretical results about the posterior behaviour of the underlying dependent vector of completely random measures are provided. The performance of the model is tested on a simulated dataset arising from a distributional Clayton copula.
- Published
- 2017
22. Objective Bayesian Analysis for Change Point Problems
- Author
-
Hinoveanu, Laurentiu, Leisen, Fabrizio, and Villa, Cristiano
- Subjects
Statistics - Methodology ,Mathematics - Statistics Theory ,Statistics - Applications ,Statistics - Computation ,Statistics - Machine Learning - Abstract
In this paper we present a loss-based approach to change point analysis. In particular, we look at the problem from two perspectives. The first focuses on the definition of a prior when the number of change points is known a priori. The second contribution aims to estimate the number of change points by using a loss-based approach recently introduced in the literature. The latter considers change point estimation as a model selection exercise. We show the performance of the proposed approach on simulated data and real data sets.
- Published
- 2017
23. Modelling Preference Data with the Wallenius Distribution
- Author
-
Grazian, Clara, Leisen, Fabrizio, and Liseo, Brunero
- Subjects
Statistics - Methodology ,Statistics - Applications ,Statistics - Computation ,Statistics - Machine Learning - Abstract
The Wallenius distribution is a generalisation of the Hypergeometric distribution where weights are assigned to balls of different colours. This naturally defines a model for ranking categories which can be used for classification purposes. Since, in general, the resulting likelihood is not analytically available, we adopt an approximate Bayesian computational (ABC) approach for estimating the importance of the categories. We illustrate the performance of the estimation procedure on simulated datasets. Finally, we use the new model for analysing two datasets about movies ratings and Italian academic statisticians' journal preferences. The latter is a novel dataset collected by the authors., Comment: 3 figures
- Published
- 2017
24. Modelling and computation using NCoRM mixtures for density regression
- Author
-
Griffin, Jim and Leisen, Fabrizio
- Subjects
Statistics - Methodology ,Statistics - Applications ,Statistics - Computation ,Statistics - Machine Learning - Abstract
Normalized compound random measures are flexible nonparametric priors for related distributions. We consider building general nonparametric regression models using normalized compound random measure mixture models. Posterior inference is made using a novel pseudo-marginal Metropolis-Hastings sampler for normalized compound random measure mixture models. The algorithm makes use of a new general approach to the unbiased estimation of Laplace functionals of compound random measures (which includes completely random measures as a special case). The approach is illustrated on problems of density regression.
- Published
- 2016
25. Objective Bayesian modelling of insurance risks with the skewed Student-t distribution
- Author
-
Leisen, Fabrizio, Marin, Juan Miguel, and Villa, Cristiano
- Subjects
Statistics - Methodology - Abstract
Insurance risks data typically exhibit skewed behaviour. In this paper, we propose a Bayesian approach to capture the main features of these datasets. This work extends the methodology introduced in Villa and Walker (2014a) by considering an extra parameter which captures the skewness of the data. In particular, a skewed Student-t distribution is considered. Two datasets are analysed: the Danish fire losses and the US indemnity loss. The analysis is carried with an objective Bayesian approach. For the discrete parameter representing the number of the degrees of freedom, we adopt a novel prior recently introduced in Villa and Walker (2014b).
- Published
- 2016
26. A Note on the Posterior Inference for the Yule-Simon Distribution
- Author
-
Leisen, Fabrizio, Rossini, Luca, and Villa, Cristiano
- Subjects
Statistics - Methodology ,Statistics - Applications - Abstract
The Yule--Simon distribution has been out of the radar of the Bayesian community, so far. In this note, we propose an explicit Gibbs sampling scheme when a Gamma prior is chosen for the shape parameter. The performance of the algorithm is illustrated with simulation studies, including count data regression, and a real data application to text analysis. We compare our proposal to the frequentist counterparts showing better performance of our algorithm when a small sample size is considered., Comment: Forthcoming in the "Journal of Statistical Computation and Simulation" - 12 pages, 4 Figures, 3 Tables
- Published
- 2016
- Full Text
- View/download PDF
27. Objective Bayesian Analysis of the Yule-Simon Distribution with Applications
- Author
-
Leisen, Fabrizio, Rossini, Luca, and Villa, Cristiano
- Subjects
Statistics - Methodology ,Statistics - Computation - Abstract
The Yule-Simon distribution is usually employed in the analysis of frequency data. As the Bayesian literature, so far, ignored this distribution, here we show the derivation of two objective priors for the parameter of the Yule-Simon distribution. In particular, we discuss the Jeffreys prior and a loss-based prior, which has recently appeared in the literature. We illustrate the performance of the derived priors through a simulation study and the analysis of real datasets., Comment: 24 pages, 11 Figures, 7 Tables
- Published
- 2016
- Full Text
- View/download PDF
28. Bayesian Nonparametric Conditional Copula Estimation of Twin Data
- Author
-
Valle, Luciana Dalla, Leisen, Fabrizio, and Rossini, Luca
- Subjects
Statistics - Methodology ,Statistics - Applications - Abstract
Several studies on heritability in twins aim at understanding the different contribution of environmental and genetic factors to specific traits. Considering the National Merit Twin Study, our purpose is to correctly analyse the influence of the socioeconomic status on the relationship between twins' cognitive abilities. Our methodology is based on conditional copulas, which allow us to model the effect of a covariate driving the strength of dependence between the main variables. We propose a flexible Bayesian nonparametric approach for the estimation of conditional copulas, which can model any conditional copula density. Our methodology extends the work of Wu et al (2015) by introducing dependence from a covariate in an infinite mixture model. Our results suggest that environmental factors are more influential in families with lower socio-economic position., Comment: Forthcoming in Journal of the Royal Statistical Society (Series C)
- Published
- 2016
29. Compound vectors of subordinators and their associated positive Lévy copulas
- Author
-
Riva-Palacio, Alan and Leisen, Fabrizio
- Published
- 2021
- Full Text
- View/download PDF
30. Embarrassingly Parallel Sequential Markov-chain Monte Carlo for Large Sets of Time Series
- Author
-
Casarin, Roberto, Craiu, Radu V., and Leisen, Fabrizio
- Subjects
Statistics - Computation ,Statistics - Applications - Abstract
Bayesian computation crucially relies on Markov chain Monte Carlo (MCMC) algorithms. In the case of massive data sets, running the Metropolis-Hastings sampler to draw from the posterior distribution becomes prohibitive due to the large number of likelihood terms that need to be calculated at each iteration. In order to perform Bayesian inference for a large set of time series, we consider an algorithm that combines 'divide and conquer" ideas previously used to design MCMC algorithms for big data with a sequential MCMC strategy. The performance of the method is illustrated using a large set of financial data.
- Published
- 2015
31. A Bootstrap Likelihood approach to Bayesian Computation
- Author
-
Zhu, Weixuan, Marin, Juan Miguel, and Leisen, Fabrizio
- Subjects
Statistics - Methodology - Abstract
There is an increasing amount of literature focused on Bayesian computational methods to address problems with intractable likelihood. One approach is a set of algorithms known as Approximate Bayesian Computational (ABC) methods. One of the problems of these algorithms is that the performance depends on the tuning of some parameters, such as the summary statistics, distance and tolerance level. To bypass this problem, Mengersen, Pudlo and Robert (2013) introduced an alternative method based on empirical likelihood, which can be easily implemented when a set of constraints, related to the moments of the distribution, is known. However, the choice of the constraints is sometimes challenging. To overcome this problem, we propose an alternative method based on a bootstrap likelihood approach. The method is easy to implement and in some cases it is faster than the other approaches. The performance of the algorithm is illustrated with examples in Population Genetics, Time Series and Stochastic Differential Equations. Finally, we test the method on a real dataset.
- Published
- 2015
32. Merging exchangeable occupancy models: $\mathcal{M}^{(a)}$- models and relation with the maximum entropy principle
- Author
-
Collet, Francesca, Leisen, Fabrizio, and Spizzichino, Fabio
- Subjects
Mathematics - Probability - Abstract
In this paper a new transformation of occupancy models, called merging, is introduced. In particular, it will be studied the effect of merging on a class of occupancy models that was recently introduced in Collet et al (2013). These results have an interesting interpretation in the so-called entropy maximization inference. The last part of the paper is devoted to highlight the impact of our findings in this research area., Comment: 16 pages, 1 figure
- Published
- 2014
33. Compound random measures and their use in Bayesian nonparametrics
- Author
-
Griffin, Jim E. and Leisen, Fabrizio
- Subjects
Statistics - Methodology - Abstract
A new class of dependent random measures which we call {\it compound random measures} are proposed and the use of normalized versions of these random measures as priors in Bayesian nonparametric mixture models is considered. Their tractability allows the properties of both compound random measures and normalized compound random measures to be derived. In particular, we show how compound random measures can be constructed with gamma, $\sigma$-stable and generalized gamma process marginals. We also derive several forms of the Laplace exponent and characterize dependence through both the L\'evy copula and correlation function. A slice sampler and an augmented P\'olya urn scheme sampler are described for posterior inference when a normalized compound random measure is used as the mixing measure in a nonparametric mixture model and a data example is discussed.
- Published
- 2014
34. A Bayesian Beta Markov Random Field Calibration of the Term Structure of Implied Risk Neutral Densities
- Author
-
Casarin, Roberto, Leisen, Fabrizio, Molina, German, and ter Horst, Enrique
- Subjects
Statistics - Applications ,Quantitative Finance - Computational Finance - Abstract
We build on the work in Fackler and King 1990, and propose a more general calibration model for implied risk neutral densities. Our model allows for the joint calibration of a set of densities at different maturities and dates through a Bayesian dynamic Beta Markov Random Field. Our approach allows for possible time dependence between densities with the same maturity, and for dependence across maturities at the same point in time. This approach to the problem encompasses model flexibility, parameter parsimony and, more importantly, information pooling across densities., Comment: 27 pages, 4 figures
- Published
- 2014
35. Loss-based approach to two-piece location-scale distributions with applications to dependent data
- Author
-
Leisen, Fabrizio, Rossini, Luca, and Villa, Cristiano
- Published
- 2020
- Full Text
- View/download PDF
36. New isometry of Krall-Laguerre orthogonal polynomials in martingale spaces
- Author
-
Huertas, Edmundo J., Torrado, Nuria, and Leisen, Fabrizio
- Subjects
Mathematics - Probability ,60G46, 42C05, 60G51, 33C47 - Abstract
Sets of orthogonal martingales are importants because they can be used as stochastic integrators in a kind of chaotic representation property, see [20]. In this paper, we revisited the problem studied by W. Schoutens in [21], investigating how an inner product derived from an Uvarov transformation of the Laguerre weight function is used in the orthogonalization procedure of a sequence of martingales related to a certain L\'evy process, called Teugels Martingales. Since the Uvarov transformation depends by a c<0, we are able to provide infinite sets of strongly orthogonal martingales, each one for every c in (-infty,0). In a similar fashion of [21], we introduce a suitable isometry between the space of polynomials and the space of linear combinations of Teugels martingales as well as the general orthogonalization procedure. Finally, the new construction is applied to the Gamma process., Comment: 13 pages, no figures. This paper has been withdrawn by the authors due to one error in the proof of the isometry
- Published
- 2013
37. Exchangeable Occupancy Models and Discrete Processes with the Generalized Uniform Order Statistics Property
- Author
-
Collet, Francesca, Leisen, Fabrizio, Spizzichino, Fabio, and Suter, Florentina
- Subjects
Mathematics - Probability ,Mathematics - Statistics Theory ,62G30, 60G09, 60G55 - Abstract
This work focuses on Exchangeable Occupancy Models (EOM) and their relations with the Uniform Order Statistics Property (UOSP) for point processes in discrete time. As our main purpose, we show how definitions and results presented in Shaked, Spizzichino and Suter (2004) can be unified and generalized in the frame of occupancy models. We first show some general facts about EOM's. Then we introduce a class of EOM's, called $\mathcal{M}^{(a)}$-models, and a concept of generalized Uniform Order Statistics Property in discrete time. For processes with this property, we prove a general characterization result in terms of $\mathcal{M}^{(a)}$-models. Our interest is also focused on properties of closure w.r.t. some natural transformations of EOM's., Comment: 27 pages
- Published
- 2011
38. Beta-Product Poisson-Dirichlet Processes
- Author
-
Bassetti, Federico, Casarin, Roberto, and Leisen, Fabrizio
- Subjects
Mathematics - Statistics Theory ,Mathematics - Probability ,Statistics - Computation - Abstract
Time series data may exhibit clustering over time and, in a multiple time series context, the clustering behavior may differ across the series. This paper is motivated by the Bayesian non--parametric modeling of the dependence between the clustering structures and the distributions of different time series. We follow a Dirichlet process mixture approach and introduce a new class of multivariate dependent Dirichlet processes (DDP). The proposed DDP are represented in terms of vector of stick-breaking processes with dependent weights. The weights are beta random vectors that determine different and dependent clustering effects along the dimension of the DDP vector. We discuss some theoretical properties and provide an efficient Monte Carlo Markov Chain algorithm for posterior computation. The effectiveness of the method is illustrated with a simulation study and an application to the United States and the European Union industrial production indexes.
- Published
- 2011
39. Free Completely Random Measures
- Author
-
Collet, Francesca and Leisen, Fabrizio
- Subjects
Mathematics - Probability ,60G57, 60E07, 46L54 - Abstract
In this paper a free analogous of completely random measure is introduced. Furthermore, a representation theorem is proved for free completely random measures that are free infinitely divisible., Comment: The manuscript has been superseded by arXiv article 2007.05336
- Published
- 2011
40. Generalized Species Sampling Priors with Latent Beta reinforcements
- Author
-
Airoldi, Edoardo M., Costa, Thiago, Bassetti, Federico, Leisen, Fabrizio, and Guindani, Michele
- Subjects
Mathematics - Statistics Theory ,Computer Science - Learning ,Statistics - Methodology - Abstract
Many popular Bayesian nonparametric priors can be characterized in terms of exchangeable species sampling sequences. However, in some applications, exchangeability may not be appropriate. We introduce a {novel and probabilistically coherent family of non-exchangeable species sampling sequences characterized by a tractable predictive probability function with weights driven by a sequence of independent Beta random variables. We compare their theoretical clustering properties with those of the Dirichlet Process and the two parameters Poisson-Dirichlet process. The proposed construction provides a complete characterization of the joint process, differently from existing work. We then propose the use of such process as prior distribution in a hierarchical Bayes modeling framework, and we describe a Markov Chain Monte Carlo sampler for posterior inference. We evaluate the performance of the prior and the robustness of the resulting inference in a simulation study, providing a comparison with popular Dirichlet Processes mixtures and Hidden Markov Models. Finally, we develop an application to the detection of chromosomal aberrations in breast cancer by leveraging array CGH data., Comment: For correspondence purposes, Edoardo M. Airoldi's email is airoldi@fas.harvard.edu; Federico Bassetti's email is federico.bassetti@unipv.it; Michele Guindani's email is mguindani@mdanderson.org ; Fabrizo Leisen's email is fabrizio.leisen@gmail.com. To appear in the Journal of the American Statistical Association
- Published
- 2010
41. Interacting Multiple Try Algorithms with Different Proposal Distributions
- Author
-
Casarin, Roberto, Craiu, Radu V., and Leisen, Fabrizio
- Subjects
Statistics - Computation - Abstract
We propose a new class of interacting Markov chain Monte Carlo (MCMC) algorithms designed for increasing the efficiency of a modified multiple-try Metropolis (MTM) algorithm. The extension with respect to the existing MCMC literature is twofold. The sampler proposed extends the basic MTM algorithm by allowing different proposal distributions in the multiple-try generation step. We exploit the structure of the MTM algorithm with different proposal distributions to naturally introduce an interacting MTM mechanism (IMTM) that expands the class of population Monte Carlo methods. We show the validity of the algorithm and discuss the choice of the selection weights and of the different proposals. We provide numerical studies which show that the new algorithm can perform better than the basic MTM algorithm and that the interaction mechanism allows the IMTM to efficiently explore the state space.
- Published
- 2010
- Full Text
- View/download PDF
42. Limiting behavior of the search cost distribution for the move-to-front rule in the stable case
- Author
-
Leisen, Fabrizio, Lijoi, Antonio, and Paroissin, Christian
- Subjects
Mathematics - Probability - Abstract
Move-to-front rule is a heuristic updating a list of n items according to requests. Items are required with unknown probabilities (or popularities). The induced Markov chain is known to be ergodic. One main problem is the study of the distribution of the search cost dened as the position of the required item. Here we first establish the link between two recent papers that both extend results proved by Kingman on the expected stationary search cost. Combining results contained in these papers, we obtain the limiting behavior for any moments of the stationary seach cost as n tends to innity.
- Published
- 2010
43. Bayesian Predictive Inference Without a Prior
- Author
-
Berti, Patrizia, primary, Dreassi, Emanuela, additional, Leisen, Fabrizio, additional, Pratelli, Luca, additional, and Rigo, Pietro, additional
- Published
- 2024
- Full Text
- View/download PDF
44. Modelling preference data with the Wallenius distribution
- Author
-
Grazian, Clara, Leisen, Fabrizio, and Liseo, Brunero
- Published
- 2019
45. Generalized species sampling priors with latent Beta reinforcements.
- Author
-
Airoldi, Edoardo M, Costa, Thiago, Bassetti, Federico, Leisen, Fabrizio, and Guindani, Michele
- Subjects
Bayesian non-parametrics ,Cancer ,Genomics ,MCMC ,Predictive Probability Functions ,Random Partitions ,Species Sampling Priors ,Bayesian nonparametrics ,Predictive probability functions ,Random partitions ,math.ST ,cs.LG ,stat.ME ,stat.TH ,Statistics & Probability ,Statistics ,Econometrics ,Demography - Abstract
Many popular Bayesian nonparametric priors can be characterized in terms of exchangeable species sampling sequences. However, in some applications, exchangeability may not be appropriate. We introduce a novel and probabilistically coherent family of non-exchangeable species sampling sequences characterized by a tractable predictive probability function with weights driven by a sequence of independent Beta random variables. We compare their theoretical clustering properties with those of the Dirichlet Process and the two parameters Poisson-Dirichlet process. The proposed construction provides a complete characterization of the joint process, differently from existing work. We then propose the use of such process as prior distribution in a hierarchical Bayes modeling framework, and we describe a Markov Chain Monte Carlo sampler for posterior inference. We evaluate the performance of the prior and the robustness of the resulting inference in a simulation study, providing a comparison with popular Dirichlet Processes mixtures and Hidden Markov Models. Finally, we develop an application to the detection of chromosomal aberrations in breast cancer by leveraging array CGH data.
- Published
- 2014
46. Bayesian loss-based approach to change point analysis
- Author
-
Hinoveanu, Laurentiu C., Leisen, Fabrizio, and Villa, Cristiano
- Published
- 2019
- Full Text
- View/download PDF
47. Conditionally identically distributed species sampling sequences
- Author
-
Bassetti, Federico, Crimaldi, Irene, and Leisen, Fabrizio
- Subjects
Mathematics - Probability - Abstract
Conditional identity in distribution (Berti et al. (2004)) is a new type of dependence for random variables, which generalizes the well-known notion of exchangeability. In this paper, a class of random sequences, called Generalized Species Sampling Sequences, is defined and a condition to have conditional identity in distribution is given. Moreover, a class of generalized species sampling sequences that are conditionally identically distributed is introduced and studied: the Generalized Ottawa sequences (GOS). This class contains a '`randomly reinforced'' version of the P\'olya urn and of the Blackwell-MacQueen urn scheme. For the empirical means and the predictive means of a GOS, we prove two convergence results toward suitable mixtures of Gaussian distributions. The first one is in the sense of stable convergence and the second one in the sense of almost sure conditional convergence. In the last part of the paper we study the length of the partition induced by a GOS at time $n$, i.e. the random number of distinct values of a GOS until time $n$. Under suitable conditions, we prove a strong law of large numbers and a central limit theorem in the sense of stable convergence. All the given results in the paper are accompanied by some examples.
- Published
- 2008
48. Bayesian non-parametric conditional copula estimation of twin data
- Author
-
Valle, Luciana Dalla, Leisen, Fabrizio, and Rossini, Luca
- Published
- 2018
49. Compound random measures and their use in Bayesian non-parametrics
- Author
-
Griffin, Jim E. and Leisen, Fabrizio
- Published
- 2017
50. Species Sampling Priors for Modeling Dependence: An Application to the Detection of Chromosomal Aberrations
- Author
-
Bassetti, Federico, Leisen, Fabrizio, Airoldi, Edoardo, Guindani, Michele, Datta, Somnath, Editor-in-chief, Viens, Frederi G., Series editor, Politis, Dimitris N., Series editor, Oja, Hannu, Series editor, Daniels, Michael, Series editor, Mitra, Riten, editor, and Müller, Peter, editor
- Published
- 2015
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.