Author: "Roel Verbelen" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Roel Verbelen"' showing total 16 results

Start Over Author "Roel Verbelen"

16 results on '"Roel Verbelen"'

1. Sparse regression with Multi-type Regularized Feature modeling

Author: Katrien Antonio, Roel Verbelen, Sander Devriendt, and Tom Reynkens
Subjects: Statistics and Probability, Generalized linear model, FOS: Computer and information sciences, Economics and Econometrics, Mathematical optimization, Level fusion, Optimization problem, Estimation theory, business.industry, Computer science, Regular polygon, 010103 numerical & computational mathematics, 01 natural sciences, Regularization (mathematics), Statistics - Computation, Methodology (stat.ME), 010104 statistics & probability, Analytics, 0101 mathematics, Statistics, Probability and Uncertainty, business, Statistics - Methodology, Computation (stat.CO), Sparse regression
Abstract: Within the statistical and machine learning literature, regularization techniques are often used to construct sparse (predictive) models. Most regularization strategies only work for data where all predictors are treated identically, such as Lasso regression for (continuous) predictors treated as linear effects. However, many predictive problems involve different types of predictors and require a tailored regularization term. We propose a multi-type Lasso penalty that acts on the objective function as a sum of subpenalties, one for each type of predictor. As such, we allow for predictor selection and level fusion within a predictor in a data-driven way, simultaneous with the parameter estimation process. We develop a new estimation strategy for convex predictive models with this multi-type penalty. Using the theory of proximal operators, our estimation procedure is computationally efficient, partitioning the overall optimization problem into easier to solve subproblems, specific for each predictor type and its associated penalty. Earlier research applies approximations to non-differentiable penalties to solve the optimization problem. The proposed SMuRF algorithm removes the need for approximations and achieves a higher accuracy and computational efficiency. This is demonstrated with an extensive simulation study and the analysis of a case-study on insurance pricing analytics.
Published: 2021

2. Unravelling the predictive power of telematics data in car insurance pricing

Author: Roel Verbelen, Gerda Claeskens, Katrien Antonio, Actuarial Science & Mathematical Finance (ASE, FEB), and Faculteit Economie en Bedrijfskunde
Subjects: Statistics and Probability, Computer science, Pay-as-you-drive insurance, Risk classification, 01 natural sciences, Generalized additive models, Postal Code, 010104 statistics & probability, 0502 economics and business, Telematics, 0101 mathematics, Usage-based insurance, Black box (phreaking), 050210 logistics & transportation, Actuarial science, business.industry, 05 social sciences, Generalized additive model, Statistical model, Product (business), Variable (computer science), Predictive power, Structural zeros, Statistics, Probability and Uncertainty, business, Compositional predictors
Abstract: A data set from a Belgian telematics product aimed at young drivers is used to identify how car insurance premiums can be designed based on the telematics data collected by a black box installed in the vehicle. In traditional pricing models for car insurance, the premium depends on self-reported rating variables (e.g. age, postal code) which capture characteristics of the policy(holder) and the insured vehicle and are often only indirectly related to the accident risk. Using telematics technology enables tailor-made car insurance pricing based on the driving behavior of the policyholder. We develop a statistical modeling approach using generalized additive models and compositional predictors to quantify and interpret the effect of telematics variables on the expected claim frequency. We find that such variables increase the predictive power and render the use of gender as a rating variable redundant. ispartof: Journal of the Royal Statistical Society: Series C (Applied Statistics) vol:67, part 5 issue:5 pages:1275-1304 status: published
Published: 2018

3. Modeling the number of hidden events subject to observation delay

Author: Katrien Antonio, Roel Verbelen, Jonas Crevecoeur, Actuarial Science & Mathematical Finance (ASE, FEB), ASE Other Research, and Faculteit Economie en Bedrijfskunde
Subjects: Information Systems and Management, General Computer Science, Computer science, 0211 other engineering and technologies, 02 engineering and technology, Management Science and Operations Research, Industrial and Manufacturing Engineering, FOS: Economics and business, 0502 economics and business, Subject (grammar), Econometrics, Occurrence of events, Observation delay, Risk management, health care economics and organizations, Event (probability theory), 050210 logistics & transportation, 021103 operations research, business.industry, 05 social sciences, Liability insurance, Calendar day effects, Data analytics, Modeling and Simulation, Risk Management (q-fin.RM), business, Quantitative Finance - Risk Management
Abstract: This paper considers the problem of predicting the number of events that have occurred in the past, but which are not yet observed due to a delay. Such delayed events are relevant in predicting the future cost of warranties, pricing maintenance contracts, determining the number of unreported claims in insurance and in modeling the outbreak of diseases. Disregarding these unobserved events results in a systematic underestimation of the event occurrence process. Our approach puts emphasis on modeling the time between the occurrence and observation of the event, the so-called observation delay. We propose a granular model for the heterogeneity in this observation delay based on the occurrence day of the event and on calendar day effects in the observation process, such as weekday and holiday effects. We illustrate this approach on a European general liability insurance data set where the occurrence of an accident is reported to the insurer with delay. ispartof: EUROPEAN JOURNAL OF OPERATIONAL RESEARCH vol:277 issue:3 pages:930-944 status: published
Published: 2019

4. Boosting insights in insurance tariff plans with tree-based machine learning methods

Author: Katrien Antonio, Marie-Pier Côté, Roel Verbelen, and Roel Henckaerts
Subjects: Statistics and Probability, Generalized linear model, FOS: Computer and information sciences, Computer Science - Machine Learning, Economics and Econometrics, Boosting (machine learning), business.industry, Computer science, Tariff, Machine learning, computer.software_genre, Statistics - Applications, Machine Learning (cs.LG), Data analysis, Applications (stat.AP), Tree based, Artificial intelligence, Statistics, Probability and Uncertainty, business, computer
Abstract: Pricing actuaries typically operate within the framework of generalized linear models (GLMs). With the upswing of data analytics, our study puts focus on machine learning methods to develop full tariff plans built from both the frequency and severity of claims. We adapt the loss functions used in the algorithms such that the specific characteristics of insurance data are carefully incorporated: highly unbalanced count data with excess zeros and varying exposure on the frequency side combined with scarce but potentially long-tailed data on the severity side. A key requirement is the need for transparent and interpretable pricing models that are easily explainable to all stakeholders. We therefore focus on machine learning with decision trees: Starting from simple regression trees, we work toward more advanced ensembles such as random forests and boosted trees. We show how to choose the optimal tuning parameters for these models in an elaborate cross-validation scheme. In addition, we present visualization tools to obtain insights from the resulting models, and the economic value of these new modeling approaches is evaluated. Boosted trees outperform the classical GLMs, allowing the insurer to form profitable portfolios and to guard against potential adverse risk selection.
Published: 2019
Full Text: View/download PDF

5. Modeling the occurrence of events subject to a reporting delay via an EM algorithm

Author: Roel Verbelen, Katrien Antonio, Gerda Claeskens, and Jonas Crevecoeur
Subjects: Statistics and Probability, Methodology (stat.ME), FOS: Computer and information sciences, General Mathematics, Applications (stat.AP), Statistics, Probability and Uncertainty, Statistics - Applications, Statistics - Methodology
Abstract: A delay between the occurrence and the reporting of events often has practical implications such as for the amount of capital to hold for insurance companies, or for taking preventive actions in case of infectious diseases. The accurate estimation of the number of incurred but not (yet) reported events forms an essential part of properly dealing with this phenomenon. We review the current practice for analysing such data and we present a flexible regression framework to jointly estimate the occurrence and reporting of events. By linking this setting to an incomplete data problem, estimation is performed via an expectation-maximization algorithm. The resulting method is elegant, easy to understand and implement, and provides refined insights in the nowcasts. The proposed methodology is applied to a European general liability portfolio in insurance.
Published: 2019
Full Text: View/download PDF

6. A data driven binning strategy for the construction of insurance tariff classes

Author: Maxime Clijsters, Roel Henckaerts, Katrien Antonio, Roel Verbelen, Faculteit Economie en Bedrijfskunde, and Actuarial Science & Mathematical Finance (ASE, FEB)
Subjects: Statistics and Probability, Generalized linear model, Economics and Econometrics, 050208 finance, business.industry, 05 social sciences, Generalized additive model, Tariff, Statistical model, Risk factor (finance), 01 natural sciences, 010104 statistics & probability, 0502 economics and business, Covariate, Statistics, Econometrics, Economics, 0101 mathematics, Statistics, Probability and Uncertainty, business, Categorical variable, Risk management
Abstract: We present a fully data driven strategy to incorporate continuous risk factors and geographical information in an insurance tariff. A framework is developed that aligns flexibility with the practical requirements of an insurance company, the policyholder and the regulator. Our strategy is illustrated with an example from property and casualty (P&C) insurance, namely a motor insurance case study. We start by fitting generalized additive models (GAMs) to the number of reported claims and their corresponding severity. These models allow for flexible statistical modeling in the presence of different types of risk factors: categorical, continuous, and spatial risk factors. The goal is to bin the continuous and spatial risk factors such that categorical risk factors result which captures the effect of the covariate on the response in an accurate way, while being easy to use in a generalized linear model (GLM). This is in line with the requirement of an insurance company to construct a practical and interpretable tariff that can be explained easily to stakeholders. We propose to bin the spatial risk factor using Fisher’s natural breaks algorithm and the continuous risk factors using evolutionary trees. GLMs are fitted to the claims data with the resulting categorical risk factors. We find that the resulting GLMs approximate the original GAMs closely, and lead to a very similar premium structure.
Published: 2018

7. Multivariate mixtures of Erlangs for density estimation under censoring

Author: Gerda Claeskens, Roel Verbelen, Katrien Antonio, and Actuarial Science & Mathematical Finance (ASE, FEB)
Subjects: Multivariate statistics, Multivariate analysis, Computer science, Initialization, computer.software_genre, 01 natural sciences, Shape parameter, Expectation-maximization algorithm, 010104 statistics & probability, 0502 economics and business, Expectation–maximization algorithm, Humans, Multivariate mixtures of Erlangs with a common scale parameter, 0101 mathematics, Likelihood Functions, 050208 finance, Applied Mathematics, 05 social sciences, General Medicine, Density estimation, Erlang (unit), Censoring (statistics), Multivariate Analysis, Censored data, Data mining, computer, Algorithm, Algorithms, Maximum likelihood
Abstract: Multivariate mixtures of Erlang distributions form a versatile, yet analytically tractable, class of distributions making them suitable for multivariate density estimation. We present a flexible and effective fitting procedure for multivariate mixtures of Erlangs, which iteratively uses the EM algorithm, by introducing a computationally efficient initialization and adjustment strategy for the shape parameter vectors. We furthermore extend the EM algorithm for multivariate mixtures of Erlangs to be able to deal with randomly censored and fixed truncated data. The effectiveness of the proposed algorithm is demonstrated on simulated as well as real data sets. ispartof: Lifetime Data Analysis vol:22 issue:3 pages:429-455 ispartof: location:United States status: published
Published: 2015
Full Text: View/download PDF

8. FITTING MIXTURES OF ERLANGS TO CENSORED AND TRUNCATED DATA USING THE EM ALGORITHM

Author: Andrei L. Badescu, Katrien Antonio, Roel Verbelen, Lan Gong, Sheldon Lin, and Actuarial Science & Mathematical Finance (ASE, FEB)
Subjects: Insurance claims, Economics and Econometrics, Computer science, Accounting, Maximum likelihood, Simulated data, Expectation–maximization algorithm, Truncation (statistics), Algorithm, Censoring (statistics), Finance
Abstract: Modeling data on claim sizes is crucial when pricing insurance products. Such loss models require on the one hand the flexibility of nonparametric density estimation techniques to describe the insurance losses and on the other hand the feasibility to analytically quantify the risk. Mixtures of Erlang distributions with a common scale are very versatile as they are dense in the space of distributions on ℝ+ (Tijms (1994, p. 163)). At the same time, it is possible to work analytically with this kind of distributions. Closed-form expressions of quantities of interest, such as the Value-at-Risk (VaR) and the Tail-Value-at-Risk (TVaR), can be derived as well as appealing closure properties (Lee and Lin (2010), Willmot and Lin (2011) and Klugman et al. (2012)). In particular, using these distributions in aggregate loss models leads to an analytical form of the corresponding aggregate loss distribution which avoids the need for simulations to evaluate the model. In actuarial science, claim severity data is often censored and/or truncated due to policy modifications such as deductibles and policy limits. Lee and Lin (2010) formulate a calibration technique based on the EM algorithm for fitting mixtures of Erlangs with a common scale parameter to complete data. Here, we construct an adjusted EM algorithm which is able to deal with censored and truncated data, inspired by McLachlan and Peel (2001) and Lee and Scott (2012). Using the developed R program, we demonstrate the strength of mixtures of Erlangs to approximate skew, bimodal, moderately heavy tailed or even flat densities based on simulated censored and truncated samples. ispartof: EUROPEAN JOURNAL OF PHARMACEUTICAL SCIENCES vol:45 issue:3 pages:729-758 ispartof: location:Leuven (Belgium) status: published
Published: 2015
Full Text: View/download PDF

9. Modelling censored losses using splicing: a global fit strategy with mixed Erlang and extreme value distributions

Author: Katrien Antonio, Jan Beirlant, Roel Verbelen, Tom Reynkens, Faculteit Economie en Bedrijfskunde, and Actuarial Science & Mathematical Finance (ASE, FEB)
Subjects: FOS: Computer and information sciences, Statistics and Probability, Economics and Econometrics, Mathematical optimization, 050208 finance, Computer science, Financial risk, 05 social sciences, Statistics - Applications, 01 natural sciences, Erlang (unit), Methodology (stat.ME), 010104 statistics & probability, symbols.namesake, Heavy-tailed distribution, 0502 economics and business, RNA splicing, Expectation–maximization algorithm, symbols, Applications (stat.AP), Pareto distribution, 0101 mathematics, Statistics, Probability and Uncertainty, Extreme value theory, Statistics - Methodology
Abstract: In risk analysis, a global fit that appropriately captures the body and the tail of the distribution of losses is essential. Modelling the whole range of the losses using a standard distribution is usually very hard and often impossible due to the specific characteristics of the body and the tail of the loss distribution. A possible solution is to combine two distributions in a splicing model: a light-tailed distribution for the body which covers light and moderate losses, and a heavy-tailed distribution for the tail to capture large losses. We propose a splicing model with a mixed Erlang (ME) distribution for the body and a Pareto distribution for the tail. This combines the flexibility of the ME distribution with the ability of the Pareto distribution to model extreme values. We extend our splicing approach for censored and/or truncated data. Relevant examples of such data can be found in financial risk analysis. We illustrate the flexibility of this splicing model using practical examples from risk measurement.
Published: 2017

10. A Data Driven Binning Strategy for the Construction of Insurance Tariff Classes

Author: Henckaerts, Roel, primary, Antonio, Katrien, additional, Clijsters, Maxime, additional, and Roel, Verbelen, additional
Published: 2017
Full Text: View/download PDF

11. Multivariate Mixtures of Erlangs for Density Estimation Under Censoring and Truncation: Additional Examples

Author: Roel Verbelen, Gerda Claeskens, and Katrien Antonio
Subjects: Multivariate statistics, Censoring (clinical trials), Statistics, Addendum, Truncation (statistics), Density estimation, Mathematics
Abstract: In this addendum to Verbelen et al. (2015), we present several additional examples of the calibration procedure for fitting multivariate mixtures of Erlangs to censored and truncated data.
Published: 2015
Full Text: View/download PDF

12. Unraveling the Predictive Power of Telematics Data in Car Insurance Pricing

Author: Roel, Verbelen, primary and Antonio, Katrien, additional
Published: 2016
Full Text: View/download PDF

13. Modeling Censored Losses Using Splicing: A Global Fit Strategy with Mixed Erlang and Extreme Value Distributions

Author: Reynkens, Tom, primary, Roel, Verbelen, additional, Beirlant, Jan, additional, and Antonio, Katrien, additional
Published: 2016
Full Text: View/download PDF

14. Multivariate Mixtures of Erlangs for Density Estimation Under Censoring and Truncation

Author: Roel, Verbelen, primary, Antonio, Katrien, additional, and Claeskens, Gerda, additional
Published: 2014
Full Text: View/download PDF

15. Fitting Mixtures of Erlangs to Censored and Truncated Data Using the EM Algorithm

Author: Antonio, Katrien, primary, Badescu, Andrei L, additional, Lan, Gong, additional, Lin, X. Sheldon, additional, and Roel, Verbelen, additional
Published: 2014
Full Text: View/download PDF

16. Fitting Mixtures of Erlangs to Uncensored and Untruncated Data Using the EM Algorithm - Addendum

Author: Roel, Verbelen, primary
Published: 2013
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

16 results on '"Roel Verbelen"'

1. Sparse regression with Multi-type Regularized Feature modeling

2. Unravelling the predictive power of telematics data in car insurance pricing

3. Modeling the number of hidden events subject to observation delay

4. Boosting insights in insurance tariff plans with tree-based machine learning methods

5. Modeling the occurrence of events subject to a reporting delay via an EM algorithm

6. A data driven binning strategy for the construction of insurance tariff classes

7. Multivariate mixtures of Erlangs for density estimation under censoring

8. FITTING MIXTURES OF ERLANGS TO CENSORED AND TRUNCATED DATA USING THE EM ALGORITHM

9. Modelling censored losses using splicing: a global fit strategy with mixed Erlang and extreme value distributions

10. A Data Driven Binning Strategy for the Construction of Insurance Tariff Classes

11. Multivariate Mixtures of Erlangs for Density Estimation Under Censoring and Truncation: Additional Examples

12. Unraveling the Predictive Power of Telematics Data in Car Insurance Pricing

13. Modeling Censored Losses Using Splicing: A Global Fit Strategy with Mixed Erlang and Extreme Value Distributions

14. Multivariate Mixtures of Erlangs for Density Estimation Under Censoring and Truncation

15. Fitting Mixtures of Erlangs to Censored and Truncated Data Using the EM Algorithm

16. Fitting Mixtures of Erlangs to Uncensored and Untruncated Data Using the EM Algorithm - Addendum

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Journal

Database

Publisher

16 results on '"Roel Verbelen"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources