100 results on '"Parametric bootstrapping"'
Search Results
2. Bayesian parameter estimation using truncated normal distributions as priors for parameters in fundamental models of chemical processes.
- Author
-
Gibson, Lauren A. and McAuley, Kimberley B.
- Abstract
Modellers of chemical processes with knowledge about plausible parameter values use Bayesian parameter estimation methods to account for their prior beliefs. Some modellers specify prior distributions with finite parameter ranges, such as uniform distributions and truncated normal distributions, because they better account for knowledge about realistic parameter ranges than normal prior distributions with parameter values ranging between −∞ and +∞. We derive closed‐form objective functions for Bayesian parameter estimation with truncated normal priors and uniform priors, for the first time, so that parameter estimation can be performed by solving simple optimization problems rather than using complex sampling‐based techniques. A parametric bootstrapping method that considers truncated normal priors and model nonlinearity is proposed to determine 95% confidence intervals and joint confidence regions. A pharmaceutical case study is used to show the effectiveness of the proposed objective functions and bootstrapping methodology. Confidence regions from bootstrapping are similar to linearization‐based confidence regions that do not account for truncation when truncated areas in normal prior distributions are relatively small. More truncation, which corresponds to more‐precise prior knowledge about the parameters, results in smaller joint confidence regions. The proposed methods will be attractive for parameter estimation in complex process models because they can be less computationally intensive than Markov chain Monte Carlo methods that provide similar results. [ABSTRACT FROM AUTHOR]
- Published
- 2025
- Full Text
- View/download PDF
3. Confidence interval estimation for the difference and ratio of the means of two gamma distributions.
- Author
-
Gao, Yi and Tian, Lili
- Subjects
- *
GAMMA distributions , *ENVIRONMENTAL engineering , *ENVIRONMENTAL sciences , *PROBABILITY theory , *STORAGE & moving industry , *CONFIDENCE intervals - Abstract
Gamma distribution is widely used in applied fields due to its flexibility of accommodating right-skewed data. This article explores statistical methods for constructing confidence intervals for both the difference and ratio of two gamma means. We propose several methods based on the concepts of generalized inference, variance estimates recovery (MOVER) and parametric bootstrapping. The performances of proposed methods are evaluated and compared via a comprehensive simulation study. Simulation results show that the hybrid method that combines MOVER with parametric bootstrapping can provide confidence intervals with reasonable coverage probabilities and interval lengths even for parameter settings with small shape parameters. Finally, two real data examples from environmental and engineering studies are analyzed using the proposed methods. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
4. Bootstrap control charts for quantiles based on log‐symmetric distributions with applications to the monitoring of reliability data.
- Author
-
Leiva, Víctor, Santos, Rafael A. dos, Saulo, Helton, Marchant, Carolina, and Lio, Yuhlong
- Subjects
- *
MONTE Carlo method , *QUALITY control charts , *QUANTILES , *LOGNORMAL distribution , *DATA distribution , *WEIBULL distribution - Abstract
In this work, a methodology to monitor a shift in the quantile of a distribution that is a member of the log‐symmetric family is proposed. Because the sampling distribution of a quantile estimator is often not available, the parametric bootstrap method is used to determine this sampling distribution and to establish the control limits when the process measurements follow a log‐symmetric distribution. The mentioned family is helpful for describing the behavior of data following a distribution with positive support and that is skewed to the right. Monte Carlo simulations are carried out to investigate the performance of the proposed bootstrap control charts for quantiles. An application regarding failure data due to stress on carbon fibers is presented for illustration when monitoring reliability data. This illustration shows that non‐conventional models, other than the Birnbaum‐Saunders, log‐normal and Weibull distributions, have potential to be used in practice. Two model selection procedures are considered to assess adequacy to the data. To facilitate the public use of the proposed methodology, we have created an R package named chartslogsym whose main functions are detailed in this paper. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
5. Prediction of rail defect development using parametric bootstrapping modified Weibull equations.
- Author
-
Cronin, John J, Zarembski, Allan M, and Palese, Joseph W
- Abstract
The railroad industry has historically used the 2-Parameter Weibull equation to determine the rate of rail fatigue defect occurrences and to forecast the fatigue life of railroad rail. However, the 2-Parameter Weibull equation has significant limitations to include inability to analyze segments of track with limited number of rail defects. These limitations are addressed through modification of the traditional 2-Parameter Weibull equation with a novel approach developed from Parametric Bootstrapping. The result is a Parametric Bootstrapping modified Weibull (PBW) forecasting approach. This methodology is applied to rail segments with insufficient numbers of defects to allow for appropriate defect forecasting analysis. Thus, the PBW method provides reasonable estimates of the rate of defects for track segments that have little or no prior defect history. This approach allows for more track to be analyzed and forecasts the probability of rail defect occurrence as a function of key parameters such as cumulative traffic over the rail. A validation of the proposed methodology was performed. Comparison of the output results of over 300,000 track segments with over 200,000 rail defects showed a major improvement in percentage of segments with reasonable Weibull parameters (alpha and beta). This percentage increased from 11% of segments using traditional Weibull analysis to 77% of segments using Parametric Bootstrap modified Weibull approach. These results show that the PBW Analysis approach introduced here offers a more accurate and effective approach to determining the probability of developing future rail defects. This provides a benefit to railroads in planning maintenance of their expensive rail assets. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
6. Ensemble bootstrap methodology for forecasting dynamic growth processes using differential equations: application to epidemic outbreaks.
- Author
-
Chowell, Gerardo and Luo, Ruiyan
- Subjects
- *
FORECASTING methodology , *DIFFERENTIAL equations , *NONLINEAR differential equations , *NONLINEAR equations , *EPIDEMICS - Abstract
Background: Ensemble modeling aims to boost the forecasting performance by systematically integrating the predictive accuracy across individual models. Here we introduce a simple-yet-powerful ensemble methodology for forecasting the trajectory of dynamic growth processes that are defined by a system of non-linear differential equations with applications to infectious disease spread.Methods: We propose and assess the performance of two ensemble modeling schemes with different parametric bootstrapping procedures for trajectory forecasting and uncertainty quantification. Specifically, we conduct sequential probabilistic forecasts to evaluate their forecasting performance using simple dynamical growth models with good track records including the Richards model, the generalized-logistic growth model, and the Gompertz model. We first test and verify the functionality of the method using simulated data from phenomenological models and a mechanistic transmission model. Next, the performance of the method is demonstrated using a diversity of epidemic datasets including scenario outbreak data of the Ebola Forecasting Challenge and real-world epidemic data outbreaks of including influenza, plague, Zika, and COVID-19.Results: We found that the ensemble method that randomly selects a model from the set of individual models for each time point of the trajectory of the epidemic frequently outcompeted the individual models as well as an alternative ensemble method based on the weighted combination of the individual models and yields broader and more realistic uncertainty bounds for the trajectory envelope, achieving not only better coverage rate of the 95% prediction interval but also improved mean interval scores across a diversity of epidemic datasets.Conclusion: Our new methodology for ensemble forecasting outcompete component models and an alternative ensemble model that differ in how the variance is evaluated for the generation of the prediction intervals of the forecasts. [ABSTRACT FROM AUTHOR]- Published
- 2021
- Full Text
- View/download PDF
7. partR2: partitioning R2 in generalized linear mixed models
- Author
-
Martin A. Stoffel, Shinichi Nakagawa, and Holger Schielzeth
- Subjects
Semi-partial coefficient of determination ,Generalized linear mixed-effects models ,Variance component analysis ,Structure coefficients ,R2 ,Parametric bootstrapping ,Medicine ,Biology (General) ,QH301-705.5 - Abstract
The coefficient of determination R2 quantifies the amount of variance explained by regression coefficients in a linear model. It can be seen as the fixed-effects complement to the repeatability R (intra-class correlation) for the variance explained by random effects and thus as a tool for variance decomposition. The R2 of a model can be further partitioned into the variance explained by a particular predictor or a combination of predictors using semi-partial (part) R2 and structure coefficients, but this is rarely done due to a lack of software implementing these statistics. Here, we introduce partR2, an R package that quantifies part R2 for fixed effect predictors based on (generalized) linear mixed-effect model fits. The package iteratively removes predictors of interest from the model and monitors the change in the variance of the linear predictor. The difference to the full model gives a measure of the amount of variance explained uniquely by a particular predictor or a set of predictors. partR2 also estimates structure coefficients as the correlation between a predictor and fitted values, which provide an estimate of the total contribution of a fixed effect to the overall prediction, independent of other predictors. Structure coefficients can be converted to the total variance explained by a predictor, here called ‘inclusive’ R2, as the square of the structure coefficients times total R2. Furthermore, the package reports beta weights (standardized regression coefficients). Finally, partR2 implements parametric bootstrapping to quantify confidence intervals for each estimate. We illustrate the use of partR2 with real example datasets for Gaussian and binomial GLMMs and discuss interactions, which pose a specific challenge for partitioning the explained variance among predictors.
- Published
- 2021
- Full Text
- View/download PDF
8. partR2: partitioning R² in generalized linear mixed models.
- Author
-
Stoffel, Martin A., Nakagawa, Shinichi, and Schielzeth, Holger
- Subjects
FIXED effects model ,FORECASTING ,INTRACLASS correlation ,STATISTICAL correlation ,STATISTICAL software ,CONFIDENCE intervals - Abstract
The coefficient of determination R² quantifies the amount of variance explained by regression coefficients in a linear model. It can be seen as the fixed-effects complement to the repeatability R (intra-class correlation) for the variance explained by random effects and thus as a tool for variance decomposition. The R² of a model can be further partitioned into the variance explained by a particular predictor or a combination of predictors using semi-partial (part) R² and structure coefficients, but this is rarely done due to a lack of software implementing these statistics. Here, we introduce partR², an R package that quantifies part R² for fixed effect predictors based on (generalized) linear mixed-effect model fits. The package iteratively removes predictors of interest from the model and monitors the change in the variance of the linear predictor. The difference to the full model gives a measure of the amount of variance explained uniquely by a particular predictor or a set of predictors. partR² also estimates structure coefficients as the correlation between a predictor and fitted values, which provide an estimate of the total contribution of a fixed effect to the overall prediction, independent of other predictors. Structure coefficients can be converted to the total variance explained by a predictor, here called 'inclusive' R², as the square of the structure coefficients times total R². Furthermore, the package reports beta weights (standardized regression coefficients). Finally, partR² implements parametric bootstrapping to quantify confidence intervals for each estimate. We illustrate the use of partR² with real example datasets for Gaussian and binomial GLMMs and discuss interactions, which pose a specific challenge for partitioning the explained variance among predictors. [ABSTRACT FROM AUTHOR]
- Published
- 2021
- Full Text
- View/download PDF
9. Is there a low-pay no-pay cycle in Australia? A note on Fok, Scutella and Wilkins (2015).
- Author
-
Cai, Lixin
- Subjects
LABOR market - Abstract
A recent study by Fok et al. (Oxf Bull Econ Stat 77:872–896, 2015) concludes that there is a low-pay no-pay cycle for males and females in the Australian labour market. This note re-estimates the model of that study using the same data. It is found that Fok et al. (2015) conclusion is based on a model specification that assumes zero correlation of unobserved heterogeneity between the different labour force states modelled. The results of this note show that when the zero correlation restriction is relaxed, there is no evidence of a low-pay no-pay cycle for either males or females. It is also found that the marginal effect estimates used in Fok et al. (2015) to draw the low-pay no-pay cycle conclusion for males and females have been estimated imprecisely. Furthermore, contrary to what Fok et al. (2015) have concluded, the results of this note show that there is no evidence on heterogeneity in the low-pay no-pay cycle across the demographic subgroups examined by Fok et al. (2015). [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
10. Statistical comparison of final weight scores in quality function deployment (QFD) studies
- Author
-
Iqbal, Zafar, P. Grigg, Nigel, Govinderaju, K., and Campbell-Allen, Nicola
- Published
- 2014
- Full Text
- View/download PDF
11. Prediction of rail defect development using parametric bootstrapping modified Weibull equations
- Author
-
Joseph W. Palese, Allan M. Zarembski, and John J Cronin
- Subjects
business.industry ,Railroad industry ,Mechanical Engineering ,020207 software engineering ,02 engineering and technology ,Structural engineering ,020303 mechanical engineering & transports ,0203 mechanical engineering ,0202 electrical engineering, electronic engineering, information engineering ,Weibull equation ,Parametric bootstrapping ,business ,Weibull distribution ,Mathematics - Abstract
The railroad industry has historically used the 2-Parameter Weibull equation to determine the rate of rail fatigue defect occurrences and to forecast the fatigue life of railroad rail. However, the 2-Parameter Weibull equation has significant limitations to include inability to analyze segments of track with limited number of rail defects. These limitations are addressed through modification of the traditional 2-Parameter Weibull equation with a novel approach developed from Parametric Bootstrapping. The result is a Parametric Bootstrapping modified Weibull (PBW) forecasting approach. This methodology is applied to rail segments with insufficient numbers of defects to allow for appropriate defect forecasting analysis. Thus, the PBW method provides reasonable estimates of the rate of defects for track segments that have little or no prior defect history. This approach allows for more track to be analyzed and forecasts the probability of rail defect occurrence as a function of key parameters such as cumulative traffic over the rail. A validation of the proposed methodology was performed. Comparison of the output results of over 300,000 track segments with over 200,000 rail defects showed a major improvement in percentage of segments with reasonable Weibull parameters (alpha and beta). This percentage increased from 11% of segments using traditional Weibull analysis to 77% of segments using Parametric Bootstrap modified Weibull approach. These results show that the PBW Analysis approach introduced here offers a more accurate and effective approach to determining the probability of developing future rail defects. This provides a benefit to railroads in planning maintenance of their expensive rail assets.
- Published
- 2021
- Full Text
- View/download PDF
12. Forecasting COVID-19 Epidemic in Spain and Italy Using A Generalized Richards Model with Quantified Uncertainty
- Author
-
Agus Suryanto, Hasan S. Panigoro, Isnani Darti, and Hadi Susanto
- Subjects
Estimation ,Polymers and Plastics ,Coronavirus disease 2019 (COVID-19) ,Estimation theory ,Calibration (statistics) ,Poisson distribution ,Confidence interval ,symbols.namesake ,Statistics ,symbols ,Time series ,Parametric bootstrapping ,General Environmental Science ,Mathematics - Abstract
The Richards model and its generalized version are deterministic models that are often implemented to fit and forecast the cumulative number of infective cases in an epidemic outbreak. In this paper we employ a generalized Richards model to predict the cumulative number of COVID-19 cases in Spain and Italy, based on available epidemiological data. To quantify uncertainty in the parameter estimation, we use a parametric bootstrapping approach to construct a 95% confidence interval estimation for the parameter model. Here we assume that the time series data follow a Poisson distribution. It is found that the 95% confidence interval of each parameter becomes narrow with the increasing number of data. All in all, the model predicts daily new cases of COVID-19 reasonably well during calibration periods. However, the model fails to produce good forecasts when the amount of data used for parameter estimations is not sufficient. Based on our parameter estimates, it is found that the early stages of COVID-19 epidemic, both in Spain and in Italy, followed an almost exponentially growth. The epidemic peak in Spain and Italy is respectively on 2 April 2020 and 28 March 2020. The final sizes of cumulative number of COVID-19 cases in Spain and Italy are forecasted to be at 293220 and 237010, respectively.
- Published
- 2021
- Full Text
- View/download PDF
13. Post-stratified change estimation for large-area forest biomass using repeated ALS strip sampling.
- Author
-
Strîmbu, Victor Felix, Ene, Liviu Theodor, Gobakken, Terje, Gregoire, Timothy G., Astrup, Rasmus, and Næsset, Erik
- Subjects
- *
AIRBORNE lasers , *FOREST biomass , *FOREST surveys , *STATISTICAL bootstrapping , *STANDARD deviations - Abstract
Post-stratified model-assisted (MA) and hybrid (HY) estimators are used with repeated airborne laser scanning (ALS) strip sampling and national forest inventory field data for stratum-wise and overall estimation of aboveground biomass (AGB) stock and change. The study area covered the southern portion of the Hedmark County in Norway. Both MA and HY estimation substantially reduced the uncertainty in AGB change when compared with estimation using the field survey only. Relative efficiencies (relative variance) of 4.15 (MA) and 3.36 (HY) for overall estimates were found. The results suggest the MA estimator for single-time estimation and the HY as more appropriate for change estimation by cover class. With the HY estimator, a nested post-stratification scheme is demonstrated, combining cover classes with change classes, which enables detailed reporting for change according to cause within each cover class, and has the potential to improve the estimation precision. Finally, parametric bootstrapping is demonstrated as an empirical alternative to estimate the model-error component in the HY estimator. The model error estimated with parametric bootstrapping converged to the analytically determined value of the HY estimator within 1000 bootstrap samples. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
14. Detecting Signatures of Positive Selection against a Backdrop of Compensatory Processes
- Author
-
Westin M. Kosater, Peter B. Chi, and David A. Liberles
- Subjects
Leptin ,Primates ,0106 biological sciences ,Nonsynonymous substitution ,0303 health sciences ,Models, Statistical ,Protein Conformation ,Positive selection ,Statistical model ,Computational biology ,Biology ,010603 evolutionary biology ,01 natural sciences ,Evolution, Molecular ,03 medical and health sciences ,Genetics ,Animals ,Selection, Genetic ,Parametric bootstrapping ,Synonymous substitution ,Molecular Biology ,Silent Mutation ,Software ,Ecology, Evolution, Behavior and Systematics ,030304 developmental biology - Abstract
There are known limitations in methods of detecting positive selection. Common methods do not enable differentiation between positive selection and compensatory covariation, a major limitation. Further, the traditional method of calculating the ratio of nonsynonymous to synonymous substitutions (dN/dS) does not take into account the 3D structure of biomacromolecules nor differences between amino acids. It also does not account for saturation of synonymous mutations (dS) over long evolutionary time that renders codon-based methods ineffective for older divergences. This work aims to address these shortcomings for detecting positive selection through the development of a statistical model that examines clusters of substitutions in clusters of variable radii. Additionally, it uses a parametric bootstrapping approach to differentiate positive selection from compensatory processes. A previously reported case of positive selection in the leptin protein of primates was reexamined using this methodology.
- Published
- 2020
- Full Text
- View/download PDF
15. Thermal Performance Curves Are Shaped by Prior Thermal Environment in Early Life
- Author
-
Carla M. Sgrò, Keyne Monro, and Adriana P. Rebolledo
- Subjects
Physiology ,fungi ,complex life cycles ,One stage ,Climate change ,thermal sensitivity ,Life stage ,Early life ,carryover effects ,larval development ,climate change ,fertilization ,Ectotherm ,developmental plasticity ,Physiology (medical) ,Thermal ,QP1-981 ,Environmental science ,Performance curves ,embryogenesis ,Parametric bootstrapping ,Biological system ,Original Research - Abstract
Understanding links between thermal performance and environmental variation is necessary to predict organismal responses to climate change, and remains an ongoing challenge for ectotherms with complex life cycles. Distinct life stages can differ in thermal sensitivity, experience different environmental conditions as development unfolds, and, because stages are by nature interdependent, environmental effects can carry over from one stage to affect performance at others. Thermal performance may therefore respond to carryover effects of prior thermal environments, yet detailed insights into the nature, strength, and direction of those responses are still lacking. Here, in an aquatic ectotherm whose early planktonic stages (gametes, embryos, and larvae) govern adult abundances and dynamics, we explore the effects of prior thermal environments at fertilization and embryogenesis on thermal performance curves at the end of planktonic development. We factorially manipulate temperatures at fertilization and embryogenesis, then, for each combination of prior temperatures, measure thermal performance curves for survival of planktonic development (end of the larval stage) throughout the performance range. By combining generalized linear mixed modeling with parametric bootstrapping, we formally estimate and compare curve descriptors (thermal optima, limits, and breadth) among prior environments, and reveal carryover effects of temperature at embryogenesis, but not fertilization, on thermal optima at completion of development. Specifically, thermal optima shifted to track temperature during embryogenesis, while thermal limits and breadth remained unchanged. Our results argue that key aspects of thermal performance are shaped by prior thermal environment in early life, warranting further investigation of the possible mechanisms underpinning that response, and closer consideration of thermal carryover effects when predicting organismal responses to climate change.
- Published
- 2021
- Full Text
- View/download PDF
16. Parametric methods for confidence interval estimation of overlap coefficients.
- Author
-
Wang, Dan and Tian, Lili
- Subjects
- *
DISTRIBUTION (Probability theory) , *PARAMETER estimation , *GENERALIZABILITY theory , *STATISTICAL bootstrapping , *CONFIDENCE intervals - Abstract
Overlap coefficient ( O V L ), the proportion of overlap area between two probability distributions, is a direct measure of similarity between two distributions. It is useful in microarray analysis for the purpose of identifying differentially expressed biomarkers, especially when data follow multimodal distribution which cannot be transformed to normal. However, the inference methods about O V L are quite sparse. This article proposes two methods, a generalized inference ( G I ) approach and a parametric bootstrapping ( P B ) method, to construct confidence intervals of O V L under the assumption of normality. In conjunction with the E M algorithms, these methods are extended to mixture Gaussian ( M G ) distributions. The performances of these methods are evaluated empirically under a variety of distributions including normal, gamma and mixture Gaussian. At last, the proposed approaches are applied to a published microarray dataset from a gene expression study of three most prevalent adult lymphoid malignancies. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
17. Parametric Bootstrapping Predictive Estimator for Logistic Regression
- Author
-
Kunio Takezawa
- Subjects
Economics and Econometrics ,Maximum likelihood ,Statistics ,Materials Chemistry ,Media Technology ,Statistics::Methodology ,Log likelihood ,Estimator ,Forestry ,Parametric bootstrapping ,Logistic regression ,Statistics::Computation ,Mathematics - Abstract
This paper proposes a method for constructing a predictive estimator for logistic regression. We make a provisional assumption that the predictive estimator is given by multiplying the maximum likelihood estimators by constants, which are estimated using a parametric bootstrap method. The relative merits of the maximum likelihood estimator and the predictive estimator produced by this method are determined by cross-validation. The results show that the predictiveestimators derived by this method lead to a smaller deviance than that obtained by the maximum likelihood estimator in many instances.
- Published
- 2019
- Full Text
- View/download PDF
18. Is there a low-pay no-pay cycle in Australia? A note on Fok, Scutella and Wilkins (2015)
- Author
-
Lixin Cai
- Subjects
Statistics and Probability ,Economics and Econometrics ,food.ingredient ,Zero correlation ,05 social sciences ,Random effects model ,Mathematics (miscellaneous) ,Specification ,food ,Scutella ,0502 economics and business ,Econometrics ,050207 economics ,Parametric bootstrapping ,Social Sciences (miscellaneous) ,050205 econometrics ,Mathematics - Abstract
A recent study by Fok et al. (Oxf Bull Econ Stat 77:872–896, 2015) concludes that there is a low-pay no-pay cycle for males and females in the Australian labour market. This note re-estimates the model of that study using the same data. It is found that Fok et al. (2015) conclusion is based on a model specification that assumes zero correlation of unobserved heterogeneity between the different labour force states modelled. The results of this note show that when the zero correlation restriction is relaxed, there is no evidence of a low-pay no-pay cycle for either males or females. It is also found that the marginal effect estimates used in Fok et al. (2015) to draw the low-pay no-pay cycle conclusion for males and females have been estimated imprecisely. Furthermore, contrary to what Fok et al. (2015) have concluded, the results of this note show that there is no evidence on heterogeneity in the low-pay no-pay cycle across the demographic subgroups examined by Fok et al. (2015).
- Published
- 2019
- Full Text
- View/download PDF
19. Enhancing prioritisation of technical attributes in quality function deployment.
- Author
-
Iqbal, Zafar, Grigg, Nigel P., Govindaraju, K., and Campbell-Allen, Nicola Marie
- Subjects
QUALITY function deployment ,STATISTICAL hypothesis testing ,CONFIDENCE intervals ,CENTRAL limit theorem ,STATISTICAL sampling ,STATISTICAL bootstrapping - Abstract
Purpose ? Quality function deployment (QFD) is a planning methodology to improve products, services and their associated processes by ensuring that the voice of the customer has been effectively deployed through specified and prioritised technical attributes (TAs). The purpose of this paper is two ways: to enhance the prioritisation of TAs: computer simulation significance test; and computer simulation confidence interval. Both are based on permutation sampling, bootstrap sampling and parametric bootstrap sampling of given empirical data. Design/methodology/approach ? The authors present a theoretical case for the use permutation sampling, bootstrap sampling and parametric bootstrap sampling. Using a published case study the authors demonstrate how these can be applied on given empirical data to generate a theoretical population. From this the authors describe a procedure to decide upon which TAs have significantly different priority, and also estimate confidence intervals from the theoretical simulated populations. Findings ? First, the authors demonstrate not only parametric bootstrap is useful to simulate theoretical populations. The authors can also employ permutation sampling and bootstrap sampling to generate theoretical populations. Then the authors obtain the results from these three approaches. qThe authors describe why there is a difference in results of permutation sampling, bootstrap and parametric bootstrap sampling. Practitioners can employ any approach, it depends how much variation in FWs is required by quality assurance division. Originality/value ? Using these methods provides QFD practitioners with a robust and reliable method for determining which TAs should be selected for attention in product and service design. The explicit selection of TAs will help to achieve maximum customer satisfaction, and save time and money, which are the ultimate objectives of QFD. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
20. Ensemble bootstrap methodology for forecasting dynamic growth processes using differential equations: application to epidemic outbreaks
- Author
-
Gerardo Chowell and Ruiyan Luo
- Subjects
Differential equations ,Epidemiology ,Computer science ,Gompertz function ,Interval score ,Health Informatics ,Interval (mathematics) ,Machine learning ,computer.software_genre ,Disease Outbreaks ,03 medical and health sciences ,0302 clinical medicine ,Influenza, Human ,Humans ,030212 general & internal medicine ,Uncertainty quantification ,Generalized logistic growth model ,030304 developmental biology ,Parametric bootstrapping ,lcsh:R5-920 ,0303 health sciences ,Models, Statistical ,Ensemble forecasting ,business.industry ,SARS-CoV-2 ,Zika Virus Infection ,Probabilistic logic ,Prediction interval ,COVID-19 ,Gompertz model ,Variance (accounting) ,Hemorrhagic Fever, Ebola ,Model ensemble, parameter estimation, uncertainty quantification, phenomenological growth ,Trajectory ,Artificial intelligence ,Richards model ,lcsh:Medicine (General) ,business ,computer ,Research Article ,Forecasting - Abstract
BackgroundEnsemble modeling aims to boost the forecasting performance by systematically integrating the predictive accuracy across individual models. Here we introduce a simple-yet-powerful ensemble methodology for forecasting the trajectory of dynamic growth processes that are defined by a system of non-linear differential equations with applications to infectious disease spread.MethodsWe propose and assess the performance of two ensemble modeling schemes with different parametric bootstrapping procedures for trajectory forecasting and uncertainty quantification. Specifically, we conduct sequential probabilistic forecasts to evaluate their forecasting performance using simple dynamical growth models with good track records including the Richards model, the generalized-logistic growth model, and the Gompertz model. We first test and verify the functionality of the method using simulated data from phenomenological models and a mechanistic transmission model. Next, the performance of the method is demonstrated using a diversity of epidemic datasets including scenario outbreak data of theEbola Forecasting Challengeand real-world epidemic data outbreaks of including influenza, plague, Zika, and COVID-19.ResultsWe found that the ensemble method that randomly selects a model from the set of individual models for each time point of the trajectory of the epidemic frequently outcompeted the individual models as well as an alternative ensemble method based on the weighted combination of the individual models and yields broader and more realistic uncertainty bounds for the trajectory envelope, achieving not only better coverage rate of the 95% prediction interval but also improved mean interval scores across a diversity of epidemic datasets.ConclusionOur new methodology for ensemble forecasting outcompete component models and an alternative ensemble model that differ in how the variance is evaluated for the generation of the prediction intervals of the forecasts.
- Published
- 2021
21. Threshold-dependent sample sizes for selenium assessment with stream fish tissue.
- Author
-
Hitt, Nathaniel P and Smith, David R
- Subjects
WATER pollution measurement ,SELENIUM analysis ,FISH anatomy ,SAMPLE size (Statistics) ,NATURAL resources management - Abstract
ABSTRACT Natural resource managers are developing assessments of selenium (Se) contamination in freshwater ecosystems based on fish tissue concentrations. We evaluated the effects of sample size (i.e., number of fish per site) on the probability of correctly detecting mean whole-body Se values above a range of potential management thresholds. We modeled Se concentrations as gamma distributions with shape and scale parameters fitting an empirical mean-to-variance relationship in data from southwestern West Virginia, USA (63 collections, 382 individuals). We used parametric bootstrapping techniques to calculate statistical power as the probability of detecting true mean concentrations up to 3 mg Se/kg above management thresholds ranging from 4 to 8 mg Se/kg. Sample sizes required to achieve 80% power varied as a function of management thresholds and Type I error tolerance (α). Higher thresholds required more samples than lower thresholds because populations were more heterogeneous at higher mean Se levels. For instance, to assess a management threshold of 4 mg Se/kg, a sample of eight fish could detect an increase of approximately 1 mg Se/kg with 80% power (given α = 0.05), but this sample size would be unable to detect such an increase from a management threshold of 8 mg Se/kg with more than a coin-flip probability. Increasing α decreased sample size requirements to detect above-threshold mean Se concentrations with 80% power. For instance, at an α-level of 0.05, an 8-fish sample could detect an increase of approximately 2 units above a threshold of 8 mg Se/kg with 80% power, but when α was relaxed to 0.2, this sample size was more sensitive to increasing mean Se concentrations, allowing detection of an increase of approximately 1.2 units with equivalent power. Combining individuals into 2- and 4-fish composite samples for laboratory analysis did not decrease power because the reduced number of laboratory samples was compensated for by increased precision of composites for estimating mean conditions. However, low sample sizes (<5 fish) did not achieve 80% power to detect near-threshold values (i.e., <1 mg Se/kg) under any scenario we evaluated. This analysis can assist the sampling design and interpretation of Se assessments from fish tissue by accounting for natural variation in stream fish populations. Integr Environ Assess Manag 2015;11:143-149. Published 2014 SETAC [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF
22. Parametric Bootstrapping of Array Data with A Generative Adversarial Network
- Author
-
Herbert Groll, Peter Gerstoft, and Christoph F. Mecklenbrauker
- Subjects
Physics::Computational Physics ,Acoustic array ,Computer science ,020206 networking & telecommunications ,02 engineering and technology ,Condensed Matter::Materials Science ,020204 information systems ,Resampling ,Norm (mathematics) ,0202 electrical engineering, electronic engineering, information engineering ,Hellinger distance ,Parametric bootstrapping ,Generative adversarial network ,Algorithm ,Generative grammar ,Parametric statistics - Abstract
Since the number of independent array data snapshots is limited by the availability of real-world data, we propose a parametric bootstrap for resampling. The proposed parametric bootstrap is based on a generative adversarial network (GAN) following the generative approach to machine learning. For the GAN model we chose the Wasserstein GAN with penalized norm of gradient of the critic with respect to its input (wGAN gp). The approach is demonstrated with synthetic and real-world ocean acoustic array data.
- Published
- 2020
- Full Text
- View/download PDF
23. Comparison of Multivendor Single-Voxel MR Spectroscopy Data Acquired in Healthy Brain at 26 Sites
- Author
-
Hongmin Xu, Thomas Lange, Michal Považan, Peter B. Barker, Muhammad G. Saleh, Scott O. Murray, Koen Cuypers, Chencheng Zhang, Fuhua Yan, Lars Ersland, Ian Greenhouse, Martin Tegenthoff, Alayar Kangarlu, Kim M. Cecil, Yan Li, Pallab K. Bhattacharyya, Nigel Hoggard, Adam J. Woods, Feng Liu, Nicolaas A.J. Puts, Chien Yuan E. Lin, Helge J. Zöllner, Niall W. Duncan, Ruoyun Ma, Hans Jörg Wittsack, Vadim Zipunnikov, Michael Dacko, Guangbin Wang, Eric C. Porges, Michael-Paul Schallmo, R. Marc Lebel, Marta Moreno-Ortega, David Yen Ting Chen, Joanna R. Long, Megan A. Forbes, Kimberly L. Chan, Georg Oeltzschner, Richard A.E. Edden, Adam Berrington, Sean Noah, Maiken K. Brix, Napapon Sailasuta, Mark Mikkelsen, Stefanie Heba, Stephan P. Swinnen, David A. Edmondson, Diederick Stoffers, Naying He, Ralph Noeske, Jacobus F.A. Jansen, Fei Gao, Peter Truong, Michael D. Noseworthy, Pieter F. Buur, Alexander R. Craven, Jy Kang Liou, Tun Wei Hsu, Celine Maes, Gabriele Ende, James J. Prisciandaro, Nicholas Simard, Markus Sack, Ashley D. Harris, Timothy P.L. Roberts, Ulrike Dydak, Jiing Feng Lirng, Iain D. Wilkinson, Spinoza Centre for Neuroimaging, RS: MHeNs - R1 - Cognitive Neuropsychiatry and Clinical Neuroscience, Beeldvorming, and MUMC+: DA BV Klinisch Fysicus (9)
- Subjects
In vivo magnetic resonance spectroscopy ,Adult ,Male ,Magnetic Resonance Spectroscopy ,Single voxel ,SHORT-ECHO ,030218 nuclear medicine & medical imaging ,03 medical and health sciences ,Young Adult ,0302 clinical medicine ,Medicine ,Humans ,Radiology, Nuclear Medicine and imaging ,3 T ,Prospective Studies ,COMBINATION ,Original Research ,Science & Technology ,Extramural ,business.industry ,METABOLITE QUANTIFICATION ,Radiology, Nuclear Medicine & Medical Imaging ,Commerce ,Brain ,Mean age ,Data set ,Multicenter study ,030220 oncology & carcinogenesis ,RESONANCE SPECTROSCOPY ,Female ,Parametric bootstrapping ,Nuclear medicine ,business ,Life Sciences & Biomedicine - Abstract
Background The hardware and software differences between MR vendors and individual sites influence the quantification of MR spectroscopy data. An analysis of a large data set may help to better understand sources of the total variance in quantified metabolite levels. Purpose To compare multisite quantitative brain MR spectroscopy data acquired in healthy participants at 26 sites by using the vendor-supplied single-voxel point-resolved spectroscopy (PRESS) sequence. Materials and Methods An MR spectroscopy protocol to acquire short-echo-time PRESS data from the midparietal region of the brain was disseminated to 26 research sites operating 3.0-T MR scanners from three different vendors. In this prospective study, healthy participants were scanned between July 2016 and December 2017. Data were analyzed by using software with simulated basis sets customized for each vendor implementation. The proportion of total variance attributed to vendor-, site-, and participant-related effects was estimated by using a linear mixed-effects model. P values were derived through parametric bootstrapping of the linear mixed-effects models (denoted Pboot). Results In total, 296 participants (mean age, 26 years ± 4.6; 155 women and 141 men) were scanned. Good-quality data were recorded from all sites, as evidenced by a consistent linewidth of N-acetylaspartate (range, 4.4-5.0 Hz), signal-to-noise ratio (range, 174-289), and low Cramér-Rao lower bounds (≤5%) for all of the major metabolites. Among the major metabolites, no vendor effects were found for levels of myo-inositol (Pboot > .90), N-acetylaspartate and N-acetylaspartylglutamate (Pboot = .13), or glutamate and glutamine (Pboot = .11). Among the smaller resonances, no vendor effects were found for ascorbate (Pboot = .08), aspartate (Pboot > .90), glutathione (Pboot > .90), or lactate (Pboot = .28). Conclusion Multisite multivendor single-voxel MR spectroscopy studies performed at 3.0 T can yield results that are coherent across vendors, provided that vendor differences in pulse sequence implementation are accounted for in data analysis. However, the site-related effects on variability were more profound and suggest the need for further standardization of spectroscopic protocols. © RSNA, 2020 Online supplemental material is available for this article. ispartof: RADIOLOGY vol:295 issue:1 pages:171-180 ispartof: location:United States status: published
- Published
- 2020
24. An outlier-resistant indicator of anomalies among inter-laboratory comparison data with associated uncertainty
- Author
-
Stephen L. R. Ellison
- Subjects
FOS: Computer and information sciences ,Statistics::Theory ,Heteroscedasticity ,General Engineering ,62 Statiatics, 62P99, 62F35 ,Statistics - Applications ,01 natural sciences ,010309 optics ,0103 physical sciences ,Outlier ,Statistics ,Applications (stat.AP) ,Pairwise comparison ,Parametric bootstrapping ,Inter-laboratory ,010306 general physics ,Statistic ,Mathematics ,Quantile - Abstract
A new robust pairwise statistic, the pairwise median scaled difference (MSD), is proposed for the detection of anomalous location/uncertainty pairs in heteroscedastic interlaboratory study data with associated uncertainties. The distribution for the IID case is presented and approximate critical values for routine use are provided. The determination of observation-specific quantiles and p-values for heteroscedastic data, using parametric bootstrapping, is demonstrated by example. It is shown that the statistic has good power for detecting anomalies compared to a previous pairwise statistic, and offers much greater resistance to multiple outlying values.
- Published
- 2018
- Full Text
- View/download PDF
25. Assessing dynamic postural control during exergaming in older adults
- Author
-
V. Soancatl Aguilar, Jos B. T. M. Roerdink, Natasha M. Maurits, Claudine J. C. Lamoth, Scientific Visualization and Computer Graphics, SMART Movements (SMART), Movement Disorder (MD), Basic and Translational Research and Imaging Methodology Development in Groningen (BRIDGE), and Personalized Healthcare Technology (PHT)
- Subjects
Dynamic postural control ,Male ,medicine.medical_specialty ,Movement ,MODELS ,Biophysics ,Intervention effect ,Pilot Projects ,Assessment of dynamic postural control ,GAMES ,Unsupervised home exergaming ,Postural control ,03 medical and health sciences ,0302 clinical medicine ,Every other week ,Physical medicine and rehabilitation ,Curvature and speed of body movements ,medicine ,Humans ,Orthopedics and Sports Medicine ,Probabilistic models ,030212 general & internal medicine ,Postural Balance ,Balance (ability) ,Aged ,High probability ,Rehabilitation ,Probabilistic logic ,Healthy Volunteers ,Video Games ,Skating ,Female ,Parametric bootstrapping ,Psychology ,030217 neurology & neurosurgery - Abstract
Digital games controlled by body movements (exergames) have been proposed as a way to improve postural control among older adults. Exergames are meant to be played at home in an unsupervised way. However, only few studies have investigated the effect of unsupervised home-exergaming on postural control. Moreover, suitable methods to dynamically assess postural control during exergaming are still scarce. Dynamic postural control (DPC) assessment could be used to provide both meaningful feedback and automatic adjustment of exergame difficulty. These features could potentially foster unsupervised exergaming at home and improve the effectiveness of exergames as tools to improve balance control. The main aim of this study is to investigate the effect of six weeks of unsupervised home-exergaming on DPC as assessed by a recently developed probabilistic model. High probability values suggest ‘deteriorated’ postural control, whereas low probability values suggest ‘good’ postural control. In a pilot study, ten healthy older adults (average 77.9, SD 7.2 years) played an ice-skating exergame at home half an hour per day, three times a week during six weeks. The intervention effect on DPC was assessed using exergaming trials recorded by Kinect at baseline and every other week. Visualization of the results suggests that the probabilistic model is suitable for real-time DPC assessment. Moreover, linear mixed model analysis and parametric bootstrapping suggest a significant intervention effect on DPC. In conclusion, these results suggest that unsupervised exergaming for improving DPC among older adults is indeed feasible and that probabilistic models could be a new approach to assess DPC.
- Published
- 2018
- Full Text
- View/download PDF
26. Multilevel statistical models and the analysis of experimental data.
- Author
-
Behm, Jocelyn E., Edmonds, Devin A., Harmon, Jason P., and Ives, Anthony R.
- Subjects
- *
ECOLOGICAL experiments , *HETEROSCEDASTICITY , *RANIDAE , *STATISTICAL bootstrapping , *STATISTICAL models , *REGRESSION analysis - Abstract
Data sets from ecological experiments can be difficult to analyze, due to lack of independence of experimental units and complex variance structures. In addition, information of interest may lie in complicated contrasts among treatments, rather than direct output from statistical tests. Here, we present a statistical framework for analyzing data sets containing non-independent experimental units and differences in variance among treatments (hetero-scedasticity) and apply this framework to experimental data on interspecific competition among three tadpole species. Our framework involves three steps: (1) use a multilevel regression model to calculate coefficients of treatment effects on response variables; (2) combine coefficients to quantify the strength of competition (the target information of our experiment); and (3) use parametric bootstrapping to calculate significance of competition strengths. We repeated this framework using three multilevel regression models to analyze data at the level of individual tadpoles, at the replicate level, and at the replicate level accounting for heteroscedasticity. Comparing results shows the need to correctly specify the statistical model, with the model that accurately accounts for heteroscedasticity leading to different conclusions from the other two models. This approach gives a single, comprehensive analysis of experimental data that can be used to extract informative biological parameters in a statistically rigorous way. [ABSTRACT FROM AUTHOR]
- Published
- 2013
- Full Text
- View/download PDF
27. Species boundaries and cryptic lineage diversity in a Philippine forest skink complex (Reptilia; Squamata; Scincidae: Lygosominae)
- Author
-
Linkem, Charles W., Hesed, Kyle Miller, Diesmos, Arvin C., and Brown, Rafe M.
- Subjects
- *
SPECIES diversity , *BIOGEOGRAPHY , *SKINKS , *SPHENOMORPHUS , *MITOCHONDRIAL DNA , *MOLECULAR phylogeny , *ANIMAL classification , *BIOLOGICAL divergence , *ARCHIPELAGOES - Abstract
Abstract: In the megadiverse conservation hotspot of the Philippines, biodiversity is not uniformly distributed throughout the archipelago, but hierarchically partitioned into islands and island groups that were conjoined during the mid- to late-Pleistocene. Few species groups are widely distributed throughout the archipelago, but some exceptions exist, such as the common scincid lizards of the Sphenomorphus jagori complex (including S. jagori, S. coxi, and S. abdictus). Using mtDNA haplotype data we test biogeographic and taxonomic predictions in these abundant, large-bodied, forest floor lizards and arrive at conclusions that differ significantly from both past, and current, appraisals of species diversity. In contrast to expectations based on existing taxonomy (three species, each with two subspecies), we find evidence of at least eleven highly divergent species lineages diagnosed by haplotypic variation. Each lineage corresponds to a biogeographically circumscribed distribution (i.e., isolated islands or geological components of islands), suggesting lineage cohesion and allopatric differentiation. Parametric bootstrapping tests reject taxonomic and biogeographic hypotheses and suggest a complex pattern of unpredicted relationships. Only one of the former species (S. jagori) appears as a monophyletic entity (including four allopatric, highly divergent lineages that we suspect may represent evolutionary species), and the remaining species are paraphyletic, necessitating a comprehensive future taxonomic revision. The pattern of biogeographic provincialism and hidden cryptic species diversity detected here leads us to suspect that even the most common, presumably well-studied, and widespread species complexes in the Philippines are in need of thorough analysis with modern genetic and phylogenetic techniques. Such studies of speciation genetics in these common, widely distributed groups may lead to a better understanding of the genetic underpinnings of biodiversity, allow for an enhanced appreciation of the evolutionary history of this model island archipelago, and enable more informed conservation planning in a global biodiversity hotspot. [Copyright &y& Elsevier]
- Published
- 2010
- Full Text
- View/download PDF
28. The multi-clump finite mixture distribution and model selection.
- Author
-
Paul, Sudhir R., Banerjee, Tathagata, and Balasoorya, Uditha
- Subjects
STATISTICAL bootstrapping ,DIRICHLET forms ,FINITE model theory ,DISTRIBUTION (Probability theory) ,STATISTICAL sampling - Abstract
The article proposes a solution to determine the number of clumps through model selection of a multi-clump finite mixture model using the bootstrap likelihood ratio test compared with likelihood ratio test and score test. Based on the analysis, the likelihood ratio tests have shown the shortcomings of the traditional large sample procedures. In these situations, the score test does not exist, while the bootstrap ratio test produces an approximate correct values.
- Published
- 2010
- Full Text
- View/download PDF
29. A Model Fit Statistic for Generalized Partial Credit Model.
- Author
-
Tie Liang and Wells, Craig S.
- Subjects
- *
ITEM response theory , *ERROR rates , *LOGISTIC model (Demography) , *TEST design , *STATISTICAL sampling , *PSYCHOLOGICAL tests , *PSYCHOMETRICS - Abstract
Investigating the fit of a parametric model is an important part of the measurement process when implementing item response theory (IRT), but research examining it is limited. A general nonparametric approach for detecting model misfit, introduced by J. Douglas and A. S. Cohen (2001), has exhibited promising results for the twoparameter logistic model and Samejima s graded response model. This study extends this approach to test the fit of generalized partial credit model (GPCM). The empirical Type I error rate and power of the proposed method are assessed for various test lengths, sample sizes, and type of assessment. Overall, the proposed fit statistic performed well under the studied conditions in that the Type I error rate was not inflated and the power was acceptable, especially for moderate to large sample sizes. A further advantage of the nonparametric approach is that it provides a convenient graphical display of possible misfit. [ABSTRACT FROM AUTHOR]
- Published
- 2009
- Full Text
- View/download PDF
30. SAMPLING DISTRIBUTIONS OF CRITICAL ILLNESS INSURANCE PREMIUM RATES: BREAST AND OVARIAN CANCER.
- Author
-
Li Lu, MACDONALD, ANGUS, and WATERS, HOWARD
- Subjects
LONG-term care insurance ,HEALTH risk assessment ,BREAST cancer patients ,CATASTROPHIC illness ,GENETICS - Abstract
Evaluating the risk of disorders in long-term insurance often relies on rates of onset estimated from quite small epidemiological studies. These estimates can carry considerable uncertainty, hence so may functions of them, such as a premium rate. In the case of genetic disorders, where it may be required to demonstrate the reliability of genetic information as a risk factor, such uncertainty may be material. Epidemiological studies publish their results in a variety of forms and it is rarely easy to estimate the sampling distribution of a premium rate without access to the original data. We found a large study of breast and ovarian cancer that cited relative risks of breast and ovarian cancer onset, with confidence intervals, in 10-year age groups. We obtained critical illness premium rates and their sampling distributions by parametric bootstrapping, and investigated the effect of possible patterns of sampling correlations. We found that this study provides ample statistical evidence that known BRCA 1 or BRCA2 mutations, or a typical family history of breast or ovarian cancer, are reliable risk factors, but the sampling covariances of the relative risks could be important at some ages and terms. Studies that cite only standard errors of parameter estimates erect a small but awkward barrier between the models they describe, and some important actuarial questions. [ABSTRACT FROM AUTHOR]
- Published
- 2008
- Full Text
- View/download PDF
31. A goodness-of-fit test for overdispersed binomial (or multinomial) models
- Author
-
Sutradhar, Santosh C., Neerchal, Nagaraj K., and Morel, Jorge G.
- Subjects
- *
BINOMIAL distribution , *VARIANCES , *GOODNESS-of-fit tests , *CHI-squared test , *STATISTICS - Abstract
Abstract: Overdispersion or extra variation is a common phenomenon that occurs when binomial (multinomial) data exhibit larger variances than that permitted by the binomial (multinomial) model. This arises when the data are clustered or when the assumption of independence is violated. Goodness-of-fit (GOF) tests available in the overdispersion literature have focused on testing for the presence of overdispersion in the data and hence they are not applicable for choosing between the several competing overdispersion models. In this paper, we consider a GOF test proposed by Neerchal and Morel [1998. Large cluster results for two parametric multinomial extra variation models. J. Amer. Statist. Assoc. 93(443), 1078–1087], and study its distributional properties and performance characteristics. This statistic is a direct analogue of the usual Pearson chi-squared statistic, but is also applicable when the clusters are not necessarily of the same size. As this test statistic is for testing model adequacy against the alternative that the model is not adequate, it is applicable in testing two competing overdispersion models. [Copyright &y& Elsevier]
- Published
- 2008
- Full Text
- View/download PDF
32. Ribosomal RNA genes and deuterostome phylogeny revisited: More cyclostomes, elasmobranchs, reptiles, and a brittle star
- Author
-
Mallatt, Jon and Winchell, Christopher J.
- Subjects
- *
NUCLEIC acids , *HAGFISHES , *BIOLOGICAL evolution , *HYPOTHESIS - Abstract
Abstract: This is an expanded study of the relationships among the deuterostome animals based on combined, nearly complete 28S and 18S rRNA genes (>3925 nt.). It adds sequences from 20 more taxa to the ∼45 sequences used in past studies. Seven of the new taxa were sequenced here (brittle star Ophiomyxa, lizard Anolis, turtle Chrysemys, sixgill shark Hexanchus, electric ray Narcine, Southern Hemisphere lamprey Geotria, and Atlantic hagfish Myxine for 28S), and the other 13 were from GenBank and the literature (from a chicken, dog, rat, human, three lungfishes, and several ray-finned fishes, or Actinopterygii). As before, our alignments were based on secondary structure but did not account for base pairing in the stems of rRNA. The new findings, derived from likelihood-based tree-reconstruction methods and by testing hypotheses with parametric bootstrapping, include: (1) brittle star joins with sea star in the echinoderm clade, Asterozoa; (2) with two hagfishes and two lampreys now available, the cyclostome (jawless) fishes remain monophyletic; (3) Hexanchiform sharks are monophyletic, as Hexanchus groups with the frilled shark, Chlamydoselachus; (4) turtle is the sister taxon of all other amniotes; (5) bird is closer to the lizard than to the mammals; (6) the bichir Polypterus is in a monophyletic Actinopterygii; (7) Zebrafish Danio is the sister taxon of the other two teleosts we examined (trout and perch); (8) the South American and African lungfishes group together to the exclusion of the Australian lungfish. Other findings either upheld those of the previous rRNA-based studies (e.g., echinoderms and hemichordates group as Ambulacraria; orbitostylic sharks; batoids are not derived from any living lineage of sharks) or were obvious (monophyly of mammals, gnathostomes, vertebrates, echinoderms, etc.). Despite all these findings, the rRNA data still fail to resolve the relations among the major groups of deuterostomes (tunicates, Ambulacraria, cephalochordates and vertebrates) and of gnathostomes (chondrichthyans, lungfishes, coelacanth, actinopterygians, amphibians, and amniotes), partly because tunicates and lungfishes are rogue taxa that disrupt the tree. Nonetheless, parametric bootstrapping showed our RNA-gene data are only consistent with these dominant hypotheses: (1) deuterostomes consist of Ambulacraria plus Chordata, with Chordata consisting of tunicates and ‘vertebrates plus cephalochordates’; and (2) lungfishes are the closest living relatives of tetrapods. [Copyright &y& Elsevier]
- Published
- 2007
- Full Text
- View/download PDF
33. ASSESSING SYSTEMATIC ERROR IN THE INFERENCE OF SEED PLANT PHYLOGENY.
- Author
-
Burleigh, J. Gordon and Mathews, Sarah
- Subjects
- *
PLANT phylogeny , *PHANEROGAMS , *PLANT evolution , *BIOLOGICAL variation , *BOTANY , *DNA , *GNETALES , *GYMNOSPERMS , *HETEROGENEITY - Abstract
We used parametric bootstrapping to assess the performance of maximum parsimony and maximum likelihood phylogenetic analyses of a 12-locus seed plant data set. Evidence of biases in maximum parsimony analyses of single-locus data sets may explain some of the locus-specific variation among DNA-based hypotheses of seed plant phylogeny. In particular, there is strong evidence of bias in maximum parsimony analyses, especially of plastid loci, that favors placing Gnetales sister to other seed plants. We concatenated simulated single-locus data sets to examine biases in analyses of a 12-locus data set in which each locus is simulated with different substitution parameters and branch lengths. Maximum parsimony analyses of the simulated 12-locus data set also show evidence of biases in favor of recovering trees with Gnetales sister to other seed plants and against recovering anthophyte, gnepine, and gnetifer trees. These biases are most evident in analyses that include the fastest-evolving characters. In the maximum likelihood analyses of the simulated 12-locus data sets, there is evidence of a bias against recovering the anthophyte hypothesis. Otherwise, there is little evidence that the heterogeneous branch lengths and substitution processes among loci influence the results from maximum likelihood phylogenetic analyses. [ABSTRACT FROM AUTHOR]
- Published
- 2007
- Full Text
- View/download PDF
34. Further use of nearly complete 28S and 18S rRNA genes to classify Ecdysozoa: 37 more arthropods and a kinorhynch
- Author
-
Mallatt, Jon and Giribet, Gonzalo
- Subjects
- *
ARTHROPODA , *HEREDITY , *CRUSTACEA , *BIOLOGICAL evolution - Abstract
Abstract: This work expands on a study from 2004 by Mallatt, Garey, and Shultz [Mallatt, J.M., Garey, J.R., Shultz, J.W., 2004. Ecdysozoan phylogeny and Bayesian inference: first use of nearly complete 28S and 18S rRNA gene sequences to classify the arthropods and their kin. Mol. Phylogenet. Evol. 31, 178–191] that evaluated the phylogenetic relationships in Ecdysozoa (molting animals), especially arthropods. Here, the number of rRNA gene-sequences was effectively doubled for each major group of arthropods, and sequences from the phylum Kinorhyncha (mud dragons) were also included, bringing the number of ecdysozoan taxa to over 80. The methods emphasized maximum likelihood, Bayesian inference and statistical testing with parametric bootstrapping, but also included parsimony and minimum evolution. Prominent findings from our combined analysis of both genes are as follows. The fundamental subdivisions of Hexapoda (insects and relatives) are Insecta and Entognatha, with the latter consisting of collembolans (springtails) and a clade of proturans plus diplurans. Our rRNA-gene data provide the strongest evidence to date that the sister group of Hexapoda is Branchiopoda (fairy shrimps, tadpole shrimps, etc.), not Malacostraca. The large, Pancrustacea clade (hexapods within a paraphyletic Crustacea) divided into a few basic subclades: hexapods plus branchiopods; cirripedes (barnacles) plus malacostracans (lobsters, crabs, true shrimps, isopods, etc.); and the basally located clades of (a) ostracods (seed shrimps) and (b) branchiurans (fish lice) plus the bizarre pentastomids (tongue worms). These findings about Pancrustacea agree with a recent study by Regier, Shultz, and Kambic that used entirely different genes [Regier, J.C., Shultz, J.W., Kambic, R.E., 2005a. Pancrustacean phylogeny: hexapods are terrestrial crustaceans and maxillopods are not monophyletic. Proc. R. Soc. B 272, 395–401]. In Malacostraca, the stomatopod (mantis shrimp) was not at the base of the eumalacostracans, as is widely claimed, but grouped instead with an euphausiacean (krill). Within centipedes, Craterostigmus was the sister to all other pleurostigmophorans, contrary to the consensus view. Our trees also united myriapods (millipedes and centipedes) with chelicerates (horseshoe crabs, spiders, scorpions, and relatives) and united pycnogonids (sea spiders) with chelicerates, but with much less support than in the previous rRNA-gene study. Finally, kinorhynchs joined priapulans (penis worms) at the base of Ecdysozoa. [Copyright &y& Elsevier]
- Published
- 2006
- Full Text
- View/download PDF
35. Post-stratified change estimation for large-area forest biomass using repeated ALS strip sampling
- Author
-
Terje Gobakken, Erik Næsset, Victor Felix Strimbu, Rasmus Astrup, Liviu Theodor Ene, and Timothy G. Gregoire
- Subjects
Global and Planetary Change ,010504 meteorology & atmospheric sciences ,Ecology ,Field data ,National forest inventory ,Relative standard deviation ,0211 other engineering and technologies ,Estimator ,Forestry ,02 engineering and technology ,Field survey ,01 natural sciences ,Statistics ,Environmental science ,Parametric bootstrapping ,Aboveground biomass ,021101 geological & geomatics engineering ,0105 earth and related environmental sciences - Abstract
Post-stratified model-assisted (MA) and hybrid (HY) estimators are used with repeated airborne laser scanning (ALS) strip sampling and national forest inventory field data for stratum-wise and overall estimation of aboveground biomass (AGB) stock and change. The study area covered the southern portion of the Hedmark County in Norway. Both MA and HY estimation substantially reduced the uncertainty in AGB change when compared with estimation using the field survey only. Relative efficiencies (relative variance) of 4.15 (MA) and 3.36 (HY) for overall estimates were found. The results suggest the MA estimator for single-time estimation and the HY as more appropriate for change estimation by cover class. With the HY estimator, a nested post-stratification scheme is demonstrated, combining cover classes with change classes, which enables detailed reporting for change according to cause within each cover class, and has the potential to improve the estimation precision. Finally, parametric bootstrapping is demonstrated as an empirical alternative to estimate the model-error component in the HY estimator. The model error estimated with parametric bootstrapping converged to the analytically determined value of the HY estimator within 1000 bootstrap samples.
- Published
- 2017
- Full Text
- View/download PDF
36. THE HISTORY OF A NEARCTIC COLONIZATION: MOLECULAR PHYLOGENETICS AND BIOGEOGRAPHY OF THE NEARCTIC TOADS (BUFO).
- Author
-
Pauly, Gregory B., Hillis, David M., and Cannatella, David C.
- Subjects
- *
PHYLOGENY , *BUFONIDAE , *TOADS , *COLONIES (Biology) , *BIOGEOGRAPHY , *POPULATION biology - Abstract
Previous hypotheses of phylogenetic relationships among Nearctic toads (Bufonidae) and their congeners suggest contradictory biogeographic histories. These hypotheses argue that the Nearctic Bufo are: (1) a polyphyletic assemblage resulting from multiple colonizations from Africa; (2) a paraphyletic assemblage resulting from a single colonization event from South America with subsequent dispersal into Eurasia; or (3) a monophyletic group derived from the Neotropics. We obtained approximately 2.5 kb of mitochondrial DNA sequence data for the 12S, 16S, and intervening valine tRNA gene from 82 individuals representing 56 species and used parametric bootstrapping to test hypotheses of the biogeographic history of the Nearctic Bufo. We find that the Nearctic species of Bufo are monophyletic and nested within a large clade of New World Bufo to the exclusion of Eurasian and African taxa. This suggests that Nearctic Bufo result from a single colonization from the Neotropics. More generally, we demonstrate the utility of parametric bootstrapping for testing alternative biogeographic hypotheses. Through parametric bootstrapping, we refute several previously published biogeographic hypotheses regarding Bufo. These previous studies may have been influenced by homoplasy in osteological characters. Given the Neotropical origin for Nearctic Bufo, we examine current distributional patterns to assess whether the Nearctic-Neotropical boundary is a broad transition zone or a narrow boundary. We also survey fossil and paleogeographic evidence to examine potential Tertiary and Cretaceous dispersal routes, including the Paleocene Isthmian Link, the Antillean and Aves Ridges, and the current Central American Land Bridge, that may have allowed colonization of the Nearctic. [ABSTRACT FROM AUTHOR]
- Published
- 2004
- Full Text
- View/download PDF
37. Novel relationships among hyloid frogs inferred from 12S and 16S mitochondrial DNA sequences
- Author
-
Darst, Catherine R. and Cannatella, David C.
- Subjects
- *
MITOCHONDRIAL DNA , *GLASS frogs (Amphibians) , *PSEUDIDAE , *HYLIDAE - Abstract
Advanced frogs (Neobatrachia) are usually divided into two taxa, Ranoidea (the firmisternal frogs) and Hyloidea (all other neobatrachians). We investigated phylogenetic relationships among several groups of Hyloidea using 12S and 16S rRNA mitochondrial gene sequences and tested explicit relationships of certain problematic hyloid taxa using a sample of 93 neobatrachians. Parsimony, maximum likelihood, and Bayesian inference methods suggest that both the Ranoidea and Hyloidea are well-supported monophyletic groups. We reject three hypotheses using parametric bootstrap simulation: (1) Dendrobatidae lies within the Ranoidea; (2) The group containing Hylidae, Pseudidae, and Centrolenidae is monophyletic; and (3) Brachycephalus is part of Bufonidae. [Copyright &y& Elsevier]
- Published
- 2004
- Full Text
- View/download PDF
38. FLORAL DEVELOPMENT AND MOLECULAR PHYLOGENY SUPPORT THE GENERIC STATUS OF TASMANNIA (WINTERACEAE).
- Author
-
Doust, Andrew N. and Drinna, Andrew N.
- Subjects
- *
WINTERACEAE , *MAGNOLIALES , *DICOTYLEDONS , *DRIMYS winteri , *MOLECULAR phylogeny , *MOLECULAR biology - Abstract
The taxonomic status of and evolutionary relationship between Tasmannia and Drimys (Winteraceae) have been subjects of controversy for many years. In this paper, a molecular phylogenetic analysis of the family with sequences of previously unpublished Tasmannia and Drimys species confirms earlier conclusions that Tasmannia and Drimys do not form a monophyletic group, despite the fact that they appear to share distinctive inflorescence and floral morphological attributes. Examination of alternative hypotheses of relationships with likelihood-ratio tests and parametric bootstrapping supports the separation of Tasmannia and Drimys. A detailed analysis of floral development in Tasmannia lanceolata and T. xerophila indicates that timing and position of sepal initiation differs between them, but that the position of subsequent organ initiation predictably follows from sepal position. This is in contrast to Drimys winteri, where a prolonged delay between sepal and petal initiation leads to the production of many phyllotactic patterns. The prolonged period of calyx tube growth leading to the formation of a calyptra in Tasmannia and Drimys probably evolved in parallel in the two lineages. [ABSTRACT FROM AUTHOR]
- Published
- 2004
- Full Text
- View/download PDF
39. Thermal performance curves reveal shifts in optima, limits, and breadth in early life
- Author
-
Keyne Monro, Carla M. Sgrò, and Adriana P. Rebolledo
- Subjects
0106 biological sciences ,Physiology ,Climate Change ,Climate change ,Aquatic Science ,Biology ,010603 evolutionary biology ,01 natural sciences ,03 medical and health sciences ,Statistics ,Thermal ,Animals ,External fertilization ,Molecular Biology ,Ecology, Evolution, Behavior and Systematics ,030304 developmental biology ,0303 health sciences ,Temperature ,Life stage ,Early life ,Larva ,Insect Science ,Ectotherm ,Animal Science and Zoology ,Performance curves ,Parametric bootstrapping - Abstract
Understanding thermal performance at life stages that limit persistence is necessary to predict responses to climate change, especially for ectotherms whose fitness (survival and reproduction) depends on environmental temperature. Ectotherms often undergo stage-specific changes in size, complexity, and duration that are predicted to modify thermal performance. Yet performance is mostly explored for adults, while performance at earlier stages that typically limit persistence remains poorly understood. Here, we experimentally isolate thermal performance curves at fertilization, embryo development, and larval development in an aquatic ectotherm whose early planktonic stages (gametes, embryos, and larvae) govern adult abundances and dynamics. Unlike previous studies based on short-term exposures, responses with unclear links to fitness, or proxies in lieu of explicit curve descriptors (thermal optima, limits, and breadth), we measure performance as successful completion of each stage after exposure throughout, and at temperatures that explicitly capture curve descriptors at all stages. Formal comparisons of descriptors using a combination of generalized linear mixed modelling and parametric bootstrapping reveal important differences among life stages. Thermal performance differs significantly from fertilization to embryo development (with thermal optimum declining by ∼2 °C, thermal limits shifting inwards by ∼8–10 °C, and thermal breadth narrowing by ∼10 °C), while performance declines independently of temperature thereafter. Our comparisons show that thermal performance at one life stage can misrepresent performance at others, and point to gains in complexity during embryogenesis, rather than subsequent gains in size or duration of exposure, as a key driver of thermal sensitivity in early life.
- Published
- 2020
- Full Text
- View/download PDF
40. Tutorial overview of simple, stratified, and parametric bootstrapping
- Author
-
P. M. Shankar
- Subjects
chi square tests ,Computer science ,business.industry ,Bootstrapping (linguistics) ,Machine learning ,computer.software_genre ,lcsh:QA75.5-76.95 ,receiver operating characteristics ,tutorial ,resampling ,Simple (abstract algebra) ,lcsh:TA1-2040 ,Resampling ,ComputingMilieux_COMPUTERSANDEDUCATION ,bootstrapping ,Artificial intelligence ,lcsh:Electronic computers. Computer science ,Parametric bootstrapping ,business ,lcsh:Engineering (General). Civil engineering (General) ,computer - Abstract
Students pursuing baccalaureate degrees in electrical engineering and computer engineering are required to take a course in probability and statistics. While the course continues to be mostly conceptual, author started initiatives to introduce data analytics in this course with special emphasis on machine vision applications. Topics such as receiver operating characteristics curves and hypothesis testing are covered through examples and exercises with students having individual datasets. Continuing with this theme, bootstrapping and associated methodologies have now been introduced to facilitate interpretation of machine vision experiments. A demo created that illustrates simple, stratified, and parametric bootstrapping as a means to understand the statistics of a machine vision sensor is presented. It encompasses a number of conceptual topics such as random variables, densities, parameter estimation, chi square testing, etc. alongside data analytics offering a holistic picture of machine learning and machine vision to the undergraduate students.
- Published
- 2020
41. Meta-Analysis of Vaccine Effectiveness Studies
- Author
-
Jozef Nauta
- Subjects
Simple (abstract algebra) ,Meta-analysis ,Statistics ,Log-normal distribution ,Interval (mathematics) ,Parametric bootstrapping ,Regression ,Confidence interval ,Statistical hypothesis testing ,Mathematics - Abstract
This chapter explores the meta-analysis of vaccine effectiveness studies. These meta-analyses tend to be non-comparative, although a simple statistical test to compare vaccine effectiveness estimates exists, as well as a powerful regression method known as meta-regression. It is demonstrated how these comparative analyses, as well as non-comparative analyses according to, for example, the DerSimonian–Laird method, can be conducted using simple SAS codes. It is shown that analytical confidence intervals for the difference of two vaccine effectiveness estimates do not exist, and that the problem of finding such a confidence interval comes down to finding a confidence interval for the difference of the medians of two lognormal distributions, and that such an interval can be obtained by means of parametric bootstrapping.
- Published
- 2020
- Full Text
- View/download PDF
42. Systematics of the Lizard Family Pygopodidae with Implications for the Diversification of Australian Temperate Biotas.
- Author
-
Jennings, W. Bryan, Pianka, Eric R., and Donnellan, Stephen
- Subjects
- *
PYGOPODIDAE , *LIZARDS - Abstract
We conducted a phylogenetic study of pygopodid lizards, a group of 38 species endemic to Australia and New Guinea, with two major goals: to reconstruct a taxonomically complete and robustly supported phylogeny for the group and to use this information to gain insights into the tempo, mode, and timing of the pygopodid radiation. Phylogenetic analyses of mitochondrial DNA (mtDNA), nuclear DNA (nDNA), and previously published morphological data using parsimony, maximum likelihood, and Bayesian methods on the independent and combined three data sets yielded trees with similar and largely stable ingroup topologies. However, relationships among the six most inclusive and unambiguously supported clades ( Aprasia , Delma , Lialis , Ophidiocephalus , Pletholax , and Pygopus ) varied depending on data set analyzed. We used parametric bootstrapping to help us understand which of the three-branch schemes linking these six taxa was most plausible given our data. We conclude based on our results that the arrangement (((( Delma , Lialis ) Pygopus ) Pletholax )( Aprasia , Ophidiocephalus )) represents the best hypothesis of intergeneric relationships. A second major problem to arise in our study concerned the inability of our two outgroup taxa ( Diplodactylus ) to root trees properly; three different rooting locations were suggested depending upon analysis. This long-branch attraction problem was so severe that the outgroup branch also interfered with estimation of ingroup relationships. We therefore used the molecular clock method to root the pygopodid tree. Results of two independent molecular clock analyses (mtDNA and nDNA) converged upon the same root location (branch leading to Delma ). We are confident that we have found the correct root because the possibility of our clock estimates agreeing by chance alone is remote given that there are 65 possible root locations (branches) on the pygopodid tree (∼1 in 65 odds). Our analysis also indicated that Delma fraseri is not monophyletic, a result supported by a parametric bootstrapping test. We elevated the Western Australian race, Delma f. petersoni , to species status (i.e., Delma petersoni ) because hybridization and incomplete lineage sorting could be ruled out as potential causes of this paraphyletic gene tree and because D. grayii is broadly sympatric with its sister species D. fraseri . Climate changes over the past 23 million years, which transformed Australia from a wet, green continent to one that is largely dry and brown, have been suspected as playing a major role in the diversification of Australia's temperate biotas. Our phylogenetic analyses of pygopodid speciation and biogeography revealed four important findings consistent with this climate change diversification model: (1) our fossil-calibrated phylogeny shows that although some extant pygopodid lineages predate the onset of aridification, 28 of 33 pygopodid species included in our study seem to have originated in the last 23 million years; (2) relative cladogenesis tests suggest that several major clades underwent higher than expected rates of speciation; (3) our findings support earlier studies showing that speciation of mesic-adapted biotas in the southeastern and southwestern corners of Australia largely occurred within each of these regions between 12 and 23 million years ago as opposed to repeated dispersal between these regions; and (4) we have identified for the first time the existence of several pairs of sympatric sister species of lizards living in arid and semiarid ecosystems. These sympatric sister species seem to be younger than allopatric or parapatric sister-species pairs, which is not consistent with previous beliefs. [ABSTRACT FROM AUTHOR]
- Published
- 2003
- Full Text
- View/download PDF
43. Invasion Genetics of New World Medflies: Testing Alternative Colonization Scenarios.
- Author
-
Bohonak, Andrew, Davies, Neil, Villablanca, Francis, and Roderick, George
- Abstract
The Mediterranean fruit fly ( Ceratitis capitata) is an invasive agricultural pest with a wide host range and a nearly global distribution. Efforts to forgo the medfly's spread into the United States are dependent on an understanding of population dynamics in newly established populations elsewhere. To explore the potential influence of demographic and historical parameters in six medfly populations distributed from Mexico to Peru, we created population genetic null models using Monte Carlo simulations. Null expectations for genetic differentiation ( F
ST ) were compared with actual sequence variation from four highly polymorphic nuclear loci. Four colonization scenarios that were modeled led to unique genetic signatures that could be used to interpret empirical data. Unless current gene flow across Latin America was assumed to be very high, we could reject colonizations consisting of multiple introductions, each of low genetic diversity. Further, if simulated populations were small ( Ne = 5 × 102 individuals per population), small invasions from a single source consistently produced FST values comparable to those currently observed in Latin America. In contrast, only large invasions from diverse sources were compatible with the observed data for large populations ( Ne ≥ 5 × 103 ). This study demonstrates that alternative population genetic hypotheses can be tested empirically even when departures from equilibrium are extreme, and that population genetic theory can be used to explore the processes that underlie biological invasions. [ABSTRACT FROM AUTHOR]- Published
- 2001
- Full Text
- View/download PDF
44. A novel relative entropy--posterior predictive model checking approach with limited information statistics for latent trait models in sparse 2k contingency tables.
- Author
-
Huiping Wu, Ka-Veng Yuen, and Shing-On Leung
- Subjects
- *
ENTROPY (Information theory) , *PREDICTION models , *GOODNESS-of-fit tests , *STATISTICAL bootstrapping , *INFORMATION theory , *PARAMETER estimation , *DATA analysis - Abstract
Limited information statistics have been recommended as the goodness-of-fit measures in sparse 2k contingency tables, but the p-values of these test statistics are computationally difficult to obtain. A Bayesian model diagnostic tool, Relative Entropy--Posterior Predictive Model Checking (RE--PPMC), is proposed to assess the global fit for latent trait models in this paper. This approach utilizes the relative entropy (RE) to resolve possible problems in the original PPMC procedure based on the posterior predictive p-value (PPP-value). Compared with the typical conservatism of PPP-value, the RE value measures the discrepancy effectively. Simulated and real data sets with different item numbers, degree of sparseness, sample sizes, and factor dimensions are studied to investigate the performance of the proposed method. The estimates of univariate information and difficulty parameters are found to be robust with dual characteristics, which produce practical implications for educational testing. Compared with parametric bootstrapping, RE--PPMC is much more capable of evaluating the model adequacy. [ABSTRACT FROM AUTHOR]
- Published
- 2014
- Full Text
- View/download PDF
45. Estimates of regional annual abundance and population growth rates of white sharks off central California
- Author
-
Barbara A. Block, Taylor K. Chapple, Timothy D. White, Salvador J. Jorgensen, Paul E. Kanive, Scot D. Anderson, and Jay J. Rotella
- Subjects
0106 biological sciences ,education.field_of_study ,Empirical data ,010604 marine biology & hydrobiology ,Population ,Biology ,010603 evolutionary biology ,01 natural sciences ,White (mutation) ,Abundance (ecology) ,Population growth ,Point estimation ,West coast ,Parametric bootstrapping ,education ,Ecology, Evolution, Behavior and Systematics ,Nature and Landscape Conservation ,Demography - Abstract
Determining population trends is critical for evaluating management actions and prioritizing species protections. In this study, we used empirical data to produce an estimate of the population trend for sub-adult and adult white sharks in central California. We used the unique dorsal fin morphology to build a mark-recapture data set in a modified Jolly-Seber model (POPAN formulation) to estimate annual abundance and then investigate population growth rates using parametric bootstrapping methods for sub-adult and adult sharks (males and females). For all demographic groups combined, we found equivocal evidence for a positive regional population growth (λ = 1.07 (95% CI = 0.91 to 1.23)). However, sex- and size-specific population growth rate estimates provided some evidence of population increases for reproductively mature males (λ = 1.06 (95% CI = 0.99 to 1.13)) and females (λ = 1.06 (95% CI = 0.95 to 1.17)). For sub-adult male and female white sharks, point estimates of λ were positive but uncertainty prevents strong inference (λ = 1.07 (95% CI = 0.85 to 1.29)) and (λ = 1.08 (95% CI = 0.88 to 1.28)), respectively. Our findings of a potential increase in reproductive-aged white sharks in central California may be a result of regional fluxes in density or attributed in part to current protection efforts and subsequent increase in abundance of pinnipeds as well as reduced gill-net fisheries mortality of juveniles. A trend estimate for the entire northeastern Pacific will require obtaining similar data across known aggregation areas along the west coast of North America.
- Published
- 2021
- Full Text
- View/download PDF
46. Density Forecasts With Midas Models
- Author
-
Knut Are Aastveit, Francesco Ravazzolo, and Claudia Foroni
- Subjects
Economics and Econometrics ,Series (mathematics) ,Nowcasting ,05 social sciences ,Monte Carlo method ,Sampling (statistics) ,Variable (computer science) ,Autoregressive model ,0502 economics and business ,Econometrics ,050207 economics ,Parametric bootstrapping ,Social Sciences (miscellaneous) ,050205 econometrics ,Mathematics ,Mixed-data sampling - Abstract
In this paper we derive a general parametric bootstrapping approach to compute density forecasts for various types of mixed-data sampling (MIDAS) regressions. We consider both classical and unrestricted MIDAS regressions with and without an autoregressive component. First, we compare the forecasting performance of the dierent MIDAS models in Monte Carlo simulation experiments. We nd that the results in terms of point and density forecasts are coherent. Moreover, the results do not clearly indicate a superior performance of one of the models under scrutiny when the persistence of the low frequency variable is low. Some dierences are instead more evident when the persistence is high, for which the ARMIDAS and the AR-U-MIDAS produce better forecasts. Second, in an empirical exercise we evaluate density forecasts for quarterly US output growth, exploiting information from typical monthly series. We nd that MIDAS models provide accurate and timely density forecasts.
- Published
- 2016
- Full Text
- View/download PDF
47. Evaluating significance in linear mixed-effects models in R
- Author
-
Steven G. Luke
- Subjects
Restricted maximum likelihood ,Statistics as Topic ,05 social sciences ,Degrees of freedom (statistics) ,Experimental data ,Experimental and Cognitive Psychology ,050105 experimental psychology ,03 medical and health sciences ,0302 clinical medicine ,Distribution (mathematics) ,Arts and Humanities (miscellaneous) ,Sample size determination ,Sample Size ,Statistics ,Linear Models ,Developmental and Educational Psychology ,Mixed effects ,Humans ,0501 psychology and cognitive sciences ,Psychology (miscellaneous) ,Parametric bootstrapping ,030217 neurology & neurosurgery ,General Psychology ,Type I and type II errors ,Mathematics - Abstract
Mixed-effects models are being used ever more frequently in the analysis of experimental data. However, in the lme4 package in R the standards for evaluating significance of fixed effects in these models (i.e., obtaining p-values) are somewhat vague. There are good reasons for this, but as researchers who are using these models are required in many cases to report p-values, some method for evaluating the significance of the model output is needed. This paper reports the results of simulations showing that the two most common methods for evaluating significance, using likelihood ratio tests and applying the z distribution to the Wald t values from the model output (t-as-z), are somewhat anti-conservative, especially for smaller sample sizes. Other methods for evaluating significance, including parametric bootstrapping and the Kenward-Roger and Satterthwaite approximations for degrees of freedom, were also evaluated. The results of these simulations suggest that Type 1 error rates are closest to .05 when models are fitted using REML and p-values are derived using the Kenward-Roger or Satterthwaite approximations, as these approximations both produced acceptable Type 1 error rates even for smaller samples.
- Published
- 2016
- Full Text
- View/download PDF
48. How reliably can we infer diversity‐dependent diversification from phylogenies?
- Author
-
Albert B. Phillimore, Alex L. Pigot, Rampal S. Etienne, and Etienne group
- Subjects
0106 biological sciences ,0301 basic medicine ,Future studies ,RECONSTRUCTED PHYLOGENIES ,Biology ,010603 evolutionary biology ,01 natural sciences ,03 medical and health sciences ,Paleontology ,conditioning ,parametric bootstrap ,Statistics ,Carrying capacity ,diversity-dependence ,SPECIATION ,MOLECULAR PHYLOGENIES ,RADIATIONS ,Ecology, Evolution, Behavior and Systematics ,Parametric statistics ,Phylogenetic tree ,extinction ,Ecological Modeling ,Model selection ,SPECIES-DIVERSITY ,NICHE ,respiratory system ,simulation ,EVOLUTIONARY MODELS ,Birth-death model ,030104 developmental biology ,Likelihood-ratio test ,diversity dependence ,TREES ,simulations ,EXPLAIN ,Parametric bootstrapping ,human activities ,Type I and type II errors - Abstract
Slow-downs in lineage accumulation in phylogenies suggest that speciation rates decline as diversity increases. Likelihood methods have been developed to detect such diversity-dependence. However, a thorough test of whether such approaches correctly infer diversity-dependence is lacking.Here we simulate phylogenetic branching under linear negative diversity-dependent and diversity-independent models and estimate from the simulated phylogenies the maximum likelihood parameters for three different conditionings – on survival of the birth-death process given the crown age, on tree size (N), and on tree size given the crown age. We report the accuracy of recovering the simulation parameters and the reliability of the model selection based on the χ2 likelihood ratio test.Parameter estimate accuracyConditioning on survival given the crown age yields a severe bias of the carrying capacity K toward N, and an upward bias of the speciation rate, particularly in clades where diversity-dependent feedbacks are still weak (N « K). Conditioning on N yields an overestimate of K and an underestimate of speciation rate, particularly when saturation has been reached. Dual conditioning yields relatively unbiased parameter estimates on average, but the deviation from the true value for any single estimate may be large.Model selection reliabilityThe frequency of incorrectly rejecting a diversity-independent model when the simulation was diversity-independent (Type I error) differs substantially from the significance level α used in the likelihood ratio test, rendering the likelihood ratio test inappropriate. The frequency of correctly rejecting the diversity-independent model when the simulation was diversity-dependent (power) is larger when the clade is closer to equilibrium, and for conditioning on crown age.We conclude that conditioning on crown age has the best statistical properties overall, but caution that parameter estimates may be biased. To assess parameter uncertainty in future studies of diversity-dependence on real data, we recommend parametric bootstrapping, examination of the likelihood surface and comparison of estimates across the types of conditioning. To assess model selection reliability we discourage the use of the χ2 likelihood ratio test or AIC (which are equivalent in this case), but recommend a likelihood ratio test based on parametric bootstrap. We illustrate this method for the diversification of Dendroica warblers.
- Published
- 2016
- Full Text
- View/download PDF
49. The Estimation of The Productivities of Institutions under Ministry of Oceans and Fisheries
- Author
-
Jong-Cheon Kim, Cheol-Hyung Park, and Tae-Hyun Kim
- Subjects
0106 biological sciences ,Estimation ,030506 rehabilitation ,010604 marine biology & hydrobiology ,Public institution ,Production efficiency ,01 natural sciences ,Fishery ,03 medical and health sciences ,Economics ,Production (economics) ,Christian ministry ,Parametric bootstrapping ,0305 other medical science ,Productivity ,Total factor productivity - Abstract
This study applied the parametric bootstrapping method to analyze whether there was a change in the production efficiency of institutions under Ministry of Oceans and Fisheries. This study used input and output oriented productivity simultaneously. In particular, the productivity was estimated through 95% confidence interval derived by 2000 times re-sampling process. The results of the study showed us a reduction in overall total factor productivity by 24% between 2009 and 2013, and 7% of decreases in productivity annually. A recent conditions of an external economic shocks brought a 28% downward shift of production function. In this study, public institutions were divided into three types, which were public, quasi-government, and other public institutions. There were approximately 13%, 1%, and 5% decreases in total factor productivity per each.In analyzing the productivity each of 14 institutions, approximately DMU4 and DMU6 had 4%, and 5% increases in productivity per each. While DMU14 showed us no changes in productivity, all of the other 10 DMUs were estimated the decreases in productivities.Key words : Institution, Bootstraping, Malmquist, Total factor productivity, TypesCorresponding author : 051-629-5319, chpark@pknu.ac.kr
- Published
- 2016
- Full Text
- View/download PDF
50. partR2: partitioning R 2 in generalized linear mixed models.
- Author
-
Stoffel MA, Nakagawa S, and Schielzeth H
- Abstract
The coefficient of determination R
2 quantifies the amount of variance explained by regression coefficients in a linear model. It can be seen as the fixed-effects complement to the repeatability R (intra-class correlation) for the variance explained by random effects and thus as a tool for variance decomposition. The R2 of a model can be further partitioned into the variance explained by a particular predictor or a combination of predictors using semi-partial (part) R2 . Furthermore, the package reports beta weights (standardized regression coefficients). Finally, partR2 implements parametric bootstrapping to quantify confidence intervals for each estimate. We illustrate the use of partR2 with real example datasets for Gaussian and binomial GLMMs and discuss interactions, which pose a specific challenge for partitioning the explained variance among predictors.R2 for fixed effect predictors based on (generalized) linear mixed-effect model fits. The package iteratively removes predictors of interest from the model and monitors the change in the variance of the linear predictor. The difference to the full model gives a measure of the amount of variance explained uniquely by a particular predictor or a set of predictors. partR2 also estimates structure coefficients as the correlation between a predictor and fitted values, which provide an estimate of the total contribution of a fixed effect to the overall prediction, independent of other predictors. Structure coefficients can be converted to the total variance explained by a predictor, here called 'inclusive' R2 , as the square of the structure coefficients times total R2 . Furthermore, the package reports beta weights (standardized regression coefficients). Finally, partR2 implements parametric bootstrapping to quantify confidence intervals for each estimate. We illustrate the use of partR2 with real example datasets for Gaussian and binomial GLMMs and discuss interactions, which pose a specific challenge for partitioning the explained variance among predictors., Competing Interests: The authors declare there are no competing interests., (©2021 Stoffel et al.)- Published
- 2021
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.