17 results on '"Xingwei Tong"'
Search Results
2. Learning Causal Effect Using Machine Learning with Application to China’s Typhoon
- Author
-
Peng Wu, Xingwei Tong, Min Wu, and Qi-rui Hu
- Subjects
Matching (statistics) ,business.industry ,Applied Mathematics ,05 social sciences ,Machine learning ,computer.software_genre ,01 natural sciences ,Ensemble learning ,Random forest ,010104 statistics & probability ,Causal inference ,0502 economics and business ,Parametric model ,Propensity score matching ,Covariate ,Observational study ,Artificial intelligence ,0101 mathematics ,business ,computer ,050205 econometrics ,Mathematics - Abstract
Matching is a routinely used technique to balance covariates and thereby alleviate confounding bias in causal inference with observational data. Most of the matching literatures involve the estimating of propensity score with parametric model, which heavily depends on the model specification. In this paper, we employ machine learning and matching techniques to learn the average causal effect. By comparing a variety of machine learning methods in terms of propensity score under extensive scenarios, we find that the ensemble methods, especially generalized random forests, perform favorably with others. We apply all the methods to the data of tropical storms that occurred on the mainland of China since 1949.
- Published
- 2020
- Full Text
- View/download PDF
3. Regression and subgroup detection for heterogeneous samples
- Author
-
Yanping Qiu, Baosheng Liang, Xingwei Tong, and Peng Wu
- Subjects
Statistics and Probability ,business.industry ,Computer science ,Feature vector ,05 social sciences ,k-means clustering ,Pattern recognition ,Regression analysis ,Sample (statistics) ,01 natural sciences ,Regression ,Task (project management) ,010104 statistics & probability ,Computational Mathematics ,0502 economics and business ,Pairwise comparison ,Artificial intelligence ,0101 mathematics ,Statistics, Probability and Uncertainty ,business ,Cluster analysis ,050205 econometrics - Abstract
Regression analysis of heterogeneous samples with subgroup structure is essential to the development of precision medicine. In practice, this task is often challenging owing to the lack of prior knowledge of subgroup labels. Therefore, detecting the subgroups with similar characteristics becomes critical, which often controls the accuracy of regression analysis. In this article, we investigate a new framework for detecting the subgroups that have similar characters in feature space and similar treatment effects. The key idea is that we incorporate K-means clustering into the regression framework of concave pairwise fusion, so that the regression and subgroup detection tasks can be performed simultaneously. Our method is specifically tailored for handling the situations where the sample is not homogeneous in the sense that the response variables in different domains of feature space are generated through different mechanisms.
- Published
- 2020
- Full Text
- View/download PDF
4. Hypothesis Testing with Paired Partly Interval Censored Data
- Author
-
Bo Lu, Xingwei Tong, and Ding-jiao Cai
- Subjects
Paired Data ,Wilcoxon signed-rank test ,Applied Mathematics ,Confounding ,Interval (mathematics) ,01 natural sciences ,Test (assessment) ,010104 statistics & probability ,03 medical and health sciences ,0302 clinical medicine ,Statistics ,Observational study ,030212 general & internal medicine ,0101 mathematics ,Parametric statistics ,Mathematics ,Statistical hypothesis testing - Abstract
Partly interval censored data frequently occur in many areas including clinical trials, epidemiology research, and medical follow-up studies. When data come from observational studies, we need to carefully adjust for the confounding bias in order to estimate the true treatment effect. Pair matching designs are popular for removing confounding bias without parametric assumptions. With time-to-event outcomes, there are some literature for hypothesis testing with paired right censored data, but not for interval censored data. O’Brien and Fleming extended the Prentice Wilcoxon test to right censored paired data by making use of the Prentice-Wilcoxon scores. Akritas proposed the Akritas test and established its asymptotic properties. We extend Akritas test to partly interval censored data. We estimate the survival distribution function by nonparametric maximum likelihood estimation (NPMLE), and prove the asymptotic validity of the new test. To improve our test under small sample size or extreme distributions, we also propose a modified version using the rank of the score difference. Simulation results indicate that our proposed methods have very good performance.
- Published
- 2019
- Full Text
- View/download PDF
5. Learning dynamic causal relationships among sugar prices
- Author
-
Jian-ping Chen, Jing Xu, Fang Wang, and Xingwei Tong
- Subjects
Causal graph ,Applied Mathematics ,media_common.quotation_subject ,Closing (real estate) ,Bayesian network ,02 engineering and technology ,Bayesian inference ,Quarter (United States coin) ,01 natural sciences ,010104 statistics & probability ,Statistics ,0202 electrical engineering, electronic engineering, information engineering ,Econometrics ,020201 artificial intelligence & image processing ,0101 mathematics ,Sugar ,Futures contract ,Mathematics ,media_common - Abstract
In this paper, we are interested in exploring the dynamic causal relationships among two sets of three variables in different quarters. One set is futures sugar closing price in Zhengzhou futures exchange market (ZC), spot sugar price in Zhengzhou (ZS) and futures sugar closing price in New York futures exchange market(NC) and the other includes futures sugar opening price in Zhengzhou (ZO), ZS and NC. For each quarter, we first use Bayesian model selection to obtain the optimal causal graph with the highest BD scores and then use Bayesian model averaging approach to explore the causal relationship between every two variables. From the real data analysis, the two conclusions almost coincide, which shows that the two methods are practical.
- Published
- 2017
- Full Text
- View/download PDF
6. Semiparametric partially linear varying coefficient models with panel count data
- Author
-
Xingwei Tong, Xuenan Feng, Xingqiu Zhao, and Xin He
- Subjects
0301 basic medicine ,Likelihood Functions ,Models, Statistical ,Counting process ,Applied Mathematics ,Monte Carlo method ,Linear model ,Reproducibility of Results ,Estimator ,Asymptotic distribution ,General Medicine ,01 natural sciences ,010104 statistics & probability ,03 medical and health sciences ,030104 developmental biology ,Statistics ,Covariate ,Linear Models ,Humans ,Regression Analysis ,Semiparametric regression ,0101 mathematics ,Count data ,Mathematics - Abstract
This paper studies semiparametric regression analysis of panel count data, which arise naturally when recurrent events are considered. Such data frequently occur in medical follow-up studies and reliability experiments, for example. To explore the nonlinear interactions between covariates, we propose a class of partially linear models with possibly varying coefficients for the mean function of the counting processes with panel count data. The functional coefficients are estimated by B-spline function approximations. The estimation procedures are based on maximum pseudo-likelihood and likelihood approaches and they are easy to implement. The asymptotic properties of the resulting estimators are established, and their finite-sample performance is assessed by Monte Carlo simulation studies. We also demonstrate the value of the proposed method by the analysis of a cancer data set, where the new modeling approach provides more comprehensive information than the usual proportional mean model.
- Published
- 2016
- Full Text
- View/download PDF
7. Causality analysis of futures sugar prices in Zhengzhou based on graphical models for multivariate time series
- Author
-
Jing Xu and Xingwei Tong
- Subjects
Discrete mathematics ,0209 industrial biotechnology ,Multivariate statistics ,Applied Mathematics ,Mixed graph ,02 engineering and technology ,01 natural sciences ,010104 statistics & probability ,020901 industrial engineering & automation ,Econometrics ,Graph (abstract data type) ,Graphical model ,0101 mathematics ,Undirected graph ,Futures contract ,Partial correlation ,Mathematics - Abstract
This paper presents a method of constructing a mixed graph which can be used to analyze the causality for multivariate time series. We construct a partial correlation graph at first which is an undirected graph. For every undirected edge in the partial correlation graph, the measures of linear feedback between two time series can help us decide its direction, then we obtain the mixed graph. Using this method, we construct a mixed graph for futures sugar prices in Zhengzhou (ZF), spot sugar prices in Zhengzhou (ZS) and futures sugar prices in New York (NF). The result shows that there is a bi-directional causality between ZF and ZS, an unidirectional causality from NF to ZF, but no causality between NF and ZS.
- Published
- 2016
- Full Text
- View/download PDF
8. Corrected-loss estimation for Error-in-Variable partially linear model
- Author
-
Jiao Jin and Xingwei Tong
- Subjects
General linear model ,Normal distribution ,Hierarchical generalized linear model ,Data set ,Covariance matrix ,General Mathematics ,Statistics ,Covariate ,Linear model ,Estimator ,Applied mathematics ,Mathematics - Abstract
We consider an Error-in-Variable partially linear model where the covariates of linear part are measured with error which follows a normal distribution with a known covariance matrix. We propose a corrected-loss estimation of the covariate effect. The proposed estimator is asymptotically normal. Simulation studies are presented to show that the proposed method performs well with finite samples, and the proposed method is applied to a real data set.
- Published
- 2015
- Full Text
- View/download PDF
9. Sieve Estimation for the Cox Model with Clustered Interval-Censored Failure Time Data
- Author
-
Jianguo Sun, Junlong Li, and Xingwei Tong
- Subjects
Statistics and Probability ,Statistics::Theory ,Proportional hazards model ,Estimation theory ,Estimator ,Regression analysis ,Sample (statistics) ,Interval (mathematics) ,Biochemistry, Genetics and Molecular Biology (miscellaneous) ,law.invention ,Sieve ,law ,Statistics ,Expectation–maximization algorithm ,Mathematics - Abstract
Clustered interval-censored failure time data occur when the failure times of interest are clustered into small groups and known only to lie in certain intervals. A number of methods have been proposed for regression analysis of clustered failure time data, but most of them apply only to clustered right-censored data. In this paper, a sieve estimation procedure is proposed for fitting a Cox frailty model to clustered interval-censored failure time data. In particular, a two-step algorithm for parameter estimation is developed and the asymptotic properties of the resulting sieve maximum likelihood estimators are established. The finite sample properties of the proposed estimators are investigated through a simulation study and the method is illustrated by the data arising from a lymphatic filariasis study.
- Published
- 2012
- Full Text
- View/download PDF
10. Efficient estimation for additive hazards regression with bivariate current status data
- Author
-
Jianguo Sun, Tao Hu, and Xingwei Tong
- Subjects
Correlation ,Bivariate data ,General Mathematics ,Statistics ,Univariate ,Econometrics ,Inference ,Regression analysis ,Bivariate analysis ,Regression ,Mathematics ,Copula (probability theory) - Abstract
This paper discusses efficient estimation for the additive hazards regression model when only bivariate current status data are available. Current status data occur in many fields including demographical studies and tumorigenicity experiments (Keiding, 1991; Sun, 2006) and several approaches have been proposed for the additive hazards model with univariate current status data (Lin et al., 1998; Martinussen and Scheike, 2002). For bivariate data, in addition to facing the same problems as those with univariate data, one needs to deal with the association or correlation between two related failure time variables of interest. For this, we employ the copula model and an efficient estimation procedure is developed for inference. Simulation studies are performed to evaluate the proposed estimates and suggest that the approach works well in practical situations. An illustrative example is provided.
- Published
- 2012
- Full Text
- View/download PDF
11. Causal analysis of futures sugar prices in Zhengzhou
- Author
-
Fang Wang, Jing Xu, Xingwei Tong, and Jian-ping Chen
- Subjects
symbols.namesake ,Bayesian information criterion ,Applied Mathematics ,Metric (mathematics) ,Bayesian probability ,Statistics ,symbols ,Bayesian network ,Akaike information criterion ,Directed acyclic graph ,Futures contract ,Dirichlet distribution ,Mathematics - Abstract
In this paper, we are interested in investigating the causal relationships among futures sugar prices in the Zhengzhou futures exchange market (ZF), the spot sugar prices in Zhengzhou (ZS) and the futures sugar prices in New York futures exchange market (NF). A useful tool called Bayesian network is introduced to analyze the problem. Since there are only three variables in our Bayesian network, the algorithm is straightforward: we display all the 25 possible network structures and adopt certain scoring metrics to evaluate them. We applied five different scoring metrics in total. Firstly, for each metric, we obtained 24 scores, each calculated from one of the 24 possible structures i.e. a Directed Acyclic Graph (DAG). Then we eliminated the network structure which represents the independence of the three variables according to our prior knowledge concerning the futures sugar market. After that, the optimal network structure which implies the causal relationships was selected according to the corresponding scoring metric. Finally, after comparing the results from different scoring metrics, we obtained the relatively affirmative conclusion that ZS causes ZF from both the Bayesian Dirichlet (BD) metric, Bayesian Dirichlet-Akaike Information Criterion (BD-AIC) metric, Bayesian Dirichlet-Bayesian Information Criterion (BD-BIC) metric and Bayesian Information Criterion (BIC) metric. The conclusions that NF causes ZF and ZF causes ZS from the Akaike Information Criterion (AIC) metric and ZF causes ZS from the BIC metric were useful and significant to our investigation.
- Published
- 2011
- Full Text
- View/download PDF
12. Comments on: Nonparametric inference based on panel count data
- Author
-
Xingwei Tong
- Subjects
Statistics and Probability ,Nonparametric inference ,Computer science ,Statistics ,Econometrics ,Statistics, Probability and Uncertainty ,Count data - Published
- 2010
- Full Text
- View/download PDF
13. Semiparametric analysis of longitudinal data with informative observation times
- Author
-
Zhihua Sun, Xiaoyun Mu, Xingwei Tong, and Liuquan Sun
- Subjects
Applied Mathematics ,Statistics ,Econometrics ,Inference ,Estimator ,Sample (statistics) ,Estimating equations ,Latent variable ,Random effects model ,Censoring (statistics) ,Regression ,Mathematics - Abstract
In many longitudinal studies, observation times as well as censoring times may be correlated with longitudinal responses. This paper considers a multiplicative random effects model for the longitudinal response where these correlations may exist and a joint modeling approach is proposed via a shared latent variable. For inference about regression parameters, estimating equation approaches are developed and asymptotic properties of the proposed estimators are established. The finite sample behavior of the methods is examined through simulation studies and an application to a data set from a bladder cancer study is provided for illustration.
- Published
- 2010
- Full Text
- View/download PDF
14. Regression analysis of multivariate recurrent event data with a dependent terminal event
- Author
-
Jianguo Sun, Liang Zhu, Xingwei Tong, and Deo Kumar Srivastava
- Subjects
Male ,Multivariate statistics ,Models, Statistical ,Multivariate analysis ,Biometrics ,Computer science ,Applied Mathematics ,Regression analysis ,Bacterial Infections ,General Medicine ,Bivariate analysis ,Leukemia, Myeloid, Acute ,Mycoses ,Terminal (electronics) ,Recurrence ,Virus Diseases ,Data Interpretation, Statistical ,Multivariate Analysis ,Statistics ,Humans ,Regression Analysis ,Female ,Survival analysis ,Event (probability theory) - Abstract
Recurrent event data occur in many clinical and observational studies (Cook and Lawless, Analysis of recurrent event data, 2007) and in these situations, there may exist a terminal event such as death that is related to the recurrent event of interest (Ghosh and Lin, Biometrics 56:554-562, 2000; Wang et al., J Am Stat Assoc 96:1057-1065, 2001; Huang and Wang, J Am Stat Assoc 99:1153-1165, 2004; Ye et al., Biometrics 63:78-87, 2007). In addition, sometimes there may exist more than one type of recurrent events, that is, one faces multivariate recurrent event data with some dependent terminal event (Chen and Cook, Biostatistics 5:129-143, 2004). It is apparent that for the analysis of such data, one has to take into account the dependence both among different types of recurrent events and between the recurrent and terminal events. In this paper, we propose a joint modeling approach for regression analysis of the data and both finite and asymptotic properties of the resulting estimates of unknown parameters are established. The methodology is applied to a set of bivariate recurrent event data arising from a study of leukemia patients.
- Published
- 2010
- Full Text
- View/download PDF
15. Efficient estimation for semiparametric varying- coefficient partially linear regression models with current status data
- Author
-
Xingwei Tong, Tao Hu, and Hengjian Cui
- Subjects
Polynomial regression ,Statistics::Theory ,Proper linear model ,Applied Mathematics ,Statistics ,Linear model ,Statistics::Methodology ,Principal component regression ,Log-linear model ,Semiparametric regression ,Mathematics ,Semiparametric model ,Variance function - Abstract
This article considers a semiparametric varying-coefficient partially linear regression model with current status data. The semiparametric varying-coefficient partially linear regression model which is a generalization of the partially linear regression model and varying-coefficient regression model that allows one to explore the possibly nonlinear effect of a certain covariate on the response variable. A Sieve maximum likelihood estimation method is proposed and the asymptotic properties of the proposed estimators are discussed. Under some mild conditions, the estimators are shown to be strongly consistent. The convergence rate of the estimator for the unknown smooth function is obtained and the estimator for the unknown parameter is shown to be asymptotically efficient and normally distributed. Simulation studies are conducted to examine the small-sample properties of the proposed estimates and a real dataset is used to illustrate our approach.
- Published
- 2009
- Full Text
- View/download PDF
16. Consistency and normality of Huber-Dutter estimators for partial linear model
- Author
-
Hengjian Cui, Peng Yu, and Xingwei Tong
- Subjects
Mathematical optimization ,Efficient estimator ,Rate of convergence ,General Mathematics ,Ordinary least squares ,Estimator ,Applied mathematics ,Minimax estimator ,Scale parameter ,Invariant estimator ,Mathematics ,Nonparametric regression - Abstract
For partial linear model Y = Xτβ0 + g0(T) + ∈ with unknown β0 ∈ ȑd and an unknown smooth function g0, this paper considers the Huber-Dutter estimators of β0, scale σ for the errors and the function g0 approximated by the smoothing B-spline functions, respectively. Under some regularity conditions, the Huber-Dutter estimators of β0 and σ are shown to be asymptotically normal with the rate of convergence n−1/2 and the B-spline Huber-Dutter estimator of g0 achieves the optimal rate of convergence in nonparametric regression. A simulation study and two examples demonstrate that the Huber-Dutter estimator of β0 is competitive with its M-estimator without scale parameter and the ordinary least square estimator.
- Published
- 2008
- Full Text
- View/download PDF
17. Asymptotics of Huber-Dutter Estimators for Partial Linear Model with Nonstochastic Designs
- Author
-
Hengjian Cui, Xingwei Tong, and Hui Zhao
- Subjects
Delta method ,Mathematical optimization ,Efficient estimator ,Bias of an estimator ,Applied Mathematics ,Consistent estimator ,Estimator ,Applied mathematics ,Trimmed estimator ,M-estimator ,Invariant estimator ,Mathematics - Abstract
For partial linear model Y = X τ β 0 + g 0(T) + e with unknown β 0 ∈¸ R d and an unknown smooth function g 0, this paper considers the Huber-Dutter estimators of β 0, scale σ for the errors and the function g 0 respectively, in which the smoothing B-spline function is used. Under some regular conditions, it is shown that the Huber-Dutter estimators of β 0 and σ are asymptotically normal with convergence rate n -1/2 and the B-spline Huber-Dutter estimator of g 0 achieves the optimal convergence rate in nonparametric regression. A simulation study demonstrates that the Huber-Dutter estimator of β 0 is competitive with its M-estimator without scale parameter and the ordinary least square estimator. An example is presented after the simulation study.
- Published
- 2005
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.