11 results on '"Yuehao Bai"'
Search Results
2. Why randomize? Minimax optimality under permutation invariance
- Author
-
Yuehao Bai
- Subjects
Combinatorics ,Economics and Econometrics ,Uniform distribution (continuous) ,Distribution (number theory) ,Optimal estimation ,Group (mathematics) ,Joint probability distribution ,Applied Mathematics ,Estimator ,Invariant (physics) ,Minimax ,Mathematics - Abstract
This paper studies finite sample minimax optimal randomization schemes and estimation schemes in estimating parameters including the average treatment effect, when treatment effects are heterogeneous. A randomization scheme is a distribution over a group of permutations of a given treatment assignment vector. An estimation scheme is a joint distribution over assignment vectors, linear estimators, and permutations of assignment vectors. The key element in the minimax problem is that the worst case is over a class of distributions of the data which is invariant to a group of permutations. First, I show that given any assignment vector and any estimator, the uniform distribution over the same group of permutations, namely the complete randomization scheme, is minimax optimal. Second, under further assumptions on the class of distributions and the objective function, I show the minimax optimal estimation scheme involves completely randomizing an assignment vector, while the optimal estimator is the difference-in-means under complete invariance and a weighted average of within-block differences under a block structure, and the number of treated units is determined by the Neyman allocation.
- Published
- 2023
3. Optimality of Matched-Pair Designs in Randomized Controlled Trials
- Author
-
Yuehao Bai
- Subjects
FOS: Computer and information sciences ,History ,Economics and Econometrics ,Polymers and Plastics ,Average treatment effect ,Econometrics (econ.EM) ,Mathematics - Statistics Theory ,Statistics Theory (math.ST) ,Industrial and Manufacturing Engineering ,law.invention ,Methodology (stat.ME) ,FOS: Economics and business ,Randomized controlled trial ,law ,Statistics ,FOS: Mathematics ,Fraction (mathematics) ,Business and International Management ,Statistics - Methodology ,Economics - Econometrics ,Mathematics ,Estimator ,Nominal level ,Standard error ,Sample size determination ,Null hypothesis - Abstract
This paper studies the optimality of matched-pair designs in randomized controlled trials (RCTs). Matched-pair designs are examples of stratified randomization, in which the researcher partitions a set of units into strata based on their observed covariates and assign a fraction of units in each stratum to treatment. A matched-pair design is such a procedure with two units per stratum. Despite the prevalence of stratified randomization in RCTs, implementations differ vastly. We provide an econometric framework in which, among all stratified randomization procedures, the optimal one in terms of the mean-squared error of the difference-in-means estimator is a matched-pair design that orders units according to a scalar function of their covariates and matches adjacent units. Our framework captures a leading motivation for stratifying in the sense that it shows that the proposed matched-pair design additionally minimizes the magnitude of the ex-post bias, i.e., the bias of the estimator conditional on realized treatment status. We then consider empirical counterparts to the optimal stratification using data from pilot experiments and provide two different procedures depending on whether the sample size of the pilot is large or small. For each procedure, we develop methods for testing the null hypothesis that the average treatment effect equals a prespecified value. Each test we provide is asymptotically exact in the sense that the limiting rejection probability under the null equals the nominal level. We run an experiment on the Amazon Mechanical Turk using one of the proposed procedures, replicating one of the treatment arms in Dellavigna and Pope (2018), and find the standard error decreases by 29%, so that only half of the sample size is required to attain the same standard error.
- Published
- 2022
4. Removal of low-concentration tetracycline from water by a two-step process of adsorption enrichment and photocatalytic regeneration
- Author
-
Jianyu Xing, Jumei Huang, Xi Wang, Feiying Yang, Yuehao Bai, Sha Li, and Xinhao Zhang
- Subjects
Environmental Engineering ,General Medicine ,Management, Monitoring, Policy and Law ,Waste Management and Disposal - Published
- 2023
5. Inference for Support Vector Regression under ℓ1 Regularization
- Author
-
Joshua Shea, Hung Ho, Guillaume Pouliot, and Yuehao Bai
- Subjects
020209 energy ,Inference ,02 engineering and technology ,General Medicine ,Variance (accounting) ,Regularization (mathematics) ,Support vector machine ,Distribution (mathematics) ,020401 chemical engineering ,Linear regression ,0202 electrical engineering, electronic engineering, information engineering ,Test statistic ,0204 chemical engineering ,Binomial proportion confidence interval ,Algorithm ,Mathematics - Abstract
We provide large-sample distribution theory for support vector regression (SVR) with l1-norm along with error bars for the SVR regression coefficients. Although a classical Wald confidence interval obtains from our theory, its implementation inherently depends on the choice of a tuning parameter that scales the variance estimate and thus the width of the error bars. We address this shortcoming by further proposing an alternative large-sample inference method based on the inversion of a novel test statistic that displays competitive power properties and does not depend on the choice of a tuning parameter.
- Published
- 2021
6. A Two-Step Method for Testing Many Moment Inequalities
- Author
-
Andres Santos, Azeem M. Shaikh, and Yuehao Bai
- Subjects
Statistics and Probability ,Economics and Econometrics ,Inequality ,media_common.quotation_subject ,05 social sciences ,Two step ,01 natural sciences ,Moment (mathematics) ,010104 statistics & probability ,0502 economics and business ,Applied mathematics ,0101 mathematics ,Statistics, Probability and Uncertainty ,Finite set ,Social Sciences (miscellaneous) ,050205 econometrics ,media_common ,Mathematics - Abstract
This article considers the problem of testing a finite number of moment inequalities. For this problem, Romano, Shaikh, and Wolf proposed a two-step testing procedure. In the first step, the proced...
- Published
- 2021
7. Cyclic enrichment of chromium based on valence state transformation in metal-free photocatalytic reductive imprinted composite hydrogel
- Author
-
Jianyu, Xing, Jingchang, Li, Feiying, Yang, Yan, Fu, Jumei, Huang, Yuehao, Bai, and Bo, Bai
- Subjects
Chromium ,Chitosan ,History ,Environmental Engineering ,Polymers and Plastics ,Polymers ,Hydrogels ,Hydrogen-Ion Concentration ,Pollution ,Industrial and Manufacturing Engineering ,Kinetics ,Spectroscopy, Fourier Transform Infrared ,Environmental Chemistry ,Pyrroles ,Adsorption ,Business and International Management ,Waste Management and Disposal ,Water Pollutants, Chemical - Abstract
Cr (VI) exists in anion form and can be reduced to positive charged Cr (III) under certain conditions. Can positive charged Cr (III) be continually used for absorbing Cr (VI) to achieve cyclic accumulation of chromium? In this paper, an ion imprinting material for adsorption of Cr (VI) was prepared by dispersing polypyrrole (PPy) in a gelatin/chitosan (Gel/CS) hydrogel network, named Gel/CS/PPy. Based on the conversion of Cr (VI) to Cr (III), a cyclic enrichment process including adsorption-photoreduction-fixation-readsorption of Cr (VI) was established in Gel/CS/PPy hydrogel. The composition and structure of the Gel/CS/PPy were analyzed by scanning electron microscopy (SEM), Fourier transform-infrared spectroscopy (FT-IR), thermogravimetric (TGA), texture analyzer (Universal TA), zeta potential and ultraviolet-visible-near infrared spectra (UV-vis-NIR). The conversion of Cr (VI) and Cr (III) and its promoting effect on readsorption were verified by XPS. The results showed that Gel/CS/PPy has good adsorption capacity for Cr (VI) and excellent photocatalytic ability to reduce Cr (VI) to Cr (III). Cr (III)-loaded Gel/CS/PPy can be further used to adsorb Cr (VI) and showed good adsorption efficiency even after four cycles. The optimal operating condition for Cr (VI) adsorption is pH = 3; 2 g/L dose of Gel/CS/PPy; and the adsorption capacity of Cr (VI) was about 106.8 mg/g after six adsorption cycles. Since Gel/CS/PPy is composed of organic components, high purity chromium can be recovered by simple calcination method later. Therefore, the synthesized Gel/CS/PPy has great potential in the practical application of low concentration Cr (VI) treatment in water.
- Published
- 2022
8. Inference in Experiments with Matched Pairs
- Author
-
Azeem M. Shaikh, Yuehao Bai, and Joseph P. Romano
- Published
- 2019
9. A Practical Method for Testing Many Moment Inequalities
- Author
-
Andres Santos, Yuehao Bai, and Azeem M. Shaikh
- Subjects
Moment (mathematics) ,Inequality ,Computer science ,Feature (computer vision) ,Sample size determination ,media_common.quotation_subject ,Applied mathematics ,Relevance (information retrieval) ,Finite set ,media_common ,Confidence region - Abstract
This paper considers the problem of testing a finite number of moment inequalities. For this problem, Romano et al. (2014) propose a two-step testing procedure. In the first step, the procedure incorporates information about the location of moments using a confidence region. In the second step, the procedure accounts for the use of the confidence region in the first step by adjusting the significance level of the test appropriately. An important feature of the proposed method is that it is “practical” in the sense that it remains computationally feasible even if the number of moments is large. Its justification, however, has so far been limited to settings in which the number of moments is fixed with the sample size. In this paper, we provide weak assumptions under which the same procedure remains valid even in settings in which there are “many” moments in the sense that the number of moments grows rapidly with the sample size. We confirm the practical relevance of our theoretical guarantees in a simulation study. We additionally provide both numerical and theoretical evidence that the procedure compares favorably with the method proposed by Chernozhukov et al. (2019), which has also been shown to be valid in such settings.
- Published
- 2019
10. Inference in Experiments with Matched Pairs
- Author
-
Joseph P. Romano, Yuehao Bai, and Azeem M. Shaikh
- Subjects
Statistics and Probability ,Average treatment effect ,Population ,Inference ,01 natural sciences ,law.invention ,010104 statistics & probability ,Treatment status ,Randomized controlled trial ,law ,Resampling ,0502 economics and business ,Covariate ,Statistics ,Medicine ,0101 mathematics ,education ,050205 econometrics ,Mathematics ,education.field_of_study ,business.industry ,05 social sciences ,Nominal level ,Statistics, Probability and Uncertainty ,business ,Null hypothesis ,Student's t-test - Abstract
This paper studies inference for the average treatment effect in randomized controlled trials where treatment status is determined according to a “matched pairs” design. By a “matched pairs” design, we mean that units are sampled i.i.d. from the population of interest, paired according to observed, baseline covariates and finally, within each pair, one unit is selected at random for treatment. This type of design is used routinely throughout the sciences, but results about its implications for inference about the average treatment effect are not available. The main requirement underlying our analysis is that pairs are formed so that units within pairs are suitably “close” in terms of the baseline covariates, and we develop novel results to ensure that pairs are formed in a way that satisfies this condition. Under this assumption, we show that, for the problem of testing the null hypothesis that the average treatment effect equals a pre-specified value in such settings, the commonly used two-sample t-test and “matched pairs” t-test are conservative in the sense that these tests have limiting rejection probability under the null hypothesis no greater than and typically strictly less than the nominal level. We show, however, that a simple adjustment to the standard errors of these tests leads to a test that is asymptotically exact in the sense that its limiting rejection probability under the null hypothesis equals the nominal level. We also study the behavior of randomization tests that arise naturally in these types of settings. When implemented appropriately, we show that this approach also leads to a test that is asymptotically exact in the sense described previously, but additionally has finite-sample rejection probability no greater than the nominal level for certain distributions satisfying the null hypothesis. A simulation study confirms the practical relevance of our theoretical results.
- Published
- 2019
11. Randomization Under Permutation Invariance
- Author
-
Yuehao Bai
- Subjects
Combinatorics ,Uniform distribution (continuous) ,Distribution (number theory) ,Optimal estimation ,Group (mathematics) ,Joint probability distribution ,Estimator ,Invariant (physics) ,Minimax ,Mathematics - Abstract
This paper studies finite sample minimax optimal randomization schemes and estimation schemes in estimating parameters including the average treat- ment effect, when treatment effects are heterogeneous. A randomization scheme is a distribution over a group of permutations of a given treatment assignment vector. An estimation scheme is a joint distribution over assignment vectors, linear estimators, and permutations of assignment vectors. The key element in the minimax problem is that the worst case is over a class of distributions of the data which is invariant to a group of permutations. First, I show that given any assignment vector and any estimator, the uniform distribution over the same group of permutations, namely the complete randomization scheme, is minimax optimal. Second, under further assumptions on the class of distributions and the objective function, I show the minimax optimal estimation scheme involves completely randomizing an assignment vector, while the optimal estimator is the difference-in-means under complete invariance and a weighted average of within-block differences under a block structure, and the numbers of treated and untreated units are determined by Neyman allocations.
- Published
- 2019
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.