Author: "Yoonsuh Jung" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Yoonsuh Jung"' showing total 30 results

Start Over Author "Yoonsuh Jung"

30 results on '"Yoonsuh Jung"'

1. In-frame cDNA library combined with protein complementation assay identifies ARL11-binding partners.

Author: Sangkyou Lee, Ilkyun Lee, Yoonsuh Jung, David McConkey, and Bogdan Czerniak
Subjects: Medicine, Science
Abstract: The cDNA expression libraries that produce correct proteins are essential in facilitating the identification of protein-protein interactions. The 5'-untranslated regions (UTRs) that are present in the majority of mammalian and non-mammalian genes are predicted to alter the expression of correct proteins from cDNA libraries. We developed a novel cDNA expression library from which 5'-UTRs were removed using a mixture of polymerase chain reaction primers that complement the Kozak sequences we refer to as an "in-frame cDNA library." We used this library with the protein complementation assay to identify two novel binding partners for ras-related ADP-ribosylation factor-like 11 (ARL11), cellular retinoic acid binding protein 2 (CRABP2), and phosphoglycerate mutase 1 (PGAM1). Thus, the in-frame cDNA library without 5'-UTRs we describe here increases the chance of correctly identifying protein interactions and will have wide applications in both mammalian and non-mammalian detection systems.
Published: 2012
Full Text: View/download PDF

2. Tutorial and applications of convolutional neural network models in image classification

Author: Dong-Gyu Lee and Yoonsuh Jung
Published: 2022

3. A review and comparison of convolution neural network models under a unified framework

Author: Jimin Park and Yoonsuh Jung
Subjects: Statistics and Probability, Applied Mathematics, Modeling and Simulation, Statistics, Probability and Uncertainty, Finance
Published: 2022

4. Deep support vector quantile regression with non-crossing constraints

Author: Wooyoung Shin and Yoonsuh Jung
Subjects: Statistics and Probability, Computational Mathematics, Statistics, Probability and Uncertainty
Published: 2022

5. Regularized linear censored quantile regression

Author: Yoonsuh Jung, Taehwa Choi, Sangbum Choi, Seung Jun Shin, and Minjeong Son
Subjects: Statistics and Probability, Statistics, Mathematics, Quantile regression
Published: 2021

6. Efficient information-based quantile regression model tuning with heteroscedastic errors

Author: Yoonsuh Jung and Wooyoung Shin
Subjects: Heteroscedasticity, Econometrics, Quantile regression model, Mathematics
Published: 2021

7. Weighted validation of heteroscedastic regression models for better selection

Author: Yoonsuh Jung and Ha Young Kim
Subjects: Heteroscedasticity, Model selection, Statistics, Analysis, Selection (genetic algorithm), Cross-validation, Computer Science Applications, Information Systems, Mathematics, Heteroscedastic regression
Published: 2021

8. Efficient information-based criteria for model selection in quantile regression

Author: Yoonsuh Jung, Wooyoung Shin, and Mingang Kim
Subjects: Statistics and Probability, Smoothing spline, Goodness of fit, Sample size determination, Model selection, Statistics, Statistics::Methodology, Variance reduction, Smoothing, Mathematics, Quantile, Quantile regression
Abstract: Information-based model selection criteria such as the AIC and BIC employ check loss functions to measure the goodness of fit for quantile regression models. Model selection using a check loss function is robust due to its resistance to outlying observations. In the present study, we suggest modifying the check loss function to achieve a more efficient goodness of fit. Because the cusp of the check loss is quadratically adjusted in the modified version, greater efficiency (or variance reduction) in the model selection is expected. Because we focus on model selection here, we do not modify the model-fitting process. Generalized cross-validation is another common method for choosing smoothing parameters in quantile smoothing splines. We describe how this can be adjusted using the modified check loss to increase efficiency. The proposed generalized cross-validation is designed to reflect the target quantile and sample size. Two real data sets and simulation studies are presented to evaluate its performance using linear and nonlinear quantile regression models.
Published: 2021

9. Comparative study of prediction models forpublic bicycle demand in Seoul

Author: Yoonsuh Jung and Soah Min
Subjects: Computer science, Econometrics, Predictive modelling
Published: 2021

10. Comparison of audio input representations on piano transcription using neural networks

Author: Yoonsuh Jung and Hye Min Han
Subjects: Artificial neural network, Computer science, Speech recognition, Piano, Spectrogram, Transcription (software)
Published: 2021

11. Modified check loss for efficient estimation via model selection in quantile regression

Author: Yoonsuh Jung, Hang J. Kim, and Steven N. MacEachern
Subjects: Statistics and Probability, Estimation, 021103 operations research, Distribution (number theory), Model selection, 0211 other engineering and technologies, Articles, 02 engineering and technology, Function (mathematics), 01 natural sciences, Cross-validation, Quantile regression, 010104 statistics & probability, Empirical research, Statistics, 0101 mathematics, Statistics, Probability and Uncertainty, Mathematics
Abstract: The check loss function is used to define quantile regression. In cross-validation, it is also employed as a validation function when the true distribution is unknown. However, our empirical study indicates that validation with the check loss often leads to overfitting the data. In this work, we suggest a modified or L2-adjusted check loss which rounds the sharp corner in the middle of check loss. This has the effect of guarding against overfitting to some extent. The adjustment is devised to shrink to zero as sample size grows. Through various simulation settings of linear and nonlinear regressions, the improvement due to modification of the check loss by quadratic adjustment is examined empirically.
Published: 2020

12. A numerical study on group quantile regression models

Author: Doyoen Kim and Yoonsuh Jung
Subjects: Statistics and Probability, Group (mathematics), Applied Mathematics, 05 social sciences, Structure (category theory), Regression analysis, Feature selection, 01 natural sciences, Quantile regression, 010104 statistics & probability, Dummy variable, Modeling and Simulation, 0502 economics and business, Statistics, Covariate, 0101 mathematics, Statistics, Probability and Uncertainty, Categorical variable, Finance, 050205 econometrics, Mathematics
Abstract: Grouping structures in covariates are often ignored in regression models. Recent statistical developments considering grouping structure shows clear advantages; however, reflecting the grouping structure on the quantile regression model has been relatively rare in the literature. Treating the grouping structure is usually conducted by employing a group penalty. In this work, we explore the idea of group penalty to the quantile regression models. The grouping structure is assumed to be known, which is commonly true for some cases. For example, group of dummy variables transformed from one categorical variable can be regarded as one group of covariates. We examine the group quantile regression models via two real data analyses and simulation studies that reveal the beneficial performance of group quantile regression models to the non-group version methods if there exists grouping structures among variables.
Published: 2019

13. Review: Reversed low-rank ANOVA model for transforming high dimensional genetic data into low dimension

Author: Yoonsuh Jung and Jianhua Hu
Subjects: Statistics and Probability, Rank (linear algebra), 05 social sciences, Feature selection, 01 natural sciences, Regression, Term (time), Set (abstract data type), 010104 statistics & probability, Variable (computer science), Dimension (vector space), 0502 economics and business, Statistics, Multiplication, 0101 mathematics, 050205 econometrics, Mathematics
Abstract: A general modeling procedure for analyzing genetic data is reviewed. We review ANOVA type model that can handle both the continuous and discrete genetic variables in one modeling framework. Unlike the regression type models which typically set the phenotype variable as a response, this ANOVA model treats the phenotype variable as an explanatory variable. By reversely treating the phenotype variable, usual high dimensional problem is turned into low dimension. Instead, the ANOVA model always includes interaction term between the genetic locations and phenotype variable to find potential association between them. The interaction term is designed to be low rank with the multiplication of bilinear terms so that the required number of parameters is kept in a manageable degree. We compare the performance of the reviewed ANOVA model to the other popular methods via microarray and SNP data sets.
Published: 2019

14. Optimal regression parameter-specific shrinkage by plug-in estimation

Author: Yoonsuh Jung
Subjects: Statistics and Probability, Shrinkage estimator, Estimation, 021103 operations research, 0211 other engineering and technologies, Estimator, 02 engineering and technology, 01 natural sciences, Regression, 010104 statistics & probability, Statistics, 0101 mathematics, Bias–variance tradeoff, Mathematics, Shrinkage
Abstract: One benefit of the bias-variance tradeoff is that regression estimators do not have to be strictly unbiased. However, to take full advantage of allowing bias, shrinkage regression estimators requir...
Published: 2019

15. Nonlinear regression models for heterogeneous data with massive outliers

Author: Yoonsuh Jung
Subjects: Statistics and Probability, Class (set theory), Heteroscedasticity, 021103 operations research, Generalized additive model, 0211 other engineering and technologies, 02 engineering and technology, 01 natural sciences, Robust regression, 010104 statistics & probability, Nonlinear system, Transformation (function), Outlier, Applied mathematics, 0101 mathematics, Statistics, Probability and Uncertainty, Nonlinear regression, Mathematics
Abstract: The income or expenditure-related data sets are often nonlinear, heteroscedastic, skewed even after the transformation, and contain numerous outliers. We propose a class of robust nonlinear...
Published: 2018

16. Transformed low-rank ANOVA models for high-dimensional variable selection

Author: Yoonsuh Jung, Hong Zhang, and Jianhua Hu
Subjects: Statistics and Probability, Biomedical Research, Genotype, Epidemiology, Gene Expression, Feature selection, High dimensional, Polymorphism, Single Nucleotide, 01 natural sciences, Uterine Cervical Diseases, 010104 statistics & probability, 03 medical and health sciences, 0302 clinical medicine, Health Information Management, Statistics, Humans, 030212 general & internal medicine, 0101 mathematics, Mathematics, Analysis of Variance, Leukemia, Rank (computer programming), Confounding Factors, Epidemiologic, Data Interpretation, Statistical, Female, Analysis of variance, Algorithms
Abstract: High-dimensional data are often encountered in biomedical, environmental, and other studies. For example, in biomedical studies that involve high-throughput omic data, an important problem is to search for genetic variables that are predictive of a particular phenotype. A conventional solution is to characterize such relationships through regression models in which a phenotype is treated as the response variable and the variables are treated as covariates; this approach becomes particularly challenging when the number of variables exceeds the number of samples. We propose a general framework for expressing the transformed mean of high-dimensional variables in an exponential distribution family via ANOVA models in which a low-rank interaction space captures the association between the phenotype and the variables. This alternative method transforms the variable selection problem into a well-posed problem with the number of observations larger than the number of variables. In addition, we propose a model selection criterion for the new model framework with a diverging number of parameters, and establish the consistency of the selection criterion. We demonstrate the appealing performance of the proposed method in terms of prediction and detection accuracy through simulations and real data analyses.
Published: 2018

17. Shrinkage estimation of proportion via logit penalty

Author: Yoonsuh Jung
Subjects: Statistics and Probability, Estimation, Shrinkage estimator, Class (set theory), Bias of an estimator, Logit, Statistics, Econometrics, Estimator, Shrinkage, Mathematics
Abstract: By releasing the unbiasedness condition, we often obtain more accurate estimators due to the bias–variance trade-off. In this paper, we propose a class of shrinkage proportion estimators which show...
Published: 2016

18. Effect of statin and aspirin use on toxicity and pathological complete response rate of neo-adjuvant chemoradiation for rectal cancer

Author: Yoonsuh Jung, Claire Hardie, and Michael B. Jameson
Subjects: 0301 basic medicine, medicine.medical_specialty, Statin, Colorectal cancer, medicine.drug_class, medicine.medical_treatment, Gastroenterology, law.invention, 03 medical and health sciences, 0302 clinical medicine, Randomized controlled trial, law, Internal medicine, Rectal Adenocarcinoma, Medicine, Stage (cooking), Aspirin, business.industry, Retrospective cohort study, General Medicine, medicine.disease, Surgery, Radiation therapy, 030104 developmental biology, Oncology, 030220 oncology & carcinogenesis, business, medicine.drug
Abstract: Aim To retrospectively evaluate the potential impact of statin and aspirin use on acute toxicity and pathological complete response (pCR) rate in rectal cancer patients receiving neo-adjuvant long-course radiation therapy (LCRT) with concurrent chemotherapy. Methods A retrospective review was performed of all patients undergoing neo-adjuvant LCRT for rectal adenocarcinoma at the Regional Cancer Treatment Service between 1 September 2007 and 1 June 2011. Data obtained include demographic details; date and radiological TNM stage at diagnosis; medication taken at time of RT; toxicity during LCRT; and surgical histology to determine if a pCR was obtained following LCRT. Results Neo-adjuvant LCRT was administered to 142 patients for rectal cancer during this period; concurrent chemotherapy was omitted in 13 due to significant comorbidities. TNM stage was 2 or 3 radiologically at diagnosis in 127 (89.4%) of patients. At the time of LCRT, 23% were taking a statin and 25% were taking aspirin. Of 135 assessable patients, 34 (13%) achieved a pCR at surgery. On logistic regression, pCR was not significantly associated with the use of chemotherapy, statins, aspirin, other NSAIDs, T-stage or N-stage. There was no significant correlation between statin or aspirin use with bladder or rectal toxicity. Actuarial time to maximum rectal toxicity was not different in statin users or nonusers. Conclusion In contrast to other larger retrospective series, this study did not find improvements in toxicity or pCR rate through statin or aspirin use in rectal cancer patients undergoing LCRT. Their potential benefits in this setting would be best studied prospectively in a large randomized trial.
Published: 2016

19. Robust regression for highly corrupted response by shifting outliers

Author: Yoonsuh Jung, Jianhua Hu, and Seung Pil Lee
Subjects: Statistics and Probability, Degrees of freedom, Robust statistics, Least trimmed squares, 02 engineering and technology, 01 natural sciences, Robust regression, 010104 statistics & probability, Huber loss, Outlier, Statistics, 0202 electrical engineering, electronic engineering, information engineering, 020201 artificial intelligence & image processing, 0101 mathematics, Statistics, Probability and Uncertainty, Mathematics
Abstract: Outlying observations are often disregarded at the sacrifice of degrees of freedom or downsized via robust loss functions (e.g., Huber's loss) to reduce the undesirable impact on data analysis. In this article, we treat the outlying status of each observation as a parameter and propose a penalization method to automatically adjust the outliers. The proposed method shifts the outliers towards the fitted values, while preserve the non-outlying observations. We also develop a generally applicable algorithm in the iterative fashion to estimate model parameters and demonstrate the connection with the maximum likelihood based estimation procedure in the case of least squares estimation. We establish asymptotic property of the resulting parameter estimators under the condition that the proportion of outliers does not vanish as sample size increases. We apply the proposed outlier adjustment method to ordinary least squares and lasso-type penalization procedure and demonstrate its empirical value via numeric studies. Furthermore, we study applicability of the proposed method to two robust estimators, Huber's robust estimator and Huberized lasso, and demonstrate its noticeable improvement of model fit in the presence of extremely large outliers.
Published: 2016

20. AK-fold averaging cross-validation procedure

Author: Yoonsuh Jung and Jianhua Hu
Subjects: Statistics and Probability, Fold (higher-order function), Model selection, Statistics, Feature selection, Statistics, Probability and Uncertainty, Algorithm, Cross-validation, Mathematics
Abstract: Cross-validation (CV) type of methods have been widely used to facilitate model estimation and variable selection. In this work, we suggest a new K-fold CV procedure to select a candidate ‘optimal’...
Published: 2015

21. Efficient quantile regression for heteroscedastic models

Author: Yoonkyung Lee, Yoonsuh Jung, and Steven N. MacEachern
Subjects: Statistics and Probability, Heteroscedasticity, Applied Mathematics, Rounding, Contrast (statistics), Conditional expectation, Regression, Quantile regression, Modeling and Simulation, Statistics, Range (statistics), Statistics, Probability and Uncertainty, Quantile, Mathematics
Abstract: Quantile regression (QR) provides estimates of a range of conditional quantiles. This stands in contrast to traditional regression techniques, which focus on a single conditional mean function. Lee et al. [Regularization of case-specific parameters for robustness and efficiency. Statist Sci. 2012;27(3):350–372] proposed efficient QR by rounding the sharp corner of the loss. The main modification generally involves an asymmetric l2 adjustment of the loss function around zero. We extend the idea of l2 adjusted QR to linear heterogeneous models. The l2 adjustment is constructed to diminish as sample size grows. Conditions to retain consistency properties are also provided.
Published: 2014

22. Impact of Concomitant Surgical Atrial Fibrillation Ablation in Patients Undergoing Aortic Valve Replacement

Author: Yoonsuh Jung, Jae Won Lee, Sun Kyun Ro, Sung Ho Jung, Joon Bum Kim, Cheol Hyun Chung, Suk Jung Choo, and Jae Suk Yoo
Subjects: Ablation Techniques, Male, Aortic valve, medicine.medical_specialty, Cox maze procedure, medicine.medical_treatment, Disease-Free Survival, Aortic valve replacement, Interquartile range, Internal medicine, Atrial Fibrillation, medicine, Humans, Sinus rhythm, Aged, Heart Valve Prosthesis Implantation, Ejection fraction, business.industry, Atrial fibrillation, General Medicine, Middle Aged, medicine.disease, Survival Rate, medicine.anatomical_structure, Aortic Valve, Concomitant, Cardiology, Female, Cardiology and Cardiovascular Medicine, business
Abstract: BACKGROUND: The clinical benefit of concomitant atrial fibrillation (AF) ablation at the time of aortic valve replacement (AVR) is uncertain. METHODS AND RESULTS: A total of 124 patients with AF who underwent AVR with (n=50) or without (n=74) a concomitant maze procedure, between 2000 and 2011, were evaluated. There were no significant differences in early postoperative outcomes. During a median clinical follow-up of 18.1 months (interquartile range: 6.9-47.8 months), 19 late deaths (15.3%) and 33 valve-related complications (26.6%) occurred, but the differences between groups were not statistically significant. Major event-free survival at 5 years was 60.9±9.9% vs. 57.0±10.3% (P=0.41). After adjustment, the maze group demonstrated similar risks for major adverse cardiac events (hazard ratio, 1.18; 95% confidence interval, 0.56-2.49; P=0.67). However, the rate of sinus rhythm restoration at 4 years was significantly higher in the maze group (80.6% vs. 3.6%, P
Published: 2014

23. A

Author: Yoonsuh, Jung and Jianhua, Hu
Subjects: Article
Abstract: Cross-validation type of methods have been widely used to facilitate model estimation and variable selection. In this work, we suggest a new K-fold cross validation procedure to select a candidate ‘optimal’ model from each hold-out fold and average the K candidate ‘optimal’ models to obtain the ultimate model. Due to the averaging effect, the variance of the proposed estimates can be significantly reduced. This new procedure results in more stable and efficient parameter estimation than the classical K-fold cross validation procedure. In addition, we show the asymptotic equivalence between the proposed and classical cross validation procedures in the linear regression setting. We also demonstrate the broad applicability of the proposed procedure via two examples of parameter sparsity regularization and quantile smoothing splines modeling. We illustrate the promise of the proposed method through simulations and a real data example.
Published: 2016

24. Effect of statin and aspirin use on toxicity and pathological complete response rate of neo-adjuvant chemoradiation for rectal cancer

Author: Claire, Hardie, Yoonsuh, Jung, and Michael, Jameson
Subjects: Adult, Aged, 80 and over, Male, Aspirin, Rectal Neoplasms, Anti-Inflammatory Agents, Non-Steroidal, Chemoradiotherapy, Adjuvant, Adenocarcinoma, Middle Aged, Young Adult, Humans, Female, Hydroxymethylglutaryl-CoA Reductase Inhibitors, Aged, Retrospective Studies
Abstract: To retrospectively evaluate the potential impact of statin and aspirin use on acute toxicity and pathological complete response (pCR) rate in rectal cancer patients receiving neo-adjuvant long-course radiation therapy (LCRT) with concurrent chemotherapy.A retrospective review was performed of all patients undergoing neo-adjuvant LCRT for rectal adenocarcinoma at the Regional Cancer Treatment Service between 1 September 2007 and 1 June 2011. Data obtained include demographic details; date and radiological TNM stage at diagnosis; medication taken at time of RT; toxicity during LCRT; and surgical histology to determine if a pCR was obtained following LCRT.Neo-adjuvant LCRT was administered to 142 patients for rectal cancer during this period; concurrent chemotherapy was omitted in 13 due to significant comorbidities. TNM stage was 2 or 3 radiologically at diagnosis in 127 (89.4%) of patients. At the time of LCRT, 23% were taking a statin and 25% were taking aspirin. Of 135 assessable patients, 34 (13%) achieved a pCR at surgery. On logistic regression, pCR was not significantly associated with the use of chemotherapy, statins, aspirin, other NSAIDs, T-stage or N-stage. There was no significant correlation between statin or aspirin use with bladder or rectal toxicity. Actuarial time to maximum rectal toxicity was not different in statin users or nonusers.In contrast to other larger retrospective series, this study did not find improvements in toxicity or pCR rate through statin or aspirin use in rectal cancer patients undergoing LCRT. Their potential benefits in this setting would be best studied prospectively in a large randomized trial.
Published: 2016

25. Efficient Tuning Parameter Selection By Cross-Validated Score In High Dimensional Models

Author: Yoonsuh Jung
Subjects: Cross Validation, Regularization Parameter Search, Parameter Averaging, Parameter Selection
Abstract: As DNA microarray data contain relatively small sample size compared to the number of genes, high dimensional models are often employed. In high dimensional models, the selection of tuning parameter (or, penalty parameter) is often one of the crucial parts of the modeling. Cross-validation is one of the most common methods for the tuning parameter selection, which selects a parameter value with the smallest cross-validated score. However, selecting a single value as an ‘optimal’ value for the parameter can be very unstable due to the sampling variation since the sample sizes of microarray data are often small. Our approach is to choose multiple candidates of tuning parameter first, then average the candidates with different weights depending on their performance. The additional step of estimating the weights and averaging the candidates rarely increase the computational cost, while it can considerably improve the traditional cross-validation. We show that the selected value from the suggested methods often lead to stable parameter selection as well as improved detection of significant genetic variables compared to the tradition cross-validation via real data and simulated data sets., {"references":["J. Zhu and T. Hastie, \"Classification of gene microarrays by penalized\nlogistic regression,\" Biostatistics, vol. 5, no. 3, pp. 427 – 443, 2004.","L. Shen and E. C. Tan, \"Dimension reduction-based penalized logistic\nregression for cancer classification using microarray data,\" IEEE/ACM\nTransactions on Computational Biology and Bioinformatics, vol. 2,\nno. 2, pp. 166 – 175, 2005.","C. Li and H. Li, \"Network-constrained regularization and variable\nselection for analysis of genomic data,\" Bioinformatics, vol. 24, no. 9,\npp. 1175 – 1182, 2008.","W. Pan, B. Xie, and X. Shen, \"Incorporating predictor network in\npenalized regression with application to microarray data,\" Biometrics,\nvol. 66, pp. 474 – 484, 2010.","G. Fort and S. Lambert-Lacroix, \"Classification using partial least\nsquares with penalized logistic regression,\" Bioinformatics, vol. 21,\nno. 7, pp. 1104 – 1111, 2005.","G. C. Cawley and N. L. C. Talbot, \"Gene selection in cancer\nclassification using sparse logistic regression with bayesian\nregularization,\" Bioinformatics, vol. 22, no. 19, pp. 2348 – 2355,\n2006.","L. Waldron, M. Pintilie, M.-S. Tsao, F. A. Shepherd, C. Huttenhower,\nand I. Jurisica, \"Optimized application of penalized regression methods\nto diverse genomic data,\" Bioinformatics, vol. 27, no. 24, pp. 3399 –\n3406, 2011.","P. Breheny and J. Huang, \"Coordinate descent algorithms for nonconvex\npenalized regression, with applications to biological feature selection,\"\nThe Annals of Applied Statistics, vol. 5, no. 457, pp. 232 – 253, 2011.","R. Tibshirani, \"Regression shrinkage and selection via the lasso,\"\nJournal of the Royal Statistical Society. Series B (Methodological),\nvol. 58, no. 1, pp. 267 – 288, 1996.\n[10] J. Friedman, T. Hastie, and R. Tibshirani, \"Regularization paths for\ngeneralized linear models via coordinate descent,\" Journal of Statistical\nSoftware, vol. 33, no. 1, pp. 1 – 22, 2008. [Online]. Available:\nhttp://www.jstatsoft.org/v33/i01/\n[11] N. Simon, J. Friedman, T. Hastie, and R. Tibshirani, \"Regularization\npaths for cox's proportional hazards model via coordinate descent,\"\nJournal of Statistical Software, vol. 39, no. 5, pp. 1 – 13, 2011.\n[Online]. Available: http://www.jstatsoft.org/v39/i05/\n[12] M. Y. Park and T. Hastie, \"L1 regularization path algorithm for\ngeneralized linear models,\" Journal of the Royal Statistical Society.\nSeries B (Methodological), vol. 69, no. 4, pp. 659 – 677, 2007.\n[13] R. Tibshirani and J. Taylor, \"The solution path of the generalized lasso,\"\nAnnals of Statistics, vol. 39, no. 3, pp. 1335 – 1371, 2011.\n[14] M. Stone, \"Cross-validatory choice and the assessment of statistical\npredictions (with discussion),\" Journal of the Royal Statistical Society.\nSeries B (Methodological), vol. 36, no. 2, pp. 111 – 147, 1974.\n[15] S. Geisser, \"The predictive sample reuse method with applications,\"\nJournal of the American Statistical Association, vol. 70, no. 350, pp.\n320 – 328, 1975.\n[16] L. J. Buturovi´c, \"Pcp: a program for supervised classification of gene\nexpression profiles,\" Bioinformatics, vol. 22, no. 2, pp. 245 – 247, 2006.\n[17] V. V. Belle, K. Pelckmans, S. V. Huffel, and J. A. K. Suykens, \"Improved\nperformance on high-dimensional survival data by application of\nsurvival-svm,\" Bioinformatics, vol. 27, no. 1, pp. 87 – 94, 2011.\n[18] A.-L. Boulesteix, C. Porzelius, and M. Daumer, \"Microarray-based\nclassification and clinical predictors: on combined classifiers and\nadditional predictive value,\" Bioinformatics, vol. 24, no. 15, pp. 1698 –\n1706, 2008.\n[19] W. Pan and X. Shen, \"Penalized model-based clustering with application\nto variable selection,\" Journal of Machine Learning Research, vol. 8, pp.\n1145 – 1164, 2007.\n[20] T. Hancock, I. Takigawa, and H. Mamitsuka, \"Mining metabolic\npathways through gene expression,\" Bioinformatics, vol. 26, no. 17, pp.\n2128 – 2135, 2010.\n[21] S. Arlot and A. Celisse, \"A survey of cross-validation procedures for\nmodel selection,\" Statistics Surveys, vol. 4, pp. 40 – 79, 2010.\n[22] B. Efron and R. Tibshirani, \"Improvements on cross-validation:\nThe .632+ bootstrap method,\" Journal of the American Statistical\nAssociation, vol. 92, no. 438, pp. 548 – 560, 1997.\n[23] U. Braga-Neto, R. Hashimoto, E. R. Dougherty, D. V. Nguyen, and\nR. J. Carroll, \"Is cross-validation better than resubstitution for ranking\ngenes?\" Bioinformatics, vol. 20, no. 2, pp. 253 – 258, 2004.\n[24] B. Scholk¨opf, K. Sung, C. Burges, T. P. F. Girosi, P. Niyogi, and\nV. Vapnik., \"Comparing support vector machines with gaussian kernels\nto radial basis function classifiers,\" IEEE Trans. Sign. Processing,\nvol. 45, pp. 2758 – 2765, 1997.\n[25] E. Dimitriadou, K. Hornik, F. Leisch, D. Meyer, and A. Weingessel,\n\"e1071: Misc functions of the department of statistics (e1071),\" TU\nWien,Version 1.5-11, Tech. Rep., 2005.\n[26] A. Karatzoglou, A. Smola, K. Hornik, and A. Zeileis, \"kernlab –\nan S4 package for kernel methods in R,\" Journal of Statistical\nSoftware, vol. 11, no. 9, pp. 1 – 20, 2004. [Online]. Available:\nhttp://www.jstatsoft.org/v11/i09/\n[27] Y. Guo, T. Hastie, and R. Tibshirani, \"Regularized linear discriminant\nanalysis and its application in microarrays,\" Biostatistics, vol. 8, pp. 86\n– 100, 2007.\n[28] G. Schwarz, \"Estimating the dimension of a model,\" The Annals of\nStatistics, vol. 6, no. 2, pp. 461 – 464, 1978.\n[29] H. Akaike, \"A new look at the statistical model identification,\" IEEE\nTransactions on Automatic Control, vol. 19, no. 6, pp. 716 – 723,\n1974.\n[30] J. Chen and Z. Chen, \"Extended bayesian information criteria for model\nselection with large model spaces,\" Biometrika, vol. 95, no. 3, pp. 759\n– 771, 2008.\n[31] H. Wang, B. Li, and C. Leng, \"Shrinkage tuning parameter selection\nwith a diverging number of parameters,\" Journal of the Royal Statistical\nSociety. Series B (Methodological), vol. 71, no. 3, pp. 671 – 683, 2009.\n[32] J. Chen and Z. Chen, \"Extended BIC for small-n-large-p sparse GLM,\"\nStatistica Sinica, vol. 22, pp. 555 – 574, 2012.\n[33] A. E. Hoerl and R. W. Kennard, \"Ridge regression: Biased estimation\nfor nonorthogonal problems,\" Technometrics, vol. 12, no. 1, pp. 55 –\n67, 1970.\n[34] A. Karatzoglou, D. Meyer, and K. Hornik, \"Support vector machines in\nr,\" Journal of Statistical Software, vol. 15, no. 9, pp. 1 – 28, 4 2006.\n[35] T. Golub, D. Slonim, P. Tamayo, C. Huard, M. Gaasenbeek, J. Mesirov,\nH. Coller, M. Loh, J. Downing, C. Caligiuri, M.A.and Bloomfield, and\nE. Lander, \"Molecular classification of cancer: class discovery and class\nprediction by gene expression monitoring.\" Science, vol. 286, pp. 531 –\n537, 1999.\n[36] U. Alon, N. Barkai, D. Notterman, K. Gish, S. Mack, and J. Levine,\n\"Broad patterns of gene expression revealed by clustering analysis of\ntumor and normal colon tissues probed by oligonucleotide arrays.\"\nProceedings of the National Academy of Sciences of the USA, vol. 96,\npp. 6745 – 6750, 1999. [37] A. Alizadeh, M. Eisen, R. Davis, C. Ma, I. Lossos, A. Rosenwald,\nJ. Boldrick, H. Sabet, T. Tran, and X. e. a. Yu, \"Distinct types of diffuse\nlarge b-cell lymphoma identified by gene expression profiling.\" Nature,\nvol. 403, no. 6769, pp. 503 – 511, 2000.\n[38] J. Khan, J. Wei, M. Ringner, L. Saal, M. Ladanyi, F. Westermann,\nF. Berthold, M. Schwab, and C. e. a. Antonescu, \"Classification and\ndiagnostic prediction of cancer using gene expression profiling and\nartificial neural networks.\" Nature Medicine, vol. 7, pp. 673 – 679, 2001.\n[39] D. Witten and R. Tibshirani, \"Penalized classification using fisher's\nlinear discriminant,\" Journal of the Royal Statistical Society. Series B\n(Methodological), vol. 73, no. 5, pp. 753 – 772, 2011.\n[40] H. Zou, \"The adaptive lasso and its oracle properties,\" Journal of the\nAmerican Statistical Association, vol. 101, no. 476, pp. 1418 – 1429,\n2006.\n[41] L. W. Hahn, M. D. Ritchie, and J. H. Moore, \"Multifactor dimensionality\nreduction software for detecting genegene and geneenvironment\ninteractions,\" Bioinformatics, vol. 19, no. 3, pp. 376 – 382, 2003.\n[42] C. Kooperberg, M. LeBlanc, J. Y. Dai, and I. Rajapakse, \"Structures and\nassumptions: Strategies to harness gene x gene and gene x environment\ninteractions in GWAS,\" Statistical Science, vol. 24, no. 4, pp. 472 – 488,\n2009."]}
Published: 2016
Full Text: View/download PDF

26. Biomarker Detection in Association Studies: Modeling SNPs Simultaneously via Logistic ANOVA

Author: Jianhua Z. Huang, Yoonsuh Jung, and Jianhua Hu
Subjects: Statistics and Probability, Logit, Single-nucleotide polymorphism, Genome-wide association study, Computational biology, Biology, Logistic regression, Article, Sample size determination, Statistics, SNP, Analysis of variance, Statistics, Probability and Uncertainty, Genetic association
Abstract: In genome-wide association studies, the primary task is to detect biomarkers in the form of Single Nucleotide Polymorphisms (SNPs) that have nontrivial associations with a disease phenotype and some other important clinical/environmental factors. However, the extremely large number of SNPs comparing to the sample size inhibits application of classical methods such as the multiple logistic regression. Currently the most commonly used approach is still to analyze one SNP at a time. In this paper, we propose to consider the genotypes of the SNPs simultaneously via a logistic analysis of variance (ANOVA) model, which expresses the logit transformed mean of SNP genotypes as the summation of the SNP effects, effects of the disease phenotype and/or other clinical variables, and the interaction effects. We use a reduced-rank representation of the interaction-effect matrix for dimensionality reduction, and employ the L1-penalty in a penalized likelihood framework to filter out the SNPs that have no associations. We develop a Majorization-Minimization algorithm for computational implementation. In addition, we propose a modified BIC criterion to select the penalty parameters and determine the rank number. The proposed method is applied to a Multiple Sclerosis data set and simulated data sets and shows promise in biomarker detection.
Published: 2015

27. Oncology nurses' knowledge of survivorship care planning: the need for education

Author: Yoonsuh Jung, Andrew L. Wessels, and Joanne Lester
Subjects: Oncology, medicine.medical_specialty, Health Knowledge, Attitudes, Practice, Activities of daily living, media_common.quotation_subject, medicine.medical_treatment, Fertility, Human sexuality, Cancer Care Facilities, Nurse's Role, Nursing, Patient Education as Topic, Internal medicine, Survivorship curve, Neoplasms, Surveys and Questionnaires, medicine, Humans, Survivors, Depression (differential diagnoses), media_common, Advanced Practice Nursing, Rehabilitation, business.industry, Oncology Nursing, Cancer, medicine.disease, Patient Discharge, Lymphedema, Cross-Sectional Studies, Family medicine, business, Nurse-Patient Relations
Abstract: Purpose/objectives To survey nurses about their knowledge of cancer survivorship care. Design Descriptive, cross-sectional. Setting Midwestern comprehensive cancer center. Sample 223 registered and advanced practice nurses. Methods Online survey of survivorship knowledge using a 50-item questionnaire derived from the Institute of Medicine report and related publications. Main research variables Concepts of survivorship care and common long-term symptoms. Findings Most nurses reported having knowledge about healthy lifestyle habits; more than 50% of nurses reported having knowledge about chemotherapy, surgery, and radiation therapy, as well as side effects of fatigue, depression, limitations of daily activities, and weight gain; less than 50% of nurses reported having knowledge of impact on family, biologic agents, lymphedema, immunizations or vaccinations, and osteoporosis screening; less than 40% of nurses reported having knowledge about marital and partner relationships, osteoporosis prevention and care, sexuality, side effects of bone marrow transplantation, employment issues, and angiogenesis agents; and less than 25% of nurses reported having knowledge on genetic risks, as well as fertility, financial, and insurance issues. Conclusions Oncology nurses at an academic comprehensive cancer center reported gaps in knowledge consistent with previous studies about knowledge of survivorship care. Implications for nursing The Institute of Medicine has challenged oncology providers to address cancer survivorship care planning. Gaps in cancer survivorship knowledge are evident and will require focused education for this initiative to be successful.
Published: 2014

28. Efficient quantile regression for heteroscedastic models

Author: Yoonsuh Jung, Yoonkyung Lee, Steven N. MacEachern, Yoonsuh Jung, Yoonkyung Lee, and Steven N. MacEachern
Published: 2015
Full Text: View/download PDF

29. Regularization of Case-Specific Parameters for Robustness and Efficiency

Author: Yoonkyung Lee, Yoonsuh Jung, and Steven N. MacEachern
Subjects: FOS: Computer and information sciences, Case indicator, Statistics and Probability, Mathematical optimization, quantile regression, Mean squared error, Computer science, General Mathematics, LASSO, outlier, Regularization (mathematics), Regression, Quantile regression, Methodology (stat.ME), Robustness (computer science), leverage point, Covariate, Outlier, Statistics::Methodology, Leverage (statistics), Statistics, Probability and Uncertainty, large margin classifier, penalized method, Statistics - Methodology
Abstract: Regularization methods allow one to handle a variety of inferential problems where there are more covariates than cases. This allows one to consider a potentially enormous number of covariates for a problem. We exploit the power of these techniques, supersaturating models by augmenting the "natural" covariates in the problem with an additional indicator for each case in the data set. We attach a penalty term for these case-specific indicators which is designed to produce a desired effect. For regression methods with squared error loss, an $\ell_1$ penalty produces a regression which is robust to outliers and high leverage cases; for quantile regression methods, an $\ell_2$ penalty decreases the variance of the fit enough to overcome an increase in bias. The paradigm thus allows us to robustify procedures which lack robustness and to increase the efficiency of procedures which are robust. We provide a general framework for the inclusion of case-specific parameters in regularization problems, describing the impact on the effective loss for a variety of regression and classification problems. We outline a computational strategy by which existing software can be modified to solve the augmented regularization problem, providing conditions under which such modification will converge to the optimum solution. We illustrate the benefits of including case-specific parameters in the context of mean regression and quantile regression through analysis of NHANES and linguistic data sets., Comment: Published in at http://dx.doi.org/10.1214/11-STS377 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)
Published: 2012

30. In-Frame cDNA Library Combined with Protein Complementation Assay Identifies ARL11-Binding Partners

Author: Yoonsuh Jung, David J. McConkey, Bogdan Czerniak, Ilkyun Lee, and Sangkyou Lee
Subjects: Macromolecular Assemblies, Receptors, Retinoic Acid, Immunoprecipitation, Molecular Sequence Data, Protein combining, lcsh:Medicine, Gene Expression, Computational biology, Biology, Biochemistry, Cell Line, Protein–protein interaction, Molecular Genetics, 03 medical and health sciences, 0302 clinical medicine, Nucleic Acids, Phosphoglycerate Mutase 1, Complementary DNA, Chemical Biology, Protein Interaction Mapping, Genetics, Humans, Genomic library, Biomacromolecule-Ligand Interactions, Gene Networks, lcsh:Science, Gene, Gene Library, 030304 developmental biology, 0303 health sciences, Multidisciplinary, Base Sequence, ADP-Ribosylation Factors, cDNA library, lcsh:R, Proteins, 3. Good health, 030220 oncology & carcinogenesis, lcsh:Q, Gene Function, 5' Untranslated Regions, Carrier Proteins, Research Article, Cloning, Protein Binding
Abstract: The cDNA expression libraries that produce correct proteins are essential in facilitating the identification of protein-protein interactions. The 5'-untranslated regions (UTRs) that are present in the majority of mammalian and non-mammalian genes are predicted to alter the expression of correct proteins from cDNA libraries. We developed a novel cDNA expression library from which 5'-UTRs were removed using a mixture of polymerase chain reaction primers that complement the Kozak sequences we refer to as an "in-frame cDNA library." We used this library with the protein complementation assay to identify two novel binding partners for ras-related ADP-ribosylation factor-like 11 (ARL11), cellular retinoic acid binding protein 2 (CRABP2), and phosphoglycerate mutase 1 (PGAM1). Thus, the in-frame cDNA library without 5'-UTRs we describe here increases the chance of correctly identifying protein interactions and will have wide applications in both mammalian and non-mammalian detection systems.
Published: 2012

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

30 results on '"Yoonsuh Jung"'

1. In-frame cDNA library combined with protein complementation assay identifies ARL11-binding partners.

2. Tutorial and applications of convolutional neural network models in image classification

3. A review and comparison of convolution neural network models under a unified framework

4. Deep support vector quantile regression with non-crossing constraints

5. Regularized linear censored quantile regression

6. Efficient information-based quantile regression model tuning with heteroscedastic errors

7. Weighted validation of heteroscedastic regression models for better selection

8. Efficient information-based criteria for model selection in quantile regression

9. Comparative study of prediction models forpublic bicycle demand in Seoul

10. Comparison of audio input representations on piano transcription using neural networks

11. Modified check loss for efficient estimation via model selection in quantile regression

12. A numerical study on group quantile regression models

13. Review: Reversed low-rank ANOVA model for transforming high dimensional genetic data into low dimension

14. Optimal regression parameter-specific shrinkage by plug-in estimation

15. Nonlinear regression models for heterogeneous data with massive outliers

16. Transformed low-rank ANOVA models for high-dimensional variable selection

17. Shrinkage estimation of proportion via logit penalty

18. Effect of statin and aspirin use on toxicity and pathological complete response rate of neo-adjuvant chemoradiation for rectal cancer

19. Robust regression for highly corrupted response by shifting outliers

20. AK-fold averaging cross-validation procedure

21. Efficient quantile regression for heteroscedastic models

22. Impact of Concomitant Surgical Atrial Fibrillation Ablation in Patients Undergoing Aortic Valve Replacement

23. A

24. Effect of statin and aspirin use on toxicity and pathological complete response rate of neo-adjuvant chemoradiation for rectal cancer

25. Efficient Tuning Parameter Selection By Cross-Validated Score In High Dimensional Models

26. Biomarker Detection in Association Studies: Modeling SNPs Simultaneously via Logistic ANOVA

27. Oncology nurses' knowledge of survivorship care planning: the need for education

28. Efficient quantile regression for heteroscedastic models

29. Regularization of Case-Specific Parameters for Robustness and Efficiency

30. In-Frame cDNA Library Combined with Protein Complementation Assay Identifies ARL11-Binding Partners

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

30 results on '"Yoonsuh Jung"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources