101 results on '"Weixuan Fu"'
Search Results
2. Novel ATXN1/ATXN1L::NUTM2A fusions identified in aggressive infant sarcomas with gene expression and methylation patterns similar to CIC-rearranged sarcoma
- Author
-
Feng Xu, Angela N. Viaene, Jenny Ruiz, Jeffrey Schubert, Jinhua Wu, Jiani Chen, Kajia Cao, Weixuan Fu, Rochelle Bagatell, Zhiqian Fan, Ariel Long, Luca Pagliaroli, Yiming Zhong, Minjie Luo, Portia A. Kreiger, Lea F. Surrey, Gerald B. Wertheim, Kristina A. Cole, Marilyn M. Li, Mariarita Santi, and Phillip B. Storm
- Subjects
CIC-rearranged sarcoma ,ATXN1/ATXN1L-associated fusions ,Whole transcriptome sequencing ,Neurology. Diseases of the nervous system ,RC346-429 - Abstract
Abstract CIC-rearranged sarcomas are newly defined undifferentiated soft tissue tumors with CIC-associated fusions, and dismal prognosis. CIC fusions activate PEA3 family genes, ETV1/4/5, leading to tumorigenesis and progression. We report two high-grade CNS sarcomas of unclear histological diagnosis and one disseminated tumor of unknown origin with novel fusions and similar gene-expression/methylation patterns without CIC rearrangement. All three patients were infants with aggressive diseases, and two experienced rapid disease deterioration and death. Whole-transcriptome sequencing identified an ATXN1-NUTM2A fusion in the two CNS tumors and an ATXN1L-NUTM2A fusion in case 3. ETV1/4/5 and WT1 overexpression were observed in all three cases. Methylation analyses predicted CIC-rearranged sarcoma for all cases. Retrospective IHC staining on case 2 demonstrated ETV4 and WT1 overexpression. ATXN1 and ATXN1L interact with CIC forming a transcription repressor complex. We propose that ATXN1/ATXN1L-associated fusions disrupt their interaction with CIC and decrease the transcription repressor complex, leading to downstream PEA3 family gene overexpression. These three cases with novel ATXN1/ATXN1L-associated fusions and features of CIC-rearranged sarcomas may further expand the scope of “CIC-rearranged” sarcomas to include non-CIC rearrangements. Additional cases are needed to demonstrate if ATXN1/ATXN1L-NUTM2A fusions are associated with younger age and more aggressive diseases.
- Published
- 2022
- Full Text
- View/download PDF
3. The Inactivated ISKNV-I Vaccine Confers Highly Effective Cross-Protection against Epidemic RSIV-I and RSIV-II from Cultured Spotted Sea Bass Lateolabrax maculatus
- Author
-
Weixuan Fu, Yong Li, Yuting Fu, Wenfeng Zhang, Panpan Luo, Qianqian Sun, Fangzhao Yu, Shaoping Weng, Wangdong Li, Jianguo He, and Chuanfu Dong
- Subjects
Megalocytivirus ,RSIV genotype ,ISKNV genotype ,inactivated vaccine ,cross-protection ,infectious spleen and kidney necrosis virus ,Microbiology ,QR1-502 - Abstract
ABSTRACT The genus Megalocytivirus of the family Iridoviridae is composed of two distinct species, namely, infectious spleen and kidney necrosis virus (ISKNV) and scale drop disease virus (SDDV), and both are important causative agents in a variety of bony fish worldwide. Of them, the ISKNV species is subdivided into three genotypes, namely, red seabream iridovirus (RSIV), ISKNV, and turbot reddish body iridovirus (TRBIV), and a further six subgenotypes, RSIV-I, RSIV-II, ISKNV-I, ISKNV-II, TRBIV-I, and TRBIV-II. Commercial vaccines derived from RSIV-I , RSIV-II and ISKNV-I have been available to several fish species. However, studies regarding the cross-protection effect among different genotype or subgenotype isolates have not been fully elucidated. In this study, RSIV-I and RSIV-II were demonstrated as the causative agents in cultured spotted seabass, Lateolabrax maculatus, through serial robust evidence, including cell culture-based viral isolation, whole-genome determination and phylogeny analysis, artificial challenge, histopathology, immunohistochemistry, and immunofluorescence as well as transmission electron microscope observation. Thereafter, a formalin-killed cell (FKC) vaccine generated from an ISKNV-I isolate was prepared to evaluate the protective effects against two spotted seabass original RSIV-I and RSIV-II. The result showed that the ISKNV-I-based FKC vaccine conferred almost complete cross-protection against RSIV-I and RSIV-II as well as ISKNV-I itself. No serotype difference was observed among RSIV-I, RSIV-II, and ISKNV-I. Additionally, the mandarin fish Siniperca chuatsi is proposed as an ideal infection and vaccination fish species for the study of various megalocytiviral isolates. IMPORTANCE Red seabream iridovirus (RSIV) infects a wide mariculture bony fish and has resulted in significant annual economic loss worldwide. Previous studies showed that the phenotypic diversity of infectious RSIV isolates would lead to different virulence characteristics, viral antigenicity, and vaccine efficacy as well as host range. Importantly, it is still doubted whether a universal vaccine could confer the same highly protective effect against various genotypic isolates. Our study here presented enough experimental evidence that a water in oil (w/o) formation of inactivated ISKNV-I vaccine could confer almost complete protection against RSIV-I and RSIV-II as well as ISKNV-I itself. Our study provides valuable data for better understanding the differential infection and immunity among different genotypes of ISKNV and RSIV isolates in the genus Megalocytivirus.
- Published
- 2023
- Full Text
- View/download PDF
4. P588: Identification of NF1-associated tumor mutations in plasma cfDNA and its clinical application
- Author
-
Feng Xu, Ariel Long, Matthew Lueder, Kajia Cao, Weixuan Fu, Zhiqian Fan, Mateusz Koptyra, Jessica Foster, Thomas De Raedt, Chelsea Kotch, Michael Fisher, and Marilyn Li
- Subjects
Genetics ,QH426-470 ,Medicine - Published
- 2023
- Full Text
- View/download PDF
5. Embedding covariate adjustments in tree-based automated machine learning for biomedical big data analyses
- Author
-
Elisabetta Manduchi, Weixuan Fu, Joseph D. Romano, Stefano Ruberto, and Jason H. Moore
- Subjects
AutoML ,Covariate adjustment ,Genetic programming ,Pathways ,Feature importance ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background A typical task in bioinformatics consists of identifying which features are associated with a target outcome of interest and building a predictive model. Automated machine learning (AutoML) systems such as the Tree-based Pipeline Optimization Tool (TPOT) constitute an appealing approach to this end. However, in biomedical data, there are often baseline characteristics of the subjects in a study or batch effects that need to be adjusted for in order to better isolate the effects of the features of interest on the target. Thus, the ability to perform covariate adjustments becomes particularly important for applications of AutoML to biomedical big data analysis. Results We developed an approach to adjust for covariates affecting features and/or target in TPOT. Our approach is based on regressing out the covariates in a manner that avoids ‘leakage’ during the cross-validation training procedure. We describe applications of this approach to toxicogenomics and schizophrenia gene expression data sets. The TPOT extensions discussed in this work are available at https://github.com/EpistasisLab/tpot/tree/v0.11.1-resAdj . Conclusions In this work, we address an important need in the context of AutoML, which is particularly crucial for applications to bioinformatics and medical informatics, namely covariate adjustments. To this end we present a substantial extension of TPOT, a genetic programming based AutoML approach. We show the utility of this extension by applications to large toxicogenomics and differential gene expression data. The method is generally applicable in many other scenarios from the biomedical field.
- Published
- 2020
- Full Text
- View/download PDF
6. Investigating the parameter space of evolutionary algorithms
- Author
-
Moshe Sipper, Weixuan Fu, Karuna Ahuja, and Jason H. Moore
- Subjects
Evolutionary algorithms ,Genetic programming ,Meta-genetic algorithm ,Parameter tuning ,Hyper-parameter ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Analysis ,QA299.6-433 - Abstract
Abstract Evolutionary computation (EC) has been widely applied to biological and biomedical data. The practice of EC involves the tuning of many parameters, such as population size, generation count, selection size, and crossover and mutation rates. Through an extensive series of experiments over multiple evolutionary algorithm implementations and 25 problems we show that parameter space tends to be rife with viable parameters, at least for the problems studied herein. We discuss the implications of this finding in practice for the researcher employing EC.
- Published
- 2018
- Full Text
- View/download PDF
7. Scale Drop Disease Virus Associated Yellowfin Seabream (Acanthopagrus latus) Ascites Diseases, Zhuhai, Guangdong, Southern China: The First Description
- Author
-
Yuting Fu, Yong Li, Weixuan Fu, Huibing Su, Long Zhang, Congling Huang, Shaoping Weng, Fangzhao Yu, Jianguo He, and Chuanfu Dong
- Subjects
scale drop disease virus ,yellowfin seabream ascites diseases ,genome ,proteome ,pathogenicity ,Microbiology ,QR1-502 - Abstract
Scale drop disease virus (SDDV), an emerging piscine iridovirus prevalent in farmed Asian seabass Lates calcarifer in Southeast Asia, was firstly scientifically descripted in Singapore in 2015. Here, an SDDV isolate ZH-06/20 was isolated by inoculating filtered ascites from diseased juvenile yellowfin seabream into MFF-1 cell. Advanced cytopathic effects were observed 6 days post-inoculation. A transmission electron microscopy examination confirmed that numerous virion particles, about 140 nm in diameter, were observed in infected MFF-1 cell. ZH-06/20 was further purified and both whole genome and virion proteome were determined. The results showed that ZH-06/20 was composed of 131,122 bp with 135 putative viral proteins and 113 of them were further detected by virion proteome. Western blot analysis showed that no (or weak) cross-reaction was observed among several major viral proteins between ZH-06/20 and ISKNV-like megalocytivirus. An artificial challenge showed that ZH-06/20 could cause 100% death to juvenile yellowfin seabream. A typical sign was characterized by severe ascites, but not scale drop, which was considerably different from SDD syndrome in Asian seabass. Collectively, SDDV was confirmed, for the first time, as the causative agent of ascites diseases in farmed yellowfin seabream. Our study offers useful information to better understanding SDDV-associated diseases in farmed fish.
- Published
- 2021
- Full Text
- View/download PDF
8. Correction to: Investigating the parameter space of evolutionary algorithms
- Author
-
Moshe Sipper, Weixuan Fu, Karuna Ahuja, and Jason H. Moore
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Analysis ,QA299.6-433 - Abstract
Following publication of the original article [1], an error was reported in one of the experiments.
- Published
- 2019
- Full Text
- View/download PDF
9. Association of the Porcine Cluster of Differentiation 4 Gene with T Lymphocyte Subpopulations and Its Expression in Immune Tissues
- Author
-
Jingen Xu, Yang Liu, Weixuan Fu, Jiying Wang, Wenwen Wang, Haifei Wang, Jianfeng Liu, Xiangdong Ding, and Qin Zhang
- Subjects
Pig ,CD4 ,Polymorphisms ,T lymphocyte subpopulations ,Expression ,Animal culture ,SF1-1100 ,Animal biochemistry ,QP501-801 - Abstract
Cluster of differentiation 4 (CD4) is mainly expressed on CD4+ T cells, which plays an important role in immune response. The aim of this study was to detect the association between polymorphisms of the CD4 gene and T lymphocyte subpopulations in pigs, and to investigate the effects of genetic variation on the CD4 gene expression level in immune tissues. Five missense mutations in the CD4 gene were identified using DNA pooling sequencing assays, and two main haplotypes (CCTCC and AGCTG) in strong linkage disequilibrium (with frequencies of 50.26% and 46.34%, respectively) were detected in the population of Large White pigs. Our results indicated that the five SNPs and the two haplotypes were significantly associated with the proportions of CD4−CD8−, CD4+CD8+, CD4+CD8−, CD4+ and CD4+/CD8+ in peripheral blood (p0.05). These results indicate that the CD4 gene may influence T lymphocyte subpopulations and can be considered as a candidate gene affecting immunity in pigs.
- Published
- 2013
- Full Text
- View/download PDF
10. Tissues Expression, Polymorphisms of IFN Regulatory Factor 6 (IRF6) Gene and Their Associated with Immune Traits in Three Pig Populations
- Author
-
Yang Liu, Jingeng Xu, Weixuan Fu, Ziqing Weng, Xiaoyan Niu, Jianfeng Liu, Xiangdong Ding, and Qin Zhang
- Subjects
Pig ,IRF6 ,Expression ,Polymorphisms ,Association Analysis ,Animal culture ,SF1-1100 ,Animal biochemistry ,QP501-801 - Abstract
Interferon regulatory factor 6 (IRF6) gene is a member of the IRF-family, and plays functionally diverse roles in the regulation of the immune system. In this report, the 13,720 bp porcine IRF6 genomic DNA structure was firstly identified with a putative IRF6 protein of 467 amino acids. Alignment and phylogenetic analysis of the porcine IRF6 amino acid sequences with their homologies to other species showed high identity (over 96%). Tissues expression of IRF6 mRNA was observed by RT-PCR, the results revealed IRF6 expressed widely in eight tissues. One SNP (HQ026023:1383 G>C) in exon7 and two SNPs (HQ026023:130 G>A; 232 C>T) in the 5 ′ promoter region of porcine IRF6 gene were demonstrated b y DNA sequencing analysis. A further analysis of SNP genotypes associated with immune traits including IFN-γ and IL10 concentrations in serum was carried out in three pig populations including Large White, Landraces and Songliao Black pig (a Chinese indigenous breed). The results showed that the SNP (HQ026023:1383 G>C) was significantly associated with the level of IFN-γ (d 20) in serum (p = 0.038) and the ratio of IFN-γ to IL10 (d 20) in serum (p = 0.041); The other two SNPs (HQ026023:130 G>A; 232 C>T) were highly significantly associated with IL10 level in serum both at the day 20 (p = 0.005; p = 0.001) and the day 35 (p = 0.004; p = 0.006). Identification of the porcine IRF6 gene will help our further understanding of the molecular basis of the IFN regulation pathway in the porcine immune response. All these results should indicate that the IRF6 gene can be regarded as a molecular marker associated with the IL10 level in serum and used for genetic selection in the pig breeding.
- Published
- 2012
- Full Text
- View/download PDF
11. Expression profile and association analysis of the porcine DQB1 gene with peripheral blood T lymphocyte subsets
- Author
-
Jingen Xu, Zhihua Cai, Yang Liu, Chonglong Wang, Weixuan Fu, Haifei Wang, Wenwen Wang, and Qin Zhang
- Subjects
Pig ,DQB1 ,Polymorphism ,T lymphocyte subsets ,Expression ,Animal culture ,SF1-1100 - Abstract
Major histocompatibility complex (MHC) class II molecules play an important role in immunology by presenting antigens to T lymphocytes. As a key member of the MHC class II gene family, the DQB1 gene is involved in interacting with T cells in immune reactions. This study was designed to screen variations in exon 3 of the DQB1 gene in Large White, Landrace and Songliao Black pigs, and to investigate the association between DQB1 gene polymorphisms and peripheral blood T lymphocyte subsets in Large White samples. In addition, the spatial transcription profile of the DQB1 gene was examined, and the effect of gene polymorphisms on its mRNA level was further analyzed. One missense mutation in exon 3 of the DQB1 gene was first identified by DNA pool sequencing, and samples were genotyped by using the polymerase chain reactionrestriction fragment length polymorphism method. Statistical analysis indicated that the DQB1 genotype was significantly associated with CD4+CD8-, CD4-CD8+ and CD4+/CD8+ indexes (PDQB1 was predominantly expressed in the spleen. However, no significant differences in splenic DQB1 mRNA levels of pigs with different genotypes were observed (P>0.05). These results suggest that the DQB1 gene may be a promising candidate gene for porcine disease resistance.
- Published
- 2015
- Full Text
- View/download PDF
12. A multiple-SNP approach for genome-wide association study of milk production traits in Chinese Holstein cattle.
- Author
-
Ming Fang, Weixuan Fu, Dan Jiang, Qin Zhang, Dongxiao Sun, Xiangdong Ding, and Jianfeng Liu
- Subjects
Medicine ,Science - Abstract
The multiple-SNP analysis has been studied by many researchers, in which the effects of multiple SNPs are simultaneously estimated and tested in a multiple linear regression. The multiple-SNP association analysis usually has higher power and lower false-positive rate for detecting causative SNP(s) than single marker analysis (SMA). Several methods have been proposed to simultaneously estimate and test multiple SNP effects. In this research, a fast method called MEML (Mixed model based Expectation-Maximization Lasso algorithm) was developed for simultaneously estimate of multiple SNP effects. An improved Lasso prior was assigned to SNP effects which were estimated by searching the maximum joint posterior mode. The residual polygenic effect was included in the model to absorb many tiny SNP effects, which is treated as missing data in our EM algorithm. A series of simulation experiments were conducted to validate the proposed method, and the results showed that compared with SMMA, the new method can dramatically decrease the false-positive rate. The new method was also applied to the 50k SNP-panel dataset for genome-wide association study of milk production traits in Chinese Holstein cattle. Totally, 39 significant SNPs and their nearby 25 genes were found. The number of significant SNPs is remarkably fewer than that by SMMA which found 105 significant SNPs. Among 39 significant SNPs, 8 were also found by SMMA and several well-known QTLs or genes were confirmed again; furthermore, we also got some positional candidate gene with potential function of effecting milk production traits. These novel findings in our research should be valuable for further investigation.
- Published
- 2014
- Full Text
- View/download PDF
13. Gene silencing of porcine MUC13 and ITGB5: candidate genes towards Escherichia coli F4ac adhesion.
- Author
-
Chuanli Zhou, Zhengzhu Liu, Yang Liu, Weixuan Fu, Xiangdong Ding, Jianfeng Liu, Ying Yu, and Qin Zhang
- Subjects
Medicine ,Science - Abstract
BACKGROUND: Integrin beta-5 (ITGB5) and mucin 13 (MUC13) genes are highly expressed on the apical surface of intestinal epithelia and are thought to be candidate genes for controlling the expression of the receptor for enterotoxigenic Escherichia coli (ETEC) F4ac. Human MUC13 protein has an expected role in protecting intestinal mucosal surfaces and porcine ITGB5 is a newly identified potential receptor for ETEC F4ac. METHODOLOGY/PRINCIPAL FINDINGS: To test the hypothesis that ITGB5 and MUC13 both play key roles in protection of the intestinal mucosa against pathogenic bacterium, porcine intestinal epithelial cells (IPEC-J2) were transfected with ITGB5-targeting, MUC13-targeting or negative control small interfering RNA (siRNA), respectively. Firstly, we measured mRNA expression levels of mucin genes (MUC4, MUC20), pro-inflammatory genes (IL8, IL1A, IL6, CXCL2), anti-inflammatory mediator SLPI, and PLAU after RNAi treatments with and without ETEC infection. Secondly, we compared the adhesions of ETEC to the pre- and post-knockdown IPEC-J2 cells of ITGB5 and MUC13, respectively. We found that ITGB5 and MUC13 knockdown both had small but significant effects in attenuating the inflammation induced by ETEC infection, and both increased bacterial adhesion in response to F4ac ETEC exposure. CONCLUSIONS/SIGNIFICANCE: Our current study first reported that ITGB5 and MUC13 are important adhesion molecules of mucosal epithelial signaling in response to Escherichia coli in pigs. These data suggest that both ITGB5 and MUC13 play key roles in defending the attachment and adhesion of ETEC to porcine jejunal cells and in maintaining epithelial barrier and immunity function.
- Published
- 2013
- Full Text
- View/download PDF
14. Genome-wide association study for cytokines and immunoglobulin G in swine.
- Author
-
Xin Lu, JianFeng Liu, WeiXuan Fu, JiaPeng Zhou, YanRu Luo, XiangDong Ding, Yang Liu, and Qin Zhang
- Subjects
Medicine ,Science - Abstract
Increased disease resistance through improved immune capacity would be beneficial for the welfare and productivity of farm animals. To identify genomic regions responsible for immune capacity traits in swine, a genome-wide association study was conducted. In total, 675 pigs were included. At 21 days of age, all piglets were vaccinated with modified live classical swine fever vaccine. Blood samples were sampled when the piglets were 20 and 35 days of age, respectively. Four traits, including Interferon-gamma (IFN-γ) and Interleukin 10 (IL-10) levels, the ratio of IFN-γ to IL-10 and Immunoglobulin G (IgG) blocking percentage to CSFV in serum were measured. All the samples were genotyped for 62,163 single nucleotide polymorphisms (SNP) using the Illumina porcineSNP60k BeadChip. After quality control, 46,079 SNPs were selected for association tests based on a single-locus regression model. To tackle the issue of multiple testing, 10,000 permutations were performed to determine the chromosome-wise and genome-wise significance level. In total, 32 SNPs with chromosome-wise significance level (including 4 SNPs with genome-wise significance level) were identified. These SNPs account for 3.23% to 13.81% of the total phenotypic variance individually. For the four traits, the numbers of significant SNPs range from 5 to 15, which jointly account for 37.52%, 82.94%, 26.74% and 24.16% of the total phenotypic variance of IFN-γ, IL-10, IFN-γ/IL-10, and IgG, respectively. Several significant SNPs are located within the QTL regions reported in previous studies. Furthermore, several significant SNPs fall into the regions which harbour a number of known immunity-related genes. Results herein lay a preliminary foundation for further identifying the causal mutations affecting swine immune capacity in follow-up studies.
- Published
- 2013
- Full Text
- View/download PDF
15. Genetic Analysis of Coronary Artery Disease Using Tree-Based Automated Machine Learning Informed By Biology-Based Feature Selection.
- Author
-
Elisabetta Manduchi, Trang T. Le, Weixuan Fu, and Jason H. Moore
- Published
- 2022
- Full Text
- View/download PDF
16. PMLB v1.0: an open-source dataset collection for benchmarking machine learning methods.
- Author
-
Joseph D. Romano, Trang T. Le, William G. La Cava, John T. Gregg, Daniel J. Goldberg, Praneel Chakraborty, Natasha L. Ray, Daniel S. Himmelstein, Weixuan Fu, and Jason H. Moore
- Published
- 2022
- Full Text
- View/download PDF
17. Evaluating recommender systems for AI-driven biomedical informatics.
- Author
-
William G. La Cava, Heather Williams, Weixuan Fu, Steven Vitale, Durga Srivatsan, and Jason H. Moore
- Published
- 2021
- Full Text
- View/download PDF
18. TPOT-NN: augmenting tree-based automated machine learning with neural network estimators.
- Author
-
Joseph D. Romano, Trang T. Le, Weixuan Fu, and Jason H. Moore
- Published
- 2021
- Full Text
- View/download PDF
19. Scaling tree-based automated machine learning to biomedical big data with a feature set selector.
- Author
-
Trang T. Le, Weixuan Fu, and Jason H. Moore
- Published
- 2020
- Full Text
- View/download PDF
20. A System for Accessible Artificial Intelligence.
- Author
-
Randal S. Olson, Moshe Sipper, William G. La Cava, Sharon Tartarone, Steven Vitale, Weixuan Fu, Patryk Orzechowski, Ryan J. Urbanowicz, John H. Holmes, and Jason H. Moore
- Published
- 2017
- Full Text
- View/download PDF
21. Large scale biomedical data analysis with tree-based automated machine learning.
- Author
-
Trang T. Le, Weixuan Fu, and Jason H. Moore
- Published
- 2020
- Full Text
- View/download PDF
22. Is deep learning necessary for simple classification tasks?
- Author
-
Joseph D. Romano, Trang T. Le, Weixuan Fu, and Jason H. Moore
- Published
- 2020
23. PMLB v1.0: an open source dataset collection for benchmarking machine learning methods.
- Author
-
Trang T. Le, William G. La Cava, Joseph D. Romano, John T. Gregg, Daniel J. Goldberg, Praneel Chakraborty, Natasha L. Ray, Daniel S. Himmelstein, Weixuan Fu, and Jason H. Moore
- Published
- 2020
24. Integrated machine learning pipeline for aberrant biomarker enrichment (i-mAB): characterizing clusters of differentiation within a compendium of systemic lupus erythematosus patients.
- Author
-
Trang T. Le, Nigel O. Blackwood, Jaclyn N. Taroni, Weixuan Fu, and Matthew K. Breitenstein
- Published
- 2018
25. Evaluating recommender systems for AI-driven data science.
- Author
-
William G. La Cava, Heather Williams, Weixuan Fu, and Jason H. Moore
- Published
- 2019
26. Evolutionary computation: an investigation of parameter space.
- Author
-
Moshe Sipper, Weixuan Fu, Karuna Ahuja, and Jason H. Moore
- Published
- 2018
- Full Text
- View/download PDF
27. From MEGATON to RASCAL: Surfing the Parameter Space of Evolutionary Algorithms.
- Author
-
Moshe Sipper, Weixuan Fu, Karuna Ahuja, and Jason H. Moore
- Published
- 2017
28. A mandarinfish Siniperca chuatsi infection and vaccination model for SDDV and efficacy evaluation of the formalin-killed cell vaccine in yellowfin seabream Acanthopagrus latus
- Author
-
Yuting Fu, Yong Li, Jiaming Chen, Fangzhao Yu, Xiangrong Liu, Weixuan Fu, Hongrun Pan, Wangdong Li, Shaoping Weng, Jianguo He, and Chuanfu Dong
- Subjects
Aquatic Science - Published
- 2023
- Full Text
- View/download PDF
29. PMLB v1.0: an open-source dataset collection for benchmarking machine learning methods
- Author
-
Daniel Himmelstein, Daniel J. Goldberg, Praneel Chakraborty, John T. Gregg, Trang T. Le, Weixuan Fu, Joseph D. Romano, Natasha L. Ray, William La Cava, and Jason H. Moore
- Subjects
Statistics and Probability ,AcademicSubjects/SCI01060 ,Computer science ,Interface (Java) ,business.industry ,Databases and Ontologies ,Benchmarking ,Python (programming language) ,Information repository ,Machine learning ,computer.software_genre ,Biochemistry ,Applications Notes ,Computer Science Applications ,Computational Mathematics ,Documentation ,Computational Theory and Mathematics ,User experience design ,Benchmark (computing) ,Artificial intelligence ,business ,Molecular Biology ,computer ,Categorical variable ,computer.programming_language - Abstract
Motivation Novel machine learning and statistical modeling studies rely on standardized comparisons to existing methods using well-studied benchmark datasets. Few tools exist that provide rapid access to many of these datasets through a standardized, user-friendly interface that integrates well with popular data science workflows. Results This release of PMLB (Penn Machine Learning Benchmarks) provides the largest collection of diverse, public benchmark datasets for evaluating new machine learning and data science methods aggregated in one location. v1.0 introduces a number of critical improvements developed following discussions with the open-source community. Availability and implementation PMLB is available at https://github.com/EpistasisLab/pmlb. Python and R interfaces for PMLB can be installed through the Python Package Index and Comprehensive R Archive Network, respectively.
- Published
- 2021
30. TPOT-NN: augmenting tree-based automated machine learning with neural network estimators
- Author
-
Trang T. Le, Weixuan Fu, Joseph D. Romano, and Jason H. Moore
- Subjects
Artificial neural network ,business.industry ,Computer science ,Estimator ,Context (language use) ,02 engineering and technology ,Machine learning ,computer.software_genre ,Field (computer science) ,Computer Science Applications ,Theoretical Computer Science ,Tree (data structure) ,Software ,Binary classification ,Hardware and Architecture ,020204 information systems ,0202 electrical engineering, electronic engineering, information engineering ,Benchmark (computing) ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,computer - Abstract
Automated machine learning (AutoML) and artificial neural networks (ANNs) have revolutionized the field of artificial intelligence by yielding incredibly high-performing models to solve a myriad of inductive learning tasks. In spite of their successes, little guidance exists on when to use one versus the other. Furthermore, relatively few tools exist that allow the integration of both AutoML and ANNs in the same analysis to yield results combining both of their strengths. Here, we present TPOT-NN—a new extension to the tree-based AutoML software TPOT—and use it to explore the behavior of automated machine learning augmented with neural network estimators (AutoML+NN), particularly when compared to non-NN AutoML in the context of simple binary classification on a number of public benchmark datasets. Our observations suggest that TPOT-NN is an effective tool that achieves greater classification accuracy than standard tree-based AutoML on some datasets, with no loss in accuracy on others. We also provide preliminary guidelines for performing AutoML+NN analyses, and recommend possible future directions for AutoML+NN methods research, especially in the context of TPOT.
- Published
- 2021
- Full Text
- View/download PDF
31. Effectively protecting Asian seabass Lates calcarifer from ISKNV-I, ISKNV-II, RSIV-II and SDDV by an inactivated ISKNV-I and SDDV bivalent vaccine
- Author
-
Yuting Fu, Yong Li, Wenfeng Zhang, Weixuan Fu, Wangdong Li, Zhiming Zhu, Shaoping Weng, Jianguo He, and Chuanfu Dong
- Subjects
Aquatic Science - Published
- 2023
- Full Text
- View/download PDF
32. RF22 | PSUN318 Hepatocytes Exposed to PFOA Prior to Differentiation Leads to Epigenetic Changes in Genes Linked With NAFLD
- Author
-
Weixuan Fu, Apoorva Joshi, Sara Elizabeth Pinney, Pheruza Tarapore, Zhiping Wang, and Stephanie Green
- Subjects
Endocrinology, Diabetes and Metabolism - Abstract
Background Perfluorooctanoic acid (PFOA), is a persistent fluorinated compound with oil and water repelling properties found in cookware, food packaging and municipal water systems. Adult animals exposed to PFOA develop hepatomegaly, fatty liver, peroxisome proliferation, and immunotoxicity. Rodents exposed to PFOA in utero have altered hepatic lipid metabolism, increased hepatic de novo lipogenesis and susceptibility to non-alcoholic fatty liver disease (NAFLD), but underlying molecular mechanisms remain unknown. With increasing rates of obesity, diabetes, and NAFLD it is critical to examine the mechanisms by which in utero exposure to PFOA contributes to the development of metabolic syndrome in offspring. Objectives To characterize how PFOA exposure during hepatocyte differentiation leads to the development of NAFLD through alterations in DNA methylation profiles and changes in the availability of transcription factor binding sites. Methods HepaRG cells (human-derived hepatocyte progenitor cells) were treated with 0.5uM PFOA or vehicle for 48 hours followed by differentiation. Undifferentiated and differentiated hepatocytes exposed to PFOA were assessed relative to controls (n=4). RNASeq was completed; DESeq2 identified differentially expressed genes via false discovery rate (FDR) of 10%, and FDR of Results PFOA treatment resulted in decreased expression of the transcription factors early growth response protein 1 (EGR1), nuclear receptor Nur77 (NR4A1), early growth response protein 2 (EGR2), krueppel-like factor 10 (KLF10) and fos-related antigen 1 (FOSL1), which are key genes linked to impaired hepatic insulin signaling, lipid metabolism, steatosis and fibrosis (q Conclusions We conclude hepatocyte progenitor cells exposed to low dose PFOA results in changes in DNA methylation and expression of key metabolic genes linked to NAFLD, notably EGR1, a gene previously linked to NAFLD, suggesting PFOA exposure in utero has lasting effects. Presentation: Sunday, June 12, 2022 12:30 p.m. - 2:30 p.m., Sunday, June 12, 2022 12:48 p.m. - 12:53 p.m.
- Published
- 2022
- Full Text
- View/download PDF
33. Development of two cell lines from yellowfin seabream Acanthopagrus latus fin and brain suitable for propagating SDDV but not for ISKNV, MRV and GIV
- Author
-
Yuting Fu, Yong Li, Xiangrong Liu, Weixuan Fu, Shaoping Weng, Fangzhao Yu, Jianguo He, and Chuanfu Dong
- Subjects
Aquatic Science - Published
- 2022
- Full Text
- View/download PDF
34. Scale Drop Disease Virus Associated Yellowfin Seabream (Acanthopagrus latus) Ascites Diseases, Zhuhai, Guangdong, Southern China: The First Description
- Author
-
Shaoping Weng, Fangzhao Yu, Jianguo He, Chuanfu Dong, Congling Huang, Yong Li, Yuting Fu, Weixuan Fu, Long Zhang, and Su Huibing
- Subjects
China ,food.ingredient ,Fish farming ,Iridovirus ,proteome ,Acanthopagrus latus ,scale drop disease virus ,Genome, Viral ,Megalocytivirus ,Microbiology ,Article ,Virus ,Fish Diseases ,Viral Proteins ,food ,yellowfin seabream ascites diseases ,Microscopy, Electron, Transmission ,Western blot ,Virology ,medicine ,Animals ,Juvenile ,pathogenicity ,genome ,Phylogeny ,biology ,medicine.diagnostic_test ,Virion ,Ascites ,biology.organism_classification ,DNA Virus Infections ,Sea Bream ,QR1-502 ,Iridoviridae ,Infectious Diseases ,Proteome - Abstract
Scale drop disease virus (SDDV), an emerging piscine iridovirus prevalent in farmed Asian seabass Lates calcarifer in Southeast Asia, was firstly scientifically descripted in Singapore in 2015. Here, an SDDV isolate ZH-06/20 was isolated by inoculating filtered ascites from diseased juvenile yellowfin seabream into MFF-1 cell. Advanced cytopathic effects were observed 6 days post-inoculation. A transmission electron microscopy examination confirmed that numerous virion particles, about 140 nm in diameter, were observed in infected MFF-1 cell. ZH-06/20 was further purified and both whole genome and virion proteome were determined. The results showed that ZH-06/20 was composed of 131,122 bp with 135 putative viral proteins and 113 of them were further detected by virion proteome. Western blot analysis showed that no (or weak) cross-reaction was observed among several major viral proteins between ZH-06/20 and ISKNV-like megalocytivirus. An artificial challenge showed that ZH-06/20 could cause 100% death to juvenile yellowfin seabream. A typical sign was characterized by severe ascites, but not scale drop, which was considerably different from SDD syndrome in Asian seabass. Collectively, SDDV was confirmed, for the first time, as the causative agent of ascites diseases in farmed yellowfin seabream. Our study offers useful information to better understanding SDDV-associated diseases in farmed fish.
- Published
- 2021
- Full Text
- View/download PDF
35. eP313: Too few or too many? Variant reporting burden and diagnostic comparisons of an extensive gene panel with exome-sequencing in immunodeficiency
- Author
-
Erfan Aref-Eshghi, Kajia Cao, Weixuan Fu, Kathleen Wood, Nancy Spinner, Matthew Dulik, Laura Conlin, Jing Wang, and Minjie Luo
- Subjects
Genetics (clinical) - Published
- 2022
- Full Text
- View/download PDF
36. Immunological ignorance is an enabling feature of the oligo-clonal T cell response to melanoma neoantigens
- Author
-
Beatriz M. Carreno, Zachary L. Skidmore, Michelle Becker-Hapak, David H. Spencer, Alexander S. Krupnick, Vincent Magrini, Maya G. Robnett, Obi L. Griffith, Ryan Demeter, Saghar Kaabinejadian, Casey L. Cummins, Gerald P. Linette, Weixuan Fu, Chong Xu, Elaine R. Mardis, William H. Hildebrand, Miren L. Baroja, Jasreet Hundal, and Malachi Griffith
- Subjects
Lung Neoplasms ,Receptors, Antigen, T-Cell, alpha-beta ,T cell ,T-Cell Antigen Receptor Specificity ,Human leukocyte antigen ,CD8-Positive T-Lymphocytes ,Biology ,Cancer Vaccines ,Polymorphism, Single Nucleotide ,Epitope ,Lymphocytes, Tumor-Infiltrating ,Antigens, Neoplasm ,HLA Antigens ,T-Lymphocyte Subsets ,medicine ,Humans ,Cytotoxic T cell ,Retroperitoneal Neoplasms ,Melanoma ,Multidisciplinary ,integumentary system ,Vaccination ,T-cell receptor ,DNA, Neoplasm ,Dendritic Cells ,Sequence Analysis, DNA ,Dendritic cell ,Clone Cells ,medicine.anatomical_structure ,Amino Acid Substitution ,PNAS Plus ,Monoclonal ,Cancer research ,Tumor Escape ,CD8 - Abstract
The impact of intratumoral heterogeneity (ITH) and the resultant neoantigen landscape on T cell immunity are poorly understood. ITH is a widely recognized feature of solid tumors and poses distinct challenges related to the development of effective therapeutic strategies, including cancer neoantigen vaccines. Here, we performed deep targeted DNA sequencing of multiple metastases from melanoma patients and observed ubiquitous sharing of clonal and subclonal single nucleotide variants (SNVs) encoding putative HLA class I-restricted neoantigen epitopes. However, spontaneous antitumor CD8+ T cell immunity in peripheral blood and tumors was restricted to a few clonal neoantigens featuring an oligo-/monoclonal T cell-receptor (TCR) repertoire. Moreover, in various tumors of the 4 patients examined, no neoantigen-specific TCR clonotypes were identified despite clonal neoantigen expression. Mature dendritic cell (mDC) vaccination with tumor-encoded amino acid-substituted (AAS) peptides revealed diverse neoantigen-specific CD8+ T responses, each composed of multiple TCR clonotypes. Isolation of T cell clones by limiting dilution from tumor-infiltrating lymphocytes (TILs) permitted functional validation regarding neoantigen specificity. Gene transfer of TCRαβ heterodimers specific for clonal neoantigens confirmed correct TCR clonotype assignments based on high-throughput TCRBV CDR3 sequencing. Our findings implicate immunological ignorance of clonal neoantigens as the basis for ineffective T cell immunity to melanoma and support the concept that therapeutic vaccination, as an adjunct to checkpoint inhibitor treatment, is required to increase the breadth and diversity of neoantigen-specific CD8+ T cells.
- Published
- 2019
- Full Text
- View/download PDF
37. ATF4 couples MYC-dependent translational activity to bioenergetic demands during tumour progression
- Author
-
Frank Chinga, Serge Y. Fuchs, Zoya Ignatova, Jiangbin Ye, Crystal S. Conn, Weixuan Fu, Constantinos Koumenis, Andrew V. Kossenkov, Carlo Salas Salinas, Ioannis I. Verginadis, Nektaria Maria Leli, Ravi K. Amaravadi, Christine Polte, Paul P. Wang, Alexandra M Monroy, Davide Ruggero, J. Alan Diehl, Feven Tameire, and Rani Ojha
- Subjects
Transcriptional Activation ,Genes, myc ,Cell Cycle Proteins ,Mice, Transgenic ,mTORC1 ,Mechanistic Target of Rapamycin Complex 1 ,Biology ,Activating Transcription Factor 4 ,Medical and Health Sciences ,Article ,Transgenic ,Mice ,03 medical and health sciences ,0302 clinical medicine ,Genetics ,Protein biosynthesis ,2.1 Biological and endogenous factors ,Animals ,Humans ,Initiation factor ,Aetiology ,Phosphorylation ,Cancer ,Adaptor Proteins, Signal Transducing ,030304 developmental biology ,0303 health sciences ,Oncogene ,TOR Serine-Threonine Kinases ,Endoplasmic reticulum ,ATF4 ,Signal Transducing ,Adaptor Proteins ,Translation (biology) ,Cell Biology ,myc ,Biological Sciences ,Endoplasmic Reticulum Stress ,Phosphoproteins ,Cell biology ,Genes ,Protein Biosynthesis ,030220 oncology & carcinogenesis ,Developmental Biology - Abstract
The c-Myc oncogene drives malignant progression and induces robust anabolic and proliferative programmes leading to intrinsic stress. The mechanisms enabling adaptation to MYC-induced stress are not fully understood. Here we reveal an essential role for activating transcription factor 4 (ATF4) in survival following MYC activation. MYC upregulates ATF4 by activating general control nonderepressible 2 (GCN2) kinase through uncharged transfer RNAs. Subsequently, ATF4 co-occupies promoter regions of over 30 MYC-target genes, primarily those regulating amino acid and protein synthesis, including eukaryotic translation initiation factor 4E-binding protein 1 (4E-BP1), a negative regulator of translation. 4E-BP1 relieves MYC-induced proteotoxic stress and is essential to balance protein synthesis. 4E-BP1 activity is negatively regulated by mammalian target of rapamycin complex 1 (mTORC1)-dependent phosphorylation and inhibition of mTORC1 signalling rescues ATF4-deficient cells from MYC-induced endoplasmic reticulum stress. Acute deletion of ATF4 significantly delays MYC-driven tumour progression and increases survival in mouse models. Our results establish ATF4 as a cellular rheostat of MYC activity, which ensures that enhanced translation rates are compatible with survival and tumour progression.
- Published
- 2019
- Full Text
- View/download PDF
38. Genetic analysis of coronary artery disease using tree-based automated machine learning informed by biology-based feature selection
- Author
-
Weixuan Fu, Jason H. Moore, Elisabetta Manduchi, and Trang T. Le
- Subjects
business.industry ,Applied Mathematics ,Single-nucleotide polymorphism ,Genomics ,CAD ,Feature selection ,Coronary Artery Disease ,Biology ,Precision medicine ,Machine learning ,computer.software_genre ,Polymorphism, Single Nucleotide ,Biobank ,Abstract machine ,Machine Learning ,Genetics ,Humans ,SNP ,Relevance (information retrieval) ,Artificial intelligence ,business ,computer ,Algorithms ,Biotechnology - Abstract
Machine Learning (ML) approaches are increasingly being used in biomedical applications. Important challenges of ML include choosing the right algorithm and tuning the parameters for optimal performance. Automated ML (AutoML) methods, such as Tree-based Pipeline Optimization Tool (TPOT), have been developed to take some of the guesswork out of ML thus making this technology available to users from more diverse backgrounds. The goals of this study were to assess applicability of TPOT to genomics and to identify combinations of single nucleotide polymorphisms (SNPs) associated with coronary artery disease (CAD), with a focus on genes with high likelihood of being good CAD drug targets. We leveraged public functional genomic resources to group SNPs into biologically meaningful sets to be selected by TPOT. We applied this strategy to data from the UK Biobank, detecting a strikingly recurrent signal stemming from a group of 28 SNPs. Importance analysis of these uncovered functional relevance of the top SNPs to genes whose association with CAD is supported in the literature and other resources. Furthermore, we employed game-theory based metrics to study SNP contributions to individual level TPOT predictions and discover distinct clusters of well-predicted CAD cases. The latter indicates a promising approach towards precision medicine.
- Published
- 2021
- Full Text
- View/download PDF
39. Abstract 5268: The spectrum of FGFR mutations in pediatric and young adult solid tumor
- Author
-
Jinhua Wu, Jeffrey Schubert, Feng Xu, Ariel Long, Maha Patel, Netta Golenberg, Weixuan Fu, Kajia Cao, Jiani Chen, Elizabeth H. Denenberg, Elizabeth A. Fanning, Rochelle Bagatell, Theodore W. Laetsch, Adam Resnick, Mariarita Santi, Phillip Jay B. Storm, Minjie Luo, Lea F. Surrey, Yiming Zhong, and Marilyn M. Li
- Subjects
Cancer Research ,Oncology - Abstract
Fibroblast growth factor receptors (FGFRs) are a family of receptor tyrosine kinases expressed on the cell membrane that play crucial roles in cellular lineage commitment, differentiation, proliferation, and apoptosis. Deregulated FGFR signaling is observed in a subset of tumors across various histologies, making FGFRs ideal therapeutic targets. We sought to determine the genetic landscape of FGFR-family variations in a cohort of pediatric and young adult patients with solid tumors. The CHOP Comprehensive Solid Tumor Panel was performed on 1,420 patients. The panel covers 238 cancer genes and screens for single nucleotide variants (SNVs), indels, copy number alterations, and 117 fusion gene partners interrogating over 700 exons for known and novel fusions. Identified variants were categorized and reported according to the AMP/ASCO/CAP guidelines. Fifty-six patients (4.1%), including 47 children and 9 young adults, were found to carry at least one FGFR alteration in their tumors. CNS tumors accounted for most of the cases (51 total, 87.9%), with pilomyxoid astrocytoma/pilocytic astrocytoma and dysembryoplastic neuroepithelial tumor the most common (13 and 12 patients, respectively). Non-CNS solid tumors included rhabdomyosarcoma (4 patients), neuroblastoma/ganglioneuroblastoma (2), and follicular thyroid carcinoma (1). FGFR somatic alterations were found in 56 tumors including 41 SNVs and small indels, 6 internal tandem duplications (ITDs), and 15 fusions genes. The most common SNVs observed were hotspot mutations p.K656E and p.N546K of FGFR1. Sequence alterations in FGFR1 contained 35 SNVs and small indels, mostly gain of function mutations located in the kinase domain, and 6 kinase domain ITDs. One SNV was identified in FGFR2 in the immunoglobulin domain. Two SNVs were reported in FGFR3, both of which were in the fibroblast growth factor receptor family domain, and 3 SNVs were identified in FGFR4, all occurring at the p.V550 codon located on the kinase domain. Companion mutations in non-FGFR genes were detected in 27 tumors, predominantly involving RAS signaling pathway genes including NF1 (14 variants), PIK3CA (8), PTPN11 (6) and PIK3R1 (4). Among fusion variants, FGFR1-TACC1 fusions were found in 5 patients, mostly in pediatric patients. One FGFR3-TACC3 fusion was identified in one young adult patient. Seven pediatric patients tested positive for FGFR2 fusions; all with different 3’ partners. The detection of an FGFR alteration defined or changed the histologic diagnosis for 22 patients. Our results reveal that FGFR alterations account for 4.1% (56/1420) of the patients with solid tumors tested in our laboratory. The majority of the FGFR-positive tumors are low-grade CNS tumors. Further, the identification of FGFR alterations can significantly improve the tumor diagnosis and provide genomic evidence for potential targeted treatment with FGFR inhibitors. Citation Format: Jinhua Wu, Jeffrey Schubert, Feng Xu, Ariel Long, Maha Patel, Netta Golenberg, Weixuan Fu, Kajia Cao, Jiani Chen, Elizabeth H. Denenberg, Elizabeth A. Fanning, Rochelle Bagatell, Theodore W. Laetsch, Adam Resnick, Mariarita Santi, Phillip Jay B. Storm, Minjie Luo, Lea F. Surrey, Yiming Zhong, Marilyn M. Li. The spectrum of FGFR mutations in pediatric and young adult solid tumor [abstract]. In: Proceedings of the American Association for Cancer Research Annual Meeting 2022; 2022 Apr 8-13. Philadelphia (PA): AACR; Cancer Res 2022;82(12_Suppl):Abstract nr 5268.
- Published
- 2022
- Full Text
- View/download PDF
40. Key Parameters of Tumor Epitope Immunogenicity Revealed Through a Consortium Approach Improve Neoantigen Prediction
- Author
-
Zhiqin Huang, Pramod K. Srivastava, Song Liu, Jason A. Greenbaum, Dirk Jäger, Jiaqian Wang, Ognjen Milicevic, Willem-Jan Krebber, Barbara Schrörs, Sean Michael Boyle, Michal Bassani-Sternberg, Ana M. Mijalkovic Lazic, Amit A. Lugade, Kristen K. Dang, Han Si, Alessandro Sette, Jeffrey P. Ward, Irsan Kooi, Michael F. Princiotta, Begonya Comin-Anduix, Thierry Schuepbach, Pia Kvistborg, Julia Kodysh, William Chour, Vladimir B. Kovacevic, Jan H. Kessler, Ariella Sasson, Antoni Ribas, Brian Stevenson, Sriram Sridhar, Prateek Tanden, Robert D. Schreiber, Jason Perera, Kathleen C. F. Sheehan, Hira Rizvi, Sachet A. Shukla, Baikang Pei, Han Chang, Bo Li, Ion I. Mandoiu, Cristina Puig-Saus, Beatriz M. Carreno, Si Qiu, Jennifer M. Shelton, Patrick Jongeneel, Qiang Hu, Taha Merghoub, Matthew D. Hellmann, James P. Conway, Francisco Arcila, Ton N. Schumacher, Mathias Vormehr, Christopher A. Morehouse, Patrice Manning, Jonathon Blake, Pornpimol Charoentong, Angela Frentzen, Christopher A. Miller, Michael A. Kuziora, Bin Song, Lei Wei, Martin Löwer, Gabor Bartha, Justin Guinney, Niels Halama, Rolf Hilker, Yinong Sebastian, Veliborka Josipovic, Jason Harris, Geng Liu, Guilhem Richard, Arjun A. Rao, Nikola M. Skundric, Markus Mueller, Daniel K. Wells, Tatiana Shcheglova, Inka Zörnig, Weixuan Fu, John Sidney, Nadine Defranoux, Gabriela Steiner, Joseph D. Szustakowski, Arbel D. Tadmor, Maxim N. Artyomov, Jianmin Wang, George Coukos, Brandon W. Higgs, Milica R. Kojicic, Siranush Sarkizova, Daphne van Beek, Naibo Yang, Robert Ziman, Mignonette H. Macabali, Thomas Yu, Nicolas Guex, Nina Bhardwaj, Lorenzo F. Fanchi, Bjoern Peters, Christian Iseli, Song Wu, Maren Lang, Juliet Forman, Marit M. van Buuren, David Balli, Steven L. C. Ketelaars, Nir Hacohen, Ekaterina Esaulova, Maarten Slagter, Todd Creasy, Robert A. Petit, Yi-Hsiang Hsu, Ravi Gupta, Katie M. Campbell, Pascal Gellert, David Haussler, Jesse M. Zaretsky, Sofie R. Salama, Vanessa M. Hubbard-Lucey, Joel Greshock, Zeynep Kosaloglu Yalcin, Cornelis J. M. Melief, Priyanka Shah, Ioannis Xenarios, Nevena M. Ilic Raicevic, Andrew Lamb, Suchit Jhunjhunwala, Aly A. Khan, David Gfeller, James R. Heath, Richard Chen, Jia M. Chen, Alphonsus H. C. Ng, Elham Sherafat, Ana Belen Blazquez, Leo J. Lee, Beata Berent-Maoz, Cheryl Selinsky, Jasreet Hundal, Eduardo Cortes, Xengie Doan, Sahar Al Seesi, Adam Kolom, Fred Ramsdell, Nicolas Robine, and Andrew J. Rech
- Subjects
medicine.medical_treatment ,T cell ,Programmed Cell Death 1 Receptor ,No reference ,Sequencing data ,Computational biology ,Biology ,General Biochemistry, Genetics and Molecular Biology ,Epitope ,Cohort Studies ,Epitopes ,03 medical and health sciences ,0302 clinical medicine ,Antigens, Neoplasm ,Neoplasms ,Research community ,medicine ,Humans ,Alleles ,030304 developmental biology ,Antigen Presentation ,0303 health sciences ,Immunogenicity ,Reproducibility of Results ,Immunotherapy ,medicine.anatomical_structure ,Peptides ,030217 neurology & neurosurgery - Abstract
Many approaches to identify therapeutically relevant neoantigens couple tumor sequencing with bioinformatic algorithms and inferred rules of tumor epitope immunogenicity. However, there are no reference data to compare these approaches, and the parameters governing tumor epitope immunogenicity remain unclear. Here, we assembled a global consortium wherein each participant predicted immunogenic epitopes from shared tumor sequencing data. 608 epitopes were subsequently assessed for T cell binding in patient-matched samples. By integrating peptide features associated with presentation and recognition, we developed a model of tumor epitope immunogenicity that filtered out 98% of non-immunogenic peptides with a precision above 0.70. Pipelines prioritizing model features had superior performance, and pipeline alterations leveraging them improved prediction performance. These findings were validated in an independent cohort of 310 epitopes prioritized from tumor sequencing data and assessed for T cell binding. This data resource enables identification of parameters underlying effective anti-tumor immunity and is available to the research community.
- Published
- 2020
41. Embedding covariate adjustments in tree-based automated machine learning for biomedical big data analyses
- Author
-
Stefano Ruberto, Elisabetta Manduchi, Joseph D. Romano, Weixuan Fu, and Jason H. Moore
- Subjects
Big Data ,Data Analysis ,Computer science ,Feature importance ,Big data ,Context (language use) ,Genetic programming ,Machine learning ,computer.software_genre ,lcsh:Computer applications to medicine. Medical informatics ,Biochemistry ,Field (computer science) ,Machine Learning ,03 medical and health sciences ,Automation ,0302 clinical medicine ,Structural Biology ,Covariate ,Humans ,Pathways ,Molecular Biology ,lcsh:QH301-705.5 ,AutoML ,030304 developmental biology ,0303 health sciences ,Covariate adjustment ,business.industry ,Applied Mathematics ,Pipeline (software) ,Computer Science Applications ,Data set ,Task (computing) ,Tree (data structure) ,lcsh:Biology (General) ,030220 oncology & carcinogenesis ,Embedding ,lcsh:R858-859.7 ,Artificial intelligence ,business ,computer ,Algorithms - Abstract
Background A typical task in bioinformatics consists of identifying which features are associated with a target outcome of interest and building a predictive model. Automated machine learning (AutoML) systems such as the Tree-based Pipeline Optimization Tool (TPOT) constitute an appealing approach to this end. However, in biomedical data, there are often baseline characteristics of the subjects in a study or batch effects that need to be adjusted for in order to better isolate the effects of the features of interest on the target. Thus, the ability to perform covariate adjustments becomes particularly important for applications of AutoML to biomedical big data analysis. Results We developed an approach to adjust for covariates affecting features and/or target in TPOT. Our approach is based on regressing out the covariates in a manner that avoids ‘leakage’ during the cross-validation training procedure. We describe applications of this approach to toxicogenomics and schizophrenia gene expression data sets. The TPOT extensions discussed in this work are available at https://github.com/EpistasisLab/tpot/tree/v0.11.1-resAdj. Conclusions In this work, we address an important need in the context of AutoML, which is particularly crucial for applications to bioinformatics and medical informatics, namely covariate adjustments. To this end we present a substantial extension of TPOT, a genetic programming based AutoML approach. We show the utility of this extension by applications to large toxicogenomics and differential gene expression data. The method is generally applicable in many other scenarios from the biomedical field.
- Published
- 2020
42. Additional file 5 of Embedding covariate adjustments in tree-based automated machine learning for biomedical big data analyses
- Author
-
Manduchi, Elisabetta, Weixuan Fu, Romano, Joseph D., Ruberto, Stefano, and Moore, Jason H.
- Subjects
ComputerSystemsOrganization_PROCESSORARCHITECTURES - Abstract
Additional file 5. resAdj TPOT pre-processor output. Structure and column description of the output from this pre-processor.
- Published
- 2020
- Full Text
- View/download PDF
43. Additional file 4 of Embedding covariate adjustments in tree-based automated machine learning for biomedical big data analyses
- Author
-
Manduchi, Elisabetta, Weixuan Fu, Romano, Joseph D., Ruberto, Stefano, and Moore, Jason H.
- Abstract
Additional file 4. Additional methods. Details for the leakage-free covariate adjustment.
- Published
- 2020
- Full Text
- View/download PDF
44. A Novel Graph Based Semi-Supervised Learning Approach to Identify Pathways Contributing to the Development of Diabetes and Obesity
- Author
-
Sara E. Pinney, Weixuan Fu, Yonghyan Nam, Apoorva Joshi, Manu Shivakumar, Rebecca A. Simmons, Dokyoon Kim, Garam Lee, and Paul P. Wang
- Subjects
Computer science ,business.industry ,Endocrinology, Diabetes and Metabolism ,Graph based ,Semi-supervised learning ,Machine learning ,computer.software_genre ,medicine.disease ,Obesity ,Development (topology) ,Diabetes mellitus ,medicine ,Artificial intelligence ,business ,computer - Abstract
Background: Gestational diabetes (GDM) has profound effects on the intrauterine metabolic milieu, induces marked abnormalities in fetal glucose and insulin secretion and is linked to obesity and diabetes in the offspring, but the mechanisms remain largely unknown. Epigenetic modifications in stems cells may be one mechanism by which an in utero exposure can lead to the development of diabetes and obesity later in life. Objective: To identify novel pathways contributing to the development of diabetes and obesity in offspring exposed to GDM in utero by integrating data generated from transcriptome and methylome analysis from second trimester human amniocytes exposed to GDM in utero. Methods: We analyzed RNAseq and genome wide DNA methylation data (ERRBS) generated from second trimester amniocytes obtained from women with GDM (n=14). Amniocytes have stem cells-like characteristics and are derived from the fetus. Expression data of 22,271 genes were retrieved from RNAseq data. CpGs with significant changes in DNA methylation were mapped into 20,028 genes by collapsing methylation probes into promoter and gene regions. To better understand the associations among diverse gene sets or among gene sets and GDM,we first constructed two weighted co-expression networks from RNAseq and DNA methylation data, respectively. Then, two co-expression networks were integrated using a linear combination. With the integrated co-expression network, graph-based label propagation algorithm was employed to prioritize GDM-associated genes. Results: From the differential expression analysis between GDM and control, the top 20 query genes, including 11 genes and 9 methylated genes, were selected for label propagation. Finally, the top 100 genes were picked up for the pathway enrichment testusing an over-representation analysis approach. Significantly enriched pathways included: Interferon Signaling, N-glycan Antennae Elongation, Sphingolipid Pathway and Metabolism, Classical Complement Pathway, Complement and Coagulation Cascades, Tryptophan Metabolism, Peroxisomal Protein Import, Unsaturated Fatty Acid Metabolism, Complement Activation, Human Innate Immune Response, Ceramide Metabolism, Fertilization Pathway, Keratan Sulfate Biosynthesis Pathway, Transport to the Golgi and Modification Pathway (FDR q Conclusion: Using a novel bioinformatic approach to synthesize transcriptome and methylome data derived from human amniocytes exposed to GDM in utero, we identified potential pathways involved in programming of diabetes and obesity in offspring including pathways involving the immune response, complex lipid metabolism, the complement pathway, and protein transport and processing. Further investigation of these pathways may yield new mechanisms contributing to diabetes and obesity.
- Published
- 2021
- Full Text
- View/download PDF
45. PFOA Exposure Prior to Hepatocyte Differentiation Leads to Gene Expression Changes Implicated in Non-Alcoholic Fatty Liver Disease
- Author
-
Weixuan Fu, Paul P. Wang, Sara E. Pinney, Apoorva Joshi, Rebecca A. Simmons, Pheruza Tarapore, and Stephanie Hanke
- Subjects
Hepatocyte differentiation ,medicine.medical_specialty ,Endocrinology, Diabetes and Metabolism ,Fatty liver ,Peroxisome Proliferation ,Lipid metabolism ,Biology ,medicine.disease ,Endocrine Disrupting Compounds: Mechanisms of Action and Clinical Implications ,Endocrinology ,Endocrine Disruption ,Internal medicine ,Lipogenesis ,Gene expression ,medicine ,Hepatic stellate cell ,Hepatic fibrosis ,AcademicSubjects/MED00250 - Abstract
Background: Perfluorooctanoic acid (PFOA), is a persistent fluorinated compound with oil and water repelling properties found in cookware, food packaging and municipal water systems. Adult animals exposed to PFOA develop hepatomegaly, fatty liver, peroxisome proliferation, and immunotoxicity. Rodents exposed to PFCs in utero have altered hepatic lipid metabolism, increased hepatic de novo lipogenesis and susceptibility to non-alcoholic fatty liver disease (NAFLD), but underlying molecular mechanisms remain unknown. With increasing rates of obesity, diabetes, and NAFLD it is critical to examine the mechanisms by which in utero exposure to PFOA contributes to the development of metabolic syndrome in offspring. Objective: To determine mechanism by which PFOA alters gene expression in undifferentiated hepatic progenitor cells. Design/methods: HepaRG cells, a human derived hepatocyte progenitor cell line, was treated with 0.5uM PFOA or vehicle for 48 hours followed by differentiation into hepatocytes. Total RNA was extracted using the RNeasy (Qiagen) [total RNA A260/280>2 and RNA integrity number >7 (Agilent Bioanalyzer)] to generate libraries with the Illumina TruSeq stranded total RNA kit. RNA-Seq was performed using 85 bp single-end read sequencing to generate >20 million reads per sample. RNAseq data was aligned to hg38 using STAR v2.6.1a and then quantified with featureCounts v1.6.2. DESeq2 identified differentially expressed genes via FDR (false discovery rate) after Bonferroni correction. Differentially expressed gene lists were used for Ingenuity Pathway Analysis (IPA) to identify pathways of biological significance. Results: PFOA treatment resulted in increased expression of transcription factors EGR1 (early growth response protein 1), NR4A1 (nuclear receptor Nur77), EGR2 (early growth response protein 2), KLF10 (Krueppel-like factor 10) and FOSL1 (Fos-related antigen 1), key genes linked to impaired hepatic insulin signaling, hepatic lipid metabolism, steatosis and fibrosis (fold change > 1.5; q
- Published
- 2021
- Full Text
- View/download PDF
46. Correction to: Investigating the parameter space of evolutionary algorithms
- Author
-
Karuna Ahuja, Jason H. Moore, Moshe Sipper, and Weixuan Fu
- Subjects
0303 health sciences ,Computer science ,030302 biochemistry & molecular biology ,Evolutionary algorithm ,lcsh:QA299.6-433 ,Correction ,lcsh:Analysis ,Parameter space ,lcsh:Computer applications to medicine. Medical informatics ,Biochemistry ,Computer Science Applications ,03 medical and health sciences ,Computational Mathematics ,Computational Theory and Mathematics ,Genetics ,lcsh:R858-859.7 ,Molecular Biology ,Algorithm ,030304 developmental biology - Abstract
Evolutionary computation (EC) has been widely applied to biological and biomedical data. The practice of EC involves the tuning of many parameters, such as population size, generation count, selection size, and crossover and mutation rates. Through an
- Published
- 2019
- Full Text
- View/download PDF
47. Comparison of protective efficacy between two DNA vaccines encoding DnaK and GroEL against fish nocardiosis
- Author
-
Jia Cai, Weixuan Fu, Suying Hou, Wenji Wang, Jianlin Chen, Xia Liqun, and Yishan Lu
- Subjects
0301 basic medicine ,Nocardia Infections ,Aquaculture ,Aquatic Science ,Biology ,Nocardia ,Microbiology ,DNA vaccination ,03 medical and health sciences ,Fish Diseases ,Immune system ,Antigen ,Bacterial Proteins ,Immunity ,medicine ,Vaccines, DNA ,Environmental Chemistry ,Animals ,Immunity, Cellular ,Bacterial disease ,Nocardiosis ,04 agricultural and veterinary sciences ,General Medicine ,Chaperonin 60 ,medicine.disease ,GroEL ,Immunity, Innate ,Immunity, Humoral ,030104 developmental biology ,Immunization ,biological sciences ,Bacterial Vaccines ,040102 fisheries ,bacteria ,0401 agriculture, forestry, and fisheries ,Molecular Chaperones - Abstract
Fish nocardiosis is a chronic granulomatous bacterial disease mainly caused by three pathogenic bacteria, including Nocardia seriolae, N. asteroids and N. salmonicida. Molecular chaperone DnaK and GroEL were identified to be the common antigens of the three pathogenic Nocardia species in our previous studies. To evaluate the immune protective effect of two DNA vaccines encoding DnaK or GroEL against fish nocardiosis, hybrid snakehead were vaccinated and the immune responses induced by these two vaccines were comparatively analyzed. The results suggested it needed at least 7 d to transport DnaK or GroEL gene from injected muscle to head kidney, spleen and liver and stimulate host's immune system for later protection after immunization by DNA vaccines. Additionally, non-specific immunity parameters (serum lysozyme (LYZ), peroxidase (POD), acid phosphatase (ACP), alkaline phosphatase (AKP) and superoxide dismutase (SOD) activities), specific antibody (IgM) production and immune-related genes (MHCIα, MHCIIα, CD4, CD8α, IL-1β and TNFα) were used to evaluate the immune responses induced in vaccinated hybrid snakehead. It proved that all the above-mentioned immune activities were significantly enhanced after immunization with these two DNA vaccines. The protective efficacy of pcDNA-DnaK and pcDNA-GroEL DNA vaccines, in terms of relative percentage survival (RPS), were 53.01% and 80.71% respectively. It demonstrated that these two DNA vaccines could increase the survival rate of hybrid snakehead against fish nocardiosis, albeit with variations in immunoprotective effects. Taken together, these results indicated that both pcDNA-DnaK and pcDNA-GroEL DNA vaccines could boost the innate, humoral and cellular immune response in hybrid snakehead and show highly protective efficacy against fish nocardiosis, suggesting that DnaK and GroEL were promising vaccine candidates. These findings will promote the development of DNA vaccines against fish nocardiosis in aquaculture.
- Published
- 2019
48. Evaluating recommender systems for AI-driven biomedical informatics
- Author
-
Heather Williams, Durga Srivatsan, Weixuan Fu, William La Cava, Steven Vitale, and Jason H. Moore
- Subjects
Statistics and Probability ,FOS: Computer and information sciences ,Computer Science - Machine Learning ,Informatics ,AcademicSubjects/SCI01060 ,Computer science ,02 engineering and technology ,Recommender system ,Machine learning ,computer.software_genre ,Biochemistry ,Health informatics ,Computer Science - Information Retrieval ,Matrix decomposition ,Machine Learning (cs.LG) ,Machine Learning ,03 medical and health sciences ,0202 electrical engineering, electronic engineering, information engineering ,Humans ,Molecular Biology ,030304 developmental biology ,0303 health sciences ,business.industry ,Deep learning ,Original Papers ,Metalearning ,3. Good health ,Computer Science Applications ,Computational Mathematics ,Subject-matter expert ,Computational Theory and Mathematics ,020201 artificial intelligence & image processing ,Artificial intelligence ,Data and Text Mining ,business ,computer ,Algorithms ,Information Retrieval (cs.IR) - Abstract
Motivation: Many researchers with domain expertise are unable to easily apply machine learning to their bioinformatics data due to a lack of machine learning and/or coding expertise. Methods that have been proposed thus far to automate machine learning mostly require programming experience as well as expert knowledge to tune and apply the algorithms correctly. Here, we study a method of automating biomedical data science using a web-based platform that uses AI to recommend model choices and conduct experiments. We have two goals in mind: first, to make it easy to construct sophisticated models of biomedical processes; and second, to provide a fully automated AI agent that can choose and conduct promising experiments for the user, based on the user's experiments as well as prior knowledge. To validate this framework, we experiment with hundreds of classification problems, comparing to state-of-the-art, automated approaches. Finally, we use this tool to develop predictive models of septic shock in critical care patients. Results: We find that matrix factorization-based recommendation systems outperform meta-learning methods for automating machine learning. This result mirrors the results of earlier recommender systems research in other domains. The proposed AI is competitive with state-of-the-art automated machine learning methods in terms of choosing optimal algorithm configurations for datasets. In our application to prediction of septic shock, the AI-driven analysis produces a competent machine learning model (AUROC 0.85 +/- 0.02) that performs on par with state-of-the-art deep learning results for this task, with much less computational effort., Comment: 17 pages, 8 figures. this version fixes link to pennai in abstract
- Published
- 2019
- Full Text
- View/download PDF
49. Evolutionary computation
- Author
-
Karuna Ahuja, Moshe Sipper, Jason H. Moore, and Weixuan Fu
- Subjects
0301 basic medicine ,Hyperparameter ,03 medical and health sciences ,030104 developmental biology ,Series (mathematics) ,Computer science ,Evolutionary algorithm ,Contrast (statistics) ,Genetic programming ,Parameter space ,Algorithm ,Evolutionary computation - Abstract
Through an extensive series of experiments over multiple evolutionary algorithm implementations and 25 problems we showed that parameter space tends to be rife with viable parameters, somewhat in contrast with common lore [6].
- Published
- 2018
- Full Text
- View/download PDF
50. A System for Accessible Artificial Intelligence
- Author
-
Patryk Orzechowski, Sharon Tartarone, Jason H. Moore, John H. Holmes, Moshe Sipper, Randal S. Olson, Ryan J. Urbanowicz, Weixuan Fu, Steven Vitale, and William La Cava
- Subjects
0301 basic medicine ,Complex data type ,Point (typography) ,Computer science ,business.industry ,Genetic programming ,02 engineering and technology ,03 medical and health sciences ,030104 developmental biology ,Open source ,Health care ,0202 electrical engineering, electronic engineering, information engineering ,Deep knowledge ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,Ai systems - Abstract
While artificial intelligence (AI) has become widespread, many commercial AI systems are not yet accessible to individual researchers nor the general public due to the deep knowledge of the systems required to use them. We believe that AI has matured to the point where it should be an accessible technology for everyone. We present an ongoing project whose ultimate goal is to deliver an open source, user-friendly AI system that is specialized for machine learning analysis of complex data in the biomedical and health care domains. We discuss how genetic programming can aid in this endeavor, and highlight specific examples where genetic programming has automated machine learning analyses in previous projects.
- Published
- 2018
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.