105 results on '"Rajapakse JC"'
Search Results
2. Reconstruction of protein-protein interaction pathways by mining subject-verb-objects intermediates
- Author
-
Rajapakse, JC, Schmidt, B, Volkert, G, Ling, MHT, Lefevre, C, Nicholas, KR, Lin, F, Rajapakse, JC, Schmidt, B, Volkert, G, Ling, MHT, Lefevre, C, Nicholas, KR, and Lin, F
- Published
- 2007
3. The Potential of Using Cost-Effective Compost Mixtures for Oyster Mushroom (Pleurotus spp) Cultivation in Sri Lanka
- Author
-
Rajapakse, JC, primary, Rubasingha, P, additional, and Dissanayake, NN, additional
- Published
- 2010
- Full Text
- View/download PDF
4. Increasing the topological quality of Kohonen's self-organising map by using a hit term
- Author
-
E. Germen, Wang, L, Rajapakse, JC, Fukushima, K, Anadolu Üniversitesi, Mühendislik Fakültesi, Elektrik ve Elektronik Mühendisliği Bölümü, and Germen, Emin
- Subjects
Self-organizing map ,media_common.quotation_subject ,Process (computing) ,Function (mathematics) ,Self organising maps ,computer.software_genre ,Topology ,Term (time) ,Quality (business) ,Data mining ,Adaptation (computer science) ,computer ,Topology (chemistry) ,media_common ,Mathematics - Abstract
9th International Conference on Neural Information Processing -- NOV 18-22, 2002 -- SINGAPORE, SINGAPORE, WOS: 000182832400193, The quality of the topology obtained at the end of the training period of Kohonen's Self Organizing Map (SOM) is highly dependent on the learning rate and neighborhood function that are chosen at the beginning. The conventional approaches to determine those parameters do not account for the data statistics and the topological characterization of the neurons. This paper proposes a new parameter, which depends on the hit ratio among the updated. neuron and the best matching neuron. It has been shown that by using this parameter with the conventional learning rate and neighborhood functions, much more adequate solution can be obtained since it deserves an information about data statistics during adaptation process., Asia Pacific Neural Network Assembly, Singapore Neurosci Assoc, SEAL & FSKD Conf Steering Comm, Nanyang Technol Univ, Sch Elect & Electr Engn
- Published
- 2002
- Full Text
- View/download PDF
5. Quantification of muscle fiber malformations using edge detection to investigate chronic muscle pressure ulcers.
- Author
-
Ong CZL, Nasir NJM, Welsch RE, Tucker-Kellogg L, and Rajapakse JC
- Abstract
Background: Microscopy of regenerated tissue shows different morphologies between the healing of acute wounds and chronic wounds. This difference can be seen manually by biologists, but computational methods are needed to automate the characterization of morphology and regenerative quality in regenerated muscle tissue., Results: From the detected edge segments, we computed several imaging biomarkers of interest, such as median tortuosity, number of edge segments normalized by area, median edge segment distance and interquartile range of orientation angles of edge segments of the microscope images of successful and unsuccessful muscle regeneration. We observed that muscle fibers in saline-treated pressure ulcers had a larger interquartile range of orientation angles of the edge segments (p = 0.05) and shorter edge segment distances (p = 0.003) compared to those of acute cardiotoxin injuries., Conclusion: Our edge detection method was able to identify statistically significant differences in some of the imaging biomarkers between saline-treated pressure ulcers and cardiotoxin injuries, suggesting that chronic pressure ulcers have increased muscle fiber malformations compared to cardiotoxin injuries., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2024 Ong, Nasir, Welsch, Tucker-Kellogg and Rajapakse.)
- Published
- 2024
- Full Text
- View/download PDF
6. SKGC: A General Semantic-Level Knowledge Guided Classification Framework for Fetal Congenital Heart Disease.
- Author
-
Lu Y, Tan G, Pu B, Wang H, Liang B, Li K, and Rajapakse JC
- Subjects
- Humans, Female, Pregnancy, Fetal Heart diagnostic imaging, Algorithms, Deep Learning, Heart Defects, Congenital diagnostic imaging, Heart Defects, Congenital classification, Semantics, Ultrasonography, Prenatal methods, Image Interpretation, Computer-Assisted methods
- Abstract
Congenital heart disease (CHD) is the most common congenital disability affecting healthy development and growth, even resulting in pregnancy termination or fetal death. Recently, deep learning techniques have made remarkable progress to assist in diagnosing CHD. One very popular method is directly classifying fetal ultrasound images, recognized as abnormal and normal, which tends to focus more on global features and neglects semantic knowledge of anatomical structures. The other approach is segmentation-based diagnosis, which requires a large number of pixel-level annotation masks for training. However, the detailed pixel-level segmentation annotation is costly or even unavailable. Based on the above analysis, we propose SKGC, a universal framework to identify normal or abnormal four-chamber heart (4CH) images, guided by a few annotation masks, while improving accuracy remarkably. SKGC consists of a semantic-level knowledge extraction module (SKEM), a multi-knowledge fusion module (MFM), and a classification module (CM). SKEM is responsible for obtaining high-level semantic knowledge, serving as an abstract representation of the anatomical structures that obstetricians focus on. MFM is a lightweight but efficient module that fuses semantic-level knowledge with the original specific knowledge in ultrasound images. CM classifies the fused knowledge and can be replaced by any advanced classifier. Moreover, we design a new loss function that enhances the constraint between the foreground and background predictions, improving the quality of the semantic-level knowledge. Experimental results on the collected real-world NA-4CH and the publicly FEST datasets show that SKGC achieves impressive performance with the best accuracy of 99.68% and 95.40%, respectively. Notably, the accuracy improves from 74.68% to 88.14% using only 10 labeled masks.
- Published
- 2024
- Full Text
- View/download PDF
7. Comparative Analysis of Vision Transformers and Conventional Convolutional Neural Networks in Detecting Referable Diabetic Retinopathy.
- Author
-
Goh JHL, Ang E, Srinivasan S, Lei X, Loh J, Quek TC, Xue C, Xu X, Liu Y, Cheng CY, Rajapakse JC, and Tham YC
- Abstract
Objective: Vision transformers (ViTs) have shown promising performance in various classification tasks previously dominated by convolutional neural networks (CNNs). However, the performance of ViTs in referable diabetic retinopathy (DR) detection is relatively underexplored. In this study, using retinal photographs, we evaluated the comparative performances of ViTs and CNNs on detection of referable DR., Design: Retrospective study., Participants: A total of 48 269 retinal images from the open-source Kaggle DR detection dataset, the Messidor-1 dataset and the Singapore Epidemiology of Eye Diseases (SEED) study were included., Methods: Using 41 614 retinal photographs from the Kaggle dataset, we developed 5 CNN (Visual Geometry Group 19, ResNet50, InceptionV3, DenseNet201, and EfficientNetV2S) and 4 ViTs models (VAN_small, CrossViT_small, ViT_small, and Hierarchical Vision transformer using Shifted Windows [SWIN]_tiny) for the detection of referable DR. We defined the presence of referable DR as eyes with moderate or worse DR. The comparative performance of all 9 models was evaluated in the Kaggle internal test dataset (with 1045 study eyes), and in 2 external test sets, the SEED study (5455 study eyes) and the Messidor-1 (1200 study eyes)., Main Outcome Measures: Area under operating characteristics curve (AUC), specificity, and sensitivity., Results: Among all models, the SWIN transformer displayed the highest AUC of 95.7% on the internal test set, significantly outperforming the CNN models (all P < 0.001). The same observation was confirmed in the external test sets, with the SWIN transformer achieving AUC of 97.3% in SEED and 96.3% in Messidor-1. When specificity level was fixed at 80% for the internal test, the SWIN transformer achieved the highest sensitivity of 94.4%, significantly better than all the CNN models (sensitivity levels ranging between 76.3% and 83.8%; all P < 0.001). This trend was also consistently observed in both external test sets., Conclusions: Our findings demonstrate that ViTs provide superior performance over CNNs in detecting referable DR from retinal photographs. These results point to the potential of utilizing ViT models to improve and optimize retinal photo-based deep learning for referable DR detection., Financial Disclosures: Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article., (© 2024 by the American Academy of Ophthalmology.)
- Published
- 2024
- Full Text
- View/download PDF
8. GLDM: hit molecule generation with constrained graph latent diffusion model.
- Author
-
Wang C, Ong HH, Chiba S, and Rajapakse JC
- Subjects
- Diffusion, Benchmarking, Drug Discovery
- Abstract
Discovering hit molecules with desired biological activity in a directed manner is a promising but profound task in computer-aided drug discovery. Inspired by recent generative AI approaches, particularly Diffusion Models (DM), we propose Graph Latent Diffusion Model (GLDM)-a latent DM that preserves both the effectiveness of autoencoders of compressing complex chemical data and the DM's capabilities of generating novel molecules. Specifically, we first develop an autoencoder to encode the molecular data into low-dimensional latent representations and then train the DM on the latent space to generate molecules inducing targeted biological activity defined by gene expression profiles. Manipulating DM in the latent space rather than the input space avoids complicated operations to map molecule decomposition and reconstruction to diffusion processes, and thus improves training efficiency. Experiments show that GLDM not only achieves outstanding performances on molecular generation benchmarks, but also generates samples with optimal chemical properties and potentials to induce desired biological activity., (© The Author(s) 2024. Published by Oxford University Press.)
- Published
- 2024
- Full Text
- View/download PDF
9. Sparse Deep Neural Network for Encoding and Decoding the Structural Connectome.
- Author
-
Singh SP, Gupta S, and Rajapakse JC
- Subjects
- Humans, Magnetic Resonance Imaging methods, Neural Networks, Computer, Neuroimaging methods, Biomarkers, Connectome methods, Alzheimer Disease
- Abstract
Brain state classification by applying deep learning techniques on neuroimaging data has become a recent topic of research. However, unlike domains where the data is low dimensional or there are large number of available training samples, neuroimaging data is high dimensional and has few training samples. To tackle these issues, we present a sparse feedforward deep neural architecture for encoding and decoding the structural connectome of the human brain. We use a sparsely connected element-wise multiplication as the first hidden layer and a fixed transform layer as the output layer. The number of trainable parameters and the training time is significantly reduced compared to feedforward networks. We demonstrate superior performance of this architecture in encoding the structural connectome implicated in Alzheimer's disease (AD) and Parkinson's disease (PD) from DTI brain scans. For decoding, we propose recursive feature elimination (RFE) algorithm based on DeepLIFT, layer-wise relevance propagation (LRP), and Integrated Gradients (IG) algorithms to remove irrelevant features and thereby identify key biomarkers associated with AD and PD. We show that the proposed architecture reduces 45.1% and 47.1% of the trainable parameters compared to a feedforward DNN with an increase in accuracy by 2.6 % and 3.1% for cognitively normal (CN) vs AD and CN vs PD classification, respectively. We also show that the proposed RFE method leads to a further increase in accuracy by 2.1% and 4% for CN vs AD and CN vs PD classification, while removing approximately 90% to 95% irrelevant features. Furthermore, we argue that the biomarkers (i.e., key brain regions and connections) identified are consistent with previous literature. We show that relevancy score-based methods can yield high discriminative power and are suitable for brain decoding. We also show that the proposed approach led to a reduction in the number of trainable network parameters, an increase in classification accuracy, and a detection of brain connections and regions that were consistent with earlier studies., (© 2024 The Authors.)
- Published
- 2024
- Full Text
- View/download PDF
10. Hybrid UNet transformer architecture for ischemic stoke segmentation with MRI and CT datasets.
- Author
-
Soh WK and Rajapakse JC
- Abstract
A hybrid UNet and Transformer (HUT) network is introduced to combine the merits of the UNet and Transformer architectures, improving brain lesion segmentation from MRI and CT scans. The HUT overcomes the limitations of conventional approaches by utilizing two parallel stages: one based on UNet and the other on Transformers. The Transformer-based stage captures global dependencies and long-range correlations. It uses intermediate feature vectors from the UNet decoder and improves segmentation accuracy by enhancing the attention and relationship modeling between voxel patches derived from the 3D brain volumes. In addition, HUT incorporates self-supervised learning on the transformer network. This allows the transformer network to learn by maintaining consistency between the classification layers of the different resolutions of patches and augmentations. There is an improvement in the rate of convergence of the training and the overall capability of segmentation. Experimental results on benchmark datasets, including ATLAS and ISLES2018, demonstrate HUT's advantage over the state-of-the-art methods. HUT achieves higher Dice scores and reduced Hausdorff Distance scores in single-modality and multi-modality lesion segmentation. HUT outperforms the state-the-art network SPiN in the single-modality MRI segmentation on Anatomical Tracings of lesion After Stroke (ATLAS) dataset by 4.84% of Dice score and a large margin of 40.7% in the Hausdorff Distance score. HUT also performed well on CT perfusion brain scans in the Ischemic Stroke Lesion Segmentation (ISLES2018) dataset and demonstrated an improvement over the recent state-of-the-art network USSLNet by 3.3% in the Dice score and 12.5% in the Hausdorff Distance score. With the analysis of both single and multi-modalities datasets (ATLASR12 and ISLES2018), we show that HUT can perform and generalize well on different datasets. Code is available at: https://github.com/vicsohntu/HUT_CT., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2023 Soh and Rajapakse.)
- Published
- 2023
- Full Text
- View/download PDF
11. Elucidating salient site-specific functional connectivity features and site-invariant biomarkers in schizophrenia via deep neural networks.
- Author
-
Chan YH, Yew WC, Chew QH, Sim K, and Rajapakse JC
- Subjects
- Humans, Brain pathology, Magnetic Resonance Imaging methods, Frontal Lobe, Neural Networks, Computer, Brain Mapping methods, Schizophrenia
- Abstract
Schizophrenia is a highly heterogeneous disorder and salient functional connectivity (FC) features have been observed to vary across study sites, warranting the need for methods that can differentiate between site-invariant FC biomarkers and site-specific salient FC features. We propose a technique named Semi-supervised learning with data HaRmonisation via Encoder-Decoder-classifier (SHRED) to examine these features from resting state functional magnetic resonance imaging scans gathered from four sites. Our approach involves an encoder-decoder-classifier architecture that simultaneously performs data harmonisation and semi-supervised learning (SSL) to deal with site differences and labelling inconsistencies across sites respectively. The minimisation of reconstruction loss from SSL was shown to improve model performance even within small datasets whilst data harmonisation often led to lower model generalisability, which was unaffected using the SHRED technique. We show that our proposed model produces site-invariant biomarkers, most notably the connection between transverse temporal gyrus and paracentral lobule. Site-specific salient FC features were also elucidated, especially implicating the paracentral lobule for our local dataset. Our examination of these salient FC features demonstrates how site-specific features and site-invariant biomarkers can be differentiated, which can deepen our understanding of the neurobiology of schizophrenia., (© 2023. The Author(s).)
- Published
- 2023
- Full Text
- View/download PDF
12. HUT: Hybrid UNet transformer for brain lesion and tumour segmentation.
- Author
-
Soh WK, Yuen HY, and Rajapakse JC
- Abstract
A supervised deep learning network like the UNet has performed well in segmenting brain anomalies such as lesions and tumours. However, such methods were proposed to perform on single-modality or multi-modality images. We use the Hybrid UNet Transformer (HUT) to improve performance in single-modality lesion segmentation and multi-modality brain tumour segmentation. The HUT consists of two pipelines running in parallel, one of which is UNet-based and the other is Transformer-based. The Transformer-based pipeline relies on feature maps in the intermediate layers of the UNet decoder during training. The HUT network takes in the available modalities of 3D brain volumes and embeds the brain volumes into voxel patches. The transformers in the system improve global attention and long-range correlation between the voxel patches. In addition, we introduce a self-supervised training approach in the HUT framework to enhance the overall segmentation performance. We demonstrate that HUT performs better than the state-of-the-art network SPiN in the single-modality segmentation on Anatomical Tracings of Lesions After Stroke (ATLAS) dataset by 4.84% of Dice score and a significant 41% in the Hausdorff Distance score. HUT also performed well on brain scans in the Brain Tumour Segmentation (BraTS20) dataset and demonstrated an improvement over the state-of-the-art network nnUnet by 0.96% in the Dice score and 4.1% in the Hausdorff Distance score., Competing Interests: The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (© 2023 The Authors.)
- Published
- 2023
- Full Text
- View/download PDF
13. Graph Neural Networks With Multiple Prior Knowledge for Multi-Omics Data Analysis.
- Author
-
Xiao S, Lin H, Wang C, Wang S, and Rajapakse JC
- Subjects
- Humans, Algorithms, Biotechnology, Data Analysis, Multiomics, Neural Networks, Computer
- Abstract
With the development of biotechnology, a large amount of multi-omics data have been collected for precision medicine. There exists multiple graph-based prior biological knowledge about omics data, such as gene-gene interaction networks. Recently, there has been an increasing interest in introducing graph neural networks (GNNs) into multi-omics learning. However, existing methods have not fully exploited these graphical priors since none have been able to integrate knowledge from multiple sources simultaneously. To solve this problem, we propose a multi-omics data analysis framework by incorporating multiple prior knowledge into graph neural network (MPK-GNN). To the best of our knowledge, this is the first attempt to introduce multiple prior graphs into multi-omics data analysis. Specifically, the proposed method contains four parts: (1) a feature-level learning module to aggregate information from prior graphs; (2) a projection module to maximize the agreement among prior networks by optimizing a contrastive loss; (3) a sample-level module to learn a global representation from input multi-omics features; (4) a task-specific module to flexibly extend MPK-GNN for various downstream multi-omics analysis tasks. Finally, we verify the effectiveness of the proposed multi-omics learning algorithm on the cancer molecular subtype classification task. Experimental results show that MPK-GNN outperforms other state-of-the-art algorithms, including multi-view learning methods and multi-omics integrative approaches.
- Published
- 2023
- Full Text
- View/download PDF
14. Multi-modal graph neural network for early diagnosis of Alzheimer's disease from sMRI and PET scans.
- Author
-
Zhang Y, He X, Chan YH, Teng Q, and Rajapakse JC
- Subjects
- Humans, Magnetic Resonance Imaging methods, Neural Networks, Computer, Positron-Emission Tomography methods, Neuroimaging methods, Early Diagnosis, Alzheimer Disease diagnostic imaging
- Abstract
In recent years, deep learning models have been applied to neuroimaging data for early diagnosis of Alzheimer's disease (AD). Structural magnetic resonance imaging (sMRI) and positron emission tomography (PET) images provide structural and functional information about the brain, respectively. Combining these features leads to improved performance than using a single modality alone in building predictive models for AD diagnosis. However, current multi-modal approaches in deep learning, based on sMRI and PET, are mostly limited to convolutional neural networks, which do not facilitate integration of both image and phenotypic information of subjects. We propose to use graph neural networks (GNN) that are designed to deal with problems in non-Euclidean domains. In this study, we demonstrate how brain networks are created from sMRI or PET images and can be used in a population graph framework that combines phenotypic information with imaging features of the brain networks. Then, we present a multi-modal GNN framework where each modality has its own branch of GNN and a technique that combines the multi-modal data at both the level of node vectors and adjacency matrices. Finally, we perform late fusion to combine the preliminary decisions made in each branch and produce a final prediction. As multi-modality data becomes available, multi-source and multi-modal is the trend of AD diagnosis. We conducted explorative experiments based on multi-modal imaging data combined with non-imaging phenotypic information for AD diagnosis and analyzed the impact of phenotypic information on diagnostic performance. Results from experiments demonstrated that our proposed multi-modal approach improves performance for AD diagnosis. Our study also provides technical reference and support the need for multivariate multi-modal diagnosis methods., Competing Interests: Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (Copyright © 2023 Elsevier Ltd. All rights reserved.)
- Published
- 2023
- Full Text
- View/download PDF
15. Deep learning and multi-omics approach to predict drug responses in cancer.
- Author
-
Wang C, Lye X, Kaalia R, Kumar P, and Rajapakse JC
- Subjects
- Humans, DNA Copy Number Variations, Mutation, Genomics, Deep Learning, Neoplasms drug therapy, Neoplasms genetics
- Abstract
Background: Cancers are genetically heterogeneous, so anticancer drugs show varying degrees of effectiveness on patients due to their differing genetic profiles. Knowing patient's responses to numerous cancer drugs are needed for personalized treatment for cancer. By using molecular profiles of cancer cell lines available from Cancer Cell Line Encyclopedia (CCLE) and anticancer drug responses available in the Genomics of Drug Sensitivity in Cancer (GDSC), we will build computational models to predict anticancer drug responses from molecular features., Results: We propose a novel deep neural network model that integrates multi-omics data available as gene expressions, copy number variations, gene mutations, reverse phase protein array expressions, and metabolomics expressions, in order to predict cellular responses to known anti-cancer drugs. We employ a novel graph embedding layer that incorporates interactome data as prior information for prediction. Moreover, we propose a novel attention layer that effectively combines different omics features, taking their interactions into account. The network outperformed feedforward neural networks and reported 0.90 for [Formula: see text] values for prediction of drug responses from cancer cell lines data available in CCLE and GDSC., Conclusion: The outstanding results of our experiments demonstrate that the proposed method is capable of capturing the interactions of genes and proteins, and integrating multi-omics features effectively. Furthermore, both the results of ablation studies and the investigations of the attention layer imply that gene mutation has a greater influence on the prediction of drug responses than other omics data types. Therefore, we conclude that our approach can not only predict the anti-cancer drug response precisely but also provides insights into reaction mechanisms of cancer cell lines and drugs as well., (© 2022. The Author(s).)
- Published
- 2022
- Full Text
- View/download PDF
16. DrDimont: explainable drug response prediction from differential analysis of multi-omics networks.
- Author
-
Hiort P, Hugo J, Zeinert J, Müller N, Kashyap S, Rajapakse JC, Azuaje F, Renard BY, and Baum K
- Subjects
- Female, Humans, Proteomics, Receptors, Estrogen, Transcriptome, Breast Neoplasms drug therapy, Software
- Abstract
Motivation: While it has been well established that drugs affect and help patients differently, personalized drug response predictions remain challenging. Solutions based on single omics measurements have been proposed, and networks provide means to incorporate molecular interactions into reasoning. However, how to integrate the wealth of information contained in multiple omics layers still poses a complex problem., Results: We present DrDimont, Drug response prediction from Differential analysis of multi-omics networks. It allows for comparative conclusions between two conditions and translates them into differential drug response predictions. DrDimont focuses on molecular interactions. It establishes condition-specific networks from correlation within an omics layer that are then reduced and combined into heterogeneous, multi-omics molecular networks. A novel semi-local, path-based integration step ensures integrative conclusions. Differential predictions are derived from comparing the condition-specific integrated networks. DrDimont's predictions are explainable, i.e. molecular differences that are the source of high differential drug scores can be retrieved. We predict differential drug response in breast cancer using transcriptomics, proteomics, phosphosite and metabolomics measurements and contrast estrogen receptor positive and receptor negative patients. DrDimont performs better than drug prediction based on differential protein expression or PageRank when evaluating it on ground truth data from cancer cell lines. We find proteomic and phosphosite layers to carry most information for distinguishing drug response., Availability and Implementation: DrDimont is available on CRAN: https://cran.r-project.org/package=DrDimont., Supplementary Information: Supplementary data are available at Bioinformatics online., (© The Author(s) 2022. Published by Oxford University Press.)
- Published
- 2022
- Full Text
- View/download PDF
17. Network-based integration of multi-omics data for clinical outcome prediction in neuroblastoma.
- Author
-
Wang C, Lue W, Kaalia R, Kumar P, and Rajapakse JC
- Subjects
- Algorithms, Humans, Machine Learning, Neural Networks, Computer, Prognosis, Neuroblastoma genetics
- Abstract
Multi-omics data are increasingly being gathered for investigations of complex diseases such as cancer. However, high dimensionality, small sample size, and heterogeneity of different omics types pose huge challenges to integrated analysis. In this paper, we evaluate two network-based approaches for integration of multi-omics data in an application of clinical outcome prediction of neuroblastoma. We derive Patient Similarity Networks (PSN) as the first step for individual omics data by computing distances among patients from omics features. The fusion of different omics can be investigated in two ways: the network-level fusion is achieved using Similarity Network Fusion algorithm for fusing the PSNs derived for individual omics types; and the feature-level fusion is achieved by fusing the network features obtained from individual PSNs. We demonstrate our methods on two high-risk neuroblastoma datasets from SEQC project and TARGET project. We propose Deep Neural Network and Machine Learning methods with Recursive Feature Elimination as the predictor of survival status of neuroblastoma patients. Our results indicate that network-level fusion outperformed feature-level fusion for integration of different omics data whereas feature-level fusion is more suitable incorporating different feature types derived from same omics type. We conclude that the network-based methods are capable of handling heterogeneity and high dimensionality well in the integration of multi-omics., (© 2022. The Author(s).)
- Published
- 2022
- Full Text
- View/download PDF
18. Decoding task specific and task general functional architectures of the brain.
- Author
-
Gupta S, Lim M, and Rajapakse JC
- Subjects
- Brain diagnostic imaging, Humans, Language, Magnetic Resonance Imaging methods, Nerve Net, Rest, Connectome methods
- Abstract
Functional magnetic resonance imaging (fMRI) is used to capture complex and dynamic interactions between brain regions while performing tasks. Task related alterations in the brain have been classified as task specific and task general, depending on whether they are particular to a task or common across multiple tasks. Using recent attempts in interpreting deep learning models, we propose an approach to determine both task specific and task general architectures of the functional brain. We demonstrate our methods with a reference-based decoder on deep learning classifiers trained on 12,500 rest and task fMRI samples from the Human Connectome Project (HCP). The decoded task general and task specific motor and language architectures were validated with findings from previous studies. We found that unlike intersubject variability that is characteristic of functional pathology of neurological diseases, a small set of connections are sufficient to delineate the rest and task states. The nodes and connections in the task general architecture could serve as potential disease biomarkers as alterations in task general brain modulations are known to be implicated in several neuropsychiatric disorders., (© 2022 The Authors. Human Brain Mapping published by Wiley Periodicals LLC.)
- Published
- 2022
- Full Text
- View/download PDF
19. Combining Neuroimaging and Omics Datasets for Disease Classification Using Graph Neural Networks.
- Author
-
Chan YH, Wang C, Soh WK, and Rajapakse JC
- Abstract
Both neuroimaging and genomics datasets are often gathered for the detection of neurodegenerative diseases. Huge dimensionalities of neuroimaging data as well as omics data pose tremendous challenge for methods integrating multiple modalities. There are few existing solutions that can combine both multi-modal imaging and multi-omics datasets to derive neurological insights. We propose a deep neural network architecture that combines both structural and functional connectome data with multi-omics data for disease classification. A graph convolution layer is used to model functional magnetic resonance imaging (fMRI) and diffusion tensor imaging (DTI) data simultaneously to learn compact representations of the connectome. A separate set of graph convolution layers are then used to model multi-omics datasets, expressed in the form of population graphs, and combine them with latent representations of the connectome. An attention mechanism is used to fuse these outputs and provide insights on which omics data contributed most to the model's classification decision. We demonstrate our methods for Parkinson's disease (PD) classification by using datasets from the Parkinson's Progression Markers Initiative (PPMI). PD has been shown to be associated with changes in the human connectome and it is also known to be influenced by genetic factors. We combine DTI and fMRI data with multi-omics data from RNA Expression, Single Nucleotide Polymorphism (SNP), DNA Methylation and non-coding RNA experiments. A Matthew Correlation Coefficient of greater than 0.8 over many combinations of multi-modal imaging data and multi-omics data was achieved with our proposed architecture. To address the paucity of paired multi-modal imaging data and the problem of imbalanced data in the PPMI dataset, we compared the use of oversampling against using CycleGAN on structural and functional connectomes to generate missing imaging modalities. Furthermore, we performed ablation studies that offer insights into the importance of each imaging and omics modality for the prediction of PD. Analysis of the generated attention matrices revealed that DNA Methylation and SNP data were the most important omics modalities out of all the omics datasets considered. Our work motivates further research into imaging genetics and the creation of more multi-modal imaging and multi-omics datasets to study PD and other complex neurodegenerative diseases., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2022 Chan, Wang, Soh and Rajapakse.)
- Published
- 2022
- Full Text
- View/download PDF
20. Graph embeddings on gene ontology annotations for protein-protein interaction prediction.
- Author
-
Zhong X and Rajapakse JC
- Subjects
- Animals, Area Under Curve, Computational Biology methods, Humans, Mice, ROC Curve, Saccharomyces cerevisiae genetics, Task Performance and Analysis, Gene Ontology, Molecular Sequence Annotation, Protein Interaction Mapping methods
- Abstract
Background: Protein-protein interaction (PPI) prediction is an important task towards the understanding of many bioinformatics functions and applications, such as predicting protein functions, gene-disease associations and disease-drug associations. However, many previous PPI prediction researches do not consider missing and spurious interactions inherent in PPI networks. To address these two issues, we define two corresponding tasks, namely missing PPI prediction and spurious PPI prediction, and propose a method that employs graph embeddings that learn vector representations from constructed Gene Ontology Annotation (GOA) graphs and then use embedded vectors to achieve the two tasks. Our method leverages on information from both term-term relations among GO terms and term-protein annotations between GO terms and proteins, and preserves properties of both local and global structural information of the GO annotation graph., Results: We compare our method with those methods that are based on information content (IC) and one method that is based on word embeddings, with experiments on three PPI datasets from STRING database. Experimental results demonstrate that our method is more effective than those compared methods., Conclusion: Our experimental results demonstrate the effectiveness of using graph embeddings to learn vector representations from undirected GOA graphs for our defined missing and spurious PPI tasks.
- Published
- 2020
- Full Text
- View/download PDF
21. Iterative consensus spectral clustering improves detection of subject and group level brain functional modules.
- Author
-
Gupta S and Rajapakse JC
- Abstract
Specialized processing in the brain is performed by multiple groups of brain regions organized as functional modules. Although, in vivo studies of brain functional modules involve multiple functional Magnetic Resonance Imaging (fMRI) scans, the methods used to derive functional modules from functional networks of the brain ignore individual differences in the functional architecture and use incomplete functional connectivity information. To correct this, we propose an Iterative Consensus Spectral Clustering (ICSC) algorithm that detects the most representative modules from individual dense weighted connectivity matrices derived from multiple scans. The ICSC algorithm derives group-level modules from modules of multiple individuals by iteratively minimizing the consensus-cost between the two. We demonstrate that the ICSC algorithm can be used to derive biologically plausible group-level (for multiple subjects) and subject-level (for multiple subject scans) brain modules, using resting-state fMRI scans of 589 subjects from the Human Connectome Project. We employed a multipronged strategy to show the validity of the modularizations obtained from the ICSC algorithm. We show a heterogeneous variability in the modular structure across subjects where modules involved in visual and motor processing were highly stable across subjects. Conversely, we found a lower variability across scans of the same subject. The performance of our algorithm was compared with existing functional brain modularization methods and we show that our method detects group-level modules that are more representative of the modules of multiple individuals. Finally, the experiments on synthetic images quantitatively demonstrate that the ICSC algorithm detects group-level and subject-level modules accurately under varied conditions. Therefore, besides identifying functional modules for a population of subjects, the proposed method can be used for applications in personalized neuroscience. The ICSC implementation is available at https://github.com/SCSE-Biomedical-Computing-Group/ICSC.
- Published
- 2020
- Full Text
- View/download PDF
22. Ambivert degree identifies crucial brain functional hubs and improves detection of Alzheimer's Disease and Autism Spectrum Disorder.
- Author
-
Gupta S, Rajapakse JC, and Welsch RE
- Subjects
- Adolescent, Adult, Aged, Cerebral Cortex physiopathology, Child, Humans, Magnetic Resonance Imaging, Nerve Net physiopathology, Young Adult, Alzheimer Disease diagnostic imaging, Alzheimer Disease physiopathology, Autism Spectrum Disorder diagnostic imaging, Autism Spectrum Disorder physiopathology, Cerebral Cortex diagnostic imaging, Connectome methods, Deep Learning, Nerve Net diagnostic imaging
- Abstract
Functional modules in the human brain support its drive for specialization whereas brain hubs act as focal points for information integration. Brain hubs are brain regions that have a large number of both within and between module connections. We argue that weak connections in brain functional networks lead to misclassification of brain regions as hubs. In order to resolve this, we propose a new measure called ambivert degree that considers the node's degree as well as connection weights in order to identify nodes with both high degree and high connection weights as hubs. Using resting-state functional MRI scans from the Human Connectome Project, we show that ambivert degree identifies brain hubs that are not only crucial but also invariable across subjects. We hypothesize that nodal measures based on ambivert degree can be effectively used to classify patients from healthy controls for diseases that are known to have widespread hub disruption. Using patient data for Alzheimer's Disease and Autism Spectrum Disorder, we show that the hubs in the patient and healthy groups are very different for both the diseases and deep feedforward neural networks trained on nodal hub features lead to a significantly higher classification accuracy with significantly fewer trainable weights compared to using functional connectivity features. Thus, the ambivert degree improves identification of crucial brain hubs in healthy subjects and can be used as a diagnostic feature to detect neurological diseases characterized by hub disruption., Competing Interests: Declaration of Competing Interest The authors declare that they have no competing interests., (Copyright © 2020 The Authors. Published by Elsevier Inc. All rights reserved.)
- Published
- 2020
- Full Text
- View/download PDF
23. GO2Vec: transforming GO terms and proteins to vector representations via graph embeddings.
- Author
-
Zhong X, Kaalia R, and Rajapakse JC
- Subjects
- Humans, Saccharomyces cerevisiae Proteins metabolism, Gene Ontology, Protein Interaction Mapping methods
- Abstract
Background: Semantic similarity between Gene Ontology (GO) terms is a fundamental measure for many bioinformatics applications, such as determining functional similarity between genes or proteins. Most previous research exploited information content to estimate the semantic similarity between GO terms; recently some research exploited word embeddings to learn vector representations for GO terms from a large-scale corpus. In this paper, we proposed a novel method, named GO2Vec, that exploits graph embeddings to learn vector representations for GO terms from GO graph. GO2Vec combines the information from both GO graph and GO annotations, and its learned vectors can be applied to a variety of bioinformatics applications, such as calculating functional similarity between proteins and predicting protein-protein interactions., Results: We conducted two kinds of experiments to evaluate the quality of GO2Vec: (1) functional similarity between proteins on the Collaborative Evaluation of GO-based Semantic Similarity Measures (CESSM) dataset and (2) prediction of protein-protein interactions on the Yeast and Human datasets from the STRING database. Experimental results demonstrate the effectiveness of GO2Vec over the information content-based measures and the word embedding-based measures., Conclusion: Our experimental results demonstrate the effectiveness of using graph embeddings to learn vector representations from undirected GO and GOA graphs. Our results also demonstrate that GO annotations provide useful information for computing the similarity between GO terms and between proteins.
- Published
- 2019
- Full Text
- View/download PDF
24. Refining modules to determine functionally significant clusters in molecular networks.
- Author
-
Kaalia R and Rajapakse JC
- Subjects
- Cluster Analysis, Humans, Algorithms, Protein Interaction Mapping methods
- Abstract
Background: Module detection algorithms relying on modularity maximization suffer from an inherent resolution limit that hinders detection of small topological modules, especially in molecular networks where most biological processes are believed to form small and compact communities. We propose a novel modular refinement approach that helps finding functionally significant modules of molecular networks., Results: The module refinement algorithm improves the quality of topological modules in protein-protein interaction networks by finding biologically functionally significant modules. The algorithm is based on the fact that functional modules in biology do not necessarily represent those corresponding to maximum modularity. Larger modules corresponding to maximal modularity are incrementally re-modularized again under specific constraints so that smaller yet topologically and biologically valid modules are recovered. We show improvement in quality and functional coverage of modules using experiments on synthetic and real protein-protein interaction networks. We also compare our results with six existing methods available for clustering biological networks., Conclusion: The proposed algorithm finds smaller but functionally relevant modules that are undetected by classical quality maximization approaches for modular detection. The refinement procedure helps to detect more functionally enriched modules in protein-protein interaction networks, which are also more coherent with functionally characterised gene sets.
- Published
- 2019
- Full Text
- View/download PDF
25. A deep neural network approach to predicting clinical outcomes of neuroblastoma patients.
- Author
-
Tranchevent LC, Azuaje F, and Rajapakse JC
- Subjects
- Gene Expression Profiling, Humans, Neuroblastoma genetics, Prognosis, Computational Biology methods, Deep Learning, Neuroblastoma diagnosis
- Abstract
Background: The availability of high-throughput omics datasets from large patient cohorts has allowed the development of methods that aim at predicting patient clinical outcomes, such as survival and disease recurrence. Such methods are also important to better understand the biological mechanisms underlying disease etiology and development, as well as treatment responses. Recently, different predictive models, relying on distinct algorithms (including Support Vector Machines and Random Forests) have been investigated. In this context, deep learning strategies are of special interest due to their demonstrated superior performance over a wide range of problems and datasets. One of the main challenges of such strategies is the "small n large p" problem. Indeed, omics datasets typically consist of small numbers of samples and large numbers of features relative to typical deep learning datasets. Neural networks usually tackle this problem through feature selection or by including additional constraints during the learning process., Methods: We propose to tackle this problem with a novel strategy that relies on a graph-based method for feature extraction, coupled with a deep neural network for clinical outcome prediction. The omics data are first represented as graphs whose nodes represent patients, and edges represent correlations between the patients' omics profiles. Topological features, such as centralities, are then extracted from these graphs for every node. Lastly, these features are used as input to train and test various classifiers., Results: We apply this strategy to four neuroblastoma datasets and observe that models based on neural networks are more accurate than state of the art models (DNN: 85%-87%, SVM/RF: 75%-82%). We explore how different parameters and configurations are selected in order to overcome the effects of the small data problem as well as the curse of dimensionality., Conclusions: Our results indicate that the deep neural networks capture complex features in the data that help predicting patient clinical outcomes.
- Published
- 2019
- Full Text
- View/download PDF
26. Analysis of correlation-based biomolecular networks from different omics data by fitting stochastic block models.
- Author
-
Baum K, Rajapakse JC, and Azuaje F
- Abstract
Background: Biological entities such as genes, promoters, mRNA, metabolites or proteins do not act alone, but in concert in their network context. Modules, i.e., groups of nodes with similar topological properties in these networks characterize important biological functions of the underlying biomolecular system. Edges in such molecular networks represent regulatory and physical interactions, and comparing them between conditions provides valuable information on differential molecular mechanisms. However, biological data is inherently noisy and network reduction techniques can propagate errors particularly to the level of edges. We aim to improve the analysis of networks of biological molecules by deriving modules together with edge relevance estimations that are based on global network characteristics. Methods: We propose to fit the networks to stochastic block models (SBM), a method that has not yet been investigated for the analysis of biomolecular networks. This procedure both delivers modules of the networks and enables the derivation of edge confidence scores. We apply it to correlation-based networks of breast cancer data originating from high-throughput measurements of diverse molecular layers such as transcriptomics, proteomics, and metabolomics. The networks were reduced by thresholding for correlation significance or by requirements on scale-freeness. Results and discussion: We find that the networks are best represented by the hierarchical version of the SBM, and many of the predicted blocks have a biological meaning according to functional annotation. The edge confidence scores are overall in concordance with the biological evidence given by the measurements. As they are based on global network connectivity characteristics and potential hierarchies within the biomolecular networks are taken into account, they could be used as additional, integrated features in network-based data comparisons. Their tight relationship to edge existence probabilities can be exploited to predict missing or spurious edges in order to improve the network representation of the underlying biological system., Competing Interests: No competing interests were disclosed.
- Published
- 2019
- Full Text
- View/download PDF
27. Fusing gene expressions and transitive protein-protein interactions for inference of gene regulatory networks.
- Author
-
Liu W and Rajapakse JC
- Subjects
- Bayes Theorem, Normal Distribution, Gene Expression Profiling, Gene Regulatory Networks, Protein Interaction Maps, Systems Biology methods
- Abstract
Background: Systematic fusion of multiple data sources for Gene Regulatory Networks (GRN) inference remains a key challenge in systems biology. We incorporate information from protein-protein interaction networks (PPIN) into the process of GRN inference from gene expression (GE) data. However, existing PPIN remain sparse and transitive protein interactions can help predict missing protein interactions. We therefore propose a systematic probabilistic framework on fusing GE data and transitive protein interaction data to coherently build GRN., Results: We use a Gaussian Mixture Model (GMM) to soft-cluster GE data, allowing overlapping cluster memberships. Next, a heuristic method is proposed to extend sparse PPIN by incorporating transitive linkages. We then propose a novel way to score extended protein interactions by combining topological properties of PPIN and correlations of GE. Following this, GE data and extended PPIN are fused using a Gaussian Hidden Markov Model (GHMM) in order to identify gene regulatory pathways and refine interaction scores that are then used to constrain the GRN structure. We employ a Bayesian Gaussian Mixture (BGM) model to refine the GRN derived from GE data by using the structural priors derived from GHMM. Experiments on real yeast regulatory networks demonstrate both the feasibility of the extended PPIN in predicting transitive protein interactions and its effectiveness on improving the coverage and accuracy the proposed method of fusing PPIN and GE to build GRN., Conclusion: The GE and PPIN fusion model outperforms both the state-of-the-art single data source models (CLR, GENIE3, TIGRESS) as well as existing fusion models under various constraints.
- Published
- 2019
- Full Text
- View/download PDF
28. Functional homogeneity and specificity of topological modules in human proteome.
- Author
-
Kaalia R and Rajapakse JC
- Subjects
- Algorithms, Humans, Reproducibility of Results, Protein Interaction Maps, Proteome metabolism
- Abstract
Background: Functional modules in protein-protein interaction networks (PPIN) are defined by maximal sets of functionally associated proteins and are vital to understanding cellular mechanisms and identifying disease associated proteins. Topological modules of the human proteome have been shown to be related to functional modules of PPIN. However, the effects of the weights of interactions between protein pairs and the integration of physical (direct) interactions with functional (indirect expression-based) interactions have not been investigated in the detection of functional modules of the human proteome., Results: We investigated functional homogeneity and specificity of topological modules of the human proteome and validated them with known biological and disease pathways. Specifically, we determined the effects on functional homogeneity and heterogeneity of topological modules (i) with both physical and functional protein-protein interactions; and (ii) with incorporation of functional similarities between proteins as weights of interactions. With functional enrichment analyses and a novel measure for functional specificity, we evaluated functional relevance and specificity of topological modules of the human proteome., Conclusions: The topological modules ranked using specificity scores show high enrichment with gene sets of known functions. Physical interactions in PPIN contribute to high specificity of the topological modules of the human proteome whereas functional interactions contribute to high homogeneity of the modules. Weighted networks result in more number of topological modules but did not affect their functional propensity. Modules of human proteome are more homogeneous for molecular functions than biological processes.
- Published
- 2019
- Full Text
- View/download PDF
29. Human Wharton's Jelly Mesenchymal Stem Cells Show Unique Gene Expression Compared with Bone Marrow Mesenchymal Stem Cells Using Single-Cell RNA-Sequencing.
- Author
-
Barrett AN, Fong CY, Subramanian A, Liu W, Feng Y, Choolani M, Biswas A, Rajapakse JC, and Bongso A
- Subjects
- Cells, Cultured, Gene Expression Profiling, Humans, Wharton Jelly cytology, Bone Marrow Cells metabolism, Mesenchymal Stem Cells metabolism, Single-Cell Analysis, Transcriptome
- Abstract
Human Wharton's jelly stem cells (hWJSCs) isolated from the human umbilical cord are a unique population of mesenchymal stem cells (MSCs) with significant clinical utility. Their broad differentiation potential, high rate of proliferation, ready availability from discarded cords, and prolonged maintenance of stemness properties in culture make them an attractive alternative source of MSCs with therapeutic value compared with human bone marrow MSCs (hBMMSCs). We aimed to characterize the differences in gene expression profiles between these two stem cell types using single-cell RNA sequencing (scRNA-Seq) to determine which pathways are involved in conferring hWJSCs with their unique properties. We identified 436 significantly differentially expressed genes between the two cell types, playing roles in processes, including immunomodulation, angiogenesis, wound healing, apoptosis, antitumor activity, and chemotaxis. Expression of immune molecules is particularly high in hWJSCs compared with hBMMSCs. These differences in gene expression may help to explain many of the advantages that hWJSCs have over hBMMSCs for clinical application. Although cell surface protein marker expression indicates that isolated hWJSCs and hBMMSCs are both homogenous populations, using scRNA-Seq we can clearly identify extreme variability in expression levels between individual cells within a certain cell type. If the cells are examined as bulk populations, it is not possible to appreciate that a single cell may be making a major unique contribution to the apparent overall expression level. We demonstrated how the fine tuning of expression within hWJSCs and hBMMSCs may be achieved by expression of molecules with opposing function between two cells. We hypothesize that a greater understanding of these differences in gene expression between the two cell types may aid in the development of new therapies using hWJSCs.
- Published
- 2019
- Full Text
- View/download PDF
30. Computational Analysis of Protein-Protein Interactions in Motile T-Cells.
- Author
-
Kumar S, Fazil MHUT, Ahmad K, Tripathy M, Rajapakse JC, and Verma NK
- Subjects
- Glycogen Synthase Kinase 3 beta chemistry, Glycogen Synthase Kinase 3 beta metabolism, Humans, Models, Molecular, Molecular Docking Simulation, Molecular Dynamics Simulation, Protein Conformation, Protein Interaction Domains and Motifs, Receptor, Notch1 chemistry, Receptor, Notch1 metabolism, T-Lymphocytes cytology, Cell Movement, Computational Biology methods, T-Lymphocytes metabolism
- Abstract
Analysis of protein-protein interactions is important for better understanding of molecular mechanisms involved in immune regulation and has potential for elaborating avenues for drug discovery targeting T-cell motility. Currently, only a small fraction of protein-protein interactions have been characterized in T-lymphocytes although there are several detection methods available. In this regard, computational approaches garner importance, with the continued explosion of genomic and proteomic data, for handling protein modeling and protein-protein interactions in large scale. Here, we describe a computational method to identify protein-protein interactions based on in silico protein design.
- Published
- 2019
- Full Text
- View/download PDF
31. Gene Ontology Enrichment Improves Performances of Functional Similarity of Genes.
- Author
-
Liu W, Liu J, and Rajapakse JC
- Subjects
- Base Sequence genetics, Datasets as Topic, Gene Expression Profiling methods, Molecular Sequence Annotation methods, Protein Interaction Mapping methods, Semantics, Computational Biology methods, Gene Ontology
- Abstract
There exists a plethora of measures to evaluate functional similarity (FS) between genes, which is a widely used in many bioinformatics applications including detecting molecular pathways, identifying co-expressed genes, predicting protein-protein interactions, and prioritization of disease genes. Measures of FS between genes are mostly derived from Information Contents (IC) of Gene Ontology (GO) terms annotating the genes. However, existing measures evaluating IC of terms based either on the representations of terms in the annotating corpus or on the knowledge embedded in the GO hierarchy do not consider the enrichment of GO terms by the querying pair of genes. The enrichment of a GO term by a pair of gene is dependent on whether the term is annotated by one gene (i.e., partial annotation) or by both genes (i.e. complete annotation) in the pair. In this paper, we propose a method that incorporate enrichment of GO terms by a gene pair in computing their FS and show that GO enrichment improves the performances of 46 existing FS measures in the prediction of sequence homologies, gene expression correlations, protein-protein interactions, and disease associated genes.
- Published
- 2018
- Full Text
- View/download PDF
32. Predicting clinical outcome of neuroblastoma patients using an integrative network-based approach.
- Author
-
Tranchevent LC, Nazarov PV, Kaoma T, Schmartz GP, Muller A, Kim SY, Rajapakse JC, and Azuaje F
- Subjects
- Algorithms, Gene Regulatory Networks genetics, Gene Regulatory Networks physiology, Genomics methods, Humans, Computational Biology methods, Neuroblastoma genetics
- Abstract
Background: One of the main current challenges in computational biology is to make sense of the huge amounts of multidimensional experimental data that are being produced. For instance, large cohorts of patients are often screened using different high-throughput technologies, effectively producing multiple patient-specific molecular profiles for hundreds or thousands of patients., Results: We propose and implement a network-based method that integrates such patient omics data into Patient Similarity Networks. Topological features derived from these networks were then used to predict relevant clinical features. As part of the 2017 CAMDA challenge, we have successfully applied this strategy to a neuroblastoma dataset, consisting of genomic and transcriptomic data. In particular, we observe that models built on our network-based approach perform at least as well as state of the art models. We furthermore explore the effectiveness of various topological features and observe, for instance, that redundant centrality metrics can be combined to build more powerful models., Conclusion: We demonstrate that the networks inferred from omics data contain clinically relevant information and that patient clinical outcomes can be predicted using only network topological data., Reviewers: This article was reviewed by Yang-Yu Liu, Tomislav Smuc and Isabel Nepomuceno.
- Published
- 2018
- Full Text
- View/download PDF
33. Profiling heterogeneity of Alzheimer's disease using white-matter impairment factors.
- Author
-
Sui X and Rajapakse JC
- Subjects
- Aged, Aged, 80 and over, Cognitive Dysfunction pathology, Corpus Callosum pathology, Diffusion Tensor Imaging methods, Disease Progression, Executive Function physiology, Female, Humans, Magnetic Resonance Imaging methods, Male, Middle Aged, Neuropsychological Tests, Alzheimer Disease pathology, Atrophy pathology, Gray Matter pathology, White Matter pathology
- Abstract
The clinical presentation of Alzheimer's disease (AD) is not unitary as heterogeneity exists in the disease's clinical and anatomical characteristics. MRI studies have revealed that heterogeneous gray matter atrophy patterns are associated with specific traits of cognitive decline. Although white matter (WM) impairment also contributes to AD pathology, its heterogeneity remains unclear. The Latent Dirichlet Allocation (LDA) method is a suitable framework to study heterogeneity and allows to identify latent impairment factors of AD instead of simply mapping an overall disease effect. By exploring whole brain WM skeleton images by using LDA, three latent factors were revealed in AD: a temporal-frontal impairment factor (temporal and frontal lobes, especially hippocampus and para-hippocampus), a parietal factor (parietal lobe, especially precuneus), and a long fibre bundle factor (corpus callosum and superior longitudinal fasciculus). As revealed by longitudinal analysis, the latent factors have distinct impact on cognitive decline: for executive function (EF), the temporal-frontal factor was more strongly associated with baseline EF compared with the parietal factor, while the long-fibre bundle factor was most associated with decline rate of EF; for memory, the three factors showed almost equal effect on the baseline memory and decline rate. For each participant, LDA estimates his/her composition profile of latent impairment factors, which indicates disease subtype. We also found that the APOE genotype affects the AD subtype. Specifically, APOE ε4 was more associated with the long fibre bundle factor and APOE ε2 was more associated with temporal-frontal factor. By investigating heterogeneity and subtypes of AD through white matter impairment factors, our study could facilitate precision medicine., (Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.)
- Published
- 2018
- Full Text
- View/download PDF
34. MultiDCoX: Multi-factor analysis of differential co-expression.
- Author
-
Liany H, Rajapakse JC, and Karuturi RKM
- Subjects
- Breast Neoplasms genetics, Chemokine CXCL13 genetics, Computer Simulation, Female, Gene Expression Profiling, Gene Expression Regulation, Neoplastic, Humans, Matrix Metalloproteinase 1 genetics, Mutation genetics, Receptors, Estrogen metabolism, Survival Analysis, Tumor Suppressor Protein p53 genetics, Algorithms, Factor Analysis, Statistical, Gene Expression Regulation, Gene Regulatory Networks
- Abstract
Background: Differential co-expression (DCX) signifies change in degree of co-expression of a set of genes among different biological conditions. It has been used to identify differential co-expression networks or interactomes. Many algorithms have been developed for single-factor differential co-expression analysis and applied in a variety of studies. However, in many studies, the samples are characterized by multiple factors such as genetic markers, clinical variables and treatments. No algorithm or methodology is available for multi-factor analysis of differential co-expression., Results: We developed a novel formulation and a computationally efficient greedy search algorithm called MultiDCoX to perform multi-factor differential co-expression analysis. Simulated data analysis demonstrates that the algorithm can effectively elicit differentially co-expressed (DCX) gene sets and quantify the influence of each factor on co-expression. MultiDCoX analysis of a breast cancer dataset identified interesting biologically meaningful differentially co-expressed (DCX) gene sets along with genetic and clinical factors that influenced the respective differential co-expression., Conclusions: MultiDCoX is a space and time efficient procedure to identify differentially co-expressed gene sets and successfully identify influence of individual factors on differential co-expression.
- Published
- 2017
- Full Text
- View/download PDF
35. Mixed Spectrum Analysis on fMRI Time-Series.
- Author
-
Kumar A, Lin F, and Rajapakse JC
- Subjects
- Algorithms, Brain diagnostic imaging, Brain physiology, Humans, Memory physiology, ROC Curve, Brain Mapping methods, Image Processing, Computer-Assisted methods, Magnetic Resonance Imaging methods
- Abstract
Temporal autocorrelation present in functional magnetic resonance image (fMRI) data poses challenges to its analysis. The existing approaches handling autocorrelation in fMRI time-series often presume a specific model of autocorrelation such as an auto-regressive model. The main limitation here is that the correlation structure of voxels is generally unknown and varies in different brain regions because of different levels of neurogenic noises and pulsatile effects. Enforcing a universal model on all brain regions leads to bias and loss of efficiency in the analysis. In this paper, we propose the mixed spectrum analysis of the voxel time-series to separate the discrete component corresponding to input stimuli and the continuous component carrying temporal autocorrelation. A mixed spectral analysis technique based on M-spectral estimator is proposed, which effectively removes autocorrelation effects from voxel time-series and identify significant peaks of the spectrum. As the proposed method does not assume any prior model for the autocorrelation effect in voxel time-series, varying correlation structure among the brain regions does not affect its performance. We have modified the standard M-spectral method for an application on a spatial set of time-series by incorporating the contextual information related to the continuous spectrum of neighborhood voxels, thus reducing considerably the computation cost. Likelihood of the activation is predicted by comparing the amplitude of discrete component at stimulus frequency of voxels across the brain by using normal distribution and modeling spatial correlations among the likelihood with a conditional random field. We also demonstrate the application of the proposed method in detecting other desired frequencies.
- Published
- 2016
- Full Text
- View/download PDF
36. Quantification of liver fibrosis via second harmonic imaging of the Glisson's capsule from liver surface.
- Author
-
Xu S, Kang CH, Gou X, Peng Q, Yan J, Zhuo S, Cheng CL, He Y, Kang Y, Xia W, So PT, Welsch R, Rajapakse JC, and Yu H
- Subjects
- Animals, Collagen metabolism, Liver metabolism, Liver pathology, Liver Cirrhosis metabolism, Liver Cirrhosis pathology, Male, Rats, Rats, Wistar, Surface Properties, Liver diagnostic imaging, Liver Cirrhosis diagnostic imaging, Microscopy
- Abstract
Liver surface is covered by a collagenous layer called the Glisson's capsule. The structure of the Glisson's capsule is barely seen in the biopsy samples for histology assessment, thus the changes of the collagen network from the Glisson's capsule during the liver disease progression are not well studied. In this report, we investigated whether non-linear optical imaging of the Glisson's capsule at liver surface would yield sufficient information to allow quantitative staging of liver fibrosis. In contrast to conventional tissue sections whereby tissues are cut perpendicular to the liver surface and interior information from the liver biopsy samples were used, we have established a capsule index based on significant parameters extracted from the second harmonic generation (SHG) microscopy images of capsule collagen from anterior surface of rat livers. Thioacetamide (TAA) induced liver fibrosis animal models was used in this study. The capsule index is capable of differentiating different fibrosis stages, with area under receiver operating characteristics curve (AUC) up to 0.91, making it possible to quantitatively stage liver fibrosis via liver surface imaging potentially with endomicroscopy., (© 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.)
- Published
- 2016
- Full Text
- View/download PDF
37. Gene and sample selection using T-score with sample selection.
- Author
-
Mundra PA and Rajapakse JC
- Subjects
- Algorithms, Databases, Genetic, Logistic Models, Computational Biology methods, Gene Expression Profiling methods, Support Vector Machine
- Abstract
Gene selection from high-dimensional microarray gene-expression data is statistically a challenging problem. Filter approaches to gene selection have been popular because of their simplicity, efficiency, and accuracy. Due to small sample size, all samples are generally used to compute relevant ranking statistics and selection of samples in filter-based gene selection methods has not been addressed. In this paper, we extend previously-proposed simultaneous sample and gene selection approach. In a backward elimination method, a modified logistic regression loss function is used to select relevant samples at each iteration, and these samples are used to compute the T-score to rank genes. This method provides a compromise solution between T-score and other support vector machine (SVM) based algorithms. The performance is demonstrated on both simulated and real datasets with criteria such as classification performance, stability and redundancy. Results indicate that computational complexity and stability of the method are improved compared to SVM based methods without compromising the classification performance., (Copyright © 2015 Elsevier Inc. All rights reserved.)
- Published
- 2016
- Full Text
- View/download PDF
38. Aging exacerbates damage and delays repair of alveolar epithelia following influenza viral pneumonia.
- Author
-
Yin L, Zheng D, Limmon GV, Leung NH, Xu S, Rajapakse JC, Yu H, Chow VT, and Chen J
- Subjects
- Age Factors, Alveolar Epithelial Cells drug effects, Alveolar Epithelial Cells virology, Animals, Antiviral Agents pharmacology, Cell Proliferation, Disease Models, Animal, Female, Influenza A Virus, H1N1 Subtype drug effects, Mice, Inbred C57BL, Orthomyxoviridae Infections drug therapy, Orthomyxoviridae Infections physiopathology, Orthomyxoviridae Infections virology, Oseltamivir pharmacology, Pneumonia, Viral drug therapy, Pneumonia, Viral physiopathology, Pneumonia, Viral virology, Pulmonary Alveoli drug effects, Pulmonary Alveoli physiopathology, Pulmonary Alveoli virology, Regeneration, Risk Factors, Time Factors, Viral Load, Aging pathology, Alveolar Epithelial Cells pathology, Influenza A Virus, H1N1 Subtype pathogenicity, Orthomyxoviridae Infections pathology, Pneumonia, Viral pathology, Pulmonary Alveoli pathology
- Abstract
Background: Influenza virus infection causes significantly higher levels of morbidity and mortality in the elderly. Studies have shown that impaired immunity in the elderly contributes to the increased susceptibility to influenza virus infection, however, how aging affects the lung tissue damage and repair has not been completely elucidated., Methods: Aged (16-18 months old) and young (2-3 months old) mice were infected with influenza virus intratracheally. Body weight and mortality were monitored. Different days after infection, lung sections were stained to estimate the overall lung tissue damage and for club cells, pro-SPC+ bronchiolar epithelial cells, alveolar type I and II cells to quantify their frequencies using automated image analysis algorithms., Results: Following influenza infection, aged mice lose more weight and die from otherwise sub-lethal influenza infection in young mice. Although there is no difference in damage and regeneration of club cells between the young and the aged mice, damage to alveolar type I and II cells (AT1s and AT2s) is exacerbated, and regeneration of AT2s and their precursors (pro-SPC-positive bronchiolar epithelial cells) is significantly delayed in the aged mice. We further show that oseltamivir treatment reduces virus load and lung damage, and promotes pulmonary recovery from infection in the aged mice., Conclusions: These findings show that aging increases susceptibility of the distal lung epithelium to influenza infection and delays the emergence of pro-SPC positive progenitor cells during the repair process. Our findings also shed light on possible approaches to enhance the clinical management of severe influenza pneumonia in the elderly.
- Published
- 2014
- Full Text
- View/download PDF
39. qFibrosis: a fully-quantitative innovative method incorporating histological features to facilitate accurate fibrosis scoring in animal model and chronic hepatitis B patients.
- Author
-
Xu S, Wang Y, Tai DCS, Wang S, Cheng CL, Peng Q, Yan J, Chen Y, Sun J, Liang X, Zhu Y, Rajapakse JC, Welsch RE, So PTC, Wee A, Hou J, and Yu H
- Subjects
- Animals, Biopsy, Collagen analysis, Disease Models, Animal, Humans, Liver pathology, Liver Cirrhosis, Experimental pathology, Rats, Hepatitis B, Chronic complications, Liver Cirrhosis, Experimental diagnosis
- Abstract
Background & Aims: There is increasing need for accurate assessment of liver fibrosis/cirrhosis. We aimed to develop qFibrosis, a fully-automated assessment method combining quantification of histopathological architectural features, to address unmet needs in core biopsy evaluation of fibrosis in chronic hepatitis B (CHB) patients., Methods: qFibrosis was established as a combined index based on 87 parameters of architectural features. Images acquired from 25 Thioacetamide-treated rat samples and 162 CHB core biopsies were used to train and test qFibrosis and to demonstrate its reproducibility. qFibrosis scoring was analyzed employing Metavir and Ishak fibrosis staging as standard references, and collagen proportionate area (CPA) measurement for comparison., Results: qFibrosis faithfully and reliably recapitulates Metavir fibrosis scores, as it can identify differences between all stages in both animal samples (p<0.001) and human biopsies (p<0.05). It is robust to sampling size, allowing for discrimination of different stages in samples of different sizes (area under the curve (AUC): 0.93-0.99 for animal samples: 1-16 mm(2); AUC: 0.84-0.97 for biopsies: 10-44 mm in length). qFibrosis can significantly predict staging underestimation in suboptimal biopsies (<15 mm) and under- and over-scoring by different pathologists (p<0.001). qFibrosis can also differentiate between Ishak stages 5 and 6 (AUC: 0.73, p=0.008), suggesting the possibility of monitoring intra-stage cirrhosis changes. Best of all, qFibrosis demonstrates superior performance to CPA on all counts., Conclusions: qFibrosis can improve fibrosis scoring accuracy and throughput, thus allowing for reproducible and reliable analysis of efficacies of anti-fibrotic therapies in clinical research and practice., (Copyright © 2014 European Association for the Study of the Liver. Published by Elsevier B.V. All rights reserved.)
- Published
- 2014
- Full Text
- View/download PDF
40. Identification of a new export signal in Plasmodium yoelii: identification of a new exportome.
- Author
-
Siau A, Huang X, Yam XY, Bob NS, Sun H, Rajapakse JC, Renia L, and Preiser PR
- Subjects
- Amino Acid Sequence, Protein Conformation, Protein Transport, Protozoan Proteins chemistry, Protozoan Proteins genetics, Plasmodium yoelii genetics, Plasmodium yoelii metabolism, Protein Sorting Signals, Protozoan Proteins metabolism
- Abstract
Development of the erythrocytic malaria parasite requires targeting of parasite proteins into multiple compartments located within and beyond the parasite confine. Beyond the PEXEL/VTS pathway and its characterized players, increasing amount of evidence has highlighted the existence of proteins exported using alternative export-signal(s)/pathway(s); hence, the exportomes currently predicted are incomplete. The nature of these exported proteins which could have a prominent role in most of the Plasmodium species remains elusive. Using P. yoelii variant proteins, we identified a signal associated to lipophilic region that mediates export of P. yoelii proteins. This non-PEXEL signal termed PLASMED is defined by semi-conserved residues and possibly a secondary structure. In vivo characterization of exported-proteins indicated that PLASMED is a bona fide export-signal that allowed us to identify an unseen P. yoelii exportome. The repertoire of the newly predicted exported proteins opens up perspectives for unravelling the remodelling of the host-cell by the parasite, against which new therapies could be elaborated., (© 2014 John Wiley & Sons Ltd.)
- Published
- 2014
- Full Text
- View/download PDF
41. Extracting rate changes in transcriptional regulation from MEDLINE abstracts.
- Author
-
Liu W, Miao K, Li G, Chang K, Zheng J, and Rajapakse JC
- Subjects
- Artificial Intelligence, Biological Ontologies, Gene Regulatory Networks, Humans, Knowledge Bases, MEDLINE, PubMed, Time Factors, Data Mining methods, Gene Expression Regulation, Transcription, Genetic
- Abstract
Background: Time delays are important factors that are often neglected in gene regulatory network (GRN) inference models. Validating time delays from knowledge bases is a challenge since the vast majority of biological databases do not record temporal information of gene regulations. Biological knowledge and facts on gene regulations are typically extracted from bio-literature with specialized methods that depend on the regulation task. In this paper, we mine evidences for time delays related to the transcriptional regulation of yeast from the PubMed abstracts., Results: Since the vast majority of abstracts lack quantitative time information, we can only collect qualitative evidences of time delays. Specifically, the speed-up or delay in transcriptional regulation rate can provide evidences for time delays (shorter or longer) in GRN. Thus, we focus on deriving events related to rate changes in transcriptional regulation. A corpus of yeast regulation related abstracts was manually labeled with such events. In order to capture these events automatically, we create an ontology of sub-processes that are likely to result in transcription rate changes by combining textual patterns and biological knowledge. We also propose effective feature extraction methods based on the created ontology to identify the direct evidences with specific details of these events. Our ontologies outperform existing state-of-the-art gene regulation ontologies in the automatic rule learning method applied to our corpus. The proposed deterministic ontology rule-based method can achieve comparable performance to the automatic rule learning method based on decision trees. This demonstrates the effectiveness of our ontology in identifying rate-changing events. We also tested the effectiveness of the proposed feature mining methods on detecting direct evidence of events. Experimental results show that the machine learning method on these features achieves an F1-score of 71.43%., Conclusions: The manually labeled corpus of events relating to rate changes in transcriptional regulation for yeast is available in https://sites.google.com/site/wentingntu/data. The created ontologies summarized both biological causes of rate changes in transcriptional regulation and corresponding positive and negative textual patterns from the corpus. They are demonstrated to be effective in identifying rate-changing events, which shows the benefits of combining textual patterns and biological knowledge on extracting complex biological events.
- Published
- 2014
- Full Text
- View/download PDF
42. Spatiotemporal quantification of cell dynamics in the lung following influenza virus infection.
- Author
-
Yin L, Xu S, Cheng J, Zheng D, Limmon GV, Leung NH, Rajapakse JC, Chow VT, Chen J, and Yu H
- Subjects
- Algorithms, Animals, Bronchioles immunology, Bronchioles pathology, Computational Biology methods, Disease Models, Animal, Epithelial Cells immunology, Epithelial Cells pathology, Female, Histocytochemistry, Image Processing, Computer-Assisted methods, Influenza A Virus, H1N1 Subtype physiology, Lung chemistry, Lung immunology, Lung virology, Mice, Mice, Inbred C57BL, Neutrophil Infiltration immunology, Orthomyxoviridae Infections physiopathology, Orthomyxoviridae Infections virology, Pulmonary Alveoli immunology, Pulmonary Alveoli pathology, Viral Load, Lung pathology, Orthomyxoviridae Infections pathology
- Abstract
Lung injury caused by influenza virus infection is widespread. Understanding lung damage and repair progression post infection requires quantitative spatiotemporal information on various cell types mapping into the tissue structure. Based on high content images acquired from an automatic slide scanner, we have developed algorithms to quantify cell infiltration in the lung, loss and recovery of Clara cells in the damaged bronchioles and alveolar type II cells (AT2s) in the damaged alveolar areas, and induction of pro-surfactant protein C (pro-SPC)-expressing bronchiolar epithelial cells (SBECs). These quantitative analyses reveal: prolonged immune cell infiltration into the lung that persisted long after the influenza virus was cleared and paralleled with Clara cell recovery; more rapid loss and recovery of Clara cells as compared to AT2s; and two stages of SBECs from Scgb1a1⁺ to Scgb1a1⁻. These results provide evidence supporting a new mechanism of alveolar repair where Clara cells give rise to AT2s through the SBEC intermediates and shed light on the understanding of the lung damage and repair process. The approach and algorithms in quantifying cell-level changes in the tissue context (cell-based tissue informatics) to gain mechanistic insights into the damage and repair process can be expanded and adapted in studying other disease models.
- Published
- 2013
- Full Text
- View/download PDF
43. Neutrophils infected with highly virulent influenza H3N2 virus exhibit augmented early cell death and rapid induction of type I interferon signaling pathways.
- Author
-
Ivan FX, Tan KS, Phoon MC, Engelward BP, Welsch RE, Rajapakse JC, and Chow VT
- Subjects
- Animals, Cell Line, Dogs, Humans, Influenza A Virus, H3N2 Subtype physiology, Madin Darby Canine Kidney Cells, Neutrophils cytology, Transcriptome, Virus Replication, Apoptosis, Influenza, Human immunology, Interferon Type I immunology, Neutrophils virology, Signal Transduction
- Abstract
We developed a model of influenza virus infection of neutrophils by inducing differentiation of the MPRO promyelocytic cell line. After 5 days of differentiation, about 20-30% of mature neutrophils could be detected. Only a fraction of neutrophils were infected by highly virulent influenza (HVI) virus, but were unable to support active viral replication compared with MDCK cells. HVI infection of neutrophils augmented early and late apoptosis as indicated by annexin V and TUNEL assays. Comparison between the global transcriptomic responses of neutrophils to HVI and low virulent influenza (LVI) revealed that the IFN regulatory factor and IFN signaling pathways were the most significantly overrepresented pathways, with activation of related genes in HVI as early as 3 h. Relatively consistent results were obtained by real-time RT-PCR of selected genes associated with the type I IFN pathway. Early after HVI infection, comparatively enhanced expression of apoptosis-related genes was also elicited., (Copyright © 2012 Elsevier Inc. All rights reserved.)
- Published
- 2013
- Full Text
- View/download PDF
44. Multiclass gene selection using Pareto-fronts.
- Author
-
Rajapakse JC and Mundra PA
- Subjects
- Algorithms, Humans, Models, Genetic, Neoplasms genetics, Neoplasms metabolism, Statistics, Nonparametric, Computational Biology methods, Databases, Genetic, Gene Expression Profiling, Models, Statistical
- Abstract
Filter methods are often used for selection of genes in multiclass sample classification by using microarray data. Such techniques usually tend to bias toward a few classes that are easily distinguishable from other classes due to imbalances of strong features and sample sizes of different classes. It could therefore lead to selection of redundant genes while missing the relevant genes, leading to poor classification of tissue samples. In this manuscript, we propose to decompose multiclass ranking statistics into class-specific statistics and then use Pareto-front analysis for selection of genes. This alleviates the bias induced by class intrinsic characteristics of dominating classes. The use of Pareto-front analysis is demonstrated on two filter criteria commonly used for gene selection: F-score and KW-score. A significant improvement in classification performance and reduction in redundancy among top-ranked genes were achieved in experiments with both synthetic and real-benchmark data sets.
- Published
- 2013
- Full Text
- View/download PDF
45. Differential pulmonary transcriptomic profiles in murine lungs infected with low and highly virulent influenza H3N2 viruses reveal dysregulation of TREM1 signaling, cytokines, and chemokines.
- Author
-
Ivan FX, Rajapakse JC, Welsch RE, Rozen SG, Narasaraju T, Xiong GM, Engelward BP, and Chow VT
- Subjects
- Animals, Apoptosis Regulatory Proteins genetics, Apoptosis Regulatory Proteins metabolism, Bronchoalveolar Lavage Fluid, Chemokines genetics, Chemokines metabolism, Cytokines genetics, Female, Gene Expression Profiling, Gene Regulatory Networks, Host-Pathogen Interactions, Influenza A Virus, H3N2 Subtype genetics, Influenza A Virus, H3N2 Subtype physiology, Lung immunology, Lung pathology, Lung virology, Membrane Glycoproteins genetics, Mice, Mice, Inbred BALB C, Orthomyxoviridae Infections genetics, Orthomyxoviridae Infections immunology, Orthomyxoviridae Infections virology, Receptors, Immunologic genetics, Signal Transduction, Systems Biology, Triggering Receptor Expressed on Myeloid Cells-1, Virulence genetics, Cytokines metabolism, Influenza A Virus, H3N2 Subtype pathogenicity, Lung metabolism, Membrane Glycoproteins metabolism, Orthomyxoviridae Infections metabolism, Receptors, Immunologic metabolism, Transcriptome
- Abstract
Investigating the relationships between critical influenza viral mutations contributing to increased virulence and host expression factors will shed light on the process of severe pathogenesis from the systems biology perspective. We previously generated a mouse-adapted, highly virulent influenza (HVI) virus through serial lung-to-lung passaging of a human influenza H3N2 virus strain that causes low virulent influenza (LVI) in murine lungs. This HVI virus is characterized by enhanced replication kinetics, severe lung injury, and systemic spread to major organs. Our gene microarray investigations compared the host transcriptomic responses of murine lungs to LVI virus and its HVI descendant at 12, 48, and 96 h following infection. More intense expression of genes associated with cytokine activity, type 1 interferon response, and apoptosis was evident in HVI at all time-points. We highlighted dysregulation of the TREM1 signaling pathway (an amplifier of cytokine production) that is likely to be upregulated in infiltrating neutrophils in HVI-infected lungs. The cytokine gene expression changes were corroborated by elevated levels of multiple cytokine and chemokine proteins in the bronchoalveolar lavage fluid of infected mice, especially at 12 h post-infection. Concomitantly, the downregulation of genes that mediate proliferative, developmental, and metabolic processes likely contributed to the lethality of HVI as well as lack of lung repair. Overall, our comparative transcriptomic study provided insights into key host factors that influence the dynamics, pathogenesis, and outcome of severe influenza.
- Published
- 2012
- Full Text
- View/download PDF
46. Serum proteome and cytokine analysis in a longitudinal cohort of adults with primary dengue infection reveals predictive markers of DHF.
- Author
-
Kumar Y, Liang C, Bo Z, Rajapakse JC, Ooi EE, and Tannenbaum SR
- Subjects
- Adult, Cohort Studies, Female, Humans, Longitudinal Studies, Male, Middle Aged, Prognosis, Biomarkers blood, Cytokines blood, Dengue Virus pathogenicity, Proteome analysis, Serum chemistry, Severe Dengue diagnosis, Severe Dengue pathology
- Abstract
Background: Infections caused by dengue virus are a major cause of morbidity and mortality in tropical and subtropical regions of the world. Factors that control transition from mild forms of disease such as dengue fever (DF) to more life-threatening forms such as dengue hemorrhagic fever (DHF) are poorly understood. Consequently, there are no reliable methods currently available for early triage of DHF patients resulting in significant over-hospitalization., Methodology/principal Findings: We have systematically examined the proteome, cytokines and inflammatory markers in sera from 62 adult dengue patients (44 DF; 18 DHF) with primary DENV infection, at three different times of infection representing the early febrile, defervescence and convalescent stages. Using fluorescent bioplex assays, we measured 27 cytokines in these serum samples. Additionally, we used multiple mass spectrometry methods for iTRAQ-based comparative analysis of serum proteome as well as measurements of protein adducts- 3-nitrotyrosine and 3-chlorotyrosine as surrogate measures of free radical activity. Using multiple methods such as OPLS, MRMR and MSVM-RFE for multivariate feature selection and classification, we report molecular markers that allow prediction of primary DHF with sensitivity and specificity of >80%., Conclusions/significance: This report constitutes a comprehensive analysis of molecular signatures of dengue disease progression and will help unravel mechanisms of dengue disease progression. Our analysis resulted in the identification of markers that may be useful for early prediction of DHF during the febrile phase. The combination of highly sensitive analytical methods and novel statistical approaches described here forms a robust platform for biomarker discovery.
- Published
- 2012
- Full Text
- View/download PDF
47. Improving signal-to-noise ratio of structured light microscopy based on photon reassignment.
- Author
-
Singh VR, Choi H, Yew EY, Bhattacharya D, Yuan L, Sheppard CJ, Rajapakse JC, Barbastathis G, and So PT
- Abstract
In this paper, we report a method for 3D visualization of a biological specimen utilizing a structured light wide-field microscopic imaging system. This method improves on existing structured light imaging modalities by reassigning fluorescence photons generated from off-focal plane excitation, improving in-focus signal strength. Utilizing a maximum likelihood approach, we identify the most likely fluorophore distribution in 3D that will produce the observed image stacks under structured and uniform illumination using an iterative maximization algorithm. Our results show the optical sectioning capability of tissue specimens while mostly preserving image stack photon count, which is usually not achievable with other existing structured light imaging methods., (2011 Optical Society of America)
- Published
- 2012
- Full Text
- View/download PDF
48. Levodopa and the feedback process on set-shifting in Parkinson's disease.
- Author
-
Au WL, Zhou J, Palmes P, Sitoh YY, Tan LC, and Rajapakse JC
- Subjects
- Aged, Antiparkinson Agents therapeutic use, Brain drug effects, Brain physiopathology, Cognition physiology, Female, Humans, Levodopa therapeutic use, Male, Middle Aged, Neuropsychological Tests, Parkinson Disease drug therapy, Parkinson Disease physiopathology, Psychomotor Performance drug effects, Psychomotor Performance physiology, Antiparkinson Agents pharmacology, Cognition drug effects, Levodopa pharmacology, Parkinson Disease psychology, Set, Psychology
- Abstract
Objective: To study the interaction between levodopa and the feedback process on set-shifting in Parkinson's disease (PD)., Methods: Functional magnetic resonance imaging (fMRI) studies were performed on 13 PD subjects and 17 age-matched healthy controls while they performed a modified card-sorting task. Experimental time periods were defined based on the types of feedback provided. PD subjects underwent the fMRI experiment twice, once during "off" medication (PDoff) and again after levodopa replacement (PDon)., Results: Compared with normal subjects, the cognitive processing times were prolonged in PDoff but not in PDon subjects during learning through positive outcomes. The ability to set-shift through negative outcomes was not affected in PD subjects, even when "off" medication. Intergroup comparisons showed the lateral prefrontal cortex was deactivated in PDoff subjects during positive feedback learning, especially following internal feedback cues. The cortical activations were increased in the posterior brain regions in PDoff subjects following external feedback learning, especially when negative feedback cues were provided. Levodopa replacement did not completely restore the activation patterns in PD subjects to normal although activations in the corticostriatal loops were restored., Conclusion: PD subjects showed differential ability to set-shift, depending on the dopamine status as well as the types of feedback cues provided. PD subjects had difficulty performing set-shift tasks through positive outcomes when "off" medication, and showed improvement after levodopa replacement. The ability to set-shift through negative feedback was not affected in PD subjects even when "off" medication, possibly due to compensatory changes outside the nigrostriatal dopaminergic pathway., (Copyright © 2011 Wiley Periodicals, Inc.)
- Published
- 2012
- Full Text
- View/download PDF
49. Tree-structured algorithm for long weak motif discovery.
- Author
-
Sun HQ, Low MY, Hsu WJ, Tan CW, and Rajapakse JC
- Subjects
- Base Sequence, Models, Genetic, Algorithms, Amino Acid Motifs genetics, Gene Expression Regulation genetics, Transcription Factors genetics
- Abstract
Motivation: Motifs in DNA sequences often appear in degenerate form, so there has been an increased interest in computational algorithms for weak motif discovery. Probabilistic algorithms are unable to detect weak motifs while exact methods have been able to detect only short weak motifs. This article proposes an exact tree-based motif detection (TreeMotif) algorithm capable of discovering longer and weaker motifs than by the existing methods., Results: TreeMotif converts the graphical representation of motifs into a tree-structured representation in which a tree that branches with nodes from every sequence represents motif instances. The method of tree construction is novel to motif discovery based on graphical representation. TreeMotif is more efficient and scalable in handling longer and weaker motifs than the existing algorithms in terms of accuracy and execution time. The performances of TreeMotif were demonstrated on synthetic data as well as on real biological data., Availability: https://sites.google.com/site/shqssw/treemotif, Contact: sunh0013@e.ntu.edu.sg, Supplementary Information: Supplementary data are available at Bioinformatics online.
- Published
- 2011
- Full Text
- View/download PDF
50. Toward better understanding of protein secondary structure: extracting prediction rules.
- Author
-
Nguyen MN, Zurada JM, and Rajapakse JC
- Subjects
- Algorithms, Data Mining, Databases, Protein, Decision Trees, Proteins classification, Artificial Intelligence, Computational Biology methods, Protein Structure, Secondary, Proteins chemistry
- Abstract
Although numerous computational techniques have been applied to predict protein secondary structure (PSS), only limited studies have dealt with discovery of logic rules underlying the prediction itself. Such rules offer interesting links between the prediction model and the underlying biology. In addition, they enhance interpretability of PSS prediction by providing a degree of transparency to the predicting model usually regarded as a black box. In this paper, we explore the generation and use of C4.5 decision trees to extract relevant rules from PSS predictions modeled with two-stage support vector machines (TS-SVM). The proposed rules were derived on the RS126 data set of 126 nonhomologous globular proteins and on the PSIPRED data set of 1,923 protein sequences. Our approach has produced sets of comprehensible, and often interpretable, rules underlying the PSS predictions. Moreover, many of the rules seem to be strongly supported by biological evidence. Further, our approach resulted in good prediction accuracy, few and usually compact rules, and rules that are generally of higher confidence levels than those generated by other rule extraction techniques.
- Published
- 2011
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.