99 results on '"Taguchi YH"'
Search Results
2. Universal Nature of Drug Treatment Responses in Drug-Tissue-Wide Model-Animal Experiments Using Tensor Decomposition-Based Unsupervised Feature Extraction
- Author
-
Taguchi, Yh., primary and Turki, Turki, additional
- Published
- 2020
- Full Text
- View/download PDF
3. Novel method for the prediction of drug-drug Interaction based on Gene Expression profiles
- Author
-
Taguchi, Yh., primary and Turki, Turki, additional
- Published
- 2020
- Full Text
- View/download PDF
4. Novel artificial intelligence-based identification of drug-gene-disease interaction using protein-protein interaction.
- Author
-
Taguchi YH and Turki T
- Subjects
- Humans, Neoplasms drug therapy, Neoplasms genetics, Neoplasms metabolism, Drug Repositioning methods, Antineoplastic Agents pharmacology, Protein Interaction Mapping methods, Computational Biology methods, Artificial Intelligence
- Abstract
The evaluation of drug-gene-disease interactions is key for the identification of drugs effective against disease. However, at present, drugs that are effective against genes that are critical for disease are difficult to identify. Following a disease-centric approach, there is a need to identify genes critical to disease function and find drugs that are effective against them. By contrast, following a drug-centric approach comprises identifying the genes targeted by drugs, and then the diseases in which the identified genes are critical. Both of these processes are complex. Using a gene-centric approach, whereby we identify genes that are effective against the disease and can be targeted by drugs, is much easier. However, how such sets of genes can be identified without specifying either the target diseases or drugs is not known. In this study, a novel artificial intelligence-based approach that employs unsupervised methods and identifies genes without specifying neither diseases nor drugs is presented. To evaluate its feasibility, we applied tensor decomposition (TD)-based unsupervised feature extraction (FE) to perform drug repositioning from protein-protein interactions (PPI) without any other information. Proteins selected by TD-based unsupervised FE include many genes related to cancers, as well as drugs that target the selected proteins. Thus, we were able to identify cancer drugs using only PPI. Because the selected proteins had more interactions, we replaced the selected proteins with hub proteins and found that hub proteins themselves could be used for drug repositioning. In contrast to hub proteins, which can only identify cancer drugs, TD-based unsupervised FE enables the identification of drugs for other diseases. In addition, TD-based unsupervised FE can be used to identify drugs that are effective in in vivo experiments, which is difficult when hub proteins are used. In conclusion, TD-based unsupervised FE is a useful tool for drug repositioning using only PPI without other information., Competing Interests: Declarations. Ethics approval and consent to participate: Not applicable. Consent for publication: Not applicable. Conflict of interest: The authors declare no Conflict of interest., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
5. Gene Selection of Methionine-Dependent Melanoma and Independent Melanoma by Variable Selection Using Tensor Decomposition.
- Author
-
Kobayashi K and Taguchi YH
- Subjects
- Humans, Cell Line, Tumor, Transcriptome, Methionine metabolism, Melanoma genetics, Melanoma metabolism, Melanoma pathology, Gene Expression Regulation, Neoplastic
- Abstract
Methionine is an essential amino acid. Dietary methionine restriction is associated with decreased tumor growth in preclinical studies and extended lifespans in animal models. The mechanism by which methionine restriction inhibits tumor growth while sparing normal cells is not fully understood. In this study, we applied tensor decomposition-based feature extraction for gene selection from the gene expression profiles of two cell lines of RNA sequencing. We compared two human melanoma cell lines, A101D and MeWo. A101D is a typical cancer cell line that exhibits methionine dependence. MeWo is a methionine-independent cell line. We used the application on R, TDbasedUFE, to perform an enrichment analysis of the selected gene set. Consequently, concordance with existing research on the differences between methionine-dependent melanoma and methionine-independent melanoma was confirmed. Targeting methionine metabolism is considered a promising strategy for treating melanoma and other cancers.
- Published
- 2024
- Full Text
- View/download PDF
6. Novel large empirical study of deep transfer learning for COVID-19 classification based on CT and X-ray images.
- Author
-
Almutaani M, Turki T, and Taguchi YH
- Subjects
- Humans, SARS-CoV-2 isolation & purification, Databases, Factual, Image Processing, Computer-Assisted methods, COVID-19 diagnostic imaging, Deep Learning, Tomography, X-Ray Computed methods
- Abstract
The early and highly accurate prediction of COVID-19 based on medical images can speed up the diagnostic process and thereby mitigate disease spread; therefore, developing AI-based models is an inevitable endeavor. The presented work, to our knowledge, is the first to expand the model space and identify a better performing model among 10,000 constructed deep transfer learning (DTL) models as follows. First, we downloaded and processed 4481 CT and X-ray images pertaining to COVID-19 and non-COVID-19 patients, obtained from the Kaggle repository. Second, we provide processed images as inputs to four pre-trained deep learning models (ConvNeXt, EfficientNetV2, DenseNet121, and ResNet34) on more than a million images from the ImageNet database, in which we froze the convolutional and pooling layers pertaining to the feature extraction part while unfreezing and training the densely connected classifier with the Adam optimizer. Third, we generate and take a majority vote of two, three, and four combinations from the four DTL models, resulting in [Formula: see text] DTL models. Then, we combine the 11 DTL models, followed by consecutively generating and taking the majority vote of [Formula: see text] DTL models. Finally, we select [Formula: see text] DTL models from [Formula: see text] Experimental results from the whole datasets using five-fold cross-validation demonstrate that the best generated DTL model, named HC, achieving the best AUC of 0.909 when applied to the CT dataset, while ConvNeXt yielded a higher marginal AUC of 0.933 compared to 0.93 for HX when considering the X-ray dataset. These promising results set the foundation for promoting the large generation of models (LGM) in AI., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
7. Tracing ALS Degeneration: Insights from Spinal Cord and Cortex Transcriptomes.
- Author
-
Sneha NP, Dharshini SAP, Taguchi YH, and Gromiha MM
- Subjects
- Humans, Frontal Lobe metabolism, Frontal Lobe pathology, Gene Regulatory Networks, Motor Neurons metabolism, Motor Neurons pathology, Amyotrophic Lateral Sclerosis genetics, Amyotrophic Lateral Sclerosis metabolism, Amyotrophic Lateral Sclerosis pathology, Transcriptome, Spinal Cord metabolism, Spinal Cord pathology
- Abstract
Background/objectives: Amyotrophic Lateral Sclerosis is a progressive neurodegenerative disorder characterized by the loss of upper and lower motor neurons. Key factors contributing to neuronal death include mitochondrial energy damage, oxidative stress, and excitotoxicity. The frontal cortex is crucial for action initiation, planning, and voluntary movements whereas the spinal cord facilitates communication with the brain, walking, and reflexes. By investigating transcriptome data from the frontal cortex and spinal cord, we aim to elucidate common pathological mechanisms and pathways involved in ALS for understanding the disease progression and identifying potential therapeutic targets., Methods: In this study, we quantified gene and transcript expression patterns, predicted variants, and assessed their functional effects using computational tools. It also includes predicting variant-associated regulatory effects, constructing functional interaction networks, and performing a gene enrichment analysis., Results: We found novel genes for the upregulation of immune response, and the downregulation of metabolic-related and defective degradation processes in both the spinal cord and frontal cortex. Additionally, we observed the dysregulation of histone regulation and blood pressure-related genes specifically in the frontal cortex., Conclusions: These results highlight the distinct and shared molecular disruptions in ALS, emphasizing the critical roles of immune response and metabolic dysfunction in neuronal degeneration. Targeting these pathways may provide new therapeutic avenues to combat neurodegeneration and preserve neuronal health.
- Published
- 2024
- Full Text
- View/download PDF
8. Characterization of circulating miRNAs in the treatment of primary liver tumors.
- Author
-
Umezu T, Tanaka S, Kubo S, Enomoto M, Tamori A, Ochiya T, Taguchi YH, Kuroda M, and Murakami Y
- Subjects
- Humans, Hepacivirus genetics, Carcinogenesis, Liver Neoplasms diagnosis, Liver Neoplasms genetics, Liver Neoplasms therapy, Carcinoma, Hepatocellular diagnosis, Carcinoma, Hepatocellular genetics, Carcinoma, Hepatocellular therapy, MicroRNAs genetics, Hepatitis C
- Abstract
Background and Aim: Circulating micro RNAs (miRNAs) indicate clinical pathologies such as inflammation and carcinogenesis. In this study, we aimed to investigate whether miRNA expression level patterns in could be used to diagnose hepatocellular carcinoma (HCC) and biliary tract cancer (BTC), and the relationship miRNA expression patterns and cancer etiology., Methods: Patients with HCC and BTC with indications for surgery were selected for the study. Total RNA was extracted from the extracellular vesicle (EV)-rich fraction of the serum and analyzed using Toray miRNA microarray. Samples were divided into two cohorts in order of collection, the first 85 HCC were analyzed using a microarray based on miRBase ver.2.0 (hereafter v20 cohort), and the second 177 HCC and 43 BTC were analyzed using a microarray based on miRBase ver.21 (hereafter v21 cohort)., Results: Using miRNA expression patterns, we found that HCC and BTC could be identified with an area under curve (AUC) 0.754 (v21 cohort). Patients with anti-hepatitis C virus (HCV) treatment (SVR-HCC) and without antiviral treatment (HCV-HCC) could be distinguished by an AUC 0.811 (v20 cohort) and AUC 0.798 (v21 cohort), respectively., Conclusions: In this study, we could diagnose primary hepatic malignant tumor using miRNA expression patterns. Moreover, the difference of miRNA expression in SVR-HCC and HCV-HCC can be important information for enclosing cases that are prone to carcinogenesis after being cured with antiviral agents, but also for uncovering the mechanism for some carcinogenic potential remains even after persistent virus infection has disappeared., (© 2023 The Authors. Cancer Reports published by Wiley Periodicals LLC.)
- Published
- 2024
- Full Text
- View/download PDF
9. Investigating Neuron Degeneration in Huntington's Disease Using RNA-Seq Based Transcriptome Study.
- Author
-
Sneha NP, Dharshini SAP, Taguchi YH, and Gromiha MM
- Subjects
- Humans, RNA-Seq, Transcriptome genetics, Nerve Degeneration, Huntington Disease genetics, Chorea
- Abstract
Huntington's disease (HD) is a progressive neurodegenerative disorder caused due to a CAG repeat expansion in the huntingtin ( HTT ) gene. The primary symptoms of HD include motor dysfunction such as chorea, dystonia, and involuntary movements. The primary motor cortex (BA4) is the key brain region responsible for executing motor/movement activities. Investigating patient and control samples from the BA4 region will provide a deeper understanding of the genes responsible for neuron degeneration and help to identify potential markers. Previous studies have focused on overall differential gene expression and associated biological functions. In this study, we illustrate the relationship between variants and differentially expressed genes/transcripts. We identified variants and their associated genes along with the quantification of genes and transcripts. We also predicted the effect of variants on various regulatory activities and found that many variants are regulating gene expression. Variants affecting miRNA and its targets are also highlighted in our study. Co-expression network studies revealed the role of novel genes. Function interaction network analysis unveiled the importance of genes involved in vesicle-mediated transport. From this unified approach, we propose that genes expressed in immune cells are crucial for reducing neuron death in HD.
- Published
- 2023
- Full Text
- View/download PDF
10. Application note: TDbasedUFE and TDbasedUFEadv: bioconductor packages to perform tensor decomposition based unsupervised feature extraction.
- Author
-
Taguchi YH and Turki T
- Abstract
Motivation: Tensor decomposition (TD)-based unsupervised feature extraction (FE) has proven effective for a wide range of bioinformatics applications ranging from biomarker identification to the identification of disease-causing genes and drug repositioning. However, TD-based unsupervised FE failed to gain widespread acceptance due to the lack of user-friendly tools for non-experts., Results: We developed two bioconductor packages-TDbasedUFE and TDbasedUFEadv-that enable researchers unfamiliar with TD to utilize TD-based unsupervised FE. The packages facilitate the identification of differentially expressed genes and multiomics analysis. TDbasedUFE was found to outperform two state-of-the-art methods, such as DESeq2 and DIABLO., Availability and Implementation: TDbasedUFE and TDbasedUFEadv are freely available as R/Bioconductor packages, which can be accessed at https://bioconductor.org/packages/TDbasedUFE and https://bioconductor.org/packages/TDbasedUFEadv, respectively., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2023 Taguchi and Turki.)
- Published
- 2023
- Full Text
- View/download PDF
11. Integrated analysis of human DNA methylation, gene expression, and genomic variation in iMETHYL database using kernel tensor decomposition-based unsupervised feature extraction.
- Author
-
Taguchi YH, Komaki S, Sutoh Y, Ohmomo H, Otsuka-Yamasaki Y, and Shimizu A
- Subjects
- Humans, Databases, Factual, Genomics, Gene Expression, DNA Methylation, Transcription Factors
- Abstract
Integrating gene expression, DNA methylation, and genomic variants simultaneously without location coincidence (i.e., irrespective of distance from each other) or pairwise coincidence (i.e., direct identification of triplets of gene expression, DNA methylation, and genomic variants, and not integration of pairwise coincidences) is difficult. In this study, we integrated gene expression, DNA methylation, and genome variants from the iMETHYL database using the recently proposed kernel tensor decomposition-based unsupervised feature extraction method with limited computational resources (i.e., short CPU time and small memory requirements). Our methods do not require prior knowledge of the subjects because they are fully unsupervised in that unsupervised tensor decomposition is used. The selected genes and genomic variants were significantly targeted by transcription factors that were biologically enriched in KEGG pathway terms as well as in the intra-related regulatory network. The proposed method is promising for integrated analyses of gene expression, methylation, and genomic variants with limited computational resources., Competing Interests: The authors have declared that no competing interests exist., (Copyright: © 2023 Taguchi et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
- Published
- 2023
- Full Text
- View/download PDF
12. Tensor decomposition discriminates tissues using scATAC-seq.
- Author
-
Taguchi YH and Turki T
- Subjects
- Gene Expression Regulation, Transcription Factors metabolism, Genome, Chromatin
- Abstract
ATAC-seq is a powerful tool for measuring the landscape structure of a chromosome. scATAC-seq is a recently updated version of ATAC-seq performed in a single cell. The problem with scATAC-seq is data sparsity and most of the genomic sites are inaccessible. Here, tensor decomposition (TD) was used to fill in missing values. In this study, TD was applied to massive scATAC-seq datasets generated by approximately 200 bp intervals, and this number can reach 13,627,618. Currently, no other methods can deal with large sparse matrices. The proposed method could not only provide UMAP embedding that coincides with tissue specificity, but also select genes associated with various biological enrichment terms and transcription factor targeting. This suggests that TD is a useful tool to process a large sparse matrix generated from scATAC-seq., Competing Interests: Declaration of Competing Interest The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: Y.-H. Taguchi reports financial support was provided by Japan Society for the Promotion of Science., (Copyright © 2023 Elsevier B.V. All rights reserved.)
- Published
- 2023
- Full Text
- View/download PDF
13. Principal component analysis- and tensor decomposition-based unsupervised feature extraction to select more suitable differentially methylated cytosines: Optimization of standard deviation versus state-of-the-art methods.
- Author
-
Taguchi YH and Turki T
- Subjects
- CpG Islands, Principal Component Analysis, DNA Methylation, Genome
- Abstract
In contrast to RNA-seq analysis, which has various standard methods, no standard methods for identifying differentially methylated cytosines (DMCs) exist. To identify DMCs, we tested principal component analysis and tensor decomposition-based unsupervised feature extraction with optimized standard deviation, which has been shown to be effective for differentially expressed gene (DEG) identification. The proposed method outperformed certain conventional methods, including those that assume beta-binomial distribution for methylation as the proposed method does not require this, especially when applied to methylation profiles measured using high throughput sequencing. DMCs identified by the proposed method also significantly overlapped with various functional sites, including known differentially methylated regions, enhancers, and DNase I hypersensitive sites. The proposed method was applied to data sets retrieved from The Cancer Genome Atlas to identify DMCs using American Joint Committee on Cancer staging system edition labels. This suggests that the proposed method is a promising standard method for identifying DMCs., Competing Interests: Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (Copyright © 2023 The Authors. Published by Elsevier Inc. All rights reserved.)
- Published
- 2023
- Full Text
- View/download PDF
14. Features extracted using tensor decomposition reflect the biological features of the temporal patterns of human blood multimodal metabolome.
- Author
-
Fujita S, Karasawa Y, Hironaka KI, Taguchi YH, and Kuroda S
- Subjects
- Humans, Blood Chemical Analysis, Metabolome, Blood
- Abstract
High-throughput omics technologies have enabled the profiling of entire biological systems. For the biological interpretation of such omics data, two analyses, hypothesis- and data-driven analyses including tensor decomposition, have been used. Both analyses have their own advantages and disadvantages and are mutually complementary; however, a direct comparison of these two analyses for omics data is poorly examined.We applied tensor decomposition (TD) to a dataset representing changes in the concentrations of 562 blood molecules at 14 time points in 20 healthy human subjects after ingestion of 75 g oral glucose. We characterized each molecule by individual dependence (constant or variable) and time dependence (later peak or early peak). Three of the four features extracted by TD were characterized by our previous hypothesis-driven study, indicating that TD can extract some of the same features obtained by hypothesis-driven analysis in a non-biased manner. In contrast to the years taken for our previous hypothesis-driven analysis, the data-driven analysis in this study took days, indicating that TD can extract biological features in a non-biased manner without the time-consuming process of hypothesis generation., Competing Interests: The authors have no competing interests to declare., (Copyright: © 2023 Fujita et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
- Published
- 2023
- Full Text
- View/download PDF
15. A new machine learning based computational framework identifies therapeutic targets and unveils influential genes in pancreatic islet cells.
- Author
-
Turki T and Taguchi YH
- Subjects
- Blood Glucose metabolism, Insulin genetics, Insulin metabolism, Glucagon, Gene Expression Profiling, Islets of Langerhans metabolism, Insulin-Secreting Cells metabolism
- Abstract
Pancreatic islets comprise a group of cells that produce hormones regulating blood glucose levels. Particularly, the alpha and beta islet cells produce glucagon and insulin to stabilize blood glucose. When beta islet cells are dysfunctional, insulin is not secreted, inducing a glucose metabolic disorder. Identifying effective therapeutic targets against the disease is a complicated task and is not yet conclusive. To close the wide gap between understanding the molecular mechanism of pancreatic islet cells and providing effective therapeutic targets, we present a computational framework to identify potential therapeutic targets against pancreatic disorders. First, we downloaded three transcriptome expression profiling datasets pertaining to pancreatic islet cells (GSE87375, GSE79457, GSE110154) from the Gene Expression Omnibus database. For each dataset, we extracted expression profiles for two cell types. We then provided these expression profiles along with the cell types to our proposed constrained optimization problem of a support vector machine and to other existing methods, selecting important genes from the expression profiles. Finally, we performed (1) an evaluation from a classification perspective which showed the superiority of our methods against the baseline; and (2) an enrichment analysis which indicated that our methods achieved better outcomes. Results for the three datasets included 44 unique genes and 10 unique transcription factors (SP1, HDAC1, EGR1, E2F1, AR, STAT6, RELA, SP3, NFKB1, and ESR1) which are reportedly related to pancreatic islet functions, diseases, and therapeutic targets., Competing Interests: Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (Copyright © 2022 Elsevier B.V. All rights reserved.)
- Published
- 2023
- Full Text
- View/download PDF
16. Bioinformatic tools for epitranscriptomics.
- Author
-
Taguchi YH
- Subjects
- Computational Biology methods, Genomics, High-Throughput Nucleotide Sequencing, Transcriptome genetics, RNA Processing, Post-Transcriptional
- Abstract
The epitranscriptome, defined as RNA modifications that do not involve alterations in the nucleotide sequence, is a popular topic in the genomic sciences. Because we need massive computational techniques to identify epitranscriptomes within individual transcripts, many tools have been developed to infer epitranscriptomic sites as well as to process datasets using high-throughput sequencing. In this review, we summarize recent developments in epitranscriptome spatial detection and data analysis and discuss their progression.
- Published
- 2023
- Full Text
- View/download PDF
17. Integrative Meta-Analysis of Huntington's Disease Transcriptome Landscape.
- Author
-
Sneha NP, Dharshini SAP, Taguchi YH, and Gromiha MM
- Subjects
- Humans, Genome-Wide Association Study, Neuroinflammatory Diseases, Gene Expression Regulation, Transcriptome genetics, Huntington Disease genetics
- Abstract
Huntington's disease (HD) is a neurodegenerative disorder with autosomal dominant inheritance caused by glutamine expansion in the Huntingtin gene (HTT). Striatal projection neurons (SPNs) in HD are more vulnerable to cell death. The executive striatal population is directly connected with the Brodmann Area (BA9), which is mainly involved in motor functions. Analyzing the disease samples from BA9 from the SRA database provides insights related to neuron degeneration, which helps to identify a promising therapeutic strategy. Most gene expression studies examine the changes in expression and associated biological functions. In this study, we elucidate the relationship between variants and their effect on gene/downstream transcript expression. We computed gene and transcript abundance and identified variants from RNA-seq data using various pipelines. We predicted the effect of genome-wide association studies (GWAS)/novel variants on regulatory functions. We found that many variants affect the histone acetylation pattern in HD, thereby perturbing the transcription factor networks. Interestingly, some variants affect miRNA binding as well as their downstream gene expression. Tissue-specific network analysis showed that mitochondrial, neuroinflammation, vasculature, and angiogenesis-related genes are disrupted in HD. From this integrative omics analysis, we propose that abnormal neuroinflammation acts as a two-edged sword that indirectly affects the vasculature and associated energy metabolism. Rehabilitation of blood-brain barrier functionality and energy metabolism may secure the neuron from cell death.
- Published
- 2022
- Full Text
- View/download PDF
18. A tensor decomposition-based integrated analysis applicable to multiple gene expression profiles without sample matching.
- Author
-
Taguchi YH and Turki T
- Subjects
- Transcriptome
- Abstract
The integrated analysis of multiple gene expression profiles previously measured in distinct studies is problematic since missing both sample matches and common labels prevent their integration in fully data-driven, unsupervised training. In this study, we propose a strategy to enable the integration of multiple gene expression profiles among multiple independent studies with neither labeling nor sample matching using tensor decomposition unsupervised feature extraction. We apply this strategy to Alzheimer's disease (AD)-related gene expression profiles that lack precise correspondence among samples, including AD single-cell RNA sequence (scRNA-seq) data. We were able to select biologically reasonable genes using the integrated analysis. Overall, integrated gene expression profiles can function analogously to prior- and/or transfer-learning strategies in other machine-learning applications. For scRNA-seq, the proposed approach significantly reduces the required computational memory., (© 2022. The Author(s).)
- Published
- 2022
- Full Text
- View/download PDF
19. Suppression of intrahepatic cholangiocarcinoma cell growth by SKI via upregulation of the CDK inhibitor p21.
- Author
-
Kawamura E, Matsubara T, Daikoku A, Deguchi S, Kinoshita M, Yuasa H, Urushima H, Odagiri N, Motoyama H, Kotani K, Kozuka R, Hagihara A, Fujii H, Uchida-Kobayashi S, Tanaka S, Takemura S, Iwaisako K, Enomoto M, Taguchi YH, Tamori A, Kubo S, Ikeda K, and Kawada N
- Subjects
- Humans, Cyclin-Dependent Kinase Inhibitor p21 genetics, Cyclin-Dependent Kinase Inhibitor p21 metabolism, Up-Regulation genetics, Cell Proliferation genetics, Cell Cycle Proteins metabolism, Bile Ducts, Intrahepatic metabolism, Bile Ducts, Intrahepatic pathology, RNA, Messenger, Cholangiocarcinoma genetics, Cholangiocarcinoma metabolism, Cholangiocarcinoma pathology, Bile Duct Neoplasms genetics, Bile Duct Neoplasms metabolism, Bile Duct Neoplasms pathology, MicroRNAs
- Abstract
Cholangiocarcinoma (CC) has a poor prognosis and different driver genes depending on the site of onset. Intrahepatic CC is the second-most common liver cancer after hepatocellular carcinoma, and novel therapeutic targets are urgently needed. The present study was conducted to identify novel therapeutic targets by exploring differentially regulated genes in human CC. MicroRNA (miRNA) and mRNA microarrays were performed using tissue and serum samples obtained from 24 surgically resected hepatobiliary tumor cases, including 10 CC cases. We conducted principal component analysis to identify differentially expressed miRNA, leading to the identification of miRNA-3648 as a differentially expressed miRNA. We used an in silico screening approach to identify its target mRNA, the tumor suppressor Sloan Kettering Institute (SKI). SKI protein expression was decreased in human CC cells overexpressing miRNA-3648, endogenous SKI protein expression was decreased in human CC tumor tissues, and endogenous SKI mRNA expression was suppressed in human CC cells characterized by rapid growth. SKI-overexpressing OZ cells (human intrahepatic CC cells) showed upregulation of cyclin-dependent kinase inhibitor p21 mRNA and protein expression and suppressed cell proliferation. Nuclear expression of CDT1 (chromatin licensing and DNA replication factor 1), which is required for the G1/S transition, was suppressed in SKI-overexpressing OZ cells. SKI knockdown resulted in the opposite effects. Transgenic p21-luciferase was activated in SKI-overexpressing OZ cells. These data indicate SKI involvement in p21 transcription and that SKI-p21 signaling causes cell cycle arrest in G1, suppressing intrahepatic CC cell growth. Therefore, SKI may be a potential therapeutic target for intrahepatic CC., (© 2022 The Authors. FEBS Open Bio published by John Wiley & Sons Ltd on behalf of Federation of European Biochemical Societies.)
- Published
- 2022
- Full Text
- View/download PDF
20. microRNA Bioinformatics.
- Author
-
Taguchi YH
- Subjects
- Computational Biology, MicroRNAs genetics
- Abstract
Firstly, I apologize for the delayed publication of this Special Issue in the form of a book title [...]., Competing Interests: The author declares no conflict of interest.
- Published
- 2022
- Full Text
- View/download PDF
21. Adapted tensor decomposition and PCA based unsupervised feature extraction select more biologically reasonable differentially expressed genes than conventional methods.
- Author
-
Taguchi YH and Turki T
- Subjects
- Reproducibility of Results, Principal Component Analysis, Normal Distribution, Biomarkers, Algorithms
- Abstract
Tensor decomposition- and principal component analysis-based unsupervised feature extraction were proposed almost 5 and 10 years ago, respectively; although these methods have been successfully applied to a wide range of genome analyses, including drug repositioning, biomarker identification, and disease-causing genes' identification, some fundamental problems have been identified: the number of genes identified was too small to assume that there were no false negatives, and the histogram of P values derived was not fully coincident with the null hypothesis that principal component and singular value vectors follow the Gaussian distribution. Optimizing the standard deviation such that the histogram of P values is as much as possible coincident with the null hypothesis results in an increase in the number and biological reliability of the selected genes. Our contribution was that we improved these methods so as to be able to select biologically more reasonable differentially expressed genes than the state of art methods that must empirically assume negative binomial distributions and dispersion relation, which is required for the selecting more expressed genes than less expressed ones, which can be achieved by the proposed methods that do not have to assume these., (© 2022. The Author(s).)
- Published
- 2022
- Full Text
- View/download PDF
22. Projection in genomic analysis: A theoretical basis to rationalize tensor decomposition and principal component analysis as feature selection tools.
- Author
-
Taguchi YH and Turki T
- Subjects
- Gene Order, Genomics, Humans, Principal Component Analysis, Projection, COVID-19
- Abstract
Identifying differentially expressed genes is difficult because of the small number of available samples compared with the large number of genes. Conventional gene selection methods employing statistical tests have the critical problem of heavy dependence of P-values on sample size. Although the recently proposed principal component analysis (PCA) and tensor decomposition (TD)-based unsupervised feature extraction (FE) has often outperformed these statistical test-based methods, the reason why they worked so well is unclear. In this study, we aim to understand this reason in the context of projection pursuit (PP) that was proposed a long time ago to solve the problem of dimensions; we can relate the space spanned by singular value vectors with that spanned by the optimal cluster centroids obtained from K-means. Thus, the success of PCA- and TD-based unsupervised FE can be understood by this equivalence. In addition to this, empirical threshold adjusted P-values of 0.01 assuming the null hypothesis that singular value vectors attributed to genes obey the Gaussian distribution empirically corresponds to threshold-adjusted P-values of 0.1 when the null distribution is generated by gene order shuffling. For this purpose, we newly applied PP to the three data sets to which PCA and TD based unsupervised FE were previously applied; these data sets treated two topics, biomarker identification for kidney cancers (the first two) and the drug discovery for COVID-19 (the thrid one). Then we found the coincidence between PP and PCA or TD based unsupervised FE is pretty well. Shuffling procedures described above are also successfully applied to these three data sets. These findings thus rationalize the success of PCA- and TD-based unsupervised FE for the first time., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2022
- Full Text
- View/download PDF
23. Estimation of Metabolic Effects upon Cadmium Exposure during Pregnancy Using Tensor Decomposition.
- Author
-
Amakura Y and Taguchi YH
- Subjects
- Pregnancy, Humans, Female, Reactive Oxygen Species metabolism, Insulin Receptor Substrate Proteins, Endoplasmic Reticulum-Associated Degradation, Insulin genetics, Insulin metabolism, Ceramides, Sphingolipids, Cadmium toxicity, Cadmium metabolism, Tumor Necrosis Factor-alpha metabolism
- Abstract
A simple tensor decomposition model was applied to the liver transcriptome analysis data to elucidate the cause of cadmium-induced gene overexpression. In addition, we estimated the mechanism by which prenatal Cd exposure disrupts insulin metabolism in offspring. Numerous studies have reported on the toxicity of Cd. A liver transcriptome analysis revealed that Cd toxicity induces intracellular oxidative stress and mitochondrial dysfunction via changes in gene expression, which in turn induces endoplasmic reticulum-associated degradation via abnormal protein folding. However, the specific mechanisms underlying these effects remain unknown. In this study, we found that Cd-induced endoplasmic reticulum stress may promote increased expression of tumor necrosis factor-α (TNF-α). Based on the high expression of genes involved in the production of sphingolipids, it was also found that the accumulation of ceramide may induce intracellular oxidative stress through the overproduction of reactive oxygen species. In addition, the high expression of a set of genes involved in the electron transfer system may contribute to oxidative stress. These findings allowed us to identify the mechanisms by which intracellular oxidative stress leads to the phosphorylation of insulin receptor substrate 1, which plays a significant role in the insulin signaling pathway.
- Published
- 2022
- Full Text
- View/download PDF
24. Integrated Analysis of Tissue-Specific Gene Expression in Diabetes by Tensor Decomposition Can Identify Possible Associated Diseases.
- Author
-
Taguchi YH and Turki T
- Subjects
- Humans, Liver, Transcriptome genetics, Diabetes Mellitus genetics
- Abstract
In the field of gene expression analysis, methods of integrating multiple gene expression profiles are still being developed and the existing methods have scope for improvement. The previously proposed tensor decomposition-based unsupervised feature extraction method was improved by introducing standard deviation optimization. The improved method was applied to perform an integrated analysis of three tissue-specific gene expression profiles (namely, adipose, muscle, and liver) for diabetes mellitus, and the results showed that it can detect diseases that are associated with diabetes (e.g., neurodegenerative diseases) but that cannot be predicted by individual tissue expression analyses using state-of-the-art methods. Although the selected genes differed from those identified by the individual tissue analyses, the selected genes are known to be expressed in all three tissues. Thus, compared with individual tissue analyses, an integrated analysis can provide more in-depth data and identify additional factors, namely, the association with other diseases.
- Published
- 2022
- Full Text
- View/download PDF
25. Tumor Heterogeneity and Molecular Characteristics of Glioblastoma Revealed by Single-Cell RNA-Seq Data Analysis.
- Author
-
Yesudhas D, Dharshini SAP, Taguchi YH, and Gromiha MM
- Subjects
- Biomarkers, DNA Copy Number Variations, Data Analysis, Gene Expression Regulation, Neoplastic, High-Temperature Requirement A Serine Peptidase 1 genetics, Humans, RNA-Seq, Brain Neoplasms metabolism, Glioblastoma genetics, Glioblastoma pathology
- Abstract
Glioblastoma multiforme (GBM) is the most common infiltrating lethal tumor of the brain. Tumor heterogeneity and the precise characterization of GBM remain challenging, and the disease-specific and effective biomarkers are not available at present. To understand GBM heterogeneity and the disease prognosis mechanism, we carried out a single-cell transcriptome data analysis of 3389 cells from four primary IDH-WT (isocitrate dehydrogenase wild type) glioblastoma patients and compared the characteristic features of the tumor and periphery cells. We observed that the marker gene expression profiles of different cell types and the copy number variations (CNVs) are heterogeneous in the GBM samples. Further, we have identified 94 differentially expressed genes (DEGs) between tumor and periphery cells. We constructed a tissue-specific co-expression network and protein-protein interaction network for the DEGs and identified several hub genes, including CX3CR1, GAPDH, FN1, PDGFRA, HTRA1, ANXA2 THBS1, GFAP, PTN, TNC , and VIM . The DEGs were significantly enriched with proliferation and migration pathways related to glioblastoma. Additionally, we were able to identify the differentiation state of microglia and changes in the transcriptome in the presence of glioblastoma that might support tumor growth. This study provides insights into GBM heterogeneity and suggests novel potential disease-specific biomarkers which could help to identify the therapeutic targets in GBM.
- Published
- 2022
- Full Text
- View/download PDF
26. Novel feature selection method via kernel tensor decomposition for improved multi-omics data analysis.
- Author
-
Taguchi YH and Turki T
- Subjects
- Data Analysis, Genomics, Proteomics
- Abstract
Background: Feature selection of multi-omics data analysis remains challenging owing to the size of omics datasets, comprising approximately [Formula: see text]-[Formula: see text] features. In particular, appropriate methods to weight individual omics datasets are unclear, and the approach adopted has substantial consequences for feature selection. In this study, we extended a recently proposed kernel tensor decomposition (KTD)-based unsupervised feature extraction (FE) method to integrate multi-omics datasets obtained from common samples in a weight-free manner., Method: KTD-based unsupervised FE was reformatted as the collection of kernelized tensors sharing common samples, which was applied to synthetic and real datasets., Results: The proposed advanced KTD-based unsupervised FE method showed comparative performance to that of the previously proposed KTD method, as well as tensor decomposition-based unsupervised FE, but required reduced memory and central processing unit time. Moreover, this advanced KTD method, specifically designed for multi-omics analysis, attributes P values to features, which is rare for existing multi-omics-oriented methods., Conclusions: The sample R code is available at https://github.com/tagtag/MultiR/ ., (© 2022. The Author(s).)
- Published
- 2022
- Full Text
- View/download PDF
27. Exploring Plausible Therapeutic Targets for Alzheimer's Disease using Multi-omics Approach, Machine Learning and Docking.
- Author
-
Parvathy Dharshini SA, Sneha NP, Yesudhas D, Kulandaisamy A, Rangaswamy U, Shanmugam A, Taguchi YH, and Gromiha MM
- Subjects
- Humans, Genomics methods, Proteome, Machine Learning, Biomarkers, Alzheimer Disease drug therapy, Alzheimer Disease metabolism
- Abstract
The progressive deterioration of neurons leads to Alzheimer's disease (AD), and developing a drug for this disorder is challenging. Substantial gene/transcriptome variability from multiple cell types leads to downstream pathophysiologic consequences that represent the heterogeneity of this disease. Identifying potential biomarkers for promising therapeutics is strenuous due to the fact that the transcriptome, epigenetic, or proteome changes detected in patients are not clear whether they are the cause or consequence of the disease, which eventually makes the drug discovery efforts intricate. The advancement in scRNA-sequencing technologies helps to identify cell type-specific biomarkers that may guide the selection of the pathways and related targets specific to different stages of the disease progression. This review is focussed on the analysis of multi-omics data from various perspectives (genomic and transcriptomic variants, and single-cell expression), which provide insights to identify plausible molecular targets to combat this complex disease. Further, we briefly outlined the developments in machine learning techniques to prioritize the risk-associated genes, predict probable mutations and identify promising drug candidates from natural products., (Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.net.)
- Published
- 2022
- Full Text
- View/download PDF
28. Effects of Collagen-Glycosaminoglycan Mesh on Gene Expression as Determined by Using Principal Component Analysis-Based Unsupervised Feature Extraction.
- Author
-
Taguchi YH and Turki T
- Abstract
The development of the medical applications for substances or materials that contact cells is important. Hence, it is necessary to elucidate how substances that surround cells affect gene expression during incubation. In the current study, we compared the gene expression profiles of cell lines that were in contact with collagen-glycosaminoglycan mesh and control cells. Principal component analysis-based unsupervised feature extraction was applied to identify genes with altered expression during incubation in the treated cell lines but not in the controls. The identified genes were enriched in various biological terms. Our method also outperformed a conventional methodology, namely, gene selection based on linear regression with time course.
- Published
- 2021
- Full Text
- View/download PDF
29. Identification of Enhancers and Promoters in the Genome by Multidimensional Scaling.
- Author
-
Ishibashi R and Taguchi YH
- Subjects
- Algorithms, Databases, Genetic, Genomics methods, Humans, Multidimensional Scaling Analysis, Nucleic Acid Conformation, Tumor Cells, Cultured, Bone Neoplasms genetics, Chromosomes, Human chemistry, Enhancer Elements, Genetic, Osteosarcoma genetics, Promoter Regions, Genetic
- Abstract
The positions of enhancers and promoters on genomic DNA remain poorly understood. Chromosomes cannot be observed during the cell division cycle because the genome forms a chromatin structure and spreads within the nucleus. However, high-throughput chromosome conformation capture (Hi-C) measures the physical interactions of genomes. In previous studies, DNA extrusion loops were directly derived from Hi-C heat maps. Multidimensional Scaling (MDS) is used in this assessment to more precisely locate enhancers and promoters. MDS is a multivariate analysis method that reproduces the original coordinates from the distance matrix between elements. We used Hi-C data of cultured osteosarcoma cells and applied MDS as the distance matrix of the genome. In addition, we selected columns 2 and 3 of the orthogonal matrix U as the desired structure. Overall, the DNA loops from the reconstructed genome structure contained bioprocesses involved in transcription, such as the pre-transcriptional initiation complex and RNA polymerase II initiation complex, and transcription factors involved in cancer, such as Foxm1 and CREB3. Therefore, our results are consistent with the biological findings. Our method is suitable for identifying enhancers and promoters in the genome.
- Published
- 2021
- Full Text
- View/download PDF
30. Tensor-Decomposition-Based Unsupervised Feature Extraction in Single-Cell Multiomics Data Analysis.
- Author
-
Taguchi YH and Turki T
- Subjects
- DNA Methylation, Databases, Factual, Gene Expression Profiling methods, Histones genetics, Histones metabolism, Computational Biology methods, Single-Cell Analysis methods
- Abstract
Analysis of single-cell multiomics datasets is a novel topic and is considerably challenging because such datasets contain a large number of features with numerous missing values. In this study, we implemented a recently proposed tensor-decomposition (TD)-based unsupervised feature extraction (FE) technique to address this difficult problem. The technique can successfully integrate single-cell multiomics data composed of gene expression, DNA methylation, and accessibility. Although the last two have large dimensions, as many as ten million, containing only a few percentage of nonzero values, TD-based unsupervised FE can integrate three omics datasets without filling in missing values. Together with UMAP, which is used frequently when embedding single-cell measurements into two-dimensional space, TD-based unsupervised FE can produce two-dimensional embedding coincident with classification when integrating single-cell omics datasets. Genes selected based on TD-based unsupervised FE are also significantly related to reasonable biological roles.
- Published
- 2021
- Full Text
- View/download PDF
31. PCA-based unsupervised feature extraction for gene expression analysis of COVID-19 patients.
- Author
-
Fujisawa K, Shimo M, Taguchi YH, Ikematsu S, and Miyata R
- Subjects
- Binding Sites, COVID-19 metabolism, Case-Control Studies, Epigenesis, Genetic, Gene Expression Regulation, Genetic Predisposition to Disease, Humans, Machine Learning, Signal Transduction, COVID-19 genetics, Gene Expression Profiling methods, Gene Regulatory Networks, Histones metabolism, NF-kappa B p50 Subunit metabolism, Transcription Factor RelA metabolism
- Abstract
Coronavirus disease 2019 (COVID-19) is raging worldwide. This potentially fatal infectious disease is caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). However, the complete mechanism of COVID-19 is not well understood. Therefore, we analyzed gene expression profiles of COVID-19 patients to identify disease-related genes through an innovative machine learning method that enables a data-driven strategy for gene selection from a data set with a small number of samples and many candidates. Principal-component-analysis-based unsupervised feature extraction (PCAUFE) was applied to the RNA expression profiles of 16 COVID-19 patients and 18 healthy control subjects. The results identified 123 genes as critical for COVID-19 progression from 60,683 candidate probes, including immune-related genes. The 123 genes were enriched in binding sites for transcription factors NFKB1 and RELA, which are involved in various biological phenomena such as immune response and cell survival: the primary mediator of canonical nuclear factor-kappa B (NF-κB) activity is the heterodimer RelA-p50. The genes were also enriched in histone modification H3K36me3, and they largely overlapped the target genes of NFKB1 and RELA. We found that the overlapping genes were downregulated in COVID-19 patients. These results suggest that canonical NF-κB activity was suppressed by H3K36me3 in COVID-19 patient blood., (© 2021. The Author(s).)
- Published
- 2021
- Full Text
- View/download PDF
32. Unsupervised tensor decomposition-based method to extract candidate transcription factors as histone modification bookmarks in post-mitotic transcriptional reactivation.
- Author
-
Taguchi YH and Turki T
- Subjects
- Cell Cycle genetics, Genome, Human genetics, Humans, Mitosis genetics, Histones genetics, Protein Processing, Post-Translational genetics, Transcription Factors genetics, Transcriptional Activation genetics
- Abstract
The histone group added to a gene sequence must be removed during mitosis to halt transcription during the DNA replication stage of the cell cycle. However, the detailed mechanism of this transcription regulation remains unclear. In particular, it is not realistic to reconstruct all appropriate histone modifications throughout the genome from scratch after mitosis. Thus, it is reasonable to assume that there might be a type of "bookmark" that retains the positions of histone modifications, which can be readily restored after mitosis. We developed a novel computational approach comprising tensor decomposition (TD)-based unsupervised feature extraction (FE) to identify transcription factors (TFs) that bind to genes associated with reactivated histone modifications as candidate histone bookmarks. To the best of our knowledge, this is the first application of TD-based unsupervised FE to the cell division context and phases pertaining to the cell cycle in general. The candidate TFs identified with this approach were functionally related to cell division, suggesting the suitability of this method and the potential of the identified TFs as bookmarks for histone modification during mitosis., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2021
- Full Text
- View/download PDF
33. Discriminating the single-cell gene regulatory networks of human pancreatic islets: A novel deep learning application.
- Author
-
Turki T and Taguchi YH
- Subjects
- Gene Regulatory Networks, Humans, Deep Learning, Islets of Langerhans
- Abstract
Analysis of single-cell pancreatic data can play an important role in understanding various metabolic diseases and health conditions. Due to the sparsity and noise present in such single-cell gene expression data, inference of single-cell gene regulatory networks remains a challenge. Since recent studies have reported the reliable inference of single-cell gene regulatory networks (SCGRNs), the current study focused on discriminating the SCGRNs of T2D patients from those of healthy controls. By accurately distinguishing SCGRNs of healthy pancreas from those of T2D pancreas, it would be possible to annotate, organize, visualize, and identify common patterns of SCGRNs in metabolic diseases. Such annotated SCGRNs could play an important role in accelerating the process of building large data repositories. This study aimed to contribute to the development of a novel deep learning (DL) application. First, we generated a dataset consisting of 224 SCGRNs belonging to both T2D and healthy pancreas and made it freely available. Next, we chose seven DL architectures, including VGG16, VGG19, Xception, ResNet50, ResNet101, DenseNet121, and DenseNet169, trained each of them on the dataset, and checked their prediction based on a test set. Of note, we evaluated the DL architectures on a single NVIDIA GeForce RTX 2080Ti GPU. Experimental results on the whole dataset, using several performance measures, demonstrated the superiority of VGG19 DL model in the automatic classification of SCGRNs, derived from the single-cell pancreatic data., (Copyright © 2021 Elsevier Ltd. All rights reserved.)
- Published
- 2021
- Full Text
- View/download PDF
34. Identification of genes associated with altered gene expression and m6A profiles during hypoxia using tensor decomposition based unsupervised feature extraction.
- Author
-
Roy SS and Taguchi YH
- Subjects
- Humans, Algorithms, Databases, Nucleic Acid, Gene Expression Profiling, Gene Expression Regulation, Hypoxia genetics, Hypoxia metabolism, Models, Biological
- Abstract
Although hypoxia is a critical factor that can drive the progression of various diseases, the mechanism underlying hypoxia itself remains unclear. Recently, m6A has been proposed as an important factor driving hypoxia. Despite successful analyses, potential genes were not selected with statistical significance but were selected based solely on fold changes. Because the number of genes is large while the number of samples is small, it was impossible to select genes using conventional feature selection methods with statistical significance. In this study, we applied the recently proposed principal component analysis (PCA), tensor decomposition (TD), and kernel tensor decomposition (KTD)-based unsupervised feature extraction (FE) to a hypoxia data set. We found that PCA, TD, and KTD-based unsupervised FE could successfully identify a limited number of genes associated with altered gene expression and m6A profiles, as well as the enrichment of hypoxia-related biological terms, with improved statistical significance.
- Published
- 2021
- Full Text
- View/download PDF
35. Editorial: miRNAs and Neurological Diseases.
- Author
-
Wang H, Taguchi YH, and Liu X
- Abstract
Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
- Published
- 2021
- Full Text
- View/download PDF
36. Exploring Common Therapeutic Targets for Neurodegenerative Disorders Using Transcriptome Study.
- Author
-
Dharshini SAP, Jemimah S, Taguchi YH, and Gromiha MM
- Abstract
Alzheimer's disease (AD) and Parkinson's disease (PD) are well-known neuronal degenerative disorders that share common pathological events. Approved medications alleviate symptoms but do not address the root cause of the disease. Energy dysfunction in the neuronal population leads to various pathological events and ultimately results in neuronal death. Identifying common therapeutic targets for these disorders may help in the drug discovery process. The Brodmann area 9 (BA9) region is affected in both the disease conditions and plays an essential role in cognitive, motor, and memory-related functions. Analyzing transcriptome data of BA9 provides deep insights related to common pathological pathways involved in AD and PD. In this work, we map the preprocessed BA9 fastq files generated by RNA-seq for disease and control samples with reference hg38 genomic assembly and identify common variants and differentially expressed genes (DEG). These variants are predominantly located in the 3' UTR (non-promoter) region, affecting the conserved transcription factor (TF) binding motifs involved in the methylation and acetylation process. We have constructed BA9-specific functional interaction networks, which show the relationship between TFs and DEGs. Based on expression signature analysis, we propose that MAPK1, VEGFR1/FLT1, and FGFR1 are promising drug targets to restore blood-brain barrier functionality by reducing neuroinflammation and may save neurons., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2021 Dharshini, Jemimah, Taguchi and Gromiha.)
- Published
- 2021
- Full Text
- View/download PDF
37. Application of Tensor Decomposition to Gene Expression of Infection of Mouse Hepatitis Virus Can Identify Critical Human Genes and Efffective Drugs for SARS-CoV-2 Infection.
- Author
-
Taguchi YH and Turki T
- Abstract
To better understand the genes with altered expression caused by infection with the novel coronavirus strain SARS-CoV-2 causing COVID-19 infectious disease, a tensor decomposition (TD)-based unsupervised feature extraction (FE) approach was applied to a gene expression profile dataset of the mouse liver and spleen with experimental infection of mouse hepatitis virus, which is regarded as a suitable model of human coronavirus infection. TD-based unsupervised FE selected 134 altered genes, which were enriched in protein-protein interactions with orf1ab, polyprotein, and 3C-like protease that are well known to play critical roles in coronavirus infection, suggesting that these 134 genes can represent the coronavirus infectious process. We then selected compounds targeting the expression of the 134 selected genes based on a public domain database. The identified drug compounds were mainly related to known antiviral drugs, several of which were also included in those previously screened with an in silico method to identify candidate drugs for treating COVID-19., (This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.)
- Published
- 2021
- Full Text
- View/download PDF
38. Tensor-Decomposition-Based Unsupervised Feature Extraction Applied to Prostate Cancer Multiomics Data.
- Author
-
Taguchi YH and Turki T
- Subjects
- Humans, Male, Algorithms, Databases, Genetic, Gene Expression Profiling, Gene Expression Regulation, Neoplastic, Neoplasm Proteins biosynthesis, Neoplasm Proteins genetics, Prostatic Neoplasms genetics, Prostatic Neoplasms metabolism, Transcription Factors biosynthesis, Transcription Factors genetics
- Abstract
The large p small n problem is a challenge without a de facto standard method available to it. In this study, we propose a tensor-decomposition (TD)-based unsupervised feature extraction (FE) formalism applied to multiomics datasets, in which the number of features is more than 100,000 whereas the number of samples is as small as about 100, hence constituting a typical large p small n problem. The proposed TD-based unsupervised FE outperformed other conventional supervised feature selection methods, random forest, categorical regression (also known as analysis of variance, or ANOVA), penalized linear discriminant analysis, and two unsupervised methods, multiple non-negative matrix factorization and principal component analysis (PCA) based unsupervised FE when applied to synthetic datasets and four methods other than PCA based unsupervised FE when applied to multiomics datasets. The genes selected by TD-based unsupervised FE were enriched in genes known to be related to tissues and transcription factors measured. TD-based unsupervised FE was demonstrated to be not only the superior feature selection method but also the method that can select biologically reliable genes. To our knowledge, this is the first study in which TD-based unsupervised FE has been successfully applied to the integration of this variety of multiomics measurements.
- Published
- 2020
- Full Text
- View/download PDF
39. Comprehensive analysis of liver and blood miRNA in precancerous conditions.
- Author
-
Umezu T, Tsuneyama K, Kanekura K, Hayakawa M, Tanahashi T, Kawano M, Taguchi YH, Toyoda H, Tamori A, Kuroda M, and Murakami Y
- Subjects
- Animals, Carcinoma, Hepatocellular blood, Exosomes metabolism, Humans, Liver Neoplasms blood, Mice, Precancerous Conditions blood, Predictive Value of Tests, Biomarkers, Tumor analysis, Biomarkers, Tumor blood, Carcinoma, Hepatocellular diagnosis, Carcinoma, Hepatocellular genetics, Liver metabolism, Liver Neoplasms diagnosis, Liver Neoplasms genetics, MicroRNAs analysis, MicroRNAs blood, Precancerous Conditions diagnosis, Precancerous Conditions genetics
- Abstract
Streptozotocin administration to mice (STZ-mice) induces type I diabetes and hepatocellular carcinoma (HCC). We attempted to elucidate the carcinogenic mechanism and the miRNA expression status in the liver and blood during the precancerous state. Serum and liver tissues were collected from STZ-mice and non-treated mice (CTL-mice) at 6, 10, and 12 W. The exosome enriched fraction extracted from serum was used. Hepatic histological examination and hepatic and exosomal miRNA expression analysis were serially performed using next-generation sequencing (NGS). Human miRNA expression analysis of chronic hepatitis liver tissue and exosomes, which were collected before starting the antiviral treatment, were also performed. No inflammation or fibrosis was found in the liver of CTL-mice during the observation period. In STZ-mice, regeneration and inflammation of hepatocytes was found at 6 W and nodules of atypical hepatocytes were found at 10 and 12 W. In the liver tissue, during 6-12 W, the expression levels of let-7f-5p, miR-143-3p, 148a-3p, 191-5p, 192-5p, 21a-5p, 22-3p, 26a-5p, and 92a-3p was significantly increased in STZ-mice, and anti-oncogenes of their target gene candidates were down-regulated. miR-122-5p was also significantly down-regulated in STZ-mice. Fifteen exosomal miRNAs were upregulated in STZ-mice. Six miRNAs (let-7f-5p, miR-10b-5p, 143-3p, 191-5p, 21a-5p, and 26a-5p) were upregulated, similarly to human HCC cases. From the precancerous state, aberrant expression of hepatic miRNAs has already occurred, and then, it can promote carcinogenesis. In exosomes, the expression pattern of common miRNAs between mice and humans before carcinogenesis was observed and can be expected to be developed as a cancer predictive marker.
- Published
- 2020
- Full Text
- View/download PDF
40. Identification of miRNA signatures for kidney renal clear cell carcinoma using the tensor-decomposition method.
- Author
-
Ng KL and Taguchi YH
- Subjects
- Biomarkers, Tumor genetics, Carcinoma, Renal Cell mortality, Databases, Genetic statistics & numerical data, Gene Expression Profiling statistics & numerical data, Gene Expression Regulation, Neoplastic, Gene Regulatory Networks, Humans, Kaplan-Meier Estimate, Kidney Neoplasms mortality, Prognosis, RNA, Messenger genetics, Unsupervised Machine Learning, Carcinoma, Renal Cell genetics, Gene Expression Profiling methods, Kidney Neoplasms genetics, MicroRNAs genetics
- Abstract
Cancer is a highly complex disease caused by multiple genetic factors. MicroRNA (miRNA) and mRNA expression profiles are useful for identifying prognostic biomarkers for cancer. Kidney renal clear cell carcinoma (KIRC), which accounts for more than 70% of all renal malignant tumour cases, was selected for our analysis. Traditional methods of identifying cancer prognostic markers may not be accurate. Tensor decomposition (TD) is a useful method uncovering the underlying low-dimensional structures in the tensor. The TD-based unsupervised feature extraction method was applied to analyse mRNA and miRNA expression profiles. Biological annotations of the prognostic miRNAs and mRNAs were examined utilizing the pathway and oncogenic signature databases DIANA-miRPath and MSigDB. TD identified the miRNA signatures and the associated genes. These genes were found to be involved in cancer-related pathways, and 23 genes were significantly correlated with the survival of KIRC patients. We demonstrated that the results are robust and not highly dependent upon the databases we selected. Compared with traditional supervised methods tested, TD achieves much better performance in selecting prognostic miRNAs and mRNAs. These results suggest that integrated analysis using the TD-based unsupervised feature extraction technique is an effective strategy for identifying prognostic signatures in cancer studies.
- Published
- 2020
- Full Text
- View/download PDF
41. A new advanced in silico drug discovery method for novel coronavirus (SARS-CoV-2) with tensor decomposition-based unsupervised feature extraction.
- Author
-
Taguchi YH and Turki T
- Subjects
- A549 Cells, Antiviral Agents chemistry, Antiviral Agents classification, Humans, SARS-CoV-2, Antiviral Agents pharmacology, Betacoronavirus drug effects, Drug Discovery methods, Unsupervised Machine Learning
- Abstract
Background: COVID-19 is a critical pandemic that has affected human communities worldwide, and there is an urgent need to develop effective drugs. Although there are a large number of candidate drug compounds that may be useful for treating COVID-19, the evaluation of these drugs is time-consuming and costly. Thus, screening to identify potentially effective drugs prior to experimental validation is necessary., Method: In this study, we applied the recently proposed method tensor decomposition (TD)-based unsupervised feature extraction (FE) to gene expression profiles of multiple lung cancer cell lines infected with severe acute respiratory syndrome coronavirus 2. We identified drug candidate compounds that significantly altered the expression of the 163 genes selected by TD-based unsupervised FE., Results: Numerous drugs were successfully screened, including many known antiviral drug compounds such as C646, chelerythrine chloride, canertinib, BX-795, sorafenib, sorafenib, QL-X-138, radicicol, A-443654, CGP-60474, alvocidib, mitoxantrone, QL-XII-47, geldanamycin, fluticasone, atorvastatin, quercetin, motexafin gadolinium, trovafloxacin, doxycycline, meloxicam, gentamicin, and dibromochloromethane. The screen also identified ivermectin, which was first identified as an anti-parasite drug and recently the drug was included in clinical trials for SARS-CoV-2., Conclusions: The drugs screened using our strategy may be effective candidates for treating patients with COVID-19., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2020
- Full Text
- View/download PDF
42. Identifying suitable tools for variant detection and differential gene expression using RNA-seq data.
- Author
-
Dharshini SAP, Taguchi YH, and Gromiha MM
- Subjects
- Alzheimer Disease genetics, Alzheimer Disease metabolism, Genome-Wide Association Study, Genomics methods, Hippocampus metabolism, Humans, Sequence Alignment, Genetic Variation, RNA-Seq methods
- Abstract
Neurodegenerative diseases are the most predominate brain disorders around the globe and the affected populations are rapidly increasing. Recently, these diseases have been addressed using the data obtained from RNA-sequencing technology to reveal the changes in gene/transcript expression, effect of variants, and pathways involved in disease mechanisms. However, the observations mainly depend on the aligners/tools and the performance of existing RNA-seq tools on hg38 genome assembly has not yet been documented. In this study, we performed a systematic analysis of various spliced aligners, transcript assembling and variant calling tools based on both genomic assemblies (hg19/hg38) from hippocampus brain tissue. This helps to identify the best possible combination tools for hg38 annotation. In order to evaluate the identified variants from various pipelines, we compared them with expression Quantitative Trait Loci (eQTL) and Genome-Wide Association Study (GWAS). In addition, the identified differentially expressed genes (DG) were compared with microarray studies. From our analysis of variant calling, the combination of GATK (Genome Analysis Tool-kit) and STAR (Spliced Transcripts Alignment to a Reference) protocol yields a larger number of GWAS/eQTL variants compared to SAMtools (Sequence Alignment Map). We also identified a higher number of non-coding variants in hg38 compared to hg19 due to enhanced annotation. In the case of various DG pipelines, we found that the Salmon-based hg38 transcriptomic quantification yields a higher number of reported DG compared to other genome-based quantification methods. This study revealed that higher number of reads maps to multiple location of the genome with hg38 compared to hg19, and these spurious multi-mapped reads may affect the gene quantification techniques. We suggest that it is necessary to develop efficient algorithms, which can handle the multi-mapped reads and improve the performance of genome-based alignment quantification., (Copyright © 2019 Elsevier Inc. All rights reserved.)
- Published
- 2020
- Full Text
- View/download PDF
43. Author Correction: Development of a novel anti-hepatitis B virus agent via Sp1.
- Author
-
Hayakawa M, Umeyama H, Iwadate M, Taguchi YH, Yano Y, Honda T, Itami-Matsumoto S, Kozuka R, Enomoto M, Tamori A, Kawada N, and Murakami Y
- Abstract
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
- Published
- 2020
- Full Text
- View/download PDF
44. SCGRNs: Novel supervised inference of single-cell gene regulatory networks of complex diseases.
- Author
-
Turki T and Taguchi YH
- Subjects
- Algorithms, Gene Expression Regulation, Sequence Analysis, RNA, Gene Regulatory Networks genetics, Transcription Factors genetics, Transcription Factors metabolism
- Abstract
Single-cell gene regulatory network (SCGRN) inference refers to the process of inferring gene regulatory networks from single-cell data, which are generated via single-cell RNA-sequencing (scRNA-seq) technologies. Although scRNA-seq leads to the generation of data pertaining to cells of particular interest, the single-cell data are noisy and highly sparse, which makes the analysis of such data a challenging task. In this study, we model an SCGRN as a directed graph where an edge from a source node (also called transcription factor (TF)) to a target node (also called target gene) indicates that a TF regulates a target gene. Inferring the SCGRN via predicting TF-target gene regulations would help biologists better understand various diseases in terms of networks. Following the modeling step, we propose three machine learning approaches. The first approach considers feature vectors encoding regulatory relationships of expressed TFs-target genes as input. The resulting model is then used to predict unseen TF-target gene regulations. The second machine learning approach constructs new feature vectors via incorporating features obtained from stacked autoencoders, which are provided to a machine learning algorithm to induce a model and predict unseen regulations of TFs-target genes. The third approach extends the second approach via including topological features extracted from an SCGRN. We perform an experimental study comparing our approaches against adapted unsupervised approaches. Experimental results on SCGRNs pertaining to healthy and type 2 pancreatic diabetes demonstrate the clinical importance and the accurate prediction performance of the proposed approaches., Competing Interests: Declaration of competing interest None declared., (Copyright © 2020 Elsevier Ltd. All rights reserved.)
- Published
- 2020
- Full Text
- View/download PDF
45. Development of a novel anti-hepatitis B virus agent via Sp1.
- Author
-
Hayakawa M, Umeyama H, Iwadate M, Taguchi YH, Yano Y, Honda T, Itami-Matsumoto S, Kozuka R, Enomoto M, Tamori A, Kawada N, and Murakami Y
- Subjects
- Antiviral Agents chemistry, DNA, Circular genetics, DNA, Viral genetics, Glycoside Hydrolase Inhibitors chemistry, Hepatitis B virology, Hepatitis B virus isolation & purification, Hepatocytes drug effects, Hepatocytes virology, High-Throughput Screening Assays, Humans, alpha-Glucosidases chemistry, Antiviral Agents pharmacology, Drug Development, Glycoside Hydrolase Inhibitors pharmacology, Hepatitis B drug therapy, Hepatitis B virus drug effects, Sp1 Transcription Factor metabolism, Virus Replication drug effects
- Abstract
Nucleos(t)ide analog (NA) therapy has proven effective in treating chronic hepatitis B. However, NAs frequently result in viral relapse after the cessation of therapy. This is because NAs cannot fully eliminate the viral episomal covalently closed circular DNA (cccDNA) in the nucleus. In this study, we identified small molecular compounds that control host factors related to viral replication using in silico screening with simulated annealing based on bioinformatics for protein-ligand flexible docking. Twelve chemical compound candidates for alpha-glucosidase (AG) inhibitors were identified from a library of chemical compounds and used to treat fresh human hepatocytes infected with HBV. They were then monitored for their anti-viral effects. HBV replication was inhibited by one candidate (1-[3-(4-tert-butylcyclohexyl)oxy-2-hydroxypropyl]-2,2,6,6-tetramethylpiperidin-4-ol) in a dose-dependent manner. This compound significantly reduced ccc DNA production, compared to Entecavir (p < 0.05), and had a lower anti-AG effect. Gene expression analysis and structural analysis of this compound showed that its inhibitive effect on HBV was via interaction with Sp1. The nuclear transcription factor Sp1 acts on multiple regions of HBV to suppress HBV replication. Identifying candidates that control nuclear transcription factors facilitate the development of novel therapies. Drugs with a mechanism different from NA are promising for the elimination of HBV.
- Published
- 2020
- Full Text
- View/download PDF
46. Neurological Disorder Drug Discovery from Gene Expression with Tensor Decomposition.
- Author
-
Taguchi YH and Turki T
- Subjects
- Animals, Gene Expression, Humans, Mice, RNA-Seq, Alzheimer Disease genetics, Drug Discovery, Gene Expression Profiling
- Abstract
Background: Identifying effective candidate drug compounds in patients with neurological disorders based on gene expression data is of great importance to the neurology field. By identifying effective candidate drugs to a given neurological disorder, neurologists would (1) reduce the time searching for effective treatments; and (2) gain additional useful information that leads to a better treatment outcome. Although there are many strategies to screen drug candidate in pre-clinical stage, it is not easy to check if candidate drug compounds can also be effective to human., Objective: We tried to propose a strategy to screen genes whose expression is altered in model animal experiments to be compared with gene expressed differentially with drug treatment to human cell lines., Methods: Recently proposed tensor decomposition (TD) based unsupervised feature extraction (FE) is applied to single cell (sc) RNA-seq experiments of Alzheimer's disease model animal mouse brain., Results: Four hundreds and one genes are screened as those differentially expressed during Aβ accumulation as age progresses. These genes are significantly overlapped with those expressed differentially with the known drug treatments for three independent data sets: LINCS, DrugMatrix, and GEO., Conclusion: Our strategy, application of TD based unsupervised FE, is useful one to screen drug candidate compounds using scRNA-seq data set., (Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.net.)
- Published
- 2020
- Full Text
- View/download PDF
47. Investigating the energy crisis in Alzheimer disease using transcriptome study.
- Author
-
Dharshini SAP, Taguchi YH, and Gromiha MM
- Subjects
- Acetylation, Aged, Aged, 80 and over, Alzheimer Disease physiopathology, Computational Biology, Computer Simulation, Female, Gene Expression Profiling, Genome-Wide Association Study, Histones metabolism, Humans, Lactic Acid, Male, Methylation, Protein Binding, RNA-Seq, Transcription Factors metabolism, Transcriptome, Alzheimer Disease metabolism, Energy Metabolism, Gene Expression Regulation, Hippocampus metabolism, Neurons metabolism
- Abstract
Alzheimer disease (AD) is a devastating neurological disorder, which initiates from hippocampus and proliferates to cortical regions. The neurons of hippocampus require higher energy to preserve the firing pattern. In AD, aberrant energy metabolism is the critical factor for neurodegeneration. However, the reason for the energy crisis in hippocampus neurons is still unresolved. Transcriptome analysis enables us in understanding the underlying mechanism of energy crisis. In this study, we identified variants/differential gene/transcript expression profiles from hippocampus RNA-seq data. We predicted the effect of variants in transcription factor (TF) binding using in silico tools. Further, a hippocampus-specific co-expression and functional interaction network were designed to decipher the relationships between TF and differentially expressed genes (DG). Identified variants predominantly influence TF binding, which subsequently regulates the DG. From the results, we hypothesize that the loss of vascular integrity is the fundamental attribute for the energy crisis, which leads to neurodegeneration.
- Published
- 2019
- Full Text
- View/download PDF
48. Tensor Decomposition-Based Unsupervised Feature Extraction Applied to Single-Cell Gene Expression Analysis.
- Author
-
Taguchi YH and Turki T
- Abstract
Although single-cell RNA sequencing (scRNA-seq) technology is newly invented and a promising one, but because of lack of enough information that labels individual cells, it is hard to interpret the obtained gene expression of each cell. Because of insufficient information available, unsupervised clustering, for example, t -distributed stochastic neighbor embedding and uniform manifold approximation and projection, is usually employed to obtain low-dimensional embedding that can help to understand cell-cell relationship. One possible drawback of this strategy is that the outcome is highly dependent upon genes selected for the usage of clustering. In order to fulfill this requirement, there are many methods that performed unsupervised gene selection. In this study, a tensor decomposition (TD)-based unsupervised feature extraction (FE) was applied to the integration of two scRNA-seq expression profiles that measure human and mouse midbrain development. TD-based unsupervised FE could select not only coincident genes between human and mouse but also biologically reliable genes. Coincidence between two species as well as biological reliability of selected genes is increased compared with that using principal component analysis (PCA)-based FE applied to the same data set in the previous study. Since PCA-based unsupervised FE outperformed the other three popular unsupervised gene selection methods, highly variable genes, bimodal genes, and dpFeature, TD-based unsupervised FE can do so as well. In addition to this, 10 transcription factors (TFs) that might regulate selected genes and might contribute to midbrain development were identified. These 10 TFs, BHLHE40, EGR1, GABPA, IRF3, PPARG, REST, RFX5, STAT3, TCF7L2, and ZBTB33, were previously reported to be related to brain functions and diseases. TD-based unsupervised FE is a promising method to integrate two scRNA-seq profiles effectively., (Copyright © 2019 Taguchi and Turki.)
- Published
- 2019
- Full Text
- View/download PDF
49. Exploring the selective vulnerability in Alzheimer disease using tissue specific variant analysis.
- Author
-
Akila Parvathy Dharshini S, Taguchi YH, and Michael Gromiha M
- Subjects
- 3' Untranslated Regions, Alzheimer Disease pathology, Brain pathology, Humans, Metabolic Networks and Pathways genetics, Organ Specificity, Quantitative Trait Loci, Alzheimer Disease genetics, Brain metabolism, Polymorphism, Single Nucleotide, Transcriptome
- Abstract
The selective vulnerability of distinct regions of the brain is a critical factor in neurodegenerative disorders. In Alzheimer's disease (AD), neurons in hippocampus situated in medial temporal lobe are immensely damaged. Identifying tissue-specific variants is essential in order to perceive the selective vulnerability in AD. In current work, we aligned mRNA-seq data with HG19/HG38 genomic assembly and identified specific variations present in temporal, frontal and other lobes of the AD using sequence alignment map tools. We compared the results with the genome-wide association and gene expression quantitative trait loci studies of the various neurological disorders. We also distinguished variants and epitranscriptomic modifications through the RNA-modification database and evaluated the variant effect in the coding/UTR regions. In addition, we developed genetic and functional interaction networks to understand the relationship between predicted vulnerable variations and differentially expressed genes. We found that genes involved in gliogenesis, intermediate filament organization are altered in the temporal lobe. Oxidative phosphorylation, and calcium ion homeostasis are modified in the frontal lobe, and protein degradation, apoptotic signaling are altered in other lobes. From this study, we propose that disruption of glial cell structural integrity, defective gliogenesis, and failure in glia-neuron communication are the primary factors for selective vulnerability., (Copyright © 2018 Elsevier Inc. All rights reserved.)
- Published
- 2019
- Full Text
- View/download PDF
50. Drug candidate identification based on gene expression of treated cells using tensor decomposition-based unsupervised feature extraction for large-scale data.
- Author
-
Taguchi YH
- Subjects
- Cell Line, Computer Simulation, Humans, Principal Component Analysis, Protein Interaction Maps, Transcription Factors metabolism, Algorithms, Drug Discovery, Transcriptome
- Abstract
Background: Although in silico drug discovery is necessary for drug development, two major strategies, a structure-based and ligand-based approach, have not been completely successful. Currently, the third approach, inference of drug candidates from gene expression profiles obtained from the cells treated with the compounds under study requires the use of a training dataset. Here, the purpose was to develop a new approach that does not require any pre-existing knowledge about the drug-protein interactions, but these interactions can be inferred by means of an integrated approach using gene expression profiles obtained from the cells treated with the analysed compounds and the existing data describing gene-gene interactions., Results: In the present study, using tensor decomposition-based unsupervised feature extraction, which represents an extension of the recently proposed principal-component analysis-based feature extraction, gene sets and compounds with a significant dose-dependent activity were screened without any training datasets. Next, after these results were combined with the data showing perturbations in single-gene expression profiles, genes targeted by the analysed compounds were inferred. The set of target genes thus identified was shown to significantly overlap with known target genes of the compounds under study., Conclusions: The method is specifically designed for large-scale datasets (including hundreds of treatments with compounds), not for conventional small-scale datasets. The obtained results indicate that two compounds that have not been extensively studied, WZ-3105 and CGP-60474, represent promising drug candidates targeting multiple cancers, including melanoma, adenocarcinoma, liver carcinoma, and breast, colon, and prostate cancers, which were analysed in this in silico study.
- Published
- 2019
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.