78 results on '"Taguchi YH"'
Search Results
2. Novel artificial intelligence-based identification of drug-gene-disease interaction using protein-protein interaction.
- Author
-
Taguchi YH and Turki T
- Subjects
- Humans, Neoplasms drug therapy, Neoplasms genetics, Neoplasms metabolism, Drug Repositioning methods, Antineoplastic Agents pharmacology, Protein Interaction Mapping methods, Computational Biology methods, Artificial Intelligence
- Abstract
The evaluation of drug-gene-disease interactions is key for the identification of drugs effective against disease. However, at present, drugs that are effective against genes that are critical for disease are difficult to identify. Following a disease-centric approach, there is a need to identify genes critical to disease function and find drugs that are effective against them. By contrast, following a drug-centric approach comprises identifying the genes targeted by drugs, and then the diseases in which the identified genes are critical. Both of these processes are complex. Using a gene-centric approach, whereby we identify genes that are effective against the disease and can be targeted by drugs, is much easier. However, how such sets of genes can be identified without specifying either the target diseases or drugs is not known. In this study, a novel artificial intelligence-based approach that employs unsupervised methods and identifies genes without specifying neither diseases nor drugs is presented. To evaluate its feasibility, we applied tensor decomposition (TD)-based unsupervised feature extraction (FE) to perform drug repositioning from protein-protein interactions (PPI) without any other information. Proteins selected by TD-based unsupervised FE include many genes related to cancers, as well as drugs that target the selected proteins. Thus, we were able to identify cancer drugs using only PPI. Because the selected proteins had more interactions, we replaced the selected proteins with hub proteins and found that hub proteins themselves could be used for drug repositioning. In contrast to hub proteins, which can only identify cancer drugs, TD-based unsupervised FE enables the identification of drugs for other diseases. In addition, TD-based unsupervised FE can be used to identify drugs that are effective in in vivo experiments, which is difficult when hub proteins are used. In conclusion, TD-based unsupervised FE is a useful tool for drug repositioning using only PPI without other information., Competing Interests: Declarations. Ethics approval and consent to participate: Not applicable. Consent for publication: Not applicable. Conflict of interest: The authors declare no Conflict of interest., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
3. Gene Selection of Methionine-Dependent Melanoma and Independent Melanoma by Variable Selection Using Tensor Decomposition.
- Author
-
Kobayashi K and Taguchi YH
- Subjects
- Humans, Cell Line, Tumor, Transcriptome, Methionine metabolism, Melanoma genetics, Melanoma metabolism, Melanoma pathology, Gene Expression Regulation, Neoplastic
- Abstract
Methionine is an essential amino acid. Dietary methionine restriction is associated with decreased tumor growth in preclinical studies and extended lifespans in animal models. The mechanism by which methionine restriction inhibits tumor growth while sparing normal cells is not fully understood. In this study, we applied tensor decomposition-based feature extraction for gene selection from the gene expression profiles of two cell lines of RNA sequencing. We compared two human melanoma cell lines, A101D and MeWo. A101D is a typical cancer cell line that exhibits methionine dependence. MeWo is a methionine-independent cell line. We used the application on R, TDbasedUFE, to perform an enrichment analysis of the selected gene set. Consequently, concordance with existing research on the differences between methionine-dependent melanoma and methionine-independent melanoma was confirmed. Targeting methionine metabolism is considered a promising strategy for treating melanoma and other cancers.
- Published
- 2024
- Full Text
- View/download PDF
4. Novel large empirical study of deep transfer learning for COVID-19 classification based on CT and X-ray images.
- Author
-
Almutaani M, Turki T, and Taguchi YH
- Subjects
- Humans, SARS-CoV-2 isolation & purification, Databases, Factual, Image Processing, Computer-Assisted methods, COVID-19 diagnostic imaging, Deep Learning, Tomography, X-Ray Computed methods
- Abstract
The early and highly accurate prediction of COVID-19 based on medical images can speed up the diagnostic process and thereby mitigate disease spread; therefore, developing AI-based models is an inevitable endeavor. The presented work, to our knowledge, is the first to expand the model space and identify a better performing model among 10,000 constructed deep transfer learning (DTL) models as follows. First, we downloaded and processed 4481 CT and X-ray images pertaining to COVID-19 and non-COVID-19 patients, obtained from the Kaggle repository. Second, we provide processed images as inputs to four pre-trained deep learning models (ConvNeXt, EfficientNetV2, DenseNet121, and ResNet34) on more than a million images from the ImageNet database, in which we froze the convolutional and pooling layers pertaining to the feature extraction part while unfreezing and training the densely connected classifier with the Adam optimizer. Third, we generate and take a majority vote of two, three, and four combinations from the four DTL models, resulting in [Formula: see text] DTL models. Then, we combine the 11 DTL models, followed by consecutively generating and taking the majority vote of [Formula: see text] DTL models. Finally, we select [Formula: see text] DTL models from [Formula: see text] Experimental results from the whole datasets using five-fold cross-validation demonstrate that the best generated DTL model, named HC, achieving the best AUC of 0.909 when applied to the CT dataset, while ConvNeXt yielded a higher marginal AUC of 0.933 compared to 0.93 for HX when considering the X-ray dataset. These promising results set the foundation for promoting the large generation of models (LGM) in AI., (© 2024. The Author(s).)
- Published
- 2024
- Full Text
- View/download PDF
5. Tracing ALS Degeneration: Insights from Spinal Cord and Cortex Transcriptomes.
- Author
-
Sneha NP, Dharshini SAP, Taguchi YH, and Gromiha MM
- Subjects
- Humans, Frontal Lobe metabolism, Frontal Lobe pathology, Gene Regulatory Networks, Motor Neurons metabolism, Motor Neurons pathology, Amyotrophic Lateral Sclerosis genetics, Amyotrophic Lateral Sclerosis metabolism, Amyotrophic Lateral Sclerosis pathology, Transcriptome, Spinal Cord metabolism, Spinal Cord pathology
- Abstract
Background/objectives: Amyotrophic Lateral Sclerosis is a progressive neurodegenerative disorder characterized by the loss of upper and lower motor neurons. Key factors contributing to neuronal death include mitochondrial energy damage, oxidative stress, and excitotoxicity. The frontal cortex is crucial for action initiation, planning, and voluntary movements whereas the spinal cord facilitates communication with the brain, walking, and reflexes. By investigating transcriptome data from the frontal cortex and spinal cord, we aim to elucidate common pathological mechanisms and pathways involved in ALS for understanding the disease progression and identifying potential therapeutic targets., Methods: In this study, we quantified gene and transcript expression patterns, predicted variants, and assessed their functional effects using computational tools. It also includes predicting variant-associated regulatory effects, constructing functional interaction networks, and performing a gene enrichment analysis., Results: We found novel genes for the upregulation of immune response, and the downregulation of metabolic-related and defective degradation processes in both the spinal cord and frontal cortex. Additionally, we observed the dysregulation of histone regulation and blood pressure-related genes specifically in the frontal cortex., Conclusions: These results highlight the distinct and shared molecular disruptions in ALS, emphasizing the critical roles of immune response and metabolic dysfunction in neuronal degeneration. Targeting these pathways may provide new therapeutic avenues to combat neurodegeneration and preserve neuronal health.
- Published
- 2024
- Full Text
- View/download PDF
6. Characterization of circulating miRNAs in the treatment of primary liver tumors.
- Author
-
Umezu T, Tanaka S, Kubo S, Enomoto M, Tamori A, Ochiya T, Taguchi YH, Kuroda M, and Murakami Y
- Subjects
- Humans, Hepacivirus genetics, Carcinogenesis, Liver Neoplasms diagnosis, Liver Neoplasms genetics, Liver Neoplasms therapy, Carcinoma, Hepatocellular diagnosis, Carcinoma, Hepatocellular genetics, Carcinoma, Hepatocellular therapy, MicroRNAs genetics, Hepatitis C
- Abstract
Background and Aim: Circulating micro RNAs (miRNAs) indicate clinical pathologies such as inflammation and carcinogenesis. In this study, we aimed to investigate whether miRNA expression level patterns in could be used to diagnose hepatocellular carcinoma (HCC) and biliary tract cancer (BTC), and the relationship miRNA expression patterns and cancer etiology., Methods: Patients with HCC and BTC with indications for surgery were selected for the study. Total RNA was extracted from the extracellular vesicle (EV)-rich fraction of the serum and analyzed using Toray miRNA microarray. Samples were divided into two cohorts in order of collection, the first 85 HCC were analyzed using a microarray based on miRBase ver.2.0 (hereafter v20 cohort), and the second 177 HCC and 43 BTC were analyzed using a microarray based on miRBase ver.21 (hereafter v21 cohort)., Results: Using miRNA expression patterns, we found that HCC and BTC could be identified with an area under curve (AUC) 0.754 (v21 cohort). Patients with anti-hepatitis C virus (HCV) treatment (SVR-HCC) and without antiviral treatment (HCV-HCC) could be distinguished by an AUC 0.811 (v20 cohort) and AUC 0.798 (v21 cohort), respectively., Conclusions: In this study, we could diagnose primary hepatic malignant tumor using miRNA expression patterns. Moreover, the difference of miRNA expression in SVR-HCC and HCV-HCC can be important information for enclosing cases that are prone to carcinogenesis after being cured with antiviral agents, but also for uncovering the mechanism for some carcinogenic potential remains even after persistent virus infection has disappeared., (© 2023 The Authors. Cancer Reports published by Wiley Periodicals LLC.)
- Published
- 2024
- Full Text
- View/download PDF
7. Investigating Neuron Degeneration in Huntington's Disease Using RNA-Seq Based Transcriptome Study.
- Author
-
Sneha NP, Dharshini SAP, Taguchi YH, and Gromiha MM
- Subjects
- Humans, RNA-Seq, Transcriptome genetics, Nerve Degeneration, Huntington Disease genetics, Chorea
- Abstract
Huntington's disease (HD) is a progressive neurodegenerative disorder caused due to a CAG repeat expansion in the huntingtin ( HTT ) gene. The primary symptoms of HD include motor dysfunction such as chorea, dystonia, and involuntary movements. The primary motor cortex (BA4) is the key brain region responsible for executing motor/movement activities. Investigating patient and control samples from the BA4 region will provide a deeper understanding of the genes responsible for neuron degeneration and help to identify potential markers. Previous studies have focused on overall differential gene expression and associated biological functions. In this study, we illustrate the relationship between variants and differentially expressed genes/transcripts. We identified variants and their associated genes along with the quantification of genes and transcripts. We also predicted the effect of variants on various regulatory activities and found that many variants are regulating gene expression. Variants affecting miRNA and its targets are also highlighted in our study. Co-expression network studies revealed the role of novel genes. Function interaction network analysis unveiled the importance of genes involved in vesicle-mediated transport. From this unified approach, we propose that genes expressed in immune cells are crucial for reducing neuron death in HD.
- Published
- 2023
- Full Text
- View/download PDF
8. Application note: TDbasedUFE and TDbasedUFEadv: bioconductor packages to perform tensor decomposition based unsupervised feature extraction.
- Author
-
Taguchi YH and Turki T
- Abstract
Motivation: Tensor decomposition (TD)-based unsupervised feature extraction (FE) has proven effective for a wide range of bioinformatics applications ranging from biomarker identification to the identification of disease-causing genes and drug repositioning. However, TD-based unsupervised FE failed to gain widespread acceptance due to the lack of user-friendly tools for non-experts., Results: We developed two bioconductor packages-TDbasedUFE and TDbasedUFEadv-that enable researchers unfamiliar with TD to utilize TD-based unsupervised FE. The packages facilitate the identification of differentially expressed genes and multiomics analysis. TDbasedUFE was found to outperform two state-of-the-art methods, such as DESeq2 and DIABLO., Availability and Implementation: TDbasedUFE and TDbasedUFEadv are freely available as R/Bioconductor packages, which can be accessed at https://bioconductor.org/packages/TDbasedUFE and https://bioconductor.org/packages/TDbasedUFEadv, respectively., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2023 Taguchi and Turki.)
- Published
- 2023
- Full Text
- View/download PDF
9. Integrated analysis of human DNA methylation, gene expression, and genomic variation in iMETHYL database using kernel tensor decomposition-based unsupervised feature extraction.
- Author
-
Taguchi YH, Komaki S, Sutoh Y, Ohmomo H, Otsuka-Yamasaki Y, and Shimizu A
- Subjects
- Humans, Databases, Factual, Genomics, Gene Expression, DNA Methylation, Transcription Factors
- Abstract
Integrating gene expression, DNA methylation, and genomic variants simultaneously without location coincidence (i.e., irrespective of distance from each other) or pairwise coincidence (i.e., direct identification of triplets of gene expression, DNA methylation, and genomic variants, and not integration of pairwise coincidences) is difficult. In this study, we integrated gene expression, DNA methylation, and genome variants from the iMETHYL database using the recently proposed kernel tensor decomposition-based unsupervised feature extraction method with limited computational resources (i.e., short CPU time and small memory requirements). Our methods do not require prior knowledge of the subjects because they are fully unsupervised in that unsupervised tensor decomposition is used. The selected genes and genomic variants were significantly targeted by transcription factors that were biologically enriched in KEGG pathway terms as well as in the intra-related regulatory network. The proposed method is promising for integrated analyses of gene expression, methylation, and genomic variants with limited computational resources., Competing Interests: The authors have declared that no competing interests exist., (Copyright: © 2023 Taguchi et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
- Published
- 2023
- Full Text
- View/download PDF
10. Principal component analysis- and tensor decomposition-based unsupervised feature extraction to select more suitable differentially methylated cytosines: Optimization of standard deviation versus state-of-the-art methods.
- Author
-
Taguchi YH and Turki T
- Subjects
- CpG Islands, Principal Component Analysis, DNA Methylation, Genome
- Abstract
In contrast to RNA-seq analysis, which has various standard methods, no standard methods for identifying differentially methylated cytosines (DMCs) exist. To identify DMCs, we tested principal component analysis and tensor decomposition-based unsupervised feature extraction with optimized standard deviation, which has been shown to be effective for differentially expressed gene (DEG) identification. The proposed method outperformed certain conventional methods, including those that assume beta-binomial distribution for methylation as the proposed method does not require this, especially when applied to methylation profiles measured using high throughput sequencing. DMCs identified by the proposed method also significantly overlapped with various functional sites, including known differentially methylated regions, enhancers, and DNase I hypersensitive sites. The proposed method was applied to data sets retrieved from The Cancer Genome Atlas to identify DMCs using American Joint Committee on Cancer staging system edition labels. This suggests that the proposed method is a promising standard method for identifying DMCs., Competing Interests: Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (Copyright © 2023 The Authors. Published by Elsevier Inc. All rights reserved.)
- Published
- 2023
- Full Text
- View/download PDF
11. A new machine learning based computational framework identifies therapeutic targets and unveils influential genes in pancreatic islet cells.
- Author
-
Turki T and Taguchi YH
- Subjects
- Blood Glucose metabolism, Insulin genetics, Insulin metabolism, Glucagon, Gene Expression Profiling, Islets of Langerhans metabolism, Insulin-Secreting Cells metabolism
- Abstract
Pancreatic islets comprise a group of cells that produce hormones regulating blood glucose levels. Particularly, the alpha and beta islet cells produce glucagon and insulin to stabilize blood glucose. When beta islet cells are dysfunctional, insulin is not secreted, inducing a glucose metabolic disorder. Identifying effective therapeutic targets against the disease is a complicated task and is not yet conclusive. To close the wide gap between understanding the molecular mechanism of pancreatic islet cells and providing effective therapeutic targets, we present a computational framework to identify potential therapeutic targets against pancreatic disorders. First, we downloaded three transcriptome expression profiling datasets pertaining to pancreatic islet cells (GSE87375, GSE79457, GSE110154) from the Gene Expression Omnibus database. For each dataset, we extracted expression profiles for two cell types. We then provided these expression profiles along with the cell types to our proposed constrained optimization problem of a support vector machine and to other existing methods, selecting important genes from the expression profiles. Finally, we performed (1) an evaluation from a classification perspective which showed the superiority of our methods against the baseline; and (2) an enrichment analysis which indicated that our methods achieved better outcomes. Results for the three datasets included 44 unique genes and 10 unique transcription factors (SP1, HDAC1, EGR1, E2F1, AR, STAT6, RELA, SP3, NFKB1, and ESR1) which are reportedly related to pancreatic islet functions, diseases, and therapeutic targets., Competing Interests: Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper., (Copyright © 2022 Elsevier B.V. All rights reserved.)
- Published
- 2023
- Full Text
- View/download PDF
12. Features extracted using tensor decomposition reflect the biological features of the temporal patterns of human blood multimodal metabolome.
- Author
-
Fujita S, Karasawa Y, Hironaka KI, Taguchi YH, and Kuroda S
- Subjects
- Humans, Blood Chemical Analysis, Metabolome, Blood
- Abstract
High-throughput omics technologies have enabled the profiling of entire biological systems. For the biological interpretation of such omics data, two analyses, hypothesis- and data-driven analyses including tensor decomposition, have been used. Both analyses have their own advantages and disadvantages and are mutually complementary; however, a direct comparison of these two analyses for omics data is poorly examined.We applied tensor decomposition (TD) to a dataset representing changes in the concentrations of 562 blood molecules at 14 time points in 20 healthy human subjects after ingestion of 75 g oral glucose. We characterized each molecule by individual dependence (constant or variable) and time dependence (later peak or early peak). Three of the four features extracted by TD were characterized by our previous hypothesis-driven study, indicating that TD can extract some of the same features obtained by hypothesis-driven analysis in a non-biased manner. In contrast to the years taken for our previous hypothesis-driven analysis, the data-driven analysis in this study took days, indicating that TD can extract biological features in a non-biased manner without the time-consuming process of hypothesis generation., Competing Interests: The authors have no competing interests to declare., (Copyright: © 2023 Fujita et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
- Published
- 2023
- Full Text
- View/download PDF
13. Bioinformatic tools for epitranscriptomics.
- Author
-
Taguchi YH
- Subjects
- Computational Biology methods, Genomics, High-Throughput Nucleotide Sequencing, Transcriptome genetics, RNA Processing, Post-Transcriptional
- Abstract
The epitranscriptome, defined as RNA modifications that do not involve alterations in the nucleotide sequence, is a popular topic in the genomic sciences. Because we need massive computational techniques to identify epitranscriptomes within individual transcripts, many tools have been developed to infer epitranscriptomic sites as well as to process datasets using high-throughput sequencing. In this review, we summarize recent developments in epitranscriptome spatial detection and data analysis and discuss their progression.
- Published
- 2023
- Full Text
- View/download PDF
14. Integrative Meta-Analysis of Huntington's Disease Transcriptome Landscape.
- Author
-
Sneha NP, Dharshini SAP, Taguchi YH, and Gromiha MM
- Subjects
- Humans, Genome-Wide Association Study, Neuroinflammatory Diseases, Gene Expression Regulation, Transcriptome genetics, Huntington Disease genetics
- Abstract
Huntington's disease (HD) is a neurodegenerative disorder with autosomal dominant inheritance caused by glutamine expansion in the Huntingtin gene (HTT). Striatal projection neurons (SPNs) in HD are more vulnerable to cell death. The executive striatal population is directly connected with the Brodmann Area (BA9), which is mainly involved in motor functions. Analyzing the disease samples from BA9 from the SRA database provides insights related to neuron degeneration, which helps to identify a promising therapeutic strategy. Most gene expression studies examine the changes in expression and associated biological functions. In this study, we elucidate the relationship between variants and their effect on gene/downstream transcript expression. We computed gene and transcript abundance and identified variants from RNA-seq data using various pipelines. We predicted the effect of genome-wide association studies (GWAS)/novel variants on regulatory functions. We found that many variants affect the histone acetylation pattern in HD, thereby perturbing the transcription factor networks. Interestingly, some variants affect miRNA binding as well as their downstream gene expression. Tissue-specific network analysis showed that mitochondrial, neuroinflammation, vasculature, and angiogenesis-related genes are disrupted in HD. From this integrative omics analysis, we propose that abnormal neuroinflammation acts as a two-edged sword that indirectly affects the vasculature and associated energy metabolism. Rehabilitation of blood-brain barrier functionality and energy metabolism may secure the neuron from cell death.
- Published
- 2022
- Full Text
- View/download PDF
15. A tensor decomposition-based integrated analysis applicable to multiple gene expression profiles without sample matching.
- Author
-
Taguchi YH and Turki T
- Subjects
- Transcriptome
- Abstract
The integrated analysis of multiple gene expression profiles previously measured in distinct studies is problematic since missing both sample matches and common labels prevent their integration in fully data-driven, unsupervised training. In this study, we propose a strategy to enable the integration of multiple gene expression profiles among multiple independent studies with neither labeling nor sample matching using tensor decomposition unsupervised feature extraction. We apply this strategy to Alzheimer's disease (AD)-related gene expression profiles that lack precise correspondence among samples, including AD single-cell RNA sequence (scRNA-seq) data. We were able to select biologically reasonable genes using the integrated analysis. Overall, integrated gene expression profiles can function analogously to prior- and/or transfer-learning strategies in other machine-learning applications. For scRNA-seq, the proposed approach significantly reduces the required computational memory., (© 2022. The Author(s).)
- Published
- 2022
- Full Text
- View/download PDF
16. Suppression of intrahepatic cholangiocarcinoma cell growth by SKI via upregulation of the CDK inhibitor p21.
- Author
-
Kawamura E, Matsubara T, Daikoku A, Deguchi S, Kinoshita M, Yuasa H, Urushima H, Odagiri N, Motoyama H, Kotani K, Kozuka R, Hagihara A, Fujii H, Uchida-Kobayashi S, Tanaka S, Takemura S, Iwaisako K, Enomoto M, Taguchi YH, Tamori A, Kubo S, Ikeda K, and Kawada N
- Subjects
- Humans, Cyclin-Dependent Kinase Inhibitor p21 genetics, Cyclin-Dependent Kinase Inhibitor p21 metabolism, Up-Regulation genetics, Cell Proliferation genetics, Cell Cycle Proteins metabolism, Bile Ducts, Intrahepatic metabolism, Bile Ducts, Intrahepatic pathology, RNA, Messenger, Cholangiocarcinoma genetics, Cholangiocarcinoma metabolism, Cholangiocarcinoma pathology, Bile Duct Neoplasms genetics, Bile Duct Neoplasms metabolism, Bile Duct Neoplasms pathology, MicroRNAs
- Abstract
Cholangiocarcinoma (CC) has a poor prognosis and different driver genes depending on the site of onset. Intrahepatic CC is the second-most common liver cancer after hepatocellular carcinoma, and novel therapeutic targets are urgently needed. The present study was conducted to identify novel therapeutic targets by exploring differentially regulated genes in human CC. MicroRNA (miRNA) and mRNA microarrays were performed using tissue and serum samples obtained from 24 surgically resected hepatobiliary tumor cases, including 10 CC cases. We conducted principal component analysis to identify differentially expressed miRNA, leading to the identification of miRNA-3648 as a differentially expressed miRNA. We used an in silico screening approach to identify its target mRNA, the tumor suppressor Sloan Kettering Institute (SKI). SKI protein expression was decreased in human CC cells overexpressing miRNA-3648, endogenous SKI protein expression was decreased in human CC tumor tissues, and endogenous SKI mRNA expression was suppressed in human CC cells characterized by rapid growth. SKI-overexpressing OZ cells (human intrahepatic CC cells) showed upregulation of cyclin-dependent kinase inhibitor p21 mRNA and protein expression and suppressed cell proliferation. Nuclear expression of CDT1 (chromatin licensing and DNA replication factor 1), which is required for the G1/S transition, was suppressed in SKI-overexpressing OZ cells. SKI knockdown resulted in the opposite effects. Transgenic p21-luciferase was activated in SKI-overexpressing OZ cells. These data indicate SKI involvement in p21 transcription and that SKI-p21 signaling causes cell cycle arrest in G1, suppressing intrahepatic CC cell growth. Therefore, SKI may be a potential therapeutic target for intrahepatic CC., (© 2022 The Authors. FEBS Open Bio published by John Wiley & Sons Ltd on behalf of Federation of European Biochemical Societies.)
- Published
- 2022
- Full Text
- View/download PDF
17. microRNA Bioinformatics.
- Author
-
Taguchi YH
- Subjects
- Computational Biology, MicroRNAs genetics
- Abstract
Firstly, I apologize for the delayed publication of this Special Issue in the form of a book title [...]., Competing Interests: The author declares no conflict of interest.
- Published
- 2022
- Full Text
- View/download PDF
18. Adapted tensor decomposition and PCA based unsupervised feature extraction select more biologically reasonable differentially expressed genes than conventional methods.
- Author
-
Taguchi YH and Turki T
- Subjects
- Reproducibility of Results, Principal Component Analysis, Normal Distribution, Biomarkers, Algorithms
- Abstract
Tensor decomposition- and principal component analysis-based unsupervised feature extraction were proposed almost 5 and 10 years ago, respectively; although these methods have been successfully applied to a wide range of genome analyses, including drug repositioning, biomarker identification, and disease-causing genes' identification, some fundamental problems have been identified: the number of genes identified was too small to assume that there were no false negatives, and the histogram of P values derived was not fully coincident with the null hypothesis that principal component and singular value vectors follow the Gaussian distribution. Optimizing the standard deviation such that the histogram of P values is as much as possible coincident with the null hypothesis results in an increase in the number and biological reliability of the selected genes. Our contribution was that we improved these methods so as to be able to select biologically more reasonable differentially expressed genes than the state of art methods that must empirically assume negative binomial distributions and dispersion relation, which is required for the selecting more expressed genes than less expressed ones, which can be achieved by the proposed methods that do not have to assume these., (© 2022. The Author(s).)
- Published
- 2022
- Full Text
- View/download PDF
19. Projection in genomic analysis: A theoretical basis to rationalize tensor decomposition and principal component analysis as feature selection tools.
- Author
-
Taguchi YH and Turki T
- Subjects
- Gene Order, Genomics, Humans, Principal Component Analysis, Projection, COVID-19
- Abstract
Identifying differentially expressed genes is difficult because of the small number of available samples compared with the large number of genes. Conventional gene selection methods employing statistical tests have the critical problem of heavy dependence of P-values on sample size. Although the recently proposed principal component analysis (PCA) and tensor decomposition (TD)-based unsupervised feature extraction (FE) has often outperformed these statistical test-based methods, the reason why they worked so well is unclear. In this study, we aim to understand this reason in the context of projection pursuit (PP) that was proposed a long time ago to solve the problem of dimensions; we can relate the space spanned by singular value vectors with that spanned by the optimal cluster centroids obtained from K-means. Thus, the success of PCA- and TD-based unsupervised FE can be understood by this equivalence. In addition to this, empirical threshold adjusted P-values of 0.01 assuming the null hypothesis that singular value vectors attributed to genes obey the Gaussian distribution empirically corresponds to threshold-adjusted P-values of 0.1 when the null distribution is generated by gene order shuffling. For this purpose, we newly applied PP to the three data sets to which PCA and TD based unsupervised FE were previously applied; these data sets treated two topics, biomarker identification for kidney cancers (the first two) and the drug discovery for COVID-19 (the thrid one). Then we found the coincidence between PP and PCA or TD based unsupervised FE is pretty well. Shuffling procedures described above are also successfully applied to these three data sets. These findings thus rationalize the success of PCA- and TD-based unsupervised FE for the first time., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2022
- Full Text
- View/download PDF
20. Estimation of Metabolic Effects upon Cadmium Exposure during Pregnancy Using Tensor Decomposition.
- Author
-
Amakura Y and Taguchi YH
- Subjects
- Pregnancy, Humans, Female, Reactive Oxygen Species metabolism, Insulin Receptor Substrate Proteins, Endoplasmic Reticulum-Associated Degradation, Insulin genetics, Insulin metabolism, Ceramides, Sphingolipids, Cadmium toxicity, Cadmium metabolism, Tumor Necrosis Factor-alpha metabolism
- Abstract
A simple tensor decomposition model was applied to the liver transcriptome analysis data to elucidate the cause of cadmium-induced gene overexpression. In addition, we estimated the mechanism by which prenatal Cd exposure disrupts insulin metabolism in offspring. Numerous studies have reported on the toxicity of Cd. A liver transcriptome analysis revealed that Cd toxicity induces intracellular oxidative stress and mitochondrial dysfunction via changes in gene expression, which in turn induces endoplasmic reticulum-associated degradation via abnormal protein folding. However, the specific mechanisms underlying these effects remain unknown. In this study, we found that Cd-induced endoplasmic reticulum stress may promote increased expression of tumor necrosis factor-α (TNF-α). Based on the high expression of genes involved in the production of sphingolipids, it was also found that the accumulation of ceramide may induce intracellular oxidative stress through the overproduction of reactive oxygen species. In addition, the high expression of a set of genes involved in the electron transfer system may contribute to oxidative stress. These findings allowed us to identify the mechanisms by which intracellular oxidative stress leads to the phosphorylation of insulin receptor substrate 1, which plays a significant role in the insulin signaling pathway.
- Published
- 2022
- Full Text
- View/download PDF
21. Integrated Analysis of Tissue-Specific Gene Expression in Diabetes by Tensor Decomposition Can Identify Possible Associated Diseases.
- Author
-
Taguchi YH and Turki T
- Subjects
- Humans, Liver, Transcriptome genetics, Diabetes Mellitus genetics
- Abstract
In the field of gene expression analysis, methods of integrating multiple gene expression profiles are still being developed and the existing methods have scope for improvement. The previously proposed tensor decomposition-based unsupervised feature extraction method was improved by introducing standard deviation optimization. The improved method was applied to perform an integrated analysis of three tissue-specific gene expression profiles (namely, adipose, muscle, and liver) for diabetes mellitus, and the results showed that it can detect diseases that are associated with diabetes (e.g., neurodegenerative diseases) but that cannot be predicted by individual tissue expression analyses using state-of-the-art methods. Although the selected genes differed from those identified by the individual tissue analyses, the selected genes are known to be expressed in all three tissues. Thus, compared with individual tissue analyses, an integrated analysis can provide more in-depth data and identify additional factors, namely, the association with other diseases.
- Published
- 2022
- Full Text
- View/download PDF
22. Tumor Heterogeneity and Molecular Characteristics of Glioblastoma Revealed by Single-Cell RNA-Seq Data Analysis.
- Author
-
Yesudhas D, Dharshini SAP, Taguchi YH, and Gromiha MM
- Subjects
- Biomarkers, DNA Copy Number Variations, Data Analysis, Gene Expression Regulation, Neoplastic, High-Temperature Requirement A Serine Peptidase 1 genetics, Humans, RNA-Seq, Brain Neoplasms metabolism, Glioblastoma genetics, Glioblastoma pathology
- Abstract
Glioblastoma multiforme (GBM) is the most common infiltrating lethal tumor of the brain. Tumor heterogeneity and the precise characterization of GBM remain challenging, and the disease-specific and effective biomarkers are not available at present. To understand GBM heterogeneity and the disease prognosis mechanism, we carried out a single-cell transcriptome data analysis of 3389 cells from four primary IDH-WT (isocitrate dehydrogenase wild type) glioblastoma patients and compared the characteristic features of the tumor and periphery cells. We observed that the marker gene expression profiles of different cell types and the copy number variations (CNVs) are heterogeneous in the GBM samples. Further, we have identified 94 differentially expressed genes (DEGs) between tumor and periphery cells. We constructed a tissue-specific co-expression network and protein-protein interaction network for the DEGs and identified several hub genes, including CX3CR1, GAPDH, FN1, PDGFRA, HTRA1, ANXA2 THBS1, GFAP, PTN, TNC , and VIM . The DEGs were significantly enriched with proliferation and migration pathways related to glioblastoma. Additionally, we were able to identify the differentiation state of microglia and changes in the transcriptome in the presence of glioblastoma that might support tumor growth. This study provides insights into GBM heterogeneity and suggests novel potential disease-specific biomarkers which could help to identify the therapeutic targets in GBM.
- Published
- 2022
- Full Text
- View/download PDF
23. Novel feature selection method via kernel tensor decomposition for improved multi-omics data analysis.
- Author
-
Taguchi YH and Turki T
- Subjects
- Data Analysis, Genomics, Proteomics
- Abstract
Background: Feature selection of multi-omics data analysis remains challenging owing to the size of omics datasets, comprising approximately [Formula: see text]-[Formula: see text] features. In particular, appropriate methods to weight individual omics datasets are unclear, and the approach adopted has substantial consequences for feature selection. In this study, we extended a recently proposed kernel tensor decomposition (KTD)-based unsupervised feature extraction (FE) method to integrate multi-omics datasets obtained from common samples in a weight-free manner., Method: KTD-based unsupervised FE was reformatted as the collection of kernelized tensors sharing common samples, which was applied to synthetic and real datasets., Results: The proposed advanced KTD-based unsupervised FE method showed comparative performance to that of the previously proposed KTD method, as well as tensor decomposition-based unsupervised FE, but required reduced memory and central processing unit time. Moreover, this advanced KTD method, specifically designed for multi-omics analysis, attributes P values to features, which is rare for existing multi-omics-oriented methods., Conclusions: The sample R code is available at https://github.com/tagtag/MultiR/ ., (© 2022. The Author(s).)
- Published
- 2022
- Full Text
- View/download PDF
24. Effects of Collagen-Glycosaminoglycan Mesh on Gene Expression as Determined by Using Principal Component Analysis-Based Unsupervised Feature Extraction.
- Author
-
Taguchi YH and Turki T
- Abstract
The development of the medical applications for substances or materials that contact cells is important. Hence, it is necessary to elucidate how substances that surround cells affect gene expression during incubation. In the current study, we compared the gene expression profiles of cell lines that were in contact with collagen-glycosaminoglycan mesh and control cells. Principal component analysis-based unsupervised feature extraction was applied to identify genes with altered expression during incubation in the treated cell lines but not in the controls. The identified genes were enriched in various biological terms. Our method also outperformed a conventional methodology, namely, gene selection based on linear regression with time course.
- Published
- 2021
- Full Text
- View/download PDF
25. Identification of Enhancers and Promoters in the Genome by Multidimensional Scaling.
- Author
-
Ishibashi R and Taguchi YH
- Subjects
- Algorithms, Databases, Genetic, Genomics methods, Humans, Multidimensional Scaling Analysis, Nucleic Acid Conformation, Tumor Cells, Cultured, Bone Neoplasms genetics, Chromosomes, Human chemistry, Enhancer Elements, Genetic, Osteosarcoma genetics, Promoter Regions, Genetic
- Abstract
The positions of enhancers and promoters on genomic DNA remain poorly understood. Chromosomes cannot be observed during the cell division cycle because the genome forms a chromatin structure and spreads within the nucleus. However, high-throughput chromosome conformation capture (Hi-C) measures the physical interactions of genomes. In previous studies, DNA extrusion loops were directly derived from Hi-C heat maps. Multidimensional Scaling (MDS) is used in this assessment to more precisely locate enhancers and promoters. MDS is a multivariate analysis method that reproduces the original coordinates from the distance matrix between elements. We used Hi-C data of cultured osteosarcoma cells and applied MDS as the distance matrix of the genome. In addition, we selected columns 2 and 3 of the orthogonal matrix U as the desired structure. Overall, the DNA loops from the reconstructed genome structure contained bioprocesses involved in transcription, such as the pre-transcriptional initiation complex and RNA polymerase II initiation complex, and transcription factors involved in cancer, such as Foxm1 and CREB3. Therefore, our results are consistent with the biological findings. Our method is suitable for identifying enhancers and promoters in the genome.
- Published
- 2021
- Full Text
- View/download PDF
26. Tensor-Decomposition-Based Unsupervised Feature Extraction in Single-Cell Multiomics Data Analysis.
- Author
-
Taguchi YH and Turki T
- Subjects
- DNA Methylation, Databases, Factual, Gene Expression Profiling methods, Histones genetics, Histones metabolism, Computational Biology methods, Single-Cell Analysis methods
- Abstract
Analysis of single-cell multiomics datasets is a novel topic and is considerably challenging because such datasets contain a large number of features with numerous missing values. In this study, we implemented a recently proposed tensor-decomposition (TD)-based unsupervised feature extraction (FE) technique to address this difficult problem. The technique can successfully integrate single-cell multiomics data composed of gene expression, DNA methylation, and accessibility. Although the last two have large dimensions, as many as ten million, containing only a few percentage of nonzero values, TD-based unsupervised FE can integrate three omics datasets without filling in missing values. Together with UMAP, which is used frequently when embedding single-cell measurements into two-dimensional space, TD-based unsupervised FE can produce two-dimensional embedding coincident with classification when integrating single-cell omics datasets. Genes selected based on TD-based unsupervised FE are also significantly related to reasonable biological roles.
- Published
- 2021
- Full Text
- View/download PDF
27. PCA-based unsupervised feature extraction for gene expression analysis of COVID-19 patients.
- Author
-
Fujisawa K, Shimo M, Taguchi YH, Ikematsu S, and Miyata R
- Subjects
- Binding Sites, COVID-19 metabolism, Case-Control Studies, Epigenesis, Genetic, Gene Expression Regulation, Genetic Predisposition to Disease, Humans, Machine Learning, Signal Transduction, COVID-19 genetics, Gene Expression Profiling methods, Gene Regulatory Networks, Histones metabolism, NF-kappa B p50 Subunit metabolism, Transcription Factor RelA metabolism
- Abstract
Coronavirus disease 2019 (COVID-19) is raging worldwide. This potentially fatal infectious disease is caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). However, the complete mechanism of COVID-19 is not well understood. Therefore, we analyzed gene expression profiles of COVID-19 patients to identify disease-related genes through an innovative machine learning method that enables a data-driven strategy for gene selection from a data set with a small number of samples and many candidates. Principal-component-analysis-based unsupervised feature extraction (PCAUFE) was applied to the RNA expression profiles of 16 COVID-19 patients and 18 healthy control subjects. The results identified 123 genes as critical for COVID-19 progression from 60,683 candidate probes, including immune-related genes. The 123 genes were enriched in binding sites for transcription factors NFKB1 and RELA, which are involved in various biological phenomena such as immune response and cell survival: the primary mediator of canonical nuclear factor-kappa B (NF-κB) activity is the heterodimer RelA-p50. The genes were also enriched in histone modification H3K36me3, and they largely overlapped the target genes of NFKB1 and RELA. We found that the overlapping genes were downregulated in COVID-19 patients. These results suggest that canonical NF-κB activity was suppressed by H3K36me3 in COVID-19 patient blood., (© 2021. The Author(s).)
- Published
- 2021
- Full Text
- View/download PDF
28. Unsupervised tensor decomposition-based method to extract candidate transcription factors as histone modification bookmarks in post-mitotic transcriptional reactivation.
- Author
-
Taguchi YH and Turki T
- Subjects
- Cell Cycle genetics, Genome, Human genetics, Humans, Mitosis genetics, Histones genetics, Protein Processing, Post-Translational genetics, Transcription Factors genetics, Transcriptional Activation genetics
- Abstract
The histone group added to a gene sequence must be removed during mitosis to halt transcription during the DNA replication stage of the cell cycle. However, the detailed mechanism of this transcription regulation remains unclear. In particular, it is not realistic to reconstruct all appropriate histone modifications throughout the genome from scratch after mitosis. Thus, it is reasonable to assume that there might be a type of "bookmark" that retains the positions of histone modifications, which can be readily restored after mitosis. We developed a novel computational approach comprising tensor decomposition (TD)-based unsupervised feature extraction (FE) to identify transcription factors (TFs) that bind to genes associated with reactivated histone modifications as candidate histone bookmarks. To the best of our knowledge, this is the first application of TD-based unsupervised FE to the cell division context and phases pertaining to the cell cycle in general. The candidate TFs identified with this approach were functionally related to cell division, suggesting the suitability of this method and the potential of the identified TFs as bookmarks for histone modification during mitosis., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2021
- Full Text
- View/download PDF
29. Identification of genes associated with altered gene expression and m6A profiles during hypoxia using tensor decomposition based unsupervised feature extraction.
- Author
-
Roy SS and Taguchi YH
- Subjects
- Humans, Algorithms, Databases, Nucleic Acid, Gene Expression Profiling, Gene Expression Regulation, Hypoxia genetics, Hypoxia metabolism, Models, Biological
- Abstract
Although hypoxia is a critical factor that can drive the progression of various diseases, the mechanism underlying hypoxia itself remains unclear. Recently, m6A has been proposed as an important factor driving hypoxia. Despite successful analyses, potential genes were not selected with statistical significance but were selected based solely on fold changes. Because the number of genes is large while the number of samples is small, it was impossible to select genes using conventional feature selection methods with statistical significance. In this study, we applied the recently proposed principal component analysis (PCA), tensor decomposition (TD), and kernel tensor decomposition (KTD)-based unsupervised feature extraction (FE) to a hypoxia data set. We found that PCA, TD, and KTD-based unsupervised FE could successfully identify a limited number of genes associated with altered gene expression and m6A profiles, as well as the enrichment of hypoxia-related biological terms, with improved statistical significance.
- Published
- 2021
- Full Text
- View/download PDF
30. Editorial: miRNAs and Neurological Diseases.
- Author
-
Wang H, Taguchi YH, and Liu X
- Abstract
Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
- Published
- 2021
- Full Text
- View/download PDF
31. Exploring Common Therapeutic Targets for Neurodegenerative Disorders Using Transcriptome Study.
- Author
-
Dharshini SAP, Jemimah S, Taguchi YH, and Gromiha MM
- Abstract
Alzheimer's disease (AD) and Parkinson's disease (PD) are well-known neuronal degenerative disorders that share common pathological events. Approved medications alleviate symptoms but do not address the root cause of the disease. Energy dysfunction in the neuronal population leads to various pathological events and ultimately results in neuronal death. Identifying common therapeutic targets for these disorders may help in the drug discovery process. The Brodmann area 9 (BA9) region is affected in both the disease conditions and plays an essential role in cognitive, motor, and memory-related functions. Analyzing transcriptome data of BA9 provides deep insights related to common pathological pathways involved in AD and PD. In this work, we map the preprocessed BA9 fastq files generated by RNA-seq for disease and control samples with reference hg38 genomic assembly and identify common variants and differentially expressed genes (DEG). These variants are predominantly located in the 3' UTR (non-promoter) region, affecting the conserved transcription factor (TF) binding motifs involved in the methylation and acetylation process. We have constructed BA9-specific functional interaction networks, which show the relationship between TFs and DEGs. Based on expression signature analysis, we propose that MAPK1, VEGFR1/FLT1, and FGFR1 are promising drug targets to restore blood-brain barrier functionality by reducing neuroinflammation and may save neurons., Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest., (Copyright © 2021 Dharshini, Jemimah, Taguchi and Gromiha.)
- Published
- 2021
- Full Text
- View/download PDF
32. Tensor-Decomposition-Based Unsupervised Feature Extraction Applied to Prostate Cancer Multiomics Data.
- Author
-
Taguchi YH and Turki T
- Subjects
- Humans, Male, Algorithms, Databases, Genetic, Gene Expression Profiling, Gene Expression Regulation, Neoplastic, Neoplasm Proteins biosynthesis, Neoplasm Proteins genetics, Prostatic Neoplasms genetics, Prostatic Neoplasms metabolism, Transcription Factors biosynthesis, Transcription Factors genetics
- Abstract
The large p small n problem is a challenge without a de facto standard method available to it. In this study, we propose a tensor-decomposition (TD)-based unsupervised feature extraction (FE) formalism applied to multiomics datasets, in which the number of features is more than 100,000 whereas the number of samples is as small as about 100, hence constituting a typical large p small n problem. The proposed TD-based unsupervised FE outperformed other conventional supervised feature selection methods, random forest, categorical regression (also known as analysis of variance, or ANOVA), penalized linear discriminant analysis, and two unsupervised methods, multiple non-negative matrix factorization and principal component analysis (PCA) based unsupervised FE when applied to synthetic datasets and four methods other than PCA based unsupervised FE when applied to multiomics datasets. The genes selected by TD-based unsupervised FE were enriched in genes known to be related to tissues and transcription factors measured. TD-based unsupervised FE was demonstrated to be not only the superior feature selection method but also the method that can select biologically reliable genes. To our knowledge, this is the first study in which TD-based unsupervised FE has been successfully applied to the integration of this variety of multiomics measurements.
- Published
- 2020
- Full Text
- View/download PDF
33. Comprehensive analysis of liver and blood miRNA in precancerous conditions.
- Author
-
Umezu T, Tsuneyama K, Kanekura K, Hayakawa M, Tanahashi T, Kawano M, Taguchi YH, Toyoda H, Tamori A, Kuroda M, and Murakami Y
- Subjects
- Animals, Carcinoma, Hepatocellular blood, Exosomes metabolism, Humans, Liver Neoplasms blood, Mice, Precancerous Conditions blood, Predictive Value of Tests, Biomarkers, Tumor analysis, Biomarkers, Tumor blood, Carcinoma, Hepatocellular diagnosis, Carcinoma, Hepatocellular genetics, Liver metabolism, Liver Neoplasms diagnosis, Liver Neoplasms genetics, MicroRNAs analysis, MicroRNAs blood, Precancerous Conditions diagnosis, Precancerous Conditions genetics
- Abstract
Streptozotocin administration to mice (STZ-mice) induces type I diabetes and hepatocellular carcinoma (HCC). We attempted to elucidate the carcinogenic mechanism and the miRNA expression status in the liver and blood during the precancerous state. Serum and liver tissues were collected from STZ-mice and non-treated mice (CTL-mice) at 6, 10, and 12 W. The exosome enriched fraction extracted from serum was used. Hepatic histological examination and hepatic and exosomal miRNA expression analysis were serially performed using next-generation sequencing (NGS). Human miRNA expression analysis of chronic hepatitis liver tissue and exosomes, which were collected before starting the antiviral treatment, were also performed. No inflammation or fibrosis was found in the liver of CTL-mice during the observation period. In STZ-mice, regeneration and inflammation of hepatocytes was found at 6 W and nodules of atypical hepatocytes were found at 10 and 12 W. In the liver tissue, during 6-12 W, the expression levels of let-7f-5p, miR-143-3p, 148a-3p, 191-5p, 192-5p, 21a-5p, 22-3p, 26a-5p, and 92a-3p was significantly increased in STZ-mice, and anti-oncogenes of their target gene candidates were down-regulated. miR-122-5p was also significantly down-regulated in STZ-mice. Fifteen exosomal miRNAs were upregulated in STZ-mice. Six miRNAs (let-7f-5p, miR-10b-5p, 143-3p, 191-5p, 21a-5p, and 26a-5p) were upregulated, similarly to human HCC cases. From the precancerous state, aberrant expression of hepatic miRNAs has already occurred, and then, it can promote carcinogenesis. In exosomes, the expression pattern of common miRNAs between mice and humans before carcinogenesis was observed and can be expected to be developed as a cancer predictive marker.
- Published
- 2020
- Full Text
- View/download PDF
34. Identification of miRNA signatures for kidney renal clear cell carcinoma using the tensor-decomposition method.
- Author
-
Ng KL and Taguchi YH
- Subjects
- Biomarkers, Tumor genetics, Carcinoma, Renal Cell mortality, Databases, Genetic statistics & numerical data, Gene Expression Profiling statistics & numerical data, Gene Expression Regulation, Neoplastic, Gene Regulatory Networks, Humans, Kaplan-Meier Estimate, Kidney Neoplasms mortality, Prognosis, RNA, Messenger genetics, Unsupervised Machine Learning, Carcinoma, Renal Cell genetics, Gene Expression Profiling methods, Kidney Neoplasms genetics, MicroRNAs genetics
- Abstract
Cancer is a highly complex disease caused by multiple genetic factors. MicroRNA (miRNA) and mRNA expression profiles are useful for identifying prognostic biomarkers for cancer. Kidney renal clear cell carcinoma (KIRC), which accounts for more than 70% of all renal malignant tumour cases, was selected for our analysis. Traditional methods of identifying cancer prognostic markers may not be accurate. Tensor decomposition (TD) is a useful method uncovering the underlying low-dimensional structures in the tensor. The TD-based unsupervised feature extraction method was applied to analyse mRNA and miRNA expression profiles. Biological annotations of the prognostic miRNAs and mRNAs were examined utilizing the pathway and oncogenic signature databases DIANA-miRPath and MSigDB. TD identified the miRNA signatures and the associated genes. These genes were found to be involved in cancer-related pathways, and 23 genes were significantly correlated with the survival of KIRC patients. We demonstrated that the results are robust and not highly dependent upon the databases we selected. Compared with traditional supervised methods tested, TD achieves much better performance in selecting prognostic miRNAs and mRNAs. These results suggest that integrated analysis using the TD-based unsupervised feature extraction technique is an effective strategy for identifying prognostic signatures in cancer studies.
- Published
- 2020
- Full Text
- View/download PDF
35. A new advanced in silico drug discovery method for novel coronavirus (SARS-CoV-2) with tensor decomposition-based unsupervised feature extraction.
- Author
-
Taguchi YH and Turki T
- Subjects
- A549 Cells, Antiviral Agents chemistry, Antiviral Agents classification, Humans, SARS-CoV-2, Antiviral Agents pharmacology, Betacoronavirus drug effects, Drug Discovery methods, Unsupervised Machine Learning
- Abstract
Background: COVID-19 is a critical pandemic that has affected human communities worldwide, and there is an urgent need to develop effective drugs. Although there are a large number of candidate drug compounds that may be useful for treating COVID-19, the evaluation of these drugs is time-consuming and costly. Thus, screening to identify potentially effective drugs prior to experimental validation is necessary., Method: In this study, we applied the recently proposed method tensor decomposition (TD)-based unsupervised feature extraction (FE) to gene expression profiles of multiple lung cancer cell lines infected with severe acute respiratory syndrome coronavirus 2. We identified drug candidate compounds that significantly altered the expression of the 163 genes selected by TD-based unsupervised FE., Results: Numerous drugs were successfully screened, including many known antiviral drug compounds such as C646, chelerythrine chloride, canertinib, BX-795, sorafenib, sorafenib, QL-X-138, radicicol, A-443654, CGP-60474, alvocidib, mitoxantrone, QL-XII-47, geldanamycin, fluticasone, atorvastatin, quercetin, motexafin gadolinium, trovafloxacin, doxycycline, meloxicam, gentamicin, and dibromochloromethane. The screen also identified ivermectin, which was first identified as an anti-parasite drug and recently the drug was included in clinical trials for SARS-CoV-2., Conclusions: The drugs screened using our strategy may be effective candidates for treating patients with COVID-19., Competing Interests: The authors have declared that no competing interests exist.
- Published
- 2020
- Full Text
- View/download PDF
36. Identifying suitable tools for variant detection and differential gene expression using RNA-seq data.
- Author
-
Dharshini SAP, Taguchi YH, and Gromiha MM
- Subjects
- Alzheimer Disease genetics, Alzheimer Disease metabolism, Genome-Wide Association Study, Genomics methods, Hippocampus metabolism, Humans, Sequence Alignment, Genetic Variation, RNA-Seq methods
- Abstract
Neurodegenerative diseases are the most predominate brain disorders around the globe and the affected populations are rapidly increasing. Recently, these diseases have been addressed using the data obtained from RNA-sequencing technology to reveal the changes in gene/transcript expression, effect of variants, and pathways involved in disease mechanisms. However, the observations mainly depend on the aligners/tools and the performance of existing RNA-seq tools on hg38 genome assembly has not yet been documented. In this study, we performed a systematic analysis of various spliced aligners, transcript assembling and variant calling tools based on both genomic assemblies (hg19/hg38) from hippocampus brain tissue. This helps to identify the best possible combination tools for hg38 annotation. In order to evaluate the identified variants from various pipelines, we compared them with expression Quantitative Trait Loci (eQTL) and Genome-Wide Association Study (GWAS). In addition, the identified differentially expressed genes (DG) were compared with microarray studies. From our analysis of variant calling, the combination of GATK (Genome Analysis Tool-kit) and STAR (Spliced Transcripts Alignment to a Reference) protocol yields a larger number of GWAS/eQTL variants compared to SAMtools (Sequence Alignment Map). We also identified a higher number of non-coding variants in hg38 compared to hg19 due to enhanced annotation. In the case of various DG pipelines, we found that the Salmon-based hg38 transcriptomic quantification yields a higher number of reported DG compared to other genome-based quantification methods. This study revealed that higher number of reads maps to multiple location of the genome with hg38 compared to hg19, and these spurious multi-mapped reads may affect the gene quantification techniques. We suggest that it is necessary to develop efficient algorithms, which can handle the multi-mapped reads and improve the performance of genome-based alignment quantification., (Copyright © 2019 Elsevier Inc. All rights reserved.)
- Published
- 2020
- Full Text
- View/download PDF
37. Author Correction: Development of a novel anti-hepatitis B virus agent via Sp1.
- Author
-
Hayakawa M, Umeyama H, Iwadate M, Taguchi YH, Yano Y, Honda T, Itami-Matsumoto S, Kozuka R, Enomoto M, Tamori A, Kawada N, and Murakami Y
- Abstract
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
- Published
- 2020
- Full Text
- View/download PDF
38. Development of a novel anti-hepatitis B virus agent via Sp1.
- Author
-
Hayakawa M, Umeyama H, Iwadate M, Taguchi YH, Yano Y, Honda T, Itami-Matsumoto S, Kozuka R, Enomoto M, Tamori A, Kawada N, and Murakami Y
- Subjects
- Antiviral Agents chemistry, DNA, Circular genetics, DNA, Viral genetics, Glycoside Hydrolase Inhibitors chemistry, Hepatitis B virology, Hepatitis B virus isolation & purification, Hepatocytes drug effects, Hepatocytes virology, High-Throughput Screening Assays, Humans, alpha-Glucosidases chemistry, Antiviral Agents pharmacology, Drug Development, Glycoside Hydrolase Inhibitors pharmacology, Hepatitis B drug therapy, Hepatitis B virus drug effects, Sp1 Transcription Factor metabolism, Virus Replication drug effects
- Abstract
Nucleos(t)ide analog (NA) therapy has proven effective in treating chronic hepatitis B. However, NAs frequently result in viral relapse after the cessation of therapy. This is because NAs cannot fully eliminate the viral episomal covalently closed circular DNA (cccDNA) in the nucleus. In this study, we identified small molecular compounds that control host factors related to viral replication using in silico screening with simulated annealing based on bioinformatics for protein-ligand flexible docking. Twelve chemical compound candidates for alpha-glucosidase (AG) inhibitors were identified from a library of chemical compounds and used to treat fresh human hepatocytes infected with HBV. They were then monitored for their anti-viral effects. HBV replication was inhibited by one candidate (1-[3-(4-tert-butylcyclohexyl)oxy-2-hydroxypropyl]-2,2,6,6-tetramethylpiperidin-4-ol) in a dose-dependent manner. This compound significantly reduced ccc DNA production, compared to Entecavir (p < 0.05), and had a lower anti-AG effect. Gene expression analysis and structural analysis of this compound showed that its inhibitive effect on HBV was via interaction with Sp1. The nuclear transcription factor Sp1 acts on multiple regions of HBV to suppress HBV replication. Identifying candidates that control nuclear transcription factors facilitate the development of novel therapies. Drugs with a mechanism different from NA are promising for the elimination of HBV.
- Published
- 2020
- Full Text
- View/download PDF
39. Investigating the energy crisis in Alzheimer disease using transcriptome study.
- Author
-
Dharshini SAP, Taguchi YH, and Gromiha MM
- Subjects
- Acetylation, Aged, Aged, 80 and over, Alzheimer Disease physiopathology, Computational Biology, Computer Simulation, Female, Gene Expression Profiling, Genome-Wide Association Study, Histones metabolism, Humans, Lactic Acid, Male, Methylation, Protein Binding, RNA-Seq, Transcription Factors metabolism, Transcriptome, Alzheimer Disease metabolism, Energy Metabolism, Gene Expression Regulation, Hippocampus metabolism, Neurons metabolism
- Abstract
Alzheimer disease (AD) is a devastating neurological disorder, which initiates from hippocampus and proliferates to cortical regions. The neurons of hippocampus require higher energy to preserve the firing pattern. In AD, aberrant energy metabolism is the critical factor for neurodegeneration. However, the reason for the energy crisis in hippocampus neurons is still unresolved. Transcriptome analysis enables us in understanding the underlying mechanism of energy crisis. In this study, we identified variants/differential gene/transcript expression profiles from hippocampus RNA-seq data. We predicted the effect of variants in transcription factor (TF) binding using in silico tools. Further, a hippocampus-specific co-expression and functional interaction network were designed to decipher the relationships between TF and differentially expressed genes (DG). Identified variants predominantly influence TF binding, which subsequently regulates the DG. From the results, we hypothesize that the loss of vascular integrity is the fundamental attribute for the energy crisis, which leads to neurodegeneration.
- Published
- 2019
- Full Text
- View/download PDF
40. Tensor Decomposition-Based Unsupervised Feature Extraction Applied to Single-Cell Gene Expression Analysis.
- Author
-
Taguchi YH and Turki T
- Abstract
Although single-cell RNA sequencing (scRNA-seq) technology is newly invented and a promising one, but because of lack of enough information that labels individual cells, it is hard to interpret the obtained gene expression of each cell. Because of insufficient information available, unsupervised clustering, for example, t -distributed stochastic neighbor embedding and uniform manifold approximation and projection, is usually employed to obtain low-dimensional embedding that can help to understand cell-cell relationship. One possible drawback of this strategy is that the outcome is highly dependent upon genes selected for the usage of clustering. In order to fulfill this requirement, there are many methods that performed unsupervised gene selection. In this study, a tensor decomposition (TD)-based unsupervised feature extraction (FE) was applied to the integration of two scRNA-seq expression profiles that measure human and mouse midbrain development. TD-based unsupervised FE could select not only coincident genes between human and mouse but also biologically reliable genes. Coincidence between two species as well as biological reliability of selected genes is increased compared with that using principal component analysis (PCA)-based FE applied to the same data set in the previous study. Since PCA-based unsupervised FE outperformed the other three popular unsupervised gene selection methods, highly variable genes, bimodal genes, and dpFeature, TD-based unsupervised FE can do so as well. In addition to this, 10 transcription factors (TFs) that might regulate selected genes and might contribute to midbrain development were identified. These 10 TFs, BHLHE40, EGR1, GABPA, IRF3, PPARG, REST, RFX5, STAT3, TCF7L2, and ZBTB33, were previously reported to be related to brain functions and diseases. TD-based unsupervised FE is a promising method to integrate two scRNA-seq profiles effectively., (Copyright © 2019 Taguchi and Turki.)
- Published
- 2019
- Full Text
- View/download PDF
41. Exploring the selective vulnerability in Alzheimer disease using tissue specific variant analysis.
- Author
-
Akila Parvathy Dharshini S, Taguchi YH, and Michael Gromiha M
- Subjects
- 3' Untranslated Regions, Alzheimer Disease pathology, Brain pathology, Humans, Metabolic Networks and Pathways genetics, Organ Specificity, Quantitative Trait Loci, Alzheimer Disease genetics, Brain metabolism, Polymorphism, Single Nucleotide, Transcriptome
- Abstract
The selective vulnerability of distinct regions of the brain is a critical factor in neurodegenerative disorders. In Alzheimer's disease (AD), neurons in hippocampus situated in medial temporal lobe are immensely damaged. Identifying tissue-specific variants is essential in order to perceive the selective vulnerability in AD. In current work, we aligned mRNA-seq data with HG19/HG38 genomic assembly and identified specific variations present in temporal, frontal and other lobes of the AD using sequence alignment map tools. We compared the results with the genome-wide association and gene expression quantitative trait loci studies of the various neurological disorders. We also distinguished variants and epitranscriptomic modifications through the RNA-modification database and evaluated the variant effect in the coding/UTR regions. In addition, we developed genetic and functional interaction networks to understand the relationship between predicted vulnerable variations and differentially expressed genes. We found that genes involved in gliogenesis, intermediate filament organization are altered in the temporal lobe. Oxidative phosphorylation, and calcium ion homeostasis are modified in the frontal lobe, and protein degradation, apoptotic signaling are altered in other lobes. From this study, we propose that disruption of glial cell structural integrity, defective gliogenesis, and failure in glia-neuron communication are the primary factors for selective vulnerability., (Copyright © 2018 Elsevier Inc. All rights reserved.)
- Published
- 2019
- Full Text
- View/download PDF
42. Drug candidate identification based on gene expression of treated cells using tensor decomposition-based unsupervised feature extraction for large-scale data.
- Author
-
Taguchi YH
- Subjects
- Cell Line, Computer Simulation, Humans, Principal Component Analysis, Protein Interaction Maps, Transcription Factors metabolism, Algorithms, Drug Discovery, Transcriptome
- Abstract
Background: Although in silico drug discovery is necessary for drug development, two major strategies, a structure-based and ligand-based approach, have not been completely successful. Currently, the third approach, inference of drug candidates from gene expression profiles obtained from the cells treated with the compounds under study requires the use of a training dataset. Here, the purpose was to develop a new approach that does not require any pre-existing knowledge about the drug-protein interactions, but these interactions can be inferred by means of an integrated approach using gene expression profiles obtained from the cells treated with the analysed compounds and the existing data describing gene-gene interactions., Results: In the present study, using tensor decomposition-based unsupervised feature extraction, which represents an extension of the recently proposed principal-component analysis-based feature extraction, gene sets and compounds with a significant dose-dependent activity were screened without any training datasets. Next, after these results were combined with the data showing perturbations in single-gene expression profiles, genes targeted by the analysed compounds were inferred. The set of target genes thus identified was shown to significantly overlap with known target genes of the compounds under study., Conclusions: The method is specifically designed for large-scale datasets (including hundreds of treatments with compounds), not for conventional small-scale datasets. The obtained results indicate that two compounds that have not been extensively studied, WZ-3105 and CGP-60474, represent promising drug candidates targeting multiple cancers, including melanoma, adenocarcinoma, liver carcinoma, and breast, colon, and prostate cancers, which were analysed in this in silico study.
- Published
- 2019
- Full Text
- View/download PDF
43. Exploring MicroRNA Biomarkers for Parkinson's Disease from mRNA Expression Profiles.
- Author
-
Taguchi YH and Wang H
- Abstract
Parkinson's disease (PD) is a chronic, progressive neurodegenerative disease characterized by both motor and nonmotor features. The diagnose of PD is based on a review of patients' signs and symptoms, and neurological and physical examinations. So far, no tests have been devised that can conclusively diagnose PD. In this study, we explore both microRNA and gene biomarkers for PD. Microarray gene expression profiles for PD patients and healthy control are analyzed using a principal component analysis (PCA)-based unsupervised feature extraction (FE). 244 genes are selected to be potential gene biomarkers for PD. In addition, we implement these genes into Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways, and find that the 15 microRNAs (miRNAs), hsa-miR-92a-3p, 16-5p, 615-3p, 877-3p, 100-5p, 320a, 877-5p, 23a-3p, 484, 23b-3p, 15a-5p, 324-3p, 19b-3p, 7b-5p and 505-3p, significantly target these 244 genes. These miRNAs are shown to be significantly related to PD. This reveals that both selected genes and miRNAs are potential biomarkers for PD.
- Published
- 2018
- Full Text
- View/download PDF
44. Correction: Tensor decomposition-based unsupervised feature extraction applied to matrix products for multi-view data processing.
- Author
-
Taguchi YH
- Abstract
[This corrects the article DOI: 10.1371/journal.pone.0183933.].
- Published
- 2018
- Full Text
- View/download PDF
45. Tensor Decomposition-Based Unsupervised Feature Extraction Can Identify the Universal Nature of Sequence-Nonspecific Off-Target Regulation of mRNA Mediated by MicroRNA Transfection.
- Author
-
Taguchi YH
- Abstract
MicroRNA (miRNA) transfection is known to degrade target mRNAs and to decrease mRNA expression. In contrast to the notion that most of the gene expression alterations caused by miRNA transfection involve downregulation, they often involve both up- and downregulation; this phenomenon is thought to be, at least partially, mediated by sequence-nonspecific off-target effects. In this study, I used tensor decomposition-based unsupervised feature extraction to identify genes whose expression is likely to be altered by miRNA transfection. These gene sets turned out to largely overlap with one another regardless of the type of miRNA or cell lines used in the experiments. These gene sets also overlap with the gene set associated with altered expression induced by a Dicer knockout. This result suggests that the off-target effect is at least as important as the canonical function of miRNAs that suppress translation. The off-target effect is also suggested to consist of competition for the protein machinery between transfected miRNAs and miRNAs in the cell. Because the identified genes are enriched in various biological terms, these genes are likely to play critical roles in diverse biological processes., Competing Interests: The author declares no conflict of interest.
- Published
- 2018
- Full Text
- View/download PDF
46. Tensor decomposition-based and principal-component-analysis-based unsupervised feature extraction applied to the gene expression and methylation profiles in the brains of social insects with multiple castes.
- Author
-
Taguchi YH
- Subjects
- Animals, Down-Regulation genetics, Finite Element Analysis, Gene Expression Profiling, Gene Ontology, Reproducibility of Results, Social Behavior, Algorithms, Brain metabolism, DNA Methylation genetics, Gene Expression Regulation, Hierarchy, Social, Principal Component Analysis, Wasps genetics
- Abstract
Background: Even though coexistence of multiple phenotypes sharing the same genomic background is interesting, it remains incompletely understood. Epigenomic profiles may represent key factors, with unknown contributions to the development of multiple phenotypes, and social-insect castes are a good model for elucidation of the underlying mechanisms. Nonetheless, previous studies have failed to identify genes associated with aberrant gene expression and methylation profiles because of the lack of suitable methodology that can address this problem properly., Methods: A recently proposed principal component analysis (PCA)-based and tensor decomposition (TD)-based unsupervised feature extraction (FE) can solve this problem because these two approaches can deal with gene expression and methylation profiles even when a small number of samples is available., Results: PCA-based and TD-based unsupervised FE methods were applied to the analysis of gene expression and methylation profiles in the brains of two social insects, Polistes canadensis and Dinoponera quadriceps. Genes associated with differential expression and methylation between castes were identified, and analysis of enrichment of Gene Ontology terms confirmed reliability of the obtained sets of genes from the biological standpoint., Conclusions: Biologically relevant genes, shown to be associated with significant differential gene expression and methylation between castes, were identified here for the first time. The identification of these genes may help understand the mechanisms underlying epigenetic control of development of multiple phenotypes under the same genomic conditions.
- Published
- 2018
- Full Text
- View/download PDF
47. Exploring microRNA Biomarker for Amyotrophic Lateral Sclerosis.
- Author
-
Taguchi YH and Wang H
- Subjects
- Biomarkers blood, Case-Control Studies, Humans, Amyotrophic Lateral Sclerosis blood, MicroRNAs blood
- Abstract
Amyotrophic lateral sclerosis (ALS) is among the severe neuro degenerative diseases that lack widely available effective treatments. As the disease progresses, patients lose the control of voluntary muscles. Although the neuronal degeneration is the cause of this disease, the failure mechanism is still unknown. In order to seek genetic mechanisms that initiate and progress ALS, the association of microRNA (miRNA) expression with this disease was considered. Serum miRNAs from healthy controls, sporadic ALS (sALS), familial ALS (fALS) and ALS mutation carriers were investigated. Principal component analysis (PCA)-based unsupervised feature extraction (FE) was applied to these serum miRNA profiles. As a result, we predict miRNAs that can discriminate patients from healthy controls with high accuracy. Thus, these miRNAs can be potential prognosis miRNA biomarkers for ALS.
- Published
- 2018
- Full Text
- View/download PDF
48. Tensor decomposition-based unsupervised feature extraction identifies candidate genes that induce post-traumatic stress disorder-mediated heart diseases.
- Author
-
Taguchi YH
- Subjects
- Animals, DNA-Binding Proteins genetics, Gene Expression Profiling, Genes, Heart Diseases etiology, Heart Diseases genetics, Mice, Principal Component Analysis, RNA-Binding Proteins genetics, Stress Disorders, Post-Traumatic complications, Brain metabolism, Myocardium metabolism, Stress, Physiological genetics
- Abstract
Background: Although post-traumatic stress disorder (PTSD) is primarily a mental disorder, it can cause additional symptoms that do not seem to be directly related to the central nervous system, which PTSD is assumed to directly affect. PTSD-mediated heart diseases are some of such secondary disorders. In spite of the significant correlations between PTSD and heart diseases, spatial separation between the heart and brain (where PTSD is primarily active) prevents researchers from elucidating the mechanisms that bridge the two disorders. Our purpose was to identify genes linking PTSD and heart diseases., Methods: In this study, gene expression profiles of various murine tissues observed under various types of stress or without stress were analyzed in an integrated manner using tensor decomposition (TD)., Results: Based upon the obtained features, ∼ 400 genes were identified as candidate genes that may mediate heart diseases associated with PTSD. Various gene enrichment analyses supported biological reliability of the identified genes. Ten genes encoding protein-, DNA-, or mRNA-interacting proteins-ILF2, ILF3, ESR1, ESR2, RAD21, HTT, ATF2, NR3C1, TP53, and TP63-were found to be likely to regulate expression of most of these ∼ 400 genes and therefore are candidate primary genes that cause PTSD-mediated heart diseases. Approximately 400 genes in the heart were also found to be strongly affected by various drugs whose known adverse effects are related to heart diseases and/or fear memory conditioning; these data support the reliability of our findings., Conclusions: TD-based unsupervised feature extraction turned out to be a useful method for gene selection and successfully identified possible genes causing PTSD-mediated heart diseases.
- Published
- 2017
- Full Text
- View/download PDF
49. Identification of candidate drugs using tensor-decomposition-based unsupervised feature extraction in integrated analysis of gene expression between diseases and DrugMatrix datasets.
- Author
-
Taguchi YH
- Subjects
- Drug Evaluation, Preclinical, Humans, Data Analysis, Databases, Pharmaceutical, Gene Expression Regulation drug effects, Molecular Targeted Therapy, Unsupervised Machine Learning
- Abstract
Identifying drug target genes in gene expression profiles is not straightforward. Because a drug targets proteins and not mRNAs, the mRNA expression of drug target genes is not always altered. In addition, the interaction between a drug and protein can be context dependent; this means that simple drug incubation experiments on cell lines do not always reflect the real situation during active disease. In this paper, I applied tensor-decomposition-based unsupervised feature extraction to the integrated analysis using a mathematical product of gene expression in various diseases and gene expression in the DrugMatrix dataset, where comprehensive data on gene expression during various drug treatments of rats are reported. I found that this strategy, in a fully unsupervised manner, enables researchers to identify a combined set of genes and compounds that significantly overlap with gene and drug interactions identified in the past. As an example illustrating the usefulness of this strategy in drug discovery experiments, I considered cirrhosis, for which no effective drugs have ever been proposed. The present strategy identified two promising therapeutic-target genes, CYPOR and HNFA4; for their protein products, bezafibrate was identified as a promising candidate drug, supported by in silico docking analysis.
- Published
- 2017
- Full Text
- View/download PDF
50. Genetic Association between Amyotrophic Lateral Sclerosis and Cancer.
- Author
-
Taguchi YH and Wang H
- Abstract
Amyotrophic lateral sclerosis (ALS) is a fatal neurodegenerative disease. An ALS drug, Riluzole, has been shown to induce two different anticancer effects on hepatocellular carcinoma (HCC). In light of this finding, we explore the relationship between ALS and cancer, especially for HCC, from the molecular biological viewpoint. We establish biomarkers that can discriminate between ALS patients and healthy controls. A principal component analysis (PCA) based unsupervised feature extraction (FE) is used to find gene biomarkers of ALS based on microarray gene expression data. Based on this method, 101 probes were selected as biomarkers for ALS with 95% high accuracy to discriminate between ALS patients and controls. Most of the genes corresponding to these probes are shown to be related to various cancers. These findings might provide a new insight for developing new therapeutic options or drugs for both ALS and cancer., Competing Interests: The authors declare that they have no conflicts of interest.
- Published
- 2017
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.