36 results on '"Wakaguri H"'
Search Results
2. High-Resolution Characterization of Toxoplasma gondii Transcriptome with a Massive Parallel Sequencing Method
- Author
-
Yamagishi, J., primary, Wakaguri, H., additional, Ueno, A., additional, Goo, Y.-K., additional, Tolba, M., additional, Igarashi, M., additional, Nishikawa, Y., additional, Sugimoto, C., additional, Sugano, S., additional, Suzuki, Y., additional, Watanabe, J., additional, and Xuan, X., additional
- Published
- 2010
- Full Text
- View/download PDF
3. Full-Malaria/Parasites and Full-Arthropods: databases of full-length cDNAs of parasites and arthropods, update 2009
- Author
-
Wakaguri, H., primary, Suzuki, Y., additional, Katayama, T., additional, Kawashima, S., additional, Kibukawa, E., additional, Hiranuka, K., additional, Sasaki, M., additional, Sugano, S., additional, and Watanabe, J., additional
- Published
- 2009
- Full Text
- View/download PDF
4. DBTSS: database of transcription start sites, progress report 2008
- Author
-
Wakaguri, H., primary, Yamashita, R., additional, Suzuki, Y., additional, Sugano, S., additional, and Nakai, K., additional
- Published
- 2007
- Full Text
- View/download PDF
5. Comparasite: a database for comparative study of transcriptomes of parasites defined by full-length cDNAs
- Author
-
Watanabe, J., primary, Wakaguri, H., additional, Sasaki, M., additional, Suzuki, Y., additional, and Sugano, S., additional
- Published
- 2007
- Full Text
- View/download PDF
6. Diverse DNA Methylation Statuses at Alternative Promoters of Human Genes in Various Tissues
- Author
-
Cheong, J., primary, Yamada, Y., additional, Yamashita, R., additional, Irie, T., additional, Kanai, A., additional, Wakaguri, H., additional, Nakai, K., additional, Ito, T., additional, Saito, I., additional, Sugano, S., additional, and Suzuki, Y., additional
- Published
- 2006
- Full Text
- View/download PDF
7. Inconsistencies of genome annotations in apicomplexan parasites revealed by 5'-end-one-pass and full-length sequences of oligo-capped cDNAs
- Author
-
Sugano Sumio, Sasaki Masahide, Suzuki Yutaka, Wakaguri Hiroyuki, and Watanabe Junichi
- Subjects
Biotechnology ,TP248.13-248.65 ,Genetics ,QH426-470 - Abstract
Abstract Background Apicomplexan parasites are causative agents of various diseases including malaria and have been targets of extensive genomic sequencing. We generated 5'-EST collections for six apicomplexa parasites using our full-length oligo-capping cDNA library method. To improve upon the current genome annotations, as well as to validate the importance for physical cDNA clone resources, we generated a large-scale collection of full-length cDNAs for several apicomplexa parasites. Results In this study, we used a total of 61,056 5'-end-single-pass cDNA sequences from Plasmodium falciparum, P. vivax, P. yoelii, P. berghei, Cryptosporidium parvum, and Toxoplasma gondii. We compared these partially sequenced cDNA sequences with the currently annotated gene models and observed significant inconsistencies between the two datasets. In particular, we found that on average 14% of the exons in the current gene models were not supported by any cDNA evidence, and that 16% of the current gene models may contain at least one mis-annotation and should be re-evaluated. We also identified a large number of transcripts that had been previously unidentified. For 732 cDNAs in T. gondii, the entire sequences were determined in order to evaluate the annotated gene models at the complete full-length transcript level. We found that 41% of the T. gondii gene models contained at least one inconsistency. We also identified and confirmed by RT-PCR 140 previously unidentified transcripts found in the intergenic regions of the current gene annotations. We show that the majority of these discrepancies are due to questionable predictions of one or two extra exons in the upstream or downstream regions of the genes. Conclusion Our data indicates that the current gene models are likely to still be incomplete and have much room for improvement. Our unique full-length cDNA information is especially useful for further refinement of the annotations for the genomes of apicomplexa parasites.
- Published
- 2009
- Full Text
- View/download PDF
8. Ago2 and a miRNA reduce Topoisomerase 1 for enhancing DNA cleavage in antibody diversification by activation-induced cytidine deaminase.
- Author
-
Kobayashi M, Wakaguri H, Shimizu M, Higasa K, Matsuda F, and Honjo T
- Subjects
- 3' Untranslated Regions, DNA Cleavage, Cytidine Deaminase genetics, Immunoglobulin Class Switching, Antibodies genetics, Somatic Hypermutation, Immunoglobulin, MicroRNAs genetics
- Abstract
Activation-induced cytidine deaminase (AID) is the essential enzyme for imprinting immunological memory through class switch recombination (CSR) and somatic hypermutation (SHM) of the immunoglobulin (Ig) gene. AID-dependent reduction of Topoisomerase 1 (Top1) promotes DNA cleavage that occurs upon Ig gene diversification, whereas the mechanism behind AID-induced Top1 reduction remains unclear. Here, we clarified the contribution of the microRNA-Ago2 complex in AID-dependent Top1 decrease. Ago2 binds to Top1 3'UTR with two regions of AID-dependent Ago2-binding sites (5'- and 3'dABs). Top1 3'UTR knockout (3'UTRKO) in B lymphoma cells leads to decreases in DNA break efficiency in the IgH gene accompanied by a reduction in CSR and SHM frequencies. Furthermore, AID-dependent Top1 protein reduction and Ago2-binding to Top1 mRNA are down-regulated in 3'UTRKO cells. Top1 mRNA in the highly translated fractions of the sucrose gradient is decreased in an AID-dependent and Top1 3'UTR-mediated manner, resulting in a decrease in Top1 protein synthesis. Both AID and Ago2 localize in the mRNA-binding protein fractions and they interact with each other. Furthermore, we found some candidate miRNAs which possibly bind to 5'- and 3'dAB in Top1 mRNA. Among them, miR-92a-3p knockdown induces the phenotypes of 3'UTRKO cells to wild-type cells whereas it does not impact on 3'UTRKO cells. Taken together, the Ago2-miR-92a-3p complex will be recruited to Top1 3'UTR in an AID-dependent manner and posttranscriptionally reduces Top1 protein synthesis. These consequences cause the increase in a non-B-DNA structure, enhance DNA cleavage by Top1 in the Ig gene and contribute to immunological memory formation.
- Published
- 2023
- Full Text
- View/download PDF
9. De Novo Genome Assembly of Japanese Black Cattle as Model of an Economically Relevant Animal.
- Author
-
Sasaki S, Haga Y, Wakaguri H, Abe K, and Suzuki Y
- Subjects
- Animals, Cattle genetics, Genome, Sequence Analysis, DNA methods, High-Throughput Nucleotide Sequencing methods, Computational Biology methods
- Abstract
A genetic analysis of Japanese Black cattle using short reads and guided by the reference genome from Western breeds would miss the structural variation and/or other unique characteristics of Japanese Black cattle. To overcome this difficulty, a de novo genome assembly independent from the reference genome is required. This chapter describes the technical developments, with respect to both experimental and bioinformatics procedures, including the use of short and long reads, required for de novo genome assembly of Japanese Black cattle., (© 2023. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.)
- Published
- 2023
- Full Text
- View/download PDF
10. Comprehensive analysis of 124 transcriptomes from 31 tissues in developing, juvenile, and adult Japanese Black cattle.
- Author
-
Arishima T, Wakaguri H, Nakashima R, Sakakihara S, Kawashima K, Sugimoto Y, Suzuki Y, and Sasaki S
- Subjects
- Animals, Cattle genetics, Phenotype, Protein Isoforms, Gene Expression Profiling, Transcriptome
- Abstract
Omic analyses of economically important animals, including Japanese Black cattle, are currently underway worldwide. In particular, tissue and developmental stage-specific transcriptome characterization is essential for understanding the molecular mechanisms underlying the phenotypic expression of genetic disorders and economic traits. Here, we conducted a comprehensive analysis of 124 transcriptomes across 31 major tissues from fetuses, juvenile calves, and adult Japanese Black cattle using short-read sequencing. We found that genes exhibiting high tissue-specific expression tended to increase after 60 days from fertilization and significantly reflected tissue-relevant biology. Based on gene expression variation and inflection points during development, we categorized gene expression patterns as stable, increased, decreased, temporary, or complex in each tissue. We also analysed the expression profiles of causative genes (e.g. SLC12A1, ANXA10, and MYH6) for genetic disorders in cattle, revealing disease-relevant expression patterns. In addition, to directly analyse the structure of full-length transcripts without transcript reconstruction, we performed RNA sequencing analysis of 22 tissues using long-read sequencing and identified 232 novel non-RefSeq isoforms. Collectively, our comprehensive transcriptomic analysis can serve as an important resource for the biological and functional interpretation of gene expression and enable the mechanistic interpretation of genetic disorders and economic traits in Japanese Black cattle., (© The Author(s) 2022. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.)
- Published
- 2022
- Full Text
- View/download PDF
11. Identification of deleterious recessive haplotypes and candidate deleterious recessive mutations in Japanese Black cattle.
- Author
-
Sasaki S, Watanabe T, Ibi T, Hasegawa K, Sakamoto Y, Moriwaki S, Kurogi K, Ogino A, Yasumori T, Wakaguri H, Muraki E, Miki Y, Yoshida Y, Inoue Y, Tabuchi I, Iwao K, Arishima T, Kawashima K, Watanabe M, Sugano S, Sugimoto Y, and Suzuki Y
- Subjects
- Animals, Biomarkers, Breeding, Cattle, Cell Cycle Proteins genetics, Cell Cycle Proteins metabolism, DNA Copy Number Variations, Embryonic Development, Homozygote, Polymorphism, Single Nucleotide, RNA Splicing, Exome Sequencing, Alleles, Genes, Recessive, Haplotypes, Mutation
- Abstract
Intensive use of a few elite sires has increased the risk of the manifestation of deleterious recessive traits in cattle. Substantial genotyping data gathered using single-nucleotide polymorphism (SNP) arrays have identified the haplotypes with homozygous deficiency, which may compromise survival. We developed Japanese Black cattle haplotypes (JBHs) using SNP array data (4843 individuals) and identified deleterious recessive haplotypes using exome sequencing of 517 sires. We identified seven JBHs with homozygous deficiency. JBH_10 and JBH_17 were associated with the resuming of estrus after artificial insemination, indicating that these haplotypes carried deleterious mutations affecting embryonic survival. The exome data of 517 Japanese Black sires revealed that AC_000165.1:g.85341291C>G of IARS in JBH_8_2, AC_000174.1:g.74743512G>T of CDC45 in JBH_17, and a copy variation region (CNVR_27) of CLDN16 in JBH_1_1 and JBH_1_2 were the candidate mutations. A novel variant AC_000174.1:g.74743512G>T of CDC45 in JBH_17 was located in a splicing donor site at a distance of 5 bp, affecting pre-mRNA splicing. Mating between heterozygotes of JBH_17 indicated that homozygotes carrying the risk allele died around the blastocyst stage. Analysis of frequency of the CDC45 risk allele revealed that its carriers were widespread throughout the tested Japanese Black cattle population. Our approach can effectively manage the inheritance of recessive risk alleles in a breeding population.
- Published
- 2021
- Full Text
- View/download PDF
12. A 44-kb deleted-type copy number variation is associated with decreasing complement component activity and calf mortality in Japanese Black cattle.
- Author
-
Sasaki S, Miki Y, Ibi T, Wakaguri H, Yoshida Y, Sugimoto Y, and Suzuki Y
- Subjects
- Alleles, Animals, Cattle, Female, Homozygote, Japan, Weaning, Cattle Diseases genetics, DNA Copy Number Variations
- Abstract
Background: Calf mortality generally occurs in calves prior to weaning, which is a serious problem in cattle breeding. Several causative variants of monogenic Mendelian disorders in calf mortality have been identified, whereas genetic factors affecting the susceptibility of calves to death are not well known. To identify variants associated with calf mortality in Japanese Black cattle, we evaluated calf mortality as a categorical trait with a threshold model and performed a genome-wide copy number variation (CNV) association study on calf mortality., Results: We identified a 44-kb deleted-type CNV ranging from 103,317,687 to 103,361,802 bp on chromosome 5, which was associated with the mortality of 1-180-day-old calves. The CNV harbored C1RL, a pseudogene, and an IncRNA localized in the C1R and C1S gene cluster, which is a component of the classical complement activation pathway for immune complexes for infectious pathogens. The average complement activity in CNVR_221 homozygotes at postnatal day 7 was significantly lower than that of wild-type animals and heterozygotes. The frequency of the risk allele in dead calves suffering from diarrhea and pneumonia and in healthy cows was 0.35 and 0.28, respectively (odds ratio = 2.2, P = 0.016), suggesting that CNVR_221 was associated with the mortality of Japanese Black calves suffering from an infectious disease., Conclusions: This study identified a deleted-type CNV associated with the mortality of 1-180-day-old calves. The complement activity in CNVR_221 homozygotes was significantly lower than that in heterozygotes and wild type animals. The frequency of the risk allele was higher in dead calves suffering from an infectious disease than in healthy cows. These results suggest that the existence of CNVR_221 in calves could be attributed to a reduction in complement activity, which in turn leads to susceptibility to infections. Thus, the risk allele could serve as a useful marker to reduce the mortality of infected Japanese Black calves.
- Published
- 2021
- Full Text
- View/download PDF
13. New type of encephalomyelitis responsive to trimethoprim/sulfamethoxazole treatment in Japan.
- Author
-
Sakiyama Y, Kanda N, Higuchi Y, Yoshimura M, Wakaguri H, Takata Y, Watanabe O, Yuan J, Tashiro Y, Saigo R, Nozuma S, Yoshimura A, Arishima S, Ikeda K, Shinohara K, Arata H, Michizono K, Higashi K, Hashiguchi A, Okamoto Y, Hirano R, Shiraishi T, Matsuura E, Okubo R, Higuchi I, Goto M, Hirano H, Sano A, Iwasaki T, Matsuda F, Izumo S, and Takashima H
- Abstract
Objective: To determine the causative pathogen and investigate the effective treatment of a new type of encephalomyelitis with an unknown pathogen in Japan and report the preliminary ultrastructural and genomic characterization of the causative agent., Methods: From 2005 to 2012, we treated 4 Japanese patients with geographic clustering and comparable clinical features, serum/CSF cytology, and radiologic findings. Brain biopsy was conducted in all patients to analyze neuropathologic changes by histology, and electron microscopy was applied to reveal the features of the putative pathogen. Genomic DNA was obtained from the affected brain tissues and CSF, and an unbiased high-throughput sequencing approach was used to screen for specific genomic sequences indicative of the pathogen origin., Results: All patients exhibited progressive dementia with involuntary tongue movements. Cytologic examination of CSF revealed elevated mononuclear cells. Abnormal MRI signals were observed in temporal lobes, subcortical white matter, and spinal cord. Biopsied brain tissue exhibited aggregated periodic acid-Schiff-positive macrophages and 2-7 μm diameter round/oval bodies without nuclei or cell walls scattered around the vessels. Unbiased high-throughput sequencing identified more than 100 archaea-specific DNA fragments. All patients were responsive to trimethoprim/sulfamethoxazole (TMP-SMX) plus corticosteroid therapy., Conclusions: We report 4 cases of encephalomyelitis due to an unknown pathogen. On the basis of ultrastructural and genomic studies, we propose a new disease entity resulting from a causative pathogen having archaeal features. TMP-SMX therapy was effective against this new type of encephalomyelitis.
- Published
- 2015
- Full Text
- View/download PDF
14. Non-high-density cholesterol level as a predictor of maximum carotid intima-media thickness in Japanese subjects with type 2 diabetes: a comparison with low-density lipoprotein level.
- Author
-
Bando Y, Wakaguri H, Aoki K, Kanehara H, Hisada A, Okafuji K, Toya D, and Tanaka N
- Abstract
Aim: To determine whether non-high-density lipoprotein cholesterol (non-HDL-C) level, in comparison with low-density lipoprotein cholesterol (LDL-C) level, is useful for predicting the values of various surrogate atherosclerosis markers in Japanese subjects with type 2 diabetes (T2DM)., Methods: Data were retrieved from medical records of 265 subjects with T2DM who underwent laboratory tests to evaluate for atherosclerosis by using the following parameters: brachial-ankle pulse wave velocity, mean and maximum carotid intima-media thickness (mean CIMT and max-CIMT), and ankle-brachial index, with simultaneous fasting blood sampling for routine lipid parameters., Results: In a multiple stepwise regression analysis, non-HDL-C level, but not LDL-C level, positively correlated with max-CIMT ( β coefficient = 0.14, F = 6.84). Stepwise logistic regression analysis revealed that a 0.26 mmol/L (10 mg/dL) increase in non-HDL-C level, but not LDL-C level, was significantly associated with high risk of max-CIMT (≥1.1 mm; odds ratio, 1.096; 95 % confidence interval, 1.003-1.202; p = 0.046). However, in a receiver operating characteristic curve (ROC) analysis, the addition of non-HDL-C level to the three significant independent variables obtained from the stepwise analyses did not significantly increased the area under the ROC curve (from 0.7789 to 0.7864, p = 0.4343)., Conclusions: Non-HDL-C levels may be non-inferior to LDL-C level for the prediction of high-risk max-CIMT in Japanese subjects with T2DM.
- Published
- 2015
- Full Text
- View/download PDF
15. DB-AT: a 2015 update to the Full-parasites database brings a multitude of new transcriptomic data for apicomplexan parasites.
- Author
-
Jąkalski M, Wakaguri H, Kischka TG, Nishikawa Y, Kawazu S, Matsubayashi M, Kawahara F, Tsuji N, Cao S, Sunaga F, Xuan X, Okubo K, Igarashi I, Tuda J, Mongan AE, Eshita Y, Maeda R, Makałowski W, Suzuki Y, and Yamagishi J
- Subjects
- Humans, Internet, Malaria, Falciparum parasitology, Plasmodium falciparum genetics, Sequence Analysis, RNA, Apicomplexa genetics, Databases, Genetic, Gene Expression Profiling
- Abstract
The previous release of our Full-parasites database (http://fullmal.hgc.jp/) brought enhanced functionality, an expanded full-length cDNA content, and new RNA-Seq datasets from several important apicomplexan parasites. The 2015 update witnesses the major shift in the databases content with focus on diverse transcriptomes of the apicomplexan parasites. The content of the database was substantially enriched with transcriptome information for new apicomplexan parasites. The latest version covers a total of 17 species, with addition of our newly generated RNA-Seq data of a total of 909,150,388 tags. Moreover, we have generated and included two novel and unique datasets, which represent diverse nature of transcriptomes in individual parasites in vivo and in vitro. One is the data collected from 116 Indonesian patients infected with Plasmodium falciparum. The other is a series of transcriptome data collected from a total of 38 single cells of P. falciparum cultured in vitro. We believe that with the recent advances our database becomes an even better resource and a unique platform in the analysis of apicomplexan parasites and their interaction with their hosts. To adequately reflect the recent modifications and the current content we have changed the database name to DB-AT--DataBase of Apicomplexa Transcriptomes., (© The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2015
- Full Text
- View/download PDF
16. DBTSS as an integrative platform for transcriptome, epigenome and genome sequence variation data.
- Author
-
Suzuki A, Wakaguri H, Yamashita R, Kawano S, Tsuchihara K, Sugano S, Suzuki Y, and Nakai K
- Subjects
- Genomics, Humans, Internet, Mutation Rate, Neoplasms genetics, Databases, Nucleic Acid, Epigenesis, Genetic, Gene Expression Profiling, Genetic Variation, Transcription Initiation Site
- Abstract
DBTSS (http://dbtss.hgc.jp/) was originally constructed as a collection of uniquely determined transcriptional start sites (TSSs) in humans and some other species in 2002. Since then, it has been regularly updated and in recent updates epigenetic information has also been incorporated because such information is useful for characterizing the biological relevance of these TSSs/downstream genes. In the newest release, Release 9, we further integrated public and original single nucleotide variation (SNV) data into our database. For our original data, we generated SNV data from genomic analyses of various cancer types, including 97 lung adenocarcinomas and 57 lung small cell carcinomas from Japanese patients as well as 26 cell lines of lung cancer origin. In addition, we obtained publically available SNV data from other cancer types and germline variations in total of 11,322 individuals. With these updates, users can examine the association between sequence variation pattern in clinical lung cancers with its corresponding TSS-seq, RNA-seq, ChIP-seq and BS-seq data. Consequently, DBTSS is no longer a mere storage site for TSS information but has evolved into an integrative platform of a variety of genome activity data., (© The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2015
- Full Text
- View/download PDF
17. Aberrant transcriptional regulations in cancers: genome, transcriptome and epigenome analysis of lung adenocarcinoma cell lines.
- Author
-
Suzuki A, Makinoshima H, Wakaguri H, Esumi H, Sugano S, Kohno T, Tsuchihara K, and Suzuki Y
- Subjects
- Adenocarcinoma of Lung, Cell Line, Tumor, Chromatin Immunoprecipitation, DNA Methylation, Gene Expression Profiling, Genome, Human, Genomics, Histones metabolism, Humans, Mutation, RNA Polymerase II metabolism, Sequence Analysis, DNA, Sequence Analysis, RNA, Adenocarcinoma genetics, Epigenesis, Genetic, Gene Expression Regulation, Neoplastic, Lung Neoplasms genetics, Transcriptome
- Abstract
Here we conducted an integrative multi-omics analysis to understand how cancers harbor various types of aberrations at the genomic, epigenomic and transcriptional levels. In order to elucidate biological relevance of the aberrations and their mutual relations, we performed whole-genome sequencing, RNA-Seq, bisulfite sequencing and ChIP-Seq of 26 lung adenocarcinoma cell lines. The collected multi-omics data allowed us to associate an average of 536 coding mutations and 13,573 mutations in promoter or enhancer regions with aberrant transcriptional regulations. We detected the 385 splice site mutations and 552 chromosomal rearrangements, representative cases of which were validated to cause aberrant transcripts. Averages of 61, 217, 3687 and 3112 mutations are located in the regulatory regions which showed differential DNA methylation, H3K4me3, H3K4me1 and H3K27ac marks, respectively. We detected distinct patterns of aberrations in transcriptional regulations depending on genes. We found that the irregular histone marks were characteristic to EGFR and CDKN1A, while a large genomic deletion and hyper-DNA methylation were most frequent for CDKN2A. We also used the multi-omics data to classify the cell lines regarding their hallmarks of carcinogenesis. Our datasets should provide a valuable foundation for biological interpretations of interlaced genomic and epigenomic aberrations., (© The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2014
- Full Text
- View/download PDF
18. The Babesia bovis gene and promoter model: an update from full-length EST analysis.
- Author
-
Yamagishi J, Wakaguri H, Yokoyama N, Yamashita R, Suzuki Y, Xuan X, and Igarashi I
- Subjects
- Base Sequence, Consensus Sequence, Contig Mapping, DNA, Complementary genetics, DNA, Protozoan genetics, Expressed Sequence Tags, Genes, Protozoan, Models, Genetic, Molecular Sequence Annotation, Molecular Sequence Data, Nucleosomes genetics, Sequence Analysis, DNA, Transcription Initiation Site, Babesia bovis genetics, Promoter Regions, Genetic
- Abstract
Background: Babesia bovis is an apicomplexan parasite that causes babesiosis in infected cattle. Genomes of pathogens contain promising information that can facilitate the development of methods for controlling infections. Although the genome of B. bovis is publically available, annotated gene models are not highly reliable prior to experimental validation. Therefore, we validated a preproposed gene model of B. bovis and extended the associated annotations on the basis of experimentally obtained full-length expressed sequence tags (ESTs)., Results: From in vitro cultured merozoites, 12,286 clones harboring full-length cDNAs were sequenced from both ends using the Sanger method, and 6,787 full-length cDNAs were assembled. These were then clustered, and a nonredundant referential data set of 2,115 full-length cDNA sequences was constructed. The comparison of the preproposed gene model with our data set identified 310 identical genes, 342 almost identical genes, 1,054 genes with potential structural inconsistencies, and 409 novel genes. The median length of 5' untranslated regions (UTRs) was 152 nt. Subsequently, we identified 4,086 transcription start sites (TSSs) and 2,023 transcriptionally active regions (TARs) by examining 5' ESTs. We identified ATGGGG and CCCCAT sites as consensus motifs in TARs that were distributed around -50 bp from TSSs. In addition, we found ACACA, TGTGT, and TATAT sites, which were distributed periodically around TSSs in cycles of approximately 150 bp. Moreover, related periodical distributions were not observed in mammalian promoter regions., Conclusions: The observations in this study indicate the utility of integrated bioinformatics and experimental data for improving genome annotations. In particular, full-length cDNAs with one-base resolution for TSSs enabled the identification of consensus motifs in promoter sequences and demonstrated clear distributions of identified motifs. These observations allowed the illustration of a model promoter composition, which supports the differences in transcriptional regulation frameworks between apicomplexan parasites and mammals.
- Published
- 2014
- Full Text
- View/download PDF
19. Construction of mate pair full-length cDNAs libraries and characterization of transcriptional start sites and termination sites.
- Author
-
Matsumoto K, Suzuki A, Wakaguri H, Sugano S, and Suzuki Y
- Subjects
- Cell Line, Chromatin chemistry, DNA, Complementary, HEK293 Cells, HeLa Cells, High-Throughput Nucleotide Sequencing, Humans, MCF-7 Cells, Poly A, RNA, Messenger chemistry, 3' Untranslated Regions, Gene Library, Transcription Initiation Site
- Abstract
To identify and characterize transcript structures ranging from transcriptional start sites (TSSs) to poly(A)-addition sites (PASs), we constructed and analyzed human TSS/PAS mate pair full-length cDNA libraries from 14 tissue types and four cell lines. The collected information enabled us to define TSS cluster (TSC) and PAS cluster (PAC) relationships for a total of 8530/9400 RefSeq genes, as well as 4251/5618 of their putative alternative promoters/terminators and 4619/4605 intervening transcripts, respectively. Analyses of the putative alternative TSCs and alternative PACs revealed that their selection appeared to be mostly independent, with rare exceptions. In those exceptional cases, pairs of transcript units rarely overlapped one another and were occasionally separated by Rad21/CTCF. We also identified a total of 172 similar cases in which TSCs and PACs spanned adjacent but distinct genes. In these cases, different transcripts may utilize different functional units of a particular gene or of adjacent genes. This approach was also useful for identifying fusion gene transcripts in cancerous cells. Furthermore, we could construct cDNA libraries in which 3'-end mate pairs were distributed randomly over the transcripts. These libraries were useful for assembling the internal structure of previously uncharacterized alternative promoter products, as well as intervening transcripts., (© The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.)
- Published
- 2014
- Full Text
- View/download PDF
20. Comparative genome analysis of three eukaryotic parasites with differing abilities to transform leukocytes reveals key mediators of Theileria-induced leukocyte transformation.
- Author
-
Hayashida K, Hara Y, Abe T, Yamasaki C, Toyoda A, Kosuge T, Suzuki Y, Sato Y, Kawashima S, Katayama T, Wakaguri H, Inoue N, Homma K, Tada-Umezaki M, Yagi Y, Fujii Y, Habara T, Kanehisa M, Watanabe H, Ito K, Gojobori T, Sugawara H, Imanishi T, Weir W, Gardner M, Pain A, Shiels B, Hattori M, Nene V, and Sugimoto C
- Subjects
- Animals, Cattle, Cell Proliferation, Leukocytes parasitology, Synteny, Genome, Protozoan, Theileria genetics, Theileria pathogenicity, Virulence Factors genetics
- Abstract
We sequenced the genome of Theileria orientalis, a tick-borne apicomplexan protozoan parasite of cattle. The focus of this study was a comparative genome analysis of T. orientalis relative to other highly pathogenic Theileria species, T. parva and T. annulata. T. parva and T. annulata induce transformation of infected cells of lymphocyte or macrophage/monocyte lineages; in contrast, T. orientalis does not induce uncontrolled proliferation of infected leukocytes and multiplies predominantly within infected erythrocytes. While synteny across homologous chromosomes of the three Theileria species was found to be well conserved overall, subtelomeric structures were found to differ substantially, as T. orientalis lacks the large tandemly arrayed subtelomere-encoded variable secreted protein-encoding gene family. Moreover, expansion of particular gene families by gene duplication was found in the genomes of the two transforming Theileria species, most notably, the TashAT/TpHN and Tar/Tpr gene families. Gene families that are present only in T. parva and T. annulata and not in T. orientalis, Babesia bovis, or Plasmodium were also identified. Identification of differences between the genome sequences of Theileria species with different abilities to transform and immortalize bovine leukocytes will provide insight into proteins and mechanisms that have evolved to induce and regulate this process. The T. orientalis genome database is available at http://totdb.czc.hokudai.ac.jp/.
- Published
- 2012
- Full Text
- View/download PDF
21. Monitoring endoplasmic reticulum stress responsive mRNAs by RNA sequencing.
- Author
-
Okuda T, Wakaguri H, Suzuki Y, and Sugano S
- Subjects
- Cell Line, Tumor, Humans, Polyribosomes, Endoplasmic Reticulum Stress, High-Throughput Nucleotide Sequencing methods, Ribonucleoproteins analysis, Sequence Analysis, RNA methods
- Abstract
Gene expression profile upon endoplasmic reticulum (ER) stress was analyzed by deep shotgun sequencing of mRNAs (DSSR) using RNAs from polysomes or cytoplasm of the HT29 cell. Two time points, 4h after tunicamycin treatment when IRE1α signaling pathway is active and 16h after the treatment when it is inactive, were used. There was a transient decrease in the proportion of shorter mRNA species (<1000bp) in polysome, while it increased transiently in the cytoplasm. Despite such an overall change and decrease in total amount of polysomes, the majority of the 6966 genes analyzed had less than 2 fold change in their expressions. We searched for the genes whose expression was elevated by 2 folds or more in both polysome and cytoplasm and confirmed the results with RT-PCR. There were 7 genes elevated only at 4h (Group I), 20 genes only at 16h (Group II) and 7 genes both at 4 and 16h (Group III). There were 3 genes involved in ribosomal RNA biogenesis in Group I and 2 genes involved mTOR control in Group III. This was consistent with the concept that the ribosome is the essential site for managing ER stress. DSSR is a useful tool for the search of candidates of ER stress responsive genes., (Copyright © 2012 Elsevier B.V. All rights reserved.)
- Published
- 2012
- Full Text
- View/download PDF
22. A pilot study on developing mucosal vaccine against alveolar echinococcosis (AE) using recombinant tetraspanin 3: Vaccine efficacy and immunology.
- Author
-
Dang Z, Yagi K, Oku Y, Kouguchi H, Kajino K, Matsumoto J, Nakao R, Wakaguri H, Toyoda A, Yin H, and Sugimoto C
- Subjects
- Adjuvants, Immunologic administration & dosage, Animals, Antibodies, Helminth blood, Antigens, Helminth genetics, Echinococcosis, Echinococcus multilocularis isolation & purification, Enzyme-Linked Immunosorbent Assay, Freund's Adjuvant administration & dosage, Glycoproteins genetics, Immunoglobulin A analysis, Immunoglobulin G blood, Intestinal Mucosa immunology, Liver parasitology, Male, Mice, Mice, Inbred BALB C, Nasal Mucosa immunology, Oligodeoxyribonucleotides administration & dosage, Pilot Projects, Tetraspanins genetics, Vaccines, Synthetic administration & dosage, Vaccines, Synthetic genetics, Vaccines, Synthetic immunology, Antigens, Helminth immunology, Echinococcosis, Hepatic prevention & control, Glycoproteins immunology, Immunity, Mucosal, Tetraspanins immunology
- Abstract
Background: We have previously evaluated the vaccine efficacies of seven tetraspanins of Echinococcus multilocularis (Em-TSP1-7) against alveolar echinococcosis (AE) by subcutaneous (s.c.) administration with Freund's adjuvant. Over 85% of liver cyst lesion number reductions (CLNR) were achieved by recombinant Em-TSP1 (rEm-TSP1) and -TSP3 (rEm-TSP3). However, to develop an efficient and safe human vaccine, the efficacy of TSP mucosal vaccines must be thoroughly evaluated., Methodology/principal Findings: rEm-TSP1 and -TSP3 along with nontoxic CpG ODN (CpG oligodeoxynucleotides) adjuvant were intranasally (i.n.) immunized to BALB/c mice and their vaccine efficacies were evaluated by counting liver CLNR (experiment I). 37.1% (p < 0.05) and 62.1% (p < 0.001) of CLNR were achieved by these two proteins, respectively. To study the protection-associated immune responses induced by rEm-TSP3 via different immunization routes (i.n. administration with CpG or s.c. immunization with Freund's adjuvant), the systemic and mucosal antibody responses were detected by ELISA (experiment II). S.c. and i.n. administration of rEm-TSP3 achieved 81.9% (p < 0.001) and 62.8% (p < 0.01) CLNR in the liver, respectively. Both the immunization routes evoked strong serum IgG, IgG1 and IgG2α responses; i.n. immunization induced significantly higher IgA responses in nasal cavity and intestine compared with s.c. immunization (p < 0.001). Both immunization routes induced extremely strong liver IgA antibody responses (p < 0.001). The Th1 and Th2 cell responses were assessed by examining the IgG1/IgG2α ratio at two and three weeks post-immunization. S.c. immunization resulted in a reduction in the IgG1/IgG2α ratio (Th1 tendency), whereas i.n. immunization caused a shift from Th1 to Th2. Moreover, immunohistochemistry showed that Em-TSP1 and -TSP3 were extensively located on the surface of E. multilocularis cysts, protoscoleces and adult worms with additional expression of Em-TSP3 in the inner part of protoscoleces and oncospheres., Conclusions: Our study indicated that i.n. administration of rEm-TSP3 with CpG is able to induce both systemic and local immune responses and thus provides significant protection against AE.
- Published
- 2012
- Full Text
- View/download PDF
23. Construction and analysis of full-length cDNA library of Cryptosporidium parvum.
- Author
-
Yamagishi J, Wakaguri H, Sugano S, Kawano S, Fujisaki K, Sugimoto C, Watanabe J, Suzuki Y, Kimata I, and Xuan X
- Subjects
- 5' Untranslated Regions, Animals, Cloning, Molecular, Cryptosporidium parvum growth & development, DNA, Complementary metabolism, Female, Gene Expression Profiling, Genes, Protozoan, Humans, Mice, Mice, SCID, Molecular Sequence Annotation, Oocytes metabolism, RNA, Protozoan genetics, RNA, Protozoan isolation & purification, Sequence Analysis, DNA methods, Cryptosporidium parvum genetics, Gene Library, Genome, Protozoan
- Abstract
A full-length cDNA library was constructed from the sporozoite of Cryptosporidium parvum. Normalized clones were subjected to Solexa shotgun sequencing, and then complete sequences for 1066 clones were reconfigured. Detailed analyses of the sequences revealed that 13.5% of the transcripts were spliced; the average and median 5' UTR lengths were 213.5 and 122 nucleotides, respectively. There were 148 inconsistencies out of 562 examined genes between the experimentally described cDNA sequence and the predicted sequence from its genome. In addition, we identified 118 sequences that had little homology against annotated genes of C. parvum as prospective candidates for addable genes. These observations should improve the reliability of C. parvum transcriptome and provide a versatile resource for further studies., (Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.)
- Published
- 2011
- Full Text
- View/download PDF
24. Full-parasites: database of full-length cDNAs of apicomplexa parasites, 2010 update.
- Author
-
Tuda J, Mongan AE, Tolba ME, Imada M, Yamagishi J, Xuan X, Wakaguri H, Sugano S, Sugimoto C, and Suzuki Y
- Subjects
- Expressed Sequence Tags, Gene Expression Profiling, Host-Parasite Interactions, Humans, Plasmodium falciparum genetics, Sequence Analysis, RNA, Toxoplasma genetics, Transcription Initiation Site, Apicomplexa genetics, DNA, Complementary chemistry, Databases, Nucleic Acid, RNA, Protozoan chemistry
- Abstract
Full-Parasites (http://fullmal.hgc.jp/) is a transcriptome database of apicomplexa parasites, which include Plasmodium and Toxoplasma species. The latest version of Full-Parasites contains a total of 105,786 EST sequences from 12 parasites, of which 5925 full-length cDNAs have been completely sequenced. Full-Parasites also contain more than 30 million transcription start sites (TSS) for Plasmodium falciparum (Pf) and Toxoplasma gondii (Tg), which were identified using our novel oligo-capping-based protocol. Various types of cDNA data resources were interconnected with our original database functionalities. Specifically, in this update, we have included two unique RNA-Seq data sets consisting of 730 million mapped RNA-Seq tags. One is a dataset of 16 time-lapse experiments of cultured bradyzoite differentiation for Tg. The other dataset includes 31 clinical samples of Pf. Parasite RNA was extracted together with host human RNA, and the extracted mixed RNA was used for RNA sequencing, with the expectation that gene expression information from the host and parasite would be simultaneously represented. By providing the largest unique full-length cDNA and dynamic transcriptome data, Full-Parasites is useful for understanding host-parasite interactions and will help to eventually elucidate how monophyletic organisms have evolved to become parasites by adopting complex life cycles.
- Published
- 2011
- Full Text
- View/download PDF
25. DBTSS provides a tissue specific dynamic view of Transcription Start Sites.
- Author
-
Yamashita R, Wakaguri H, Sugano S, Suzuki Y, and Nakai K
- Subjects
- Algorithms, Animals, Computational Biology trends, Databases, Protein, Gene Expression Profiling, Genomics, Humans, Information Storage and Retrieval methods, Internet, Mice, NIH 3T3 Cells, User-Computer Interface, Computational Biology methods, Databases, Genetic, Databases, Nucleic Acid, Transcription Initiation Site
- Abstract
DataBase of Transcription Start Sites (DBTSS) is a database which contains precise positional information for transcription start sites (TSSs) of eukaryotic mRNAs. In this update, we included 330 million new tags generated by massively sequencing the 5'-end of oligo-cap selected cDNAs in humans and mice. The tags were collected from normal fetal or adult human tissues, including brain, thymus, liver, kidney and heart, from 6 human cell lines in 21 diverse growth conditions as well as from mouse NIH3T3 cell line: altogether 31 different cell types or culture conditions are represented. This unprecedented increase in depth of data now allows DBTSS to faithfully represent the dynamically changing landscape of TSSs in different cell types and conditions, during development and in the course of evolution. Differential usage of alternative 5'-ends across cell types and conditions can be viewed in a series of new interfaces. Promoter sequence information is now displayed in a comparative genomics viewer where evolutionary turnover of the TSSs can be evaluated. DBTSS can be accessed at http://dbtss.hgc.jp/.
- Published
- 2010
- Full Text
- View/download PDF
26. Evaluation of Echinococcus multilocularis tetraspanins as vaccine candidates against primary alveolar echinococcosis.
- Author
-
Dang Z, Yagi K, Oku Y, Kouguchi H, Kajino K, Watanabe J, Matsumoto J, Nakao R, Wakaguri H, Toyoda A, and Sugimoto C
- Subjects
- Amino Acid Sequence, Animals, Antibodies, Helminth blood, Cloning, Molecular, Echinococcosis, Pulmonary immunology, Echinococcus multilocularis immunology, Female, Gene Library, Male, Mice, Mice, Inbred BALB C, Molecular Sequence Data, Rabbits, Rats, Recombinant Proteins immunology, Sequence Alignment, Sequence Homology, Amino Acid, Antigens, Helminth immunology, Echinococcosis, Pulmonary prevention & control, Membrane Proteins immunology, Vaccines immunology
- Abstract
Echinococcus multilocularis causes an important zoonotic cestode disease. The metacestode stage proliferates in the liver of intermediate hosts including human and rodents and forms multiple cysts. Recently, members of a transmembrane protein tetraspanin (TSP) family have been used as vaccines against schistosomosis, or as diagnostic antigens for cysticercosis. In this study, seven tetraspanins of E. multilocularis, designated as TSP1 to TSP7, were evaluated for their protective potential against primary alveolar echinococcosis. The large extracellular loop (LEL) region of these tetraspanins was cloned from a full-length enriched cDNA library of E. multilocularis metacestodes and expressed in Escherichia coli as a fusion protein with thioredoxin. Recombinant TSPs were applied as vaccines against an E. multilocularis primary experimental infection in BALB/c mice. Cyst lesions in the livers of vaccinated and non-vaccinated mice were counted. The cyst lesion reduction rates induced by the seven tetraspanins in vaccinated vis-à-vis non-vaccinated mice were: 87.9%, 65.8%, 85.1%, 66.9%, 73.7%, 72.9% and 37.6%. Vaccination conferred protective rates to mice ranging from 0% (TSP5, 6, 7) to maximally 33% (TSP1, 3). The results indicated that recombinant tetraspanins have varying protective effects against primary alveolar echinococcosis and could be used in vaccine development.
- Published
- 2009
- Full Text
- View/download PDF
27. Inconsistencies of genome annotations in apicomplexan parasites revealed by 5'-end-one-pass and full-length sequences of oligo-capped cDNAs.
- Author
-
Wakaguri H, Suzuki Y, Sasaki M, Sugano S, and Watanabe J
- Subjects
- 5' Untranslated Regions, Animals, Chromosome Mapping, Cluster Analysis, DNA, Protozoan genetics, Gene Library, Models, Genetic, Sequence Analysis, DNA, Transcription Initiation Site, Apicomplexa genetics, DNA, Complementary genetics, Expressed Sequence Tags, Genome, Protozoan
- Abstract
Background: Apicomplexan parasites are causative agents of various diseases including malaria and have been targets of extensive genomic sequencing. We generated 5'-EST collections for six apicomplexa parasites using our full-length oligo-capping cDNA library method. To improve upon the current genome annotations, as well as to validate the importance for physical cDNA clone resources, we generated a large-scale collection of full-length cDNAs for several apicomplexa parasites., Results: In this study, we used a total of 61,056 5'-end-single-pass cDNA sequences from Plasmodium falciparum, P. vivax, P. yoelii, P. berghei, Cryptosporidium parvum, and Toxoplasma gondii. We compared these partially sequenced cDNA sequences with the currently annotated gene models and observed significant inconsistencies between the two datasets. In particular, we found that on average 14% of the exons in the current gene models were not supported by any cDNA evidence, and that 16% of the current gene models may contain at least one mis-annotation and should be re-evaluated. We also identified a large number of transcripts that had been previously unidentified. For 732 cDNAs in T. gondii, the entire sequences were determined in order to evaluate the annotated gene models at the complete full-length transcript level. We found that 41% of the T. gondii gene models contained at least one inconsistency. We also identified and confirmed by RT-PCR 140 previously unidentified transcripts found in the intergenic regions of the current gene annotations. We show that the majority of these discrepancies are due to questionable predictions of one or two extra exons in the upstream or downstream regions of the genes., Conclusion: Our data indicates that the current gene models are likely to still be incomplete and have much room for improvement. Our unique full-length cDNA information is especially useful for further refinement of the annotations for the genomes of apicomplexa parasites.
- Published
- 2009
- Full Text
- View/download PDF
28. Massive transcriptional start site analysis of human genes in hypoxia cells.
- Author
-
Tsuchihara K, Suzuki Y, Wakaguri H, Irie T, Tanimoto K, Hashimoto S, Matsushima K, Mizushima-Sugano J, Yamashita R, Nakai K, Bentley D, Esumi H, and Sugano S
- Subjects
- Basic Helix-Loop-Helix Transcription Factors metabolism, Cell Hypoxia, Cell Line, Cell Line, Tumor, Colonic Neoplasms genetics, Gene Library, Gene Regulatory Networks, Genome, Human, Humans, Hypoxia-Inducible Factor 1, alpha Subunit metabolism, Promoter Regions, Genetic, Sequence Analysis, DNA, Transcription, Genetic, Gene Expression Regulation, Neoplastic, Transcription Initiation Site
- Abstract
Combining our full-length cDNA method and the massively parallel sequencing technology, we developed a simple method to collect precise positional information of transcriptional start sites (TSSs) together with digital information of the gene-expression levels in a high throughput manner. We applied this method to observe gene-expression changes in a colon cancer cell line cultured in normoxic and hypoxic conditions. We generated more than 100 million 36-base TSS-tag sequences and revealed comprehensive features of hypoxia responsive alterations in the transcriptional landscape of the human genome. The features include presence of inducible 'hot regions' in 54 genomic regions, 220 novel hypoxia inducible promoters that may drive non-protein-coding transcripts, 191 hypoxia responsive alternative promoters and detailed views of 120 novel as well as known hypoxia responsive genes. We further analyzed hypoxic response of different cells using additional 60 million TSS-tags and found that the degree of the gene-expression changes were different among cell lines, possibly reflecting cellular robustness against hypoxia. The novel dynamic figure of the human gene transcriptome will deepen our understanding of the transcriptional program of the human genome as well as bringing new insights into the biology of cancer cells in hypoxia.
- Published
- 2009
- Full Text
- View/download PDF
29. Comprehensive detection of human terminal oligo-pyrimidine (TOP) genes and analysis of their characteristics.
- Author
-
Yamashita R, Suzuki Y, Takeuchi N, Wakaguri H, Ueda T, Sugano S, and Nakai K
- Subjects
- Animals, Gene Expression Profiling, Genome, Human, HL-60 Cells, Humans, Mice, RNA, Messenger chemistry, Ribosomal Proteins genetics, Transcription Initiation Site, Gene Expression Regulation, Protein Biosynthesis, RNA 5' Terminal Oligopyrimidine Sequence
- Abstract
Although the knowledge accumulated on the transcriptional regulations of eukaryotes is significant, the knowledge on their translational regulations remains limited. Thus, we performed a comprehensive detection of terminal oligo-pyrimidine (TOP), which is one of the well-characterized cis-regulatory motifs for translational controls located immediately downstream of the transcriptional start sites of mRNAs. Utilizing our precise 5'-end information of the full-length cDNAs, we could screen 1645 candidate TOP genes by position specific matrix search. Among them, not only 75 out of 78 ribosomal protein genes but also eight previously identified non-ribosomal-protein TOP genes were included. We further experimentally validated the translational activities of 83 TOP candidate genes. Clear translational regulations exerted on the stimulation of 12-O-tetradecanoyl-1-phorbol-13-acetate for at least 41 of them was observed, indicating that there should be a few hundreds of human genes which are subjected to regulation at translation levels via TOPs. Our result suggests that TOP genes code not only formerly characterized ribosomal proteins and translation-related proteins but also a wider variety of proteins, such as lysosome-related proteins and metabolism-related proteins, playing pivotal roles in gene expression controls in the majority of cellular mRNAs.
- Published
- 2008
- Full Text
- View/download PDF
30. Expressed sequence tags from cynomolgus monkey (Macaca fascicularis) liver: a systematic identification of drug-metabolizing enzymes.
- Author
-
Uno Y, Suzuki Y, Wakaguri H, Sakamoto Y, Sano H, Osada N, Hashimoto K, Sugano S, and Inoue I
- Subjects
- Amino Acid Sequence, Animals, Aryl Hydrocarbon Hydroxylases chemistry, DNA Primers, Liver enzymology, Macaca fascicularis, Molecular Sequence Data, Sequence Homology, Amino Acid, Steroid Hydroxylases chemistry, Aryl Hydrocarbon Hydroxylases genetics, Expressed Sequence Tags, Liver metabolism, Steroid Hydroxylases genetics
- Abstract
The liver, a major organ for drug metabolism, is physiologically similar between monkeys and humans. However, the paucity of identified genes has hampered a deep understanding of drug metabolism in monkeys. To provide such a genetic resource, 28655 expressed sequence tags (ESTs) were generated from a cynomolgus monkey liver full-length enriched cDNA library, which contained 23 unique ESTs homologous to human drug-metabolizing enzymes. Our comparative genomics approach identified nine lineage-specific candidate ESTs, including three drug-metabolizing enzymes, which could be important for understanding the physiological differences between monkeys and humans.
- Published
- 2008
- Full Text
- View/download PDF
31. DBTSS: database of transcription start sites, progress report 2008.
- Author
-
Wakaguri H, Yamashita R, Suzuki Y, Sugano S, and Nakai K
- Subjects
- Animals, Binding Sites, Evolution, Molecular, Gene Expression, Humans, Internet, Promoter Regions, Genetic, Sequence Analysis, DNA, Software, Species Specificity, Transcription Factors metabolism, Databases, Nucleic Acid, Transcription Initiation Site
- Abstract
DBTSS is a database of transcriptional start sites, based on our unique collection of precise, experimentally determined 5'-end sequences of full-length cDNAs. Since its first release in 2002, several major updates have been made. In this update, we expanded the human transcriptional start site dataset by 19 million uniquely mapped, and RefSeq-associated, 5'-end sequences, which were generated by a newly introduced Solexa sequencer. Moreover, in order to provide means for interpreting those massive TSS data, we implemented two new analytical tools: one for connecting expression information with predicted transcription factor binding sites; the other for examining evolutionary conservation or species-specificity of promoters and transcripts, which can be browsed by our own comparative genome viewer. With the expanded dataset and the enhanced functionalities, DBTSS provides a unique platform that enables in-depth transcriptome analyses. DBTSS is accessible at http://dbtss.hgc.jp/.
- Published
- 2008
- Full Text
- View/download PDF
32. Distinct class of putative "non-conserved" promoters in humans: comparative studies of alternative promoters of human and mouse genes.
- Author
-
Tsuritani K, Irie T, Yamashita R, Sakakibara Y, Wakaguri H, Kanai A, Mizushima-Sugano J, Sugano S, Nakai K, and Suzuki Y
- Subjects
- Alternative Splicing, Animals, Conserved Sequence, Gene Expression Regulation, Humans, Molecular Sequence Data, Species Specificity, Mice genetics, Promoter Regions, Genetic genetics
- Abstract
Although recent studies have revealed that the majority of human genes are subject to regulation of alternative promoters, the biological relevance of this phenomenon remains unclear. We have also demonstrated that roughly half of the human RefSeq genes examined contain putative alternative promoters (PAPs). Here we report large-scale comparative studies of PAPs between human and mouse counterpart genes. Detailed sequence comparison of the 17,245 putative promoter regions (PPRs) in 5463 PAP-containing human genes revealed that PPRs in only a minor fraction of genes (807 genes) showed clear evolutionary conservation as one or more pairs. Also, we found that there were substantial qualitative differences between conserved and non-conserved PPRs, with the latter class being AT-rich PPRs of relative minor usage, enriched in repetitive elements and sometimes producing transcripts that encode small or no proteins. Systematic luciferase assays of these PPRs revealed that both classes of PPRs did have promoter activity, but that their strength ranges were significantly different. Furthermore, we demonstrate that these characteristic features of the non-conserved PPRs are shared with the PPRs of previously discovered putative non-protein coding transcripts. Taken together, our data suggest that there are two distinct classes of promoters in humans, with the latter class of promoters emerging frequently during evolution.
- Published
- 2007
- Full Text
- View/download PDF
33. Intrinsic promoter activities of primary DNA sequences in the human genome.
- Author
-
Sakakibara Y, Irie T, Suzuki Y, Yamashita R, Wakaguri H, Kanai A, Chiba J, Takagi T, Mizushima-Sugano J, Hashimoto S, Nakai K, and Sugano S
- Subjects
- Base Composition, Cell Line, CpG Islands, DNA, Complementary genetics, Genes, Reporter, Humans, Luciferases genetics, RNA, Untranslated classification, RNA, Untranslated genetics, TATA Box, Transcription, Genetic, DNA genetics, Genome, Human, Promoter Regions, Genetic
- Abstract
In order to understand an overview of promoter activities intrinsic to primary DNA sequences in the human genome within a particular cell type, we carried out systematic quantitative luciferase assays of DNA fragments corresponding to putative promoters for 472 human genes which are expressed in HEK (human embryonic kidney epithelial) 293 cells. We observed the promoter activities of them were distributed in a bimodal manner; putative promoters belonging to the first group (with strong promoter activities) were designated as P1 and the latter (with weak promoter activities) as P2. The frequencies of the TATA-boxes, the CpG islands, and the overall G + C-contents were significantly different between these two populations, indicating there are two separate groups of promoters. Interestingly, similar analysis using 251 randomly isolated genomic DNA fragments showed that P2-type promoter occasionally occurs within the human genome. Furthermore, 35 DNA fragments corresponding to putative promoters of non-protein-coding transcripts (ncRNAs) shared similar features with the P2 in both promoter activities and sequence compositions. At least, a part of ncRNAs, which have been massively identified by full-length cDNA projects with no functional relevance inferred, may have originated from those sporadic promoter activities of primary DNA sequences inherent to the human genome.
- Published
- 2007
- Full Text
- View/download PDF
34. Common inheritance of chromosome Ia associated with clonal expansion of Toxoplasma gondii.
- Author
-
Khan A, Böhme U, Kelly KA, Adlem E, Brooks K, Simmonds M, Mungall K, Quail MA, Arrowsmith C, Chillingworth T, Churcher C, Harris D, Collins M, Fosker N, Fraser A, Hance Z, Jagels K, Moule S, Murphy L, O'Neil S, Rajandream MA, Saunders D, Seeger K, Whitehead S, Mayr T, Xuan X, Watanabe J, Suzuki Y, Wakaguri H, Sugano S, Sugimoto C, Paulsen I, Mackey AJ, Roos DS, Hall N, Berriman M, Barrell B, Sibley LD, and Ajioka JW
- Subjects
- Animals, Crosses, Genetic, Genetic Variation, Genetics, Population, Inheritance Patterns, Meiosis, Molecular Sequence Data, Recombination, Genetic, Toxoplasma classification, Chromosomes, Evolution, Molecular, Toxoplasma genetics
- Abstract
Toxoplasma gondii is a globally distributed protozoan parasite that can infect virtually all warm-blooded animals and humans. Despite the existence of a sexual phase in the life cycle, T. gondii has an unusual population structure dominated by three clonal lineages that predominate in North America and Europe, (Types I, II, and III). These lineages were founded by common ancestors approximately10,000 yr ago. The recent origin and widespread distribution of the clonal lineages is attributed to the circumvention of the sexual cycle by a new mode of transmission-asexual transmission between intermediate hosts. Asexual transmission appears to be multigenic and although the specific genes mediating this trait are unknown, it is predicted that all members of the clonal lineages should share the same alleles. Genetic mapping studies suggested that chromosome Ia was unusually monomorphic compared with the rest of the genome. To investigate this further, we sequenced chromosome Ia and chromosome Ib in the Type I strain, RH, and the Type II strain, ME49. Comparative genome analyses of the two chromosomal sequences revealed that the same copy of chromosome Ia was inherited in each lineage, whereas chromosome Ib maintained the same high frequency of between-strain polymorphism as the rest of the genome. Sampling of chromosome Ia sequence in seven additional representative strains from the three clonal lineages supports a monomorphic inheritance, which is unique within the genome. Taken together, our observations implicate a specific combination of alleles on chromosome Ia in the recent origin and widespread success of the clonal lineages of T. gondii.
- Published
- 2006
- Full Text
- View/download PDF
35. DBTSS: DataBase of Human Transcription Start Sites, progress report 2006.
- Author
-
Yamashita R, Suzuki Y, Wakaguri H, Tsuritani K, Nakai K, and Sugano S
- Subjects
- Animals, Humans, Internet, Mice, Plasmodium falciparum genetics, Rhodophyta genetics, Zebrafish genetics, Databases, Nucleic Acid statistics & numerical data, Promoter Regions, Genetic, Transcription Initiation Site
- Abstract
DBTSS was first constructed in 2002 based on precise, experimentally determined 5' end clones. Several major updates and additions have been made since the last report. First, the number of human clones has drastically increased, going from 190,964 to 1,359,000. Second, information about potential alternative promoters is presented because the number of 5' end clones is now sufficient to determine several promoters for one gene. Namely, we defined putative promoter groups by clustering transcription start sites (TSSs) separated by <500 bases. A total of 8308 human genes and 4276 mouse genes were found to have putative multiple promoters. Third, DBTSS provides detailed sequence comparisons of user-specified TSSs. Finally, we have added TSS information for zebrafish, malaria and schyzon (a red algae model organism). DBTSS is accessible at http://dbtss.hgc.jp.
- Published
- 2006
- Full Text
- View/download PDF
36. Diversification of transcriptional modulation: large-scale identification and characterization of putative alternative promoters of human genes.
- Author
-
Kimura K, Wakamatsu A, Suzuki Y, Ota T, Nishikawa T, Yamashita R, Yamamoto J, Sekine M, Tsuritani K, Wakaguri H, Ishii S, Sugiyama T, Saito K, Isono Y, Irie R, Kushida N, Yoneyama T, Otsuka R, Kanda K, Yokoi T, Kondo H, Wagatsuma M, Murakawa K, Ishida S, Ishibashi T, Takahashi-Fujii A, Tanase T, Nagai K, Kikuchi H, Nakai K, Isogai T, and Sugano S
- Subjects
- Base Sequence, Exons genetics, Humans, Molecular Sequence Data, Organ Specificity, Signal Transduction genetics, CpG Islands genetics, Gene Library, Multigene Family genetics, Promoter Regions, Genetic genetics, Quantitative Trait Loci genetics, Transcription, Genetic genetics
- Abstract
By analyzing 1,780,295 5'-end sequences of human full-length cDNAs derived from 164 kinds of oligo-cap cDNA libraries, we identified 269,774 independent positions of transcriptional start sites (TSSs) for 14,628 human RefSeq genes. These TSSs were clustered into 30,964 clusters that were separated from each other by more than 500 bp and thus are very likely to constitute mutually distinct alternative promoters. To our surprise, at least 7674 (52%) human RefSeq genes were subject to regulation by putative alternative promoters (PAPs). On average, there were 3.1 PAPs per gene, with the composition of one CpG-island-containing promoter per 2.6 CpG-less promoters. In 17% of the PAP-containing loci, tissue-specific use of the PAPs was observed. The richest tissue sources of the tissue-specific PAPs were testis and brain. It was also intriguing that the PAP-containing promoters were enriched in the genes encoding signal transduction-related proteins and were rarer in the genes encoding extracellular proteins, possibly reflecting the varied functional requirement for and the restricted expression of those categories of genes, respectively. The patterns of the first exons were highly diverse as well. On average, there were 7.7 different splicing types of first exons per locus partly produced by the PAPs, suggesting that a wide variety of transcripts can be achieved by this mechanism. Our findings suggest that use of alternate promoters and consequent alternative use of first exons should play a pivotal role in generating the complexity required for the highly elaborated molecular systems in humans.
- Published
- 2006
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.