259 results on '"Sequence similarity searching"'
Search Results
2. ProphET, prophage estimation tool: A stand-alone prophage sequence prediction tool with self-updating reference database.
- Author
-
Reis-Cunha, João L., Bartholomeu, Daniella C., Manson, Abigail L., Earl, Ashlee M., and Cerqueira, Gustavo C.
- Subjects
- *
BACTERIAL genomes , *GENE expression , *BACTERIAL genetics , *MICROBIAL genetics , *WEB-based user interfaces - Abstract
Background: Prophages play a significant role in prokaryotic evolution, often altering the function of the cell that they infect via transfer of new genes e.g., virulence or antibiotic resistance factors, inactivation of existing genes or by modifying gene expression. Recently, phage therapy has gathered renewed interest as a promising alternative to control bacterial infections. Cataloging the repertoire of prophages in large collections of species’ genomes is an important initial step in understanding their evolution and potential therapeutic utility. However, current widely-used tools for identifying prophages within bacterial genome sequences are mainly web-based, can have long response times, and do not scale to keep pace with the many thousands of genomes currently being sequenced routinely. Methodology: In this work, we present ProphET, an easy to install prophage predictor to be used in Linux operation system, without the constraints associated with a web-based tool. ProphET predictions rely on similarity searches against a database of prophage genes, taking as input a bacterial genome sequence in FASTA format and its corresponding gene annotation in GFF. ProphET identifies prophages in three steps: similarity search, calculation of the density of prophage genes, and edge refinement. ProphET performance was evaluated and compared with other phage predictors based on a set of 54 bacterial genomes containing 267 manually annotated prophages. Findings and conclusions: ProphET identifies prophages in bacterial genomes with high precision and offers a fast, highly scalable alternative to widely-used web-based applications for prophage detection. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
3. A combined strategy of neuropeptide prediction and tandem mass spectrometry identifies evolutionarily conserved ancient neuropeptides in the sea anemone Nematostella vectensis.
- Author
-
Hayakawa, Eisuke, Watanabe, Hiroshi, Menschaert, Gerben, Holstein, Thomas W., Baggerman, Geert, and Schoofs, Liliane
- Subjects
- *
NEUROPEPTIDES , *TANDEM mass spectrometry , *SEA anemones , *MASS spectrometry , *SIGNAL peptides , *AMINO acid sequence , *ANIMAL species - Abstract
Neuropeptides are a class of bioactive peptides shown to be involved in various physiological processes, including metabolism, development, and reproduction. Although neuropeptide candidates have been predicted from genomic and transcriptomic data, comprehensive characterization of neuropeptide repertoires remains a challenge owing to their small size and variable sequences. De novo prediction of neuropeptides from genome or transcriptome data is difficult and usually only efficient for those peptides that have identified orthologs in other animal species. Recent peptidomics technology has enabled systematic structural identification of neuropeptides by using the combination of liquid chromatography and tandem mass spectrometry. However, reliable identification of naturally occurring peptides using a conventional tandem mass spectrometry approach, scanning spectra against a protein database, remains difficult because a large search space must be scanned due to the absence of a cleavage enzyme specification. We developed a pipeline consisting of in silico prediction of candidate neuropeptides followed by peptide-spectrum matching. This approach enables highly sensitive and reliable neuropeptide identification, as the search space for peptide-spectrum matching is highly reduced. Nematostella vectensis is a basal eumetazoan with one of the most ancient nervous systems. We scanned the Nematostella protein database for sequences displaying structural hallmarks typical of eumetazoan neuropeptide precursors, including amino- and carboxyterminal motifs and associated modifications. Peptide-spectrum matching was performed against a dataset of peptides that are cleaved in silico from these putative peptide precursors. The dozens of newly identified neuropeptides display structural similarities to bilaterian neuropeptides including tachykinin, myoinhibitory peptide, and neuromedin-U/pyrokinin, suggesting these neuropeptides occurred in the eumetazoan ancestor of all animal species. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
4. Identification of flowering-time genes in mast flowering plants using De Novo transcriptomic analysis.
- Author
-
Samarth, null, Lee, Robyn, Song, Jiancheng, Macknight, Richard C., and Jameson, Paula E.
- Subjects
- *
FLOWERING of plants , *ARABIDOPSIS proteins , *MOUNTAIN plants , *FLOWERING time , *WILD plants , *PLANT conservation - Abstract
Mast flowering is synchronised highly variable flowering by a population of perennial plants over a wide geographical area. High seeding years are seen as a threat to native and endangered species due to high predator density caused by the abundance of seed. An understanding of the molecular pathways that influence masting behaviour in plants could provide better prediction of a forthcoming masting season and enable conservation strategies to be deployed. The goal of this study was to identify candidate flowering genes that might be involved in regulating mast flowering. To achieve this, high-throughput large-scale RNA-sequencing was performed on two masting plant species, Celmisia lyallii (Asteraceae), and Chionochloa pallens (Poaceae) to develop a reference transcriptome for functional and molecular analysis. An average total of 33 million 150 base-paired reads, for both species, were assembled using the Trinity pipeline, resulting in 151,803 and 348,649 transcripts respectively for C. lyallii and C. pallens. For both species, about 56% of the unigenes were annotated with gene descriptions to known proteins followed by Gene Ontology analysis, categorising them on the basis of putative biological processes, molecular function, and cellular localization. A total of 543 transcripts from C. lyallii and 470 transcripts from C. pallens were also mapped to unique flowering-time proteins identified in Arabidopsis thaliana, suggesting the conservation of the flowering network in these wild alpine plants growing in natural field conditions. Expression analysis of several selected homologous flowering-pathway genes showed seasonal and photoperiodic variations. These genes can further be analysed to understand why seasonal cues, such as the increasing photoperiod in spring, that triggers the annual flowering of most plants, are insufficient to always trigger flowering in masting plants and to uncover the molecular basis of how additional cues (such as temperature during the previous growing seasons) then determines flowering in mast years. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
5. Detection of an invasive aquatic plant in natural water bodies using environmental DNA.
- Author
-
Anglès d’Auriac, Marc B., Strand, David A., Mjelde, Marit, Demars, Benoit O. L., and Thaulow, Jens
- Subjects
- *
BODIES of water , *AQUATIC plants , *INVASIVE plants , *WATER use , *CHLOROPLAST DNA , *ENDANGERED species - Abstract
The ability to detect founding populations of invasive species or rare species with low number of individuals is important for aquatic ecosystem management. Traditional approaches use historical data, knowledge of the species’ ecology and time-consuming surveys. Within the past decade, environmental DNA (eDNA) has emerged as a powerful additional tracking tool. While much work has been done with animals, comparatively very little has been done with aquatic plants. Here we investigated the transportation and seasonal changes in eDNA concentrations for an invasive aquatic species, Elodea canadensis, in Norway. A specific probe assay was developed using chloroplast DNA to study the fate of the targeted eDNA through space and time. The spatial study used a known source of Elodea canadensis within Lake Nordbytjern 400 m away from the lake outlet flowing into the stream Tveia. The rate of disappearance of E. canadensis eDNA was an order of magnitude loss over about 230 m in the lake and 1550 m in the stream. The time series study was performed monthly from May to October in lake Steinsfjorden harbouring E. canadensis, showing that eDNA concentrations varied by up to three orders of magnitude, peaking during fall. In both studies, the presence of suspended clay or turbidity for some samples did not hamper eDNA analysis. This study shows how efficient eDNA tools may be for tracking aquatic plants in the environment and provides key spatial and temporal information on the fate of eDNA. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
6. Ligand-guided homology modeling drives identification of novel histamine H3 receptor ligands.
- Author
-
Schaller, David, Hagenow, Stefanie, Stark, Holger, and Wolber, Gerhard
- Subjects
- *
HISTAMINE receptors , *BINDING site assay , *G protein coupled receptors , *PHYSICAL & theoretical chemistry , *HOMOLOGY (Biology) , *LIGANDS (Biochemistry) - Abstract
In this study, we report a ligand-guided homology modeling approach allowing the analysis of relevant binding site residue conformations and the identification of two novel histamine H3 receptor ligands with binding affinity in the nanomolar range. The newly developed method is based on exploiting an essential charge interaction characteristic for aminergic G-protein coupled receptors for ranking 3D receptor models appropriate for the discovery of novel compounds through virtual screening. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
7. Microbial origin of bioflocculation components within a promising natural bioflocculant resource of Ruditapes philippinarum conglutination mud from an aquaculture farm in Zhoushan, China.
- Author
-
Mu, Jun, Cui, Xia, Shao, Mingjiao, Wang, Yuxia, Yang, Qiao, Yang, Guangfeng, and Zheng, Liying
- Subjects
- *
FLOCCULANTS , *MANILA clam , *NATURAL resources , *AQUACULTURE , *MUD , *PHYSICAL sciences , *EARTH sciences - Abstract
Ruditapes philippinarum conglutination mud (RPM) is a byproduct from the aquiculture of an important commercially bivalve mollusk R. philippinarum and has been recently reported as a promising natural bioflocculant resource. However the origin of bioflocculation components within RPM is still a pending doubt and impedes its effective exploitation. This study investigated the probability that RPM bioflocculation components originate from its associated microbes. RPM samples from an aquaculture farm in Zhoushan of China were applied to characterize its microbial community structure, screen associated bioflocculant-producing strains, and explore the homology between extracellular polysaccharides (EPS) from bioflocculant-producing isolates and RPM flocculation components. Results showed that RPM exhibited high bacterial biodiversity, with Proteobacteria, Bacteroidetes and Actinobacteria as the most abundant phyla; hgcI_clade, CL500_29_marine_group, Fusibacter, MWH_UniP1_aquatic_group and Arcobacter as the dominant genera. Fourteen highly efficient bioflocculant-producing strains were screened and phylogenetically identified as Pseudoalteromonas sp. (5), Psychrobacter sp. (3), Halomonas sp. (2), Albirhodobacter sp. (1), Celeribacter sp. (1), Kocuria sp. (1) and Bacillus sp. (1), all of which except Bacillus sp. were reported for the first time for their excellent flocculation capability. Furthermore, EPS from the bioflocculant-producing strains exhibited highly similar monosaccharide composition to the reported flocculation-effective RPM polysaccharides. On the other hand, the existence of fungi in RPM was rare and showed no flocculation functionality. Findings from Zhoushan RPM strongly supported that RPM flocculation components were of bacterial origin and make RPM reproduction possible by fermentation approach. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
8. Assessment of BOLD and GenBank – Their accuracy and reliability for the identification of biological materials.
- Author
-
Meiklejohn, Kelly A., Damaso, Natalie, and Robertson, James M.
- Subjects
- *
BIOMEDICAL materials , *IDENTIFICATION , *REFERENCE sources , *GENETIC barcoding , *BIOLOGICAL databases , *NUCLEOTIDE sequence - Abstract
Taxonomic identification of biological materials can be achieved through DNA barcoding, where an unknown “barcode” sequence is compared to a reference database. In many disciplines, obtaining accurate taxonomic identifications can be imperative (e.g., evolutionary biology, food regulatory compliance, forensics). The Barcode of Life DataSystems (BOLD) and GenBank are the main public repositories of DNA barcode sequences. In this study, an assessment of the accuracy and reliability of sequences in these databases was performed. To achieve this, 1) curated reference materials for plants, macro-fungi and insects were obtained from national collections, 2) relevant barcode sequences (rbcL, matK, trnH-psbA, ITS and COI) from these reference samples were generated and used for searching against both databases, and 3) optimal search parameters were determined that ensure the best match to the known species in either database. While GenBank outperformed BOLD for species-level identification of insect taxa (53% and 35%, respectively), both databases performed comparably for plants and macro-fungi (~81% and ~57%, respectively). Results illustrated that using a multi-locus barcode approach increased identification success. This study outlines the utility of the BLAST search tool in GenBank and the BOLD identification engine for taxonomic identifications and identifies some precautions needed when using public sequence repositories in applied scientific disciplines. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
9. Microfilariae infestation of goliath frogs (Conraua goliath) from Cameroon.
- Author
-
Nguete Nguiffo, Daniel, Wondji, Charles S., Pone Wabo, Josué, and Mpoame, Mbida
- Subjects
- *
BLOOD cell count , *FROGS , *LEUCOCYTES , *BLOOD parasites , *ERYTHROCYTES , *MITOCHONDRIAL DNA - Abstract
The goliath frog (Conraua goliath) is endemic to Equatorial Guinea and Cameroon. It is an endangered species but little information is known about its parasites. To understand the impact of blood parasites on this species, we microscopically examined blood smears from 78 goliath frogs in February and November 2016 (dry and wet seasons) from six localities in Littoral Region of Cameroon, and we sequenced mitochondrial DNA from positive samples. Microfilariae were found in 33/78 (42.3%) goliath frogs at six locations. No other haemoparasite species was detected. Morphological characteristics of microfilariae were also described, and specimens from each frog species were similar. DNA sequencing data from the mitochondrial Cytochrome Oxidases sub unit I (COI) gene revealed a close relationship with Icosiella neglecta, a microfilaria documented in other European, Asian, and African frogs. However, sequences were sufficiently genetically distant (0.118) that they may define a new species of Icosiella. The infection burden of microfilariae varied by site, with season (65% in dry season to 23% in rainy season), and by sex, (male frogs had significantly higher parasite burdens than females (p < 0.0001)). However, this may have been confounded by size as the microfilaria intensity increased with frog weight (p < 0.0001), and males were larger than females. Microfilaria infection intensity varied from 1 to 120 per 50 μl of blood. Microfilaria induced a significant increase (p < 0.05) in the number of white blood cells (WBC) counted compared to uninfected frogs, but there was no statistically significant variation in red blood cell (RBC) count, plasma cholesterol level (p = 0.210) or plasma glucose level (p = 0.100). [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
10. Characterization of a cold-active, detergent-stable metallopeptidase purified from Bacillus sp. S1DI 10 using Response Surface Methodology.
- Author
-
Singh, Drishtant, Thakur, Sharad, Thayil, Seema Madhumal, and Kesavan, Anup Kumar
- Subjects
- *
PEPTIDASE , *SODIUM dodecyl sulfate , *RECOMBINANT proteins , *ORGANIC solvents , *BACILLUS (Bacteria) , *LIPASES - Abstract
The colder regions of Earth are inhabited by cold-adapted microorganisms designated as psychrophiles that are known to produce cold-active enzymes, such as peptidases, chaperones, lipases, cellulases, and phosphatases. These types of enzymes are a major part of the market of industrial enzymes. Bacteria isolated from water samples collected from the Chamba region in the Himalayas were screened for peptidase production using skim milk agar plates. Among the peptidase-producing bacteria isolated, 20% of the isolates exhibited fast growth and maximum zones of clearance, and thus, were used for further studies. The 16S rDNA sequence analysis of isolate S1DI 10 identified it as a Bacillus sp. The peptidase was cloned in pET28a vector and expressed in Escherichia coli BL21(DE3) and the His-tagged recombinant protein was purified using Ni-NTA column. The purified peptidase of SIDI 10 was found to be an alkaline, cold-active peptidase with optimal enzyme activity at 10°C and pH 8. An approach of one variable at a time was used to further study the effect of various metal ions, organic solvents and detergents on the peptidase enzyme. The peptidase activity was enhanced in the presence of Fe2+ and Mn2+ (metal ions), hexane (organic solvent), SDS- sodium dodecyl sulfate (anionic detergent) and Tween 80 (nonionic detergent). Response surface methodology (RSM) was used to determine the cumulative effect of these five variables. A 25 full factorial central composite design was applied for the five independent variables to determine the optimal combinations of these constituents at the maximum peptidase activity. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
11. Does social context impact metacognition? Evidence from stereotype threat in a visual search task.
- Author
-
Gajdos, Thibault, Régner, Isabelle, Huguet, Pascal, Hainguerlot, Marine, Vergnaud, Jean-Christophe, Sackur, Jérôme, and de Gardelle, Vincent
- Subjects
- *
STEREOTYPES , *METACOGNITION , *SOCIAL impact , *SOCIAL context , *SENSORY perception , *INFORMATION science - Abstract
While recent studies have emphasized the role of metacognitive judgments in social interactions, whether social context might reciprocally impact individuals’ metacognition remains an open question. It has been proposed that such might be the case in situations involving stereotype threat. Here, we provide the first empirical test of this hypothesis. Using a visual search task, we asked participants, on a trial-by-trial basis, to monitor the unfolding and accuracy of their search processes, and we developed a computational model to measure the accuracy of their metacognition. Results indicated that stereotype threat enhanced metacognitive monitoring of both outcomes and processes. Our study thus shows that social context can actually affect metacognition. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
12. Prediction of Sphingosine protein-coding regions with a self adaptive spectral rotation method.
- Author
-
Li, Zhongwei, Guan, Yanan, Yuan, Xiang, Zheng, Pan, and Zhu, Hu
- Subjects
- *
CONCRETE construction industry , *SPHINGOSINE , *AMINO acid sequence , *PHYSICAL sciences , *ROTATIONAL motion - Abstract
Identifying protein coding regions in DNA sequences by computational methods is an active research topic. Welan gum produced by Sphingomonas sp. WG has great application potential in oil recovery and concrete construction industry. Predicting the coding regions in the Sphingomonas sp. WG genome and addressing the mechanism underlying the explanation for the synthesis of Welan gum metabolism is an important issue at present. In this study, we apply a self adaptive spectral rotation (SASR, for short) method, which is based on the investigation of the Triplet Periodicity property, to predict the coding regions of the whole-genome data of Sphingomonas sp. WG without any previous training process, and 1115 suspected gene fragments are obtained. Suspected gene fragments are subjected to a similarity search against the non-redundant protein sequences (nr) database of NCBI with blastx, and 762 suspected gene fragments have been labeled as genes in the nr database. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
13. A combined computational strategy of sequence and structural analysis predicts the existence of a functional eicosanoid pathway in Drosophila melanogaster.
- Author
-
Scarpati, Michael, Qi, Yan, Govind, Shubha, and Singh, Shaneen
- Subjects
- *
STRUCTURAL analysis (Engineering) , *EICOSANOIDS , *DROSOPHILA melanogaster , *LIPOXYGENASES , *BIOINFORMATICS - Abstract
This study reports on a putative eicosanoid biosynthesis pathway in Drosophila melanogaster and challenges the currently held view that mechanistic routes to synthesize eicosanoid or eicosanoid-like biolipids do not exist in insects, since to date, putative fly homologs of most mammalian enzymes have not been identified. Here we use systematic and comprehensive bioinformatics approaches to identify most of the mammalian eicosanoid synthesis enzymes. Sensitive sequence analysis techniques identified candidate Drosophila enzymes that share low global sequence identities with their human counterparts. Twenty Drosophila candidates were selected based upon (a) sequence identity with human enzymes of the cyclooxygenase and lipoxygenase branches, (b) similar domain architecture and structural conservation of the catalytic domain, and (c) presence of potentially equivalent functional residues. Evaluation of full-length structural models for these 20 top-scoring Drosophila candidates revealed a surprising degree of conservation in their overall folds and potential analogs for functional residues in all 20 enzymes. Although we were unable to identify any suitable candidate for lipoxygenase enzymes, we report structural homology models of three fly cyclooxygenases. Our findings predict that the D. melanogaster genome likely codes for one or more pathways for eicosanoid or eicosanoid-like biolipid synthesis. Our study suggests that classical and/or novel eicosanoids mediators must regulate biological functions in insects–predictions that can be tested with the power of Drosophila genetics. Such experimental analysis of eicosanoid biology in a simple model organism will have high relevance to human development and health. [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
14. Molecular tools for identification of shark species involved in depredation incidents in Western Australian fisheries.
- Author
-
Fotedar, Seema, Lukehurst, Sherralee, Jackson, Gary, and Snow, Michael
- Subjects
- *
SHARKS , *FISHERIES , *GENETIC barcoding , *CHONDRICHTHYES , *NUCLEIC acid isolation methods - Abstract
Shark depredation is an issue of concern in some Western Australian recreational and commercial fisheries where it can have economic, social and ecological consequences. Knowledge of the shark species involved is fundamental to developing effective management strategies to mitigate the impacts of depredation. Identification of the species responsible is difficult as direct observation of depredation events is uncommon and evaluating bite marks on fish has a high degree of uncertainty. The use of trace DNA techniques has provided an alternative method for species identification. We demonstrate proof of concept for a targeted DNA barcoding approach to identify shark species using trace DNA found at bite marks on recovered remains of hooked fish. Following laboratory validation, forensic analysis of swabs collected from samples of bitten demersal fish, led to the definitive identification of shark species involved in 100% of the incidences of depredation (n = 16). [ABSTRACT FROM AUTHOR]
- Published
- 2019
- Full Text
- View/download PDF
15. Coinvasion by the ladybird Harmonia axyridis (Coleoptera: Coccinellidae) and its parasites, Hesperomyces virescens (Ascomycota: Laboulbeniales) and Parasitylenchus bifurcatus (Nematoda: Tylenchida, Allantonematidae), in the Caucasus.
- Author
-
Orlova-Bienkowskaja, Marina J., Spiridonov, Sergei E., Butorina, Natalia N., and Bieńkowski, Andrzej O.
- Subjects
- *
HARMONIA axyridis , *LADYBUGS , *TYLENCHIDA , *RECOMBINANT DNA , *PHYLOGENETIC models - Abstract
The study of parasites in recently established populations of invasive species can shed light on the sources of invasion and possible indirect interactions between the alien species and native ones. We studied parasites of the global invader Harmonia axyridis (Coleoptera: Coccinellidae) in the Caucasus. In 2012, the first established population of Ha. axyridis was recorded in the Caucasus in Sochi (south of European Russia, Black Sea coast). By 2018, the ladybird had spread to a vast area: Armenia, Georgia and south Russia (Adygea, the Krasnodar territory, the Stavropol territory, Dagestan, Kabardino-Balkaria and North Ossetia). The examination of 213 adults collected in Sochi in 2018 showed that 53% were infested with Hesperomyces virescens fungi (Ascomycota: Laboulbeniales) and that 8% were infested with Parasitylenchus bifurcatus nematodes (Nematoda: Tylenchida, Allantonematidae). The examined Ha. axyridis specimens were free of the parasitic mite Coccipolipus hippodamiae. An analysis of the phylogenetic relationships of P. bifurcatus based on 18S rDNA confirmed the morphological identification of this species. Hesperomyces virescens and P. bifurcatus were first recorded in the Caucasus and Russia, although they are rather widespread in Europe. This likely indicates that they appeared as a result of coinvasion with their host because the populations of Ha. axyridis, He. virescens and P. bifurcatus in the Caucasus are isolated from the main parts of the ranges of these species in Europe. The nearest localities of Ha. axyridis is on another shore of the Black Sea, and the nearest localities of He. virescens and P. bifurcatus are more than 1000 km from the Caucasus. It is impossible to determine whether the first founders of the Caucasian population were infested with the parasites or whether the parasites were introduced by specimens of Ha. axyridis that arrived later from Europe. Harmonia axyridis was released in the region for pest control, but laboratory cultures are always free of He. virescens and P. bifurcatus. Therefore, the detection of He. virescens and P. bifurcatus indicates that the population of Ha. axyridis in the Caucasus could not have derived exclusively from released specimens. We did not find He. virescens on 400 specimens of 29 other ladybird species collected from the same localities as Ha. axyridis in the Caucasus. No reliable correlation between infestation by He. virescens and that by P. bifurcatus has been found. In addition to these two parasites, an unidentified species of the order Mermithida was recorded. This is the first documented case of Ha. axyridis infestation by a parasitic nematode of this order in nature. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
16. BlasterJS: A novel interactive JavaScript visualisation component for BLAST alignment results.
- Author
-
Blanco-Míguez, Aitor, Fdez-Riverola, Florentino, Sánchez, Borja, and Lourenço, Anália
- Subjects
- *
JAVASCRIPT programming language , *BIOINFORMATICS software , *MOLECULAR biology , *DATA modeling , *WEB-based user interfaces - Abstract
Background: The wide range of potential applications has made the Basic Local Alignment Search Tool (BLAST) a ubiquitous tool in the field of Molecular Biology. Within this context, it is increasingly appealing to embed BLAST services within larger Web applications. Results: This work introduces BlasterJS viewer, a new JavaScript library for the lightweight development of Web-based applications supporting the visualisation of BLAST outputs. BlasterJS detaches from similar data viewers by focusing on the visual and interactive display of sequence similarity results and being completely independent of BLAST services. BlasterJS is compatible with the text outputs generated by the BLAST family of programs, namely BLASTp, BLASTn, BLASTx, tBLASTn, and tBLASTx, and works in all major Web browsers. Furthermore, BlasterJS is available through the EBI’s BioJS registry 5, which extends its potential use to a wider scope of bioinformatics applications. Conclusions: BlasterJS is new Javascript library that enables easy and seamless integration of visual and interactive representations of BLAST outputs in Web-based applications supporting sequence similarity search. BlasterJS is free accessible at . [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
17. Predicting position along a looping immune response trajectory.
- Author
-
Rath, Poonam, Allen, Jessica A., and Schneider, David S.
- Subjects
- *
ECOLOGICAL resilience , *IMMUNE response , *PREVENTIVE medicine , *MONOCYTES , *PREDICTION models - Abstract
When we get sick, we want to be resilient and recover our original health. To measure resilience, we need to quantify a host's position along its disease trajectory. Here we present Looper, a computational method to analyze longitudinally gathered datasets and identify gene pairs that form looping trajectories when plotted in the space described by these phases. These loops enable us to track where patients lie on a typical trajectory back to health. We analyzed two publicly available, longitudinal human microarray datasets that describe self-resolving immune responses. Looper identified looping gene pairs expressed by human donor monocytes stimulated by immune elicitors, and in YF17D-vaccinated individuals. Using loops derived from training data, we found that we could predict the time of perturbation in withheld test samples with accuracies of 94% in the human monocyte data, and 65–83% within the same cohort and in two independent cohorts of YF17D vaccinated individuals. We suggest that Looper will be useful in building maps of resilient immune processes across organisms. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
18. A PageRank-based heuristic for the minimization of open stacks problem.
- Author
-
Frinhani, Rafael de Magalhães Dias, Carvalho, Marco Antonio Moreira de, and Soma, Nei Yoshihiro
- Subjects
- *
HEURISTIC , *INFORMATION science , *DATA visualization , *PROBLEM solving , *GRAPH theory - Abstract
The minimization of open stacks problem (MOSP) aims to determine the ideal production sequence to optimize the occupation of physical space in manufacturing settings. Most of current methods for solving the MOSP were not designed to work with large instances, precluding their use in specific cases of similar modeling problems. We therefore propose a PageRank-based heuristic to solve large instances modeled in graphs. In computational experiments, both data from the literature and new datasets up to 25 times fold larger in input size than current datasets, totaling 1330 instances, were analyzed to compare the proposed heuristic with state-of-the-art methods. The results showed the competitiveness of the proposed heuristic in terms of quality, as it found optimal solutions in several cases, and in terms of shorter run times compared with the fastest available method. Furthermore, based on specific graph densities, we found that the difference in the value of solutions between methods was small, thus justifying the use of the fastest method. The proposed heuristic is scalable and is more affected by graph density than by size. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
19. De novo assembly of a transcriptome for the cricket Gryllus bimaculatus prothoracic ganglion: An invertebrate model for investigating adult central nervous system compensatory plasticity.
- Author
-
Fisher, Harrison P., Pascual, Micah G., Jimenez, Sylvia I., Michaelson, David A., Joncas, Colby T., Quenzer, Eleanor D., Christie, Andrew E., and Horch, Hadley W.
- Subjects
- *
GRYLLUS bimaculatus , *NEUROPLASTICITY , *INSECT anatomy , *CENTRAL nervous system , *AUDITORY pathways - Abstract
The auditory system of the cricket, Gryllus bimaculatus, demonstrates an unusual amount of anatomical plasticity in response to injury, even in adults. Unilateral removal of the ear causes deafferented auditory neurons in the prothoracic ganglion to sprout dendrites across the midline, a boundary they typically respect, and become synaptically connected to the auditory afferents of the contralateral ear. The molecular basis of this sprouting and novel synaptogenesis in the adult is not understood. We hypothesize that well-conserved developmental guidance cues may recapitulate their guidance functions in the adult in order to facilitate this compensatory growth. As a first step in testing this hypothesis, we have generated a de novo assembly of a prothoracic ganglion transcriptome derived from control and deafferented adult individuals. We have mined this transcriptome for orthologues of guidance molecules from four well-conserved signaling families: Slit, Netrin, Ephrin, and Semaphorin. Here we report that transcripts encoding putative orthologues of most of the candidate developmental ligands and receptors from these signaling families were present in the assembly, indicating expression in the adult G. bimaculatus prothoracic ganglion. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
20. Solving the RNA design problem with reinforcement learning.
- Author
-
Eastman, Peter, Shi, Jade, Ramsundar, Bharath, and Pande, Vijay S.
- Subjects
- *
REINFORCEMENT learning , *NUCLEOTIDE sequence , *BIOENGINEERING , *COMPUTER science , *COMPUTATIONAL biology - Abstract
We use reinforcement learning to train an agent for computational RNA design: given a target secondary structure, design a sequence that folds to that structure in silico. Our agent uses a novel graph convolutional architecture allowing a single model to be applied to arbitrary target structures of any length. After training it on randomly generated targets, we test it on the Eterna100 benchmark and find it outperforms all previous algorithms. Analysis of its solutions shows it has successfully learned some advanced strategies identified by players of the game Eterna, allowing it to solve some very difficult structures. On the other hand, it has failed to learn other strategies, possibly because they were not required for the targets in the training set. This suggests the possibility that future improvements to the training protocol may yield further gains in performance. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
21. Xenin is a novel anorexigen in goldfish (Carassius auratus).
- Author
-
Kerbel, Brent, Badal, Kimberly, Sundarrajan, Lakshminarasimhan, Blanco, Ayelen, and Unniappan, Suraj
- Subjects
- *
AMINO acids , *PEPTIDES , *APPETITE depressants , *GOLDFISH , *MESSENGER RNA , *FISH feeds - Abstract
Xenin, a highly conserved 25 amino acid peptide cleaved from the N-terminus of the coatomer protein alpha (COPA), is emerging as a food intake regulator in mammals and birds. To date, no research has been conducted on xenin biology in fish. This study aims to identify the copa mRNA encoding xenin in goldfish (Carassius auratus) as a model, to elucidate its regulation by feeding, and to describe the role of xenin on appetite. First, a partial sequence of copa cDNA, a region encoding xenin, was identified from goldfish brain. This sequence is highly conserved among both vertebrates and invertebrates. RT-qPCR revealed that copa mRNAs are widely distributed in goldfish tissues, with the highest levels detected in the brain, gill, pituitary and J-loop. Immunohistochemistry confirmed also the presence of COPA peptide in the hypothalamus and enteroendocrine cells on the J-loop mucosa. In line with its anorexigenic effects, we found important periprandial fluctuations in copa mRNA expression in the hypothalamus, which were mainly characterized by a gradually decrease in copa mRNA levels as the feeding time was approached, and a gradual increase after feeding. Additionally, fasting differently modulated the expression of copa mRNA in a tissue-dependent manner. Peripheral and central injections of xenin reduce food intake in goldfish. This research provides the first report of xenin in fish, and shows that this peptide is a novel anorexigen in goldfish. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
22. A family of long intergenic non-coding RNA genes in human chromosomal region 22q11.2 carry a DNA translocation breakpoint/AT-rich sequence.
- Author
-
Delihas, Nicholas
- Subjects
- *
NON-coding RNA , *HUMAN chromosomes , *CHROMOSOMAL translocation , *SEQUENCE alignment , *SATELLITE DNA - Abstract
FAM230C, a long intergenic non-coding RNA (lincRNA) gene in human chromosome 13 (chr13) is a member of lincRNA genes termed family with sequence similarity 230. An analysis using bioinformatics search tools and alignment programs was undertaken to determine properties of FAM230C and its related genes. Results reveal that the DNA translocation element, the Translocation Breakpoint Type A (TBTA) sequence, which consists of satellite DNA, Alu elements, and AT-rich sequences is embedded in the FAM230C gene. Eight lincRNA genes related to FAM230C also carry the TBTA sequences. These genes were formed from a large segment of the 3’ half of the FAM230C sequence duplicated in chr22, and are specifically in regions of low copy repeats (LCR22)s, in or close to the 22q.11.2 region. 22q11.2 is a chromosomal segment that undergoes a high rate of DNA translocation and is prone to genetic deletions. FAM230C-related genes present in other chromosomes do not carry the TBTA motif and were formed from the 5’ half region of the FAM230C sequence. These findings identify a high specificity in lincRNA gene formation by gene sequence duplication in different chromosomes. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
23. The redox-sensing protein Rex modulates ethanol production in Thermoanaerobacterium saccharolyticum.
- Author
-
Zheng, Tianyong, Lanahan, Anthony A., Lynd, Lee R., and Olson, Daniel G.
- Subjects
- *
OXIDATION-reduction reaction , *THERMOPHILIC bacteria , *ETHANOL , *DELETION mutation , *ALCOHOL dehydrogenase , *COMPUTATIONAL biology - Abstract
Thermoanaerobacterium saccharolyticum is a thermophilic anaerobe that has been engineered to produce high amounts of ethanol, reaching ~90% theoretical yield at a titer of 70 g/L. Here we report the physiological changes that occur upon deleting the redox-sensing transcriptional regulator Rex in wild type T. saccharolyticum: a single deletion of rex resulted in a two-fold increase in ethanol yield (from 40% to 91% theoretical yield), but the resulting strains grew only about a third as fast as the wild type strain. Deletion of the rex gene also had the effect of increasing expression of alcohol dehydrogenase genes, adhE and adhA. After several serial transfers, the ethanol yield decreased from an average of 91% to 55%, and the growth rates had increased. We performed whole-genome resequencing to identify secondary mutations in the Δrex strains adapted for faster growth. In several cases, secondary mutations had appeared in the adhE gene. Furthermore, in these strains the NADH-linked alcohol dehydrogenase activity was greatly reduced. Complementation studies were done to reintroduce rex into the Δrex strains: reintroducing rex decreased ethanol yield to below wild type levels in the Δrex strain without adhE mutations, but did not change the ethanol yield in the Δrex strain where an adhE mutation occurred. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
24. New computational approaches to understanding molecular protein function.
- Author
-
Fetrow, Jacquelyn S. and Babbitt, Patricia C.
- Subjects
- *
COMPUTATIONAL biology , *PHYSIOLOGICAL effects of proteins , *PROTEIN genetics , *MOLECULAR biology , *ONTOLOGIES (Information retrieval) - Abstract
The author discusses new computational approaches to understanding molecular protein function. The author states that Gene Ontology (GO) system of classifying function recognizes ways of defining function, using distinct cellular components, molecular function, and mentions the need of understanding molecular function involving motifs, like sequence motifs exemplified by PRINTS and PROSITE. The author notes that contemporary protein superfamilies are the result of numerous genetic events.
- Published
- 2018
- Full Text
- View/download PDF
25. HMMER Cut-off Threshold Tool (HMMERCTTER): Supervised classification of superfamily protein sequences with a reliable cut-off threshold.
- Author
-
Pagnuco, Inti Anabela, Revuelta, María Victoria, Bondino, Hernán Gabriel, Brun, Marcel, and ten Have, Arjen
- Subjects
- *
AMINO acid sequence , *BIOLOGICAL evolution , *BIOINFORMATICS , *TASK performance , *COMPUTATIONAL biology , *DECISION support systems - Abstract
Background: Protein superfamilies can be divided into subfamilies of proteins with different functional characteristics. Their sequences can be classified hierarchically, which is part of sequence function assignation. Typically, there are no clear subfamily hallmarks that would allow pattern-based function assignation by which this task is mostly achieved based on the similarity principle. This is hampered by the lack of a score cut-off that is both sensitive and specific. Results: HMMER Cut-off Threshold Tool (HMMERCTTER) adds a reliable cut-off threshold to the popular HMMER. Using a high quality superfamily phylogeny, it clusters a set of training sequences such that the cluster-specific HMMER profiles show cluster or subfamily member detection with 100% precision and recall (P&R), thereby generating a specific threshold as inclusion cut-off. Profiles and thresholds are then used as classifiers to screen a target dataset. Iterative inclusion of novel sequences to groups and the corresponding HMMER profiles results in high sensitivity while specificity is maintained by imposing 100% P&R self detection. In three presented case studies of protein superfamilies, classification of large datasets with 100% precision was achieved with over 95% recall. Limits and caveats are presented and explained. Conclusions: HMMERCTTER is a promising protein superfamily sequence classifier provided high quality training datasets are used. It provides a decision support system that aids in the difficult task of sequence function assignation in the twilight zone of sequence similarity. All relevant data and source codes are available from the Github repository at the following URL: . [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
26. Taxonomy and physiology of Pseudoxanthomonas arseniciresistens sp. nov., an arsenate and nitrate-reducing novel gammaproteobacterium from arsenic contaminated groundwater, India.
- Author
-
Mohapatra, Balaram, Sar, Pinaki, Kazy, Sufia Khannam, Maiti, Mrinal Kumar, and Satyanarayana, Tulasi
- Subjects
- *
ARSENIC content in groundwater , *PROTEOBACTERIA , *BACTERIAL DNA , *NUCLEOTIDE sequencing , *POLYAMINES - Abstract
Reductive transformation of toxic arsenic (As) species by As reducing bacteria (AsRB) is a key process in As-biogeochemical-cycling within the subsurface aquifer environment. In this study, we have characterized a Gram-stain-negative, non-spore-forming, rod-shaped As reducing bacterium designated KAs 5-3T, isolated from highly As-contaminated groundwater of India. Strain KAs 5-3T displayed high 16S rRNA gene sequence similarity to the members of the genus Pseudoxanthomonas, with P. mexicana AMX 26BT (99.25% similarity), P. japonensis 12-3T (98.9 0%), P. putridarboris WD-12T (98.02%), and P. indica P15T (97.27%) as closest phylogenetic neighbours. DNA-DNA hybridization study unambiguously indicated that strain KAs 5-3T represented a novel species that was separate from reference strains of P. mexicana AMX 26BT (35.7%), P. japonensis 12-3T (35.5%), P. suwonensis 4M1T (35.5%), P. wuyuanensis XC21-2T (35.0%), P. indica P15T (32.5%), P. daejeonensis TR6-08T (32.0%), and P. putridarboris WD12T (22.1%). The DNA G+C content of strain KAs 5-3T was 64.9 mol %. The predominant fatty acids were C15:0 (37.4%), C16:0 iso (12.6%), C17:1 iso ω9c (10.5%), C15:0 anteiso (9.5%), C11:0 iso 3-OH (8.5%), and C16:1 ω7c/ C16:1 ω6c (7.5%). The major polar lipids were diphosphatidylglycerol, phosphatidyldimethylethanolamine, phosphatidylcholine, and two unknown phospholipids (PL1, PL2). Ubiquinone 8 (Q8) was the predominant respiratory quinone and spermidine was the major polyamine of the strain KAs 5-3T. Cells of strain KAs 5-3T showed the ability to use O2, As5+, NO3-, NO2-, and Fe3+ as terminal electron acceptors as well as to reduce As5+ through the cytosolic process under aerobic incubations. Genes encoding arsenate reductase (arsC) for As-detoxification, nitrate- and nitrite reductase (narG and nirS) for denitrification were detected in the strain KAs 5-3T. Based on taxonomic and physiological data, strain KAs 5-3T is described as a new representative member of the genus Pseudoxanthomonas, for which the name Pseudoxanthomonas arseniciresistens sp. nov. is proposed. The type strain is KAs 5-3T (= LMG 29169T = MTCC 12116T = MCC 3121T). [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
27. Crystal structure of a cold-active protease (Pro21717) from the psychrophilic bacterium, Pseudoalteromonas arctica PAMC 21717, at 1.4 Å resolution: Structural adaptations to cold and functional analysis of a laundry detergent enzyme.
- Author
-
Park, Ha Ju, Lee, Chang Woo, Kim, Dockyu, Do, Hackwon, Han, Se Jong, Kim, Jung Eun, Koo, Bon-Hun, Lee, Jun Hyuck, and Yim, Joung Han
- Subjects
- *
ENZYMES , *CRYSTAL structure , *CATALYTIC domains , *LAUNDRY detergents , *FUNCTIONAL analysis , *PROTEOLYTIC enzymes - Abstract
Enzymes isolated from organisms found in cold habitats generally exhibit higher catalytic activity at low temperatures than their mesophilic homologs and are therefore known as cold-active enzymes. Cold-active proteases are very useful in a variety of biotechnological applications, particularly as active ingredients in laundry and dishwashing detergents, where they provide strong protein-degrading activity in cold water. We identified a cold-active protease (Pro21717) from a psychrophilic bacterium, Pseudoalteromonas arctica PAMC 21717, and determined the crystal structure of its catalytic domain (CD) at a resolution of 1.4 Å. The Pro21717-CD structure shows a conserved subtilisin-like fold with a typical catalytic triad (Asp185, His244, and Ser425) and contains four calcium ions and three disulfide bonds. Interestingly, we observed an unexpected electron density at the substrate-binding site from a co-purified peptide. Although the sequence of this peptide is unknown, analysis of the peptide-complexed structure nonetheless provides some indication of the substrate recognition and binding mode of Pro21717. Moreover, various parameters, including a wide substrate pocket size, an abundant active-site loop content, and a flexible structure provide potential explanations for the cold-adapted properties of Pro21717. In conclusion, this is first structural characterization of a cold-adapted subtilisin-like protease, and these findings provide a structural and functional basis for industrial applications of Pro21717 as a cold-active laundry or dishwashing detergent enzyme. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
28. TopicalPdb: A database of topically delivered peptides.
- Author
-
Mathur, Deepika, Mehta, Ayesha, Firmal, Priyanka, Bedi, Gursimran, Sood, Charu, Gautam, Ankur, and Raghava, Gajendra P. S.
- Subjects
- *
MOLECULAR structure of peptides , *C-terminal residues , *BIOINFORMATICS , *PHARMACOLOGY , *COMPUTATIONAL biology - Abstract
TopicalPdb () is a repository of experimentally verified topically delivered peptides. Data was manually collected from research articles. The current release of TopicalPdb consists of 657 entries, which includes peptides delivered through the skin (462 entries), eye (173 entries), and nose (22 entries). Each entry provides comprehensive information related to these peptides like the source of origin, nature of peptide, length, N- and C-terminal modifications, mechanism of penetration, type of assays, cargo and biological properties of peptides, etc. In addition to natural peptides, TopicalPdb contains information of peptides having non-natural, chemically modified residues and D-amino acids. Besides this primary information, TopicalPdb stores predicted tertiary structures as well as peptide sequences in SMILE format. Tertiary structures of peptides were predicted using state-of-art method PEPstrMod. In order to assist users, a number of web-based tools have been integrated that includes keyword search, data browsing, similarity search and structural similarity. We believe that TopicalPdb is a unique database of its kind and it will be very useful in designing peptides for non-invasive topical delivery. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
29. Recombinant expression, purification and biochemical characterization of kievitone hydratase from Nectria haematococca.
- Author
-
Engleder, Matthias, Horvat, Melissa, Emmerstorfer-Augustin, Anita, Wriessnegger, Tamara, Gabriel, Stefanie, Strohmeier, Gernot, Weber, Hansjörg, Müller, Monika, Kaluzna, Iwona, Mink, Daniel, Schürmann, Martin, and Pichler, Harald
- Subjects
- *
HYDRATASES , *ISOFLAVONOIDS , *GLYCOPROTEINS , *RECOMBINANT proteins , *BIOCHEMICAL substrates - Abstract
Kievitone hydratase catalyzes the addition of water to the double bond of the prenyl moiety of plant isoflavonoid kievitone and, thereby, forms the tertiary alcohol hydroxy-kievitone. In nature, this conversion is associated with a defense mechanism of fungal pathogens against phytoalexins generated by host plants after infection. As of today, a gene sequence coding for kievitone hydratase activity has only been identified and characterized in Fusarium solani f. sp. phaseoli. Here, we report on the identification of a putative kievitone hydratase sequence in Nectria haematococca (NhKHS), the teleomorph state of F. solani, based on in silico sequence analyses. After heterologous expression of the enzyme in the methylotrophic yeast Pichia pastoris, we have confirmed its kievitone hydration activity and have assessed its biochemical properties and substrate specificity. Purified recombinant NhKHS is obviously a homodimeric glycoprotein. Due to its good activity for the readily available chalcone derivative xanthohumol (XN), this compound was selected as a model substrate for biochemical studies. The optimal pH and temperature for hydratase activity were 6.0 and 35°C, respectively, and apparent Vmax and Km values for hydration of XN were 7.16 μmol min-1 mg-1 and 0.98 ± 0.13 mM, respectively. Due to its catalytic properties and apparent substrate promiscuity, NhKHS is a promising enzyme for the biocatalytic production of tertiary alcohols. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
30. Distribution of triclosan-resistant genes in major pathogenic microorganisms revealed by metagenome and genome-wide analysis.
- Author
-
Khan, Raees, Roy, Nazish, Choi, Kihyuck, and Lee, Seon-Woo
- Subjects
- *
TRICLOSAN , *PHENOTYPES , *GENE mapping , *CARRIER proteins , *PATHOGENIC microorganisms - Abstract
The substantial use of triclosan (TCS) has been aimed to kill pathogenic bacteria, but TCS resistance seems to be prevalent in microbial species and limited knowledge exists about TCS resistance determinants in a majority of pathogenic bacteria. We aimed to evaluate the distribution of TCS resistance determinants in major pathogenic bacteria (N = 231) and to assess the enrichment of potentially pathogenic genera in TCS contaminated environments. A TCS-resistant gene (TRG) database was constructed and experimentally validated to predict TCS resistance in major pathogenic bacteria. Genome-wide in silico analysis was performed to define the distribution of TCS-resistant determinants in major pathogens. Microbiome analysis of TCS contaminated soil samples was also performed to investigate the abundance of TCS-resistant pathogens. We experimentally confirmed that TCS resistance could be accurately predicted using genome-wide in silico analysis against TRG database. Predicted TCS resistant phenotypes were observed in all of the tested bacterial strains (N = 17), and heterologous expression of selected TCS resistant genes from those strains conferred expected levels of TCS resistance in an alternative host Escherichia coli. Moreover, genome-wide analysis revealed that potential TCS resistance determinants were abundant among the majority of human-associated pathogens (79%) and soil-borne plant pathogenic bacteria (98%). These included a variety of enoyl-acyl carrier protein reductase (ENRs) homologues, AcrB efflux pumps, and ENR substitutions. FabI ENR, which is the only known effective target for TCS, was either co-localized with other TCS resistance determinants or had TCS resistance-associated substitutions. Furthermore, microbiome analysis revealed that pathogenic genera with intrinsic TCS-resistant determinants exist in TCS contaminated environments. We conclude that TCS may not be as effective against the majority of bacterial pathogens as previously presumed. Further, the excessive use of this biocide in natural environments may selectively enrich for not only TCS-resistant bacterial pathogens, but possibly for additional resistance to multiple antibiotics. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
31. Comparing fixed sampling with minimizer sampling when using k-mer indexes to find maximal exact matches.
- Author
-
Almutairy, Meznah and Torng, Eric
- Subjects
- *
STATISTICAL sampling , *BIOINFORMATICS , *HUMAN genome , *SEQUENCE analysis , *COMPUTATIONAL biology - Abstract
Bioinformatics applications and pipelines increasingly use k-mer indexes to search for similar sequences. The major problem with k-mer indexes is that they require lots of memory. Sampling is often used to reduce index size and query time. Most applications use one of two major types of sampling: fixed sampling and minimizer sampling. It is well known that fixed sampling will produce a smaller index, typically by roughly a factor of two, whereas it is generally assumed that minimizer sampling will produce faster query times since query k-mers can also be sampled. However, no direct comparison of fixed and minimizer sampling has been performed to verify these assumptions. We systematically compare fixed and minimizer sampling using the human genome as our database. We use the resulting k-mer indexes for fixed sampling and minimizer sampling to find all maximal exact matches between our database, the human genome, and three separate query sets, the mouse genome, the chimp genome, and an NGS data set. We reach the following conclusions. First, using larger k-mers reduces query time for both fixed sampling and minimizer sampling at a cost of requiring more space. If we use the same k-mer size for both methods, fixed sampling requires typically half as much space whereas minimizer sampling processes queries only slightly faster. If we are allowed to use any k-mer size for each method, then we can choose a k-mer size such that fixed sampling both uses less space and processes queries faster than minimizer sampling. The reason is that although minimizer sampling is able to sample query k-mers, the number of shared k-mer occurrences that must be processed is much larger for minimizer sampling than fixed sampling. In conclusion, we argue that for any application where each shared k-mer occurrence must be processed, fixed sampling is the right sampling method. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
32. Extension of the viral ecology in humans using viral profile hidden Markov models.
- Author
-
Bzhalava, Zurab, Hultin, Emilie, and Dillner, Joakim
- Subjects
- *
VIRAL ecology , *VIRAL proteins , *NUCLEOTIDE sequence , *GEMINIVIRIDAE , *MARKOV processes - Abstract
When human samples are sequenced, many assembled contigs are “unknown”, as conventional alignments find no similarity to known sequences. Hidden Markov models (HMM) exploit the positions of specific nucleotides in protein-encoding codons in various microbes. The algorithm HMMER3 implements HMM using a reference set of sequences encoding viral proteins, “vFam”. We used HMMER3 analysis of “unknown” human sample-derived sequences and identified 510 contigs distantly related to viruses (Anelloviridae (n = 1), Baculoviridae (n = 34), Circoviridae (n = 35), Caulimoviridae (n = 3), Closteroviridae (n = 5), Geminiviridae (n = 21), Herpesviridae (n = 10), Iridoviridae (n = 12), Marseillevirus (n = 26), Mimiviridae (n = 80), Phycodnaviridae (n = 165), Poxviridae (n = 23), Retroviridae (n = 6) and 89 contigs related to described viruses not yet assigned to any taxonomic family). In summary, we find that analysis using the HMMER3 algorithm and the “vFam” database greatly extended the detection of viruses in biospecimens from humans. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
33. Large-scale mapping of bioactive peptides in structural and sequence space.
- Author
-
Nardo, Agustina E., Añón, M. Cristina, and Parisi, Gustavo
- Subjects
- *
BIOACTIVE compounds , *PROTEIN content of food , *PROTEIN folding , *NUCLEOTIDE sequence , *BIOINFORMATICS - Abstract
Health-enhancing potential bioactive peptide (BP) has driven an interest in food proteins as well as in the development of predictive methods. Research in this area has been especially active to use them as components in functional foods. Apparently, BPs do not have a given biological function in the containing proteins and they do not evolve under independent evolutionary constraints. In this work we performed a large-scale mapping of BPs in sequence and structural space. Using well curated BP deposited in BIOPEP database, we searched for exact matches in non-redundant sequences databases. Proteins containing BPs, were used in fold-recognition methods to predict the corresponding folds and BPs occurrences were mapped. We found that fold distribution of BP occurrences possibly reflects sequence relative abundance in databases. However, we also found that proteins with 5 or more than 5 BP in their sequences correspond to well populated protein folds, called superfolds. Also, we found that in well populated superfamilies, BPs tend to adopt similar locations in the protein fold, suggesting the existence of hotspots. We think that our results could contribute to the development of new bioinformatics pipeline to improve BP detection. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
34. Three unrelated and unexpected amino acids determine the susceptibility of the interface cysteine to a sulfhydryl reagent in the triosephosphate isomerases of two trypanosomes.
- Author
-
Díaz-Mazariegos, Selma, Cabrera, Nallely, and Perez-Montfort, Ruy
- Subjects
- *
AMINO acids , *CYSTEINE , *SULFHYDRYL group , *TRIOSE-phosphate isomerase , *AMINO acid sequence - Abstract
Proteins with great sequence similarity usually have similar structure, function and other physicochemical properties. But in many cases, one or more of the physicochemical or functional characteristics differ, sometimes very considerably, among these homologous proteins. To better understand how critical amino acids determine quantitative properties of function in proteins, the responsible residues must be located and identified. This can be difficult to achieve, particularly in cases where multiple amino acids are involved. In this work, two triosephosphate isomerases with very high similarity from two related human parasites were used to address one such problem. We demonstrate that a seventy-fold difference in the reactivity of an interface cysteine to the sulfhydryl reagent methylmethane sulfonate in these two enzymes depends on three amino acids located far away from this critical residue and which could not have been predicted using other current methods. Starting from previous observations with chimeric proteins involving these two triosephosphate isomerases, we developed a strategy involving additive mutant enzymes and selected site directed mutants to locate and identify the three amino acids. These three residues seem to induce changes in the interface cysteine in reactivity by increasing (or decreasing) its apparent pKa. Some enzymes with four to seven mutations also exhibited altered reactivity. This study completes a strategy for identifying key residues in the sequences of proteins that can have applications in future protein structure-function studies. [ABSTRACT FROM AUTHOR]
- Published
- 2018
- Full Text
- View/download PDF
35. Conservation of a microRNA cluster in parasitic nematodes and profiling of miRNAs in excretory-secretory products and microvesicles of Haemonchus contortus.
- Author
-
Gu, Henry Y., Marks, Neil D., Winter, Alan D., Weir, William, Tzelos, Thomas, McNeilly, Tom N., Britton, Collette, and Devaney, Eileen
- Subjects
- *
MICRORNA , *NEMATODES , *HAEMONCHUS contortus , *ASCARIDIDA , *HOST-parasite relationships , *GENE expression - Abstract
microRNAs are small non-coding RNAs that are important regulators of gene expression in a range of animals, including nematodes. We have analysed a cluster of four miRNAs from the pathogenic nematode species Haemonchus contortus that are closely linked in the genome. We find that the cluster is conserved only in clade V parasitic nematodes and in some ascarids, but not in other clade III species nor in clade V free-living nematodes. Members of the cluster are present in parasite excretory-secretory products and can be detected in the abomasum and draining lymph nodes of infected sheep, indicating their release in vitro and in vivo. As observed for other parasitic nematodes, H. contortus adult worms release extracellular vesicles (EV). Small RNA libraries were prepared from vesicle-enriched and vesicle-depleted supernatants from both adult worms and L4 stage larvae. Comparison of the miRNA species in the different fractions indicated that specific miRNAs are packaged within vesicles, while others are more abundant in vesicle-depleted supernatant. Hierarchical clustering analysis indicated that the gut is the likely source of vesicle-associated miRNAs in the L4 stage, but not in the adult worm. These findings add to the growing body of work demonstrating that miRNAs released from parasitic helminths may play an important role in host-parasite interactions. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
36. Target-similarity search using Plasmodium falciparum proteome identifies approved drugs with anti-malarial activity and their possible targets.
- Author
-
Mogire, Reagan M., Akala, Hoseah M., Macharia, Rosaline W., Juma, Dennis W., Cheruiyot, Agnes C., Andagalu, Ben, Brown, Mathew L., El-Shemy, Hany A., and Nyanjom, Steven G.
- Subjects
- *
MALARIA treatment , *ANTIMALARIALS , *PLASMODIUM falciparum , *PROTEOMICS , *CAUSES of death , *ARTEMISININ - Abstract
Malaria causes about half a million deaths annually, with Plasmodium falciparum being responsible for 90% of all the cases. Recent reports on artemisinin resistance in Southeast Asia warrant urgent discovery of novel drugs for the treatment of malaria. However, most bioactive compounds fail to progress to treatments due to safety concerns. Drug repositioning offers an alternative strategy where drugs that have already been approved as safe for other diseases could be used to treat malaria. This study screened approved drugs for antimalarial activity using an in silico chemogenomics approach prior to in vitro verification. All the P. falciparum proteins sequences available in NCBI RefSeq were mined and used to perform a similarity search against DrugBank, TTD and STITCH databases to identify similar putative drug targets. Druggability indices of the potential P. falciparum drug targets were obtained from TDR targets database. Functional amino acid residues of the drug targets were determined using ConSurf server which was used to fine tune the similarity search. This study predicted 133 approved drugs that could target 34 P. falciparum proteins. A literature search done at PubMed and Google Scholar showed 105 out of the 133 drugs to have been previously tested against malaria, with most showing activity. For further validation, drug susceptibility assays using SYBR Green I method were done on a representative group of 10 predicted drugs, eight of which did show activity against P. falciparum 3D7 clone. Seven had IC50 values ranging from 1 μM to 50 μM. This study also suggests drug-target association and hence possible mechanisms of action of drugs that did show antiplasmodial activity. The study results validate the use of proteome-wide target similarity approach in identifying approved drugs with activity against P. falciparum and could be adapted for other pathogens. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
37. The transcriptional response to the olive fruit fly (Bactrocera oleae) reveals extended differences between tolerant and susceptible olive (Olea europaea L.) varieties.
- Author
-
Grasso, Filomena, Coppola, Mariangela, Carbone, Fabrizio, Baldoni, Luciana, Alagna, Fiammetta, Perrotta, Gaetano, Pérez-Pulido, Antonio J., Garonna, Antonio, Facella, Paolo, Daddiego, Loretta, Lopez, Loredana, Vitiello, Alessia, Rao, Rosa, and Corrado, Giandomenico
- Subjects
- *
OLIVE fly , *OLIVE diseases & pests , *GENE expression in plants , *PYROSEQUENCING ,OLIVE varieties - Abstract
The olive fruit fly Bactrocera oleae (Diptera: Tephritidae) is the most devastating pest of cultivated olive (Olea europaea L.). Intraspecific variation in plant resistance to B. oleae has been described only at phenotypic level. In this work, we used a transcriptomic approach to study the molecular response to the olive fruit fly in two olive cultivars with contrasting level of susceptibility. Using next-generation pyrosequencing, we first generated a catalogue of more than 80,000 sequences expressed in drupes from approximately 700k reads. The assembled sequences were used to develop a microarray layout with over 60,000 olive-specific probes. The differential gene expression analysis between infested (i.e. with II or III instar larvae) and control drupes indicated a significant intraspecific variation between the more tolerant and susceptible cultivar. Around 2500 genes were differentially regulated in infested drupes of the tolerant variety. The GO annotation of the differentially expressed genes implies that the inducible resistance to the olive fruit fly involves a number of biological functions, cellular processes and metabolic pathways, including those with a known role in defence, oxidative stress responses, cellular structure, hormone signalling, and primary and secondary metabolism. The difference in the induced transcriptional changes between the cultivars suggests a strong genetic role in the olive inducible defence, which can ultimately lead to the discovery of factors associated with a higher level of tolerance to B. oleae. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
38. Study of three interesting Amanita species from Thailand: Morphology, multiple-gene phylogeny and toxin analysis.
- Author
-
Thongbai, Benjarong, Miller, Steven L., Stadler, Marc, Wittstein, Kathrin, Hyde, Kevin D., Lumyong, Saisamorn, and Raspé, Olivier
- Subjects
- *
AMANITA phalloides , *AMATOXINS , *PHYLOGENY , *TOXICOLOGY , *BIOINFORMATICS - Abstract
Amanita ballerina and A. brunneitoxicaria spp. nov. are introduced from Thailand. Amanita fuligineoides is also reported for the first time from Thailand, increasing the known distribution of this taxon. Together, those findings support our view that many taxa are yet to be discovered in the region. While both morphological characters and a multiple-gene phylogeny clearly place A. brunneitoxicaria and A. fuligineoides in sect. Phalloideae (Fr.) Quél., the placement of A. ballerina is problematic. On the one hand, the morphology of A. ballerina shows clear affinities with stirps Limbatula of sect. Lepidella. On the other hand, in a multiple-gene phylogeny including taxa of all sections in subg. Lepidella, A. ballerina and two other species, including A. zangii, form a well-supported clade sister to the Phalloideae sensu Bas 1969, which include the lethal “death caps” and “destroying angels”. Together, the A. ballerina-A. zangii clade and the Phalloideae sensu Bas 1969 also form a well-supported clade. We therefore screened for two of the most notorious toxins by HPLC-MS analysis of methanolic extracts from the basidiomata. Interestingly, neither α-amanitin nor phalloidin was found in A. ballerina, whereas Amanita fuligineoides was confirmed to contain both α-amanitin and phalloidin, and A. brunneitoxicaria contained only α-amanitin. Together with unique morphological characteristics, the position in the phylogeny indicates that A. ballerina is either an important link in the evolution of the deadly Amanita sect. Phalloideae species, or a member of a new section also including A. zangii. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
39. THPdb: Database of FDA-approved peptide and protein therapeutics.
- Author
-
Usmani, Salman Sadullah, Bedi, Gursimran, Samuel, Jesse S., Singh, Sandeep, Kalra, Sourav, Kumar, Pawan, Ahuja, Anjuman Arora, Sharma, Meenu, Gautam, Ankur, and Raghava, Gajendra P. S.
- Subjects
- *
THERAPEUTIC use of proteins , *PEPTIDES , *PHARMACODYNAMICS , *CHEMICAL properties , *THERAPEUTICS - Abstract
THPdb () is a manually curated repository of Food and Drug Administration (FDA) approved therapeutic peptides and proteins. The information in THPdb has been compiled from 985 research publications, 70 patents and other resources like DrugBank. The current version of the database holds a total of 852 entries, providing comprehensive information on 239 US-FDA approved therapeutic peptides and proteins and their 380 drug variants. The information on each peptide and protein includes their sequences, chemical properties, composition, disease area, mode of activity, physical appearance, category or pharmacological class, pharmacodynamics, route of administration, toxicity, target of activity, etc. In addition, we have annotated the structure of most of the protein and peptides. A number of user-friendly tools have been integrated to facilitate easy browsing and data analysis. To assist scientific community, a web interface and mobile App have also been developed. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
40. Community detection in sequence similarity networks based on attribute clustering.
- Author
-
Chowdhary, Janamejaya, Löffler, Frank E., and Smith, Jeremy C.
- Subjects
- *
AMINO acid sequence , *SEQUENCE analysis , *PROTEIN structure , *SEQUENCE alignment , *CLUSTER analysis (Statistics) , *LINEAR algebra - Abstract
Networks are powerful tools for the presentation and analysis of interactions in multi-component systems. A commonly studied mesoscopic feature of networks is their community structure, which arises from grouping together similar nodes into one community and dissimilar nodes into separate communities. Here, the community structure of protein sequence similarity networks is determined with a new method: Attribute Clustering Dependent Communities (ACDC). Sequence similarity has hitherto typically been quantified by the alignment score or its expectation value. However, pair alignments with the same score or expectation value cannot thus be differentiated. To overcome this deficiency, the method constructs, for pair alignments, an extended alignment metric, the link attribute vector, which includes the score and other alignment characteristics. Rescaling components of the attribute vectors qualitatively identifies a systematic variation of sequence similarity within protein superfamilies. The problem of community detection is then mapped to clustering the link attribute vectors, selection of an optimal subset of links and community structure refinement based on the partition density of the network. ACDC-predicted communities are found to be in good agreement with gold standard sequence databases for which the “ground truth” community structures (or families) are known. ACDC is therefore a community detection method for sequence similarity networks based entirely on pair similarity information. A serial implementation of ACDC is available from [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
41. Molecular mechanics of Staphylococcus aureus adhesin, CNA, and the inhibition of bacterial adhesion by stretching collagen.
- Author
-
Madani, Ali, Garakani, Kiavash, and Mofrad, Mohammad R. K.
- Subjects
- *
STAPHYLOCOCCUS aureus , *BACTERIAL adhesins , *RESPONSE inhibition , *COLLAGEN , *BACTERIAL diseases - Abstract
Bacterial adhesion to collagen, the most abundant protein in humans, is a critical step in the initiation and persistence of numerous bacterial infections. In this study, we explore the collagen binding mechanism of the multi-modular cell wall anchored collagen adhesin (CNA) in Staphylococcus aureus and examine how applied mechanical forces can modulate adhesion ability. The common structural-functional elements and domain organization of CNA are present across over 50 genera of bacteria. Through the use of molecular dynamics models and normal mode analysis, we shed light on the CNA’s structural and conformational dynamics and its interactions with collagen that lead to collagen binding. Our results suggest that the linker region, CNA165-173, acts as a hinge exhibiting bending, extensional, and torsional modes of structural flexibility and its residues are key in the interaction of the CNA-collagen complex. Steered molecular dynamics simulations were conducted with umbrella sampling. During the course of these simulations, the ‘locking’ latch from the CNA N2 domain was dissociated from its groove in the CNA N1 domain, implying the importance of the latch for effective ligand binding. Finally, we observed that the binding efficiency of the CNA N1-N2 domains to collagen decreases greatly with increasing tensile force application to the collagen peptides. Thus, CNA and similar adhesins might preferentially bind to sites in which collagen fibers are cleaved, such as in wounded, injured, or inflamed tissues, or in which the collagenous tissue is less mature. As alternative techniques for control of bacterial infection are in-demand due to the rise of bacterial antibiotic resistance, results from our computational studies with respect to the mechanoregulation of the collagen binding site may inspire new therapeutics and engineering solutions by mechanically preventing colonization and/or further pathogenesis. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
42. A TALE-inspired computational screen for proteins that contain approximate tandem repeats.
- Author
-
Perycz, Malgorzata, Krwawicz, Joanna, and Bochtler, Matthias
- Subjects
- *
TANDEM repeats , *BACTERIAL proteins , *PLANT cells & tissues , *AMINO acid sequence , *REPEATED sequence (Genetics) - Abstract
TAL (transcription activator-like) effectors (TALEs) are bacterial proteins that are secreted from bacteria to plant cells to act as transcriptional activators. TALEs and related proteins (RipTALs, BurrH, MOrTL1 and MOrTL2) contain approximate tandem repeats that differ in conserved positions that define specificity. Using PERL, we screened ~47 million protein sequences for TALE-like architecture characterized by approximate tandem repeats (between 30 and 43 amino acids in length) and sequence variability in conserved positions, without requiring sequence similarity to TALEs. Candidate proteins were scored according to their propensity for nuclear localization, secondary structure, repeat sequence complexity, as well as covariation and predicted structural proximity of variable residues. Biological context was tentatively inferred from co-occurrence of other domains and interactome predictions. Approximate repeats with TALE-like features that merit experimental characterization were found in a protein of chestnut blight fungus, a eukaryotic plant pathogen. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
43. DNA reference libraries of French Guianese mosquitoes for barcoding and metabarcoding.
- Author
-
Talaga, Stanislas, Leroy, Céline, Guidez, Amandine, Dusfour, Isabelle, Girod, Romain, Dejean, Alain, and Murienne, Jérôme
- Subjects
- *
FRENCH Guianese , *MOSQUITOES , *ARTHROPODA , *GENETIC barcoding , *TAXONOMY - Abstract
The mosquito family (Diptera: Culicidae) constitutes the most medically important group of arthropods because certain species are vectors of human pathogens. In some parts of the world, the diversity is so high that the accurate delimitation and/or identification of species is challenging. A DNA-based identification system for all animals has been proposed, the so-called DNA barcoding approach. In this study, our objectives were (i) to establish DNA barcode libraries for the mosquitoes of French Guiana based on the COI and the 16S markers, (ii) to compare distance-based and tree-based methods of species delimitation to traditional taxonomy, and (iii) to evaluate the accuracy of each marker in identifying specimens. A total of 266 specimens belonging to 75 morphologically identified species or morphospecies were analyzed allowing us to delimit 86 DNA clusters with only 21 of them already present in the BOLD database. We thus provide a substantial contribution to the global mosquito barcoding initiative. Our results confirm that DNA barcodes can be successfully used to delimit and identify mosquito species with only a few cases where the marker could not distinguish closely related species. Our results also validate the presence of new species identified based on morphology, plus potential cases of cryptic species. We found that both COI and 16S markers performed very well, with successful identifications at the species level of up to 98% for COI and 97% for 16S when compared to traditional taxonomy. This shows great potential for the use of metabarcoding for vector monitoring and eco-epidemiological studies. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
44. Genet-specific DNA methylation probabilities detected in a spatial epigenetic analysis of a clonal plant population.
- Author
-
Araki, Kiwako S., Kubo, Takuya, and Kudoh, Hiroshi
- Subjects
- *
PLANT epigenetics , *DNA methylation , *PLANT population genetics , *GENETIC polymorphisms in plants , *GENOTYPE-environment interaction - Abstract
In sessile organisms such as plants, spatial genetic structures of populations show long-lasting patterns. These structures have been analyzed across diverse taxa to understand the processes that determine the genetic makeup of organismal populations. For many sessile organisms that mainly propagate via clonal spread, epigenetic status can vary between clonal individuals in the absence of genetic changes. However, fewer previous studies have explored the epigenetic properties in comparison to the genetic properties of natural plant populations. Here, we report the simultaneous evaluation of the spatial structure of genetic and epigenetic variation in a natural population of the clonal plant Cardamine leucantha. We applied a hierarchical Bayesian model to evaluate the effects of membership of a genet (a group of individuals clonally derived from a single seed) and vegetation cover on the epigenetic variation between ramets (clonal plants that are physiologically independent individuals). We sampled 332 ramets in a 20 m × 20 m study plot that contained 137 genets (identified using eight SSR markers). We detected epigenetic variation in DNA methylation at 24 methylation-sensitive amplified fragment length polymorphism (MS-AFLP) loci. There were significant genet effects at all 24 MS-AFLP loci in the distribution of subepiloci. Vegetation cover had no statistically significant effect on variation in the majority of MS-AFLP loci. The spatial aggregation of epigenetic variation is therefore largely explained by the aggregation of ramets that belong to the same genets. By applying hierarchical Bayesian analyses, we successfully identified a number of genet-specific changes in epigenetic status within a natural plant population in a complex context, where genotypes and environmental factors are unevenly distributed. This finding suggests that it requires further studies on the spatial epigenetic structure of natural populations of diverse organisms, particularly for sessile clonal species. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
45. Environmental DNA (eDNA) metabarcoding assays to detect invasive invertebrate species in the Great Lakes.
- Author
-
Klymus, Katy E., Marshall, Nathaniel T., and Stepien, Carol A.
- Subjects
- *
DNA analysis , *ECOSYSTEM management , *BIODIVERSITY , *BIOLOGICAL assay - Abstract
Describing and monitoring biodiversity comprise integral parts of ecosystem management. Recent research coupling metabarcoding and environmental DNA (eDNA) demonstrate that these methods can serve as important tools for surveying biodiversity, while significantly decreasing the time, expense and resources spent on traditional survey methods. The literature emphasizes the importance of genetic marker development, as the markers dictate the applicability, sensitivity and resolution ability of an eDNA assay. The present study developed two metabarcoding eDNA assays using the mtDNA 16S RNA gene with Illumina MiSeq platform to detect invertebrate fauna in the Laurentian Great Lakes and surrounding waterways, with a focus for use on invasive bivalve and gastropod species monitoring. We employed careful primer design and in vitro testing with mock communities to assess ability of the markers to amplify and sequence targeted species DNA, while retaining rank abundance information. In our mock communities, read abundances reflected the initial input abundance, with regressions having significant slopes (p<0.05) and high coefficients of determination (R2) for all comparisons. Tests on field environmental samples revealed similar ability of our markers to measure relative abundance. Due to the limited reference sequence data available for these invertebrate species, care must be taken when analyzing results and identifying sequence reads to species level. These markers extend eDNA metabarcoding research for molluscs and appear relevant to other invertebrate taxa, such as rotifers and bryozoans. Furthermore, the sphaeriid mussel assay is group-specific, exclusively amplifying bivalves in the Sphaeridae family and providing species-level identification. Our assays provide useful tools for managers and conservation scientists, facilitating early detection of invasive species as well as improving resolution of mollusc diversity. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
46. Mitochondrial genome evolution in Alismatales: Size reduction and extensive loss of ribosomal protein genes.
- Author
-
Petersen, Gitte, Cuenca, Argelia, Zervas, Athanasios, Ross, Gregory T., Graham, Sean W., Barrett, Craig F., Davis, Jerrold I., and Seberg, Ole
- Subjects
- *
HELOBIAE , *PLANT mitochondria , *MITOCHONDRIAL RNA , *GENOMES , *RIBOSOMAL proteins - Abstract
The order Alismatales is a hotspot for evolution of plant mitochondrial genomes characterized by remarkable differences in genome size, substitution rates, RNA editing, retrotranscription, gene loss and intron loss. Here we have sequenced the complete mitogenomes of Zostera marina and Stratiotes aloides, which together with previously sequenced mitogenomes from Butomus and Spirodela, provide new evolutionary evidence of genome size reduction, gene loss and transfer to the nucleus. The Zostera mitogenome includes a large portion of DNA transferred from the plastome, yet it is the smallest known mitogenome from a non-parasitic plant. Using a broad sample of the Alismatales, the evolutionary history of ribosomal protein gene loss is analyzed. In Zostera almost all ribosomal protein genes are lost from the mitogenome, but only some can be found in the nucleus. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
47. Mechanisms of fast and stringent search in homologous pairing of double-stranded DNA.
- Author
-
Bitran, Amir, Chiang, Wei-Yin, Levine, Erel, and Prentiss, Mara
- Subjects
- *
DOUBLE-stranded RNA , *BACTERIAL DNA , *MOLECULES , *ESCHERICHIA coli DNA , *CHROMOSOMES - Abstract
Self-organization in the cell relies on the rapid and specific binding of molecules to their cognate targets. Correct bindings must be stable enough to promote the desired function even in the crowded and fluctuating cellular environment. In systems with many nearly matched targets, rapid and stringent formation of stable products is challenging. Mechanisms that overcome this challenge have been previously proposed, including separating the process into multiple stages; however, how particular in vivo systems overcome the challenge remains unclear. Here we consider a kinetic system, inspired by homology dependent pairing between double stranded DNA in bacteria. By considering a simplified tractable model, we identify different homology testing stages that naturally occur in the system. In particular, we first model dsDNA molecules as short rigid rods containing periodically spaced binding sites. The interaction begins when the centers of two rods collide at a random angle. For most collision angles, the interaction energy is weak because only a few binding sites near the collision point contribute significantly to the binding energy. We show that most incorrect pairings are rapidly rejected at this stage. In rare cases, the two rods enter a second stage by rotating into parallel alignment. While rotation increases the stability of matched and nearly matched pairings, subsequent rotational fluctuations reduce kinetic trapping. Finally, in vivo chromosome are much longer than the persistence length of dsDNA, so we extended the model to include multiple parallel collisions between long dsDNA molecules, and find that those additional interactions can greatly accelerate the searching. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
48. Discovery of a new family of relaxases in Firmicutes bacteria.
- Author
-
Ramachandran, Gayetri, Miguel-Arribas, Andrés, Abia, David, Singh, Praveen K., Crespo, Isidro, Gago-Córdoba, César, Hao, Jian An, Luque-Ortega, Juan Roman, Alfonso, Carlos, Wu, Ling J., Boer, D. Roeland, and Meijer, Wilfried J. J.
- Subjects
- *
DRUG resistance in microorganisms , *PATHOGENIC bacteria , *HORIZONTAL gene transfer , *GUT microbiome , *BACILLUS subtilis - Abstract
Antibiotic resistance is a serious global problem. Antibiotic resistance genes (ARG), which are widespread in environmental bacteria, can be transferred to pathogenic bacteria via horizontal gene transfer (HGT). Gut microbiomes are especially apt for the emergence and dissemination of ARG. Conjugation is the HGT route that is predominantly responsible for the spread of ARG. Little is known about conjugative elements of Gram-positive bacteria, including those of the phylum Firmicutes, which are abundantly present in gut microbiomes. A critical step in the conjugation process is the relaxase-mediated site- and strand-specific nick in the oriT region of the conjugative element. This generates a single-stranded DNA molecule that is transferred from the donor to the recipient cell via a connecting channel. Here we identified and characterized the relaxosome components oriT and the relaxase of the conjugative plasmid pLS20 of the Firmicute Bacillus subtilis. We show that the relaxase gene, named relLS20, is essential for conjugation, that it can function in trans and provide evidence that Tyr26 constitutes the active site residue. In vivo and in vitro analyses revealed that the oriT is located far upstream of the relaxase gene and that the nick site within oriT is located on the template strand of the conjugation genes. Surprisingly, the RelLS20 shows very limited similarity to known relaxases. However, more than 800 genes to which no function had been attributed so far are predicted to encode proteins showing significant similarity to RelLS20. Interestingly, these putative relaxases are encoded almost exclusively in Firmicutes bacteria. Thus, RelLS20 constitutes the prototype of a new family of relaxases. The identification of this novel relaxase family will have an important impact in different aspects of future research in the field of HGT in Gram-positive bacteria in general, and specifically in the phylum of Firmicutes, and in gut microbiome research. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
49. An Atlas of Peroxiredoxins Created Using an Active Site Profile-Based Approach to Functionally Relevant Clustering of Proteins.
- Author
-
Harper, Angela F., Leuthaeuser, Janelle B., Babbitt, Patricia C., Morris, John H., Ferrin, Thomas E., Poole, Leslie B., and Fetrow, Jacquelyn S.
- Subjects
- *
PEROXIREDOXINS , *PROTEINS , *ANTIOXIDANTS , *ENZYMES , *CELLULAR signal transduction - Abstract
Peroxiredoxins (Prxs or Prdxs) are a large protein superfamily of antioxidant enzymes that rapidly detoxify damaging peroxides and/or affect signal transduction and, thus, have roles in proliferation, differentiation, and apoptosis. Prx superfamily members are widespread across phylogeny and multiple methods have been developed to classify them. Here we present an updated atlas of the Prx superfamily identified using a novel method called MISST (Multi-level Iterative Sequence Searching Technique). MISST is an iterative search process developed to be both agglomerative, to add sequences containing similar functional site features, and divisive, to split groups when functional site features suggest distinct functionally-relevant clusters. Superfamily members need not be identified initially—MISST begins with a minimal representative set of known structures and searches GenBank iteratively. Further, the method’s novelty lies in the manner in which isofunctional groups are selected; rather than use a single or shifting threshold to identify clusters, the groups are deemed isofunctional when they pass a self-identification criterion, such that the group identifies itself and nothing else in a search of GenBank. The method was preliminarily validated on the Prxs, as the Prxs presented challenges of both agglomeration and division. For example, previous sequence analysis clustered the Prx functional families Prx1 and Prx6 into one group. Subsequent expert analysis clearly identified Prx6 as a distinct functionally relevant group. The MISST process distinguishes these two closely related, though functionally distinct, families. Through MISST search iterations, over 38,000 Prx sequences were identified, which the method divided into six isofunctional clusters, consistent with previous expert analysis. The results represent the most complete computational functional analysis of proteins comprising the Prx superfamily. The feasibility of this novel method is demonstrated by the Prx superfamily results, laying the foundation for potential functionally relevant clustering of the universe of protein sequences. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
50. KrillDB: A de novo transcriptome database for the Antarctic krill (Euphausia superba).
- Author
-
Sales, Gabriele, Deagle, Bruce E., Calura, Enrica, Martini, Paolo, Biscontin, Alberto, De Pittà, Cristiano, Kawaguchi, So, Romualdi, Chiara, Meyer, Bettina, Costa, Rodolfo, and Jarman, Simon
- Subjects
- *
EUPHAUSIA superba , *BIOMASS , *CATASTROPHIC illness , *GENETIC databases , *BIOINFORMATICS - Abstract
Antarctic krill (Euphausia superba) is a key species in the Southern Ocean with an estimated biomass between 100 and 500 million tonnes. Changes in krill population viability would have catastrophic effect on the Antarctic ecosystem. One looming threat due to elevated levels of anthropogenic atmospheric carbon dioxide (CO2) is ocean acidification (lowering of sea water pH by CO2 dissolving into the oceans). The genetics of Antarctic krill has long been of scientific interest for both for the analysis of population structure and analysis of functional genetics. However, the genetic resources available for the species are relatively modest. We have developed the most advanced genetic database on Euphausia superba, KrillDB, which includes comprehensive data sets of former and present transcriptome projects. In particular, we have built a de novo transcriptome assembly using more than 360 million Illumina sequence reads generated from larval krill including individuals subjected to different CO2 levels. The database gives access to: 1) the full list of assembled genes and transcripts; 2) their level of similarity to transcripts and proteins from other species; 3) the predicted protein domains contained within each transcript; 4) their predicted GO terms; 5) the level of expression of each transcript in the different larval stages and CO2 treatments. All references to external entities (sequences, domains, GO terms) are equipped with a link to the appropriate source database. Moreover, the software implements a full-text search engine that makes it possible to submit free-form queries. KrillDB represents the first large-scale attempt at classifying and annotating the full krill transcriptome. For this reason, we believe it will constitute a cornerstone of future approaches devoted to physiological and molecular study of this key species in the Southern Ocean food web. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.