104 results on '"Jason R Miller"'
Search Results
2. Machine learning on alignment features for parent-of-origin classification of simulated hybrid RNA-seq
- Author
-
Jason R. Miller and Donald A. Adjeroh
- Subjects
Machine learning ,RNA-seq ,Allele-specific expression ,Sequence alignment ,Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Parent-of-origin allele-specific gene expression (ASE) can be detected in interspecies hybrids by virtue of RNA sequence variants between the parental haplotypes. ASE is detectable by differential expression analysis (DEA) applied to the counts of RNA-seq read pairs aligned to parental references, but aligners do not always choose the correct parental reference. Results We used public data for species that are known to hybridize. We measured our ability to assign RNA-seq read pairs to their proper transcriptome or genome references. We tested software packages that assign each read pair to a reference position and found that they often favored the incorrect species reference. To address this problem, we introduce a post process that extracts alignment features and trains a random forest classifier to choose the better alignment. On each simulated hybrid dataset tested, our machine-learning post-processor achieved higher accuracy than the aligner by itself at choosing the correct parent-of-origin per RNA-seq read pair. Conclusions For the parent-of-origin classification of RNA-seq, machine learning can improve the accuracy of alignment-based methods. This approach could be useful for enhancing ASE detection in interspecies hybrids, though RNA-seq from real hybrids may present challenges not captured by our simulations. We believe this is the first application of machine learning to this problem domain.
- Published
- 2024
- Full Text
- View/download PDF
3. Predicting the functional effect of amino acid substitutions and indels.
- Author
-
Yongwook Choi, Gregory E Sims, Sean Murphy, Jason R Miller, and Agnes P Chan
- Subjects
Medicine ,Science - Abstract
As next-generation sequencing projects generate massive genome-wide sequence variation data, bioinformatics tools are being developed to provide computational predictions on the functional effects of sequence variations and narrow down the search of casual variants for disease phenotypes. Different classes of sequence variations at the nucleotide level are involved in human diseases, including substitutions, insertions, deletions, frameshifts, and non-sense mutations. Frameshifts and non-sense mutations are likely to cause a negative effect on protein function. Existing prediction tools primarily focus on studying the deleterious effects of single amino acid substitutions through examining amino acid conservation at the position of interest among related sequences, an approach that is not directly applicable to insertions or deletions. Here, we introduce a versatile alignment-based score as a new metric to predict the damaging effects of variations not limited to single amino acid substitutions but also in-frame insertions, deletions, and multiple amino acid substitutions. This alignment-based score measures the change in sequence similarity of a query sequence to a protein sequence homolog before and after the introduction of an amino acid variation to the query sequence. Our results showed that the scoring scheme performs well in separating disease-associated variants (n = 21,662) from common polymorphisms (n = 37,022) for UniProt human protein variations, and also in separating deleterious variants (n = 15,179) from neutral variants (n = 17,891) for UniProt non-human protein variations. In our approach, the area under the receiver operating characteristic curve (AUC) for the human and non-human protein variation datasets is ∼0.85. We also observed that the alignment-based score correlates with the deleteriousness of a sequence variation. In summary, we have developed a new algorithm, PROVEAN (Protein Variation Effect Analyzer), which provides a generalized approach to predict the functional effects of protein sequence variations including single or multiple amino acid substitutions, and in-frame insertions and deletions. The PROVEAN tool is available online at http://provean.jcvi.org.
- Published
- 2012
- Full Text
- View/download PDF
4. New assembly, reannotation and analysis of the Entamoeba histolytica genome reveal new genomic features and protein content information.
- Author
-
Hernan A Lorenzi, Daniela Puiu, Jason R Miller, Lauren M Brinkac, Paolo Amedeo, Neil Hall, and Elisabet V Caler
- Subjects
Arctic medicine. Tropical medicine ,RC955-962 ,Public aspects of medicine ,RA1-1270 - Abstract
In order to maintain genome information accurately and relevantly, original genome annotations need to be updated and evaluated regularly. Manual reannotation of genomes is important as it can significantly reduce the propagation of errors and consequently diminishes the time spent on mistaken research. For this reason, after five years from the initial submission of the Entamoeba histolytica draft genome publication, we have re-examined the original 23 Mb assembly and the annotation of the predicted genes.The evaluation of the genomic sequence led to the identification of more than one hundred artifactual tandem duplications that were eliminated by re-assembling the genome. The reannotation was done using a combination of manual and automated genome analysis. The new 20 Mb assembly contains 1,496 scaffolds and 8,201 predicted genes, of which 60% are identical to the initial annotation and the remaining 40% underwent structural changes. Functional classification of 60% of the genes was modified based on recent sequence comparisons and new experimental data. We have assigned putative function to 3,788 proteins (46% of the predicted proteome) based on the annotation of predicted gene families, and have identified 58 protein families of five or more members that share no homology with known proteins and thus could be entamoeba specific. Genome analysis also revealed new features such as the presence of segmental duplications of up to 16 kb flanked by inverted repeats, and the tight association of some gene families with transposable elements.This new genome annotation and analysis represents a more refined and accurate blueprint of the pathogen genome, and provides an upgraded tool as reference for the study of many important aspects of E. histolytica biology, such as genome evolution and pathogenesis.
- Published
- 2010
- Full Text
- View/download PDF
5. Multi-symptom Relief with Propylene Glycol-Hydroxypropyl-Guar Nanoemulsion Lubricant Eye Drops in Subjects with Dry Eye Disease: A Post-Marketing Prospective Study
- Author
-
Katherine Bickle, Jason R. Miller, Joseph Tauber, and Deborah Awisi-Gyau
- Subjects
Artificial tears ,Dry eye symptoms ,Multi-symptom relief ,Tired ,Watery ,Ophthalmology ,RE1-994 - Abstract
Abstract Introduction The study aimed to evaluate multi-symptom relief of dry eye manifestations with the use of propylene glycol-hydroxypropyl-guar (PG-HPG) nanoemulsion lubricant eye drops, among subjects with dry eye disease (DED). Methods This was a post-marketing, prospective, single-arm study conducted in the USA. Subjects aged ≥ 18 years, with tear breakup time (TBUT) ≤ 10 s for both eyes, dry eye questionnaire-5 (DEQ-5) “watery eyes” symptom score 1–4, symptoms of burning/stinging, sore and tired eyes as determined by impact of dry eye on everyday living—symptom bother (IDEEL-SB) questionnaire, and IDEEL-SB score 16–65 were included. Subjects were required to complete IDEEL-SB and DEQ-5 at days 0, 14 ± 2, and 28 ± 2, and self-administer one drop of PG-HPG four times daily for 28 ± 2 days. Primary endpoints were change from baseline at day 28 in symptoms of sore, stinging/burning, and tired eyes on IDEEL-SB; and symptom of watery eyes on DEQ-5. Other endpoints evaluated were corneal staining and TBUT at baseline and day 28 ± 2; symptom relief (5-point Likert scale) at day 28 ± 2, and safety. Results Of 119 subjects enrolled, 95 completed the study (mean ± SD age 61.2 ± 13.0 years; female 69.5%). Mean IDEEL-SB scores reduced significantly from baseline at day 28 for symptoms of aching/sore eyes (change from baseline − 1.0 ± 1.1), burning/stinging eyes (change from baseline − 1.1 ± 0.9), and tired eyes (change from baseline − 1.1 ± 1.0) (all p
- Published
- 2023
- Full Text
- View/download PDF
6. Approach to the Ankle in Adult Acquired Flatfoot Deformity
- Author
-
Mark J. Capuzzi, Jason R. Miller, and Tymoteusz Siwy
- Subjects
Orthopedics and Sports Medicine ,Surgery - Published
- 2023
- Full Text
- View/download PDF
7. A host subtraction database for virus discovery in human cell line sequencing data [version 3; peer review: 2 approved]
- Author
-
Jason R. Miller, Kari A. Dilley, Derek M. Harkins, Timothy B. Stockwell, Reed S. Shabman, and Granger G. Sutton
- Subjects
Method Article ,Articles ,RNAseq ,human cell lines ,HepG2 ,HuH-7 ,Jurkat ,host subtraction - Abstract
The human cell lines HepG2, HuH-7, and Jurkat are commonly used for amplification of the RNA viruses present in environmental samples. To assist with assays by RNAseq, we sequenced these cell lines and developed a subtraction database that contains sequences expected in sequence data from uninfected cells. RNAseq data from cell lines infected with Sendai virus were analyzed to test host subtraction. The process of mapping RNAseq reads to our subtraction database vastly reduced the number non-viral reads in the dataset to allow for efficient secondary analyses.
- Published
- 2019
- Full Text
- View/download PDF
8. A host subtraction database for virus discovery in human cell line sequencing data [version 2; referees: 1 approved, 1 approved with reservations]
- Author
-
Jason R. Miller, Kari A. Dilley, Derek M. Harkins, Timothy B. Stockwell, Reed S. Shabman, and Granger G. Sutton
- Subjects
Method Article ,Articles ,RNAseq ,human cell lines ,HepG2 ,HuH-7 ,Jurkat ,host subtraction - Abstract
The human cell lines HepG2, HuH-7, and Jurkat are commonly used for amplification of the RNA viruses present in environmental samples. To assist with assays by RNAseq, we sequenced these cell lines and developed a subtraction database that contains sequences expected in sequence data from uninfected cells. RNAseq data from cell lines infected with Sendai virus were analyzed to test host subtraction. The process of mapping RNAseq reads to our subtraction database vastly reduced the number non-viral reads in the dataset to allow for efficient secondary analyses.
- Published
- 2018
- Full Text
- View/download PDF
9. A draft genome sequence for the Ixodes scapularis cell line, ISE6 [version 1; referees: 2 approved]
- Author
-
Jason R. Miller, Sergey Koren, Kari A. Dilley, Derek M. Harkins, Timothy B. Stockwell, Reed S. Shabman, and Granger G. Sutton
- Subjects
Research Article ,Articles ,tick ,genome ,cell line ,ISE6 ,Ixodes scapularis - Abstract
Background: The tick cell line ISE6, derived from Ixodes scapularis, is commonly used for amplification and detection of arboviruses in environmental or clinical samples. Methods: To assist with sequence-based assays, we sequenced the ISE6 genome with single-molecule, long-read technology. Results: The draft assembly appears near complete based on gene content analysis, though it appears to lack some instances of repeats in this highly repetitive genome. The assembly appears to have separated the haplotypes at many loci. DNA short read pairs, used for validation only, mapped to the cell line assembly at a higher rate than they mapped to the Ixodes scapularis reference genome sequence. Conclusions: The assembly could be useful for filtering host genome sequence from sequence data obtained from cells infected with pathogens.
- Published
- 2018
- Full Text
- View/download PDF
10. Structural evidence for MADS-box type I family expansion seen in new assemblies of A. arenosa and A. lyrata
- Author
-
Jonathan Bramsiepe, Anders K. Krabberod, Katrine N. Bjerkan, Renate M. Alling, Ida M. Johannessen, Karina S. Hornslien, Jason R. Miller, Anne K. Brysting, and Paul E. Grini
- Abstract
Arabidopsis thaliana diverged from A. arenosa and A. lyrata at least 6 million years ago and are identified by genome-wide polymorphisms or morphological traits. The species are to a high degree reproductively isolated, but hybridization barriers are incomplete. A special type of hybridization barrier is based in the triploid endosperm of the seed, where embryo lethality is caused by endosperm failure to support the developing embryo. The MADS-box type I family of transcription factors are specifically expressed in the endosperm and has been proposed to play a role in endosperm-based hybridization barriers. The gene family is well known for a high evolutionary duplication rate, as well as being regulated by genomic imprinting. Here we address MADS-box type I gene family evolution and the role of type I genes in the context of hybridization. Using two de-novo assembled and annotated chromosome-level genomes of A. arenosa and A. lyrata ssp. petraea we analyzed the MADS-box type I gene family in Arabidopsis to predict orthologs, copy number and structural genomic variation related to the type I loci. Our findings were compared to gene expression profiles sampled before and after the transition to endosperm cellularization in order to investigate the involvement of MADS-box type I loci in endosperm-based hybridization barriers. We observed substantial differences in type-I expression between A. arenosa and A. lyrata ssp. petraea in the endosperm, suggesting a genetic cause for the endosperm-based hybridization barrier in A. arenosa and A. lyrata ssp. petraea hybrid seeds.
- Published
- 2023
- Full Text
- View/download PDF
11. The arabidopsis information portal: an application platform for data discovery.
- Author
-
Matthew R. Hanlon, Matthew Vaughn, Stephen A. Mock, Rion Dooley, Walter Moreira, Joe Stubbs, Christopher Town 0002, Jason R. Miller, Vivek Krishnakumar, Erik S. Ferlanti, and Eleanor Pence
- Published
- 2014
- Full Text
- View/download PDF
12. Spatial and temporal regulation of parent-of-origin allelic expression in the endosperm
- Author
-
Yuri S van Ekelenburg, Karina S Hornslien, Tom Van Hautegem, Matyáš Fendrych, Gert Van Isterdael, Katrine N Bjerkan, Jason R Miller, Moritz K Nowack, and Paul E Grini
- Subjects
Physiology ,fungi ,Genetics ,food and beverages ,Biology and Life Sciences ,Plant Science - Abstract
Genomic imprinting promotes differential expression of parental alleles in the endosperm of flowering plants and is regulated by epigenetic modification such as DNA methylation and histone tail modifications in chromatin. After fertilization, the endosperm develops through a syncytial stage before it cellularizes and becomes a nutrient source for the growing embryo. Regional compartmentalization has been shown both in early and late endosperm development, and different transcriptional domains suggest divergent spatial and temporal regional functions. The analysis of the role of parent-of-origin allelic expression in the endosperm as a whole and the investigation of domain-specific functions have been hampered by the inaccessibility of the tissue for high-throughput transcriptome analyses and contamination from surrounding tissue. Here, we used fluorescence-activated nuclear sorting (FANS) of nuclear targeted GFP fluorescent genetic markers to capture parental-specific allelic expression from different developmental stages and specific endosperm domains. This approach allowed us to successfully identify differential genomic imprinting with temporal and spatial resolution. We used a systematic approach to report temporal regulation of imprinted genes in the endosperm, as well as region-specific imprinting in endosperm domains. Analysis of our data identified loci that are spatially differentially imprinted in one domain of the endosperm, while biparentally expressed in other domains. These findings suggest that the regulation of genomic imprinting is dynamic and challenge the canonical mechanisms for genomic imprinting.
- Published
- 2023
13. A host subtraction database for virus discovery in human cell line sequencing data [version 1; referees: 1 approved with reservations]
- Author
-
Jason R. Miller, Kari A. Dilley, Derek M. Harkins, Timothy B. Stockwell, Reed S. Shabman, and Granger G. Sutton
- Subjects
Method Article ,Articles ,RNAseq ,human cell lines ,HepG2 ,HuH-7 ,Jurkat ,host subtraction - Abstract
The human cell lines HepG2, HuH-7, and Jurkat are commonly used for amplification of the RNA viruses present in environmental samples. To assist with assays by RNAseq, we sequenced these cell lines and developed a subtraction database that contains sequences expected in sequence data from uninfected cells. RNAseq data from cell lines infected with Sendai virus were analyzed to test host subtraction. The process of mapping RNAseq reads to our subtraction database vastly reduced the number non-viral reads in the dataset to allow for efficient secondary analyses.
- Published
- 2018
- Full Text
- View/download PDF
14. Araport: the Arabidopsis Information Portal.
- Author
-
Vivek Krishnakumar, Matthew R. Hanlon, Sergio Contrino, Erik S. Ferlanti, Svetlana Karamycheva, Maria Kim, Benjamin D. Rosen, Chia-Yi Cheng, Walter Moreira, Stephen A. Mock, Joseph Stubbs, Julie M. Sullivan, Konstantinos Krampis, Jason R. Miller, Gos Micklem, Matthew Vaughn, and Christopher D. Town
- Published
- 2015
- Full Text
- View/download PDF
15. Araport: an application platform for data discovery.
- Author
-
Matthew R. Hanlon, Matthew Vaughn, Stephen A. Mock, Rion Dooley, Walter Moreira, Joe Stubbs, Christopher Town 0002, Jason R. Miller, Vivek Krishnakumar, Erik S. Ferlanti, and Eleanor Pence
- Published
- 2015
- Full Text
- View/download PDF
16. Initial genome sequencing of the sugarcane CP 96-1252 complex hybrid [version 1; referees: 2 approved]
- Author
-
Jason R. Miller, Kari A. Dilley, Derek M. Harkins, Manolito G. Torralba, Kelvin J. Moncera, Karen Beeri, Karrie Goglin, Timothy B. Stockwell, Granger G. Sutton, and Reed S. Shabman
- Subjects
Data Note ,Articles ,Agriculture & Biotechnology ,Plant Genetics & Gene Expression ,Sugarcane genome ,DNA sequencing ,sequencing reads - Abstract
The CP 96-1252 cultivar of sugarcane is a complex hybrid of commercial importance. DNA was extracted from lab-grown leaf tissue and sequenced. The raw Illumina DNA sequencing results provide 101 Gbp of genome sequence reads. The dataset is available from https://www.ncbi.nlm.nih.gov/bioproject/PRJNA345486/.
- Published
- 2017
- Full Text
- View/download PDF
17. Visualization challenges for a new cyberpharmaceutical computing paradigm.
- Author
-
Russell J. Turner, Kabir Chaturvedi, Nathan Edwards, Daniel P. Fasulo, Aaron L. Halpern, Daniel H. Huson, Oliver Kohlbacher, Jason R. Miller, Knut Reinert, Karin A. Remington, Russell Schwartz, Brian Walenz, Shibu Yooseph, and Sorin Istrail
- Published
- 2001
- Full Text
- View/download PDF
18. Consensus generation and variant detection by Celera Assembler.
- Author
-
Gennady Denisov, Brian Walenz, Aaron L. Halpern, Jason R. Miller, Nelson Axelrod, Samuel Levy, and Granger G. Sutton
- Published
- 2008
- Full Text
- View/download PDF
19. Aggressive assembly of pyrosequencing reads with mates.
- Author
-
Jason R. Miller, Arthur L. Delcher, Sergey Koren, Eli Venter, Brian Walenz, Anushka Brownley, Justin Johnson 0002, Kelvin Li, Clark M. Mobarry, and Granger G. Sutton
- Published
- 2008
- Full Text
- View/download PDF
20. ThurGood: Evaluating Assembly-to-Assembly Mapping.
- Author
-
Hagit Shatkay, Jason R. Miller, Clark M. Mobarry, Michael Flanigan, Shibu Yooseph, and Granger G. Sutton
- Published
- 2004
- Full Text
- View/download PDF
21. Regulation of Parent-of-Origin Allelic Expression in the Endosperm
- Author
-
Karina S. Hornslien, Paul E. Grini, and Jason R. Miller
- Subjects
Epigenomics ,0106 biological sciences ,Physiology ,Arabidopsis ,Plant Science ,Biology ,01 natural sciences ,Genomic Imprinting ,Magnoliopsida ,Gene Expression Regulation, Plant ,Histone methylation ,Genetics ,Epigenetics ,Imprinting (psychology) ,Gene ,Alleles ,Regulation of gene expression ,Arabidopsis Proteins ,Computational Biology ,DNA Methylation ,Endosperm ,Seeds ,DNA methylation ,Genomic imprinting ,Research Article ,Signal Transduction ,010606 plant biology & botany - Abstract
Genomic imprinting is an epigenetic phenomenon established in the gametes prior to fertilization that causes differential expression of parental alleles, mainly in the endosperm of flowering plants. The overlap between previously identified panels of imprinted genes is limited. To investigate imprinting, we used high-resolution sequencing data acquired with sequence-capture technology. We present a bioinformatics pipeline to assay parent-of-origin allele-specific expression and report more than 300 loci with parental expression bias in Arabidopsis (Arabidopsis thaliana). In most cases, the level of expression from maternal and paternal alleles was not binary, instead supporting a differential dosage hypothesis for the evolution of imprinting in plants. To address imprinting regulation, we systematically employed mutations in regulative epigenetic pathways suggested to be major players in the process. We established the mechanistic mode of imprinting for more than 50 loci regulated by DNA methylation and Polycomb-dependent histone methylation. However, the imprinting patterns of most genes were not affected by these mechanisms. To this end, we also demonstrated that the RNA-directed DNA methylation pathway alone does not substantially influence imprinting patterns, suggesting that more complex epigenetic pathways regulate most of the identified imprinted genes.
- Published
- 2019
- Full Text
- View/download PDF
22. Permanent Polymethyl Methacrylate Antibiotic Spacer for Definitive Management of Failed Total Ankle Replacements
- Author
-
Benjamin L. Marder and Jason R. Miller
- Subjects
medicine.medical_specialty ,Debridement ,Polymethyl methacrylate ,medicine.drug_class ,business.industry ,medicine.medical_treatment ,Antibiotics ,Ankle replacement ,Total Ankle Replacements ,Soft tissue ,Surgery ,medicine.anatomical_structure ,medicine ,Implant ,Ankle ,business - Abstract
Advances in implant design, patient familiarity, and surgeon experience have led to a large cohort of patients undergoing total ankle replacement (TAR). Unfortunately, early or late failure can result from soft tissue-related wound problems, deep infection, or technical error in implantation. When a TAR becomes infected, treatment remains controversial with no level 1 evidence to support management. Individual patient characteristics are important when determining the treatment approach to the infected TAR as soft tissue coverage, patient comorbidities, and the ultimate surgical goal of the patient need to be taken into account. Salvage of an infected TAR will consist of eradication of infection, obtaining soft tissue coverage and maintaining function of the limb. Treatment strategies described in the literature consist of irrigation and debridement with or without polyethylene exchange, one-stage revision arthroplasty, and two-stage revision arthroplasty with antibiotic cement spacer. Permanent antibiotic cement spacer has been infrequently reported in the literature; however it does have a significant role in the treatment approach to the failed TAR. There are instances where patient morbidity, significant bone loss, or prolonged recovery severely limit a patient’s appetite to undergo further attempts at implant salvage. In cases where ankle fusion or revision arthroplasty is not within the goals of the patient or surgeon, permanent polymethyl methacrylate (PMMA) antibiotic-impregnated spacer can assist the patient in ultimate limb retention following the devastating complications associated with an infected TAR.
- Published
- 2021
- Full Text
- View/download PDF
23. Exploring Neural Network Models for LncRNA Sequence Identification
- Author
-
Jason R. Miller and Donald A. Adjeroh
- Subjects
0303 health sciences ,Artificial neural network ,business.industry ,Computer science ,Perceptron ,Machine learning ,computer.software_genre ,Data modeling ,Network planning and design ,03 medical and health sciences ,Identification (information) ,ComputingMethodologies_PATTERNRECOGNITION ,0302 clinical medicine ,Recurrent neural network ,Encoding (memory) ,Artificial intelligence ,business ,computer ,030217 neurology & neurosurgery ,Selection (genetic algorithm) ,030304 developmental biology - Abstract
Distinguishing long non-coding RNA from protein-coding RNA is important to molecular and cellular biology. The problem can be addressed with machine learning in general and with artificial neural networks in particular. We explore the effects of various network design choices on the accuracy of human LncRNA identification. Perceptron-based neural network models were found to be almost as accurate as more complex recurrent neural networks, and K-mer representations of the data seemed to assist both. Size selection of training data affected results. These explorations could assist in neural network design for RNA analysis.
- Published
- 2020
- Full Text
- View/download PDF
24. Novel isoforms of influenza virus PA-X and PB1-F2 indicated by automatic annotation
- Author
-
Margaret Rush, Andrew J. Burnham, Gillian M. Air, Stephan Bour, Emily A. Billings, Jason R. Miller, and Indresh Singh
- Subjects
Gene isoform ,Cancer Research ,Base Sequence ,Nonsense mutation ,virus diseases ,Computational biology ,Gene Annotation ,Biology ,medicine.disease_cause ,Genome ,Stop codon ,Viral Proteins ,Infectious Diseases ,Influenza A virus ,Virology ,GenBank ,Influenza, Human ,medicine ,Humans ,Protein Isoforms ,Gene - Abstract
The influenza A virus genome contains 8 gene segments encoding 10 commonly recognized proteins. Additional protein products have been identified, including PB1-F2 and PA-X. We report the in-silico identification of novel isoforms of PB1-F2 and PA-X in influenza virus genomes sequenced from avian samples. The isoform observed in PA-X includes a mutated stop codon that should extend the protein product by 8 amino acids. The isoform observed in PB1-F2 includes two nonsense mutations that should truncate the N-terminal region of the protein product and remove the entire mitochondrial targeting domain. Both isoforms were uncovered during automatic annotation of CEIRS sequence data. Nominally termed PA-X8 and PB1-F2-Cterm, both predicted isoforms were subsequently found in other annotated influenza genomes previously deposited in GenBank. Both isoforms were noticed due to discrepant annotations output by two annotation engines, indicating a benefit of incorporating multiple algorithms during gene annotation.
- Published
- 2020
25. Substrate Spatial Complexity Analysis for the Prediction of Ventricular Arrhythmias in Patients with Ischemic Cardiomyopathy
- Author
-
Jason R. Miller, Mauro Maggioni, Jonathan Chrispin, Adityo Prakosa, David R. Okada, Natalia A. Trayanova, Steven J.M. Jones, and Katherine C. Wu
- Subjects
Gadolinium DTPA ,Male ,medicine.medical_specialty ,Cardiomyopathy ,Myocardial Ischemia ,Action Potentials ,Contrast Media ,Risk Assessment ,Article ,Ventricular Function, Left ,Sudden cardiac death ,Machine Learning ,Imaging, Three-Dimensional ,Heart Rate ,Predictive Value of Tests ,Risk Factors ,Physiology (medical) ,Internal medicine ,medicine ,Humans ,In patient ,Diagnosis, Computer-Assisted ,Registries ,Aged ,Retrospective Studies ,Ischemic cardiomyopathy ,Spatial complexity ,medicine.diagnostic_test ,Fourier Analysis ,business.industry ,Magnetic resonance imaging ,Arrhythmias, Cardiac ,Stroke Volume ,Middle Aged ,medicine.disease ,Prognosis ,Magnetic Resonance Imaging ,United States ,Death, Sudden, Cardiac ,Cardiology ,Female ,Cardiology and Cardiovascular Medicine ,business ,Cardiomyopathies - Abstract
Background: Transition zones between healthy myocardium and scar form a spatially complex substrate that may give rise to reentrant ventricular arrhythmias (VAs). We sought to assess the utility of a novel machine learning approach for quantifying 3-dimensional spatial complexity of grayscale patterns on late gadolinium enhanced cardiac magnetic resonance images to predict VAs in patients with ischemic cardiomyopathy. Methods: One hundred twenty-two consecutive ischemic cardiomyopathy patients with left ventricular ejection fraction ≤35% without prior history of VAs underwent late gadolinium enhanced cardiac magnetic resonance images. From raw grayscale data, we generated graphs encoding the 3-dimensional geometry of the left ventricle. A novel technique, adapted to these graphs, assessed global regularity of signal intensity patterns using Fourier-like analysis and generated a substrate spatial complexity profile for each patient. A machine learning statistical algorithm was employed to discern which substrate spatial complexity profiles correlated with VA events (appropriate implantable cardioverter-defibrillator firings and arrhythmic sudden cardiac death) at 5 years of follow-up. From the statistical machine learning results, a complexity score ranging from 0 to 1 was calculated for each patient and tested using multivariable Cox regression models. Results: At 5 years of follow-up, 40 patients had VA events. The machine learning algorithm classified with 81% overall accuracy and correctly classified 86% of those without VAs. Overall negative predictive value was 91%. Average complexity score was significantly higher in patients with VA events versus those without (0.5±0.5 versus 0.1±0.2; P P =0.002). Conclusions: Substrate spatial complexity analysis of late gadolinium enhanced cardiac magnetic resonance images may be helpful in refining VA risk in patients with ischemic cardiomyopathy, particularly to identify low-risk patients who may not benefit from prophylactic implantable cardioverter-defibrillator therapy. Visual Overview: A visual overview is available for this article.
- Published
- 2020
26. The Ring Verruca Plantaris in Cantharidin Use
- Author
-
Jason R. Miller and Christopher R. Hood
- Subjects
Adult ,Male ,medicine.medical_specialty ,Indoles ,Population ,Foot Diseases ,Young Adult ,030207 dermatology & venereal diseases ,03 medical and health sciences ,chemistry.chemical_compound ,0302 clinical medicine ,Plantar warts ,Biopsy ,medicine ,Anticarcinogenic Agents ,Humans ,030212 general & internal medicine ,Enzyme Inhibitors ,Human papillomavirus ,Child ,education ,Verruca Plantaris ,Cantharidin ,education.field_of_study ,medicine.diagnostic_test ,Foot ,business.industry ,virus diseases ,General Medicine ,medicine.disease ,Dermatology ,chemistry ,Female ,Warts ,Complication ,business - Abstract
Verrucae (warts) are the most common viral infections of the skin, affecting 7% to 10% of the general population. Typically caused by human papillomavirus type 1, plantar warts manifest as benign proliferation of the epithelial cells on the feet. It has been cited that up to one-third of nongenital warts become recalcitrant, and biopsy is often required to confirm diagnosis and direct appropriate treatment. These treatments can vary from various types of oral medications, acids, ablative modalities, and injections. In this article, we present a case of a recalcitrant plantar wart that appeared to circumferentially spread from the initial site after first-line treatment and presumed resolution with the product cantharidin. The development of ring warts is a known complication associated with cantharidin use, with little described rationale to the presentation.
- Published
- 2018
- Full Text
- View/download PDF
27. Radiographic evaluation of naviculo-cuneiform sag deformity following triplane bunion correction via first tarsometatarsal joint arthrodesis
- Author
-
Jason R. Miller, Mark J. Capuzzi, Tymoteusz Siwy, Alexandra Parish, and Marcus Duvall
- Published
- 2022
- Full Text
- View/download PDF
28. Diagnostic Value of Early Magnetic Resonance Imaging After Acute Lateral Ankle Injury
- Author
-
Lauren D. Reed, Louis J. Ciliberti, Stephanie Eldridge, Jason R. Miller, and Karl W. Dunn
- Subjects
Adult ,Male ,medicine.medical_specialty ,Syndesmosis ,Lateral ankle ,Delayed Diagnosis ,Ankle Fractures ,Sensitivity and Specificity ,Talus ,Young Adult ,03 medical and health sciences ,0302 clinical medicine ,Blunt ,Patient age ,Humans ,Medicine ,Orthopedics and Sports Medicine ,In patient ,Ankle Injuries ,Diagnostic Errors ,Retrospective Studies ,030222 orthopedics ,medicine.diagnostic_test ,business.industry ,Age Factors ,Magnetic resonance imaging ,Retrospective cohort study ,030229 sport sciences ,Middle Aged ,Prognosis ,Magnetic Resonance Imaging ,Surgery ,medicine.anatomical_structure ,Sprains and Strains ,Female ,Radiology ,business ,Ankle sprain ,Ankle Joint - Abstract
We report a retrospective study of 171 consecutive patients with a lateral ankle sprain. All the patients with direct or blunt force trauma were excluded. Within 21 days of injury, 115 (67.25%) patients had undergone magnetic resonance imaging to evaluate for more serious or significant injuries. The average patient age was 44.09 years. Of the 115 patients, 75 (65.23%) had findings noted to be "significant." MRI can serve as a valuable and underused tool in the evaluation of acute lateral ankle injuries. The underuse of MRI might explain the high degree of variability in patients recovering from a lateral ankle sprain.
- Published
- 2017
- Full Text
- View/download PDF
29. ePlant: Visualizing and Exploring Multiple Levels of Data for Hypothesis Generation in Plant Biology
- Author
-
Lawrence A. Kelley, Erik S. Ferlanti, Nicholas J. Provart, Vivek Krishnakumar, Hans Yu, Christopher D. Town, Matthew Cumming, Michael J.E. Sternberg, Jim Fan, Wolfgang Stuerzlinger, Geoffrey Fucile, Jason R. Miller, Ruian Shi, Jamie Waese, Asher Pasha, and Biotechnology and Biological Sciences Research Council (BBSRC)
- Subjects
0301 basic medicine ,Biochemistry & Molecular Biology ,BROWSER ,Process (engineering) ,Systems biology ,Plant Biology & Botany ,Statistics as Topic ,0607 Plant Biology ,Plant Science ,Biology ,Bioinformatics ,computer.software_genre ,Data type ,Chromosomes, Plant ,In Brief ,Set (abstract data type) ,User-Computer Interface ,03 medical and health sciences ,Gene Expression Regulation, Plant ,REVEALS ,NETWORK ,Zoom ,0604 Genetics ,Science & Technology ,Base Sequence ,DATA SET ,Systems Biology ,Plant Sciences ,Botany ,0601 Biochemistry And Cell Biology ,Cell Biology ,Data science ,TRANSCRIPTOME ANALYSIS ,Visualization ,PROTEIN MICROARRAYS ,030104 developmental biology ,Workflow ,ARABIDOPSIS-THALIANA ,GENE-EXPRESSION MAP ,MODEL DATA-ANALYSIS ,Web service ,Life Sciences & Biomedicine ,computer ,Software ,RESPONSES ,Subcellular Fractions - Abstract
A big challenge in current systems biology research arises when different types of data must be accessed from separate sources and visualized using separate tools. The high cognitive load required to navigate such a workflow is detrimental to hypothesis generation. Accordingly, there is a need for a robust research platform that incorporates all data and provides integrated search, analysis, and visualization features through a single portal. Here, we present ePlant (http://bar.utoronto.ca/eplant), a visual analytic tool for exploring multiple levels of Arabidopsis thaliana data through a zoomable user interface. ePlant connects to several publicly available web services to download genome, proteome, interactome, transcriptome, and 3D molecular structure data for one or more genes or gene products of interest. Data are displayed with a set of visualization tools that are presented using a conceptual hierarchy from big to small, and many of the tools combine information from more than one data type. We describe the development of ePlant in this article and present several examples illustrating its integrative features for hypothesis generation. We also describe the process of deploying ePlant as an “app” on Araport. Building on readily available web services, the code for ePlant is freely available for any other biological species research.
- Published
- 2017
- Full Text
- View/download PDF
30. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation
- Author
-
Sergey Koren, Brian P. Walenz, Adam M. Phillippy, Nicholas H. Bergman, Konstantin Berlin, and Jason R. Miller
- Subjects
0301 basic medicine ,Source code ,media_common.quotation_subject ,030106 microbiology ,Method ,Sequence assembly ,Genomics ,MinHash ,Computational biology ,Biology ,Contig Mapping ,03 medical and health sciences ,Genetics ,Animals ,Humans ,Genetics (clinical) ,Repetitive Sequences, Nucleic Acid ,media_common ,Contig ,Sequence Analysis, DNA ,Drosophila melanogaster ,030104 developmental biology ,k-mer ,Eukaryotic chromosome fine structure ,Minion ,Graph (abstract data type) ,Nanopore sequencing ,Genome, Bacterial ,Software - Abstract
Long-read single-molecule sequencing has revolutionizedde novogenome assembly and enabled the automated reconstruction of reference-quality genomes. However, given the relatively high error rates of such technologies, efficient and accurate assembly of large repeats and closely related haplotypes remains challenging. We address these issues with Canu, a successor of Celera Assembler that is specifically designed for noisy single-molecule sequences. Canu introduces support for nanopore sequencing, halves depth-of-coverage requirements, and improves assembly continuity while simultaneously reducing runtime by an order of magnitude on large genomes versus Celera Assembler 8.2. These advances result from new overlapping and assembly algorithms, including an adaptive overlapping strategy based ontf-idfweighted MinHash and a sparse assembly graph construction that avoids collapsing diverged repeats and haplotypes. We demonstrate that Canu can reliably assemble complete microbial genomes and near-complete eukaryotic chromosomes using either PacBio or Oxford Nanopore technologies, and achieves a contig NG50 of greater than 21 Mbp on both human andDrosophila melanogasterPacBio datasets. For assembly structures that cannot be linearly represented, Canu provides graph-based assembly outputs in graphical fragment assembly (GFA) format for analysis or integration with complementary phasing and scaffolding techniques. The combination of such highly resolved assembly graphs with long-range scaffolding information promises the complete and automated assembly of complex genomes.
- Published
- 2017
- Full Text
- View/download PDF
31. Exploring structural variation and gene family architecture with De Novo assemblies of 15 Medicago genomes
- Author
-
Jason R. Miller, Kelly P. Steele, Nevin D. Young, Robert M. Stupar, Junqi Liu, Roxanne Denny, Thiruvarangan Ramaraj, Peng Zhou, Joseph Guhlin, Peter Tiffin, Joann Mudge, Kevin A. T. Silverstein, and Andrew Farmer
- Subjects
0301 basic medicine ,DNA Copy Number Variations ,lcsh:QH426-470 ,lcsh:Biotechnology ,Pairwise Genome Comparison ,Biology ,Leucine-Rich Repeat Proteins ,Ortholog Group ,Genome ,Dispensable Genome ,Nucleotide diversity ,Short InDels ,Structural variation ,03 medical and health sciences ,lcsh:TP248.13-248.65 ,Medicago truncatula ,Genetics ,Gene family ,Copy-number variation ,Gene ,Heat-Shock Proteins ,Plant Proteins ,2. Zero hunger ,Comparative Genomic Hybridization ,High-Throughput Nucleotide Sequencing ,Proteins ,food and beverages ,Sequence Analysis, DNA ,biology.organism_classification ,Syntenic Block ,lcsh:Genetics ,030104 developmental biology ,RNA, Plant ,Sequence Alignment ,Genome, Plant ,Reference genome ,Research Article ,Biotechnology - Abstract
Background Previous studies exploring sequence variation in the model legume, Medicago truncatula, relied on mapping short reads to a single reference. However, read-mapping approaches are inadequate to examine large, diverse gene families or to probe variation in repeat-rich or highly divergent genome regions. De novo sequencing and assembly of M. truncatula genomes enables near-comprehensive discovery of structural variants (SVs), analysis of rapidly evolving gene families, and ultimately, construction of a pan-genome. Results Genome-wide synteny based on 15 de novo M. truncatula assemblies effectively detected different types of SVs indicating that as much as 22% of the genome is involved in large structural changes, altogether affecting 28% of gene models. A total of 63 million base pairs (Mbp) of novel sequence was discovered, expanding the reference genome space for Medicago by 16%. Pan-genome analysis revealed that 42% (180 Mbp) of genomic sequences is missing in one or more accession, while examination of de novo annotated genes identified 67% (50,700) of all ortholog groups as dispensable – estimates comparable to recent studies in rice, maize and soybean. Rapidly evolving gene families typically associated with biotic interactions and stress response were found to be enriched in the accession-specific gene pool. The nucleotide-binding site leucine-rich repeat (NBS-LRR) family, in particular, harbors the highest level of nucleotide diversity, large effect single nucleotide change, protein diversity, and presence/absence variation. However, the leucine-rich repeat (LRR) and heat shock gene families are disproportionately affected by large effect single nucleotide changes and even higher levels of copy number variation. Conclusions Analysis of multiple M. truncatula genomes illustrates the value of de novo assemblies to discover and describe structural variation, something that is often under-estimated when using read-mapping approaches. Comparisons among the de novo assemblies also indicate that different large gene families differ in the architecture of their structural variation. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3654-1) contains supplementary material, which is available to authorized users.
- Published
- 2017
- Full Text
- View/download PDF
32. Structural Design and Optimization of Slotted Waveguide Antenna Stiffened Structures under Compressive Load
- Author
-
William Baron, Robert A. Canfield, Jason R. Miller, Woon Kyung Kim, and James Tuss
- Subjects
Core (optical fiber) ,Waveguide (electromagnetism) ,Materials science ,Acoustics ,Glass fiber ,Solid modeling ,Radio frequency ,Antenna (radio) ,Finite element method ,Slotted waveguide - Abstract
Structural design and optimization of slotted waveguide antenna stiffened structures (SWASS) are considered. Radio frequency (RF) performance was considered by modeling waveguide slots for an antenna imbedded in an aircraft panel subject to external loading. An equivalent two-dimensional plate model was shown to be adequate for simplifying the waveguide core as an equivalent plate layer; however, smearing the stiffness effect of the slots was shown to be inaccurate. For higher fidelity, plate finite elements were used in a three-dimensional model with slots to evaluate the mechanical failure. This modeling technique proved cost-effective, employing few elements compared with three-dimensional solid modeling found in the literature. Validation with published experiments and verification with other published simulations demonstrated that it was sufficiently accurate for design purposes. Four SWASS configurations were compared. A slotted metallic foil waveguide tube wrapped with glass fiber and a carbon fiber inner mold line face sheet was the most effective concept, followed closely by all carbon fiber construction with slots cut through the outer mold line face sheet. Glass fiber face sheets for both inner and outer mold lines were the least effective concepts, whether or not the waveguide tubes were constructed of metallic foil or carbon fiber.
- Published
- 2020
- Full Text
- View/download PDF
33. A host subtraction database for virus discovery in human cell line sequencing data
- Author
-
Granger G. Sutton, Jason R. Miller, Kari A. Dilley, Timothy B. Stockwell, Derek M. Harkins, and Reed S. Shabman
- Subjects
0301 basic medicine ,HepG2 ,viruses ,Sequencing data ,Human cell line ,computer.software_genre ,Jurkat cells ,Virus ,General Biochemistry, Genetics and Molecular Biology ,Cell Line ,03 medical and health sciences ,0302 clinical medicine ,Databases, Genetic ,Humans ,General Pharmacology, Toxicology and Pharmaceutics ,030304 developmental biology ,0303 health sciences ,biology ,Database ,General Immunology and Microbiology ,Host (biology) ,Jurkat ,Subtraction ,HuH-7 ,RNA ,High-Throughput Nucleotide Sequencing ,General Medicine ,Articles ,Method Article ,host subtraction ,biology.organism_classification ,RNAseq ,Sendai virus ,030104 developmental biology ,Viruses ,human cell lines ,computer ,030217 neurology & neurosurgery - Abstract
The human cell lines HepG2, HuH-7, and Jurkat are commonly used for amplification of the RNA viruses present in environmental samples. To assist with assays by RNAseq, we sequenced these cell lines and developed a subtraction database that contains sequences expected in sequence data from uninfected cells. RNAseq data from cell lines infected with Sendai virus were analyzed to test host subtraction. The process of mapping RNAseq reads to our subtraction database vastly reduced the number non-viral reads in the dataset to allow for efficient secondary analyses.
- Published
- 2019
34. Lyme Disease Manifestations in the Foot and Ankle: A Retrospective Case Series
- Author
-
Jason R. Miller, Louis J. Ciliberti, Joshua K. Hollinger, Shelley M. Brand, Karl W. Dunn, Dina K. Becker, and Domenick Braccia
- Subjects
Adult ,Male ,0301 basic medicine ,medicine.medical_specialty ,Adolescent ,Arthritis ,Lyme Arthritis ,Young Adult ,03 medical and health sciences ,0302 clinical medicine ,Lyme disease ,Pathognomonic ,Foot Joints ,Borrelia ,medicine ,Humans ,Orthopedics and Sports Medicine ,030212 general & internal medicine ,Borrelia burgdorferi ,Child ,Retrospective Studies ,Lyme Disease ,biology ,business.industry ,Middle Aged ,bacterial infections and mycoses ,biology.organism_classification ,medicine.disease ,Surgery ,030104 developmental biology ,medicine.anatomical_structure ,Erythema migrans ,Female ,Joint Diseases ,Ankle ,business - Abstract
Lyme disease is the result of Borrelia burgdorferi bacterial infection after exposure from a tick bite. A pathognomonic finding in early-stage Lyme disease is an expanding, red macular ring known as erythema migrans. Lyme arthritis is a late-stage manifestation of this disease, affecting the large, weightbearing joints with intermittent pain and swelling. The existing data on Lyme disease and subsequent arthritis have reported manifestations in the lower extremity, primarily in the knee and ankle and less commonly the small joints of the foot. We present a retrospective case series of 11 cases of painful arthritis in the foot and ankle with confirmatory Lyme disease testing.
- Published
- 2016
- Full Text
- View/download PDF
35. An algorithm for automated closure during assembly.
- Author
-
Sergey Koren, Jason R. Miller, Brian Walenz, and Granger G. Sutton
- Published
- 2010
- Full Text
- View/download PDF
36. The Rare Lesser Metatarsal Bipartite Sesamoid
- Author
-
Stephanie Eldridge, Jason R. Miller, Matthew Tran, and Christopher R. Hood
- Subjects
Adult ,0301 basic medicine ,Ossicles ,business.industry ,food and beverages ,General Medicine ,Anatomy ,Magnetic Resonance Imaging ,030218 nuclear medicine & medical imaging ,Radiography ,03 medical and health sciences ,0302 clinical medicine ,medicine.anatomical_structure ,Rare case ,Sesamoid bone ,Humans ,Medicine ,medicine.bone ,Female ,030101 anatomy & morphology ,Sesamoid Bones ,Metatarsal bones ,business ,Metatarsal Bones - Abstract
Sesamoid bones and accessory ossicles are common incidental findings on radiographs. These can occasionally become symptomatic, usually after a precipitating event such as an injury or overuse, or they can be incidental findings unrelated to the presenting pathology. The aim of this study was to highlight a rare case of a bipartite fifth metatarsal sesamoid bone and to review previous literature regarding sesamoid bones and accessory ossicles.
- Published
- 2017
- Full Text
- View/download PDF
37. Association of Vitamin D With Stress Fractures: A Retrospective Cohort Study
- Author
-
Brock A. Swanson, Jason R. Miller, Louis J. Ciliberti, Karl W. Dunn, and Rikhil D. Patel
- Subjects
Adult ,Male ,medicine.medical_specialty ,Time Factors ,Fractures, Stress ,Bone density ,Population ,March fracture ,030209 endocrinology & metabolism ,Bone health ,vitamin D deficiency ,03 medical and health sciences ,0302 clinical medicine ,Bone Density ,medicine ,Vitamin D and neurology ,Humans ,Orthopedics and Sports Medicine ,Vitamin D ,education ,Metatarsal Bones ,Retrospective Studies ,education.field_of_study ,Stress fractures ,business.industry ,Retrospective cohort study ,030229 sport sciences ,Vitamin D Deficiency ,medicine.disease ,Magnetic Resonance Imaging ,Surgery ,Female ,business ,Follow-Up Studies - Abstract
Vitamin D is an essential, fat-soluble nutrient that is a key modulator of bone health. Despite the gaining popularity throughout published medical studies, no consensus has been reached regarding a serum vitamin D level that will guarantee adequate skeletal health in a patient with an increased functional demand. The purpose of the present investigation was to examine the serum concentrations of vitamin D in patients with confirmed stress fractures. A total of 124 patients were included in our retrospective cohort study. Of the 124 patients, 53 had vitamin D levels measured within 3 months of diagnosis. An association was seen in patients with a stress fracture and vitamin D level measured, as 44 (83.02%) of the 53 patients had a serum 25-hydroxyvitamin D level
- Published
- 2016
- Full Text
- View/download PDF
38. Analysis of the Aedes albopictus C6/36 genome provides insight into cell line utility for viral propagation
- Author
-
Vinita Puri, Benjamin D. Rosen, Sergey Koren, Adam M. Phillippy, Jason R. Miller, Françoise Thibaud-Nissen, Zhijian Tu, David M. Brown, Igor V. Sharakhov, Granger G. Sutton, Maria V. Sharakhova, Nicholas H. Bergman, Reed S. Shabman, Derek M. Harkins, Xiao-Guang Chen, Peter M. Piermarini, Kari A. Dilley, Timothy B. Stockwell, and Robert Sebra
- Subjects
0301 basic medicine ,C6/36 ,Aedes albopictus ,сборка генома ,Genome, Insect ,Sequence assembly ,Health Informatics ,Locus (genetics) ,Computational biology ,Mosquito Vectors ,Biology ,Virus Replication ,Genome ,Transcriptome ,03 medical and health sciences ,кровососущие комары ,Aedes ,Animals ,Humans ,Whole genome sequencing ,030102 biochemistry & molecular biology ,Contig ,Base Sequence ,Zika Virus Infection ,Research ,fungi ,Gene Annotation ,cell line ,Zika Virus ,biology.organism_classification ,3. Good health ,Computer Science Applications ,030104 developmental biology ,Larva ,genome assembly ,клеточная линия - Abstract
Background The 50-year-old Aedes albopictus C6/36 cell line is a resource for the detection, amplification, and analysis of mosquito-borne viruses including Zika, dengue, and chikungunya. The cell line is derived from an unknown number of larvae from an unspecified strain of Aedes albopictus mosquitoes. Toward improved utility of the cell line for research in virus transmission, we present an annotated assembly of the C6/36 genome. Results The C6/36 genome assembly has the largest contig N50 (3.3 Mbp) of any mosquito assembly, presents the sequences of both haplotypes for most of the diploid genome, reveals independent null mutations in both alleles of the Dicer locus, and indicates a male-specific genome. Gene annotation was computed with publicly available mosquito transcript sequences. Gene expression data from cell line RNA sequence identified enrichment of growth-related pathways and conspicuous deficiency in aquaporins and inward rectifier K+ channels. As a test of utility, RNA sequence data from Zika-infected cells were mapped to the C6/36 genome and transcriptome assemblies. Host subtraction reduced the data set by 89%, enabling faster characterization of nonhost reads. Conclusions The C6/36 genome sequence and annotation should enable additional uses of the cell line to study arbovirus vector interactions and interventions aimed at restricting the spread of human disease.
- Published
- 2018
39. Hybrid assembly with long and short reads improves discovery of gene family expansions
- Author
-
Roxanne Denny, Hayan Lee, Li Song, Peter Tiffin, Junqi Liu, James Gurtowski, W. Richard McCombie, Michael C. Schatz, Jason R. Miller, Kevin A. T. Silverstein, Lyza G. Maron, Robert M. Stupar, Namrata Singh, Joann Mudge, Susan R. McCouch, Nevin D. Young, Peng Zhou, Brian P. Walenz, and Thiruvarangan Ramaraj
- Subjects
0106 biological sciences ,0301 basic medicine ,DNA Copy Number Variations ,lcsh:QH426-470 ,lcsh:Biotechnology ,Sequence assembly ,Genomics ,Computational biology ,Biology ,Genes, Plant ,01 natural sciences ,Genome ,03 medical and health sciences ,Hybrid assembly pipeline ,Tandem repeat ,lcsh:TP248.13-248.65 ,Medicago truncatula ,Genetics ,Copy-number variation ,2. Zero hunger ,Comparative genomics ,Genome assembly ,Contig ,Shotgun sequencing ,Methodology Article ,Tandem repeats ,food and beverages ,Oryza ,lcsh:Genetics ,Phenotype ,030104 developmental biology ,Tandem Repeat Sequences ,Multigene Family ,010606 plant biology & botany ,Biotechnology - Abstract
Background Long-read and short-read sequencing technologies offer competing advantages for eukaryotic genome sequencing projects. Combinations of both may be appropriate for surveys of within-species genomic variation. Methods We developed a hybrid assembly pipeline called “Alpaca” that can operate on 20X long-read coverage plus about 50X short-insert and 50X long-insert short-read coverage. To preclude collapse of tandem repeats, Alpaca relies on base-call-corrected long reads for contig formation. Results Compared to two other assembly protocols, Alpaca demonstrated the most reference agreement and repeat capture on the rice genome. On three accessions of the model legume Medicago truncatula, Alpaca generated the most agreement to a conspecific reference and predicted tandemly repeated genes absent from the other assemblies. Conclusion Our results suggest Alpaca is a useful tool for investigating structural and copy number variation within de novo assemblies of sampled populations. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3927-8) contains supplementary material, which is available to authorized users.
- Published
- 2017
- Full Text
- View/download PDF
40. Analysis of the Aedes albopictus C6/36 genome provides insight into cell line adaptations to in vitro viral propagation
- Author
-
Vinita Puri, Kari A. Dilley, Benjamin D. Rosen, Timothy B. Stockwell, Maria V. Sharakhova, Adam M. Phillippy, Sergey Koren, Robert Sebra, Jason R. Miller, Reed S. Shabman, Xiao-Guang Chen, David M. Brown, Françoise Thibaud-Nissen, Peter M. Piermarini, Nicholas H. Bergman, Zhijian Tu, Igor V. Sharakhov, Derek M. Harkins, and Granger G. Sutton
- Subjects
Whole genome sequencing ,Genetics ,Genome evolution ,Aedes albopictus ,biology ,Sequence assembly ,Locus (genetics) ,Genomics ,Gene Annotation ,biology.organism_classification ,Genome - Abstract
BackgroundThe 50-year old Aedes albopictus C6/36 cell line is a resource for the detection, amplification, and analysis of mosquito-borne viruses including Zika, dengue, and chikungunya. The cell line is derived from an unknown number of larvae from an unspecified strain of Aedes albopictus mosquitoes. Toward improved utility of the cell line for research in virus transmission, we present an annotated assembly of the C6/36 genome.ResultsThe C6/36 genome assembly has the largest contig N50 (3.3 Mbp) of any mosquito assembly, presents the sequences of both haplotypes for most of the diploid genome, reveals independent null mutations in both alleles of the Dicer locus, and indicates a male-specific genome. Gene annotation was computed with publicly available mosquito transcript sequences. Gene expression data from cell line RNA sequence identified enrichment of growth-related pathways and conspicuous deficiency in aquaporins and inward rectifier K+ channels. As a test of utility, RNA sequence data from Zika-infected cells was mapped to the C6/36 genome and transcriptome assemblies. Host subtraction reduced the data set by 89%, enabling faster characterization of non-host reads.ConclusionsThe C6/36 genome sequence and annotation should enable additional uses of the cell line to study arbovirus vector interactions and interventions aimed at restricting the spread of human disease.
- Published
- 2017
- Full Text
- View/download PDF
41. Strategies for optimizing BioNano and Dovetail explored through a second reference quality assembly for the legume model, Medicago truncatula
- Author
-
Michael J. Sadowsky, Peter Tiffin, Robert M. Stupar, Nicholas P. Devitt, Jason R. Miller, Diego Fajardo, Kevin A. T. Silverstein, Nevin D. Young, Thiruvarangan Ramaraj, Peng Zhou, Joann Mudge, and Karen M. Moll
- Subjects
0106 biological sciences ,0301 basic medicine ,Quality Control ,Time Factors ,lcsh:QH426-470 ,lcsh:Biotechnology ,Cost-Benefit Analysis ,Dovetail ,Sequence assembly ,BioNano ,Computational biology ,Biology ,01 natural sciences ,Genome ,DNA sequencing ,Chromosomes, Plant ,03 medical and health sciences ,lcsh:TP248.13-248.65 ,Next generation sequencing ,Medicago truncatula ,Genetics ,2. Zero hunger ,Whole genome sequencing ,PacBio ,Genome assembly ,Genomics ,Reference Standards ,biology.organism_classification ,Dovetail joint ,lcsh:Genetics ,030104 developmental biology ,DNA microarray ,Functional genomics ,Genome, Plant ,010606 plant biology & botany ,Biotechnology ,Research Article - Abstract
Background Third generation sequencing technologies, with sequencing reads in the tens- of kilo-bases, facilitate genome assembly by spanning ambiguous regions and improving continuity. This has been critical for plant genomes, which are difficult to assemble due to high repeat content, gene family expansions, segmental and tandem duplications, and polyploidy. Recently, high-throughput mapping and scaffolding strategies have further improved continuity. Together, these long-range technologies enable quality draft assemblies of complex genomes in a cost-effective and timely manner. Results Here, we present high quality genome assemblies of the model legume plant, Medicago truncatula (R108) using PacBio, Dovetail Chicago (hereafter, Dovetail) and BioNano technologies. To test these technologies for plant genome assembly, we generated five assemblies using all possible combinations and ordering of these three technologies in the R108 assembly. While the BioNano and Dovetail joins overlapped, they also showed complementary gains in continuity and join numbers. Both technologies spanned repetitive regions that PacBio alone was unable to bridge. Combining technologies, particularly Dovetail followed by BioNano, resulted in notable improvements compared to Dovetail or BioNano alone. A combination of PacBio, Dovetail, and BioNano was used to generate a high quality draft assembly of R108, a M. truncatula accession widely used in studies of functional genomics. As a test for the usefulness of the resulting genome sequence, the new R108 assembly was used to pinpoint breakpoints and characterize flanking sequence of a previously identified translocation between chromosomes 4 and 8, identifying more than 22.7 Mb of novel sequence not present in the earlier A17 reference assembly. Conclusions Adding Dovetail followed by BioNano data yielded complementary improvements in continuity over the original PacBio assembly. This strategy proved efficient and cost-effective for developing a quality draft assembly compared to traditional reference assemblies. Electronic supplementary material The online version of this article (doi:10.1186/s12864-017-3971-4) contains supplementary material, which is available to authorized users.
- Published
- 2017
42. An Examination of Intervention Research with Secondary Students with EBD in Light of Common Core State Standards for Mathematics
- Author
-
Jason R. Miller, Paula Maccini, Kenneth Wright, and Candace A. Mulcahy
- Subjects
050103 clinical psychology ,05 social sciences ,050301 education ,Integrated mathematics ,Academic standards ,National Science Education Standards ,Education ,Reform mathematics ,Clinical Psychology ,Interactive Mathematics Program ,Connected Mathematics ,Developmental and Educational Psychology ,Mathematics education ,0501 psychology and cognitive sciences ,Math wars ,Core-Plus Mathematics Project ,Psychology ,0503 education - Abstract
In this review, the authors offer a critical analysis of published interventions for improving mathematics performance among middle and high school students with EBD in light of the Common Core State Standards. An exhaustive review of literature from 1975 to December 2012 yielded 20 articles that met criteria for inclusion. The authors analyzed the body of research for the nature of participants, settings, interventionists, interventions, and alignment with Common Core State Standards. Despite limitations with the body of research, the results of the review indicate a number of promising practices exist for secondary students with EBD in mathematics, including: (a) peer-mediated approaches, (b) interventions that address academic performance as well as behavior, (d) use of technology, and (e) use of strategy instruction to support conceptual understanding. Implications for research and practice are provided.
- Published
- 2014
- Full Text
- View/download PDF
43. An improved genome assembly uncovers prolific tandem repeats in Atlantic cod
- Author
-
William Brynildsen Reinar, Kjetill S. Jakobsen, Alexander J. Nederbragt, James R. Knight, Jason R. Miller, Brian P. Walenz, Morten Skage, Paul Peluso, Sigbjørn Lien, Sissel Jentoft, Jenny M. Ekholm, Ave Tooming-Klunderud, Bastiaan Star, Harald Grove, Rolf B. Edvardsen, and Ole K. Tørresen
- Subjects
0301 basic medicine ,Heterozygote ,Assembly algorithms ,Sequence assembly ,Genomics ,Repetitive DNA ,Biology ,Genome ,03 medical and health sciences ,0302 clinical medicine ,Assembly consolidation ,Tandem repeat ,Indel polymorphism ,Genetics ,Animals ,Gadus ,Long-read sequencing technology ,Promoter Regions, Genetic ,Microsatellites ,Repeated sequence ,Gene ,PacBio ,Heterozygosity ,Dinucleotide repeats ,Molecular Sequence Annotation ,Sequence Analysis, DNA ,biology.organism_classification ,030104 developmental biology ,Gadus morhua ,Tandem Repeat Sequences ,Evolutionary biology ,Pyrosequencing ,Atlantic cod ,030217 neurology & neurosurgery ,Research Article ,Biotechnology - Abstract
Background: The first Atlantic cod (Gadus morhua) genome assembly published in 2011 was one of the early genome assemblies exclusively based on high-throughput 454 pyrosequencing. Since then, rapid advances in sequencing technologies have led to a multitude of assemblies generated for complex genomes, although many of these are of a fragmented nature with a significant fraction of bases in gaps. The development of long-read sequencing and improved software now enable the generation of more contiguous genome assemblies.Results: By combining data from Illumina, 454 and the longer PacBio sequencing technologies, as well as integrating the results of multiple assembly programs, we have created a substantially improved version of the Atlantic cod genome assembly. The sequence contiguity of this assembly is increased fifty-fold and the proportion of gap-bases has been reduced fifteen-fold. Compared to other vertebrates, the assembly contains an unusual high density of tandem repeats (TRs). Indeed, retrospective analyses reveal that gaps in the first genome assembly were largely associated with these TRs. We show that 21 % of the TRs across the assembly, 19 % in the promoter regions and 12 % in the coding sequences are heterozygous in the sequenced individual.Conclusions: The inclusion of PacBio reads combined with the use of multiple assembly programs drastically improved the Atlantic cod genome assembly by successfully resolving long TRs. The high frequency of heterozygous TRs within or in the vicinity of genes in the genome indicate a considerable standing genomic variation in Atlantic cod populations, which is likely of evolutionary importance.
- Published
- 2017
44. Genomic innovations, transcriptional plasticity and gene loss underlying the evolution and divergence of two highly polyphagous and invasive Helicoverpa pest species
- Author
-
Peter M. Campbell, David F. Clarke, H-J. Zhang, Christopher W. Coppin, Brian P. Walenz, L. B. Han, Irene Horne, Stephen L. Pearce, Philip G. Board, Peter D. East, Annette McGrath, Heiko Vogel, René Feyereisen, Sharon Downes, G. Duan, Trent Perry, S. Li, Rahul V. Rane, Z. Zou, Emmanuelle Jacquin-Joly, Owain R. Edwards, Klas Hatje, Tom Walsh, Daniel S.T. Hughes, Samia Elfekih, Omaththage P. Perera, N-Y. Liu, Steven E. Scherer, Granger G. Sutton, Richard A. Gibbs, Suyog S. Kuwar, David G. Heckel, Shalini N. Jhangiani, John G. Oakeshott, Thomas Chertemps, Nicolas Montagné, Alisha Anderson, Y. P. Huang, John T. Christeller, Martin Kollmar, Angela McGaughran, Craig Anderson, William James, Karl H.J. Gordon, Robert T. Good, A. Papanikolaou, Wei Xu, Stephen Richards, Anne Bretschneider, Lars S. Jermiin, Claire A. Farnsworth, Charles Robin, Jiaxin Qu, Y. C. Han, S. V. Song, Martine Maïbèche, Yidong Wu, Sassan Asgari, Wee Tek Tay, Kim C. Worley, Philip J. Batterham, Jason R. Miller, Commonwealth Scientific and Industrial Research Organisation [Canberra] (CSIRO), University of Melbourne, Research School of Biology, Australian National University (ANU), USDA-ARS : Agricultural Research Service, Human Genome Sequencing Center, Baylor College of Medicine (BCM), Baylor University-Baylor University, Biological and Environmental Sciences, University of Stirling, School of Biological Sciences, University of Queensland [Brisbane], John Curtin School of Medical Research, Max Planck Institute for Chemical Ecology, Max-Planck-Gesellschaft, Institut d'écologie et des sciences de l'environnement de Paris (iEES), Institut National de la Recherche Agronomique (INRA)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Université Paris-Est Créteil Val-de-Marne - Paris 12 (UPEC UP12)-Centre National de la Recherche Scientifique (CNRS), Plant & Food Research, School Biology Science, Royal Holloway [University of London] (RHUL), State key laboratory of Integrated Management of pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences [Changchun Branch] (CAS), Nanjing Agricultural University (NAU), Max Planck Institute for Biophysical Chemistry (MPI-BPC), Chinese Academy of Sciences (CAS), Southwest Forestry University (SWFU), J. Craig Venter Institute [La Jolla, USA] (JCVI), Université Pierre et Marie Curie - Paris 6 (UPMC), School of Veterinary and Life Sciences, Murdoch University, Chongqing Medical University, University of Copenhagen = Københavns Universitet (UCPH), CSIRO Transformational Biology Program University of Melbourne Max-Planck-Gesellschaft Agricultural Research Service, US Department of Agriculture NIH (USA) 2 R01 GM0//11/-04A1 National Natural Science Foundation of China 31530060 New Zealand Government Public Good Science Fund FRST C06X0804 Chinese Scholarship Council, Centre National de la Recherche Scientifique (CNRS)-Université Paris-Est Créteil Val-de-Marne - Paris 12 (UPEC UP12)-Université Pierre et Marie Curie - Paris 6 (UPMC)-Institut National de la Recherche Agronomique (INRA), Nanjing Agricultural University, and University of Copenhagen = Københavns Universitet (KU)
- Subjects
0106 biological sciences ,0301 basic medicine ,Physiology ,ARMIGERA HUBNER LEPIDOPTERA ,HELIOTHIS-VIRESCENS LEPIDOPTERA ,MAXIMUM-LIKELIHOOD PHYLOGENIES ,DNA-SEQUENCING DATA ,WILD HOST PLANTS ,RNA-SEQ DATA ,INSECTICIDE RESISTANCE ,BACILLUS-THURINGIENSIS ,ZEA LEPIDOPTERA ,COTTON BOLLWORM ,[SDV]Life Sciences [q-bio] ,Genomics ,Plant Science ,Helicoverpa armigera ,01 natural sciences ,Genome ,General Biochemistry, Genetics and Molecular Biology ,03 medical and health sciences ,Structural Biology ,Botany ,Journal Article ,Gene family ,Gene ,Helicoverpa ,lcsh:QH301-705.5 ,Ecology, Evolution, Behavior and Systematics ,Genetics ,Comparative genomics ,biology ,fungi ,Cell Biology ,biology.organism_classification ,010602 entomology ,030104 developmental biology ,lcsh:Biology (General) ,[SDE]Environmental Sciences ,Helicoverpa zea ,General Agricultural and Biological Sciences ,Developmental Biology ,Biotechnology - Abstract
International audience; Background: Helicoverpa armigera and Helicoverpa zea are major caterpillar pests of Old and New World agriculture, respectively. Both, particularly H. armigera, are extremely polyphagous, and H. armigera has developed resistance to many insecticides. Here we use comparative genomics, transcriptomics and resequencing to elucidate the genetic basis for their properties as pests. Results: We find that, prior to their divergence about 1.5 Mya, the H. armigera/H. zea lineage had accumulated up to more than 100 more members of specific detoxification and digestion gene families and more than 100 extra gustatory receptor genes, compared to other lepidopterans with narrower host ranges. The two genomes remain very similar in gene content and order, but H. armigera is more polymorphic overall, and H. zea has lost several detoxification genes, as well as about 50 gustatory receptor genes. It also lacks certain genes and alleles conferring insecticide resistance found in H. armigera. Non-synonymous sites in the expanded gene families above are rapidly diverging, both between paralogues and between orthologues in the two species. Whole genome transcriptomic analyses of H. armigera larvae show widely divergent responses to different host plants, including responses among many of the duplicated detoxification and digestion genes. Conclusions: The extreme polyphagy of the two heliothines is associated with extensive amplification and neofunctionalisation of genes involved in host finding and use, coupled with versatile transcriptional responses on different hosts. H. armigera's invasion of the Americas in recent years means that hybridisation could generate populations that are both locally adapted and insecticide resistant.
- Published
- 2017
- Full Text
- View/download PDF
45. ThaleMine: A Warehouse for Arabidopsis Data Integration and Discovery
- Author
-
Jason R. Miller, Irina Belyaeva, Agnes P. Chan, Gos Micklem, Sergio Contrino, Christopher D. Town, Matthew W. Vaughn, Chia Yi Cheng, Vivek Krishnakumar, Erik S. Ferlanti, Micklem, Gos [0000-0002-6883-6168], and Apollo - University of Cambridge Repository
- Subjects
0301 basic medicine ,Arabidopsis thaliana ,Physiology ,Computer science ,Arabidopsis ,data warehouse ,Information Storage and Retrieval ,Genomics ,InterMine ,Plant Science ,computer.software_genre ,Bioinformatics ,World Wide Web ,03 medical and health sciences ,Gene Expression Regulation, Plant ,Databases, Genetic ,Protein Interaction Mapping ,Protein Interaction Maps ,data integration ,Internet ,Data collection ,biology ,business.industry ,Arabidopsis Proteins ,Sequence Analysis, RNA ,Gene Expression Profiling ,Computational Biology ,Reproducibility of Results ,Cell Biology ,General Medicine ,biology.organism_classification ,Data warehouse ,Software framework ,030104 developmental biology ,web services ,Gene Ontology ,The Internet ,Web service ,business ,computer ,Data integration - Abstract
ThaleMine (https://apps.araport.org/thalemine/) is a comprehensive data warehouse that integrates a wide array of genomic information of the model plant Arabidopsis thaliana. The data collection currently includes the latest structural and functional annotation from the Araport11 update, the Col-0 genome sequence, RNA-seq and array expression, co-expression, protein interactions, homologs, pathways, publications, alleles, germplasm and phenotypes. The data are collected from a wide variety of public resources. Users can browse gene-specific data through Gene Report pages, identify and create gene lists based on experiments or indexed keywords, and run GO enrichment analysis to investigate the biological significance of selected gene sets. Developed by the Arabidopsis Information Portal project (Araport, https://www.araport.org/), ThaleMine uses the InterMine software framework, which builds well-structured data, and provides powerful data query and analysis functionality. The warehoused data can be accessed by users via graphical interfaces, as well as programmatically via web-services. Here we describe recent developments in ThaleMine including new features and extensions, and discuss future improvements. InterMine has been broadly adopted by the model organism research community including nematode, rat, mouse, zebrafish, budding yeast, the modENCODE project, as well as being used for human data. ThaleMine is the first InterMine developed for a plant model. As additional new plant InterMines are developed by the legume and other plant research communities, the potential of cross-organism integrative data analysis will be further enabled.
- Published
- 2016
46. Genome sequence of Aedes aegypti, a major arbovirus vector
- Author
-
Sergio Verjovski-Almeida, James E. Galagan, Ryan C. Kennedy, Zhiyong Xi, Jason R. Miller, Eric Eisenstadt, Kyanne R. Reidenbach, Robert V. Bruggner, Yu-Hui Rogers, Hadi Quesneville, Doreen Werner, Owen White, Alexander S. Raikhel, Mario Stanke, J. Spencer Johnston, Diane D. Lovin, Evgenia V. Kriventseva, Ian T. Paulsen, Kathryn S. Campbell, Norman H. Lee, Ewan Birney, Karyn Megy, Hean Koo, David Kulp, Shelby L. Bidwell, Jingsong Zhu, Philip Montgomery, Paolo Amedeo, Yongmei Zhao, Chinnappa D. Kodira, Javier Costas, Michael C. Schatz, Steven P. Sinkins, Claire M. Fraser-Liggett, Martin Shumway, Kurt LaButti, Akio Mori, Brendan J. Loftus, Manfred Grabherr, Eric O. Stinson, Frank H. Collins, Zhijian Jake Tu, Monique R. Coy, Matt Crawford, Janice P. Vanzee, William M. Gelbart, Joshua Orvis, Peter Arensburger, Chunhong Mao, Evgeny M. Zdobnov, Saul A. Kravitz, Suely Lopes Gomes, David DeCaprio, David G. Hogenkamp, Daniel Lawson, Dennis L. Knudson, David W. Severson, George Dimopoulos, Marcelo B. Soares, Sinéad B. O'Leary, Peter W. Atkinson, David M. Jaffe, Becky deBruyn, Martin Hammond, Ana L. T. O. Nascimento, Jim Biedler, Stefan Wyder, Jose M. C. Tubio, Bruce W. Birren, Catherine A. Hill, Chad Nusbaum, Eduardo Lee, Song Li, Susan E. Brown, Jennifer R. Wortman, James R. Hogan, Hamza El-Dorry, Qi Zhao, Linda Hannick, Carlos Frederico Martins Menck, Vishvanath Nene, Jonathan Crabtree, Steven L. Salzberg, Michael H. Holmes, Maria de Fatima Bonaldo, Quinghu Ren, Mihaela Pertea, Charles Roth, Evan Mauceli, Karin Eiglmeier, Horacio Naveira, Brian J. Haas, Qiandong Zeng, Neil F. Lobo, Jennifer R. Schneider, The Institute for Genomic Research, The Institute for Genomic Research, Rockville, European Bioinformatics Institute [Hinxton] ( EMBL-EBI ), European Molecular Biology Laboratory [Hinxton], Broad Institute of MIT and Harvard ( BROAD INSTITUTE ), Broad Institute of MIT and Harvard, Virginia Polytechnic Institute and State University [Blacksburg], University College Dublin [Dublin] ( UCD ), Bloomberg School of Public Health, Johns Hopkins University ( JHU ) -Bloomberg School of Public Health, University of Geneva Medical School, Swiss Institute of Bioinformatics-University of Geneva Medical School, University of Notre Dame ( UND ), Harvard University [Cambridge], College of Agricultural Sciences Colorado State University, Colorado State University [Fort Collins] ( CSU ) -College of Agricultural Sciences, Northwestern University [Evanston], University of California [Riverside] ( UCR ), Department of Atmospheric, Oceanic and Planetary Physics [Oxford] ( AOPP ), University of Oxford [Oxford], Purdue University [West Lafayette], Centro Nacional de Genotipado Fundación Pública Galega de Medicina Xenómica Hospital Clínico Universitario de Santiago, Centro Nacional de Genotipado-Fundación Pública Galega de Medicina Xenómica-Hospital Clínico Universitario de Santiago, Institut Pasteur [Paris], Universidade de Sao Paulo Instituto de Quimica, Universidade de São Paulo ( USP ) -Instituto de Quimica, Texas A&M University [College Station], Joint Technology Center, University of Massachusetts [Amherst] ( UMass Amherst ), Universidade de Sao Paulo, Institute of Biomedical Sciences, Universidade de São Paulo ( USP ) -Institute of Biomedical Sciences, Instituto Butantan [São Paulo], Universidade da Coruña, University of Maryland [College Park], Institut Jacques Monod ( IJM ), Université Paris Diderot - Paris 7 ( UPD7 ) -Centre National de la Recherche Scientifique ( CNRS ), University of California [Santa Cruz] ( UCSC ), Complexo Hospitalario Universitario de Santiago, Universität Göttingen, Georg-August-Universität Göttingen, George Washington University Medical Center, George Washington University ( GW ), The Institute for Genomic Research (TIGR), European Bioinformatics Institute [Hinxton] (EMBL-EBI), EMBL Heidelberg, Broad Institute of MIT and Harvard (BROAD INSTITUTE), Harvard Medical School [Boston] (HMS)-Massachusetts Institute of Technology (MIT)-Massachusetts General Hospital [Boston], University College Dublin [Dublin] (UCD), Johns Hopkins Bloomberg School of Public Health [Baltimore], Johns Hopkins University (JHU), Swiss Institute of Bioinformatics [Lausanne] (SIB), Université de Lausanne (UNIL)-Université de Lausanne (UNIL)-University of Geneva Medical School, University of Notre Dame [Indiana] (UND), Colorado State University [Fort Collins] (CSU)-College of Agricultural Sciences, University of California [Riverside] (UCR), University of California, Department of Atmospheric, Oceanic and Planetary Physics [Oxford] (AOPP), Universidade de São Paulo (USP)-Instituto de Quimica, University of Massachusetts [Amherst] (UMass Amherst), University of Massachusetts System (UMASS), Universidade de São Paulo (USP)-Institute of Biomedical Sciences (ICB/USP), Universidade de São Paulo (USP), University of Maryland System, Institut Jacques Monod (IJM (UMR_7592)), Université Paris Diderot - Paris 7 (UPD7)-Centre National de la Recherche Scientifique (CNRS), University of California [Santa Cruz] (UCSC), Georg-August-University [Göttingen], The George Washington University (GW), George Washington University (GW), Université de Lausanne = University of Lausanne (UNIL)-Université de Lausanne = University of Lausanne (UNIL)-University of Geneva Medical School, Harvard University, University of California [Riverside] (UC Riverside), University of California (UC), University of Oxford, Institut Pasteur [Paris] (IP), Universidade de São Paulo = University of São Paulo (USP)-Instituto de Quimica, Universidade de São Paulo = University of São Paulo (USP)-Institute of Biomedical Sciences (ICB/USP), Universidade de São Paulo = University of São Paulo (USP), University of California [Santa Cruz] (UC Santa Cruz), Georg-August-University = Georg-August-Universität Göttingen, Zdobnov, Evgeny, and Wyder, Stefan
- Subjects
0106 biological sciences ,Male ,Transcription, Genetic ,Genome, Insect ,transposons ,receptors ,Genes, Insect ,Aedes/ genetics/metabolism ,MESH: Genes, Insect ,Yellow Fever/prevention & control/transmission ,MESH: Base Sequence ,01 natural sciences ,Dengue/prevention & control/transmission ,MESH: Protein Structure, Tertiary ,MESH: Arboviruses ,MESH : Insect Vectors ,MESH: Insect Proteins ,MESH : Anopheles gambiae ,MESH: Animals ,MESH : Arboviruses ,insects ,MESH: Yellow Fever ,superfamily ,ddc:616 ,0303 health sciences ,Anopheles ,MESH : Genes, Insect ,3. Good health ,yellow-fever mosquito ,anopheles-gambiae ,drosophila-melanogaster ,expression ,evolution ,organization ,Drosophila melanogaster ,MESH: DNA Transposable Elements ,[ SDV.BBM.GTP ] Life Sciences [q-bio]/Biochemistry, Molecular Biology/Genomics [q-bio.GN] ,Multigene Family ,Public Health ,MESH : Protein Structure, Tertiary ,MESH: Sex Characteristics ,Molecular Sequence Data ,MESH : Multigene Family ,MESH: Sex Determination (Genetics) ,MESH : Sex Determination (Genetics) ,Arbovirus ,Article ,03 medical and health sciences ,Species Specificity ,[SDV.BBM.GTP]Life Sciences [q-bio]/Biochemistry, Molecular Biology/Genomics [q-bio.GN] ,MESH: Anopheles gambiae ,Yellow Fever ,MESH: Species Specificity ,Humans ,MESH: Humans ,MESH: Molecular Sequence Data ,fungi ,MESH : Humans ,MESH : Sex Characteristics ,Membrane Transport Proteins ,Sex Determination Processes ,medicine.disease ,Insect Vectors ,Protein Structure, Tertiary ,Sex Determination (Genetics) ,MESH: Multigene Family ,MESH: Female ,MESH : Sequence Analysis, DNA ,MESH: Sequence Analysis, DNA ,Drosophila melanogaster/genetics ,MESH : Molecular Sequence Data ,Odorant binding ,Insect Proteins/genetics ,Anopheles gambiae ,MESH: Dengue ,Genome ,MESH: Membrane Transport Proteins ,Dengue ,MESH : Membrane Transport Proteins ,Aedes ,Insect Vectors/ genetics/metabolism ,MESH : Drosophila melanogaster ,MESH : Female ,Membrane Transport Proteins/genetics ,Genetics ,MESH : Insect Proteins ,Sex Characteristics ,Multidisciplinary ,MESH: Synteny ,MESH: Aedes ,MESH : Genome, Insect ,MESH : DNA Transposable Elements ,Insect Proteins ,Female ,Orthologous Gene ,MESH : Male ,MESH : Dengue ,Aedes aegypti ,MESH: Insect Vectors ,Biology ,010603 evolutionary biology ,Synteny ,MESH: Drosophila melanogaster ,Protein Structure, Tertiary/genetics ,MESH : Yellow Fever ,parasitic diseases ,medicine ,MESH : Species Specificity ,Animals ,030304 developmental biology ,MESH : Aedes ,Base Sequence ,MESH: Genome, Insect ,MESH: Transcription, Genetic ,MESH : Synteny ,MESH : Transcription, Genetic ,Sequence Analysis, DNA ,biology.organism_classification ,MESH: Male ,DNA Transposable Elements ,MESH : Base Sequence ,MESH : Animals ,Arboviruses ,Anopheles gambiae/genetics/metabolism - Abstract
We present a draft sequence of the genome of Aedes aegypti , the primary vector for yellow fever and dengue fever, which at ∼1376 million base pairs is about 5 times the size of the genome of the malaria vector Anopheles gambiae . Nearly 50% of the Ae. aegypti genome consists of transposable elements. These contribute to a factor of ∼4 to 6 increase in average gene length and in sizes of intergenic regions relative to An. gambiae and Drosophila melanogaster . Nonetheless, chromosomal synteny is generally maintained among all three insects, although conservation of orthologous gene order is higher (by a factor of ∼2) between the mosquito species than between either of them and the fruit fly. An increase in genes encoding odorant binding, cytochrome P450, and cuticle domains relative to An. gambiae suggests that members of these protein families underpin some of the biological differences between the two mosquito species.
- Published
- 2016
- Full Text
- View/download PDF
47. A proposed common format for electromagnetics data interchange
- Author
-
Jason R. Miller
- Subjects
Electromagnetics ,Computer science ,Operating system ,computer.software_genre ,File format ,computer ,Data interchange ,JSON ,computer.programming_language - Abstract
The lack of a common data interchange format for CEM is hampering progress, causing duplicate workloads and opening the door for error. We propose a standard format based on JSON for flexible data interchange. The format is designed to be self-describing, extensible, open, and freely available.
- Published
- 2016
- Full Text
- View/download PDF
48. Realistic randomized water surface generation for computational electromagnetics
- Author
-
Jason R. Miller
- Subjects
Physics ,Surface (mathematics) ,Optics ,business.industry ,Situated ,Computational electromagnetics ,Aerospace engineering ,business ,Physics::Atmospheric and Oceanic Physics ,Ground plane - Abstract
Traditionally, an infinite ground plane is used to represent the water surface for analysis of antennas installed on ships or otherwise situated over water. While this is sufficient for some scenarios, we have found the need for more accuracy in representing short-range propagation close to a water surface. In this paper, we generate randomized, realistic water surfaces and use these in CEM simulations.
- Published
- 2016
- Full Text
- View/download PDF
49. A framework for human microbiome research
- Author
-
Rosamond Rhodes, Asif T. Chinwalla, Tessa Madden, Ashlee M. Earl, Maria C. Rivera, Candace N. Farmer, Jonathan M. Goldberg, Karthik Kota, Victor Felix, Nicholas B. King, Shibu Yooseph, Erica Sodergren, Monika Bihan, Martin J. Blaser, Dirk Gevers, Dan Knights, Pamela Sankar, Anup Mahurkar, Heather Huot Creasy, Veena Bhonagiri, Thomas M. Schmidt, Curtis Huttenhower, Mina Rho, Todd J. Treangen, Thomas J. Sharpton, I. Min A. Chen, Bo Liu, Sarah K. Highlander, Catherine C. Davis, Susan M. Huse, Richard A. Gibbs, Noam J. Davidovics, Patricio S. La Rosa, Carsten Russ, Wesley C. Warren, Richard K. Wilson, Patrick Minx, Jean E. McEwen, Alyxandria M. Schubert, Scott Anderson, Bonnie P. Youmans, Jamison McCorrison, Kathie A. Mihindukulasuriya, Vandita Joshi, Peter J. Mannon, Brandi L. Cantarel, Joseph F. Petrosino, Jack D. Sobel, Chandri Yandava, Sharvari Gujja, Janet K. Jansson, David J. Dooling, Daniel McDonald, Rob Knight, Granger G. Sutton, Gary C. Armitage, Larry J. Forney, Robert S. Fulton, Yuan Qing Wu, Jonathan Crabtree, Susan Kinder-Haake, Lu Wang, Liang Ye, Victor M. Markowitz, Narmada Shenoy, Elizabeth A. Lobos, Ruth M. Farrell, Tatiana A. Vishnivetskaya, Patrick S. G. Chain, Jacques Ravel, Katherine H. Huang, Sergey Koren, Yan Ding, Christina Giblin, Jason R. Miller, Michelle G. Giglio, Gina A. Simone, Chad Nusbaum, Lynn M. Schriml, Matthew C. Ross, Daniel D. Sommer, Sandra L. Lee, Theresa A. Hepburn, Michael Holder, Shaila Chhibba, Patrick D. Schloss, Omry Koren, Lan Zhang, Catrina Fronick, Richard R. Sharp, Diana Tabbaa, Yuzhen Ye, Dennis C. Friedrich, Christie Kovar, Owen White, A. Scott Durkin, Michael Feldgarden, Gary L. Andersen, Makedonka Mitreva, Todd Wylie, Nihar U. Sheth, Sheila Fisher, John Martin, Jose C. Clemente, Xiang Qin, James Versalovic, Dana A. Busam, Bruce W. Birren, Jeremy Zucker, Yu-Hui Rogers, Shannon Dugan, Kristine M. Wylie, Katherine P. Lemon, Floyd E. Dewhirst, Nicola Segata, Konstantinos Liolios, Anthony A. Fodor, Elizabeth L. Appelbaum, Ramana Madupu, W. Michael Dunne, Katherine S. Pollard, Leslie Foster, Olukemi O. Abolude, Yue Liu, Nikos C. Kyrpides, Christopher Wellington, Yanjiao Zhou, Lita M. Proctor, Tsegahiwot Belachew, Mircea Podar, Julia A. Segre, Holli A. Hamilton, Aye Wollam, Paul Spicer, Lei Chen, Sarah Young, Beltran Rodriguez-Mueller, Todd Z. DeSantis, Sean M. Sykes, Toby Bloom, Kelvin Li, Shane Canon, Catherine Jordan, Manolito Torralba, Brandi Herter, R. Dwayne Lunsford, Krishna Palaniappan, Jeroen Raes, Hongyu Gao, Barbara A. Methé, Kjersti Aagaard, Amy L. McGuire, Jonathan Friedman, Matthew D. Pearson, Jason Walker, Mary A. Cutting, Jonathan H. Badger, Diane E. Hoffmann, Tulin Ayvaz, Michael Fitzgerald, Brian J. Haas, Ravi Sanka, Doyle V. Ward, Kris A. Wetterstrand, Mark A. Watson, Christopher Smillie, Lucinda Fulton, Zhengyuan Wang, Lisa Begg, James R. White, Konstantinos Mavrommatis, Lucia Alvarado, Pamela McInnes, Emily L. Harris, Harindra Arachchi, Craig Pohl, Catherine A. Lozupone, Ruth E. Ley, Clinton Howarth, Yiming Zhu, Huaiyang Jiang, Gregory A. Buck, Carl C. Baker, Kimberley D. Delehaunty, Cristyn Kells, Katarzyna Wilczek-Boney, Kim C. Worley, Cesar Arze, J. Fah Sathirapongsasuti, Carolyn Deal, Sandra W. Clifton, Ken Chu, Rachel L. Erlich, Elaine R. Mardis, Cecil M. Lewis, Niall Lennon, Margaret Priest, Scott T. Kelley, Kymberlie Hallsworth-Pepin, Jane Peterson, Allison D. Griggs, Michelle O'Laughlin, Heidi H. Kong, Joshua Orvis, Maria Y. Giovanni, Sahar Abubucker, Dawn Ciulla, Sean Conlan, Chien Chi Lo, Antonio Gonzalez, Georgia Giannoukos, Jennifer R. Wortman, Paul Brooks, Jacques Izard, Chad Tomlinson, Donna M. Muzny, Shital M. Patel, Eric J. Alm, George M. Weinstock, Irene Newsham, Jeffrey G. Reid, Karoline Faust, Qiandong Zeng, Elena Deych, Nathalia Garcia, Mathangi Thiagarajan, James A. Katancik, Vivien Bonazzi, Robert C. Edgar, Christian J. Buhay, Indresh Singh, Johannes B. Goll, Ioanna Pagani, Vincent Magrini, Wendy A. Keitel, Emma Allen-Vercoe, Teena Mehta, Jeffery A. Schloss, William D. Shannon, Mihai Pop, Matthew B. Scholz, Valentina Di Francesco, Rebecca Truty, Karen E. Nelson, Kevin Riehle, Lora Lewis, Joseph L. Campbell, Laurie Zoloth, Massachusetts Institute of Technology. Computational and Systems Biology Program, Massachusetts Institute of Technology. Department of Biological Engineering, Massachusetts Institute of Technology. Department of Civil and Environmental Engineering, Friedman, Jonathan, Smillie, Chris Scott, and Alm, Eric J.
- Subjects
Adult ,Male ,Adolescent ,Statistics as Topic ,Population ,Computational biology ,Biology ,Genome ,Article ,Young Adult ,03 medical and health sciences ,Human health ,0302 clinical medicine ,RNA, Ribosomal, 16S ,Humans ,Microbiome ,education ,030304 developmental biology ,Genetics ,0303 health sciences ,education.field_of_study ,Multidisciplinary ,Bacteria ,Human microbiome ,Reference Standards ,Metagenomics ,030220 oncology & carcinogenesis ,Earth Microbiome Project ,Metagenome ,Female ,Human Microbiome Project - Abstract
A variety of microbial communities and their genes (the microbiome) exist throughout the human body, with fundamental roles in human health and disease. The National Institutes of Health (NIH)-funded Human Microbiome Project Consortium has established a population-scale framework to develop metagenomic protocols, resulting in a broad range of quality-controlled resources and data including standardized methods for creating, processing and interpreting distinct types of high-throughput metagenomic data available to the scientific community. Here we present resources from a population of 242 healthy adults sampled at 15 or 18 body sites up to three times, which have generated 5,177 microbial taxonomic profiles from 16S ribosomal RNA genes and over 3.5 terabases of metagenomic sequence so far. In parallel, approximately 800 reference strains isolated from the human body have been sequenced. Collectively, these data represent the largest resource describing the abundance and variety of the human microbiome, while providing a framework for current and future studies.
- Published
- 2012
50. Bibliotherapy: Helping Children Cope with Life's Challenges
- Author
-
Jason R. Miller, Angela Stewart, and Michael Rozalski
- Subjects
Stress management ,Coping (psychology) ,Psychotherapist ,Values education ,medicine.medical_treatment ,Teaching method ,Bibliotherapy ,medicine ,Psychology ,Education - Abstract
Using carefully selected thematic books, teachers can use literature to reach students experiencing difficult situations.
- Published
- 2010
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.