134 results on '"Stoltzfus, Arlin"'
Search Results
2. Avoidance of Self during CRISPR Immunization
- Author
-
Weissman, JL, Stoltzfus, Arlin, Westra, Edze R., and Johnson, Philip L.F.
- Published
- 2020
- Full Text
- View/download PDF
3. The role of mutation bias in adaptive molecular evolution : insights from convergent changes in protein function
- Author
-
Storz, Jay F., Natarajan, Chandrasekhar, Signore, Anthony V., Witt, Christopher C., McCandlish, David M., and Stoltzfus, Arlin
- Published
- 2019
4. Evaluating targeted COVID-19 vaccination strategies with agent-based modeling.
- Author
-
Hladish, Thomas J., Pillai, Alexander N., Pearson, Carl A. B., Toh, Kok Ben, Tamayo, Andrea C., Stoltzfus, Arlin, and Longini, Ira M.
- Subjects
COVID-19 vaccines ,BREAKTHROUGH infections ,CONTACT tracing ,SARS-CoV-2 ,SARS-CoV-2 Omicron variant - Abstract
We evaluate approaches to vaccine distribution using an agent-based model of human activity and COVID-19 transmission calibrated to detailed trends in cases, hospitalizations, deaths, seroprevalence, and vaccine breakthrough infections in Florida, USA. We compare the incremental effectiveness for four different distribution strategies at four different levels of vaccine supply, starting in late 2020 through early 2022. Our analysis indicates that the best strategy to reduce severe outcomes would be to actively target high disease-risk individuals. This was true in every scenario, although the advantage was greatest for the intermediate vaccine availability assumptions and relatively modest compared to a simple mass vaccination approach under high vaccine availability. Ring vaccination, while generally the most effective strategy for reducing infections, ultimately proved least effective at preventing deaths. We also consider using age group as a practical surrogate measure for actual disease-risk targeting; this approach also outperforms both simple mass distribution and ring vaccination. We find that quantitative effectiveness of a strategy depends on whether effectiveness is assessed after the alpha, delta, or omicron wave. However, these differences in absolute benefit for the strategies do not change the ranking of their performance at preventing severe outcomes across vaccine availability assumptions. Author Summary: We use our agent-based model of SARS-CoV-2 transmission to evaluate alternative vaccine distribution strategies over a range of vaccine supply scenarios. We find that strategies targeting transmission (e.g., ring vaccination) perform best in preventing infections, but targeting disease risk prevents more instances of severe outcomes. Specifically, strategies based on age, or age and comorbidities—which do not require contact tracing—resulted in the fewest hospitalizations in our model. These strategy rankings held true across all vaccine supply scenarios and were robust to the introduction of SARS-CoV-2 variants. While the quantitative results cannot be directly applied to other settings (as we used a synthetic population calibrated to the State of Florida), the rankings of strategies should be more generalizable. [ABSTRACT FROM AUTHOR]
- Published
- 2024
- Full Text
- View/download PDF
5. Phylotastic! Making tree-of-life knowledge accessible, reusable and convenient
- Author
-
Stoltzfus, Arlin, Lapp, Hilmar, Matasci, Naim, Deus, Helena, Sidlauskas, Brian, Zmasek, Christian M, Vaidya, Gaurav, Pontelli, Enrico, Cranston, Karen, Vos, Rutger, Webb, Campbell O, Harmon, Luke J, Pirrung, Megan, O'Meara, Brian, Pennell, Matthew W, Mirarab, Siavash, Rosenberg, Michael S, Balhoff, James P, Bik, Holly M, Heath, Tracy A, Midford, Peter E, Brown, Joseph W, McTavish, Emily Jane, Sukumaran, Jeet, Westneat, Mark, Alfaro, Michael E, Steele, Aaron, and Jordan, Greg
- Abstract
Abstract Background Scientists rarely reuse expert knowledge of phylogeny, in spite of years of effort to assemble a great “Tree of Life” (ToL). A notable exception involves the use of Phylomatic, which provides tools to generate custom phylogenies from a large, pre-computed, expert phylogeny of plant taxa. This suggests great potential for a more generalized system that, starting with a query consisting of a list of any known species, would rectify non-standard names, identify expert phylogenies containing the implicated taxa, prune away unneeded parts, and supply branch lengths and annotations, resulting in a custom phylogeny suited to the user’s needs. Such a system could become a sustainable community resource if implemented as a distributed system of loosely coupled parts that interact through clearly defined interfaces. Results With the aim of building such a “phylotastic” system, the NESCent Hackathons, Interoperability, Phylogenies (HIP) working group recruited 2 dozen scientist-programmers to a weeklong programming hackathon in June 2012. During the hackathon (and a three-month follow-up period), 5 teams produced designs, implementations, documentation, presentations, and tests including: (1) a generalized scheme for integrating components; (2) proof-of-concept pruners and controllers; (3) a meta-API for taxonomic name resolution services; (4) a system for storing, finding, and retrieving phylogenies using semantic web technologies for data exchange, storage, and querying; (5) an innovative new service, DateLife.org, which synthesizes pre-computed, time-calibrated phylogenies to assign ages to nodes; and (6) demonstration projects. These outcomes are accessible via a public code repository (GitHub.com), a website (http://www.phylotastic.org), and a server image. Conclusions Approximately 9 person-months of effort (centered on a software development hackathon) resulted in the design and implementation of proof-of-concept software for 4 core phylotastic components, 3 controllers, and 3 end-user demonstration tools. While these products have substantial limitations, they suggest considerable potential for a distributed system that makes phylogenetic knowledge readily accessible in computable form. Widespread use of phylotastic systems will create an electronic marketplace for sharing phylogenetic knowledge that will spur innovation in other areas of the ToL enterprise, such as annotation of sources and methods and third-party methods of quality assessment.
- Published
- 2013
6. Intron "Sliding" and the Diversity of Intron Positions
- Author
-
Stoltzfus, Arlin, Logsdon, John M., Palmer, Jeffrey D., and Doolittle, W. Ford
- Published
- 1997
7. Mutation and Selection Induce Correlations between Selection Coefficients and Mutation Rates.
- Author
-
Gitschlag, Bryan L., Cano, Alejandro V., Payne, Joshua L., McCandlish, David M., and Stoltzfus, Arlin
- Subjects
MODULES (Algebra) ,GENETIC mutation - Abstract
The joint distribution of selection coefficients and mutation rates is a key determinant of the genetic architecture of molecular adaptation. Three different distributions are of immediate interest: (1) the "nominal" distribution of possible changes, prior to mutation or selection; (2) the "de novo" distribution of realized mutations; and (3) the "fixed" distribution of selectively established mutations. Here, we formally characterize the relationships between these joint distributions under the strong-selection/weak-mutation (SSWM) regime. The de novo distribution is enriched relative to the nominal distribution for the highest rate mutations, and the fixed distribution is further enriched for the most highly beneficial mutations. Whereas mutation rates and selection coefficients are often assumed to be uncorrelated, we show that even with no correlation in the nominal distribution, the resulting de novo and fixed distributions can have correlations with any combination of signs. Nonetheless, we suggest that natural systems with a finite number of beneficial mutations will frequently have the kind of nominal distribution that induces negative correlations in the fixed distribution. We apply our mathematical framework, along with population simulations, to explore joint distributions of selection coefficients and mutation rates from deep mutational scanning and cancer informatics. Finally, we consider the evolutionary implications of these joint distributions together with two additional joint distributions relevant to parallelism and the rate of adaptation. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
8. Mutation-biased adaptation in Andean house wrens
- Author
-
Stoltzfus, Arlin and McCandlish, David M.
- Published
- 2015
9. Mendelian-Mutationism: The Forgotten Evolutionary Synthesis
- Author
-
STOLTZFUS, ARLIN and CABLE, KELE
- Published
- 2014
10. Modeling Evolution Using the Probability of Fixation: History and Implications
- Author
-
McCandlish, David M. and Stoltzfus, Arlin
- Published
- 2014
- Full Text
- View/download PDF
11. Mutational Biases Influence Parallel Adaptation
- Author
-
Stoltzfus, Arlin and McCandlish, David M.
- Published
- 2017
- Full Text
- View/download PDF
12. Mutation bias and the predictability of evolution.
- Author
-
Cano, Alejandro V., Gitschlag, Bryan L., Rozhoňová, Hana, Stoltzfus, Arlin, McCandlish, David M., and Payne, Joshua L.
- Subjects
GENETIC mutation ,COMMUNICABLE diseases ,POPULATION genetics - Abstract
Predicting evolutionary outcomes is an important research goal in a diversity of contexts. The focus of evolutionary forecasting is usually on adaptive processes, and efforts to improve prediction typically focus on selection. However, adaptive processes often rely on new mutations, which can be strongly influenced by predictable biases in mutation. Here, we provide an overview of existing theory and evidence for such mutation-biased adaptation and consider the implications of these results for the problem of prediction, in regard to topics such as the evolution of infectious diseases, resistance to biochemical agents, as well as cancer and other kinds of somatic evolution. We argue that empirical knowledge of mutational biases is likely to improve in the near future, and that this knowledge is readily applicable to the challenges of short-term prediction. This article is part of the theme issue 'Interdisciplinary approaches to predicting evolutionary biology'. [ABSTRACT FROM AUTHOR]
- Published
- 2023
- Full Text
- View/download PDF
13. NeXML: Rich, Extensible, and Verifiable Representation of Comparative Data and Metadata
- Author
-
Vos, Rutger A., Balhoff, James P., Caravas, Jason A., Holder, Mark T., Lapp, Hilmar, Maddison, Wayne P., Midford, Peter E., Priyam, Anurag, Sukumaran, Jeet, Xia, Xuhua, and Stoltzfus, Arlin
- Published
- 2012
14. Testing the Exon Theory of Genes: The Evidence from Protein Structure
- Author
-
Stoltzfus, Arlin, Spencer, David F., Zuker, Michael, Logsdon, John M., and Doolittle, W. Ford
- Published
- 1994
15. Introns and the Origin of Protein-Coding Genes
- Author
-
Senapathy, Periannan, Bertolaet, Bonnie L., Seidel, H. Martin, Knowles, Jeremy R., Stoltzfus, Arlin, Spencer, David F., Zuker, Michael, Logsdon, John M., and Doolittle, W. Ford
- Published
- 1995
16. Gene identification in black cohosh (Actaea racemosa L.): expressed sequence tag profiling and genetic screening yields candidate genes for production of bioactive secondary metabolites
- Author
-
Spiering, Martin J., Urban, Lori A., Nuss, Donald L., Gopalan, Vivek, Stoltzfus, Arlin, and Eisenstein, Edward
- Published
- 2011
- Full Text
- View/download PDF
17. Amino Acid Exchangeability and the Adaptive Code Hypothesis
- Author
-
Stoltzfus, Arlin and Yampolsky, Lev Y.
- Published
- 2007
- Full Text
- View/download PDF
18. The exchangeability of amino acids in proteins
- Author
-
Yampolsky, Lev Y. and Stoltzfus, Arlin
- Subjects
Amino acids -- Research ,Genetics -- Research ,Proteins -- Research ,Biological sciences - Abstract
The comparative analysis of protein sequences depends crucially on measures of amino acid similarity or distance. Many such measures exist, yet it is not known how well these measures reflect the operational exchangeability of amino acids in proteins, since most are derived by methods that confound a variety of effects, including effects of mutation. In pursuit of a pure measure of exchangeability, we present (1) a compilation of data on the effects of 9671 amino acid exchanges engineered and assayed in a set of 12 proteins; (2) a statistical procedure to combine results from diverse assays of exchange effects; (3) a matrix of 'experimental exchangeability' values E[X.sub.ij] derived from applying this procedure to the compiled data; and (4) a set of three tests designed to evaluate the power of an exchangeability measure to (i) predict the effects of amino acid exchanges in the laboratory, (ii) account for the disease-causing potential of missense mutations in the human population, and (iii) model the probability of fixation of missense mutations in evolution. EX not only captures useful information on exchangeability while remaining free of other effects, but also outperforms all measures tested except for the best-performing alignment scoring matrix, which is comparable in performance.
- Published
- 2005
19. On the Possibility of Constructive Neutral Evolution
- Author
-
Stoltzfus, Arlin
- Published
- 1999
- Full Text
- View/download PDF
20. Climbing Mount Probable: Mutation as a Cause of Nonrandomness in Evolution
- Author
-
Stoltzfus, Arlin and Yampolsky, Lev Y.
- Published
- 2009
21. Mutation bias shapes the spectrum of adaptive substitutions.
- Author
-
Cano, Alejandro V., Rozhoňová, Hana, Stoltzfus, Arlin, McCandlishe, David M., and Payne, Joshua L.
- Subjects
GENETIC mutation ,MYCOBACTERIUM tuberculosis ,SACCHAROMYCES cerevisiae ,ESCHERICHIA coli ,AMINO acids - Abstract
Evolutionary adaptation often occurs by the fixation of beneficial mutations. This mode of adaptation can be characterized quantitatively by a spectrum of adaptive substitutions, i.e., a distribution for types of changes fixed in adaptation. Recent work establishes that the changes involved in adaptation reflect common types of mutations, raising the question of how strongly the mutation spectrum shapes the spectrum of adaptive substitutions. We address this question with a codon-based model for the spectrum of adaptive amino acid substitutions, applied to three large datasets covering thousands of amino acid changes identified in natural and experimental adaptation in Saccharomyces cerevisiae, Escherichia coli, and Mycobacterium tuberculosis. Using speciesspecific mutation spectra based on prior knowledge, we find that the mutation spectrum has a proportional influence on the spectrum of adaptive substitutions in all three species. Indeed, we find that by inferring the mutation rates that best explain the spectrum of adaptive substitutions, we can accurately recover the species-specific mutation spectra. However, we also find that the predictive power of the model differs substantially between the three species. To better understand these differences, we use population simulations to explore the factors that influence how closely the spectrum of adaptive substitutions mirrors the mutation spectrum. The results show that the influence of the mutation spectrum decreases with increasing mutational supply (Nμ) and that predictive power is strongly affected by the number and diversity of beneficial mutations. [ABSTRACT FROM AUTHOR]
- Published
- 2022
- Full Text
- View/download PDF
22. A Sequence-Based Model Accounts Largely for the Relationship of Intron Positions to Protein Structural Features
- Author
-
De Kee, Danny W., Gopalan, Vivek, and Stoltzfus, Arlin
- Published
- 2007
23. Mutation-Biased Adaptation in a Protein NK Model
- Author
-
Stoltzfus, Arlin
- Published
- 2006
24. Nexplorer: phylogeny-based exploration of sequence family data
- Author
-
Gopalan, Vivek, Qiu, Wei-Gang, Chen, Michael Z., and Stoltzfus, Arlin
- Published
- 2006
25. The Evolutionary Gain of Spliceosomal Introns: Sequence and Phase Preferences
- Author
-
Qiu, Wei-Gang, Schisler, Nick, and Stoltzfus, Arlin
- Published
- 2004
26. Sharing and re-use of phylogenetic trees (and associated data) to facilitate synthesis
- Author
-
Stoltzfus Arlin, O'Meara Brian, Whitacre Jamie, Mounce Ross, Gillespie Emily L, Kumar Sudhir, Rosauer Dan F, and Vos Rutger A
- Subjects
Evolution ,Phylogeny ,Data sharing ,Bioinformatics ,Phyloinformatics ,Standards ,Medicine ,Biology (General) ,QH301-705.5 ,Science (General) ,Q1-390 - Abstract
Abstract Background Recently, various evolution-related journals adopted policies to encourage or require archiving of phylogenetic trees and associated data. Such attention to practices that promote sharing of data reflects rapidly improving information technology, and rapidly expanding potential to use this technology to aggregate and link data from previously published research. Nevertheless, little is known about current practices, or best practices, for publishing trees and associated data so as to promote re-use. Findings Here we summarize results of an ongoing analysis of current practices for archiving phylogenetic trees and associated data, current practices of re-use, and current barriers to re-use. We find that the technical infrastructure is available to support rudimentary archiving, but the frequency of archiving is low. Currently, most phylogenetic knowledge is not easily re-used due to a lack of archiving, lack of awareness of best practices, and lack of community-wide standards for formatting data, naming entities, and annotating data. Most attempts at data re-use seem to end in disappointment. Nevertheless, we find many positive examples of data re-use, particularly those that involve customized species trees generated by grafting to, and pruning from, a much larger tree. Conclusions The technologies and practices that facilitate data re-use can catalyze synthetic and integrative research. However, success will require engagement from various stakeholders including individual scientists who produce or consume shareable data, publishers, policy-makers, technology developers and resource-providers. The critical challenges for facilitating re-use of phylogenetic trees and associated data, we suggest, include: a broader commitment to public archiving; more extensive use of globally meaningful identifiers; development of user-friendly technology for annotating, submitting, searching, and retrieving data and their metadata; and development of a minimum reporting standard (MIAPA) indicating which kinds of data and metadata are most important for a re-useable phylogenetic record.
- Published
- 2012
- Full Text
- View/download PDF
27. Constructive neutral evolution: exploring evolutionary theory’s curious disconnect
- Author
-
Stoltzfus Arlin
- Subjects
Evolutionary theory ,Constructive neutral evolution ,Neo-Darwinism ,Mutation ,Evolutionary genetics ,Mutation bias ,Modern Synthesis ,Biology (General) ,QH301-705.5 - Abstract
Abstract Constructive neutral evolution (CNE) suggests that neutral evolution may follow a stepwise path to extravagance. Whether or not CNE is common, the mere possibility raises provocative questions about causation: in classical neo-Darwinian thinking, selection is the sole source of creativity and direction, the only force that can cause trends or build complex features. However, much of contemporary evolutionary genetics departs from the conception of evolution underlying neo-Darwinism, resulting in a widening gap between what formal models allow, and what the prevailing view of the causes of evolution suggests. In particular, a mutationist conception of evolution as a 2-step origin-fixation process has been a source of theoretical innovation for 40 years, appearing not only in the Neutral Theory, but also in recent breakthroughs in modeling adaptation (the “mutational landscape” model), and in practical software for sequence analysis. In this conception, mutation is not a source of raw materials, but an agent that introduces novelty, while selection is not an agent that shapes features, but a stochastic sieve. This view, which now lays claim to important theoretical, experimental, and practical results, demands our attention. CNE provides a way to explore its most significant implications about the role of variation in evolution. Reviewers Alex Kondrashov, Eugene Koonin and Johann Peter Gogarten reviewed this article.
- Published
- 2012
- Full Text
- View/download PDF
28. Bio::NEXUS: a Perl API for the NEXUS format for comparative biological data
- Author
-
Liang Chengzhi, Gopalan Vivek, Hladish Thomas, Qiu Weigang, Yang Peter, and Stoltzfus Arlin
- Subjects
Computer applications to medicine. Medical informatics ,R858-859.7 ,Biology (General) ,QH301-705.5 - Abstract
Abstract Background Evolutionary analysis provides a formal framework for comparative analysis of genomic and other data. In evolutionary analysis, observed data are treated as the terminal states of characters that have evolved (via transitions between states) along the branches of a tree. The NEXUS standard of Maddison, et al. (1997; Syst. Biol. 46: 590–621) provides a portable, expressive and flexible text format for representing character-state data and trees. However, due to its complexity, NEXUS is not well supported by software and is not easily accessible to bioinformatics users and developers. Results Bio::NEXUS is an application programming interface (API) implemented in Perl, available from CPAN and SourceForge. The 22 Bio::NEXUS modules define 351 methods in 4229 lines of code, with 2706 lines of POD (Plain Old Documentation). Bio::NEXUS provides an object-oriented interface to reading, writing and manipulating the contents of NEXUS files. It closely follows the extensive explanation of the NEXUS format provided by Maddison et al., supplemented with a few extensions such as support for the NHX (New Hampshire Extended) tree format. Conclusion In spite of some limitations owing to the complexity of NEXUS files and the lack of a formal grammar, NEXUS will continue to be useful for years to come. Bio::NEXUS provides a user-friendly API for NEXUS supplemented with an extensive set of methods for manipulations such as re-rooting trees and selecting subsets of data. Bio::NEXUS can be used as glue code for connecting existing software that uses NEXUS, or as a framework for new applications.
- Published
- 2007
- Full Text
- View/download PDF
29. Phylotastic: Improving Access to Tree-of-Life Knowledge With Flexible, on-the-Fly Delivery of Trees.
- Author
-
Nguyen, Van D, Nguyen, Thanh H, Tayeen, Abu Saleh Md, Laughinghouse IV, H Dail, Sánchez-Reyes, Luna L, Wiggins, Jodie, Pontelli, Enrico, Mozzherin, Dmitry, O'Meara, Brian, and Stoltzfus, Arlin
- Subjects
PYTHON programming language ,GEOLOGICAL time scales ,CARTOGRAPHY software ,WEB portals ,WEB services ,TREES - Abstract
A comprehensive phylogeny of species, i.e., a tree of life, has potential uses in a variety of contexts, including research, education, and public policy. Yet, accessing the tree of life typically requires special knowledge, complex software, or long periods of training. The Phylotastic project aims make it as easy to get a phylogeny of species as it is to get driving directions from mapping software. In prior work, we presented a design for an open system to validate and manage taxon names, find phylogeny resources, extract subtrees matching a user's taxon list, scale trees to time, and integrate related resources such as species images. Here, we report the implementation of a set of tools that together represent a robust, accessible system for on-the-fly delivery of phylogenetic knowledge. This set of tools includes a web portal to execute several customizable workflows to obtain species phylogenies (scaled by geologic time and decorated with thumbnail images); more than 30 underlying web services (accessible via a common registry); and code toolkits in R and Python (allowing others to develop custom applications using Phylotastic services). The Phylotastic system, accessible via http://www.phylotastic.org, provides a unique resource to access the current state of phylogenetic knowledge, useful for a variety of cases in which a tree extracted quickly from online resources (as distinct from a tree custom-made from character data) is sufficient, as it is for many casual uses of trees identified here. [ABSTRACT FROM AUTHOR]
- Published
- 2020
- Full Text
- View/download PDF
30. Publishing re-usable phylogenetic trees, in theory and practice
- Author
-
O’Meara, Brian, Whitacre, Jamie, Mounce, Ross, Rosauer, Dan, Vos, Rutger, and Stoltzfus, Arlin
- Published
- 2011
- Full Text
- View/download PDF
31. EvoIO: Community-driven standards for sustainable interoperability
- Author
-
Stoltzfus, Arlin, Cranston, Karen, Lapp, Hilmar, McKay, Sheldon, Pontelli, Enrico, Vos, Rutger, and Cellinese, Nico
- Published
- 2010
- Full Text
- View/download PDF
32. NeXML: Rich, Extensible, and Verifiable Representation of Comparative Data and Metadata
- Author
-
Lapp, Hilmar, Sukumaran, Jeet, Caravas, Jason A., Maddison, Wayne P., Stoltzfus, Arlin, Xia, Xuhua, Balhoff, James P., Midford, Peter E., Priyam, Anurag, Holder, Mark T., and Vos, Rutger A.
- Abstract
In scientific research, integration and synthesis require a common understanding of where data come from, how much they can be trusted, and what they may be used for. To make such an understanding computer-accessible requires standards for exchanging richly annotated data. The challenges of conveying reusable data are particularly acute in regard to evolutionary comparative analysis, which comprises an ever-expanding list of data types, methods, research aims, and subdisciplines. To facilitate interoperability in evolutionary comparative analysis, we present NeXML, an XML standard (inspired by the current standard, NEXUS) that supports exchange of richly annotated comparative data. NeXML defines syntax for operational taxonomic units, character-state matrices, and phylogenetic trees and networks. Documents can be validated unambiguously. Importantly, any data element can be annotated, to an arbitrary degree of richness, using a system that is both flexible and rigorous. We describe how the use of NeXML by the TreeBASE and Phenoscape projects satisfies user needs that cannot be satisfied with other available file formats. By relying on XML Schema Definition, the design of NeXML facilitates the development and deployment of software for processing, transforming, and querying documents. The adoption of NeXML for practical use is facilitated by the availability of (1) an online manual with code samples and a reference to all defined elements and attributes, (2) programming toolkits in most of the languages used commonly in evolutionary informatics, and (3) input–output support in several widely used software applications. An active, open, community-based development process enables future revision and expansion of NeXML.
- Published
- 2012
- Full Text
- View/download PDF
33. Publishing re-usable phylogenetic trees, in theory and practice
- Author
-
Stoltzfus, Arlin, O'Meara, Brian, Whitacre, Jamie, Mounce, Ross, Rosauer, Dan, and Vos, Rutger
- Subjects
Evolutionary Biology ,Bioinformatics ,Computer science ,business.industry ,media_common.quotation_subject ,Data Standards ,Nature of Science ,Appropriate technology ,Reuse ,Data science ,Unique identifier ,Metadata ,World Wide Web ,Documentation ,Publishing ,Quality (business) ,General Materials Science ,business ,media_common - Abstract
Sharing and re-use of data are essential to the progressive and self-correcting nature of science. In recognition of this principle, journals and funding agencies have adopted policies to encourage sharing of information ('data'), including empirical data as well as computed inferences such as phylogenetic trees.Here we summarize an ongoing analysis of 1) current practices for sharing phylogenetic trees and associated data; 2) current barriers to effective sharing and reuse of such data; and 3) prospects for reducing these barriers to promote more widespread sharing and re-use. Currently, the technical infrastructure is available to support (with some limitations) rudimentary archiving in conjunction with manuscript publication. Yet, most published trees are not archived, and there is no community standard governing the recommended format or content to ensure a re-usable phylogenetic record. Without a shift in emphasis toward re-usability, along with technology and standards to support such a shift, the value of trees (whether disseminated via public archives, or by other means) will be limited. Interviews with actual or potential secondary consumers of phylogenetic results suggest that there is a considerable market for re-use, but that most attempts end in disappointment. Phylogenetic results available via author requests, journal web sites, archival repositories and project web sites rarely include the critical information that secondary consumers seek, such as unique identifiers for biological sources (including species sources and accession numbers), indicators of quality, and documentation of the analytical methods used to obtain the results.Based on the analysis presented here, we suggest that enabling effective re-use entails a commitment by the research community to several changes from current practice: 1) using globally unique identifiers (GUIDs) to reference informational and material entities; 2) developing and using technology for documenting and exchanging the metadata that facilitate re-use; and 3) supporting development and use of a minimal reporting standard that indicates what data and metadata are considered essential for a re-useable phylogenetic record. We suggest that re-use may be catalyzed most rapidly by identifying and targeting (with appropriate technology) the most promising circumstances for re-use. These might include the extraction of sub-trees from large trees (for use in reconciliation, classification, and comparative analysis); the re-use of seed alignments, sub-alignments and homologized characters; the linking of phylogenies to geographic information (for use in ecology, phylogeography and biogeography); and the construction of supertrees and supermatrices.
- Published
- 2011
34. Why we don't want another "Synthesis".
- Author
-
Stoltzfus, Arlin
- Subjects
- *
CHEMICAL synthesis , *BIOLOGICAL evolution , *GENETIC mutation , *ALLELES , *POPULATION genetics - Abstract
High-level debates in evolutionary biology often treat the Modern Synthesis as a framework of population genetics, or as an intellectual lineage with a changing distribution of beliefs. Unfortunately, these flexible notions, used to negotiate decades of innovations, are now thoroughly detached from their historical roots in the original Modern Synthesis (OMS), a falsifiable scientific theory. The OMS held that evolution can be adequately understood as a process of smooth adaptive change by shifting the frequencies of small-effect alleles at many loci simultaneously, without the direct involvement of new mutations. This shifting gene frequencies theory was designed to support a Darwinian view in which the course of evolution is governed by selection, and to exclude a mutation-driven view in which the timing and character of evolutionary change may reflect the timing and character of events of mutation. The OMS is not the foundation of current thinking, but a special case of a broader conception that includes (among other things) a mutation-driven view introduced by biochemists in the 1960s, and now widely invoked. This innovation is evident in mathematical models relating the rate of evolution directly to the rate of mutation, which emerged in 1969, and now represent a major branch of theory with many applications. In evo-devo, mutationist thinking is reflected by a concern for the "arrival of the fittest". Though evolutionary biology is not governed by any master theory, and incorporates views excluded from the OMS, the recognition of these changes has been hindered by woolly conceptions of theories, and by historical accounts, common in the evolutionary literature, that misrepresent the disputes that defined the OMS. [ABSTRACT FROM AUTHOR]
- Published
- 2017
- Full Text
- View/download PDF
35. On the Causes of Evolutionary Transition:Transversion Bias.
- Author
-
Stoltzfus, Arlin and Norris, Ryan W.
- Abstract
A pattern in which nucleotide transitions are favored several fold over transversions is common in molecular evolution. When this pattern occurs among amino acid replacements, explanations often invoke an effect of selection, on the grounds that transitions are more conservative in their effects on proteins. However, the underlying hypothesis of conservative transitions has never been tested directly. Here we assess support for this hypothesis using direct evidence: the fitness effects of mutations in actual proteins measured via individual or paired growth experiments. We assembled data from 8 published studies, ranging in size from 24 to 757 single-nucleotide mutations that change an amino acid. Every study has the statistical power to reveal significant effects of amino acid exchangeability, and most studies have the power to discern a binary conservative-vs-radical distinction. However, only one study suggests that transitions are significantly more conservative than transversions. In the combined set of 1,239 replacements (544 transitions, 695 transversions), the chance that a transition is more conservative than a transversion is 53 % (95 % confidence interval 50 to 56) compared with the null expectation of 50 %. We show that this effect is not large compared with that of most biochemical factors, and is not large enough to explain the several-fold bias observed in evolution. In short, the available data have the power to verify the "conservative transitions" hypothesis if true, but suggest instead that selection on proteins plays at best a minor role in the observed bias. [ABSTRACT FROM AUTHOR]
- Published
- 2016
- Full Text
- View/download PDF
36. PhylOnt: A domain-specific ontology for phylogeny analysis.
- Author
-
Panahiazar, Maryam, Ranabahu, Ajith, Taslimi, Vahid, Yalamanchili, Hima, Stoltzfus, Arlin, Leebens-Mack, Jim, and Sheth, Amit P.
- Abstract
Phylogenetic analyses can resolve historical relationships among genes, organisms or higher taxa. Understanding such relationships can elucidate a wide range of biological phenomena including the role of adaptation as a driver of diversification, the importance of gene and genome duplications in the evolution gene function, or the evolutionary consequences of biogeographic shifts. The variety of methods of analysis and data types typically employed in phylogenetic analyses can pose challenges for semantic reasoning due to significant representational and computational complexity. These challenges could be ameliorated with the development of an ontology designed to capture and organize the variety of concepts used to describe phylogenetic data, methods of analysis and the results of phylogenetic analyses. In this paper, we discuss the development of PhylOnt — an ontology for phylogenetic analyses, which establishes a foundation for semantics-based workflows including meta-analyses of phylogentic data and trees. PhylOnt is an extensible ontology, which describes the methods employed to estimate trees given a data matrix, models and programs used for phylogenetic analysis and descriptions of phylogenetic trees including branch-length information and support values. The relational vocabulary included in PhylOnt will facilitate the integration of heterogeneous data types derived from both structured and unstructured sources. To illustrate the utility of PhylOnt, we annotated scientific literature to support semantic search. The semantic annotations can subsequently support workflows that requiring the exchange and integration of heterogeneous phylogenetic information. [ABSTRACT FROM PUBLISHER]
- Published
- 2012
- Full Text
- View/download PDF
37. Population Diversity of ORFan Genes in Escherichia coli.
- Author
-
Yu, Guoqin and Stoltzfus, Arlin
- Subjects
- *
ESCHERICHIA coli , *BACTERIAL genetics , *BACTERIAL population genetics , *SHIGELLA , *GENETIC mutation , *GENETICS of bacterial diversity - Abstract
The origin and evolution of “ORFans” (suspected genes without known relatives) remain unclear. Here, we take advantage of a unique opportunity to examine the population diversity of thousands of ORFans, based on a collection of 35 complete genomes of isolates of Escherichia coli and Shigella (which is included phylogenetically within E. coli). As expected from previous studies, ORFans are shorter and AT-richer in sequence than non-ORFans. We find that ORFans often are very narrowly distributed: the most common pattern is for an ORFan to be found in only one genome. We compared within-species population diversity of ORFan genes with those of two control groups of non-ORFan genes. Patterns of population variation suggest that most ORFans are not artifacts, but encode real genes whose protein-coding capacity is conserved, reflecting selection against nonsynonymous mutations. Nevertheless, nonsynonymous nucleotide diversity is higher than for non-ORFans, whereas synonymous diversity is roughly the same. In particular, there is a several-fold excess of ORFans in the highest decile of diversity relative to controls, which might be due to weaker purifying selection, positive selection, or a subclass of ORFans that are decaying. [ABSTRACT FROM AUTHOR]
- Published
- 2012
- Full Text
- View/download PDF
38. Bio::NEXUS: a Perl API for the NEXUS format for comparative biological data.
- Author
-
Hladish, Thomas, Gopalan, Vivek, Chengzhi Liang, Weigang Qiu, Yang, Peter, and Stoltzfus, Arlin
- Subjects
BIOINFORMATICS ,COMPUTERS in biology ,USER interfaces ,COMPUTER files ,INFORMATION resources management - Abstract
Background: Evolutionary analysis provides a formal framework for comparative analysis of genomic and other data. In evolutionary analysis, observed data are treated as the terminal states of characters that have evolved (via transitions between states) along the branches of a tree. The NEXUS standard of Maddison, et al. (1997; Syst. Biol. 46: 590-621) provides a portable, expressive and flexible text format for representing character-state data and trees. However, due to its complexity, NEXUS is not well supported by software and is not easily accessible to bioinformatics users and developers. Results: Bio::NEXUS is an application programming interface (API) implemented in Perl, available from CPAN and SourceForge. The 22 Bio::NEXUS modules define 351 methods in 4229 lines of code, with 2706 lines of POD (Plain Old Documentation). Bio::NEXUS provides an object-oriented interface to reading, writing and manipulating the contents of NEXUS files. It closely follows the extensive explanation of the NEXUS format provided by Maddison et al., supplemented with a few extensions such as support for the NHX (New Hampshire Extended) tree format. Conclusion: In spite of some limitations owing to the complexity of NEXUS files and the lack of a formal grammar, NEXUS will continue to be useful for years to come. Bio::NEXUS provides a user-friendly API for NEXUS supplemented with an extensive set of methods for manipulations such as re-rooting trees and selecting subsets of data. Bio::NEXUS can be used as glue code for connecting existing software that uses NEXUS, or as a framework for new applications. [ABSTRACT FROM AUTHOR]
- Published
- 2007
- Full Text
- View/download PDF
39. Mutationism and the dual causation of evolutionary change.
- Author
-
Stoltzfus, Arlin
- Subjects
- *
GENETICS , *GENETIC mutation , *MOLECULAR evolution , *BIOLOGICAL variation , *CAUSATION (Philosophy) , *BIOLOGICAL evolution - Abstract
The rediscovery of Mendel's laws a century ago launched the science that William Bateson called “genetics,” and led to a new view of evolution combining selection, particulate inheritance, and the newly characterized phenomenon of “mutation.” This “mutationist” view clashed with the earlier view of Darwin, and the later “Modern Synthesis,” by allowing discontinuity, and by recognizing mutation (or more properly, mutation-and-altered-development) as a source of creativity, direction, and initiative. By the mid-20th century, the opposing Modern Synthesis view was a prevailing orthodoxy: under its influence, “evolution” was redefined as “shifting gene frequencies,” that is, the sorting out of pre-existing variation without new mutations; and the notion that mutation-and-altered-development can exert a predictable influence on the course of evolutionary change was seen as heretical. Nevertheless, mutationist ideas re-surfaced: the notion of mutational determinants of directionality emerged in molecular evolution by 1962, followed in the 1980s by an interest among evolutionary developmental biologists in a shaping or creative role of developmental propensities of variation, and more recently, a recognition by theoretical evolutionary geneticists of the importance of discontinuity and of new mutations in adaptive dynamics. The synthetic challenge presented by these innovations is to integrate mutation-and-altered-development into a new understanding of the dual causation of evolutionary change—a broader and more predictive understanding that already can lay claim to important empirical and theoretical results—and to develop a research program appropriately emphasizing the emergence of variation as a cause of propensities of evolutionary change. [ABSTRACT FROM AUTHOR]
- Published
- 2006
- Full Text
- View/download PDF
40. The Evolutionary Gain of Spliceosomal Introns: Sequence and Phase Preferences.
- Author
-
Wei-Gang Qiu, Schisler, Nick, and Stoltzfus, Arlin
- Published
- 2004
- Full Text
- View/download PDF
41. Bias in the introduction of variation as an orienting factor in evolution.
- Author
-
Yampolsky, Lev Y. and Stoltzfus, Arlin
- Subjects
- *
BIOLOGICAL variation , *BIOLOGICAL evolution , *POPULATION genetics - Abstract
SUMMARY According to New Synthesis doctrine, the direction of evolution is determined by selection and not by “internal causes” that act by way of propensities of variation. This doctrine rests on the theoretical claim that because mutation rates are small in comparison to selection coefficients, mutation is powerless to overcome opposing selection. Using a simple population-genetic model, this claim is shown to depend on assuming the prior availability of variation, so that mutation may act only as a “pressure” on the frequencies of existing alleles, and not as the evolutionary process that introduces novelty. As shown here, mutational bias in the introduction of novelty can strongly influence the course of evolution, even when mutation rates are small in comparison to selection coefficients. Recognizing this mode of causation provides a distinct mechanistic basis for an “internalist” approach to determining the contribution of mutational and developmental factors to evolutionary phenomena such as homoplasy, parallelism, and directionality. [ABSTRACT FROM AUTHOR]
- Published
- 2001
- Full Text
- View/download PDF
42. Intron `sliding' and the diversity of intron positions.
- Author
-
Stoltzfus, Arlin and Logsdon, John M.
- Subjects
- *
INTRONS - Abstract
Presents information on the intron `sliding' and the diversity of intron positions. Information on the locations of introns in homologous genes; Reference to the arguments of intron sliding; Suggestions of results.
- Published
- 1997
- Full Text
- View/download PDF
43. In search of mutation-driven evolution Mutation-Driven Evolution.
- Author
-
Stoltzfus, Arlin
- Subjects
- *
DEVELOPMENTAL biology , *NONFICTION - Abstract
The article reviews the book "Mutation?Driven Evolution," by Masatoshi Nei.
- Published
- 2014
- Full Text
- View/download PDF
44. Evidence for a predominant role of oxidative damage in germline mutation in mammals
- Author
-
Stoltzfus, Arlin
- Subjects
- *
BIOCHEMICAL genetics , *FREE radicals , *DNA damage , *GENETICS - Abstract
Abstract: Spontaneous copying errors in replication often are assumed to be the main source of germline mutations in humans and other mammals. However, when laboratory data on context-dependent patterns of oxidative DNA damage are compared with patterns of mutation inferred from mammalian sequence evolution, the strength of the correlation suggests that damage is the main source of mutations. Analysis of damage susceptibility holds promise for improving models of mutational specificity. [Copyright &y& Elsevier]
- Published
- 2008
- Full Text
- View/download PDF
45. Molecular Evolution: Introns Fall into Place
- Author
-
Stoltzfus, Arlin
- Subjects
- *
MOLECULAR evolution , *INTRONS , *SPLIT genes , *GENES - Abstract
The evolutionary origin of spliceosomal introns remains elusive. The startling success of a new way of predicting intron sites suggests that the splicing machinery determines where introns are added to genes. [Copyright &y& Elsevier]
- Published
- 2004
- Full Text
- View/download PDF
46. Genes-in-pieces revisited.
- Author
-
Doolittle, W. Ford and Stoltzfus, Arlin
- Subjects
- *
GENETICS - Abstract
Comments on a study by Tittiger et al published in the February 4, 1993 issue of `Nature' which describes an intron in the triosephosphate isomerase (TIM or TPI) gene of the mosquito Culex tarsalis, an intron that neatly subdivides an exon that would otherwise encode two protein `modules.' Confirmation of a prediction by Gilbert et al; `Introns early' theory of gene evolution; History of introns; Explaining split genes (`introns late'); More.
- Published
- 1993
- Full Text
- View/download PDF
47. A fitness distribution law for amino-acid replacements.
- Author
-
Sun M, Stoltzfus A, and McCandlish DM
- Abstract
The effect of replacing the amino acid at a given site in a protein is difficult to predict. Yet, evolutionary comparisons have revealed highly regular patterns of interchangeability between pairs of amino acids, and such patterns have proved enormously useful in a range of applications in bioinformatics, evolutionary inference, and protein design. Here we reconcile these apparently contradictory observations using fitness data from over 350,000 experimental amino acid replacements. Almost one-quarter of the 20 × 19 = 380 types of replacements have broad distributions of fitness effects (DFEs) that closely resemble the background DFE for random changes, indicating an overwhelming influence of protein context in determining mutational effects. However, we also observe that the 380 pair-specific DFEs closely follow a maximum entropy distribution, specifically a truncated exponential distribution. The shape of this distribution is determined entirely by its mean, which is equivalent to the chance that a replacement of the given type is fitter than a random replacement. In this type of distribution, modest deviations in the mean correspond to much larger changes in the probability of falling in the far right tail, so that modest differences in mean exchangeability may result in much larger differences in the chance of a highly fit mutation. Indeed, we show that under the assumption that purifying selection filters out the vast majority of mutations, the maximum entropy distributions of fitness effects inferred from deep mutational scanning experiments predict the characteristic patterns of amino acid change observed in molecular evolution. These maximum entropy distributions of mutational effects not only provide a tuneable model for molecular evolution, but also have implications for mutational effect prediction and protein engineering., Competing Interests: Author Declaration The authors declare that they have no conflicts of interest.
- Published
- 2024
- Full Text
- View/download PDF
48. Multiple models for outbreak decision support in the face of uncertainty.
- Author
-
Shea K, Borchering RK, Probert WJM, Howerton E, Bogich TL, Li SL, van Panhuis WG, Viboud C, Aguás R, Belov AA, Bhargava SH, Cavany SM, Chang JC, Chen C, Chen J, Chen S, Chen Y, Childs LM, Chow CC, Crooker I, Del Valle SY, España G, Fairchild G, Gerkin RC, Germann TC, Gu Q, Guan X, Guo L, Hart GR, Hladish TJ, Hupert N, Janies D, Kerr CC, Klein DJ, Klein EY, Lin G, Manore C, Meyers LA, Mittler JE, Mu K, Núñez RC, Oidtman RJ, Pasco R, Pastore Y Piontti A, Paul R, Pearson CAB, Perdomo DR, Perkins TA, Pierce K, Pillai AN, Rael RC, Rosenfeld K, Ross CW, Spencer JA, Stoltzfus AB, Toh KB, Vattikuti S, Vespignani A, Wang L, White LJ, Xu P, Yang Y, Yogurtcu ON, Zhang W, Zhao Y, Zou D, Ferrari MJ, Pannell D, Tildesley MJ, Seifarth J, Johnson E, Biggerstaff M, Johansson MA, Slayton RB, Levander JD, Stazer J, Kerr J, and Runge MC
- Subjects
- Humans, Uncertainty, Disease Outbreaks prevention & control, Public Health, Pandemics prevention & control, COVID-19 epidemiology, COVID-19 prevention & control
- Abstract
Policymakers must make management decisions despite incomplete knowledge and conflicting model projections. Little guidance exists for the rapid, representative, and unbiased collection of policy-relevant scientific input from independent modeling teams. Integrating approaches from decision analysis, expert judgment, and model aggregation, we convened multiple modeling teams to evaluate COVID-19 reopening strategies for a mid-sized United States county early in the pandemic. Projections from seventeen distinct models were inconsistent in magnitude but highly consistent in ranking interventions. The 6-mo-ahead aggregate projections were well in line with observed outbreaks in mid-sized US counties. The aggregate results showed that up to half the population could be infected with full workplace reopening, while workplace restrictions reduced median cumulative infections by 82%. Rankings of interventions were consistent across public health objectives, but there was a strong trade-off between public health outcomes and duration of workplace closures, and no win-win intermediate reopening strategies were identified. Between-model variation was high; the aggregate results thus provide valuable risk quantification for decision making. This approach can be applied to the evaluation of management interventions in any setting where models are used to inform decision making. This case study demonstrated the utility of our approach and was one of several multimodel efforts that laid the groundwork for the COVID-19 Scenario Modeling Hub, which has provided multiple rounds of real-time scenario projections for situational awareness and decision making to the Centers for Disease Control and Prevention since December 2020.
- Published
- 2023
- Full Text
- View/download PDF
49. Evaluating targeted COVID-19 vaccination strategies with agent-based modeling.
- Author
-
Hladish TJ, Pillai AN, Pearson CAB, Toh KB, Tamayo A, Stoltzfus A, and Longini IM Jr
- Abstract
We evaluate approaches to vaccine distribution using an agent-based model of human activity and COVID-19 transmission calibrated to detailed trends in cases, hospitalizations, deaths, seroprevalence, and vaccine breakthrough infections in Florida, USA. We compare the incremental effectiveness for four different distribution strategies at four different levels of vaccine availability, reflecting different income settings' historical COVID-19 vaccine distribution. Our analysis indicates that the best strategy to reduce severe outcomes is to actively target high disease-risk individuals. This was true in every scenario, although the advantage was greatest for the middle-income-country availability assumptions, and relatively modest compared to a simple mass vaccination approach for rapid, high levels of vaccine availability. Ring vaccination, while generally the most effective strategy for reducing infections, ultimately proved least effective at preventing deaths. We also consider using age group as a practical, surrogate measure for actual disease-risk targeting; this approach still outperforms both simple mass distribution and ring vaccination. We also find that the magnitude of strategy effectiveness depends on when assessment occurs ( e.g ., after delta vs. after omicron variants). However, these differences in absolute benefit for the strategies do not change the ranking of their performance at preventing severe outcomes across vaccine availability assumptions.
- Published
- 2023
- Full Text
- View/download PDF
50. COVID-19 reopening strategies at the county level in the face of uncertainty: Multiple Models for Outbreak Decision Support.
- Author
-
Shea K, Borchering RK, Probert WJM, Howerton E, Bogich TL, Li S, van Panhuis WG, Viboud C, Aguás R, Belov A, Bhargava SH, Cavany S, Chang JC, Chen C, Chen J, Chen S, Chen Y, Childs LM, Chow CC, Crooker I, Valle SYD, España G, Fairchild G, Gerkin RC, Germann TC, Gu Q, Guan X, Guo L, Hart GR, Hladish TJ, Hupert N, Janies D, Kerr CC, Klein DJ, Klein E, Lin G, Manore C, Meyers LA, Mittler J, Mu K, Núñez RC, Oidtman R, Pasco R, Piontti APY, Paul R, Pearson CAB, Perdomo DR, Perkins TA, Pierce K, Pillai AN, Rael RC, Rosenfeld K, Ross CW, Spencer JA, Stoltzfus AB, Toh KB, Vattikuti S, Vespignani A, Wang L, White L, Xu P, Yang Y, Yogurtcu ON, Zhang W, Zhao Y, Zou D, Ferrari M, Pannell D, Tildesley M, Seifarth J, Johnson E, Biggerstaff M, Johansson M, Slayton RB, Levander J, Stazer J, Salerno J, and Runge MC
- Abstract
Policymakers make decisions about COVID-19 management in the face of considerable uncertainty. We convened multiple modeling teams to evaluate reopening strategies for a mid-sized county in the United States, in a novel process designed to fully express scientific uncertainty while reducing linguistic uncertainty and cognitive biases. For the scenarios considered, the consensus from 17 distinct models was that a second outbreak will occur within 6 months of reopening, unless schools and non-essential workplaces remain closed. Up to half the population could be infected with full workplace reopening; non-essential business closures reduced median cumulative infections by 82%. Intermediate reopening interventions identified no win-win situations; there was a trade-off between public health outcomes and duration of workplace closures. Aggregate results captured twice the uncertainty of individual models, providing a more complete expression of risk for decision-making purposes.
- Published
- 2020
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.