Back to Search Start Over

aPhyloGeo-Covid: A web interface for phylogeographic analysis of SARS-CoV-2 variation using Neo4j and Snakemake

Authors :
Wanlin Li
Tahiri, Nadia
Publication Year :
2023
Publisher :
Zenodo, 2023.

Abstract

The gene sequencing data, along with the associated lineage tracing and research data generatedthroughout the Coronavirus disease 2019 (COVID-19) pandemic, constitute invaluable resources that profoundlyempower phylogeography research. To optimize the utilization of these resources, we have developed an interactiveanalysis platform called aPhyloGeo-Covid, leveraging the capabilities of Neo4j, Snakemake, and Python. This platform enables researchersto explore and visualize diverse data sources specifically relevant to SARS-CoV-2 for phylogeographic analysis. The integrated Neo4j database acts as a comprehensive repository, consolidating COVID-19 pandemic-related sequences information,climate data, and demographic data obtained from public databases, facilitating efficient filtering and organization of input data forphylogeographical studies. Presently, the database encompasses over 113,774 nodes and 194,381 relationships. Additionally, aPhyloGeo-Covid provides a scalable and reproducible phylogeographic workflow for investigating the intricate relationship between geographic features and the patterns of variation in diverse SARS-CoV-2 variants. The code repository of platform is publicly accessible onGitHub (https://github.com/tahiri-lab/iPhyloGeo/tree/iPhylooGeo-neo4j), providing researchers with a valuable tool to analyzeand explore the intricate dynamics of SARS-CoV-2 within a phylogeographic context.

Details

Language :
English
Database :
OpenAIRE
Accession number :
edsair.doi.dedup.....6217174f6818d3325e118c79ee2f6dcb
Full Text :
https://doi.org/10.5281/zenodo.8136026