Back to Search
Start Over
Loqusdb: added value of an observations database of local genomic variation
- Source :
- BMC Bioinformatics, Vol 21, Iss 1, Pp 1-10 (2020), BMC Bioinformatics
- Publication Year :
- 2020
- Publisher :
- Springer Science and Business Media LLC, 2020.
-
Abstract
- Background Exome and genome sequencing is becoming the method of choice for rare disease diagnostics. One of the key challenges remaining is distinguishing the disease causing variants from the benign background variation. After analysis and annotation of the sequencing data there are typically thousands of candidate variants requiring further investigation. One of the most effective and least biased ways to reduce this number is to assess the rarity of a variant in any population. Currently, there are a number of reliable sources of information for major population frequencies when considering single nucleotide variants (SNVs) and small insertion and deletions (INDELs), with gnomAD as the most prominent public resource available. However, local variation or frequencies in sub-populations may be underrepresented in these public resources. In contrast, for structural variation (SV), the background frequency in the general population is more or less unknown mostly due to challenges in calling SVs in a consistent way. Keeping track of local variation is one way to overcome these problems and significantly reduce the number of potential disease causing variants retained for manual inspection, both for SNVs and SVs. Results Here, we present loqusdb, a tool to solve the challenge of keeping track of any type of variant observations from genome sequencing data. Loqusdb was designed to handle a large flow of samples and unlike other solutions, samples can be added continuously to the database without rebuilding it, facilitating improvements and additions. We assessed the added value of a local observations database using 98 samples annotated with information from a background of 888 unrelated individuals. Conclusions We show both how powerful SV analysis can be when filtering for population frequencies and how the number of apparently rare SNVs/INDELs can be reduced by adding local population information even after annotating the data with other large frequency databases, such as gnomAD. In conclusion, we show that a local frequency database is an attractive, and a necessary addition to the publicly available databases that facilitate the analysis of exome and genome data in a clinical setting.
- Subjects :
- medicine.medical_specialty
Computer science
Population
Population frequency
Genomics
Mendelian
lcsh:Computer applications to medicine. Medical informatics
computer.software_genre
Polymorphism, Single Nucleotide
Biochemistry
Genome
DNA sequencing
Structural variation
User-Computer Interface
03 medical and health sciences
INDEL Mutation
Structural Biology
Intellectual Disability
Databases, Genetic
medicine
Humans
Nucleotide
Indel
education
lcsh:QH301-705.5
Molecular Biology
Exome
030304 developmental biology
chemistry.chemical_classification
0303 health sciences
education.field_of_study
Database
Applied Mathematics
030305 genetics & heredity
Genetic Variation
Structural variant
Computer Science Applications
Single nucleotide variant
lcsh:Biology (General)
chemistry
lcsh:R858-859.7
Medical genetics
DNA microarray
Rare disease
computer
Software
Subjects
Details
- ISSN :
- 14712105
- Volume :
- 21
- Database :
- OpenAIRE
- Journal :
- BMC Bioinformatics
- Accession number :
- edsair.doi.dedup.....a24eaefdc205ed5ded17f60cf6ba2214
- Full Text :
- https://doi.org/10.1186/s12859-020-03609-z