Back to Search Start Over

Phased diploid genome assembly with single-molecule real-time sequencing.

Authors :
Chin CS
Peluso P
Sedlazeck FJ
Nattestad M
Concepcion GT
Clum A
Dunn C
O'Malley R
Figueroa-Balderas R
Morales-Cruz A
Cramer GR
Delledonne M
Luo C
Ecker JR
Cantu D
Rank DR
Schatz MC
Source :
Nature methods [Nat Methods] 2016 Dec; Vol. 13 (12), pp. 1050-1054. Date of Electronic Publication: 2016 Oct 17.
Publication Year :
2016

Abstract

While genome assembly projects have been successful in many haploid and inbred species, the assembly of noninbred or rearranged heterozygous genomes remains a major challenge. To address this challenge, we introduce the open-source FALCON and FALCON-Unzip algorithms (https://github.com/PacificBiosciences/FALCON/) to assemble long-read sequencing data into highly accurate, contiguous, and correctly phased diploid genomes. We generate new reference sequences for heterozygous samples including an F1 hybrid of Arabidopsis thaliana, the widely cultivated Vitis vinifera cv. Cabernet Sauvignon, and the coral fungus Clavicorona pyxidata, samples that have challenged short-read assembly approaches. The FALCON-based assemblies are substantially more contiguous and complete than alternate short- or long-read approaches. The phased diploid assembly enabled the study of haplotype structure and heterozygosities between homologous chromosomes, including the identification of widespread heterozygous structural variation within coding sequences.

Details

Language :
English
ISSN :
1548-7105
Volume :
13
Issue :
12
Database :
MEDLINE
Journal :
Nature methods
Publication Type :
Academic Journal
Accession number :
27749838
Full Text :
https://doi.org/10.1038/nmeth.4035