Back to Search Start Over

The complete sequence of a human Y chromosome.

Authors :
Rhie A
Nurk S
Cechova M
Hoyt SJ
Taylor DJ
Altemose N
Hook PW
Koren S
Rautiainen M
Alexandrov IA
Allen J
Asri M
Bzikadze AV
Chen NC
Chin CS
Diekhans M
Flicek P
Formenti G
Fungtammasan A
Garcia Giron C
Garrison E
Gershman A
Gerton JL
Grady PGS
Guarracino A
Haggerty L
Halabian R
Hansen NF
Harris R
Hartley GA
Harvey WT
Haukness M
Heinz J
Hourlier T
Hubley RM
Hunt SE
Hwang S
Jain M
Kesharwani RK
Lewis AP
Li H
Logsdon GA
Lucas JK
Makalowski W
Markovic C
Martin FJ
Mc Cartney AM
McCoy RC
McDaniel J
McNulty BM
Medvedev P
Mikheenko A
Munson KM
Murphy TD
Olsen HE
Olson ND
Paulin LF
Porubsky D
Potapova T
Ryabov F
Salzberg SL
Sauria MEG
Sedlazeck FJ
Shafin K
Shepelev VA
Shumate A
Storer JM
Surapaneni L
Taravella Oill AM
Thibaud-Nissen F
Timp W
Tomaszkiewicz M
Vollger MR
Walenz BP
Watwood AC
Weissensteiner MH
Wenger AM
Wilson MA
Zarate S
Zhu Y
Zook JM
Eichler EE
O'Neill RJ
Schatz MC
Miga KH
Makova KD
Phillippy AM
Source :
Nature [Nature] 2023 Sep; Vol. 621 (7978), pp. 344-354. Date of Electronic Publication: 2023 Aug 23.
Publication Year :
2023

Abstract

The human Y chromosome has been notoriously difficult to sequence and assemble because of its complex repeat structure that includes long palindromes, tandem repeats and segmental duplications <superscript>1-3</superscript> . As a result, more than half of the Y chromosome is missing from the GRCh38 reference sequence and it remains the last human chromosome to be finished <superscript>4,5</superscript> . Here, the Telomere-to-Telomere (T2T) consortium presents the complete 62,460,029-base-pair sequence of a human Y chromosome from the HG002 genome (T2T-Y) that corrects multiple errors in GRCh38-Y and adds over 30 million base pairs of sequence to the reference, showing the complete ampliconic structures of gene families TSPY, DAZ and RBMY; 41 additional protein-coding genes, mostly from the TSPY family; and an alternating pattern of human satellite 1 and 3 blocks in the heterochromatic Yq12 region. We have combined T2T-Y with a previous assembly of the CHM13 genome <superscript>4</superscript> and mapped available population variation, clinical variants and functional genomics data to produce a complete and comprehensive reference sequence for all 24 human chromosomes.<br /> (© 2023. This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply.)

Details

Language :
English
ISSN :
1476-4687
Volume :
621
Issue :
7978
Database :
MEDLINE
Journal :
Nature
Publication Type :
Academic Journal
Accession number :
37612512
Full Text :
https://doi.org/10.1038/s41586-023-06457-y