Skip to main content

Genome sequence of YH: the first diploid genome sequence of a Han Chinese individual.

Dataset type: Genomic
Data released on July 06, 2011

Wang J; Wang W; Li R; Li Y; Tian G; Goodman L; Fan W; Zhang J; Li J; Zhang J; Guo Y; Feng B; Li H; Lu Y; Fang X; Liang H; Du Z; Li D; Zhao Y; Hu Y; Yang Z; Zheng H; Hellmann I; Inouye M; Pool J; Yi X; Zhao J; Duan J; Zhou Y; Qin J; Ma L; Li G; Yang Z; Zhang G; Yang B; Yu C; Liang F; Li W; Li S; Li D; Ni P; Ruan J; Li Q; Zhu H; Liu D; Lu Z; Li N; Guo G; Zhang J; Ye J; Fang L; Hao Q; Chen Q; Liang Y; Su Y; Asan ; Ping C; Yang S; Chen F; Li L; Zhou K; Zheng H; Ren Y; Yang L; Gao Y; Yang G; Li Z; Feng X; Kristiansen K; Wong GK; Nielsen R; Durbin R; Bolund L; Zhang X; Li S; Yang H; Wang J (2011): Genome sequence of YH: the first diploid genome sequence of a Han Chinese individual. GigaScience. https://doi.org/10.5524/100015

DOI10.5524/100015

Genomic data from the YH (Homo sapiens) genome – first diploid genome sequence of a Han Chinese, a representative of the Asian population. The genomic DNA used in this study came from an anonymous male Han Chinese individual who has no known genetic diseases. The YH genome was assembled based on 3.3 billion reads using the Illumina Genome Analyzer. We achieved 117.7G nucleotides data and the genome was sequenced to 36-fold average coverage. By aligning the short reads with SOAP, 102.9G nucleotides were mapped onto the NCBI reference genome and 99.97% of the genome was covered. The raw sequences, alignments, consensus genome, variants and relevant tools are released for public use under a CC0 license.

View citations on Google ScholarView citations on Europe PubMed CentralView citations on Dimensions

Additional details

Read the peer-reviewed publication(s):

  • Wang, J., Wang, W., Li, R., Li, Y., Tian, G., Goodman, L., Fan, W., Zhang, J., Li, J., Zhang, J., Guo, Y., Feng, B., Li, H., Lu, Y., Fang, X., Liang, H., Du, Z., Li, D., Zhao, Y., … Wang, J. (2008). The diploid genome sequence of an Asian individual. Nature, 456(7218), 60–65. https://doi.org/10.1038/nature07484 (PubMed:18987735)
Related datasets:

doi:10.5524/100015 IsSupplementedBy doi:10.5524/100014
doi:10.5524/100015 IsSupplementedBy doi:10.5524/100013
doi:10.5524/100015 IsPreviousVersionOf doi:10.5524/100038(It is a more recent version of this dataset)


Additional information:

http://yh.genomics.org.cn/

Genome browser:

http://yh.genomics.org.cn/mapview.jsp?

Accessions (data included in GigaDB):

ENA: ERP000053
BioProject: PRJEA39173

Projects:

Go to 1000 Genomes website

Click on a table column to sort the results.

Table Settings
Sample ID Common Name Scientific Name Sample Attributes Taxonomic ID Genbank Name
YH Human Homo sapiens 9606 human

Click on a table column to sort the results.

Table Settings

File Name Description Sample ID Data Type File Format Size Release Date File Attributes Download
Sequence assembly FASTA 714.29 MB 2011-07-06 MD5 checksum: 6df82e9acf4473c09181adb475e6899d
Sequence assembly FASTA 737.12 MB 2011-07-06 MD5 checksum: 4c1b3c01a3f4b813b671cb2eeffca4bf
SNPs UNKNOWN 278.38 kB 2011-07-06 MD5 checksum: 873c1850e862f9f09d1acf1a90cfbb7b
SNPs UNKNOWN 265.93 kB 2011-07-06 MD5 checksum: b4175c99a71a579e9536caddf0789375
SNPs UNKNOWN 224.77 kB 2011-07-06 MD5 checksum: 5a73f20d238cc46195e91fad17d4901b
SNPs UNKNOWN 197.59 kB 2011-07-06 MD5 checksum: 3272f2c3b560238001942098b7261d2c
SNPs UNKNOWN 200.17 kB 2011-07-06 MD5 checksum: 8d0d62ab65508830b628c9004938a6fa
SNPs UNKNOWN 218.70 kB 2011-07-06 MD5 checksum: 08b9195125e343d76e474b97ded21c24
SNPs UNKNOWN 179.65 kB 2011-07-06 MD5 checksum: bdf50969e6a9ae8c97386918dd4785a3
SNPs UNKNOWN 173.50 kB 2011-07-06 MD5 checksum: 60a0b2bb67ead907553a854daca625ff
Date Action