Genome sequence of YH: the first diploid genome sequence of a Han Chinese individual.

Dataset type: Genomic
Data released on July 06, 2011

Wang J; Wang W; Li R; Li Y; Tian G; Goodman L; Fan W; Zhang J; Li J; Zhang J; Guo Y; Feng B; Li H; Lu Y; Fang X; Liang H; Du Z; Li D; Zhao Y; Hu Y; Yang Z; Zheng H; Hellmann I; Inouye M; Pool J; Yi X; Zhao J; Duan J; Zhou Y; Qin J; Ma L; Li G; Yang Z; Zhang G; Yang B; Yu C; Liang F; Li W; Li S; Li D; Ni P; Ruan J; Li Q; Zhu H; Liu D; Lu Z; Li N; Guo G; Zhang J; Ye J; Fang L; Hao Q; Chen Q; Liang Y; Su Y; Asan ; Ping C; Yang S; Chen F; Li L; Zhou K; Zheng H; Ren Y; Yang L; Gao Y; Yang G; Li Z; Feng X; Kristiansen K; Wong GK; Nielsen R; Durbin R; Bolund L; Zhang X; Li S; Yang H; Wang J (2011): Genome sequence of YH: the first diploid genome sequence of a Han Chinese individual. GigaScience. https://doi.org/10.5524/100015

DOI10.5524/100015

Genomic data from the YH (Homo sapiens) genome – first diploid genome sequence of a Han Chinese, a representative of the Asian population. The genomic DNA used in this study came from an anonymous male Han Chinese individual who has no known genetic diseases. The YH genome was assembled based on 3.3 billion reads using the Illumina Genome Analyzer. We achieved 117.7G nucleotides data and the genome was sequenced to 36-fold average coverage. By aligning the short reads with SOAP, 102.9G nucleotides were mapped onto the NCBI reference genome and 99.97% of the genome was covered. The raw sequences, alignments, consensus genome, variants and relevant tools are released for public use under a CC0 license.

Additional details

Read the peer-reviewed publication(s):

Wang, J., Wang, W., Li, R., Li, Y., Tian, G., Goodman, L., Fan, W., Zhang, J., Li, J., Zhang, J., Guo, Y., Feng, B., Li, H., Lu, Y., Fang, X., Liang, H., Du, Z., Li, D., Zhao, Y., … Wang, J. (2008). The diploid genome sequence of an Asian individual. Nature, 456(7218), 60–65. https://doi.org/10.1038/nature07484 (PubMed:18987735)

Related datasets:

doi:10.5524/100015 IsSupplementedBy doi:10.5524/100014
doi:10.5524/100015 IsSupplementedBy doi:10.5524/100013
doi:10.5524/100015 IsPreviousVersionOf doi:10.5524/100038(It is a more recent version of this dataset)

Projects:

Click on a table column to sort the results.

Table Settings

Sample ID	Common Name	Scientific Name	Sample Attributes	Taxonomic ID	Genbank Name
YH	Human	Homo sapiens		9606	human

Click on a table column to sort the results.

Table Settings

File Name	Data Type	File Format	Size	Release Date	File Attributes
african2.scafSeq.closure.gz	Sequence assembly	FASTA	714.29 MB	2011-07-06	MD5 checksum: 6df82e9acf4473c09181adb475e6899d
asm_yanh.scafSeq.closure.gz	Sequence assembly	FASTA	737.12 MB	2011-07-06	MD5 checksum: 4c1b3c01a3f4b813b671cb2eeffca4bf
CHB_Illuminus_dbSNP_01.tab.gz	SNPs	UNKNOWN	278.38 kB	2011-07-06	MD5 checksum: 873c1850e862f9f09d1acf1a90cfbb7b
CHB_Illuminus_dbSNP_02.tab.gz	SNPs	UNKNOWN	265.93 kB	2011-07-06	MD5 checksum: b4175c99a71a579e9536caddf0789375
CHB_Illuminus_dbSNP_03.tab.gz	SNPs	UNKNOWN	224.77 kB	2011-07-06	MD5 checksum: 5a73f20d238cc46195e91fad17d4901b
CHB_Illuminus_dbSNP_04.tab.gz	SNPs	UNKNOWN	197.59 kB	2011-07-06	MD5 checksum: 3272f2c3b560238001942098b7261d2c
CHB_Illuminus_dbSNP_05.tab.gz	SNPs	UNKNOWN	200.17 kB	2011-07-06	MD5 checksum: 8d0d62ab65508830b628c9004938a6fa
CHB_Illuminus_dbSNP_06.tab.gz	SNPs	UNKNOWN	218.70 kB	2011-07-06	MD5 checksum: 08b9195125e343d76e474b97ded21c24
CHB_Illuminus_dbSNP_07.tab.gz	SNPs	UNKNOWN	179.65 kB	2011-07-06	MD5 checksum: bdf50969e6a9ae8c97386918dd4785a3
CHB_Illuminus_dbSNP_08.tab.gz	SNPs	UNKNOWN	173.50 kB	2011-07-06	MD5 checksum: 60a0b2bb67ead907553a854daca625ff