Genomic data from Escherichia coli O104:H4 isolate TY-2482
Dataset type: Genomic
Data released on June 03, 2011
Li D; Xi F; Zhao M; Chen W; Cao S; Xu R; Wang G; Wang J; Zhang Z; Li Y; Cui C; Chang C; Cui C; Luo Y; Qin J; Li S; Li J; Peng Y; Pu F; Sun Y; Chen Y; Zong Y; Ma X; Yang X; Cen Z; Song Y; Zhao X; Chen F; Yin X; Rohde H; Liang Y; Li Y; the Escherichia coli O104:H4 TY-2482 isolate genome sequencing consortium (2011): Genomic data from Escherichia coli O104:H4 isolate TY-2482 BGI Shenzhen. https://doi.org/10.5524/100001
The May 2011 outbreak of an E. coli infection in Europe resulted in serious concerns about the potential appearance of a new deadly strain of bacteria, Escherichia coli O104:H4 TY-2482. In response to this situation, and immediately after the reports of deaths, the University Medical Centre Hamburg-Eppendorf and BGI-Shenzhen worked together to sequence the bacterium and assess its human health risk.
The bacterium’s genome was first sequenced using Life Technologies; Ion Torrent sequencing platform. According to the results of the draft assembly, the estimated genome size of this new E. coli strain is about 5.2 Mb. Sequence analysis indicated this bacterium is an EHEC serotype O104 E. coli strain. Comparative analysis showed that this bacterium has 93% sequence similarity with the EAEC 55989 E. coli strain, which was isolated in the Central African Republic and known to cause serious diarrhea. This strain of E. coli, however, has also acquired specific sequences that appear to be similar to those involved in the pathogenicity of hemorrhagic colitis and hemolytic-uremic syndrome. The acquisition of these genes may have occurred through horizontal gene transfer.
To maximize its utility to the research community and aid those fighting the epidemic, this genomic data was released into the public domain under a CC0 license.
To the extent possible under law, BGI Shenzhen has waived all copyright and related or neighboring rights to genomic data from the 2011 E. coli outbreak. This work is published from China.
Additional details
Read the peer-reviewed publication(s):
- Rohde, H., Qin, J., Cui, Y., Li, D., Loman, N. J., Hentschke, M., Chen, W., Pu, F., Peng, Y., Li, J., Xi, F., Li, S., Li, Y., Zhang, Z., Yang, X., Zhao, M., Wang, P., Guan, Y., Cen, Z., … Yang, R. (2011). Open-Source Genomic Analysis of Shiga-Toxin–ProducingE. coliO104:H4. New England Journal of Medicine, 365(8), 718–724. https://doi.org/10.1056/nejmoa1107643 (PubMed:21793736)
- Bennett, N., Plečko, D., Ukor, I.-F., Meinshausen, N., & Bühlmann, P. (2022). ricu: R’s interface to intensive care data. GigaScience, 12. https://doi.org/10.1093/gigascience/giad041 (PubMed:37318234)
Related datasets:
doi:10.5524/100001 IsNewVersionOf doi:10.5524/200179
Additional information:
https://github.com/ehec-outbreak-crowdsourced/BGI-data-analysis/wiki/
Accessions (data included in GigaDB):
SRA: SRP006916
BioProject: PRJNA67657
Click on a table column to sort the results.
Table SettingsSample ID | Common Name | Scientific Name | Sample Attributes | Taxonomic ID | Genbank Name |
---|---|---|---|---|---|
TY-2482 | E. coli | Escherichia coli | Isolate:TY-2482 Isolation source:stool sample from patient with he... Alternative accession-SRA Sample:SRS211184 ... |
562 |
Click on a table column to sort the results.
Table SettingsFile Name | Description | Sample ID | Data Type | File Format | Size | Release Date | File Attributes | Download |
---|---|---|---|---|---|---|---|---|
Readme | TEXT | 1.43 kB | 2012-02-28 | MD5 checksum: afe7a86cd0e0e0ccd3a51330987774d1 |
||||
03/06/11 Ion Torrent mapped assembly | Sequence assembly | FASTA | 1.54 MB | 2012-02-28 | MD5 checksum: d3bbc2fb16340d1ce927610cd41a539c |
|||
06/06/11 Ion Torrent+Illumina hybrid assembly | Sequence assembly | FASTA | 1.76 MB | 2012-02-28 | ||||
06/06/11 Ion Torrent+Illumina hybrid assembly (NCBI version) | Sequence assembly | FASTA | 1.64 MB | 2012-02-28 | MD5 checksum: c96bab74dbbcbaced5c7d1b43f2ba154 |
|||
11/06/11 Illumina de novo assembly | Sequence assembly | FASTA | 1.65 MB | 2012-02-28 | MD5 checksum: 18b3a2349fc9ad6fa8646ef142a4c842 |
|||
16/06/11 Gapless Illumina de novo assembly (chromosome) | Sequence assembly | FASTA | 1.63 MB | 2012-02-28 | MD5 checksum: b0ccd63bb28547a184572be8b51f2e4c |
|||
16/06/11 Gapless Illumina de novo assembly (plasmid) | Sequence assembly | FASTA | 51.20 kB | 2012-02-28 | MD5 checksum: feb40a51c32a0f3ef3124ad0c05ad824 |
|||
11/06/11 Illumina reads | Genome sequence | FASTQ | 1.62 GB | 2012-02-28 | MD5 checksum: 9cc180feee23eb506e28e08f888cdc56 |
|||
02/06/11 Ion Torrent run 1 | Genome sequence | FASTQ | 8.18 MB | 2012-02-28 | MD5 checksum: 0e45f130055ccc0471f67c3b502bf19a |
|||
02/06/11 Ion Torrent run 2 | Genome sequence | FASTQ | 10.69 MB | 2012-02-28 | MD5 checksum: 1f049ec2ce48637cdbe102dc25bda337 |
Funding body | Awardee | Award ID | Comments |
---|---|---|---|
National Natural Science Foundation of China | M Li | 31530068 |
Date | Action |
---|---|
September 15, 2017 | Relationship added : DOI 200029 |
September 15, 2017 | Relationship removed : DOI 200029 |
October 2, 2017 | Manuscript Link added : 10.1093/gigascience/gix082 |
October 2, 2017 | Manuscript Link added : 10.1093/gigascience/gix078 |
March 12, 2018 | Funder added : National Institutes of Health |
July 30, 2018 | Funder added : National Natural Science Foundation of China |
August 21, 2018 | Link added : PRJEB21098 |
August 21, 2018 | Link removed : PRJEB21098 |
December 5, 2018 | External Link removed : http://climb.genomics.cn/Ecoli_TY-2482 - no longer active, no alternative found |
January 28, 2019 | Additional file readme_100001.txt added |
January 28, 2019 | File readme_100001.txt updated |
August 1, 2019 | Relationship added : DOI 200097 |
July 3, 2019 | Relationship added : DOI 200096 |
July 3, 2019 | Relationship removed : DOI 200096 |
August 1, 2019 | Relationship removed : DOI 200097 |
November 16, 2019 | Relationship added : DOI 200099 |
November 16, 2019 | Relationship removed : DOI 200099 |
December 14, 2019 | Relationship added : DOI 100676 |
December 14, 2019 | Relationship removed : DOI 100676 |
April 14, 2020 | Relationship added : DOI 100721 |
April 14, 2020 | Relationship removed : DOI 100721 |
August 18, 2020 | Relationship added : DOI 100780 |
August 18, 2020 | Relationship removed : DOI 100780 |
August 20, 2020 | Additional file readme_100782.txt added |
August 20, 2020 | readme_100782.txt: additional file attribute added |
August 24, 2020 | Relationship added : DOI 200125 |
November 17, 2020 | File 110601_I238_FCB067HABXX_L3_ESCqslRAADIAAPEI-2_1.fq.gz updated |
November 23, 2020 | Relationship added : DOI 200129 |
November 23, 2020 | Relationship removed : DOI 200129 |
August 20, 2021 | Additional file short_summary_output2.txt added |
October 11, 2021 | Additional file Drought_tolerance_horsegram_genes.csv added |
January 5, 2022 | Additional file Pfla.chr.gff added |
January 5, 2022 | Additional file Pfla.contig.gff added |
January 17, 2022 | Additional file DENTIST.part3.tar added |
January 17, 2022 | Additional file DENTIST.part5.tar added |
July 6, 2022 | Funder added : CESAM |
July 6, 2022 | Funder added : CESAM |
September 8, 2022 | Additional file BenTiger_SIreference_minQ30_minGQ30_minDP10_rmvIndel_mac3.recode.vcf.gz added |
September 8, 2022 | File BenTiger_SIreference_minQ30_minGQ30_minDP10_rmvIndel_mac3.recode.vcf.gz removed |
September 8, 2022 | File removed : BenTiger_SIreference_minQ30_minGQ30_minDP10_rmvIndel_mac3.recode.vcf.gz |
September 13, 2022 | Funder added : Washington Tree Fruit Research Commission (WTFRC) |
October 7, 2022 | Manuscript Link removed : 10.1093/gigascience/gix078 |
October 7, 2022 | Manuscript Link removed : 10.1093/gigascience/gix082 |
June 13, 2023 | Description updated from : The May 2011 outbreak of an E. coli infection in Europe resulted in serious concerns about the potential appearance of a new deadly strain of bacteria, Escherichia coli O104:H4 TY-2482. In response to this situation, and immediately after the reports of deaths, the University Medical Centre Hamburg-Eppendorf and BGI-Shenzhen worked together to sequence the bacterium and assess its human health risk. The bacterium’s genome was first sequenced using Life Technologies; Ion Torrent sequencing platform. According to the results of the draft assembly, the estimated genome size of this new E. coli strain is about 5.2 Mb. Sequence analysis indicated this bacterium is an EHEC serotype O104 E. coli strain. Comparative analysis showed that this bacterium has 93% sequence similarity with the EAEC 55989 E. coli strain, which was isolated in the Central African Republic and known to cause serious diarrhea. This strain of E. coli, however, has also acquired specific sequences that appear to be similar to those involved in the pathogenicity of hemorrhagic colitis and hemolytic-uremic syndrome. The acquisition of these genes may have occurred through horizontal gene transfer. To maximize its utility to the research community and aid those fighting the epidemic, this genomic data was released into the public domain under a CC0 license. To the extent possible under law, BGI Shenzhen has waived all copyright and related or neighboring rights to genomic data from the 2011 E. coli outbreak. This work is published from: China. |
June 13, 2023 | Relationship removed : DOI 200125 |
June 15, 2023 | File 110601_I238_FCB067HABXX_L3_ESCqslRAADIAAPEI-2_1.fq.gz updated |
June 15, 2023 | File 110601_I238_FCB067HABXX_L3_ESCqslRAADIAAPEI-2_1.fq.gz updated |
June 15, 2023 | File 110601_I238_FCB067HABXX_L3_ESCqslRAADIAAPEI-2_1.fq.gz updated |
September 26, 2023 | Manuscript Link added : 10.1093/gigascience/giad041 |
October 20, 2023 | Relationship added : DOI 200179 |
December 7, 2023 | Relationship added : DOI 200184 |
December 7, 2023 | Relationship removed : DOI 200184 |
February 1, 2024 | Link added : BioProject:PRJEB53517 |
February 1, 2024 | Link removed : BioProject:PRJEB53517 |