ABSTRACT
Here, we report the completed genome sequences for two non-O1/non-O139 Vibrio cholerae isolates. Each isolate has only a single chromosome, as opposed to the normal paradigm of two chromosomes found in all other V. cholerae isolates.
GENOME ANNOUNCEMENT
Vibrio cholerae is a comma-shaped Gram-negative bacterium best known as the causative agent of cholera. Cholera represents an estimated burden of 1.4 to 4.3 million cases and 28,000 to 142,000 deaths per year worldwide (1). Historically, cholera outbreaks have been linked to V. cholerae O1 serogroup strains or its derivatives of the O37 and O139 serogroups. A genomic study on the 2010 Haitian cholera outbreak strains highlighted the putative role of non O1/non-O139 V. cholerae in causing cholera (2). In a recent study, we examined the genomic diversity of a large collection of V. cholerae strains belonging to different non-O1/non-0139 serogroups using whole-genome mapping (3). In that study, we found pervasive genetic and genomic structural diversity, including indels, duplications, and fusions of the usual two chromosomes. Here, we report the complete genome assemblies of two single-chromosome V. cholerae isolates.
Each genome was drafted and assembled using four data types: Illumina short-read, Roche 454 standard and long-insert reads, and PacBio long reads. Short-read and long-insert paired 454 data were assembled together in Newbler, and consensus sequences were computationally shredded into 2-kbp overlapping shreds. The Illumina short-read data were assembled with Velvet (4), and consensus sequences were computationally shredded into 1.5-kb overlapping shreds. The PacBio long-read data were assembled using Hierarchical Genome Assembly Process (HGAP) (5). Consensus sequences from HGAP were computationally shredded into 10-kbp overlapping pieces. All shreds were integrated using Phrap. Possible misassemblies were corrected and repeat regions verified using in-house scripts and manual editing in Consed (6–8). All genomes were assembled to finished-quality completion (9), and each assembly was annotated using an Ergatis-based (10) workflow with minor manual curation.
The finding of a single chromosome was independently verified by whole-genome mapping and pulsed-field gel electrophoresis of intact chromosomes of these two strains (3). Due to the nonstandard topology of these genomes, we reviewed OpGen Argus-generated in silico optical maps for orthogonal confirmation of the single-chromosome nature of each genome.
The V. cholerae strain 1154-74 genome is 3.928 Mb (47.8% G+C content), and the V. cholerae strain 10432-62 genome is 4.077 Mb (47.7% G+C content). Annotation located 3,430 and 3,645 coding sequences, respectively, with similar gene profiles.
ACKNOWLEDGMENTS
Funding for this effort was provided by the Defense Threat Reduction Agency's Joint Science and Technology Office (DTRA J9-CB/JSTO).
This paper is approved by the LANL for unlimited release (LA-UR-15-21518). The views expressed in this article are those of the authors and do not necessarily reflect the official policy or position of the Department of the Navy, Department of Defense, or the U.S. Government.
FOOTNOTES
- Received 3 April 2015.
- Accepted 7 April 2015.
- Published 14 May 2015.
- Copyright © 2015 Johnson et al.
This is an open-access article distributed under the terms of the Creative Commons Attribution 3.0 Unported license.