ABSTRACT
We report here the draft genome sequences of Sphingomonas sp. IBVSS1 and IBVSS2, two bacteria assembled from the metagenomes of surface samples from freshwater lakes. The genomes are >99% complete and may represent new species within the Sphingomonas genus, indicating a larger diversity than currently identified.
GENOME ANNOUNCEMENT
Advances in high-throughput sequencing, coupled with decreasing costs, have led to the number of available bacterial genomes increasing almost exponentially. Genome sequencing, however, has traditionally been limited to species that can be held and grown in culture due to the high DNA volumes needed. A predominant focus on cultivable species has led to a genome bias, and true bacterial diversity is poorly represented. Metagenomic studies are rectifying this bias and have already revealed a large novel diversity (1). However, metagenomic studies remain limited, with many ecosystems yet to be sampled. We attempt to expand species richness in a bioproject with a goal to identify novel bacteria from various environmental samples. Here, we present the draft genomes of two unclassified Sphingomonas bacteria, isolated from the surface of freshwater lakes in Norway (Årungen, Ås) and Japan (Tsukuba, Ibaraki).
DNA was isolated using a standard phenol-chloroform protocol with ethanol precipitation and subsequent cleaning using Zymo genomic clean and concentrator. DNA was prepared and sequenced on the Illumina HiSeq2500 platform (150-bp paired-end reads; 350-bp insert size) and PacBio RS2 P6-C4 chemistry (20 kb) at the Norwegian Sequencing Centre. Metagenome drafts were assembled using SPAdes version 3.9.0 (2), with single genomes separated with MetaBAT (3), and quality was assessed with CheckM (4). Separate genomes were scaffolded using LINKS (5), and gaps were closed with Sealer (6). Genome assemblies were evaluated with PROmer (7) and REAPER (8) before being improved with Pilon (9). Genomes were annotated using the NCBI Prokaryotic Genome Annotation Pipeline (10). Taxonomical rank was established on evaluation of CheckM (4), PhyloSift (11), and megaBLAST against the NCBInr database.
Sphingomonas sp. IBVSS1 was assembled into 11 scaffolds, constituting 14 contigs with a sequence length of 3.15 Mb and a GC content of 66.67%. The scaffold N50 value was 0.73 Mb with Illumina coverage of 12× and PacBio coverage of 13×. CheckM estimated genome completeness at 99.54% with no contamination or strain heterogeneity. The genome constitutes 2,909 genes, 49 RNAs, 43 tRNAs, 3 noncoding RNAs (ncRNAs), and 50 pseudogenes.
Sphingomonas sp. IBVSS2 was assembled into nine scaffolds, constituting 10 contigs with a total sequence length of 4.29 Mb and a GC content of 67.81%. The scaffold N50 value was 0.75 Mb with Illumina coverage of 84× and PacBio coverage of 16×. CheckM estimated genome completeness at 99.54% with no contamination or strain heterogeneity. The genome constitutes 3,925 genes, 52 RNAs, 46 tRNAs, 3 ncRNAs, and 29 pseudogenes.
Genomes were confirmed as novel Sphingomonas spp. by using 16S queries to perform a BLASTn search against the NCBInr database: IBVSS1 had a 95% identity to S. sanxanigenens (GenBank accession number CP006644), and IBVSS2 had a 96% identity to the 16S rRNA of S. taxi (CP009571). IBVSS1 and IBVSS2 had a 92% 16S rRNA identity to each other. The low identity to known Sphingomonas bacteria may suggest IBVSS1 and IBVSS2 as new species, indicating a larger diversity than currently identified.
Accession number(s).The draft genomes of Sphingomonas sp. strains IBVSS1 and IBVSS2 sequenced under this project have been deposited at DDBJ/EMBL/GenBank under the accession numbers NFUS00000000 and NFUR00000000 , respectively. These biosamples (SAMN06840509 and SAMN06840510, respectively) are part of BioProject PRJNA384425.
ACKNOWLEDGMENTS
We thank Abel (UiO) for providing computing resources, in particular projects nn9244k and nn9404k. We also thank the Midas cluster at VIB for providing additional computing resources. We are grateful to the Norwegian Sequencing Centre (NSC) for providing sequencing support and analysis for both the Illumina and PacBio platforms. We thank Thomas Haverkamp (IBV, UiO) for discussions related to the assembly and analysis of prokaryote genomes.
This work was supported by research grants from the Norwegian Research Council to R.J.S.O. (project 230868).
FOOTNOTES
- Received 18 July 2017.
- Accepted 19 July 2017.
- Published 24 August 2017.
- Copyright © 2017 Orr et al.
This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license .