Metagenome of the Siberian Underground Water Reservoir

ABSTRACT We report on the metagenome of a deep subsurface aquifer in the Tomsk region of Russia, sampled via an oil exploration borehole drilled to a depth of 2.8 km. Methanogenic archaea were present in the water along with members of various bacterial phyla, including Firmicutes, Chloroflexi, Bacteroidetes, Ignavibacteriae, and uncultured candidate divisions.

T he deep subsurface biosphere, one of the least studied ecosystems on Earth, contains communities of extremophilic microorganisms thriving under conditions of high temperature and pressure. Microbial communities in deep subsurface ecosystems can be diverse in composition, depending on geochemical conditions (1, 2). These ecosystems remain isolated from the surface biosphere for thousands to millions years and do not depend on the input of organic substances from the surface (3,4). Some deep subsurface microbial communities comprise mostly chemolithoautrophic microorganisms that use abiotically produced molecular hydrogen as an energy source (5). In other subsurface ecosystems, buried organic matter supports the growth of organotrophs (6).
The thermal artesian waters of the West Siberian oil basin can be accessed via oil exploration boreholes. The aquifers are located at depths of 1 to 3 km and presumably were formed in the sedimentary rocks of the Mesozoic Era. Our previous studies revealed diverse microbial communities, the composition of which differed when different boreholes were sampled and analyzed (7,8). Here, we report a metagenome from borehole 5P, drilled to a depth of 2.8 km near the village of Chazhemto in the Tomsk region of Russia.
Contigs comprising partial or complete 16S rRNA gene sequences were identified using CheckM (13). The RDP Classifier (14) and a BLASTN search against the NCBI nonredundant (NR) database were used for the taxonomic assignment of these sequences. Consistently with results of 16S rRNA profiling (9), we found methanogenic Archaea (Methanosaeta, Methanothermobacter, Methanobacterium, and Methanomassiliicoccus) and various groups of bacteria including representatives of the phyla Firmicutes, Chloroflexi, Ignavibacteriae, Bacteroidetes, and uncultured candidate divi-sions. Binning of contigs using CONCOCT (15) allowed us to obtain genome bins representing composite genomes of dominant microbial species with more than 90% completeness, as estimated by CheckM (13). The obtained genomic data will be used to describe the phylogenetic and functional diversity of the aquifer and to predict the biogeochemical reactions in the deep subsurface biosphere that may be performed by uncultured prokaryotes.
Accession number(s). The sequences obtained in this project have been deposited in the NCBI Sequence Read Archive under the accession number SRR6186653.

ACKNOWLEDGMENTS
This study was carried out with the participation of the team of the Russian leading scientific school NSh-9998.2016.4 and supported by the Russian Foundation for Basic Research (grant 16-34-60124).