ABSTRACT
“Candidatus Arthromitus” UMNCA01 was recovered from ileal samples of commercial turkey poults and may have probiotic capabilities. The complete genome was determined using the Illumina MiSeq and HiSeq sequencing platforms. The complete genome consists of 1,631,326 bp and has a G+C content of 26.14%, 1,540 coding sequences (CDS), and 37 RNA coding genes.
ANNOUNCEMENT
A candidate genus of segmented filamentous bacteria, “Candidatus Arthromitus” belongs to the family Clostridiaceae. These commensal organisms promote adaptive and innate immune responses in murine models in a host-specific manner, prevent diseases, and promote animal growth (1, 2). Different strains of “Candidatus Arthromitus” inhabit the ileum region in many vertebrate animals, such as cattle, pigs, chickens, humans, and, as shown more recently, turkeys (3). In an attempt to discern the microbial basis of light turkey syndrome (LTS), a condition where commercial turkey flocks fail to meet their genetic potential weights despite standardized diets and growth conditions (4), Danzeisen et al. performed 16S rRNA sequencing of intestinal microbiome analysis of high-performing and low-performing (based upon flock weights) turkey flocks (5). This analysis revealed that at the age of 2 to 3 weeks, high-performing turkey flocks harbored significantly higher proportions of “Candidatus Arthromitus” bacteria than their low-performing counterparts (5). In this study, the genome of a turkey-specific strain of “Candidatus Arthromitus” was sequenced from the gut microbiome.
“Candidatus Arthromitus” UMNCA01, a Gram-positive bacterium, was recovered from ileal samples harvested from 2-week-old turkey poults from a research turkey flock in barns at the University of Minnesota. The sample was identified for shotgun sequencing by previous 16S rRNA amplicon profiling indicating a high relative abundance of “Candidatus Arthromitus” bacteria and light microscopy confirming the presence of high levels of segmented filamentous bacteria. For metagenomic shotgun sequencing, the total genomic DNA was isolated using a Qiagen stool kit (Hilden, Germany). The quantity of the genomic DNA was determined by measuring A260 using a UV-visible (UV-Vis) spectrophotometer (A260 = 1 corresponds to 50 ng/μl of double-stranded DNA [dsDNA]). The quality of the genomic DNA was determined by measuring the A260/A280 ratio, and a value of 1.8 indicated pure DNA preparation as described (6). Twenty micrograms of metagenomic DNA was used to prepare a paired-end (PE) sequencing library (Nextera XT, Illumina, San Diego, CA), and a PCR amplified library was sequenced using the Illumina MiSeq and HiSeq platforms. The shotgun data were assembled using CLC Genomics Workbench v. 9.0/APRIL-2016, with default parameters, and then contigs were mapped to an existing mouse “Candidatus Arthromitus” genome using Mauve (7) to retrieve and arrange “Candidatus Arthromitus” sequences (sourced from turkeys) that mapped to those genomes. Following manual curation, unmapped contigs were then filtered from the metagenomic assembly. The final “Candidatus Arthromitus” assembly resulted in an average 100× genome coverage with a total number of 1,631,326 bp arranged into 41 contigs. The G+C content of these contigs was 26.14%, with an average contig size of 39,788 bp and an N50 value of 57,760 bp. The draft genome contains 1,480 protein coding sequences, 37 RNA genes, and 60 pseudogenes. The genome sequence of “Candidatus Arthromitus” UMNCA01 was annotated using the National Center for Biological Information (NCBI) Prokaryotic Genome Annotation Pipeline and the best-placed reference protein set of GeneMarkS+ (annotation software v. 4.6) as described (8, 9); the results are summarized in Table 1.
Global statistics of the “Candidatus Arthromitus” UMNCA01 genome
Data availability.This “Candidatus Arthromitus” UMNCA01 whole-genome shotgun (WGS) project has the GenBank accession number NZ_LXFF00000000. The version of this project is NZ_LXFF01000000 and consists of sequences LXFF01000001 through LXFF01000041. The filtered assembly and raw sequencing reads can be accessed through BioProject accession number PRJNA319431 and BioSample accession numbers SAMN04889864 and SAMN13392129, respectively.
ACKNOWLEDGMENTS
We acknowledge Holly Reiland for assistance in surveying the genome contents.
This work was funded by an Agriculture and Food Research Initiative competitive grant (2016-67015-24911) from the USDA National Institute of Food and Agriculture and by the University of Minnesota through Global Food Ventures.
FOOTNOTES
- Received 13 September 2019.
- Accepted 26 December 2019.
- Published 23 January 2020.
- Copyright © 2020 Hedblom et al.
This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International license.