Draft Genome Sequence of the Tumor-Targeting Salmonella enterica Serovar Typhimurium Strain SL7207

ABSTRACT Salmonella enterica serovar Typhimurium strain SL7207 is a genetically modified derivative of strain SL1344, which preferentially accumulates in tumors and can be used as a vehicle for tissue-specific gene delivery in vivo. Here, we report the draft genome sequence of SL7207, confirming a purported aroA deletion and four single-nucleotide polymorphisms compared to SL1344.

S almonella enterica serovar Typhimurium is most well known for causing acute gastroenteritis. If left untreated, S. Typhimurium infection can lead to systemic disease resulting in host septic shock and, in extreme cases, death. Despite these adverse effects, multiple Salmonella serovars have been demonstrated to preferentially colonize tumor tissue when administered intravenously and delay tumor growth. SL7207 is one of the most promising strains utilized for this purpose (1). SL7207 is reported to have a deletion in aroA, resulting in bacterial auxotrophy for two compounds: -aminobenzoic acid and 2,3-dihydroxybenzoate (2). These compounds are not found in mammalian tissue, rendering the bacteria attenuated in the mammalian host and lending to its suitability as a therapeutic agent. The parent strain of SL7207, SL3261, was engineered as a vaccine strain, and it has been used in cancer therapy studies (1,3,4). However, many studies have identified the tumor-preferential localization of SL7207 when administered intravenously (5)(6)(7). Multiple studies have employed SL7207 for its tumor disruption and tumor-specific DNA vaccine delivery, as well as identified tumor-specific Salmonella promoters and immune components involved in Salmonella-tumor localization. Furthermore, bioluminescent SL7207 has been employed as an agent to identify the location of tumors and metastasis in vivo (8,9).
However, a detailed analysis of the genetic makeup of this strain is lacking. To gain a deeper understanding of the mechanisms underlying the antitumor properties of SL7207, its entire genome was sequenced.
Genomic DNA of S. Typhimurium SL7207 was extracted from a freshly grown single colony using an Illumina Nextera XT DNA sample kit per the manufacturer's protocol (Illumina, USA). Sequencing was performed by Illumina MiSeq using a 2 ϫ 250 paired-end protocol. Read quality analysis and trimming were conducted using Trimmomatic and then quality assessed using in-house scripts combined with SAMtools, BedTools, and BWA-MEM. De novo assembly was conducted with SPAdes version 3.5, resulting in a total of 68 contigs, with 30 larger than 1,000 bp. The draft genome of strain SL7207 contains 5,026,283 bp, with 52.16% GϩC content, and encodes 4,754 coding sequences (CDSs), nine rRNAs, and 84 tRNAs. S. Typhimurium SL1344, in comparison, encodes 4,605 CDSs, 22 rRNAs, and 85 tRNAs. The contigs of SL7207 were reordered against the complete genome of SL1344, and aligned using progressive-Mauve (version 20150226, build 10) (10). A list of single-nucleotide polymorphisms (SNPs) was generated from Mauve and visually inspected using CLC Genomics Workbench version 7.
The genome sequence of SL7207 provides the foundation for further molecular characterization of the strain in its use as a therapeutic agent in the treatment of cancer.
Accession number(s). This draft genome project has been deposited at DDBJ/ EMBL/GenBank under the accession number MPJV00000000 (BioProject PRJNA350897; BioSample SAMN05949357). The version described in this paper is the first version, MPJV01000000.

ACKNOWLEDGMENTS
Strain SL7207 was kindly provided by Siegfried Weiss, Helmoholtz Centre for Infection Research, Germany. Genome sequencing was provided by MicrobesNG (http:// www.microbesng.uk), which is supported by the Biotechnology and Biological Sciences Research Council (BBSRC; grant number BB/L024209/1). This work was funded by the Wellcome Trust through a Wellcome Trust PhD studentship to S.A.J. (102460/Z/13/Z) and by BBSRC grants BB/K008005/1 and BB/ P003281/1 awarded to D.M.W.