SUPPLEMENTARY DATA
Bioinformatic analysis of bifidobacterial prophage genomes
Prophage-like element Binf-1.The predicted prophage-like element Binf-1 in B. longumsubsp. infantisATCC15697 extends from ORF1149 (putative integrase-encoding gene) to ORF1099 (predicted holin-encoding gene) (Fig. 2). The prophage-like element Binf-1 contains a lysogeny module whose encoded proteins display significant BLAST similarities to a phage integrase (the product of ORF1149; displaying a DNA-BRE-C superfamily motif), and an anti-repressor protein (ORF1140). Notably, the lysogeny module contains two genes coding for a putative two-component system histidine protein kinase and a methyltransferase protein. Downstream of the lysogeny module, a small number of ORFs are present that specify proteins with similarities to proteins involved in DNA replication (e.g., recombinase proteins and primase proteins) from Rhodococcus and Mycobacteriumphages. Identification of likely structural modules such as the DNA packaging genes (from ORF1116 to ORF1120) and the tail morphogenesis genes (ORF1107) was possible on the basis of observed homology to genes in high G+C Gram-positive bacterial and lactococcalphages.
Prophage-like element Binf-2.In this prophage-like element the lysogeny module was predicted to extend from ORF1490 to ORF1497, and includes the predicted integrase gene and genetic switch region. Downstream of the lysogeny module, a small number of anonymous ORFs and genes encoding proteins with similarities to DNA replication proteins from Enterococcusphages were identified. Database matches included a RecA ortholog, a replication initiation protein and a single-stranded DNA binding protein, defining this genome segment as the likely DNA replication module. Further downstream, several ORFs encoding hypothetical proteins were detected (Fig. 2). Their closest relatives were found in genomes from phages that infect low G+C content bacteria, such as enterobacteria, Lactococcus lactis, as well as high G+C content bacteria, such as Mycobacterium and Sinorhizobium. Notably, two adjacent DNA methyltransferase-encoding genes (ORF1526 and ORF1527) were surrounded on both sides by an antirepressor-encoding gene (ORF1525andORF1528) and on one side by an integrase-encoding gene (ORF1531), suggesting acquisition of this DNA region by horizontal gene transfer.
Prophage-like element Binf-3.This prophage-like element represents what appears to be one of the most rearranged and disintegrated prophage genome identified on the genome of B. longumsubsp. infantisATCC 15697, suggesting that this prophage-like element represents a phage remnant. In fact, only modules for structural phage components and host lysis were detected (Fig. 2). The former include the putative morphogenesis genes (encoding a minor tail component, and a prohead protease), genes putatively involved in DNA packaging (encoding the large terminase subunit, as well as a portal protein), while the latter was represented by genes predicted to specify the lysin and holin (Fig. 2). The closest relatives of these DNA regions were from phages infecting Actinobacteria,such as Corynebacterium and Mycobacterium bacteriophages.
Prophage-like element Binf-4. The predicted prophage Binf-4 in B. longum subsp. infantisATCC 15697 extends from ORF1780 (integrase gene) to ORF1831 (lysin gene) (Fig. 2). Database matches allowed the subdivision of the Binf-4 prophage genome into functional modules (Fig. 2). In this prophage-like element, the lysogeny module is limited to a region extending from ORF1780 to ORF1784, and encompassing genes that specify homologs of the integrase of Leifsonia xyli and the cI repressor of M. tuberculosis. The next ORFs (ORF1791-1795) constitute the likely DNA replication module, as it includes genes whose encoded protein products display similarity to single-stranded DNA binding proteins of mycobacteriophage, the partitioning protein of Clostridium, a putative replication protein of an Enterobacterium phage and an oligoribonuclease. Identification of structural modules such as the DNA packaging genes and head morphogenesis genes (from ORF1809 to ORF1815), and tail morphogenesis genes (from ORF1821 to ORF1823) was possible on the basis of observed homology to genes present in genomes of high G+C mycobacterial prophages. The general organization of the regions spanning part of the tail fiber and the host lysis cassette (from ORF1831 to ORF1832) showed a one to one correspondence to similarly sized genes in the tail fiber and host lysis clusters of the Binf-3 prophage-like element (Fig. 2). Notably, the lysis cassette, which is constituted by the lysin and holin-encoding genes, was identified downstream of the tail fiber module. The order of the lysin and holin-encoding genes is opposite to what is generally described for prophage genomes (Canchaya et al., 2003) and this genetic constellation is conserved in all identified B. longum subsp. infantis prophage genomes.
Prophage-like element Binf-rmn-3.As shown in Figure 2, the genetic structure of Binf-rmn-3 appears to have been the result of multiple DNA deletion and rearrangement events. In fact, its small size and the absence of a region encoding structural phage components suggest that one or more major DNA deletion events have occurred in this prophage-like element.
Prophage-like element Binf-5. The 37,747 bp genome of this prophage-like element extends from ORF2467 to ORF2406, which encode an integrase protein and a phage holin, respectively. This genome was predicted to contain functional modules such as the lysogeny module (ORF2467 to ORF2459) with a typical genetic switch region, and DNA replication module (ORF2456 to ORF2447) (Fig. 2). Putative tail morphogenesis and head morphogenesis genes were identified on the basis of observed homology to similar genes in high G+CCorynebacterium phages. The rightmost part of the Binf-5 genome represents a likely host-lysis module, comprised of the holin (ORF2406) and lysin (ORF2407) genes showing database matches to homologous genes of the B. longum subsp. longumBlj-1 prophage (Ventura et al., 2005).
Prophage-like element Binf-6.In this prophage-like element the lysogeny module is limited to a phage integrase gene (ORF1137), and to a superinfection exclusion gene (ORF1133) whose deduced protein product shares significant similarity with the Lp1 prophage superinfection immunity gene of Lactobacillus plantarum (Ventura et al., 2003). The lysogeny module is followed by a DNA replication module containing genes that are predicted to encode an oligoribonuclease (ORF1119) and a primase (ORF1098). The next ORFs (ORF1090 to ORF1078) constitute the likely morphogenesis module. Notably, the Binf-6 prophage-like element contains a transposon-like element encompassing a transposase (ORF1075) and a putative site-specific integrase/resolvase (ORF1073) resembling a B. longum subsp. longumDJO10A transposon. The presence of such a composite mobile element inserted into a prophage genomic sequence is not unusual. In fact, in B. breveUCC2003 a prophage genomic sequence has been described that carries a transposon element encompassing genes encoding a putative ABC transporter system and two glycosyl hydrolases (Ventura et al., 2005). The transposon element in the Binf-6 prophage is followed by ORF1070-1069 and ORF1067, encoding hypothetical products and a minor tail protein, respectively.
Prophage-like element Binf-7.The analysis of the genome of B. longum subsp. infantisCCUG52486 revealed, apart from Binf-6, one additional prophage-like element, Binf-7. The genomic organization of this prophage-like element resembles that of the B. longum subsp.longumBl-1 prophage (Ventura et al., 2005). The Binf-7 integrase gene is preceded by a predicted holin and a lysin gene with sequence similarity to the B. longum subsp.longumprophage Bl-1. Of note, the deduced lysin sequence contains a typical amidase motif (pfam01510). Further upstream, putative morphogenesis genes (encoding a minor tail component, and a prohead protease) and genes putatively involved in DNA packaging (encoding a terminase, as well as a portal protein) were identified (Fig. 2).
Prophage-like element Bbif-1.The genome organization of Bbif-1 resembles a typical lambdoid phage genome structure. Downstream of the lysogeny module, several ORFs are predicted to encode proteins with homology to DNA replication proteins, thus defining this genome segment as the likely DNA replication module. Further database matches allowed a distinction of likely DNA packaging and head morphogenesis genes, possible head-to-tail joining genes and putative tail genes. A lysis cassette was identified downstream of the tail fiber module, which is constituted by the classical genetic constellation of a holin and lysin gene.
Prophage-like element Bbif-2.Database matches allowed the identification in the genome of Bbif-2 prophage-like element of ORFs displaying similarities to phage-related genes, some of which could be clustered into functional modules (Fig. 3). In this prophage-like element, the lysogeny module is limited to a region spanning ORF1254 to ORF1273. The rest of the Bbif-2 genome contains ORFs displaying similarities to morphogenesis genes but apparently distributed in a non-modular fashion. Only the rightmost part of the genome contains a typical host lysis module which encompasses the presumptive holin gene (ORF1338) and an endolysin gene (ORF 1339).
Prophage-like element Bado-1.The predicted integrase-encoding gene (ORF958) is located next to the host lysis cassette in a genetic constellation that is similar to that of Blj-1 in B. longum subsp.longumDJO10A strain (Ventura et al., 2005). The database matches of the predicted ORFs allowed a distinction of DNA packaging and head morphogenesis genes producing the highest similarity scores with the analogous genes (DNA packaging and head morphogenesis) from phages infecting Mycobacterium and Streptomyces. Notably, a small number of genes that encode proteins with similarity to bacterial proteins were identified between the host lysis cassette and the predicted morphogenesis module. Notably, these include a complete Type I R/M system and genes involved in polyketide biosynthesis (e.g., beta-lactamase and enoyl-CoA hydratase) (Fig. 5).
Prophage-like element Bado-2.The predicted prophage Bado-2 extends from ORF1182 (integrase gene) to ORF1212 (putative terminase) on the genome of B. adolescentisL2-32. The genome sequences of this prophage includes ORFs encoding predicted structural components of the phage and genes of the lysogeny module (encoding a predicted integrase and Cro-like protein), host lysis cassette (lysin gene) and two transcriptional regulators (ORF1189 and ORF1193) (Fig. 5).
Prophage-like element Bado-3.The genome architecture of prophage Bado-3 resembles that of Bado-2, despite the lack of significant DNA similarities between these two prophage genomes (Fig. 5). The lysogeny module of Bado-2 includes two genes (ORF1370 and ORF1372) whose deduced protein products show a high level of identity to an integrase and a cI-like repressor, respectively. Two adjacent genes (ORF1373 and ORF1374) were assigned to the host lysis cassette, which is constituted by the classical genetic constellation of holin and lysin-encoding genes. Further database matches allowed the distinction of likely DNA packaging and head morphogenesis genes, putative head-to-tail joining and tail genes.
Prophage-like element Bdent-1.The predicted prophage-like element Bdent-1 extends from ORF1446 (integrase gene) to ORF1477 (lysin gene) (Fig. 6). In this prophage-like element, the integrase-encoding gene is placed immediately upstream of the likely DNA replication module consisting of two ORFs (ORF1450 and ORF1453), which encode proteins with similarities to a primase and DNA helicase from Firmicutes. Identification of likely structural modules such as the head morphogenesis and DNA packaging (from ORF1460 to ORF1462) and tail morphogenesis genes (from ORF1469 to ORF1472) was possible on the basis of observed homology to lactococcal and streptococcal phages. Further upstream, a gene (ORF1474) similar to an endopeptidase detected in the genome of Streptococcus pyogenes was identified, which could be part of a structural tail component required for cell wall lysis to allow phage entry during infection (Kenny et al., 2004); separated from this gene by an anonymous ORF, a gene encoding a putative lysin (ORF 1477) was identified.
Prophage-like element Bdent-2. Database matches indicated that the Bdent-2 prophage genome is subdivided into functional modules (Fig. 6). In this prophage-like element the lysogenic module is limited to a phage integrase gene (ORF0601), whose deduced protein product displays significant similarity to phage integrases of Mycobacterium and Corynebacterium species. The integrase-encoding gene is followed by genes putatively involved in DNA packaging (encoding a small and large terminase subunit, as well as a portal protein), and putative tail morphogenesis genes (encoding a minor tail component, and a prohead protease). Their closest relatives are from phages infecting low G+C content bacteria like L. lactisbIL170, Lactobacillusdelbrueckii bacteriophage LL-H and from phages infecting high G+C content bacteria like Mycobacterium and Corynebacterium. Further upstream, two genes (ORF1598 and ORF1604) are located that encode proteins with weak similarity to an excisionase protein of Frankia and a cI-like repressor of Listeria (Fig. 6), thus constituting the predicted lysogeny module.
References
Canchaya, C., Proux, C., Fournous, G., Bruttin, A., and Brussow, H. (2003).Prophage genomics.Microbiol Mol Biol Rev67(2), 238-76, table of contents.
Kenny, J. G., McGrath, S., Fitzgerald, G. F., and van Sinderen, D. (2004). Bacteriophage Tuc2009 encodes a tail-associated cell wall-degrading activity. J Bacteriol186(11), 3480-91.
Ventura, M., Canchaya, C., Kleerebezem, M., de Vos, W. M., Siezen, R. J., and Brussow, H. (2003). The prophage sequences of Lactobacillus plantarum strain WCFS1. Virology316(2), 245-55.
Ventura, M., Lee, J. H., Canchaya, C., Zink, R., Leahy, S., Moreno-Munoz, J. A., O'Connell-Motherway, M., Higgins, D., Fitzgerald, G. F., O'Sullivan, D. J., and van Sinderen, D. (2005). Prophage-like elements in bifidobacteria: insights from genomics, transcription, integration, distribution, and phylogenetic analysis. Appl Environ Microbiol71(12), 8692-705.