FWC Sector Competitiveness Studies - Competitiveness of the ...
Discovery of a new family of relaxases in Firmicutes bacteria€¦ · Competitiveness, and through...
Transcript of Discovery of a new family of relaxases in Firmicutes bacteria€¦ · Competitiveness, and through...
RESEARCH ARTICLE
Discovery of a new family of relaxases in
Firmicutes bacteria
Gayetri Ramachandran1☯¤a, Andres Miguel-Arribas1☯, David Abia1☯, Praveen K. Singh1¤b,
Isidro Crespo2, Cesar Gago-Cordoba1, Jian An Hao1¤c, Juan Roman Luque-Ortega3,
Carlos Alfonso3, Ling J. Wu4, D. Roeland Boer2, Wilfried J. J. Meijer1*
1 Centro de Biologıa Molecular "Severo Ochoa" (CSIC-UAM), Instituto de Biologıa Molecular "Eladio
Viñuela" (CSIC), Universidad Autonoma, Canto Blanco, Madrid, Spain, 2 XALOC beamline, ALBA
synchrotron Light Source, Cerdanyola del Vallès, Barcelona, Spain, 3 Centro de Investigaciones Biologicas
(CSIC), Madrid, Spain, 4 Centre for Bacterial Cell Biology, Institute for Cell and Molecular Biosciences,
Newcastle University, Newcastle Upon Tyne, United Kingdom
☯ These authors contributed equally to this work.
¤a Current address: Institute Pasteur, Synthetic Biology (G-5), Paris, France
¤b Current address: Max Planck Institute for Terrestrial Microbiology, Marburg, Germany
¤c Current address: The Institute of Seawater Desalination and Multipurpose Utilization, SOA, Tianjin, China
Abstract
Antibiotic resistance is a serious global problem. Antibiotic resistance genes (ARG), which
are widespread in environmental bacteria, can be transferred to pathogenic bacteria via hor-
izontal gene transfer (HGT). Gut microbiomes are especially apt for the emergence and dis-
semination of ARG. Conjugation is the HGT route that is predominantly responsible for the
spread of ARG. Little is known about conjugative elements of Gram-positive bacteria,
including those of the phylum Firmicutes, which are abundantly present in gut microbiomes.
A critical step in the conjugation process is the relaxase-mediated site- and strand-specific
nick in the oriT region of the conjugative element. This generates a single-stranded DNA
molecule that is transferred from the donor to the recipient cell via a connecting channel.
Here we identified and characterized the relaxosome components oriT and the relaxase of
the conjugative plasmid pLS20 of the Firmicute Bacillus subtilis. We show that the relaxase
gene, named relLS20, is essential for conjugation, that it can function in trans and provide evi-
dence that Tyr26 constitutes the active site residue. In vivo and in vitro analyses revealed
that the oriT is located far upstream of the relaxase gene and that the nick site within oriT is
located on the template strand of the conjugation genes. Surprisingly, the RelLS20 shows
very limited similarity to known relaxases. However, more than 800 genes to which no func-
tion had been attributed so far are predicted to encode proteins showing significant similarity
to RelLS20. Interestingly, these putative relaxases are encoded almost exclusively in Firmi-
cutes bacteria. Thus, RelLS20 constitutes the prototype of a new family of relaxases. The
identification of this novel relaxase family will have an important impact in different aspects
of future research in the field of HGT in Gram-positive bacteria in general, and specifically in
the phylum of Firmicutes, and in gut microbiome research.
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 1 / 23
a1111111111
a1111111111
a1111111111
a1111111111
a1111111111
OPENACCESS
Citation: Ramachandran G, Miguel-Arribas A, Abia
D, Singh PK, Crespo I, Gago-Cordoba C, et al.
(2017) Discovery of a new family of relaxases in
Firmicutes bacteria. PLoS Genet 13(2): e1006586.
doi:10.1371/journal.pgen.1006586
Editor: Diarmaid Hughes, Uppsala University,
SWEDEN
Received: June 9, 2016
Accepted: January 16, 2017
Published: February 16, 2017
Copyright: © 2017 Ramachandran et al. This is an
open access article distributed under the terms of
the Creative Commons Attribution License, which
permits unrestricted use, distribution, and
reproduction in any medium, provided the original
author and source are credited.
Data Availability Statement: All relevant data are
within the paper and its Supporting Information
files.
Funding: Work in the Meijer lab was funded by the
Spanish government through grant Bio2013-
41489-P of the Ministry of Economy and
Competitiveness, and through grant Bio2016-
77883-C2-1-P of the Ministry of Economy,
Industry and Competitiveness; the former grant
also funded AMA and CGC. The Spanish
government also supported DRB, JRLO, and CA.
DRB was funded by grant Bio2016-77883-C2-2-P
Author summary
Antibiotics have saved the lives of millions. However, the emergence and spread of antibi-
otic resistance compromises the effectiveness of antibiotics. Genes conferring antibiotic
resistance are often located on genetic elements that can be transferred to other bacteria.
Conjugation is the predominant route responsible for spreading antibiotic resistance
genes, and depends critically on a class of proteins called relaxases. These enzymes initiate
conjugation by processing the DNA of the mobile element and are therefore an Achilles´
heel of the conjugation process. Although antibiotic resistance is an important health
threat in both Gram-negative and Gram-positive bacteria, conjugation has so far been
studied primarily in Gram-negative bacteria. Due to the extremely high concentration of
bacteria, which favors conjugation, the intestinal gut is a hotspot for spreading antibiotic
resistance. It is now known that a large part of the gut microbiome corresponds to Gram-
positive bacteria, and many of these belong to the phylum Firmicutes. To better under-
stand conjugation and specifically relaxases of Gram-positive conjugative elements we
have identified and characterized the relaxase and the DNA region at which it acts of the
conjugative plasmid pLS20 from the Gram-positive Firmicute Bacillus subtilis. We also
show that the relaxase of pLS20 is the founding member of a new and large family of
relaxases that is almost exclusively present in Firmicutes bacteria. This work will have
important implications for the transfer of genes in Gram-positive bacteria in general and
specifically for conjugation-mediated spread of antibiotic resistance in Firmicutes bacteria
of the intestinal gut.
Introduction
Horizontal gene transfer (HGT) processes are responsible for the spread of antibiotic resis-
tance (AbR) which poses a serious economic and health problem worldwide. Conjugation,
which is the process by which a DNA element is actively transferred from a donor cell to a
recipient cell through a dedicated transportation pore connecting the cells, is the main HGT
route responsible for spreading AbR genes [1–4]. Conjugative elements can be embedded in a
bacterial chromosome or they can be present on plasmids. Several aspects of conjugation have
been studied in considerable depth over the last few decades. However, most conjugation stud-
ies concern Gram-negative (G-) bacteria. Recent studies provide evidence that the gut micro-
biome of humans and animals functions as a pool of AbR genes [5–9]. Bacteria from diverse
environments and from food enter the gut, and the enormous numbers and density of
microbes in the gut favors HGT, especially conjugation. The gut microbiome contains both G-
and Gram-positive (G+) bacteria. A large fraction of the G+ bacteria of the gut microbiome
belong to the phylum of Firmicutes [10]. Moreover, Firmicutes constitute a large fraction of
the microbiota of fermented foods, many of which also thrive in the gut, and for which it has
been shown that they can harbor AbR genes [11]. Together, this illustrates the need to better
understand the biology of plasmid-mediated conjugation in G+ bacteria in general, and in the
phylum of Firmicutes in particular.
We have been studying the low-copy number conjugative plasmid pLS20, originally isolated
from the Firmicutes bacterium B. subtilis natto strain IFO3335 that is used in the fermentation
of soybeans to produce “natto”, a popular dish in South Asia [12]. pLS20 is known to be conju-
gative in liquid media as well as on solid media [13–15]. The replication region of pLS20 has
been determined [16] and is flanked at its right side by three genes that are involved in
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 2 / 23
of the Ministry of Economy, Industry and
Competitiveness, and JRLO and CA were
supported by grant BFU2014-52070-C2-2-P of the
Ministry of Economy and Competitiveness to CA.
LJW’s work was supported by Wellcome Trust
grant WT098374AIA to Jeff Errington. PKS was
holder of a JaePre fellowship from the Spanish
Research Council (CSIC). JAH received a State
Scholarship Fund from the China Scholarship
Council. The funders had no role in study design,
data collection and analysis, decision to publish, or
preparation of the manuscript.
Competing Interests: The authors have declared
that no competing interests exist.
regulating the expression of the large conjugation operon that is located immediately down-
stream of these three genes [17,18].
The basis of the conjugation process is conserved in plasmids of G+ and G- bacteria [19–
21]. Thus, in most cases, only one DNA strand, called the T-strand, is transferred into the
recipient cell through a sophisticated, multi-component pore, -a type IV secretion system
(T4SS)-, that connects the donor and recipient cells. Initiation of the generation of the T-
strand and its delivery to the pore involves a specific nucleoprotein complex, called the relaxo-
some. The pivotal component of the relaxosome is a relaxase that recognizes and binds to spe-
cific sequences within a region of several hundred base pairs on the plasmid named the origin
of transfer (oriT). Often, the oriT region is also recognized by auxiliary proteins, which are gen-
erally encoded by the plasmid but can also involve host-encoded proteins [for review see, 21].
The relaxase cleaves the DNA in a strand- and site-specific manner at a specific position called
the nic site within the oriT region. The generated hydroxyl group at 3´-end of the nick site
functions as a primer for DNA elongation. Upon nicking, the relaxase remains covalently
attached to the 5´-end of the nicked T-strand via a tyrosine residue, and this complex is deliv-
ered to the so called T4 Coupling Protein (T4CP) that is located at the cytoplasmic side of the
transferosome and which is actively involved in the transfer of the relaxase and the attached T-
strand into the recipient cell. Besides conjugative plasmids, there is a group of plasmids that
lack the T4SS but contain a relaxase gene and an oriT, and sometimes auxiliary relaxasome
genes. These plasmids, called mobilizable plasmids, can be transferred to recipient cells when
they are co-resident with a conjugative plasmid that provides the T4SS.
The essential role of relaxases in the transfer of plasmids and other conjugative elements
gain them much attention and several relaxases have been studied at the functional, biochemi-
cal and structural level. In addition, they have been chosen as a target to develop drugs inter-
fering with its activity [22]. Based on their relationship, relaxases have been classified into six
families [23,24]. However, as for other aspects of conjugation, most of our knowledge on con-
jugative relaxases is based on those that are encoded by mobilizable or conjugative plasmids of
G- origin, or those on mobilizable plasmids of G+ origin [23,24].
Here we describe the identification of relaxosome components of the B. subtilis plasmid
pLS20. Contrary to many other plasmids, the relaxosome components of pLS20 are located
within its large conjugation operon. We identified the relaxase gene of pLS20cat, which we
name relLS20, and showed that it is essential for conjugation. We also defined the minimal func-
tional oriT region of pLS20cat, oriTLS20, and demonstrate that it contains a pronounced static
bent. We then demonstrated that RelLS20 has nicking-activity and determined the nick site in
oriTLS20. Interestingly, RelLS20 shows only very limited homology with known relaxases and
cannot be classified into one of the six existing relaxase families. Importantly, we found that
more than 800 deduced proteins present in public databases for which no function had been
attributed, and which are almost all encoded by bacteria belonging to the phylum of Firmi-
cutes, show significant homology with RelLS20. Most probably, many or all of these proteins
are relaxases, and hence RelLS20 constitutes the prototype of a new family of relaxases that we
name MOBL. Thus, besides important progress in the understanding of the relaxosome com-
ponents of pLS20, our studies have important implications for a large number of related
relaxases within the phylum Firmicutes.
Results
Identification of putative relaxase gene of pLS20cat by in silico analysis
Putative function(s) of newly sequenced genes are commonly assigned based upon similarity
of the entire or partial region(s) of the deduced protein sequence to proteins, protein domains
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 3 / 23
or signatures present in the conserved domain database (CDD) of the NCBI [25]. Surprisingly,
none of the deduced pLS20cat proteins was identified as being a putative relaxase after search-
ing pLS20cat against the CDD. We therefore screened all the putatively encoded proteins of
pLS20cat for the presence of one or more motifs that are distinctive of relaxases previously
described and classified into six MOB families, MOBP, MOBQ, MOBF, MOBC, MOBH and
MOBV [23,24], considering that these motifs could include sequences that are not covered by
the twelve relaxase-specific motifs present in the CDD. Generally, these motifs concern regions
that have a crucial role in the enzymatic activity of the protein. For instance, motif I and III
correspond respectively to regions that contain the catalytically active tyrosine(s), and the
three histidines essential for coordinating a divalent metal ion [24]. This screening led to the
identification of one candidate, gene 58, as a potential relaxase gene. Interestingly, unlike
many other conjugative plasmids, the pLS20 putative relaxase gene is not located adjacent to
the gene predicted to encode T4CPs [24], pLS20cat gene 48.
Although the overall similarity is limited, a stretch of 17 amino acids (residues 147–163) of
the deduced protein p58 sequence showed similarity to part of motifs III present in many
relaxases. Relaxase motif III embraces the “HUH” sequence that is characteristic of the HUH
endonuclease superfamily, which includes many relaxases [26]. The “HUH” sequence is essen-
tial for enzymatic activity of the protein. Within the HUH motif the “U” represents a bulky
hydrophobic residue, and “H” a Histidine. The two His residues in this motif correspond to
two of the three ligands that coordinate a divalent metal ion that is required for the enzymatic
nicking-closing activity. In the case of relaxases, the third ligand is often also a Histidine. This
third His residue is located near the HUH motif in the primary sequence and is included in
the motif III of relaxases, which therefore is also referred to as the 3H motif. Thus, except those
belonging to the MOBH or MOBC family, all other relaxases contain a motif III [24]. Fig 1
shows the similarity between residues 141–171 of the deduced pLS20cat protein p58 and the
consensus sequences of motif III regions present in relaxases belonging to the MOBP, MOBQ
and MOBV families. Interestingly, no significant similarity exists between p58 and the consen-
sus sequence of motif III of the MOBF type relaxases (see Discussion). In summary, in silicoanalyses suggested that pLS0cat gene 58 may encode a relaxase.
pLS20cat gene 58 is the prototype of a new family of genes whose
orthologs are widespread in Firmicutes bacteria
If pLS20cat gene 58 indeed encodes for a relaxase then it could be the founding member of a
novel relaxase family. To test this possibility we performed a psi-blastp search of the NCBI nr
database using the deduced protein p58 sequence as a query (see Materials and methods for
details). After removing redundant sequences this search resulted in the identification of 817
hits showing significant similarity with putative pLS20cat protein p58 (threshold E-value 1e-15, see S1 Table for a list of these hits). Interestingly, almost all of the identified sequences cor-
responded to products of (putative) genes of bacteria belonging to the phylum Firmicutes
(99.6% Firmicutes, 0.1% bacteriodetes, 0.1% tenericutes and 0.1% undefined). The vast major-
ity (99.6%) of these hits were designated as hypothetical protein. However, three hits were
labelled as (putative) relaxase: WP_014386599.1 (putative relaxase of Lactococcus garvieae21881 plasmid pGL5 [28]), WP_008381272.1 (putative relaxase of Enterococcus sp. C1 [29]),
and WP_011377372.1 (putative relaxase of Enterococcus faecium plasmid pHTβ [30]). In the
latter case, genetic evidence suggests that the corresponding gene may encode the relaxase of
plasmid pHTβ. Firstly, these observations lend further support to the view that pLS20cat gene
58 encodes a relaxase. Based on this and on results described below we name gene 58 relLS20.Secondly, these observations provide evidence that RelLS20 is the prototype of a large, new
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 4 / 23
family of relaxases, which we name MOBL, and which contains over 800 members that are
almost exclusively encoded by Firmicutes bacteria.
We next performed additional analyses to gain insights into the evolutionary relationship
between the newly identified MOBL family of relaxases and the previously described relaxase
families. The motif-identification program MEME [27] was used to identify up to 10 con-
served motifs in sequences of the MOBL members (E-value <1e-15, for details see Materials
and methods). To discriminate the motifs identified in our work from those described previ-
ously for the six MOB families [24], we refer to the MEME motifs determined in this work as
signatures. Weblogo presentations of the ten identified signatures of the MOBL type relaxases
are shown in S1A Fig. The positions of the signatures in the RelLS20 primary sequence are
given in S1B Fig.
Similarly, we analysed the relaxases of the six previously defined MOB families and identi-
fied up to ten signatures in each family (see S2 Fig). We then used the Motif Alignment and
Search Tool (MAST) of the MEME suite to determine the presence of (i) MOBL signatures in
the relaxases of the other six MOB families (S2 Table), and (ii) signatures of those six MOB
families in members of the MOBL family (S3 Table). None of the MOBL signatures were
detected in members of the MOBC and MOBH families. As explained above, MOBL signature
1 -containing the conserved histidines-, shares similarity with motif III regions of MOBQ,
MOBV and MOBP type relaxases (see Fig 1). Not surprisingly therefore, the MOBL signature 1
was detected in a substantial number of relaxases of these MOB families but not in the mem-
bers of the other three families. Importantly, other MOBL signatures were either not detected
or detected in only a limited number of relaxases from the six existing MOB families (see S2
Table). A similar tendency was observed in the reciprocal analyses (see S3 Table). Cross
Fig 1. A 17 residue region of the deduced protein sequence of pLS20cat gene 58 shows similarity to the motif III
regions of MOBP, MOBQ and MOBV type relaxases. pLS20cat has been sequenced independently in our lab and in the
lab of M. Itaya (Keio University, Japan) who deposited the sequence in public databases where it was given the accession
number NC_015148.1. pLS20cat protein p58, according to our nomenclature, has been assigned the accession number
YP_004243525.1. Residues 141–171 of the deduced pLS20cat protein p58 are aligned with the consensus sequences,
represented as Weblogos, of the motif III region of relaxases belonging to the MOBP, MOBQ and MOBV family. The
consensus sequence of the motif III regions of the published relaxases of each MOB family [24] was identified here with the
motif-identification program MEME [27]. The position of the HUH signature is given at the top. Residues of the deduced
pLS20cat p58 sequence are highlighted against a light grey, dark grey and black background when they are conserved
with respect to the consensus sequence present in one, two or three of the MOB families, respectively.
doi:10.1371/journal.pgen.1006586.g001
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 5 / 23
detection of signatures was most frequently observed between the MOBL and MOBP families.
Altogether, these data corroborate the view that RelLS20 and the identified related (putative)
proteins constitute a new MOB family, and that MOBL is closest related to the MOBP family of
relaxases.
Members of the MOBL family can be divided into two clades
RelLS20 contains eight out of the ten MOBL signatures (S1B Fig). Most other MOBL members
also contain most but not all MOBL signatures, indicating that the MOBL family is composed
of different clades. To test this statistically we calculated a phylogenetic tree using neighbor-
joining analysis, applying bootstrap values of 1000 replicates (see Materials and methods, S3
Fig). The phylogenetic tree obtained shows that the MOBL members can be divided into two
clades, and that RelLS20 belongs to clade 1.
Bioinformatic analyses were performed to study whether, besides RelLS20, other MOBL
members are encoded by a plasmid. For this we identified all circular plasmids deposited in
public databases for which the complete sequence has been determined (see Materials and
methods). This resulted in a database of 10,904 plasmids. Next, the sequences of the 817 MOBL
relaxases were used as query to search the translated sequences of the generated plasmid data-
base for similarity with stringent settings (>95% similarity over >80% of the entire protein
sequence). This analysis indicated that at least 306 of the 817 MOBL members are located on a
plasmid. Interestingly, 98% of identified plasmid-located mobL genes (300 genes) correspond
to members classified in clade 1. Most of the DNA sequences currently present in databases
was generated from metagenomic or by shotgun sequencing approaches, which do not dis-
criminate between chromosomal or extrachromosomal DNA. It is therefore plausible that
(many) more clade 1 MOBL genes are located on plasmids. To gain insight into the nature of
the relaxases belonging to MOBL clade 2, we performed in silico analysis and manually
inspected DNA regions surrounding these genes. However, we were not able to draw firm con-
clusions about the nature of the clade 2 MOBL members, largely because very little functional
information is available about these genes.
In summary, members of the newly identified MOBL family can be subdivided into two
clades and many members belonging to clade 1 are located on plasmids. Strikingly, all these
genes are almost exclusively present in the phylum Firmicutes.
pLS20cat gene 58 is essential for conjugation
Since processing of DNA to produce the T-DNA strand that is transferred into the recipient
cell is a vital step in the conjugation process, it was expected that gene 58 would be essential for
conjugation if it encoded the relaxase of pLS20cat. Gene 58 is translationally coupled to the pre-
ceding gene 57, which in turn is closely linked to gene 56 [17]. Therefore, we constructed a dele-
tion derivative of pLS20cat lacking these three genes (see Materials and methods) and named
the resulting plasmid pLS20Δ56–58. Next, B. subtilis 168 containing pLS20cat (strain PKS11) or
pLS20Δ56–58 (strain GR149) were used as donor cells to determine their conjugation efficien-
cies using a standard protocol (see Materials and methods). The efficiency of conjugation
observed for pLS20cat was in the range of 10−3, which is similar to values reported previously
under these conditions [17,18]. However, no transconjugants were observed for pLS20Δ56–58
(see Table 1). These results suggested that at least one of the genes of 56 to 58 is required for
conjugation. However, it was also possible that the deleted region contained the oriT, which is
essential for conjugation. To rule out this possibility, we re-introduced genes 56–58, under the
control of an IPTG-inducible Pspank promoter, into strain GR149 (harbouring pLS20Δ56–58) at
the amyE locus. As expected, no transconjugants were obtained when the resulting strain,
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 6 / 23
GR150, was grown in the absence of IPTG. Importantly, conjugation efficiencies similar to
those obtained with pLS20cat were obtained for pLS20Δ56–58 when GR150 was grown in the
presence of IPTG (see Table 1). These results demonstrated therefore that one or more of the
pLS20cat genes 56, 57 and 58 were required for conjugation. In addition, the results showed
that (i) pLS20Δ56–58 contains a functional oriT that hence must be located outside the deleted
DNA fragment spanning genes 56–58, and (ii) that genes 56–58 can function in trans.To determine whether gene 58 is required for conjugation we constructed strain GR206
that contains genes 56 and 57 (but not 58) under the control of the Pspank promoter at the
amyE locus and harbouring pLS20Δ56–58. No transconjugants were obtained when this strain
was used as donor in conjugation experiments, regardless of whether cells were grown in the
presence or absence of IPTG. These results showed that pLS20cat gene 58 was required for
conjugation, as would be expected for a relaxase gene.
The origin of transfer region of pLS20cat, oriTLS20, is located upstream of
gene 56 and is intrinsically bent
The results presented above show that RelLS20 can function in trans. Consequently, it was
expected that pLS20cat could mobilize a compatible plasmid containing the oriT region of
pLS20cat but lacking the relaxase gene. We used this strategy to identify the oriT region of
pLS20cat, which we name oriTLS20 henceforth. Thus, we engineered the oriT screening vector
pUCTA2501 and screened derivatives containing regions of pLS20 for their ability to be mobi-
lized by pLS20cat. As expected, the empty vector pUCTA2501 was not mobilized by pLS20cat
(see Table 2). The oriT regions are often located upstream of the relaxase gene [21,23], and the
results presented above showed that pLS20Δ56–58 contained a functional oriT. We therefore
first tested a 1.75 kb fragment, named Fragment 1, which encompasses the 3´-region of
pLS20cat gene 55 till the middle of gene 57 (see Fig 2A for a schematic representation). For
this, Fragment 1 was cloned in pUCTA2501 resulting in plasmids pGR8A and pGR8B (“A”
and “B” correspond to the orientation of the cloned fragment tested). As shown in Table 2 and
schematically in Fig 2A, pLS20cat was able to mobilize both plasmids pGR8A and B, indicating
that oriTLS20 is located on the 1.75 kb Fragment 1. In addition, these results indicated that
RelLS20 can act on oriTLS20 regardless of its orientation in the plasmid. To delineate the oriTLS20
region further several internal regions of Fragment 1 were sub-cloned into pUCTA2501 and
the resulting derivatives were tested for their ability to be mobilized by pLS20cat. Using this
approach the oriTLS20 was delineated to a region of 362 bp (Fragment 6 present in vectors
pGR16A/B) that corresponds to the intergenic region between genes 55–56 (see Table 2 and
Fig 2A).
Table 1. Conjugation efficiencies of pLS20cat and pLS20Δ56–58 in different backgrounds.
Strain Genotype Plasmid Inductor Conjugation efficiency *
PKS11 168 pLS20cat - 8.3x10-3
GR149 168 pLS20Δ56–58 - < 10−7
GR150 168, amyE::Pspank 56–58 pLS20Δ56–58 - < 10−7
+ 1.4x10-3
GR206 168, amyE::Pspank 56–57 pLS20Δ56–58 - < 10−7
+ < 10−7
*: Conjugation efficiencies are calculated as transconjugants/donor, and correspond to the mean value of at least three independent experiments. When
indicated, the inductor IPTG was added at a final concentration of 1 mM.
doi:10.1371/journal.pgen.1006586.t001
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 7 / 23
The topology of DNA, including intrinsic bents, can have major effects on the function of
proteins that bind or process DNA [31,32]. The oriT regions often contain repeated sequences,
and several oriT regions from conjugative elements of Gram-negative bacteria have been dem-
onstrated to contain an intrinsic bent which is believed to be important for optimal binding
and functionality of the relaxosome proteins [for review see, 21]. Fig 2B shows that oriTLS20
also contains several direct and inverted repeated sequences. In silico analysis of the entire
pLS20cat sequence predicted the presence of a static bent near oriTLS20 (S4 Fig). To test
whether oriTLS20 is intrinsically bent we performed circular permutation assays. Thus, eight
DNA fragments, - F1-F8, each 600 bp long and corresponding to different positions of the
oriTLS20 (see Fig 2C) -, were generated and their migration in 2% agarose or 8% native PAA
gels was tested. As expected, identical migration positions were observed when the fragments
were run in agarose gel (Fig 2D). However, when run on the native PAA gel clear differences
in migration were observed between the fragments, demonstrating that the oriTLS20 region is
intrinsically bent (Fig 2E). Taking into account that migration is most affected when the bent
is located in the middle of a DNA fragment [33], these results suggest that the bent is located
towards the 5´ region of oriTLS20.
RelLS20 forms monomers in solution
To characterize the relaxase of pLS20cat in vitro, we purified RelLS20 (Mw 50,329 Da) fused to a
His(6) tag at its C-terminus from E. coli. We first determined its oligomerization state using
two complementary analytical ultracentrifugation approaches, i.e. sedimentation velocity (SV)
and sedimentation equilibrium (SE) (Fig 3), as well as dynamic light scattering (DLS) experi-
ments using the same experimental conditions.
More than 95% of the RelLS20 absorbance in SV experiments corresponded to a narrow
peak with an experimental sedimentation coefficient of 2.8 S (s20,w = 3.3 S), matching the theo-
retical behaviour of the moderately elongated protein monomer (f/f0 = 1.5) (Fig 3A). DLS anal-
ysis gave a D value of 56.7 ± 0.8 μm2/s. Applying the obtained D and S values for RelLS20 to the
Table 2. Mobilization efficiencies of oriT-screening vector pUCTA2501 and derivatives.
Strain Plasmid § Region cloned (bp) Mobilization efficiency *
GR104 pUCTA2501 < 10−7
GR124 pGR8A 1739 3.3X10-5
GR183 pGR8B 1739 1.2X10-4
GR114 pGR10A 849 -< 10−7
GR121 pGR10B 849 < 10−7
GR115 pGR12A 949 < 10−7
GR122 pGR12B 949 < 10−7
GR184 pGR20A 1152 4.5X10-5
GR140 pGR20B 1152 1.7X10-4
GR139 pGR22A 949 6.8X10-5
GR185 pGR22B 949 1.2X10-4
GR137 pGR16A 362 2.0X10-5
GR138 pGR16B 362 1.5X10-4
§: Besides the plasmid mentioned, all strains contained pLS20cat.
*: Mobilization efficiencies are calculated as Em-resistant transconjugants/donor using strain PS110 as recipient strain. Mobilization efficiencies are the
mean value of at least three independent experiments. The “A” and “B” extensions of the pUCTA2501 derivatives reflect different orientations of the same
insert.
doi:10.1371/journal.pgen.1006586.t002
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 8 / 23
Svedberg equation resulted in an apparent molecular mass of 47,760 Da, which is close to the
theoretical monomer molecular mass (from amino acid composition). In SE experiments a cal-
culated average molecular mass of 43,200 ± 300 Da was obtained, a value slightly lower than
the monomer molecular mass. The experimental data adjust very well to the best-fit line
assuming RelLS20 being a monomer (Fig 3B). In summary, the outcome of three complemen-
tary experimental approaches showed that RelLS20 is monomeric in solution.
Biochemical analyses of RelLS20 and determination of the nic site
Relaxases introduce a site- and strand-specific cut at the so-called nic site within the oriTregion and remains covalently attached to the 5´ end via a tyrosine residue. This results in
relaxation of the covalently closed circular form of the plasmid. To provide conclusive evi-
dence that pLS20cat gene 58 encodes a relaxase we analyzed samples of the oriTLS20-containing
plasmid pGR16B on an agarose gel after being incubated in a buffer with or without purified
RelLS20. As shown in Fig 4A, a fraction of the plasmid migrated to a higher position in the gel
compared to the RelLS20-untreated sample, showing that RelLS20 had relaxed a portion of the
plasmid molecules. RelLS20 nicked pGR16B in a concentration dependent manner (S5A Fig),
but it did not or hardly nicked the control plasmid pUCTA2501 lacking oriTLS20 (S5B Fig).
Fig 2. Determination and characteristics of oriTLS20. (A) Schematic overview of pLS20cat regions analyzed for the presence of a
functional oriT region. The corresponding pLS20cat region and position of genes 55, 56 and 57 are presented on the top. Numbers
correspond to length in base pairs. The regions cloned are indicated by horizontal bars. The names of the fragments and the plasmids, and
their ability to be mobilized are indicated at the right. “A” and “B” correspond to the orientation of the cloned fragment. -, + and ++:
mobilization frequencies <10−7, in the order of 10−5, and 10−4, respectively. (B) Features of the oriTLS20 region. The sequence of the 362 bp
oriTLS20 region is presented along with identified features. Inverted repeated sequences are indicated with red arrows. Identical or nearly
identical direct repeated sequences are boxed in green, orange or purple. Note that the sequence with consensus 5´-TGTGCAT´-3 is
present three times. The palindromic sequence 5´-TGGTACCA-3´ is indicated with two converging arrowheads. The 30-bp region with a
GC-content of 73% is highlighted with a blue-lined box. Numbering of the fragment is given on the right. The determined nic site is indicated
with a lightning symbol. (C) Schematic representation of the 1,300 bp pLS20cat region containing oriTLS20 (blue box) studied for the
presence of a static bent. The lower part indicates the positions of the 600 bp DNA fragments F1 to F8 with respect to oriTLS20, and which
were subjected to electrophoresis in gels of 2% agarose (D) or 8% native PAA (E). After electrophoresis, the agarose and PAA gels were
stained with ethidium bromide.
doi:10.1371/journal.pgen.1006586.g002
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 9 / 23
Plasmids pGR16A and pGR16B gave similar nicking efficiencies, indicating that the nicking
reaction was independent of the orientation of oriTLS0 (S5C Fig). RelLS20 did not relax the plas-
mid, however, when the reaction mixture contained the chelating agent EDTA (S5D Fig), sup-
porting the view that RelLS20 requires divalent cation(s) like Mg for the nicking reaction, as has
been observed for other relaxases containing a Histidine triad [19,24, for review see, 26].
Relaxases are often multidomain proteins. Without exception the relaxase domain is
located in the N-terminal region of the protein. To test if this also applies to RelLS20 we purified
its N-terminal region (residues 1–232, which includes the predicted Histidine triad) fused to a
His(6) tag from E. coli, and used this protein, named N-RelLS20, in nicking reactions. As shown
in S5E Fig, N-RelLS20 relaxed part of the oriTLS20 containing plasmid pGR16B, but not the
parental vector pUCTA2501 lacking oriTLS20 indicating that the N-RelLS20 domain contains
nicking activity.
In all cases studied, the catalytic residue present in the relaxase domain that makes a tran-
sient 5´ phosphotyrosine bond with the nucleotide at the nick site corresponds to a tyrosine
[for review see, 26]. Based on the following observation, residue Tyr26 might correspond to
the catalytic tyrosine residue of RelLS20. The catalytic Tyr residue of MOBP family relaxases is
present in MOBP signature 5. MAST analyses showed that this MOBP signature is conserved
in 7% of the MOBL relaxases (S3 Table). RelLS20 is one of the MOBL members containing the
MOBP signature 5. The position of the MOBP signature 5 in RelLS20 overlaps with that of
MOBL signature 5 (it is a coincidence that both signatures are named signature 5). An align-
ment of these two signatures suggests that the catalytic Tyr residue of MOBP type relaxases
corresponds to RelLS20 residue Tyr26 and it shows that this tyrosine residue is almost fully con-
served in the RelLS20-related putative relaxases (see Fig 4B). To obtain evidence that Tyr26 is
the catalytic tyrosine of RelLS20 we engineered a mutant in which codon 26 encodes for a phe-
nylalanine, and used the purified mutant, RelLS20-Y26F, in nicking reactions. No substantial
nicking activity was observed for the Y26F mutant (S5F Fig) supporting the view that Y26 is
the active Tyrosine.
To locate the position of the nick site within oriTLS20, we prepared samples of pGR16B that
were treated with or without RelLS20 and subsequently used these in DNA sequencing
Fig 3. Analytical ultracentrifugation analyses showed that RelLS20 is monomeric in solution. Purified
RelLS20 (12 μM) was studied by sedimentation velocity (SV) and sedimentation equilibrium (SE). Plot (A)
represents the sedimentation coefficient distribution c(s) profile obtained by SV data analysis. Graph (B)
shows the experimental data obtained by SE assays (empty circles) and best-fit analysis considering a protein
monomer (black line) and dimer (dashed line) species model. Lower part represents the difference between
experimental data and the best fit to a single species model (residuals).
doi:10.1371/journal.pgen.1006586.g003
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 10 / 23
reactions with convergently oriented primers 28 (forward) or 29 (reverse) that hybridize to
regions flanking the oriTLS20 (Fig 4C). Very similar signal intensities were obtained in both
reactions using the reverse primer 29. However, in the case of primer 28 the intensity of the
signal dropped sharply in the RelLS20-treated DNA sample after the sequence
5´-GAATCGGGGGCCGG-3´. Most likely this drop in signal intensity was due to a nick at
this position in the template strand for primer 28.
The nic site is flanked by a 30 bp stretch of high GC content (73%) and coincides with the
beginning of the inverted repeated sequence 5´-GGCTTTTTGCtgccGCAAAAAGCC-3´ (see
Fig 2B). Thus, RelLS20 introduces a nick at a position flanking an inverted repeat in the tem-
plate strand of the conjugation genes. Implicitly, these results demonstrate that pLS20cat gene
58 encodes the relaxase RelLS20. Fig 2A shows that the mobilization efficiency of the fragments
endowing mobility to the pUCTA2501 derivative was 5 to 10 fold higher when cloned in
Fig 4. Determination of the catalytically active tyrosine residue of RelLS20, and the position of the nick site within oriTLS20. (A)
Plasmid pGR16B containing oriTLS20 was incubated in a buffer lacking (-) or containing (+) purified RelLS20. Next, both samples were treated
with proteinase K and DNA was run on a 0.8% agarose gel. The positions of supercoiled (sc) and nicked open circular DNA (oc) are
indicated. (B) The N-terminal 60 residues of RelLS20 are aligned with the consensus sequences (presented as weblogos) of signatures 5 of
the MOBL and MOBP type relaxases. The position of the catalytic Tyrosine residue of MOBP type relaxases is indicated with a vertical arrow.
(C) Samples of pGR16B with (RelLS20) or without (control) prior incubation with RelLS20 were used as template DNA in sequencing
reactions using forward primer 28 or reverse primer 29. The area of grey background, corresponding to the signal strength obtained with the
control sample, is duplicated and overlaid to the same positions of the chromatogram of the RelLS20-treated sample to highlight the drop in
signal intensity. Note that the intensities of the signals are very similar for other regions. The residual signal observed after the drop in signal
intensity is due to the presence of low amounts of non-nicked plasmid DNA in the sample.
doi:10.1371/journal.pgen.1006586.g004
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 11 / 23
orientation “B” compared to orientation “A”. The orientation of an oriT sequence determines
the DNA strand that is transferred into the recipient cell, implying that one pUCTA2501 DNA
strand is established better in the recipient than the other. As explained in the discussion, this
observation suggests that RelLS20 nicks the template strand of the conjugation genes, which is
in line with the results presented in Fig 4C.
Discussion
Here we have identified the cis-acting origin-of-transfer region oriTLS20 and the relaxase gene
encoding the trans-acting RelLS20 protein from pLS20. The relaxosome module is embedded
within the large conjugation operon that encompasses genes 28 to 74 [17]. The results obtained
have provided a better understanding of the relaxosome components present on Gram-posi-
tive mobile elements in general and particularly those belonging to the phylum Firmicutes.
oriTLS20
We have delineated oriTLS20 to a region of 362 bp, which is intrinsically bent. Interestingly,
about 10-fold higher mobilization efficiencies were obtained for pUCTA2501 derivatives in
which oriTLS20 was cloned in orientation “B” compared to orientation “A” (see Table 2). Most
probably this difference is due to the following. Relaxases cleave DNA at a site- and strand-spe-
cific position, and the nicked strand is transferred into the recipient cell. Consequently, the ori-
entation of oriT dictates which DNA strand is transferred into the recipient cell. The
replication protein of rolling-circle plasmids, which is related to relaxases, initiate a novel
round of replication by introducing a site- and strand-specific nick in the plasmid at the so
called double-strand origin (DSO). The 3´-end of the nicked site is used for DNA elongation,
which is coupled to displacement of the replicated strand. To complete one round of replica-
tion the fully displaced circular ssDNA replication intermediate has to be converted into
dsDNA. Efficient ss to dsDNA conversion starts at a specific region called single strand origin
(SSO). Several families of SSOs are known, and they all have in common that they are only
functional in only one orientation [15]. Since the orientation of oriT dictates the DNA strand
that is transferred to the recipient cell, only in one orientation of oriT the transferred ssDNA
strand will have the SSO in its functional orientation, resulting in optimal conversion of the
ssDNA to double stranded plasmid DNA. The relation between functionality of an SSO and
mobilization efficiency has been reported for other rolling-circle plasmids [34]. According to
this reasoning, the nic site would be located in the template strand of the conjugation genes.
This was indeed confirmed biochemically (Fig 4). oriT´s can be located at distinct positions
with respect to the relaxase gene. In the case of pLS20cat the oriT and the relaxase are separated
by a region of 865 bp containing two putative genes.
The relaxase RelLS20
The relaxase of pLS20cat was not identified by performing standard Blast searches. However,
we did identify a stretch of 17 residues in the protein encoded by gene 58 that encompasses a
putative HUH motif present in the superfamily of the so-called HUH endonucleases that
includes relaxases [26]. We were able to show that gene 58 is required for conjugation, that
RelLS20 can act in trans, and we determined the RelLS20-mediated strand and site-specific nick
site within the oriTLS20 region. Together, these results show that gene 58 encodes the relaxase
gene of pLS20cat, which we named relLS20. We also provided evidence that the nicking activity
resides in the N-terminal domain of RelLS20 and that tyrosine 26 constitutes the catalytic resi-
due responsible for making a transient 5´ phosphotyrosine bond with the nucleotide at the
nick site.
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 12 / 23
RelLS20 is the prototype of a novel relaxase family, MOBL, that contains a
large number of relaxases which are almost exclusively encoded in
bacteria belonging to the phylum Firmicutes
By performing psiblast searches (7 iterations) we identified 817 sequences in public databases
that may encode proteins showing significant homology with RelLS20 (cutoff 1e-15), and we
classified these relaxases into a new family named MOBL. Based on neighbor-joining analysis,
the 817 (putative) relaxases can be subdivided into 2 clades (see S3 Fig) and a large fraction of
the clade 1 mobL relaxase genes are located on a plasmid.
Interestingly, four relaxases identified in our database screens as members of the MOBL
family, had been classified before in the MOBP family. These four members concern NCBI ref-
erence accession numbers: WP_011264114.1 (encoded by plasmid pBCNF5603 of Clostridiumperfringens F5603), WP_010968268.1 (encoded by plasmid pCP13 of C. perfringens strain 13),
WP_011377372.1 (encoded by conjugative plasmid pHTβ of E. faecium) [30], and
WP_012477521.1 (encoded by the 21 kb pEF1 plasmid of E. faecium 6T1a) [35]. When these
relaxases were classified in 2009 it was mentioned that “the phylogenetic position of these pro-teins is out of the clades of the MOBP tree described in this work” [24]. Therefore, we propose
that these four relaxases belong to the MOBL family defined here.
Previously, a scheme depicting the relationship between the main relaxase families has been
presented [24]. In this scheme the MOBQ, MOBV and MOBP families form a large cluster. The
other MOB families, including protein families having nicking-closing activities that overlap
with some of these MOB families, have been placed into three other groups. We identified
RelLS20 based on the presence of a putative 3-histidine motif, and sequences encompassing this
motif (MOBL signature 1) are highly conserved among the MOBL relaxases. A 3-histidine
motif is also present in relaxases belonging to the MOBF, MOBQ, MOBV and MOBP families.
Using the Motif and Alignment tool MAST, sequences showing significant similarity to MOBL
signature 1 were identified in a number of relaxases of the MOBQ, MOBV and MOBP families
where they overlap with the corresponding 3-histidine motif. However, MOBL signature 1
does not show significant similarity with the 3-histidine motif of relaxases of the MOBF family.
This indicates that the MOBL family is more closely related to the MOBQ, MOBV and MOBP
families. Additional analyses revealed that the MOBL family is most closely related to the
MOBP family. Based on this we propose the MOBL family being a new member of the previ-
ously defined cluster including the MOBQ, MOBV and MOBP families. An updated scheme of
the relationships between the main relaxase protein families is presented in Fig 5.
Relaxases belonging to the previously defined six MOB families are generally present in
multiple bacterial phyla [24]. An important finding of this work is that the vast majority
(98.2%) of the MOBL relaxases are present in bacteria belonging to the phylum of Firmicutes.
It is therefore likely that the MOBL type relaxases play a prominent role in horizontal gene
transfer in Firmicutes bacteria. The finding that many of the MOBL relaxases identified here
are located on plasmids supports this assumption.
Firmicutes bacteria can be found in a wide variety of habitats. One of the habitats in which
Firmicutes are notably present is the gastrointestinal tract of mammals including humans, and
evidence is accumulating that a balanced gut microbiome plays important roles in the health
of these organisms [36]. Dysbiosis in the gut microbiome composition as a consequence of an
oral antibiotic treatment can have substantial consequences [37]. The effects can be worsened
when the gut microbiome contains bacteria that are resistant to the antibiotic treatment, espe-
cially when the resistant bacteria concern (opportunistic) pathogens. In the recent years evi-
dence has been presented that the number of antibiotic resistance genes in the gut microbiome
is increasing and that this pool of resistance genes may contribute to future emergence and
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 13 / 23
spread of antibiotic resistance in (human) pathogens [5,6]. Conjugation is considered the
major mechanism responsible for antibiotic resistance in general. Together, this warrants a
better understanding of the conjugative elements present in the gut microbiome. Relaxases are
essential for conjugation. Therefore, they form a target to interfere with conjugation mediated
spread of antibiotic resistance [22,38]. Moreover, once the essential relaxase gene has been
identified the related conjugative element can be relatively easily recognized and studied.
Therefore, the identification in this work of more than 800 (putative) relaxase genes present
almost exclusively in bacteria of the phylum Firmicutes is likely to have significant impacts on
the identification and analysis of mobile elements in this phylum of bacteria that is largely rep-
resented in mammalian gut microbiota.
Fig 5. Schematic presentation of the relationships between the main relaxase protein families. The MOBQ, MOBV,
MOBP, and MOBL families belong to one large cluster (shown on a grey background). The other MOB families are divided
into three independent groups. Note that these overlap with other protein families having DNA nicking-closing activities
such as the replication proteins of rolling-circle plasmids (REP), IS91-like transposases (IS91) or HD hydrolases. The
MOBF family of relaxases contain a motif III that is also present in relaxases of the MOBQ, MOBP, MOBV and MOBL
families. MOBP includes relaxases that are also classified as MOBHEN.
doi:10.1371/journal.pgen.1006586.g005
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 14 / 23
Materials and methods
Bacterial strains, plasmids, media and oligonucleotides
Escherichia coli and B. subtilis strains were grown in Luria-Bertani (LB) liquid medium or on
1.5% LB agar plates. When appropriate, media were supplemented with the following antibiot-
ics: ampicillin (100 μg/ml), erythromycin (1 and 150 μg/ml in B. subtilis and E. coli, respec-
tively), chloramphenicol (5 μg/ml), spectinomycin (100 μg/ml), and kanamycin (10 and 30 μg/
ml in B. subtilis and E. coli, respectively). B. subtilis strains used were isogenic with B. subtilisstrain 168 and are listed in S4 Table. Plasmids and oligonucleotides used are listed in S5 and S6
Tables, respectively. All oligos were purchased from Isogen Life Science, The Netherlands.
Transformation
E. coli cells were transformed using standard methods [39]. Preparation of competent B. subti-lis cells and transformation were carried out as described before [40]. Transformants were
selected on LB agar plates with appropriate antibiotics. pLS20cat encodes a protein, RokLS20,
that inhibits the development of competence by repressing comK, the key transcriptional acti-
vator of competence genes [41]. Therefore, to manipulate genes on pLS20cat we prepared
competent cells of a pLS20cat-harboring strain that contains a chromosomal Pxyl-comK fusion
(PKS56) using a standard protocol [41].
Construction of plasmids and strains
The correctness of sequences of all cloned PCR fragments was confirmed by sequence analysis.
Amplification by PCR of pLS20cat regions was performed using total DNA isolated from
pLS20cat harboring strain PKS11 as template. Plasmid pLS20Δ56–58 was constructed by
replacing genes 56, 57 and 58 with the kanamycin gene of plasmid pBEST501 as follows. The
upstream region of gene 56 (PCR-UP-56) and the downstream region of gene 58 (PCR-DN-
58) were amplified by PCR using pLS20cat as template DNA in combination with primer sets
oGR56/oGR57 and oGR58/oGR59, respectively. The resulting PCR fragments were digested
with BamHI and SalI. Plasmid pBEST501 was digested with BamHI and SalI and the resulting
800 bp fragment containing the kanamycin gene was purified. Next, the DNA fragments corre-
sponding to PCR-UP-56, PCR-DN-58, and the kanamycin gene were used to prepare a liga-
tion mixture. After ligation, this mixture was used to transform competent cells of pLS20cat-
containing B. subtilis cells of strain PKS56. Deletion of genes 56–58 of kanamycin-resistant
transformants was first confirmed by PCR using primers Ori-UP and Ori-Dn. Next, the
absence of mutations in the PCR amplified regions was confirmed by sequence analysis, which
in addition affirmed replacement of genes 56–58 by the kanamycin resistance gene. The result-
ing strain containing plasmid pLS20Δ56–58 was named GR148. Finally, pLS20Δ56–58 was iso-
lated from strain GR148 and used to transform competent cells of the appropriate strain.
The following strategy was used to clone the pLS20cat genes 56, 57 and 58 behind the
IPTG-inducible Pspank promoter in the B. subtilis amyE integration vector pDR110. The genes
were amplified by PCR using primers oGR43 and oGR60. After purification, the PCR frag-
ment was digested with NheI and SphI and then ligated with the vector pDR110 cut with the
same enzymes. Plasmid DNA of the constructed vector pGR27 was isolated from E. coli cells
and then used to transform competent B. subtilis cells. Double-crossover integration into the
chromosome was checked by the loss of amylase activity. A similar strategy was used to con-
struct a B. subtilis strain GR206 in which pLS20cat genes 56 and 57 were placed under the con-
trol of the Pspank promoter at the chromosomal amyE locus. In this case, the genes were
amplified by PCR using primers oGR27 and oGR133. The purified PCR fragment was digested
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 15 / 23
with SpeI and SphI and then ligated with the vector pDR110 cut with NheI and SphI. Plasmid
DNA of the constructed vector pGR52 was isolated from E. coli cells and then used to trans-
form competent B. subtilis cells. Double-crossover integration into the chromosome was
checked by the loss of amylase activity.
The E. coli/B. subtilis shuttle vector pUCTA2501 contains the replication functions of the
cryptic B. subtilis rolling-circle plasmid pTA1015 [15] as well as the erythromycin resistance
gene of pE194. The following strategy was used to create pUCTA2501 derivatives containing a
fragment of plasmid pLS20cat. First, the desired pLS20cat region was amplified by PCR using
appropriate primers. Next, after the purified PCR fragments had been digested with XbaI, the
fragments were ligated to the vector pUCTA2501 linearized with XbaI, and the resulting liga-
tion mixture was used to transform competent E. coli XL1-blue cells. The correctness and the
orientation of the inserts were checked by sequencing. The names of the pUCTA2501 deriva-
tives and their characteristics are listed in S5 Table. Each pUCTA2501 derivative plasmid was
then used to transform competent B. subtilis 168 cells. The presence of the plasmid in erythro-
mycin-resistant transformants was confirmed by PCR and pLS20cat was introduced into the
strain by conjugation using strain PKS11 as donor. The resulting strains are listed in S4 Table.
The pLS20cat gene 58 (relLS20) was cloned as follows in the E. coli expression vector pET28b
+ to generate a fusion gene containing a C-terminal his(6) extension. RelLS20 was amplified with
primerset oWM001-oWM002. The resulting PCR fragment was digested with NcoI and SalIand cloned in pET28b+ digested with the same restriction enzymes. The resulting pET28b
+ derivative named pAND83, was constructed using E. coli strain XL-blue1. Once verified its
correctness, the plasmid was transformed into E. coli strain BL21(DE3). Using the same strat-
egy, the N-terminal region of pLS20cat gene 58 (N-relLS20) was cloned in the pET28b+ vector.
In this case, N-relLS20 was amplified with primerset 0WM001-oWM003. The derivative of
relLS20 in which Tyrosine codon 26 was mutated to encode a Phenylalaline (relLS20-Y26F) was
constructed as follows. A DNA fragment containing the desired mutation was generated by
PCR using pAND83 as template and primerset oWM001A-oWM002. The resulting DNA frag-
ment was then extended at its 5´ end by two PCR reactions. First, the PCR product was used as
template in a PCR reaction with primerset oWM001B-oWM002; and second, this PCR prod-
uct served as template for the final PCR reaction using primerset oWM001-oWM002. Next,
the PCR fragment was cloned into pET28b+ using the same strategy as that used for cloning
the wild type relLS20.
Conjugation/mobilization assays
Conjugation was carried out in liquid medium as described previously [17]. The effect of
ectopic expression of a given gene placed under the control of the inducible Pspank promoter
on conjugation was studied by adding the inducer (1 mM IPTG) to prewarmed LB medium
used to dilute overnight cultures of the donor cells.
Analytical ultracentrifugation experiments
Sedimentation velocity assays (SV). Samples at concentrations ranging from 12 to
48 μM, in 20 mM Tris, 500 mM NaCl, 10 mM MgCl2, 1 mM EDTA, 0.1 mM β-Mercaptoetha-
nol and 1% glycerol, pH 7.4, were loaded (320 μL) into analytical ultracentrifugation cells. The
experiments were carried out at 48,000 rpm in a XL-I analytical ultracentrifuge (Beckman-
Coulter Inc.) equipped with both UV-VIS absorbance and Raleigh interference detection sys-
tems, using an An-50Ti rotor, and 12 mm Epon-charcoal standard double-sector centerpieces.
Sedimentation profiles were recorded at 230 nm. Differential sedimentation coefficient distri-
butions were calculated by least-squares boundary modelling of sedimentation velocity data
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 16 / 23
using the continuous distribution c(s) Lamm equation model as implemented by SEDFIT [42].
These s values were corrected to standard conditions (water, 20˚C, and infinite dilution) using
the program SEDNTERP [43] to obtain the corresponding standard s values (s20,w).
Sedimentation equilibrium assays (SE). SE experiments of RelLS20 samples were carried
out using short columns (100 μL) at speeds ranging from 12,000 to 15,000 rpm and at two
wavelengths (230 and 280 nm), using the same experimental conditions and instrument as in
the SV experiments. After the last equilibrium scan, a high-speed centrifugation run (48,000
rpm) was done to obtain the corresponding baseline offsets. Weight-average buoyant molecu-
lar weights of protein were determined by fitting a single species model to the experimental
data using the HeteroAnalysis program [44] and corrected for solvent composition and tem-
perature with the program SEDNTERP [43].
Dynamic light scattering assays (DLS). DLS experiments were carried out in a Protein
Solutions DynaPro MS/X instrument (Protein Solutions, Piscataway, NJ) at 20˚C using a 90˚
light scattering cuvette. Prior to measurements, samples at the same experimental conditions
used in SV and SE, were centrifuged for 10 min at 12,000g and 4˚C. Data were collected and
analyzed with Dynamics V6 Software.
Estimate of molar mass of RelLS20 from hydrodynamic measurements. The apparent
molar mass of a single sedimenting solute species (M) may be calculated using measured val-
ues of the sedimentation coefficient s and the diffusion coefficient D according to the Svedberg
equation [45],
M ¼RTs
ð1 � �nrÞD
where T, R, and ρ stand for the absolute temperature, the universal gas constant and the den-
sity of the solution, respectively. The estimate of molar mass obtained via this relation is inde-
pendent of the shape of the sedimenting/diffusing species, as frictional coefficients for
sedimentation and diffusion cancel in the derivation of the equation.
Overexpression and purification of recombinant RelLS20 and derivatives
containing a C-terminal His(6) tag
A recombinant version of RelLS20 was expressed and purified as follows. E. coli BL21(DE3)
cells containing plasmid pAND83 (relLS20His(6)) were inoculated in fresh LB media comple-
mented with 30 μg/ml kanamycin and grown at 37˚C with shaking (200 rpm). At an OD600 of
about 0.6, IPTG was added to a final concentration of 1 mM to induce the recombinant pro-
tein and growth was continued for 2 h. Next, cells were collected by centrifugation and pro-
cessed as described previously [41]. The nickel-column purified proteins (>95% pure) were
finally dialysed against buffer B (20 mM Tris-HCl pH 8.0, 1 mM EDTA, 500 mM NaCl, 10
mM MgCl2, 7mM β-mercaptoethanol, 50% v/v glycerol) and stored in aliquots at -80˚C. Brad-
ford assay and OD280 determination were used to determine the protein concentrations. The
SDS PAGE presented in S6 Fig visualizes the purification step of RelLS20. The same methodol-
ogy was used to purify the N-terminal domain of RelLS20 and the Y26F mutant protein. Appar-
ently, the Tyr26F mutation affects the stability of the protein. This conclusion is based on our
findings that the yield of purified mutant protein was much lower than that obtained for full-
length RelLS20 or its N-terminal domain. Consequently, the level of purity obtained for the
mutant protein was lower than that for the full-length RelLS20 or N-RelLS20.
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 17 / 23
Determination of the RelLS20-mediated nick site in oriTLS20
RelLS20 nicking activity was assayed in 10 μL reaction volumes containing 2 nM supercoiled
plasmid pGR16B and the indicated protein concentration in 20 mM Tris/HCl pH 7.5, 5 mM
MgCl2, 5 mM NaCl, 0.1 mM EDTA and 50 μg/mL BSA. The reaction mixture was incubated
for 15 minutes at 37˚C in a water bath. To stop the reaction, 25 μg/mL Proteinase K, 0.5% SDS
and 25mM EDTA were added followed by incubation at 37˚C for 20 minutes. Then, 6X DNA
loading buffer was added to a final concentration of 1X, and all the volume was loaded in a
0.8% agarose gel, prestained with Sybr-safe. Gels were run at 20V overnight at 4˚C and then
photographed. In addition, nicking reactions were carried out in duplicates, and after the reac-
tions were stopped the DNA was purified using a commercial kit (Wizard, Promega) after
which the purified plasmid DNAs were employed in Sanger sequencing reactions using primer
oGR28 or oGR29.
In silico analyses
Prediction of static bents in pLS20cat DNA. The entire pLS20cat sequence (accession
number NC_015148.1) was analyzed for the presence of intrinsic bents according to the dinu-
cleotide wedge model using parameters “AAWedge” and “Cacchione & De Santis” [46] imple-
mented in the program dnacurve.py (http://www.lfd.uci.edu/~gohlke/code/dnacurve.py.html).To allow analysis of the entire plasmid the default sequence length limit of the program was
raised. Two set of parameters were tested (“AAWedge” and “Cacchione & De Santis”). For
both, the curvature, and the curvature-angle were calculated for each position using window
sizes of 10 and 15 bp, respectively. These values were smoothed using a sliding window of 600
bp and normalized to mean 0 and standard deviation 1.
Identification of MobL members. RelLS20 was used as a query sequence to execute a psi-
blast search against the NCBI nr protein database (version 2.2.30+, January 17, 2015), allowing
up to 10 rounds of reiteration with and E-value threshold of 1e-15 [47–49]. This search
resulted in the detection of 962 sequences. The program “USEARCH” (version
8.0.1517_i86linux32) was then used to identify and remove redundant sequences showing
100% identity [50], resulting in 817 unique hits showing high similarity to RelLS20.
MEME and MAST. The motif-identification program MEME (Bailey and Elkan, 1994)
(http://meme-suite.org/tools/meme, version 4.10.0) was used to identify conserved motifs,
which we named “signatures” to distinguish them from “motifs” published before, in members
of the MOBL and previously described MOB families [24]. We noted that more specific motifs
were obtained when reducing the redundancy between the input sequences. The function
“cluster_fast” of the program USEARCH [50] was used to remove similar sequences, keeping
only the centroids of clusters of sequences that shared more than 50% of identity. MEME was
run against the remaining proteins, setting the search to retrieve up to a maximum of ten
motifs for each family with an E-value below 1e-15. The program MAST [51] of the MEME
suite was used to study if an identified motif of a given MOB family was present in member(s)
of any of the other MOB families using an E-value threshold of 1e-6.
Neighbor-joining analysis. The program “MEGA6” [52] was employed for constructing
a condensed phylogenetic tree by reducing the length of interior branches with low statistical
support to 0. Protein sequences longer than 300 residues were included in the analysis. The
phylogeny was built by neighbor-joining and tested by 1000 bootstraps.
Crossing MOBL members against constructed plasmid database. Plasmids deposited in
the NCBI nr database were retrieved by screening the annotations for the keywords “plasmid”
and “circular DNA”. The 10,904 plasmids retrieved at 27 May 2015 were used to build a blast
database. Next, each MOBL member was run against the constructed plasmid database using
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 18 / 23
tblastn. A MOBL member was considered to be located on a plasmid if an identity of more
than 95% was identified over more than 80% of the entire protein sequence.
Supporting information
S1 Fig. Identification of ten signatures in MOBL type relaxases and distribution of the
identified signatures in the primary sequence of RelLS20. (A) Weblogo representation of the
ten signatures identified in the MOBL type relaxases using the motif-identification program
MEME. Name of the signatures (sig) is given on the left. The position of the signature in the
primary RelLS20 sequence is presented at the right. (B) Position of ten signatures identified by
MEME for the MOBL type relaxases in the RelLS20 primary sequence. The primary sequence of
the RelLS20 protein is presented along with the position of eight MOBL signatures identified by
MEME. The colour code used for the signatures is consistent with that used for the Weblogo
presentations given in (A). The RelLS20 region showing similarity with part of the motifs III of
the MOBP MOBQ and MOBV type relaxases (see Fig 1) is highlighted in red. Note that MOBL
signatures III and IX are not present in RelLS20.
(TIF)
S2 Fig. Weblogo presentation of up to ten signatures identified for relaxases belonging to
the MOBP (A), MOBQ (B), MOBV (C), MOBF (D), MOBH (E), and MOBC (F) family using
the motif-identification program MEME. Names of the signatures (sig) is given on the left.
(PDF)
S3 Fig. Phylogenetic tree of MOBL family of relaxases. The phylogenetic tree was con-
structed using the program “MEGA6” (see Materials and methods) and phylogeny was built
by neighbor-joining and tested by 1000 bootstraps. Members of the resulting two clades are
given in red (clade 1) and blue (clade 2). RelLS20, belonging to clade 1, is highlighted in green.
(PDF)
S4 Fig. In silico analyses predicts a static bent in the pLS20cat region near oriTLS20. The
entire pLS20cat sequence (accession number NC_015148.1) was analyzed for the presence of
intrinsic bents according to the dinucleotide wedge model [46, see Materials and methods].
The predicted probability of sequences to form a static bent is presented as a function of the
pLS20cat sequence. Maximum values peak around pLS20cat positions 2,200 and 27,500. The
latter position, -27,500-, coincides with oriTLS20. Positions around 2,200 correspond to the
region containing the divergently oriented promoters Prco and Pc driving expression of regula-
tory gene rcoLS20 and the conjugation operon, respectively, which has been demonstrated to
contain a static bent [18].
(TIF)
S5 Fig. Biochemical characterization of RelLS20. (A) The efficiency of the RelLS20 nicking
reaction is concentration dependent. Plasmid pGR16B (2 nM) was incubated without (-) or
with increasing amounts of RelLS20 (4.25, 12.5, 25 and 50 nM, respectively). (B) A supercoiled
plasmid lacking oriTLS20 is not relaxed by RelLS20. The empty pUCTA2501 oriT-screening vec-
tor (2 nM), was incubated without (-) or with (+, 25 nM). (C) RelLS20-mediated plasmid relaxa-
tion is independent of oriTLS20 orientation. Plasmids pGRA16A and pGRA16B (2 nM), which
differ only in the orientation of oriTLS20, were treated without (-) or with (+, 25 nM) RelLS20.
(D) RelLS20-mediated nicking is inhibited by the chelating agent EDTA. Plasmid pGR16B (2
nM) was treated with RelLS20 (25 nM) in the absence (-) or presence (+) of 10 mM EDTA. (E)
The nicking activity of RelLS20 resides in its N-terminal domain. Plasmid pGR16B (2 nM) was
incubated without (-) or with increasing concentrations of N-RelLS20 (12.5, 25, 50, 100, 200
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 19 / 23
and 400 nM, respectively). In the last lane, labelled “C”, the empty vector pUCTA2501 was
incubated with 200 nM N-RelLS20. (F) RelLS20 residue Tyr26 is important for nicking activity.
Plasmid pGR16B (2 nM) was incubated without (-) or with RelLS20Y26F (50 nM). After incu-
bation, the samples were treated with proteinase K and the DNAs were separated on 0.8% aga-
rose gels. The positions of supercoiled (sc) and nicked open circular DNA (oc) are indicated.
(TIF)
S6 Fig. Purification of RelLS20. A Coomassie Brilliant Blue stained 12% SDS PAA gel reflect-
ing RelLS20 nickel column purification steps. RelLS20 was overexpressed in the E. coli strain
AZ43 which corresponds to strain BL21(DE3) harboring pET28b+ derivative pAND84 (see S4
Table). Aliquots of different steps were loaded. Control lane “C”, 4 μl prestained marker pro-
teins (ThermoFisher PageRuler Plus (10–250 kDa); increasing molecular weights of approxi-
mately 10, 15, 25, 35, 55, 70, 100, 130 and 250 kDa, respectively). Lane 1, supernatant fraction
of induced AZ43 cells (5 μg total protein loaded). Lane 2, flow-through of the centrifuged total
lysate adjusted to 20 mM imidazole after passing the Nickel column (5 μg total protein loaded).
Lane 3, washing step (fraction 7, corresponding to ~40 mM imidazole (6 μl)). Lane 4, fraction
40, corresponding to ~100 mM imidazole (16 μl). Lane 5, pool of eluted fractions (lateral 200
mM imidazole elution peaks corresponding to fractions 12–15 and 19–40, 5 μg loaded). Lane
6, pool of eluted fractions (central 200 mM imidazole elution peak with highest concentrations
of RelLS20 corresponding to fractions 16–18, 5 μg loaded). Each fraction corresponded to 1 ml.
The N-terminal domain of RelLS20 and the Y26F mutant were purified using the same method-
ology.
(TIF)
S1 Table. Proteins in the NCBI database showing significant similarity with RelLS20.
(PDF)
S2 Table. Occurrence (%) of MOBL signatures in members of other MOB families (%).
(DOCX)
S3 Table. Occurrence (%) of the signatures of the MOBQ, MOBV, MOBP, MOBF, MOBC
and MOBH in members of the MOBL family.
(DOCX)
S4 Table. Strains used.
(DOCX)
S5 Table. Plasmids used.
(DOCX)
S6 Table. Oligonucleotides used.
(DOCX)
Acknowledgments
We thank Jose Belio for help with preparing the Figures, and Margarita Salas and Jeff Erring-
ton for their support on our work. We also want to acknowledge helpful discussion with other
lab members.
Author contributions
Conceived and designed the experiments: DA JRLO CA LJW DRB WJJM.
Performed the experiments: GR AMA PKS IC CGC JAH JRLO WJJM.
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 20 / 23
Analyzed the data: GR AMA DA PKS IC JAH JRLO CA LJW DRB WJJM.
Contributed reagents/materials/analysis tools: DA JRLO CA LJW DRB WJJM.
Wrote the paper: DA LJW DRB WJJM.
References1. Mazel D, Davies J (1999) Antibiotic resistance in microbes. Cell Mol Life Sci 56: 742–754. PMID:
11212334
2. Waters VL (1999) Conjugative transfer in the dissemination of beta-lactam and aminoglycoside resis-
tance. Front Biosci 4: D433–D456. PMID: 10228095
3. Norman A, Hansen LH, Sorensen SJ (2009) Conjugative plasmids: vessels of the communal gene pool.
Philos Trans R Soc Lond B Biol Sci 364: 2275–2289. 364/1527/2275 [pii]. doi: 10.1098/rstb.2009.0037
PMID: 19571247
4. Davies J, Davies D (2010) Origins and evolution of antibiotic resistance. Microbiol Mol Biol Rev 74:
417–433. 74/3/417 [pii]. doi: 10.1128/MMBR.00016-10 PMID: 20805405
5. Sommer MO, Dantas G, Church GM (2009) Functional characterization of the antibiotic resistance res-
ervoir in the human microflora. Science 325: 1128–1131. 325/5944/1128 [pii]. doi: 10.1126/science.
1176950 PMID: 19713526
6. Sommer MO, Church GM, Dantas G (2010) The human microbiome harbors a diverse reservoir of anti-
biotic resistance genes. Virulence 1: 299–303. 12010 [pii]. doi: 10.4161/viru.1.4.12010 PMID:
21178459
7. Forsberg KJ, Reyes A, Wang B, Selleck EM, Sommer MO, Dantas G (2012) The shared antibiotic resis-
tome of soil bacteria and human pathogens. Science 337: 1107–1111. 337/6098/1107 [pii]. doi: 10.
1126/science.1220761 PMID: 22936781
8. Penders J, Stobberingh EE, Savelkoul PH, Wolffs PF (2013) The human microbiome as a reservoir of
antimicrobial resistance. Front Microbiol 4: 87. doi: 10.3389/fmicb.2013.00087 PMID: 23616784
9. van SW (2015) The human gut resistome. Philos Trans R Soc Lond B Biol Sci 370: 20140087.
rstb.2014.0087 [pii]. doi: 10.1098/rstb.2014.0087 PMID: 25918444
10. Sankar SA, Lagier JC, Pontarotti P, Raoult D, Fournier PE (2015) The human gut microbiome, a taxo-
nomic conundrum. Syst Appl Microbiol 38: 276–286. S0723-2020(15)00045-4 [pii]. doi: 10.1016/j.
syapm.2015.03.004 PMID: 25864640
11. Devirgiliis C, Zinno P, Perozzi G (2013) Update on antibiotic resistance in foodborne Lactobacillus and
Lactococcus species. Front Microbiol 4: 301. doi: 10.3389/fmicb.2013.00301 PMID: 24115946
12. Tanaka T, Kuroda M, Sakaguchi K (1977) Isolation and characterization of four plasmids from Bacillus
subtilis. J Bacteriol 129: 1487–1494. PMID: 403179
13. Koehler TM, Thorne CB (1987) Bacillus subtilis (natto) plasmid pLS20 mediates interspecies plasmid
transfer. J Bacteriol 169: 5271–5278. PMID: 3117774
14. Itaya M, Sakaya N, Matsunaga S, Fujita K, Kaneko S (2006) Conjugational transfer kinetics of pLS20
between Bacillus subtilis in liquid medium. Biosci Biotechnol Biochem 70: 740–742. JST.JSTAGE/bbb/
70.740 [pii]. PMID: 16556997
15. Meijer WJJ, Wisman GBA, Terpstra P, Thorsted PB, Thomas CM, Holsappel S, Venema G, Bron S
(1998) Rolling-circle plasmids from Bacillus subtilis: complete nucleotide sequences and analyses of
genes of pTA1015, pTA1040, pTA1050 and pTA1060, and comparisons with related plasmids from
Gram-positive bacteria. FEMS Microbiol Rev 21: 337–368. PMID: 9532747
16. Meijer WJJ, de Boer A, van Tongeren S, Venema G, Bron S (1995) Characterization of the replication
region of the Bacillus subtilis plasmid pLS20: a novel type of replicon. Nucleic Acids Res 23: 3214–
3223. PMID: 7667098
17. Singh PK, Ramachandran G, Ramos-Ruiz R, Peiro-Pastor R, Abia D, Wu LJ, Meijer WJ (2013) Mobility
of the Native Bacillus subtilis Conjugative Plasmid pLS20 Is Regulated by Intercellular Signaling. PLoS
Genet 9: e1003892. PGENETICS-D-13-01403 [pii]. doi: 10.1371/journal.pgen.1003892 PMID:
24204305
18. Ramachandran G, Singh PK, Luque-Ortega JR, Yuste L, Alfonso C, Rojo F, Wu LJ, Meijer WJ (2014) A
Complex Genetic Switch Involving Overlapping Divergent Promoters and DNA Looping Regulates
Expression of Conjugation Genes of a Gram-positive Plasmid. PLoS Genet 10: e1004733. PGE-
NETICS-D-14-01424 [pii]. doi: 10.1371/journal.pgen.1004733 PMID: 25340403
19. Smillie C, Garcillan-Barcia MP, Francia MV, Rocha EPC, De la Cruz F (2010) Mobility of plasmids.
Microbiol Mol Biol Rev 74: 434–452. doi: 10.1128/MMBR.00020-10 PMID: 20805406
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 21 / 23
20. Alvarez-Martinez CE, Christie PJ (2009) Biological diversity of prokaryotic type IV secretion systems.
Microbiol Mol Biol Rev 73: 775–808. doi: 10.1128/MMBR.00023-09 PMID: 19946141
21. De la Cruz F, Frost LS, Meyer RJ, Zechner EL (2010) Conjugative DNA metabolism in Gram-negative
bacteria. FEMS Microbiol Rev 34: 18–40. FMR195 [pii]. doi: 10.1111/j.1574-6976.2009.00195.x PMID:
19919603
22. Lujan SA, Guogas LM, Ragonese H, Matson SW, Redinbo MR (2007) Disrupting antibiotic resistance
propagation by inhibiting the conjugative DNA relaxase. Proc Natl Acad Sci U S A 104: 12282–12287.
0702760104 [pii]. doi: 10.1073/pnas.0702760104 PMID: 17630285
23. Francia MV, Varsaki A, Garcillan-Barcia MP, Latorre A, Drainas C, De la Cruz F (2004) A classification
scheme for mobilization regions of bacterial plasmids. FEMS Microbiol Rev 28: 79–100.
S0168644503000780 [pii]. doi: 10.1016/j.femsre.2003.09.001 PMID: 14975531
24. Garcillan-Barcia MP, Francia MV, De la Cruz F (2009) The diversity of conjugative relaxases and its
application in plasmid classification. FEMS Microbiol Rev 33: 657–687. PMID: 19396961
25. Marchler-Bauer A, Derbyshire MK, Gonzales NR, Lu S, Chitsaz F, Geer LY, Geer RC, He J, Gwadz M,
Hurwitz DI, Lanczycki CJ, Lu F, Marchler GH, Song JS, Thanki N, Wang Z, Yamashita RA, Zhang D,
Zheng C, Bryant SH (2015) CDD: NCBI’s conserved domain database. Nucleic Acids Res 43: D222–
D226. gku1221 [pii]. doi: 10.1093/nar/gku1221 PMID: 25414356
26. Chandler M, De la Cruz F, Dyda F, Hickman AB, Moncalian G, Ton-Hoang B (2013) Breaking and join-
ing single-stranded DNA: the HUH endonuclease superfamily. Nat Rev Microbiol 11: 525–538. nrmi-
cro3067 [pii]. doi: 10.1038/nrmicro3067 PMID: 23832240
27. Bailey TL, Elkan C (1994) Fitting a mixture model by expectation maximization to discover motifs in bio-
polymers. Proc Int Conf Intell Syst Mol Biol 2: 28–36. PMID: 7584402
28. Aguado-Urda M, Gibello A, Blanco MM, Lopez-Campos GH, Cutuli MT, Fernandez-Garayzabal JF
(2012) Characterization of plasmids in a human clinical strain of Lactococcus garvieae. PLoS One 7:
e40119. PONE-D-12-03972 [pii]. doi: 10.1371/journal.pone.0040119 PMID: 22768237
29. Chan GF, Gan HM, Rashid NA (2012) Genome sequence of Enterococcus sp. strain C1, an azo dye
decolorizer. J Bacteriol 194: 5716–5717. 194/20/5716 [pii]. doi: 10.1128/JB.01372-12 PMID: 23012290
30. Tomita H, Ike Y (2005) Genetic analysis of transfer-related regions of the vancomycin resistance
Enterococcus conjugative plasmid pHTbeta: identification of oriT and a putative relaxase gene. J Bac-
teriol 187: 7727–7737. 187/22/7727 [pii]. doi: 10.1128/JB.187.22.7727-7737.2005 PMID: 16267297
31. Fogg JM, Randall GL, Pettitt BM, Sumners de WL, Harris SA, Zechiedrich L (2012) Bullied no more:
when and how DNA shoves proteins around. Q Rev Biophys 45: 257–299. S0033583512000054 [pii].
doi: 10.1017/S0033583512000054 PMID: 22850561
32. Gimenes F, Takeda KI, Fiorini A, Gouveia FS, Fernandez MA (2008) Intrinsically bent DNA in replica-
tion origins and gene promoters. Genet Mol Res 7: 549–558. gmr461 [pii]. PMID: 18752180
33. Koo HS, Wu HM, Crothers DM (1986) DNA bending at adenine. thymine tracts. Nature 320: 501–506.
doi: 10.1038/320501a0 PMID: 3960133
34. Lorenzo-Diaz F, Espinosa M (2009) Lagging-strand DNA replication origins are required for conjugal
transfer of the promiscuous plasmid pMV158. J Bacteriol 191: 720–727. JB.01257-08 [pii]. doi: 10.
1128/JB.01257-08 PMID: 19028894
35. Ruiz-Barba JL, Floriano B, Maldonado-Barragan A, Jimenez-Diaz R (2007) Molecular analysis of the
21-kb bacteriocin-encoding plasmid pEF1 from Enterococcus faecium 6T1a. Plasmid 57: 175–181.
S0147-619X(06)00062-X [pii]. doi: 10.1016/j.plasmid.2006.06.003 PMID: 16893567
36. Schroeder BO, Backhed F (2016) Signals from the gut microbiota to distant organs in physiology and
disease. Nat Med 22: 1079–1089. nm.4185 [pii]. doi: 10.1038/nm.4185 PMID: 27711063
37. Jakobsson HE, Jernberg C, Andersson AF, Sjolund-Karlsson M, Jansson JK, Engstrand L (2010)
Short-term antibiotic treatment has differing long-term impacts on the human throat and gut micro-
biome. PLoS One 5: e9836. doi: 10.1371/journal.pone.0009836 PMID: 20352091
38. Nash RP, McNamara DE, Ballentine WK, III, Matson SW, Redinbo MR (2012) Investigating the impact
of bisphosphonates and structurally related compounds on bacteria containing conjugative plasmids.
Biochem Biophys Res Commun 424: 697–703. S0006-291X(12)01295-8 [pii]. doi: 10.1016/j.bbrc.
2012.07.012 PMID: 22796221
39. Sambrook J., Fritsch E. F., and Maniatis T. (1989) Molecular cloning: a laboratory manual. Cold Spring
Harbor, New York: Cold Spring Harbor Laboratory Press.
40. Bron S, Peijnenburg A, Peeters BPH, Haima P, Venema G (1989) Cloning and plasmid (in)stability in
Bacillus subtilis. In: Butler LO, Harwood CR, Moseley BEB, editors. Genetic transformation and expres-
sion. Andover,UK: Intercept Ltd. pp. 205–219.
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 22 / 23
41. Singh PK, Ramachandran G, Duran-Alcalde L, Alonso C, Wu LJ, Meijer WJ (2012) Inhibition of Bacillus
subtilis natural competence by a native, conjugative plasmid-encoded comK repressor protein. Environ
Microbiol 14: 2812–2825. doi: 10.1111/j.1462-2920.2012.02819.x PMID: 22779408
42. Schuck P (2000) Size-distribution analysis of macromolecules by sedimentation velocity ultracentrifuga-
tion and lamm equation modeling. Biophys J 78: 1606–1609. doi: 10.1016/S0006-3495(00)76713-0
PMID: 10692345
43. Laue TM, Shah BD, Ridgeway TM, Pelletier SL (1992) Interpretation of analytical sedimentation data
for proteins. In: Harding SE, Rowe AJ, Horton JC, editors. Analytical ultracentrifugation in biochemistry
and polymer science. Cambridge, UK: Royal Society of Chemistry. pp. 90–125.
44. Cole JL (2004) Analysis of heterogeneous interactions. Methods Enzymol 384: 212–232. doi: 10.1016/
S0076-6879(04)84013-8 PMID: 15081689
45. Svedberg T. and Pedersen K. O. (1940) The Ultracentrifuge. Oxford: Clarendon Press.
46. Goodsell DS, Dickerson RE (1994) Bending and curvature calculations in B-DNA. Nucleic Acids Res
22: 5497–5503. PMID: 7816643
47. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST
and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25: 3389–
3402. gka562 [pii]. PMID: 9254694
48. Schaffer AA, Aravind L, Madden TL, Shavirin S, Spouge JL, Wolf YI, Koonin EV, Altschul SF (2001)
Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics
and other refinements. Nucleic Acids Res 29: 2994–3005. PMID: 11452024
49. Altschul SF, Wootton JC, Gertz EM, Agarwala R, Morgulis A, Schaffer AA, Yu YK (2005) Protein data-
base searches using compositionally adjusted substitution matrices. FEBS J 272: 5101–5109.
EJB4945 [pii]. doi: 10.1111/j.1742-4658.2005.04945.x PMID: 16218944
50. Edgar RC (2010) Search and clustering orders of magnitude faster than BLAST. Bioinformatics 26:
2460–2461. btq461 [pii]. doi: 10.1093/bioinformatics/btq461 PMID: 20709691
51. Bailey TL, Gribskov M (1998) Combining evidence using p-values: application to sequence homology
searches. Bioinformatics 14: 48–54. btb036 [pii]. PMID: 9520501
52. Tamura K, Stecher G, Peterson D, Filipski A, Kumar S (2013) MEGA6: Molecular Evolutionary Genetics
Analysis version 6.0. Mol Biol Evol 30: 2725–2729. mst197 [pii]. doi: 10.1093/molbev/mst197 PMID:
24132122
New family of relaxases in Firmicutes
PLOS Genetics | DOI:10.1371/journal.pgen.1006586 February 16, 2017 23 / 23