Computational analysis of human N-acetylgalactosamine-6-sulfate sulfatase enzyme: an update in...
-
Upload
luis-alejandro -
Category
Documents
-
view
217 -
download
0
Transcript of Computational analysis of human N-acetylgalactosamine-6-sulfate sulfatase enzyme: an update in...
Computational analysis of human N-acetylgalactosamine-6-sulfatesulfatase enzyme: an update in genotype–phenotype correlationfor Morquio A
Sergio Olarte-Avellaneda • Alexander Rodrıguez-Lopez •
Carlos Javier Almeciga-Dıaz • Luis Alejandro Barrera
Received: 3 March 2014 / Accepted: 21 April 2014 / Published online: 7 October 2014
� Springer Science+Business Media Dordrecht 2014
Abstract Mucopolysaccharidosis IV A (MPS IV A) is a
lysosomal storage disease produced by the deficiency of N-
acetylgalactosamine-6-sulfate sulfatase (GALNS) enzyme.
Although genotype–phenotype correlations have been
reported, these approaches have not enabled to establish a
complete genotype–phenotype correlation, and they have
not considered a ligand–enzyme interaction. In this study,
we expanded the in silico evaluation of GALNS mutations
by using several bioinformatics tools. Tertiary GALNS
structure was modeled and used for molecular docking
against galactose-6-sulfate, N-acetylgalactosamine-6-sul-
fate, keratan sulfate, chondroitin-6-sulfate, and the artificial
substrate 4-methylumbelliferyl-b-D-galactopyranoside-6-
sulfate. Furthermore, we considered the evolutionary resi-
due conservation, change conservativeness, position within
GALNS structure, and the impact of amino acid substitu-
tion on the structure and function of GALNS. Molecular
docking showed that amino acids involved in ligand
interaction correlated with those observed in other human
sulfatases, and mutations within the active cavity reduced
affinity of all evaluated ligands. Combination of several
bioinformatics approaches allowed to explaine 90 % of the
missense mutations affecting GALNS, and the prediction
of the phenotype for another 21 missense mutations. In
summary, we have shown for the first time a docking
evaluation of natural and artificial ligands for human
GALNS, and proposed an update in genotype–phenotype
correlation for Morquio A, based on the use of multiple
parameters to predict the disease severity.
Keywords MPS IV A � GALNS � Keratan sulfate �Chondroitin-6-sulfate � Molecular modeling �Computational molecular docking
Introduction
N-Acetylgalactosamine-6-sulfate sulfatase (GALNS, EC
3.1.6.4, UniProt P34059) is a lysosomal enzyme involved
in the degradation of glycosaminoglycans (GAG) keratan
sulfate (KS) and chondroitin-6-sulfate (C6S) [41]. Muta-
tions in GALNS gene lead to Mucopolysaccharidosis IV A
(MPS IV A, Morquio A disease, OMIM 253000), in which
both KS and C6S are accumulated within the lysosome
[41]. MPS IV A is a lysosomal storage disease (LSD) with
an estimated incidence of 1:200.000, and characterized by
systemic skeletal dysplasia, laxity of joints, hearing loss,
corneal clouding, valvular heart disease, and pulmonary
dysfunction [20, 41]. Currently, there are not corrective
therapies for MPS IV A patients, and only supportive
measures and surgical interventions are used to treat some
disease manifestations [20, 41]. However, enzyme
replacement therapy [6, 9, 21, 27, 39, 40] and gene therapy
are under evaluation [2, 3, 14].
The full-length GALNS cDNA has been cloned and
encodes a 522 amino acids protein including a signal
S. Olarte-Avellaneda
Clinical Bacteriology Program, School of Health Sciences,
Universidad Colegio Mayor de Cundinamarca, Bogota, D.C.,
Colombia
S. Olarte-Avellaneda � A. Rodrıguez-Lopez (&) �C. J. Almeciga-Dıaz (&) � L. A. Barrera
Proteins Expression and Purification Laboratory, Institute for the
Study of Inborn Errors of Metabolism, School of Sciences,
Pontificia Universidad Javeriana, Kra 7 No. 43-82 Building 53,
Room 303A, Bogota, D.C., Colombia
e-mail: [email protected]
C. J. Almeciga-Dıaz
e-mail: [email protected]
123
Mol Biol Rep (2014) 41:7073–7088
DOI 10.1007/s11033-014-3383-3
peptide of 26 residues [36]. Over 170 mutations have been
described in GALNS gene, including missense/nonsense
mutations (75 %), splicing mutations (8 %), small dele-
tions (10 %), small insertions (1.7 %), small indel (\1 %),
gross deletions (1.7 %), gross insertions/duplications
(1 %), and complex rearrangements (1 %) [32]. GALNS is
a homodimer of 120 kDa, with monomers of 60 kDa
formed by 40 and 15 kDa polypeptides bound by a disul-
fide bond [18], containing two N-glysosylations [26]. As
other lysosomal enzymes, GALNS has a signal peptide that
leads the nascent protein to the endoplasmic reticulum and
Golgi apparatus system, in which the protein is modified
through N-glycosylation and proteolytic processing [15].
The most important posttranslational GALNS modifica-
tion, common to other sulfatases, is the active-site activa-
tion through the conversion of cysteine 79 to formylglycine
(FGly), in a process mediated by the formylglycine-gen-
erating enzyme [5, 38]. The tertiary structure model of
GALNS showed that it has two domains: a large N-ter-
minal domain with a a/b topology with a 10-stranded b-
sheet and six a-helices, and a small C-terminal domain
comprising a 4-stranded antiparallel b-sheet perpendicular
to a long a-helix [26, 34]. The catalytic residue (FGly) lies
at the bottom of a cavity containing a metal ion (Ca2?) and
is flanked by positively charged amino acids, as observed
in other sulfatases [12, 26, 34].
Despite the high heterogeneity of Morquio A patients,
some genotype–phenotype correlations have been estab-
lished. So far, four mutations groups have been identified:
(i) mutations causing destruction of the hydrophobic core
or modification of the packing, (ii) removal of a salt bridge
to destabilize the entire conformation, (iii) modification of
the active site geometry, and (iv) location on surface of the
protein to alter lysosomal targeting, hydrogen bounds, or
N-glycosylation sites [34]. Mutations affecting the hydro-
phobic core, salt bridges, or active cavity (activation or
folding) were associated with severe phenotypes; while
mutations located on protein surface or distant from the
active cavity, as well as those affecting the N-glycosyla-
tions, were associated with attenuated phenotypes [19, 34,
38]. A phenotype–genotype correlation approach, based on
the evolutionary conservation of amino acids, conserva-
tiveness of amino acids changes (according to amino acids
properties), and structural conservation of amino acid res-
idues, was also reported [37]. The results showed that
missense mutations are mainly located in highly conserved
residues and can lead to severe phenotypes if they are the
result of a non-conservative amino acid exchange. A
structural and functional analysis of missense mutations in
GALNS enzyme by using several bioinformatics tools
showed that mutations could produce structural alterations,
changes in ligand affinity (N-acetyl-galactosamine-6-sul-
fate), and the presence of a positive correlation between
mutations in hydrophobic cores and severe phenotype [33].
Finally, mapping of missense mutations in a X-ray crystal
structure revealed that most of the mutations affect the
hydrophobic core of the structure [26]. Recently, correla-
tions between genotype, phenotype, and blood and urine
KS levels were proposed, which allowed to predict the
clinical severity more precisely [8].
To increase the knowledge in the genotype–phenotype
correlation for MPS IV A and to understand the GALNS–
ligand interactions, in this study, we used several bioin-
formatics tools to expand the in silico evaluation of
GALNS and the effect of mutations on structure and pro-
tein–ligand interactions. A docking study with natural and
artificial ligands was carried out for the wildtype and
mutated GALNS models. The results suggest the interac-
tion of the evaluated ligands with the residues Asp39,
Asp40, FGly79, Arg83, Tyr108, His142, His236, Asp288,
Asn289, and Lys310. Combination of several bioinfor-
matics approaches allowed explaining 90 % of missense
mutations, and the prediction of the phenotype for 21
missense mutations.
Methods
Protein structure modeling
GALNS protein was retrieved from Uniprot (Entry No.
P34059), and signal peptide and potential N-glycosylation
sites were predicted using SignalP 4.0 and NetNGlyc 1.0,
respectively, available at Expasy server. GALNS tertiary
structure without signal peptide was modeled by using
I-TASSER [44], selecting the tertiary structure of human
arylsulfatase A (ASA, PDB 1AUK) as a template for protein
threading. Protein structure was validated using PDBsum
[17]. Calcium ion was added to the tertiary structure using
YASARA View v11.4.18 (YASARA Biosciences GmbH,
Vienna, Austria), constrained to Asp39, Asp40, Asp288, and
Asn289, as previously reported for ASA [30]. Amino acids
interacting with Ca?2 were predicted using Protein Structure
Analysis Package (PSAP) [4]. Active site amino acids and
volume of cavity were predicted using Computed Atlas of
Surface Topography of Proteins (CASTp) [7]. Structures
were visualized with YASARA View v11.4.18 and UCSF
Chimera v1.6.2 [23], while Swiss-PdbViewer v4.1 [13] was
used for energy minimization and to calculate the root-mean-
square deviation (RMSD).
MPS IV A mutations and phenotypes
Missense mutations affecting GALNS gene were retrieved
from Human Gene Mutation Database (HGMD) [32], as
well as from previous reports [16, 26, 37, 38], where
7074 Mol Biol Rep (2014) 41:7073–7088
123
phenotype was defined for each mutation. We did not
consider deletions and nonsense and splice-site mutations,
since they have been previously associated with severe
phenotypes due to the synthesis of a truncated enzyme or
changes in the open reading frame [38].
Molecular docking
Molecular docking of the modeled GALNS structure and
the natural and artificial ligands was done using Molegro
Virtual Docker v5.5 (MVD, CLC bio, Aarhus N, Den-
mark). The search space was centered between Cys79 and
calcium ion, and radius of 7, 9, 12 and 15 A were evalu-
ated. Structures of galactose-6-sulfate (G6S) and keratan
sulfate (KS, 1KES) were retrieved from the RCSB-Protein
Data Bank, while structures of N-acetylgalactosamine-6-
sulfate (6S-GalNAc, CID 193456) and chondroitin-6-sul-
fate (C6S, CID 24766) were retrieved from PubChem.
Structure of the artificial ligand 4-methylumbelliferyl-b-D-
galactopyranoside-6-sulfate (4MUGPS), which is used for
GALNS enzyme activity assay [42], was built using Mar-
vinSketch at Marvin Suite (ChemAxon Ltd., Budapest,
Hungary). Docking for each ligand was run 20 times and
constrained to the active cavity. To evaluate the effect of
mutations on GALNS-ligand affinity fifteen mutations
affecting amino acids present within the active cavity were
studied. Mutations were manually included into GALNS
model by using Swiss-PdbViewer v4.1, and interactions of
mutated models with ligands were evaluated as described
above for the natural and artificial substrates.
Molecular dynamics
Molecular dynamics analysis were done for the docking
results obtained for wild-type GALNS against G6S and 6S-
GalNAc. Simulations were developed using GROMACS
4.5.5 [24], and topologies of ligands were generated using
PRODRG2 [31]. Simulations were carried out for 20 ns,
and trajectories were analyzed by RMSD and root mean
square fluctuation (RMSF) of the ligand. All simulations
were done at the High Performance Computing Center—
ZINE—of Pontificia Universidad Javeriana.
Genotype–phenotype correlation
To evaluate the genotype–phenotype correlation of muta-
tions outside the active cavity, the sequence of the human
GALNS enzyme was separately aligned with GALNS
sequences from other species (Macaca mulata AFE69926.1,
Bos taurus NP_001193258.1, Rattus norvergicus NP_00104
1316.1, Canis lupus familiaris NP_001041585.1, Mus mus-
culus AAH04002.1, Mustela putorius furo AER98880.1,
Cricetulus griseus EGV94073.1, Heterocephalus glaber
EHB15848.1, Gallus gallus XP_414208.1, Loxodonta afri-
cana XP_003418184.1, Meleagris gallopavo XP_003209
945.1, Xenopus (Silurana) tropicalis XP_002933701.1,
Saccoglossus kowalevskii XP_002736267.1, Oreochromis
niloticus XP_003445750.1, Amphimedon queenslandica
XP_003383435.1, Sus scrofa NP_999120.1, Cavia porcellus
XP_003460959.1, Equus caballus XP_001488119.1, Anolis
carolinensis XP_003229531.1), or 17 human sulfatases
(available at http://sulfabase.tigem.it/Hs.html). Sequences
were aligned using Muscle [10], and the phylogenetic tree
was generated by using MEGA5 [35]. For genotype–phe-
notype correlation it was considered evolutionary residue
conservation in GALNS and sulfatases sequences, residue
change conservativeness, residue position within GALNS
structure, Polyphen2 results [1], and values of Atomic
Accessible Surface Area (AASA) as previously reported
[26]. Swiss-PdbViewer v4.1 was used to include the mis-
sense mutations into the GALNS 3D model and for estima-
tion of energy minimization.
Statistical analysis
Differences between groups were tested for statistical sig-
nificance by using Student’s t test. An error level of 5 %
(p \ 0.05) was considered significant. Analyses were per-
formed using SPSS Statistics v18 (IBM Corp, Armonk, NY).
Results and discussion
In this study, we carried out a computational evaluation of
human GALNS missense mutations and their genotype–
phenotype correlation in Morquio A disease. Although
several studies have carried out this task [8, 26, 33, 34, 37],
there are some issues that have not been completely evalu-
ated. For example, the study of ligand affinity by molecular
docking of a modeled wildtype and mutated GALNS did not
consider the presence of calcium ion within the active cavity
[33]. Furthermore, the three dimensional structure of human
GALNS, determined by X-ray crystallography, was only
used for localization of the mutations, and the ligand binding
analysis with N-acetylgalactosamine was not possible since
the sugar was bound in a nonproductive orientation [26].
Thus, it is still necessary to expand the in silico evaluation of
GALNS, specially the ligand–enzyme interaction, in order to
increase the knowledge in the genotype–phenotype corre-
lation for Morquio A.
3D model analysis
The sequence of the signal peptide (26 amino acids) was
removed from GALNS sequence, and the remained amino
acids (522) were used to predict the 3D structure based on
Mol Biol Rep (2014) 41:7073–7088 7075
123
the reported structure of ASA. Although GALNS and ASA
only have a 34 % identity, the phylogenetic study (see
above results) showed that ASA is the closest human sul-
fatase to GALNS. The final model had a C-score of 0.38
and presented the 522 amino acids of the primary structure,
conforming 8 a-helices and 23 b-sheets (Fig. 1a), which
are similar to the results of crystalized GALNS (9 a-helices
and 23 b-sheets) [26]. Ramachandram plot analysis of the
model showed that 97.9 % of amino acids were in the two
most favored regions, which is similar to the results
obtained for ASA (98.7 %) and previously modeled human
GALNS (96.8 %) [33]. G-factors analysis showed an
average score of -0.34, suggesting that dihedral angles and
main-chain covalent forces are not unusual (Values below
-0.5: unusual, and below -1.0: highly unusual).
During the final analysis of this study, the crystal
structure of human GALNS was published [26]. The
modeled human GALNS has a RMSD of 1.2 A (399
atoms) against the crystal structure of human GALNS
(Fig. 1b). The main differences were observed on the
C-terminal end and four loops located at the protein surface
(Fig. 1b), as previously reported for other sulfatases [12].
The modeled GALNS has a RMSD of 0.65 A (451 atoms),
1.24 A (333 atoms) and 1.22 A (391 atoms) against ASA,
Arylsulfatase B (ARSB, PDB 1FSU), and ARSC (PDB
1P49), respectively, while crystallized GALNS has a
RMSD against the same sulfatases of 1.31 A (394 atoms),
1.37 A (315 atoms), and 1.24 (367 atoms) A, respectively.
As it has been observed for all human sulfatases,
GALNS presents the characteristic conversion of Cys to
Formylglycine (FGly) at the active site, mediated by the
formylglycine-generating enzyme [29]. This Cys-to-FGly
modification was not included in the final GALNS 3D
model since docking software used does not recognize the
FGly as an amino acid but it does as a ligand, interfering
with the docking analysis.
The active cavity on model GALNS had an area of
1,152 A2 and a volume of 1,244.6 A3 in which 58 amino
acids were involved. The residues of the primary active site
described for the X-ray GALNS model [26], Asp39,
Asp40, FGly79 (Cys79), Arg83, Tyr108, His142, His236,
Asp288, Asn289 and Lys310 are presented within the
predicted active cavity (Fig. 1c). These residues have been
also identified within the active cavity of ASA, ARSB,
ARSC, and a homologous arylsulfatase from Pseudomonas
aeruginosa [12]. Overall, the spatial arrangement of amino
acids within the active cavity was similar between modeled
and crystallized GALNS (Fig. 1c), showing the high
degree of conservation of the active cavity, as previously
described for other sulfatases [12]. Recently, it was
reported the computational analysis of the active cavity of
GALNS from human and other eight species (i.e. rhesus
macaque, bovine, mouse, rat, dog, chicken, tilapia nilotica,
and pig) [22]. The results showed that 85 % of amino acids
of the active cavity were conserved among the studied
species, and that active cavity of these enzymes has a
Fig. 1 Human GALNS tertiary
structure. a Tertiary structure of
GALNS was modeled using the
tertiary structure of the human
Arylsulfatase A (ASA, PDB
1AUK) as a template for protein
threading. Green sphere
represents calcium ion present
at the bottom of the active
cavity. b Modeled human
GALNS (red) has a RMSD of
1.2 A against the crystal
structure of human GALNS
(blue). Main differences
between both structures are
pointed within gray circules.
Spheres represent calcium ions
present at the cavity. c Amino
acids present at the active cavity
in modeled (red) and crystal
(blue) GALNS. d Active cavity
of modeled human GALNS,
with calcium ion located at the
botton of the cavity. (Color
figure online)
7076 Mol Biol Rep (2014) 41:7073–7088
123
positive partial charge, which correlates with the negative
charge of GALNS substrates [11].
Since presence of calcium ion within the active cavity
has been reported for GALNS [26, 34], and other human
sulfatases [12], this divalent ion was included within
modeled structure, and was located at the bottom of the
active cavity (Fig. 1d). PSAP analysis showed that Ca2?
formed a coordination complex with Asp39, Asp40,
Asp288, and Asp289, which correlates with the amino
acids reported for GALNS [26] and ARSC [12].
In summary, these results show the quality of the pre-
dicted 3D structure of the human GALNS and the high
conservation of the tertiary structure of sulfatases [12, 26],
and that the predicted GALNS model could be used to
study the effect of missense mutations on enzyme function
and structure.
Docking analysis
Enzyme–ligand interaction was evaluated by using struc-
tures of the natural ligands KS and C6S, monomers G6S
and 6S-GalNAc, and the artificial ligand 4MUGPS. For the
evaluated ligands, it was expected that sulfate would be
located at the bottom of the cavity close to the active
residue [30]. Among natural ligands, only C6S was able to
fit within the active cavity, interacting with Ca2?, Leu78,
Cys79, Thr100, Ala102, His103, His142, Tyr181, Lys310,
Asn407, and Trp409, through H-bonds and electrostatic
and steric interactions (Fig. 2; Table 1), with an affinity
energy of -173.337 kJ/mol. Docking of monomers (G6S
and 6S-GalNAc), and the artificial ligand are shown in
Fig. 3 and summarized in Table 1. Monomers showed
lower affinity energy than that of the artificial ligand, while
4MUGPS showed lower affinity for the enzyme than that of
C6S, suggested by a higher energy of affinity of 4MUGPS
than that of C6S (-121.685 vs. -173.337 kJ/mol), as
described by natural and artificial ligands of ASA [30]. For
the evaluated ligands, the sulfate group interacted with
Ca?2, Cys79, Arg83, His142, and Asp288/Asn289, while
Ala102 and Tyr108 or Tyr181 interacted with the carbon
chain, suggesting the importance of these residues in
enzyme–ligand interaction. It was proposed for ARSC that
Arg79, Lys134, His136, His290, and Lys368, which cor-
respond to Arg83, Lys140, His142, His236, and Lys310
within GALNS structure, participate in the catalytic
mechanism of ARSC, and could be extended to the other
sulfatases [12]. In fact, similar residues have been involved
in ligand binding of ASA [30]. Analysis G6S and 6S-
GalNAc interaction with GALNS from rhesus macaque,
bovine, mouse, rat, dog, chicken, tilapia nilotica, and pig,
showed a lower affinity than that observed for human
GALNS [22]. Futhermore, amino acids involved in inter-
action with the sulfate group of G6S and 6S-GalNAc were
highly conserved among studied structures, although in
non-human GALNS new residues interacted with carbon
chain of the ligands [22].
Molecular dynamics analysis of G6S and 6S-GalNAc
showed that ligands RMSD were 1.05 ± 0.17 A and
0.91 ± 0.16 A, respectively (Fig. 4a), suggesting that
ligands position was stable during the evaluated time
(20 ns). Furthermore, RMSF showed that deviation of
atoms for each ligand along the evaluated time was
0.63 ± 0.39 and 0.45 ± 0.27 A for G6S and 6S-GalNAc
(Fig. 4b, c), respectively. Speficically, for G6S the atoms
with the highest deviation (between 1 and 1.2 A) were
oxygens of sulfate group (O2, O3, and O4) as well as
hydrogens H6, H7, and H8 (Fig. 4b). On the other hand, for
6S-GalNAc the highest deviation (*1 A) was observed for
hydrogens H3, H5, and H6 (Fig. 4c). In summary, these
results support the docking results and suggest that this
Fig. 2 Molecular docking for chondroitin-6-sulfate. a 3D represen-
tation of residues interacting with chondroitin-6-sulfate. b 2D repre-
sentation of residues interacting with chondroitin-6-sulfate. Blue lines
hydrogen bonds; green lines electrostatic interactions; red letters
steric interactions. Green sphere represents calcium ion present at the
bottom of the cavity. (Color figure online)
Mol Biol Rep (2014) 41:7073–7088 7077
123
strategy can be used to model the effect of mutations on
ligand–GALNS interactions.
Missense mutations affecting residues located within the
active cavity were modeled, and the resulting mutated
enzymes were analyzed through docking against C6S, G6S,
6S-GalNAc, and 4MUGPS. Results of energy minimiza-
tion, energy of affinity mutations, and intra-molecular
hydrogen bonds for mutations P77R, C79Y, S80L, H142R,
H166Q, G168R, H236D, N289S, G290S, A291T, A291D,
G301C, G309R, K310N, and M318R, are summarized in
Table 2. Six out of 15 mutations affected amino acids
directly involved in the enzyme–ligand interaction
(underlined mutations). In all cases, the mutations pro-
duced an increase in the energy of affinity for the four
evaluated ligands, suggesting a reduction in the GALNS-
ligand affinity. These results differ from those of a previous
Table 1 Summary of the interactions of natural and artificial ligands for human GALNS enzyme. Chondroitin-6-sulfate (C6S), N-acet-
ylgalactosamine-6-sulfate (6S-GalNAc), galactose-6-sulfate (G6S), or 4-Methylumbelliferyl-b-D-galactopyranoside-6-sulfate (4MUGPS)
Ligand Affinity
energy
(kJ/mol)
Interactions
H-bonds Electrostatic Steric
C6S -173.337 Cys79, Thr100, Ala102, His142,
Tyr181, Lys310, and Asn407
Ca2? Leu78, His103, and Trp409
G6S -115.618 Cys79, Arg83, Ala102, Cys165,
Tyr181, Lys310
Ca2?, Asp39, Asp40, His142,
Asp288, Asn289
Ser80, Tyr108
6S-GalNAc -120.104 Cys79, Tyr181, His236, Asn289, Lys310 Ca2?, Asp39, Asp40, Asp288 Leu78, Ala 102, Trp 409
4MUGPS -121.685 Cys79, Thr100, Ala102, Asn289, Lys310 Ca2?, Asp39, Asp40, His236, Asp288 Asn407, Trp409, Glu410, Gln311
Fig. 3 Molecular docking for galactose-6-sulfate (G6S), N-acetylga-
lactosamine-6-sulfate (6S-GalNAc), and 4-Methylumbelliferyl-b-D-
galactopyranoside-6-sulfate (4MUGPS). a, c, d 3D representation of
residues interacting with G6S, 6S-GalNAc and 4MUGPS. b, c, d 2D
resentation of the residues interacting with G6S, 6S-GalNAc and
4MUGPS. Blue lines hydrogen bonds; green lines electrostatic
interactions; red letters steric interactions. Green sphere represents
calcium ion present at the bottom of the cavity. (Color figure online)
7078 Mol Biol Rep (2014) 41:7073–7088
123
GALNS docking study, in which a change in the energy of
affinity for some mutations was not observed [33]. The
difference in these results could be associated with the
presence of Ca2? within the present model, which has been
shown to have a positive effect on docking results [30], or
to the software used for the docking analysis (AutoDock
Vina vs. MVD).
A previous report suggested that mutation P77R affects
GALNS through the destruction of a hydrophobic core or a
packing modification [34]. However, we have observed
that this mutation affects GALNS enzyme by alteration of
the ligand–enzyme interaction. Meanwhile for mutations
H166Q and G168R, it has been suggested that they modify
the active cavity [34], which agrees with present results.
Sum of all affinity energies for each mutation showed
that energy levels for attenuated and severe phenotypes
were higher than those obtained for the wild type enzyme
(Table 2; Fig. 5a). Furthermore, the sum of all affinity
energie values for severe mutations trend to be higher than
those obtained for attenuated phenotype mutations.
An important observation is that both K310N and
A291D, two mutations associated with attenuated pheno-
type, produced an increase in the energy of affinity similar
to that observed for the other severe mutations within the
active cavity. Although K310 was identified as a residue
involved in enzyme–ligand interaction, K310N mutation
could be associated with an attenuated phenotype since the
ligand interaction with K310 was not lost in K310N. Ne-
verthelss, a different picture was observed for mutations
A291D (attenuated) and A291T (severe). In wildtype
enzyme, A291 has a hydrogen bond with K310, which was
conserved both in A291D and A291T mutations. However,
different atoms are involved in this hydrogen bond:
between A291 oxygen from backbone and K310 hydrogen
from a-amino in wildtype, and between A291D or A291T
hydrogen from carboxyl group and nitrogen from a-amino
(Fig. 5b), which could affect the orientation of K310.
Furthermore, mutation A291T produced a new hydrogen
bond with G301 that is not observed by mutation A291D
(Fig. 5b). This new hydrogen bond in A291T could affect
further the orientation of K310 leading to a severe phe-
notype, due to the direct role of this residue in the enzyme–
ligand interaction.
Since mutations H236D and N289S affect residues
involved in the ligand–enzyme interaction, it would be
expected that they produce a severe phenotype. However,
these mutations are associated with an attenuated pheno-
type. This phenotype could be explained by the fact that in
both cases the ligand–protein interactions present in the
wild-type enzyme were conserved in the mutated enzymes.
We did not observe a direct correlation between the
energy of minimization, loss/gain of hydrogen bonds (data
not shown), and the phenotype of the disease (Table 2),
suggesting that changes in the affinity energy could be a
better predictor for a phenotype–genotype correlation for
mutations affecting the active cavity.
Fig. 4 Molecular dynamics analysis of wild-type GALNS complexed
with G6S and 6S-GalNAc. Results were evaluated for RMSD of
ligands (a) and root mean square fluctuation (RMSF) of G6S (b) and
6S-GalNAc (c) atoms during 20 ns of molecular dynamics simulation.
Atoms with the highest deviations are shown within each RMSF
figure
Mol Biol Rep (2014) 41:7073–7088 7079
123
Additional computational analysis
Figure 6 shows the phylogenetic trees from the alignment
of GALNS sequences several species and the alignment of
sequences from human sulfatases. One hundred ninety-
eight amino acids were conserved among GALNS
enzymes. As expected, human GALNS was closer to that
of Macaca mulatta and distant from Saccoglossus kowa-
levskii (Acorn worm) and Amphimedon queenslandica
(sponge) (Fig. 6a). On the other hand, 17 amino acids were
conserved among the studied sulfatases, and GALNS was
close related to ASA (Fig. 6b), which support the selection
of this model as template for GALNS modeling. Fifteen out
of the 17 conserved residues for sulfatases were also con-
served among the evaluated GALNS enzymes: C79, P81,
S82, R83, G89, G139, K140, H236, D262, D288, G290,
P327, T348, G365, and D447 (amino acid numbers are
relative to human GALNS sequence), while D39 and G132
were conserved among sulfatases but not in GALNS from
other species.
Taking into account the above results, we analyzed 124
missense mutations reported for GALNS from which 103
have a reported phenotype, 69 and 34 associated to severe
and attenuated phenotypes, respectively (Table 3), while
the remaining 21 mutations lack of a reported phenotype
(Table 4). We considered the amino acid conservation in
human sulfatases and GALNS from different species, res-
idue position within GALNS structure, Polyphen2 results,
energy minimization, and Atomic Accessible Surface Area
(AASA) for the genotype–phenotype correlation.
Sixty-nine out of 103 mutations with assigned pheno-
type (67 %) have a reported severe phenotype. From these
69 mutations, 63 (91 %) were located in highly conserved
residues (present at 15–20 of the studied sequences), while
46 (66 %) were located in completely conserved amino
acids among GALNS sequences. Forty-three out of these
69 mutations (62 %) were produced by a non-conservative
change, with 40 of them located in highly conserved resi-
dues. Finally, 34 out of these 69 mutations were located in
residues highly conserved in both GALNS and sulfatases
sequences.
On the other hand, 34 mutations have an attenuated
reported phenotype (Table 2). Twenty-six out of these 34
(76 %) mutations were located in low conserved amino
acids (present at nine or less sulfatases sequences), 11
(32 %) were produced by a conservative change, and 20
(59 %) were located in the surface of the enzyme.
In summary, these results show that severe mutations
preferably affect higly conserved amino acids and involve
non-conservative amino acid changes; while attenuated
mutations seem to affect less conserved residues, without a
clear preference for conservative or non-conservative
amino acid changes, as previously reported [37].
Energy minimization for missense mutations associated
with severe or attenuated phenotype was obtained after
modeling them within the GALNS 3D-structure (Table 3),
Table 2 Results of energy minimization and affinity energy after
docking of chondroitin-6-sulfate (C6S), N-acetylgalactosamine-6-
sulfate (6S-GalNAc), galactose-6-sulfate (G6S), or
4-Methylumbelliferyl-b-D-galactopyranoside-6-sulfate (4MUGPS)
with wild-type and mutated human GALNS. Parenthesis numbers
represent HGMD accession numbers
Mutation Phenotype Energy
minimization
(kJ/mol)
Affinity energy (kJ/mol)
C6S G6S 6S-GalNAc 4MUGPS Sum
GALNS Normal -12,420 -173.337 -115.618 -120.104 -121.685 -530.744
P77R (CM950524) Severe -12,663 -155.047 -113.530 -117.669 -114.834 -501.080
C79Y (CM045155) Severe -11,994 -161.745 -108.831 -115.234 -109.769 -495.579
S80L (CM970577) Severe -11,370 -159.109 -114.209 -120.016 -110.918 -504.252
H142R Severe -12,624 -159.604 -109.509 -119.295 -99.620 -488.028
H166Q (CM970587) Severe -12,559 -172.787 -113.118 -118.002 -106.561 -510.468
G168R (CM970588) Severe -12,260 -160.336 -113.464 -112.341 -103.364 -489.505
H236D Attenuated -12,369 -149.569 -115.125 -117.141 -107.347 -489.182
N289S Attenuated -12,236 -162.749 -112.261 -114.195 -110.827 -500.032
G290S (CM970594) Severe -12,423 -163.532 -114.135 -119.413 -112.757 -509.837
A291T (CM950535) Severe -12,268 -162.139 -113.600 -114.513 -96.969 -487.221
A291D (CM970595) Attenuated -12,349 -172.340 -113.937 -114.181 -115.690 -516.148
G301C (CM970597) Severe -12,091 -167.525 -115.454 -119.218 -102.139 -504.336
G309R (CM970598) Severe -8,589 -152.197 -103.978 -119.095 -116.071 -491.341
K310N (CM054737) Attenuated -12,569 -167.765 -115.497 -112.741 -119.305 -515.308
M318R (CM950536) Severe -12,667 -163.070 -113.307 -117.472 -103.205 -497.054
7080 Mol Biol Rep (2014) 41:7073–7088
123
excluding those present at the signal peptide (M1V, L15M
and G23R). Seventy-three out of 100 mutations (73 %)
produced an increase in the energy minimization in com-
parison to the wildtype enzyme value, with 48 out of 73
mutations (65 %) associated with a severe phenotype.
Similar results were reported for GALNS [34] and N-
acetylgalactosamine-4-sulfatase [28], showing that a high
percentage of mutations associated with a severe pheno-
type were produced by large structural changes in the
protein structure. However, this value by itself cannot
explain the phenotype associated to the mutation.
We further characterized mutations M41L, D60N, R94C,
R94G, F97V, Q111R, F156S, N204K, H236D, N289S,
K310N, T312S, A324E, and M391V, which are associated
with an attenuated phenotype. Mutations M41L, F97V,
T312S, A324E and M391V were produced by a conservative
change of the amino acid, while D60N, N204K, M391V are
placed in the surface of the protein and are not involved in the
active cavity. In the case of mutation N204K, this affects an
N-glycosylation site which could partially affect the intra-
cellular traffic or the stability of the enzyme [19], since one of
N-glycosylation will remain in the enzyme.
Mutations S80L, S135R, S341R, S470P and T312S are
associated with severe and attenuated phenotypes, and
might be associated with predicted phosphorylation sites.
Mutation S80L (severe phenotype), affects a highly con-
served amino acid among GALNS sequences and is present
within the active cavity of the enzyme, while for S135R
and S341R the severe phenotype could be produced by a
non-conservative change from a polar to a basic amino
acid. Although the T312S (attenuated phenotype [43])
affects a highly conserved amino acid, this residue is not
present within the active cavity. In this case, the mutation
was produced by a conservative change, and the serine
could be potentionally phosphorylated.
Recently, it was reported the missense mutation C165Y,
which was associated with a severe phenotype [16]. The
computational analysis of this mutation confirmed the severe
phenotype. C165Y mutation affects a completely conserved
residue within studied GALNS sequences, C165 is part of the
active cavity interacting through a H-bond with the carbon
chain of the ligand G6S, and the mutant enzyme showed a
value of minimization energy higher (-11,458 kJ/mol) than
that observed for the wildtype enzyme.
As previously reported, combination of studied factors
provides a better association between missense variants
and clinical severity than that obtained by either factor
alone [37]. Sixty-two out of the 69 severe mutations
(89.8 %) could be explained by the combination of residue
conservation among GALNS and sulfatases sequences,
residue change conservativeness, residue position within
GALNS structure, Polyphen2 results, and the value of
minimization energy (Table 2); which is higher than the
obtained by the combination of evolutionary conservation
and conservativeness [37]. The remaining severe mutations
(seven), are located in the C-terminal end that in the crystal
structure showed a novel conformation, where the poly-
peptide reverses direction and forms a wall of the active
cavity contributing to substrate selectivity [26], which
could explain the phenotype associated to these mutations.
To explain the 34 mutations associated with attenuated
phenotype, we added the value of the Atomic Accessible
Surface Area as reported previously [26]. Combining all
studied factors, including the docking results for mutations
N289S, A291D and K310N, allowed explaining 27 out of
the 34 mutations (79.4 %). However, high heterogeneity
was observed for the studied factors, which agrees with
previous reports showing high molecular heterogeneity for
this group of mutations [19].
Finally, we used these bioinformatics tools to predict the
phenotype for 21 mutations without a clear reported phe-
notype (Table 4). Fourteen mutations could be associated
Fig. 5 a Total affinity energy (kJ/mol) for the evaluated ligands and
mutations. Phenotype was assigned according to literature reports
(**p \ 0.001). b Effect of A291D (red) and A291T (yellow) mutations.
Blue residues wild-type enzyme; Blue line hydrogen bonds. (Color
figure online)
Mol Biol Rep (2014) 41:7073–7088 7081
123
with a severe phenotype, since they affect 13 highly to
completely conserved residues and have a minimization
energy value higher than that of the wildtype enzyme.
Seven mutations could be associated with an attenuated or
intermediate phenotype, due to low evolutionary conser-
vation, conservativeness residue changes, values of mini-
mization energy close to the wildtype enzyme, and surface
localization.
Conclusions
In summary, we have shown for the first time a docking
evaluation of natural and artificial ligands for human
GALNS enzyme, suggesting the interaction of Asp39,
Asp40, FGly79 (Cys79), Arg83, Tyr108, His142, His236,
Asp288, Asn289, and Lys310. This knowledge could pro-
vide the basis for the search of pharmacological chaperones
for Morquio A disease. Furthermore, combination of sev-
eral bioinformatics approaches allowed to explain 90 % of
missense mutations with a reported phenotype, and the
prediction of the phenotype for 21 missense mutations for
which the phenotype information was not available in the
reported literature. In most cases, severe mutations were
associated with changes of higly to completely conserved
residues, non-conservativeness changes, and low values of
Atomic Accessible Surface Area. On the other hand,
attenuated mutations seem to be associated with none to
Fig. 6 Phylogenetic trees for
GALNS from several species
(a) and for human sulfatases (b).
Sequences were aligned by
using Muscle and phylogenetic
tree was generated by using
MEGA5. Parenthesis numbers
represent GenBank and Uniprot
accession numbers for a and b,
respectively
7082 Mol Biol Rep (2014) 41:7073–7088
123
Ta
ble
3A
nal
ysi
so
fm
isse
nse
mu
tati
on
saf
fect
ing
GA
LN
Sen
zym
ew
ith
rep
ort
edp
hen
oty
pe
Muta
tion
aE
volu
tionar
yco
nse
rvat
ion
Res
ide
chan
ge
Conse
rvat
iven
ess
Posi
tion
Poly
phen
2A
tom
ic
acce
ssib
le
surf
ace
area
(A2)d
Ener
gy
Min
imiz
atio
n
(kJ/
mol)
Rep
ort
ed
phen
oty
pee
GA
LN
Sb
Sulf
atas
esc
Wil
dty
pe
-12,4
20
Norm
al
M1V
(CM
042053)
Com
ple
tely
–C
onse
rvat
iven
ess
Surf
ace
Ben
ign
––
Sev
ere
L15M
(CM
054751)
––
Conse
rvat
iven
ess
Surf
ace
Ben
ign
––
Att
enuat
ed
G23R
(CM
042054)
––
Non-c
onse
rvat
iven
ess
Surf
ace
Ben
ign
––
Att
enuat
ed
L36P
(CM
054746)
Inte
rmed
iate
Low
Conse
rvat
iven
ess
Surf
ace
Pro
bab
ly0
-12,3
18
Att
enuat
ed
M41L
(CM
041769)
Com
ple
tely
Low
Conse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly0.2
-12,4
18
Att
enuat
ed
G42E
(CM
054752)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly0
-11,4
58
Sev
ere
G47R
(CM
970574)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly0
-12,6
70
Sev
ere
D60N
(CM
970575)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly7.5
-12,5
78
Att
enuat
ed
R61W
(CM
054740)
Inte
rmed
iate
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly17
-12,1
82
Att
enuat
ed
G66R
(CM
061006)
Hig
hly
Hig
hly
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly0
-12,2
96
Sev
ere
F69
V(C
M970576)
Hig
hly
Hig
hly
Conse
rvat
iven
ess
Surf
ace
Poss
ibly
0-
12,3
86
Sev
ere
A75G
Hig
hly
Low
Conse
rvat
iven
ess
Act
ive
cavit
yP
oss
ibly
–-
12,3
38
Sev
ere
P77R
(CM
950524)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly0
-12,6
63
Sev
ere
C79Y
(CM
045155)
Com
ple
tely
Com
ple
tely
Conse
rvat
iven
ess
Cat
alyti
cP
robab
ly1.3
-11,9
94
Sev
ere
S80L
(CM
970577)
Com
ple
tely
Low
Non-c
onse
rvat
iven
ess
Cat
alyti
cP
robab
ly0
-11,3
70
Sev
ere
R90W
(CM
950525)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly3.4
-10,9
83
Sev
ere
R94C
(CM
950527)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly3.2
-12,1
37
Att
enuat
ed
R94G
(CM
950526)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly3.2
-12,0
49
Att
enuat
ed
R94L
(CM
054741)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly3.2
-12,1
57
Sev
ere
G96C
(CM
970578)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly0.1
-11,7
73
Sev
ere
G96V
(CM
950528)
Com
ple
tely
Hig
hly
Conse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly0.1
-11,2
59
Sev
ere
F97V
(CM
960687)
Com
ple
tely
Low
Conse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly0
-12,3
92
Att
enuat
ed
Q111R
(CM
970579)
Com
ple
tely
Low
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly0.7
-12,5
44
Att
enuat
ed
I113F
(CM
950529)
Com
ple
tely
Low
Conse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly0
-11,6
76
Sev
ere
G116S
(CM
054742)
Com
ple
tely
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly0
-12,4
78
Sev
ere
P125L
(CM
970581)
Com
ple
tely
Low
Conse
rvat
iven
ess
Surf
ace
Pro
bab
ly0
-11,2
31
Sev
ere
S135R
(CM
980814)
Non-c
onse
rved
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Poss
ibly
0.2
-11,9
34
Sev
ere
V138A
(CM
950530)
Hig
hly
Low
Conse
rvat
iven
ess
Surf
ace
Pro
bab
ly0
-12,4
14
Att
enuat
ed
G139S
(CM
970582)
Com
ple
tely
Com
ple
tely
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly0.1
-11,8
51
Sev
ere
W141R
(CM
970583)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly0
-12,5
69
Sev
ere
W141C
(CM
045156)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly0
-12,2
72
Sev
ere
H142R
Com
ple
tely
Hig
hly
Conse
rvat
iven
ess
Cat
alyti
cP
robab
ly0.7
-12,6
24
Sev
ere
H150Y
(CM
042056)
Hig
hly
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly1.9
-11,8
47
Att
enuat
ed
P151S
(CM
950533)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly0.2
-12,4
48
Sev
ere
P151L
(CM
950532)
Com
ple
tely
Hig
hly
Conse
rvat
iven
ess
Surf
ace
Pro
bab
ly0.2
-12,2
77
Sev
ere
G155R
(CM
970584)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly2.5
-12,5
54
Sev
ere
Mol Biol Rep (2014) 41:7073–7088 7083
123
Ta
ble
3co
nti
nu
ed
Muta
tion
aE
volu
tionar
yco
nse
rvat
ion
Res
ide
chan
ge
Conse
rvat
iven
ess
Posi
tion
Poly
phen
2A
tom
ic
acce
ssib
le
surf
ace
area
(A2)d
Ener
gy
Min
imiz
atio
n
(kJ/
mol)
Rep
ort
ed
phen
oty
pee
GA
LN
Sb
Sulf
atas
esc
G155E
(CM
041770)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly2.5
-12,3
45
Sev
ere
F156C
(CM
980815)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly0
-12,3
48
Sev
ere
F156S
(CM
970585)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly0
-12,3
65
Att
enuat
ed
S162F
(CM
970586)
Hig
hly
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly0.1
-11,8
05
Sev
ere
P163H
Hig
hly
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly0.6
-12,3
77
Att
enuat
ed
C165Y
Com
ple
tely
Low
Conse
rvat
iven
ess
Cat
alyti
cP
robab
ly–
-11,4
58
Sev
ere
H166Q
(CM
970587)
Com
ple
tely
Low
Non-c
onse
rvat
iven
ess
Cat
alyti
cP
robab
ly0
-12,5
59
Sev
ere
F167
V(C
M054747)
Hig
hly
Low
Conse
rvat
iven
ess
Act
ive
cavit
yP
oss
ibly
15.2
-12,3
96
Att
enuat
ed
6168R
(CM
970588)
Com
ple
tely
Low
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly1.6
-12,2
60
Sev
ere
D171A
(CM
001701)
Hig
hly
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly16.7
-12,4
16
Att
enuat
ed
P179S
(CM
023382)
Com
ple
tely
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly0
-12,4
32
Sev
ere
P179L
(CM
980816)
Com
ple
tely
Low
Conse
rvat
iven
ess
Surf
ace
Pro
bab
ly0
-12,4
41
Sev
ere
P179H
(CM
970589)
Com
ple
tely
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly0
-12,4
61
Sev
ere
E185G
(CM
970590)
Hig
hly
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Poss
ibly
17.9
-12,3
00
Sev
ere
N204
K(C
M920291
Com
ple
tely
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Ben
ign
6-
12,2
45
Att
enuat
ed
W230G
(CM
950534)
Com
ple
tely
Low
Conse
rvat
iven
ess
Surf
ace
Pro
bab
ly0
-12,3
01
Sev
ere
D233N
(CM
041771)
Com
ple
tely
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly0.3
-12,5
91
Sev
ere
H236D
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Cat
alyti
cP
robab
ly4.7
-12,3
69
Att
enuat
ed
V239F
(CM
054743)
Hig
hly
Low
Conse
rvat
iven
ess
Surf
ace
Pro
bab
ly0
-11,4
17
Sev
ere
G247D
(CM
970591)
Com
ple
tely
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Ben
ign
18
-12,5
03
Sev
ere
R253W
(CM
001702)
Hig
hly
Low
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly6.6
-12,1
93
Att
enuat
ed
A257T
(CM
980817)
Com
ple
tely
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly0
-12,3
40
Sev
ere
R259Q
(CM
970592)
Hig
hly
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly9.8
-12,3
33
Att
enuat
ed
F284V
(CM
980818)
Hig
hly
Low
Conse
rvat
iven
ess
Surf
ace
Ben
ign
0.2
-12,4
01
Att
enuat
ed
S287L
(CM
970593)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly0
-11,1
94
Sev
ere
N289S
Com
ple
tely
Hig
hly
Conse
rvat
iven
ess
Cat
alyti
cP
robab
ly0
-12,2
36
Att
enuat
ed
G290S
(CM
970594)
Com
ple
tely
Com
ple
tely
Conse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly0
-12,4
23
Sev
ere
A291T
(CM
950535)
Hig
hly
Low
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly0.4
-12,2
68
Sev
ere
A291D
(CM
970595)
Hig
hly
Low
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly0.4
-12,3
49
Att
enuat
ed
S295F
(CM
970596)
Hig
hly
Low
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly3.5
-12,3
13
Att
enuat
ed
G301C
(CM
970597)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly3.7
-12,0
91
Sev
ere
L307P
(CM
045154)
Com
ple
tely
Low
Conse
rvat
iven
ess
Surf
ace
Pro
bab
ly0.4
-12,4
20
Sev
ere
G309R
(CM
970598)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly0.2
-8,5
89
Sev
ere
K310N
(CM
054737)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Cat
alyti
cP
robab
ly4.5
-12,5
69
Att
enuat
ed
T312S
(CM
980819)
Com
ple
tely
Low
Conse
rvat
iven
ess
Surf
ace
Ben
ign
0-
12,4
95
Att
enuat
ed
M318R
(CM
950536)
Hig
hly
Low
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly0
-12,6
67
Sev
ere
A324E
Com
ple
tely
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly0
-12,3
78
Att
enuat
ed
7084 Mol Biol Rep (2014) 41:7073–7088
123
Ta
ble
3co
nti
nu
ed
Muta
tion
aE
volu
tionar
yco
nse
rvat
ion
Res
ide
chan
ge
Conse
rvat
iven
ess
Posi
tion
Poly
phen
2A
tom
ic
acce
ssib
le
surf
ace
area
(A2)d
Ener
gy
Min
imiz
atio
n
(kJ/
mol)
Rep
ort
ed
phen
oty
pee
GA
LN
Sb
Sulf
atas
esc
G340D
(CM
041772)
Hig
hly
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly0.4
-12,4
54
Sev
ere
S341R
(CM
045152)
Hig
hly
Hig
hly
Non-c
onse
rvat
iven
ess
Surf
ace
Poss
ibly
0.8
-12,3
23
Sev
ere
M343R
Com
ple
tely
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly1.6
-12,5
13
Sev
ere
M343L
(CM
950537)
Com
ple
tely
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Ben
ign
1.6
-12,4
08
Sev
ere
D344N
(CM
950538)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly0.1
-12,5
87
Sev
ere
D344E
(CM
970600)
Com
ple
tely
Hig
hly
Conse
rvat
iven
ess
Surf
ace
Pro
bab
ly0.1
-12,3
82
Sev
ere
L345P
(CM
041773)
Hig
hly
Low
Conse
rvat
iven
ess
Surf
ace
Pro
bab
ly0.7
-12,1
58
Sev
ere
F346L
(CM
950539)
Hig
hly
Low
Conse
rvat
iven
ess
Surf
ace
Pro
bab
ly0
-12,3
83
Sev
ere
A351V
(CM
970601)
Non-c
onse
rved
Low
Conse
rvat
iven
ess
Surf
ace
Ben
ign
5.6
-12,4
02
Sev
ere
L352P
(CM
041774)
Hig
hly
Hig
hly
Conse
rvat
iven
ess
Surf
ace
Pro
bab
ly2
-11,6
55
Att
enuat
ed
P357L
(CM
042058)
Hig
hly
Low
Conse
rvat
iven
ess
Surf
ace
Ben
ign
8.1
-12,4
00
Sev
ere
R361G
(CM
970602)
Com
ple
tely
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly6.6
-12,0
65
Sev
ere
L369P
(CM
054744)
Hig
hly
Low
Conse
rvat
iven
ess
Surf
ace
Pro
bab
ly3.6
-11,7
67
Sev
ere
R376Q
(CM
980820)
Non-c
onse
rved
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Ben
ign
16.6
-12,3
44
Sev
ere
R380T
(CM
054738)
Hig
hly
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly1.2
-12,1
80
Sev
ere
R380S
(CM
054739)
Hig
hly
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly1.2
-12,1
77
Att
enuat
ed
R386C
(CM
950540)
Com
ple
tely
Low
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly1.1
-12,1
64
Sev
ere
R386H
(CM
045153)
Com
ple
tely
Low
Conse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly1.1
-12,1
75
Sev
ere
D388N
(CM
042060)
Non-c
onse
rved
Low
Non-c
onse
rvat
iven
ess
Act
ive
Cav
ity
Ben
ign
5.1
-12,5
88
Att
enuat
ed
M391V
(CM
970603)
Com
ple
tely
Low
Conse
rvat
iven
ess
Surf
ace
Ben
ign
0-
12,4
33
Att
enuat
ed
L395
V(C
M980821)
Inte
rmed
iate
Low
Conse
rvat
iven
ess
Surf
ace
Ben
ign
4.1
-12,4
09
Sev
ere
H398D
(CM
041775)
Non-c
onse
rved
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Ben
ign
2.5
-12,3
57
Att
enuat
ed
N407H
(CM
970605)
Com
ple
tely
Low
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yP
robab
ly0.4
-12,2
04
Sev
ere
W409S
(CM
970606)
Hig
hly
Low
Non-c
onse
rvat
iven
ess
Act
ive
cavit
yB
enig
n7.5
-12,3
80
Att
enuat
ed
E450
V(C
M950541)
Com
ple
tely
Hig
hly
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly0
-12,3
86
Sev
ere
F452I
(CM
054753)
Inte
rmed
iate
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Ben
ign
7.7
-12,4
26
Sev
ere
F452L
Inte
rmed
iate
Low
Conse
rvat
iven
ess
Surf
ace
Ben
ign
7.7
-12,4
42
Sev
ere
P484S
(CM
054750)
Hig
hly
Low
Non-c
onse
rvat
iven
ess
Surf
ace
Pro
bab
ly4.1
-12,4
52
Att
enuat
ed
N487S
(CM
950543)
Hig
hly
Hig
hly
Conse
rvat
iven
ess
Surf
ace
Ben
ign
6.9
-12,2
49
Sev
ere
M494V
(CM
970607)
Hig
hly
Low
Conse
rvat
iven
ess
Surf
ace
Ben
ign
1.8
-12,2
32
Sev
ere
aP
aren
thes
isnum
ber
sre
pre
sent
HG
MD
acce
ssio
nnum
ber
sb
Com
ple
tely
conse
rved
:20
spec
ies;
Hig
hly
conse
rved
:15–19
spec
ies;
Inte
rmed
iate
conse
rved
:9–14
spec
ies;
and
non-c
onse
rved
:\9
spec
ies
cC
om
ple
tely
conse
rved
:17
sulf
atas
es;
Hig
hly
conse
rved
:9–16
sulf
atas
es;
low
conse
rved
:\8
sulf
atas
esd
Rep
ort
edby
Riv
era-
Colo
net
al.
[26
]e
Ref
eren
ces
[16,
26
,37
,38
]
Mol Biol Rep (2014) 41:7073–7088 7085
123
Ta
ble
4P
hen
oty
pe
pre
dic
tio
nfo
rm
isse
nse
mu
tati
on
saf
fect
ing
GA
LN
Sen
zym
ew
ith
ou
tre
po
rted
ph
eno
typ
e
Mu
tati
on
aE
vo
luti
on
ary
con
serv
atio
nR
esid
e
chan
ge
con
serv
ativ
enes
s
Po
siti
on
Po
lyp
hen
2A
tom
ic
acce
ssib
le
surf
ace
area
(A2)d
En
erg
y
Min
imiz
atio
n
(kJ/
mo
l)
Pre
dic
ted
ph
eno
typ
eG
AL
NS
bS
ulf
atas
esc
S5
3F
(CM
05
47
55
)H
igh
lyL
ow
No
n-c
on
serv
ativ
enes
sS
urf
ace
Pro
bab
ly3
.8-
12
,40
7S
ever
e
S7
4F
Inte
rmed
iate
Lo
wN
on
-co
nse
rvat
iven
ess
Su
rfac
eP
rob
ably
–-
11
,74
9S
ever
e
T8
8I
Hig
hly
Hig
hly
No
n-c
on
serv
ativ
enes
sS
urf
ace
Pro
bab
ly0
-1
2,3
65
Sev
ere
A1
07
T(C
M0
42
05
5)
Hig
hly
Lo
wN
on
-co
nse
rvat
iven
ess
Act
ive
cav
ity
Pro
bab
ly0
-1
1,9
75
Sev
ere
E1
21
DC
om
ple
tely
Hig
hly
Co
nse
rvat
iven
ess
Su
rfac
eP
rob
ably
–-
12
,38
0A
tten
uat
ed
N1
64
T(C
M0
45
15
1)
Co
mp
lete
lyL
ow
Co
nse
rvat
iven
ess
Act
ive
cav
ity
Pro
bab
ly0
.8-
12
,28
0S
ever
e
A2
03
V(C
M0
54
74
9)
Inte
rmed
iate
Lo
wC
on
serv
ativ
enes
sS
urf
ace
Ben
ign
0-
12
,40
8A
tten
uat
ed
Y2
54
CC
om
ple
tely
Hig
hly
Co
nse
rvat
iven
ess
Act
ive
cav
ity
Pro
bab
ly–
-1
2,3
57
Sev
ere
E2
60
KC
om
ple
tely
Hig
hly
No
n-c
on
serv
ativ
enes
sS
urf
ace
Pro
bab
ly–
-1
2,3
24
Sev
ere
E2
60
D(C
M0
54
74
8)
Co
mp
lete
lyH
igh
lyC
on
serv
ativ
enes
sS
urf
ace
Pro
bab
ly0
-1
2,4
19
Att
enu
ated
T3
12
AC
om
ple
tely
Lo
wN
on
-co
nse
rvat
iven
ess
Su
rfac
eP
rob
ably
0-
12
,50
2S
ever
e
G3
16
VC
om
ple
tely
Hig
hly
Co
nse
rvat
iven
ess
Act
ive
cav
ity
Pro
bab
ly0
.1-
11
,05
7S
ever
e
W3
25
C(C
M0
54
75
6)
Co
mp
lete
lyL
ow
No
n-c
on
serv
ativ
enes
sS
urf
ace
Pro
bab
ly0
-1
2,3
43
Sev
ere
L3
66
FH
igh
lyL
ow
Co
nse
rvat
iven
ess
Su
rfac
eB
enig
n7
.1-
12
,41
4A
tten
uat
ed
A3
92
V(C
M0
42
06
1)
Co
mp
lete
lyL
ow
Co
nse
rvat
iven
ess
Su
rfac
eP
rob
ably
0-
12
,39
5S
ever
e
T3
94
PIn
term
edia
teL
ow
No
n-c
on
serv
ativ
enes
sS
urf
ace
Po
ssib
ly–
-1
2,3
26
Sev
ere
L3
95
P(C
M9
70
60
4)
Inte
rmed
iate
Lo
wC
on
serv
ativ
enes
sS
urf
ace
Ben
ign
4.1
-1
2,3
49
Att
enu
ated
H4
01
Y(C
M0
54
75
4)
Co
mp
lete
lyL
ow
No
n-c
on
serv
ativ
enes
sS
urf
ace
Pro
bab
ly0
.1-
11
,43
0S
ever
e
S4
70
P(C
M0
54
74
5)
No
n-c
on
serv
edL
ow
No
n-c
on
serv
ativ
enes
sS
urf
ace
Ben
ign
24
.3-
11
,54
8A
tten
uat
ed
V4
88
M(C
M9
50
54
4)
Hig
hly
Lo
wC
on
serv
ativ
enes
sS
urf
ace
Ben
ign
17
.5-
12
,38
8A
tten
uat
ed
N4
95
YC
om
ple
tely
Lo
wC
on
serv
ativ
enes
sS
urf
ace
Pro
bab
ly–
-1
1,3
06
Sev
ere
aP
aren
thes
isn
um
ber
sre
pre
sen
tH
GM
Dac
cess
ion
nu
mb
ers
bC
om
ple
tely
con
serv
ed:
20
spec
ies;
Hig
hly
con
serv
ed:
15
–1
9sp
ecie
s;In
term
edia
teco
nse
rved
:9
–1
4sp
ecie
s;an
dn
on
-co
nse
rved
:\9
spec
ies
cC
om
ple
tely
con
serv
ed:
17
sulf
atas
es;
Hig
hly
con
serv
ed:
9–
16
sulf
atas
es;
low
con
serv
ed:\
8su
lfat
ases
dR
epo
rted
by
Riv
era-
Co
lon
etal
.[2
6]
7086 Mol Biol Rep (2014) 41:7073–7088
123
intermediate amino acid conservation, conservativeness
changes, surface position, high values of Atomic Accessi-
ble Surface Area, and energy minimization close or lower
than that of the wildtype enzyme. However, several factors
could limit the phenotype-genotype correlation, such as the
subjective classification of patients (clinical and biochem-
ical), association of GALNS with the Multienzyme Lyso-
somal Complex of b-galactosidase, cathepsin A, and
neuraminidase [25]; or epigenetic factors. Furthermore,
future anlaysis should consider the relation with the levels
of urine and blood keratin and/or chondroitin sulfate [8].
We consider that this approach can be extrapolated for the
study of phenotype–genotype correlation for other lyso-
somal storage diseases, which could have important
implications in the understanding of the disease, and the
design of new therapeutic strategies.
Acknowledgments This work was supported in part by The
Administrative Department of Science, Technology and Innovation
COLCIENCIAS (Colombia) grant 120356933205 (ID PPTA 5174) and
Pontificia Universidad Javeriana (PPTA 5596). Alexander Rodriguez-
Lopez recived a PhD scholarship from Pontificia Universidad Javeri-
ana. We thank Alexander Herrera at the High Performance Computing
Center—ZINE—of Pontificia Universidad Javeriana, for his assistance
during docking and molecular dynamics simulations.
References
1. Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova
A, Bork P, Kondrashov AS, Sunyaev SR (2010) A method and
server for predicting damaging missense mutations. Nat Methods
7:248–249
2. Almeciga-Dıaz C, Montano AM, Tomatsu S, Barrera L (2010)
Adeno-associated virus gene transfer on Morquio A: effect of pro-
moters and sulfatase-modifying Factor 1. FEBS J 277:3608–3619
3. Almeciga-Dıaz C, Rueda-Paramo M, Espejo A, Echeverri O, Mont-
ano A, Tomatsu S, Barrera L (2009) Effect of Elongation Factor 1apromoter and SUMF1 over in vitro expression of N-acet-
ylgalactosamine-6-sulfate sulfatase. Mol Biol Rep 36:1863–1870
4. Balamurugan B, Roshan B, Hameed BS, Sumathi K, Senthilku-
mar R, Udayakumar A, Babu KHV, Kalaivani M, Sowmiya G,
Sivasankari P, Saravanan S, Ranjani CV, Gopalakrishnan K,
Selvakumar KN, Jaikumar M, Brindha T, Michael D, Sekar K
(2007) PSAP: protein structure analysis package. J Appl Crys-
tallogr 40:773–777
5. Cosma M, Pepe P, Annunziata I, Newbold R, Grompe M, Parenti
G, Ballabio A (2003) The multiple sulfatase deficiency gene
encodes an essential and limiting factor for the activity of sul-
fatases. Cell 113:445–456
6. Crunkhorn S (2013) Trial watch: enzyme replacement success in
Phase III trial for rare metabolic disorder. Nat Rev Drug Discov
12:12
7. Dundas J, Ouyang Z, Tseng J, Binkowski A, Turpaz Y, Liang J
(2006) CASTp: computed atlas of surface topography of proteins
with structural and topographical mapping of functionally anno-
tated residues. Nucleic Acids Res 34:W116–W118
8. Dung VC, Tomatsu S, Montano AM, Gottesman G, Bober MB,
Mackenzie W, Maeda M, Mitchell GA, Suzuki Y, Orii T (2013)
Mucopolysaccharidosis IVA: correlation between genotype,
phenotype and keratan sulfate levels. Mol Genet Metab
110:129–138
9. Dvorak-Ewell M, Wendt D, Hague C, Christianson T, Koppaka
V, Crippen D, Kakkis E, Vellard M (2010) Enzyme replacement
in a human model of mucopolysaccharidosis IVA in vitro and its
biodistribution in the cartilage of wild type mice. PLoS One
5:e12194
10. Edgar RC (2004) MUSCLE: a multiple sequence alignment
method with reduced time and space complexity. BMC Bioinform
5:113
11. Funderburgh J (2002) Keratan sulfate biosynthesis. IUBMB Life
54:187–194
12. Ghosh D (2007) Human sulfatases: a structural perspective to
catalysis. Cell Mol Life Sci 64:2013–2022
13. Guex N, Peitsch MC (1997) SWISS-MODEL and the Swiss-
PdbViewer: an environment for comparative protein modeling.
Electrophoresis 18:2714–2723
14. Gutierrez M, Garcia-Vallejo F, Tomatsu S, Ceron F, Almeciga-
Dıaz C, Domınguez M, Barrera L (2008) Construction of an
adenoassociated virus-derived vector for the treatment of Mor-
quio A disease. Biomedica 28:448–459
15. Kornfeld S (1987) Trafficking of lysosomal enzymes. FASEB J
1:462–468
16. Kubaski F, Brusius-Facchin AC, Palhares HM, Balarin MA,
Viapiana-Camelier M, Guidobono R, Burin MG, Giugliani R,
Leistner-Segal S (2013) Identification of a novel missense
mutation in Brazilian patient with a severe form of mucopoly-
saccharidosis type IVA. Gene 517:112–115
17. Laskowski RA, Chistyakov VV, Thornton JM (2005) PDBsum
more: new summaries and analyses of the known 3D structures of
proteins and nucleic acids. Nucleic Acids Res 33:D266–D268
18. Masue M, Sukegawa K, Orii T, Hashimoto T (1991) N-Acet-
ylgalactosamine-6-sulfate sulfatase in human placenta: purifica-
tion and characteristics. J Biochem 110:965–970
19. Montano AM, Sukegawa K, Kato Z, Carrozzo R, Di Natale P,
Christensen E, Orii KO, Orii T, Kondo N, Tomatsu S (2007)
Effect of ‘attenuated’ mutations in mucopolysaccharidosis IVA
on molecular phenotypes of N-acetylgalactosamine-6-sulfate
sulfatase. J Inherit Metab Dis 30:758–767
20. Montano AM, Tomatsu S, Gottesman G, Smith M, Orii T (2007)
International Morquio A registry: clinical manifestation and nat-
ural course of Morquio A disease. J Inherit Metab Dis 30:165–174
21. Mosquera A, Rodrıguez A, Vargas C, Leonardi F, Espejo A,
Sanchez O, Almeciga-Dıaz C, Barrera L (2012) Characterization
of a recombinant N-acetylgalactosamine-6-sulfate sulfatase pro-
duced in E. coli for enzyme replacement therapy of Morquio A
disease. Process Biochem 47:2097–2102
22. Olarte-Avellaneda S, Rodrıguez-Lopez A, Almeciga-Dıaz CJ
(2014) In-silico analysis of the active cavity of N-acet-
ylgalactosamine-6-sulfate sulfatase in eight species. In: Castillo
LF, Cristancho M, Isaza G, Pinzon A, Corchado JM (eds) Advances
in computational biology. Springer, Las Vegas, pp 141–146
23. Pettersen EF, Goddard TD, Huang CC, Couch GS, Greenblatt
DM, Meng EC, Ferrin TE (2004) UCSF Chimera: a visualization
system for exploratory research and analysis. J Comput Chem
25:1605–1612
24. Pronk S, Pall S, Schulz R, Larsson P, Bjelkmar P, Apostolov R, Shirts
MR, Smith JC, Kasson PM, van der Spoel D, Hess B, Lindahl E
(2013) GROMACS 4.5: a high-throughput and highly parallel open
source molecular simulation toolkit. Bioinformatics 29:845–854
25. Pshezhetsky AV, Potier M (1996) Association of N-acet-
ylgalactosamine-6-sulfate sulfatase with the multienzyme lyso-
somal complex of beta-galactosidase, cathepsin A, and
neuraminidase. Possible implication for intralysosomal catabolism
of keratan sulfate. J Biol Chem 271:28359–28365
Mol Biol Rep (2014) 41:7073–7088 7087
123
26. Rivera-Colon Y, Schutsky EK, Kita AZ, Garman SC (2012) The
structure of human GALNS reveals the molecular basis for
mucopolysaccharidosis IV A. J Mol Biol 423:736–751
27. Rodriguez A, Espejo AJ, Hernandez A, Velasquez OL, Lizaraso LM,
Cordoba HA, Sanchez OF, Almeciga-Diaz CJ, Barrera LA (2010)
Enzyme replacement therapy for Morquio A: an active recombinant
N-acetylgalactosamine-6-sulfate sulfatase produced in Escherichia
coli BL21. J Ind Microbiol Biotechnol 37:1193–1201
28. Saito S, Ohno K, Sugawara K, Sakuraba H (2008) Structural and
clinical implications of amino acid substitutions in N-acet-
ylgalactosamine-4-sulfatase: insight into mucopolysaccharidosis
type VI. Mol Genet Metab 93:419–425
29. Sardiello M, Annunziata I, Roma G, Ballabio A (2005) Sulfatases
and sulfatase modifying factors: an exclusive and promiscuous
relationship. Hum Mol Genet 14:3203–3217
30. Schenk M, Koppisetty CA, Santos DC, Carmona E, Bhatia S,
Nyholm PG, Tanphaichitr N (2009) Interaction of arylsulfatase-A
(ASA) with its natural sulfoglycolipid substrates: a computational
and site-directed mutagenesis study. Glycoconj J 26:1029–1045
31. Schuttelkopf AW, van Aalten DM (2004) PRODRG: a tool for
high-throughput crystallography of protein-ligand complexes.
Acta Crystallogr Sect D 60:1355–1363
32. Stenson PD, Mort M, Ball EV, Shaw K, Phillips A, Cooper DN
(2014) The Human Gene Mutation Database: building a com-
prehensive mutation repository for clinical and molecular
genetics, diagnostic testing and personalized genomic medicine.
Hum Genet 133:1–9
33. Sudhakar SC, Mahalingam K (2011) Structural and functional
analysis of N-acetylgalactosamine-6-sulfate sulfatase using bio-
informatics tools: insight into Mucopolysaccharidosis IVA.
J Pharm Res 4:3958–3962
34. Sukegawa K, Nakamura H, Kato Z, Tomatsu S, Montano AM,
Fukao T, Toietta G, Tortora P, Orii T, Kondo N (2000) Bio-
chemical and structural analysis of missense mutations in N-
acetylgalactosamine-6-sulfate sulfatase causing mucopolysac-
charidosis IVA phenotypes. Hum Mol Genet 9:1283–1290
35. Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S
(2011) MEGA5: molecular evolutionary genetics analysis using
maximum likelihood, evolutionary distance, and maximum par-
simony methods. Mol Biol Evol 28:2731–2739
36. Tomatsu S, Fukuda M, Masue K, Sukegawa T, Fukao A, Ya-
magishi T, Hori H, Iwati T, Ogawa Y (1991) Morquio disease:
isolation, characterization and expression of full-length cDNA for
human N-acetylgalactosamine-6-sulfate sulfatase. Biochem Bio-
phys Res Commun 181:677–683
37. Tomatsu S, Montano A, Lopez P, Trandafirescu G, Gutierrez M,
Oikawa H, Nishioka T, Vieira M, Orii T, Noguchi A (2006)
Determinant factors of spectrum of missense variants in muco-
polysaccharidosis IVA gene. Mol Genet Metab 89:139–149
38. Tomatsu S, Montano A, Nishioka T, Gutierrez M, Pena O,
Trandafirescu G, Lopez P, Yamaguchi S, Noguchi A, Orri T (2005)
Mutation and polymorphism spectrum of the GALNS gene in
mucopolysaccharidosis IVA (Morquio A). Hum Mutat 26:500–512
39. Tomatsu S, Montano A, Ohashi A, Oikawa H, Oguma T, Dung V,
Nishioka T, Orii T, Sly W (2008) Enzyme replacement therapy in
a murine model of Morquio A syndrome. Hum Mol Genet
17:815–824
40. Tomatsu S, Montano AM, Dung V, Ohashi A, Oikawa H, Oguma
T, Orii T, Barrera L, Sly W (2010) Enhacement of drug delivery:
enzyme-replacement therapy for murine Morquio A syndrome.
Mol Ther 18:1094–1102
41. Tomatsu S, Montano AM, Oikawa H, Smith M, Barrera L, Chi-
nen Y, Thacker MM, Mackenzie WG, Suzuki Y, Orii T (2011)
Mucopolysaccharidosis type IVA (Morquio A disease): clinical
review and current treatment. Curr Pharm Biotechnol 12:931–945
42. van Diggelen O, Zhao H, Kleijer W, Janse H, Poorthuis B, Pelt
JV, Kamerling J, Galjaard H (1990) A fluorometric enzyme assay
for the diagnosis of Morquio disease type A (MPS IVA). Clin
Chem Acta 187:131–140
43. Yamada N, Fukuda S, Tomatsu S, Muller V, Hopwood JJ, Nelson
J, Kato Z, Yamagishi A, Sukegawa K, Kondo N, Orii T (1998)
Molecular heterogeneity in mucopolysaccharidosis IVA in Aus-
tralia and Northern Ireland: nine novel mutations including
T312S, a common allele that confers a mild phenotype. Hum
Mutat 11:202–208
44. Zhang Y (2008) I-TASSER server for protein 3D structure pre-
diction. BMC Bioinform 9:40
7088 Mol Biol Rep (2014) 41:7073–7088
123