ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic...
Transcript of ars.els-cdn.com · Web viewSupplementary information . Elucidation of the complete biosynthetic...
Supplementary information
Elucidation of the complete biosynthetic pathway of the main triterpene
glycosylation products of Panax notoginseng using a synthetic biology platform
Dong Wang1†, Jinhe Wang1,2†, Yusong Shi1, Rongsheng Li1,3, Feiyu Fan1,Ying Huang1, Weixian Li1,4, Ning Chen2,
Luqi Huang5, Zhubo Dai1* and Xueli Zhang1*
1Key Laboratory of Systems Microbial Biotechnology, Tianjin Institute of Industrial Biotechnology, Chinese
Academy of Sciences, China. 2 College of Biotechnology, Tianjin University of Science and Technology, China. 3
School of Pharmacy and Yunnan Key Laboratory of Natural Medicine Pharmacology, Kunming Medical
University, China. 4 College of Pharmaceutical Science, Yunnan University of Chinese Medicine, China. 5 State
Key Laboratory Breeding Base of Dao-di Herbs, National Resource Center for Chinese Materia Medica, China
Academy of Chinese Medical Sciences, China.
†These authors contribute equally to this work.
*Corresponding author at: 32 West 7th Ave, Tianjin Airport Economic Park, Tianjin, 300308, China. Tel and Fax:
86-22-84861946. Email address: [email protected].
*Corresponding author at: 32 West 7th Ave, Tianjin Airport Economic Park, Tianjin, 300308, China. Tel and Fax:
86-22-84861983. Email address: [email protected].
List:
1. Supplementary Methods: P2-P9
2. Supplementary Figures: P10-11
3. Supplementary Tables: P12-P31
4. Supplementary Sequence 1: P32-P63
5. Supplementary Sequences 2: P64-P72
6. Supplementary Sequences 3: P73-P78
1
Supplementary Methods
Gene mining
Total RNA from control and drug treatment samples were isolated using TRIzol reagent (Invitrogen,
Grand Island, USA) according to the manufacturer’s protocol and used for high-throughput RNA
sequencing. Two 150-bp paired-end RNA-seq libraries were commercially generated in Novogene
Biotechnology Co. Ltd (Tianjin, China) using Illumina’s HiSeq X Ten platform (Illumina, San Diego,
USA). 7.3 and 5.3 million raw reads from control and drug treatment samples were filtered using
Trimmomatic (v0.35)1 to remove adapters, NGS QC Toolkit (v2.3.3)2 program to remove low quality reads
and discard reads based upon the threshold of length. Then filtered clean reads from two samples were
pooled together and a total of 12454362 reads were used to create a de-novo transcriptome assembly using
the Trinity pipeline (v2.4.0)3 with the default parameters, including a fixed k-mer size of 25 nt and
min_kmer_cov 2 options. Only assembled transcripts longer than 600 bp were kept. In addition, a set of
published data from NCBI SRA(Sequence Read Archive)database were also re-analyzed in this work
according to the above process, including SRR5282580, SRR5286139, SRR5288612, SRR5288616, and
SRR5288620. For estimating transcript abundance, HQ reads were then mapped back to the assembled
transcriptome using Bowtie2 (v2.3.2)4, and mapping statistics (fragments per kilobase transcript length per
million fragments mapped, FPKM) were estimated by RSEM5. Annotation of the reference transcriptome
was conducted on a local server against the Swissprot database and Pfam database using Blastx with an e-
value of 10-5.
Plasmid construction
1. Construction of pRS425-LEU2-TEF1-Pn1-31
The Pn1-31 gene was amplified from the cDNA of Panax Notoginseng (using primer sets SexAI-Pn1-
31/Pn1-31-AscI). The TEF1 promoter (PTEF1) and CYC1 terminator (TCYC1) were amplified from the
genomic DNA of S. cerevisiae BY4742 (using primer sets SacII-TEF1/SEXAI-TEF1 and
AscI-CYC1t/CYC1t-SacII). These amplified DNA fragments were digested with SexAI/AscI, SexAI and
AscI, respectively. These digested DNA fragments were ligated with T4 ligase and used as templates for
2
PCR amplification of the PTEF1-Pn1-31-TCYC1 cassette (using primer sets SacII-TEF1/CYC1t-SacII). The
amplified product was cloned into SacII site of plasmid pRS425, resulting in pRS425-LEU2- TEF1-Pn1-31.
2. Construction of pRS425-LEU2-TEF1-Pn3-29
The Pn3-29 gene was amplified from the cDNA of Panax Notoginseng (using primer sets PacI-Pn3-
29/Pn3-29-AscI). The TEF1 promoter (PTEF1) and CYC1 terminator (TCYC1) were amplified from the
genomic DNA of S. cerevisiae BY4742 (using primer sets SacII-TEF1/PacI-TEF1 and AscI-CYC1t/CYC1t-
SacII). These amplified DNA fragments were digested with PacI/AscI, PacI and AscI, respectively. These
digested DNA fragments were ligated with T4 ligase and used as templates for PCR amplification of the
PTEF1-Pn3-29-TCYC1 cassette (using primer sets SacII-TEF1/CYC1t-SacII). The amplified product was cloned
into SacII site of plasmid pRS425, resulting in pRS425-LEU2-TEF1-Pn3-29.
3. Construction of pRS425-LEU2-TEF1-Pn1-2, Pn1-3, Pn1-4, Pn1-6, Pn1-12, Pn1-14,
Pn1-15, Pn1-16, Pn1-17, Pn1-20, Pn1-21, Pn1-23, Pn1-24, Pn1-25 Pn1-28, Pn1-29, Pn1-30,
Pn1-31, Pn1-34, Pn1-37, Pn1-39, Pn1-45, Pn1-46, Pn1-47, Pn1-48, Pn1-49, Pn1-50, Pn1-51,
Pn1-52, Pn1-53, Pn1-54, Pn1-55, Pn1-57, Pn1-58, Pn1-61, Pn1-63, Pn1-66, Pn2-7, Pn2-9,
Pn2-19, Pn2-20, Pn2-21, Pn2-30, Pn2-31, Pn2-33, Pn2-35, Pn2-47, Pn3-1, Pn3-4, Pn3-5, Pn3-
6, Pn3-7, Pn3-8, Pn3-9, Pn3-11, Pn3-13, Pn3-15, Pn3-21, Pn3-24, Pn3-26, Pn3-27, Pn3-28,
Pn3-31, Pn3-32,Pn-3-32-i5, Pn3-33, Pn3-35, Pn3-36, Pn3-38, Pn3-39, Pn3-40, Pn3-41, Pn3-
42, Pn3-43, Pn3-44, Pn3-47, Pn3-54, Pn3-55, Pn3-60, Pn3-67, Pn3-68, Pn3-74, Pn3-75, Pn3-
76, Pn3-79, Pn3-87, Pn3-88, Pn3-89, Pn3-91, Pn3-97, Pn3-98, Pn3-103, Pn3-107
The Pn1-2, Pn1-3, Pn1-4, Pn1-6, Pn1-12, Pn1-14, Pn1-15, Pn1-16, Pn1-17, Pn1-20, Pn1-
21, Pn1-23, Pn1-24, Pn1-25 Pn1-28, Pn1-29, Pn1-30, Pn1-31, Pn1-34, Pn1-37, Pn1-39, Pn1-45,
Pn1-46, Pn1-47, Pn1-48, Pn1-49, Pn1-50, Pn1-51, Pn1-52, Pn1-53, Pn1-54, Pn1-55, Pn1-57,
Pn1-58, Pn1-61, Pn1-63, Pn1-66, Pn2-7, Pn2-9, Pn2-19, Pn2-20, Pn2-21, Pn2-30, Pn2-31, Pn2-
33, Pn2-35, Pn2-47, Pn3-1, Pn3-4, Pn3-5, Pn3-6, Pn3-7, Pn3-8, Pn3-9, Pn3-11, Pn3-13, Pn3-15,
Pn3-21, Pn3-24, Pn3-26, Pn3-27, Pn3-28, Pn3-31, Pn3-32,Pn3-32-i5, Pn3-33, Pn3-35, Pn3-36,
Pn3-38, Pn3-39, Pn3-40, Pn3-41, Pn3-42, Pn3-43, Pn3-44, Pn3-47, Pn3-54, Pn3-55, Pn3-60,
Pn3-67, Pn3-68, Pn3-74, Pn3-75, Pn3-76, Pn3-79, Pn3-87, Pn3-88, Pn3-89, Pn3-91, Pn3-97,
Pn3-98, Pn3-103, Pn3-107 genes were amplified from the cDNA of Panax Notoginseng (using
primer sets in Table S2), respectively, and cloned into plasmid pRS425-LEU2-TEF1 -Pn1-31-CYC1
3
at SexAI and AscI sites, resulting in plasmids pRS425-LEU2-TEF1-Pn1-2, Pn1-3, Pn1-4, Pn1-6,
Pn1-12, Pn1-14, Pn1-15, Pn1-16, Pn1-17, Pn1-20, Pn1-21, Pn1-23, Pn1-24, Pn1-25 Pn1-28, Pn1-
29, Pn1-30, Pn1-31, Pn1-34, Pn1-37, Pn1-39, Pn1-45, Pn1-46, Pn1-47, Pn1-48, Pn1-49, Pn1-50,
Pn1-51, Pn1-52, Pn1-53, Pn1-54, Pn1-55, Pn1-57, Pn1-58, Pn1-61, Pn1-63, Pn1-66, Pn2-7, Pn2-
9, Pn2-19, Pn2-20, Pn2-21, Pn2-30, Pn2-31, Pn2-33, Pn2-35, Pn2-47, Pn3-1, Pn3-4, Pn3-5, Pn3-
6, Pn3-7, Pn3-8, Pn3-9, Pn3-11, Pn3-13, Pn3-15, Pn3-21, Pn3-24, Pn3-26, Pn3-27, Pn3-28, Pn3-
31, Pn3-32,Pn-3-32-i5, Pn3-33, Pn3-35, Pn3-36, Pn3-38, Pn3-39, Pn3-40, Pn3-41, Pn3-42, Pn3-
43, Pn3-44, Pn3-47, Pn3-54, Pn3-55, Pn3-60, Pn3-67, Pn3-68, Pn3-74, Pn3-75, Pn3-76, Pn3-79,
Pn3-87, Pn3-88, Pn3-89, Pn3-91, Pn3-97, Pn3-98, Pn3-103, Pn3-107.
4. Construction of pRS425-LEU2-TEF1-Pn2-28, Pn2-39, Pn3-19, Pn3-25, Pn3-106
The Pn2-28, Pn2-39, Pn3-19, Pn3-25, Pn3-106 genes were amplified from the cDNA of
Panax notoginseng (using primer sets in Table S2), respectively, and cloned into plasmid
pRS425-LEU2-TEF1-Pn3-29-CYC1 at PacI and AscI sites, resulting in plasmids pRS425-LEU2-
TEF1-Pn2-28, Pn2-39, Pn3-19, Pn3-25, Pn3-106.
5. Construction of pM3-Pn1-31 and pM3-SynAtUGD1
Pn1-31 and p-SynAtUGD1 were digested with SexAI and AscI, respectively, and cloned into
plasmid pM3-tHMG1 at SexAI and AscI sites, resulting in plasmids pM3-Pn1-31 and pM3-
SynAtUGD1.
6. Construction of pM13-Pn3-29
Pn3-29 was digested with PacI and AscI, and cloned into plasmid pM13-PgPPDS6 at PacI
and AscI sites, resulting in plasmid pM13-Pn3-29.
7. Construction of pM9-Pn1-31 and pM9-SynAtUXS3
Pn1-31 and p-SynAtUXS3 were digested with SexAI and AscI, respectively, and cloned
into plasmid pM9-ERG127 at SexAI and AscI sites, resulting in plasmids pM9-Pn1-31 and pM9-
SynAtUXS3.
8. Construction of pM16-Pn3-31 and pM16-SynUGTPg101
Pn3-31 and p-SynUGTPg101 were digested with SexAI and AscI, respectively, and cloned
into plasmid pM16-IDI17 at SexAI and AscI sites, resulting in plasmids pM16-Pn3-31 and pM16-
SynUGTPg101.
9. Construction of pRS425-LEU2-TEF1-SynPn3-29
4
p-SynPn3-29 was digested with SexAI and AscI, respectively, and cloned into plasmid pRS425-
LEU2-TEF1-Pn1-31 at SexAI and AscI sites, resulting in plasmids pRS425-LEU2 -TEF1-
SynPn3-29.
10. Construction of p-TRP1
TRP1 was amplified from S. cerevisiae S288C genome using primer set BSP-TRP1-F/BSP-
TRP1-R, and cloned into plasmid pEASY-Blunt Simple, resulting in plasmids pTRP1.
11. Construction of pLPPgRNA
pLPPgRNA was amplified from p425-SNR52p-gRNA.CAN1.Y-SUP4t (purchased from
Addgene) using primer set 43803-up/43803-LPP-down, and digested with DpnI , resulting in
pLPPgRNA.
Strain construction
Transformation of S. cerevisiae strains was performed as described previously 6-8
1. BY-GM
Strain BY-GM was constructed by integrating the PGM1, PGM2, UGP1 and SynPgPPDS
genes into the EGH1 site of strain BY-T38. Seven DNA fragments were amplified from pM2-
PGM1 (using primer set 1-M-pEASY-PGK1-F/s-4G-1-M-ADHT-FBA1-R), pM8-PGM2 (using
primer set s-4G-4-M-ADH1t-FBA1-F/s-4G-4-M-TDH2t-TDH3-R), pM4-UGP1 (using primer set
S-4G-3M-TDH2t-TDH3-F/3G-3-M-TPI1t-TEF1-R), pM3-SynPgPPDS (using primer set 3G-2-
M-TPI1t-TEF1-F/M-CYC1t-pEASY-R), S. cerevisiae genome (using primer sets
EGH1-UP-F/EGH1-UP-R and EGH1-DOWN-F/EGH1-DOWN-R) and p-TRP1 (using primer set
EGH1-UP-Marker-50bp-F/Only-PGK1-R) were transformed into strain BY-T3 followed by
selection on SD-HIS-TRP plate. The strain was verified by PCR analysis.
2. PPD-PPT-GM
Strain PPD-PPT-GM was constructed by integrating the SynPgDDS, SynPgPPTS, AtCPR1
and SynPgPPDS genes into the GAL7 site of strain BY-GM. Six DNA fragments amplified from
pM2-SynPgDDS (using primer set 1-M-pEASY-PGK1-F/s-4G-1-M-ADHT-FBA1-R), pM8-
SynPgPPTS (using primer set s-4G-4-M-ADH1t-FBA1-F/s-4G-4-M-TDH2t-TDH3-R), pM4-
AtCPR1 (using primer set S-4G-3M-TDH2t-TDH3-F/3G-3-M-TPI1t-TEF1-R), pM3-
SynPgPPDS (using primer set 3G-2-M-TPI1t-TEF1-F/M-CYC1t-pEASY-R), and pGAL7-URA3
5
(using primer sets X1-M-pEASY-r-t-F/GAL7-interg-2 and GAL7-interg-1/X2-M-pEASY-r-t-R)
were transformed into strain BY-GM followed by selection on SD-HIS-TRP-URA plate. The
strain was verified by PCR analysis.
3. PPD-PPT-GM-UGTs
Strain PPD-PPT-GM-UGTs were constructed by transforming plasmids pRS425-LEU2 -
TEF1-UGTs into strain PPD-PPT-GM, respectively, followed by selection on SD-HIS-TRP -
URA-LEU plate. The strains were verified by PCR analysis.
4. Rh2-GM
Strain Rh2-GM was constructed by integrating the PGM1, PGM2, UGP1 and Pn1-31 genes
into the EGH1 site of strain BY-PPT 9. Seven DNA fragments amplified from pM2-PGM1 (using
primer set 1-M-pEASY-PGK1-F/s-4G-1-M-ADHT-FBA1-R), pM8-PGM2 (using primer set s-
4G-4-M-ADH1t-FBA1-F/s-4G-4-M-TDH2t-TDH3-R), pM4-UGP1 (using primer set S-4G-3M-
TDH2t-TDH3-F/3G-3-M-TPI1t-TEF1-R), pM3-Pn1-31 (using primer set 3G-2-M-TPI1t-TEF1-
F/M-CYC1t-pEASY-R), S. cerevisiae genome (using primer sets EGH1-UP-F/EGH1-UP-R and
EGH1-DOWN-F/EGH1-DOWN-R) and p-TRP1 (using primer set
EGH1-UP-Marker-50bp-F/Only-PGK1-R) were transformed into strain BY-PPT followed by
selection on SD-HIS-TRP-URA plate. The strain was verified by PCR analysis.
5. Rh2-GM-UGTs
Strain Rh2-GM-UGTs were constructed by transforming plasmids pRS425-LEU2 -TEF1-
UGTs into strain Rh2-GM, respectively, followed by selection on SD-HIS-TRP -URA-LEU plate.
The strains were verified by PCR analysis.
6. Rd-GM
Strain Rd-GM was constructed by integrating the PGM1, Pn1-31 Pn3-31, PGM2, UGP1 and
Pn3-29 genes into the EGH1 site of strain BY-PPT 9. Nine DNA fragments amplified from pM2-
PGM1 (using primer set 1-M-pEASY-PGK1-F/s-7G-1-M-ADHT-PDC1-R), pM-9-Pn1-31 (using
primer set s-7G-1-M-ADH1t-PDC1-F/3G-1-M-ADH2t-ENO2-R), pM16-Pn3-31 (using primer
set 3G-2-M-ADH2t-ENO2-F/6G-2-M-PDC1t-FBA1-R), pM8-PGM2 (using primer set 6G-2-M-
PDC1t-FBA1-F/s-4G-4-M-TDH2t-TDH3-R), pM4-UGP1 (using primer set S-4G-3M-TDH2t-
TDH3-F/3G-3-M-TPI1t-TEF1-R), pM3-Pn3-29 (using primer set 3G-2-M-TPI1t-TEF1-F/M-
CYC1t-pEASY-R), S. cerevisiae genome (using primer sets EGH1-UP-F/EGH1-UP-R and
6
EGH1-DOWN-F/EGH1-DOWN-R) and p-TRP1 (using primer set
EGH1-UP-Marker-50bp-F/Only-PGK1-R) were transformed into strain BY-PPT followed by
selection on SD-HIS-TRP-URA plate. The strain was verified by PCR analysis.
7. Rd-GM-UGTs
Strain Rd-GM-UGTs were constructed by transforming plasmids pRS425-LEU2 -TEF1-
UGTs into strain Rd-GM, respectively, followed by selection on SD-HIS-TRP-URA-LEU plate.
The strains were verified by PCR analysis.
8. Rg1-XM
Strain Rg1-XM was constructed by integrating the PGM1, SynAtUXS3, SynUGTPg101,
PGM2, UGP1 and SynAtUGD1 genes into the EGH1 site of strain BY-PPT 9. Nine DNA
fragments amplified from pM2-PGM1 (using primer set 1-M-pEASY-PGK1-F/s-7G-1-M-ADHT-
PDC1-R), pM-9-SynAtUXS3 (using primer set s-7G-1-M-ADH1t-PDC1-F/3G-1-M-ADH2t-
ENO2-R), pM16-SynUGTPg101 (using primer set 3G-2-M-ADH2t-ENO2-F/6G-2-M-PDC1t-
FBA1-R), pM8-PGM2 (using primer set 6G-2-M-PDC1t-FBA1-F/s-4G-4-M-TDH2t-TDH3-R),
pM4-UGP1 (using primer set S-4G-3M-TDH2t-TDH3-F/3G-3-M-TPI1t-TEF1-R), pM3-
SynAtUGD1 (using primer set 3G-2-M-TPI1t-TEF1-F/M-CYC1t-pEASY-R), S. cerevisiae
genome (using primer sets EGH1-UP-F/EGH1-UP-R and EGH1-DOWN-F/EGH1-DOWN-R)
and p-TRP1 (using primer set EGH1-UP-Marker-50bp-F/Only-PGK1-R) were transformed into
strain BY-PPT followed by selection on SD-HIS-TRP-URA plate. The strain was verified by PCR
analysis.
9. Rg1-XM-UGTs
Strain Rg1-XM-UGTs were constructed by transforming plasmids pRS425-LEU2-TEF1-
UGTs into strain Rg1-XM, respectively, followed by selection on SD-HIS-TRP-URA-LEU plate.
The strains were verified by PCR analysis.
10. YS01-CK
Strain YS01-CK was constructed by integrating the PGM1, PGM2, UGP1 and SynPn3-29
genes into the LPP site of strain YS01 (lab collection) with CRISPR-Cas9 method. Four DNA
fragments amplified from pM2-PGM1 (using primer set LPP-50-PGK1-F/s-4G-1-M-ADHT-
FBA1-R), pM8-PGM2 (using primer set s-4G-4-M-ADH1t-FBA1-F/s-4G-4-M-TDH2t-TDH3-
R), pM4-UGP1 (using primer set S-4G-3M-TDH2t-TDH3-F/3G-3-M-TPI1t-TEF1-R), pRS425-
7
LEU2-TEF1-SynPn3-29 (using primer set 3G-2-M-TPI1t-TEF1-F/LPP-50-CYC1t-R) and a
gRNA plasmid pLPPgRNA were transformed into strain YS01 followed by selection on SD-TRP-
URA plate. The strain was verified by PCR analysis.
11. YS01-CK2
Strain YS01-CK2 was constructed by recovering URA3, HIS3, TRP1, LEU2 markers. URA3,
HIS3, TRP1 genes amplified from S. cerevisiae S288C genome using primer sets M-URA-F/M-
URA-R, M-HIS3-ORF-F/M-HIS3-ORF-R, Trp-up-F/Trp-down-R, together with pRS425-LEU2-
TEF1-SynPn3-29, were transformed into strain YS01-CK (removing Cas9 and gRNA plasmids)
followed by selection on SD-TRP-URA-HIS-LEU plate. The strain was verified by PCR analysis.
Chemical analysis
LC-MS analysis was done according to the literature6, 8, with the following adaptations, for
the determination of CK, F1, Rh2, Rg3, F2, Rd, Rb1, Rg1 and R1. Acetone and methanol (1:1)
extracts (20 ul) were analysed by LC-MS using an Agilent 1200 HPLC system coupled to a
Bruker-micrOTOF-II with an electrospray ionisation (ESI) interface. Data acquisition and
processing were performed with the MicrOTOF control version 3.0/Data Analysis Version 4.0
software.
For chromatographic separation of CK, F1, Rh2, Rg3, F2, Rd, Rb1, a Waters Symmetry
C18® column (250 mm×4.6mm, 5 μm) was used. The mobile phase consisted of 0.1% formic
acid and 10% methanol in water (A) and 0.1% formic acid in acetonitrile (B). The column
was equilibrated with 80% A, then the sample was injected, the mobile phase A reached 5%
within 65 minutes, keeped 5% for 5 minutes, later returned to 80% within 5 minutes and hold
80% for 5 minutes. The solvent flow rate was 0.8 mL/min and the column temperature was
set at 30 °C. Optimised MS operating conditions were as follows: all spectra were obtained in the
positive ion mode over an m/z range of 100-1200; dry gas flow, 6.0 L/min; dry temperature, 180
°C; nebuliser pressure, 1 bar; and probe voltage, -4.5 kV. Crystallized CK, F1, Rh2, Rg3, F2, Rd
and Rb1 (all purchased from shyuanye, China) were used as standards for analysis.
For chromatographic separation of Rg1 and R1, an Ultimate® PG-C18 column (250
mm×4.6mm, 5 μm) was used. The mobile phase consisted of 0.1% formic acid in acetonitrile
(A) and 0.1% formic acid in water (B). The column was equilibrated with 19% A, then the
sample was injected, keeping 19% A for 12 minutes; the mobile phase A reached 26% within 20
minutes, and increased to 90% in 2 minutes, keeping 90% for 10 minutes, later returned to 19%
within 2 minutes and hold 19% for 4 minutes. The solvent flow rate was 1 mL/min and the
column temperature was set at 25 °C. Optimised MS operating conditions were as follows: all
spectra were obtained in the positive ion mode over an m/z range of 100-1200; dry gas flow, 6.0
8
L/min; dry temperature, 180 °C; nebuliser pressure, 1 bar; and probe voltage, -4.5 kV.
Crystallized Rg1 and R1 (all purchased from shyuanye, China) were used as standards for
analysis.
Supplementary Methods Reference
1. Bolger AM, Lohse M, & Usadel B (2014) Trimmomatic: a flexible trimmer for Illumina
sequence data. Bioinformatics 30(15):2114-2120.
2. Patel RK & Jain M (2012) NGS QC Toolkit: a toolkit for quality control of next
generation sequencing data. PLoS One 7(2):e30619.
3. Grabherr MG, et al. (2011) Full-length transcriptome assembly from RNA-Seq data
without a reference genome. Nat Biotechnol 29(7):644-652.
4. Langmead B & Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nat
Methods 9(4):357-359.
5. Li B & Dewey CN (2011) RSEM: accurate transcript quantification from RNA-Seq data
with or without a reference genome. BMC Bioinformatics 12:323.
6. Dai Z, et al. (2013) Metabolic engineering of Saccharomyces cerevisiae for production of
ginsenosides. Metab Eng 20:146-156.
7. Lin, T. et al. (2016 )Construction of cell factories for production of lupeol in Saccharomyces
cerevisiae. China Journal of Chinese Materia Medica,41 (6):1008-1015.
8. Dai Z, et al. (2014) Producing aglycons of ginsenosides in bakers' yeast. Sci Rep 4:3698.
9. Wang, J. et al. (2019 ) Optimization of UDP-glucose supply module and production of
ginsenoside F1 in Saccharomyces cerevisiae. China Journal of Chinese Materia
Medica,44 (21):4596-4604.
9
Figure S1 : A synthetic biology platform was used to identify UGTs enzyme
catalyzing of triterpenoids.
A: High throughput strategy of A:
B: High throughput strategy of B:
10
Figure S2:Comparison of production of protopanaxadiol (PPD) using the P. ginseng CPR and
Vitis vinifera CPR.
We transformed a codon-optimized version of the dammarenediol-II synthase (SynPgDDS), protopanaxadiol synthase (SynPgPPDS) and cytochrome P450 reductase (PgCPR, KF486915) genes from P. ginseng into the S. cerevisiae strain BY-T3, resulting in strain PgCPR. For the constructon of strain VvCPR, the dammarenediol-II synthase (SynPgDDS) and protopanaxadiol synthase (SynPgPPDS) genes from P. ginseng, as well as the Vitis vinifera CPR (VvCPR, XP_002270732) gene were also transformed into the S. cerevisiae strain BY-T3, resulting in strain VvCPR. This strains was cultivated in SD-URA-TRP medium with 20g/L glucose for 5 days. Three repeats were performed and the error bars represent standard deviation.
11
Supplementary Table
Table S1. The therapeutic potential of ginsenosides. Ginsenosides Activities/applications ReferenceF1 Skin whitening
AntioxidantsHan, J. et al.1
Lee, E.H. et al.2
Rg1 AntioxidantsNeuroprotection
Liu, Z.Q. et al.3
Liu, Z.J. et al.4
R1 Antioxidants Liu, Z.Q. et al.3
CK Antidiabetic effectsInhibition of various cancer cell growth
Yoon, S.H. et al.5
Musende, A.G. et al.6
Rh2 Antidiabetic effectsProoxidativeInhibition of various cancer cell growth
Lee, W.K. et al.7
Liu, Z.Q. et al. 3
Musende, A.G. et al.6
Rg3 ProoxidativeInhibition of various cancer cell growthNeuroprotection
Liu, Z.Q. et al. 3
Musende, A.G. et al.6
Tian, J. et al.8
Rd AntioxidantsNeuroprotection
Liu, Z.Q. et al.3
Ye, R.D. et al.9
Rb1 Antidiabetic effectsAntioxidantsNeuroprotection
Shang, W. et al.10
Liu, Z.Q. et al. 3
Chen, X.C. et al.11
Re Antidiabetic effectsAntioxidantsNeuroprotection
Xie, J.T. et al.12
Liu, Z.Q. et al.3
Chen, L.M. et al.13
Rb2 Antidiabetic effectsAntioxidants
Yokozawa, T. et al.14
Chae, S. et al.15
Rh1 Inhibition of various cancer cell growth Lyu X, et al.16
F2 Inhibition of various cancer cell growth Shin JY. et al.17
Supplementary Table S1 reference:1. Han, J.; Lee, E.; Kim, E.; Yeom, M.H.; Kwon, O.; Yoon, T.H.; Lee, T.R.; Kim, K. Role of epidermal γδ T-
cell-derived interleukin 13 in the skin-whitening effect of Ginsenoside F1. Exp. Dermatol. 2014, 23, 860–862.
2. Lee, E.H.; Cho, S.Y.; Kim, S.J.; Shin, E.S.; Chang, H.K.; Kim, D.H.; Yeom, M.H.; Woe, K.S.; Lee, J.; Sim,
Y.C.; et al. Ginsenoside F1 protects human HaCaT keratinocytes from ultraviolet-B-induced apoptosis by
maintaining constant levels of Bcl-2. J. Investig. Dermatol. 2003, 121, 607–613.
3. Liu, Z.Q., Luo, X.Y., Liu, G.Z., Chen, Y.P., Wang, Z.C., Sun, Y.X., 2003. In vitro study of the relationship
between the structure of ginsenoside and its antioxidative or prooxidative activity in free radical induced
hemolysis of human erythrocytes. J. Agric. Food Chem. 51, 2555–2558.
4. Liu, Z.J., Zhao, M., Zhang, Y., Xue, J.F., Chen, N.H., 2010. Ginsenoside Rg1 promotes glutamate release
via a calcium/calmodulin-depe ndent protein kinase II-dependent signaling pathway. Brain Res. 1333, 1–8.
5. Yoon, S.H., Han, E.J., Sung, J.H., Chung, S.H., 2007. Anti-diabetic effects of compound K versus
metformin versus compound K-metformin combination therapy in diabetic db/db mice. Biol. Pharm. Bull. 30,
12
2196–2200.
6. Musende, A.G., Eberding, A., Wood, C., Adomat, H., Fazli, L., Hurtado-Coll, A., Jia, W., Bally, M.B.,
Guns, E.T., 2009. Pre-clinical evaluation of Rh2 in PC-3 human xenograft model for prostate cancer in vivo:
formulation, pharmacokinetics, biodistribution and efficacy. Cancer Chemother. Pharmacol. 64, 1085–1095.
7. Lee, W.K., Kao, S.T., Liu, I.M., Cheng, J.T., 2006. Increase of insulin secretion by ginsenoside Rh2 to
lower plasma glucose in Wistar rats. Clin. Exp. Pharmacol. Physiol. 33, 27–32.
8. Tian, J., Fu, F., Geng, M., Jiang, M., Yang, J., Jiang, W., Wang, C., Liu, K., 2005. Neuroprotective effect of
20(S)-ginsenoside Rg3 on cerebral ischemia in rats. Neurosci. Lett. 374, 92–97
9. Ye, R.D., Li, N.L., Han, J.L., Kong, X.W., Cao, R., Rao, Z.R., Zhao, G., 2009. Neuroprotective effects of
ginsenoside Rd against oxygen–glucose deprivation in cultured hippocampal neurons. Neurosci. Res. 64, 306–
310.
10. Shang, W., Yang, Y., Jiang, B., Jin, H., Zhou, L., Liu, S., Chen, M., 2007. Ginsenoside Rb1 promotes
adipogenesis in 3T3-L1 cells by enhancing PPARgamma2 and C/EBPalpha gene expression. Life Sci. 80, 618–
625.
11. Chen, X.C., Huang, T.W., Zhang, J., Song, J.Q., Chen, L.M., Zhu, Y.G., 2008d. Involvement of calpain and
p25 of CDK5 pathway in ginsenoside Rb1’s attenuation of beta-amyloid peptide (25-35)-induced tau
hyperphosphorylation in cortical neurons. Brain Res. 1200, 99–106.
12. Xie, J.T., Mehendale, S.R., Li, X., Quigg, R., Wang, X., Wang, C.Z., Wu, J.A., Aung, H.H., Rue, P.A., Bell,
G.I., Yuan, C.S., 2005b. Anti-diabetic effect of ginsenoside Re in ob/ob mice. Biochim. Biophys. Acta 1740, 319–
325.
13. Chen, L.M., Zhou, X.M., Cao, Y.L., Hu, W.X., 2008. Neuroprotection of ginsenoside Re in cerebral
ischemia-reperfusion injury in rats. J. Asian Nat. Prod. Res. 10, 439–445.
14. Yokozawa, T., Yasui, T., Oura, H., 1993. Stimulation of RNA-polymerase activity by ginsenoside-Rb2 in
diabetic rats. Phytother. Res. 7, 240–243.
15. Chae, S., Kang, K.A., Youn, U., Park, J.S., Hyun, J.W., 2010. A comparative study of the potential
antioxidant activities of ginsenosides. J. Food Biochem. 34, 31–43.
16. Lyu X, Xu X, Song A, Guo J, Zhang Y, Zhang Y. 2019.Ginsenoside Rh1 inhibits colorectal cancer cell
migration and invasion in vitro and tumor growth in vivo. Oncol Lett. 18(4):4160-4166.
17. Shin JY, Lee JM, Shin HS, Park SY, Yang JE, Cho SK, Yi TH. 2012. Anti-cancer effect of ginsenoside
f2 against glioblastoma multiforme in xenograft model in sd rats. J Ginseng Res. 36(1):86-92.
13
Table S2. Primers used in this work
Primer name Target Gene SEQUENCE(5' TO 3')
Pac-pPDC1 pPDC1 GCGTTAATTAACATGCGACTGGGTGAGCATATGTTC
pPDC1-SexA GCGACCWGGTTTTGATTGATTTGACTGTGTTATTT
ASC-tADH2 tADH2 GCGGCGCGCCGCGGATCTCTTATGTCTTTACGATT
tADH2-Pme1 GCGGTTTAAACTAGAATTATATAACTTGATGAGATG
Pac-pENO2 pENO2 GCGTTAATTAAAATCCTACTCTTGCCGTTGCCATCC
pENO2-SexA GCGACCWGGTTATTATTGTATGTTATAGTATTAGTTG
Sac11-TEF1 pTEF1 GCGCCGCGGAGTGATCCCCCACACACCATAGCTT
TEF1-SexA1 TGGCGACCWGGTTTTGTAATTAAAACTTAGATTAGA
TEF1-Pac1pTEF1
(Pac1)TGGCGTTAATTAATTTGTAATTAAAACTTAGATTAGA
Asc1-CYC1t CYC1t GCGGCGCGCCCCGCTGATCCTAGAGGGCCGCATCA
CYC1t-Sac11 GCGCCGCGGGCGCGTTGGCCGATTCATTAATGCA
SexA1-Pn1-2 Pn1-2 GCGACCTGGTATGTTTACAACAAAAATGCCAACAC
Pn1-2-Asc1 GCGGCGCGCCTTATTGTTTAGATTTCACACCCATT
SexA1-Pn1-3 Pn1-3 GCGACCTGGTATGGATTCTCCCTTGTCATCCTCAC
Pn1-3-Asc1 GCGGCGCGCCTTATGCTTTCTCCCTTTTCTCTCTG
SexA1-Pn1-4 Pn1-4 GCGACCTGGTATGGATTCTCCCTTGTCATCCTCAC
Pn1-4-Asc1 GCGGCGCGCCTTATGCTTTCTCCCTTTTCTCTCTG
SexA1-Pn1-6 Pn1-6 GCGACCTGGTATGGAAAGAAAAGCAATGGAATATTC
Pn1-6-Asc1 GCGGCGCGCCTTACTTGATAAAATCAACTAAAGCATC
SexA1-Pn1-12 Pn1-12 GCGACCTGGTATGGAGCAAAATCAGAAAATGGCTAG
Pn1-12-Asc1 GCGGCGCGCCTTATCTCCTAGAGAAAGATAAAATGC
SexA1-Pn1-14 Pn1-14 GCGACCTGGTATGGCCGCCAATGACAAATTACAC
Pn1-14-Asc1 GCGGCGCGCCTCATTTTGTGAGGATTTTATGACTTTC
SexA1-Pn1-15 Pn1-15 GCGACCTGGTATGGAGGTTGTGGTTGATGAAG
14
Pn1-15-Asc1 GCGGCGCGCCCTATTTTTTCCACGACTCAACAACAC
SexA1-Pn1-16 Pn1-16 GCGACCTGGTATGGACTCACAAGTCTCATCACGTCC
Pn1-16-Asc1 GCGGCGCGCCTTACTGATCTGATCGTTCCTCTCTC
SexA1-Pn1-17 Pn1-17 GCGACCTGGTATGAAACAACTAAGAGAGTTGCTAAC
Pn1-17-Asc1 GCGGCGCGCCTTATTTCATCCAATCAGTAATATTGC
SexA1-Pn1-20 Pn1-18 GCGACCTGGTATGGCAGTAGCCGGCGCTGGTGCAC
Pn1-20-Asc1 GCGGCGCGCCTCATTTTATTATATGTTTTTTGGGG
SexA1-Pn1-21 Pn1-21 GCGACCTGGTATGAATGCACCAAGTTTTCACATAGC
Pn1-21-Asc1 GCGGCGCGCCTCATCCCAATAACTGTTGCAGTTTC
SexA1-Pn1-23 Pn1-23 GCGACCTGGTATGAATGCACCAAGTTTTCACATAG
Pn1-23-Asc1 GCGGCGCGCCTCATCCCAATAACTGTTGCAGTTTC
SexA1-Pn1-24 Pn1-24 GCGACCTGGTATGGAGGTTGTGGTTGATGAAGCC
Pn1-24-Asc1 GCGGCGCGCCCTATTTTTTCCACGACTCAACAAC
SexA1-Pn1-25 Pn1-25 GCGACCTGGTATGGATTCACAATCAAACCAACTTCAC
Pn1-25-Asc1 GCGGCGCGCCTTAATTAGAGCATTTACGTTCGCTTGG
SexA1-Pn1-28 Pn1-28 GCGACCTGGTATGGGATTAAATTCAGCTAATAAGG
Pn1-28-Asc1 GCGGCGCGCCTCACATATTGCAGTGGAGAGCTTCC
SexA1-Pn1-29 Pn1-29 GCGACCTGGTATGGGTAATCCAAAACCTCATGTAG
Pn1-29-Asc1 GCGGCGCGCCTCAACCTTCATAATGAAGAGCCTCC
SexA1-Pn1-30 Pn1-30 GCGACCTGGTATGGCGAAGAATCAGCAGAAAGTTC
Pn1-30-Asc1 GCGGCGCGCCCTATCTGCCAAGAATTTTGATTTTG
SexA1-Pn1-31 Pn1-31 GCGACCTGGTATGGACAGAGAAATGTTGAGCAAAAC
Pn1-31-Asc1 GCGGCGCGCCTCAGGAGGACACAAGCTTTGAAATG
SexA1-Pn1-34 Pn1-34 GCGACCTGGTATGTCCGGTTCCTCCGGTAAAGGCG
Pn1-34-Asc1 GCGGCGCGCCCTATCGATCATTTTTGAGTTGAAGTTG
SexA1-Pn1-37 Pn1-37 GCGACCTGGTATGGAGAAGAATCACTGTGACAACC
Pn1-37-Asc1 GCGGCGCGCCTTATCTACGAATGTGAGCAATGAAGTC
SexA1-Pn1-39 Pn1-39 GCGACCTGGTATGGAAAATAACCACGTTCTTCATG
Pn1-39-Asc1 GCGGCGCGCCTTAACTCATCAATTGGGATTTCCTCA
SexA1-Pn1-45 Pn1-45 GCGACCTGGTATGGAGAAGCAAACATTGGCAAAC
Pn1-45-Asc1 GCGGCGCGCCTCAAGTGGTGGTGGATAAAACCTTTG
15
SexA1-Pn1-47 Pn1-47 GCGACCTGGTATGGCAGAAACTCAGAAGCCACATG
Pn1-47-Asc1 GCGGCGCGCCTTACTTCATGGGGGAGAGAATCAC
SexA1-Pn1-48 Pn1-48 GCGACCTGGTATGTCAGATTTCGGTGACCCAAATTG
Pn1-48-Asc1 GCGGCGCGCCTCAAGTGACCCGGGTATTCTTCCAC
SexA1-Pn1-49 Pn1-49 GCGACCTGGTATGGGTACAAGGCAATCCCAGCAAA
Pn1-49-Asc1 GCGGCGCGCCTTAATTACCCATGACATCCTCAATA
SexA1-Pn1-50 Pn1-50 GCGACCTGGTATGGGAGCAGAGCTCATCTTCATCC
Pn1-50-Asc1 GCGGCGCGCCCTAACCGACCTGATTATATTTTGCG
SexA1-Pn1-51 Pn1-51 GCGACCTGGTATGGGAGCAGAGCTCATCTTCATCC
Pn1-51-Asc1GCGGCGCGCCTCATTGGCAGCAGTGAATCGTCACAA
C
SexA1-Pn1-52 Pn1-52 GCGACCTGGTATGACAACAGTTGAGGAAGTATTTGTG
Pn1-52-Asc1 GCGGCGCGCCTCATAATAACACCCCCCTAAAGGC
SexA1-Pn1-53 Pn1-53 GCGACCTGGTATGACAACAGTTGAGGAAGTATTTG
Pn1-53-Asc1 GCGGCGCGCCTCATAATAACACCCCCCTAAAGGC
SexA1-Pn1-54 Pn1-54 GCGACCTGGTATGGAGATTAACCGGCATAGGAAGC
Pn1-54-Asc1 GCGGCGCGCCTTATTTTGTATGATTTTCAAGATAC
SexA1-Pn1-55 Pn1-55 GCGACCTGGTATGGAACATGAAGATCAAAAGCC
Pn1-55-Asc1 GCGGCGCGCCTTAATTATTTGCCGTTGGGCAAAC
SexA1-Pn1-57 Pn1-57 GCGACCTGGTATGGAGTCTCCGAATAGACCTCATG
Pn1-57-Asc1 GCGGCGCGCCTTAAGGTTTGCTAATATTTTTTCCAAC
SexA1-Pn1-58 Pn1-58 GCGACCTGGTATGGCAACTGAAGACCCTAAACTCC
Pn1-58-Asc1 GCGGCGCGCCTTATCCATTTTTTGATTTCTCAAAAG
SexA1-Pn1-61 Pn1-61 GCGACCTGGTATGAATACCCTACTCTTGTTTCATA
Pn1-61-Asc1 GCGGCGCGCCTTACACTAAATCCTTGATCTTACTC
SexA1-Pn1-63 Pn1-63 GCGACCTGGTATGGAGAAATTGGAACTGATTTTCAT
Pn1-63-Asc1 GCGGCGCGCCTCACACTTTCATATCCTCAATTAAG
SexA1-Pn1-66 Pn1-66 CGACCTGGTATGGATTATCAAGAAAAAATTCTCCCTC
Pn1-66-Asc1 GCGGCGCGCCTTAGGTATGTGGTGTTGACAAGGTC
SexA1-Pn2-7 Pn2-7 GCGACCTGGTATGGCAACCCAAAAATGTTTCCGTG
Pn2-7-Asc1 GCGGCGCGCCTCATACAACAATCCGTTTTTTCTTG
16
SexA1-Pn2-9 Pn2-9 GCGACCTGGTATGGATACCAACTCAAATAGCTTC
Pn2-9-Asc1 GCGGCGCGCCTTAATTAATATCTCTATACTTGCCC
SexA1-Pn2-19 Pn2-19 GCGACCTGGTATGGCCACTGAGGAAGCTAGTTTG
Pn2-19 -Asc1 GCGGCGCGCCTTACTCATCCAAATTCTTGTTGCATA
SexA1-Pn2-20 Pn2-20 GCGACCTGGTATGAGAGTACTTCCAATTCTTCTAC
Pn2-20-Asc1 GCGGCGCGCCCTAAGTACATGCATGCTTAAGTTCA
SexA1-Pn2-21 Pn2-21 GCGACCTGGTATGGATACCGAAAAGCTTCATGTAG
Pn2-21-Asc1 GCGGCGCGCCTTAAATTAATTTTTTTAACCTCCTC
Pac1-Pn2-28 Pn2-28 GCGTTAATTAAATGGAAAATCAAGGACTACGA
Pn2-28-Asc1 GCGGCGCGCCTCACTTTGACAATTGGAATGACGAG
SexA1-Pn2-30 Pn2-30 GCGACCTGGTATGTACGAGTCGACGACAAAGATG
Pn2-30-Asc1 GCGGCGCGCCCTATAACACATCTCTCACTAACTTGTC
SexA1-Pn2-31 Pn2-31 GCGACCTGGTATGTCAAAAATGGAACAAACACCC
Pn2-31-Asc1 GCGGCGCGCCTCATGAGGCTATAGGTGCTATTTTA
SexA1-Pn2-33 Pn2-33 GCGACCTGGTATGGAAAATCGACAAATTCTCCCTC
Pn2-33-Asc1 GCGGCGCGCCTTACGCCTGTGATGTTGACAAG
SexA1-Pn2-35 Pn2-35 GCGACCTGGTATGGCTCAACAAACAATCCCAC
Pn2-35-Asc1 GCGGCGCGCCCTAGGGTGTGATGCCACCCAAAG
SexA1-Pn2-47 Pn2-47 GCGACCTGGTATGGGAGCGGAGCTCATCTTCATC
Pn2-47-Asc1 GCGGCGCGCCTTAATTACCCATGACATCCTCAATA
SexA1-Pn3-1 Pn3-1 GCGACCTGGTATGAAGAACCCAGAGCTCATATTC
Pn3-1-Asc1 GCGGCGCGCCTCAAATAATTTCCTCCATTATACTTC
SexA1-Pn3-4 Pn3-4 GCGACCTGGTATGCTCCCCTGGTTAGCTTTTGGCC
Pn3-4-Asc1 GCGGCGCGCCCTATTTTTCAGAATTTTTTCCGGTG
SexA1-Pn3-5 Pn3-5 GCGACCTGGTATGGAGAAGCAAGGAGAAAAGAA
Pn3-5-Asc1 GCGGCGCGCCTTATAATGACATAATATAACTAACC
SexA1-Pn3-6 Pn3-6 GCGACCTGGTATGGCTGAACAAACAATCCCACCTC
Pn3-6-Asc1 GCGGCGCGCCCTAGGCTCTGATGCCACCCACAGTC
SexA1-Pn3-7 Pn3-7 GCGACCTGGTATGAAGAGGGCCAAGATAATAATGG
Pn3-7-Asc1 GCGGCGCGCCTCACGAATTAAGATTCTGTTGAGATAC
SexA1-Pn3-8 Pn3-8 GCGACCTGGTATGGAAATGAAGAGCTATGGAGCTC
17
Pn3-8-Asc1 GCGGCGCGCCCTATATATGTGAAGAGCTTCTTGTAC
SexA1-Pn3-9 Pn3-9 GCGACCTGGTATGGCCGCCGGCAATGATAACTTAC
Pn3-9-Asc1 GCGGCGCGCCTTATTTTGTATGATTTTCAAGATAC
SexA1-Pn3-11 Pn3-11 GCGACCTGGTATGGGGTCCTTCGGAAAAGCAGAA
Pn3-11-Asc1 GCGGCGCGCCCTACTCTGAAATATTATTAGTCAAG
SexA1-Pn3-13 Pn3-13 GCGACCTGGTATGGATATCGAGAAAGGTAGAATC
Pn3-13-Asc1 GCGGCGCGCCTTAATATTGTGCGTCTTTCTTCATC
SexA1-Pn3-15 Pn3-15 GCGACCTGGTATGGGAGCGGAGCTCATCTTCATCC
Pn3-15-Asc1 GCGGCGCGCCTTAATTACCCATGACATCCTCAATAA
Pac11-Pn3-19 Pn3-19 GCGTTAATTAAATGAGGAATATGAAGACTGCAGAAC
Pn3-19-Asc1 GCGGCGCGCCTTACATAATAATTTCCTCAATTAATC
SexA1-Pn3-21 Pn3-21 GCGACCTGGTATGTTCCCTTGGCTTGCCTTTGGTC
Pn3-21-Asc1 GCGGCGCGCCTCATATTAATTCCCTCTTGATCTTTC
SexA1-Pn3-24 Pn3-24 GCGACCTGGTATGGGTGGTCAGCTTCACGTCTTC
Pn3-24-Asc1 GCGGCGCGCCTCAAGAATGATAAAAACTCAATTC
Pac1-Pn3-25 Pn3-25 GCGTTAATTAAATGTCCCAAAGTCCAGCAATGTCC
Pn3-25-Asc1 GCGGCGCGCCTTATAACTCTTCTTGATTGTTCTTC
SexA1-Pn3-26 Pn3-26 GCGACCTGGTATGTTCCCTTGGCTTGCCTTTGGTC
Pn3-26-Asc1 GCGGCGCGCCTCATATTAATTCCCTCTTGATCTTTC
SexA1-Pn3-27 Pn3-27 GCGACCTGGTATGGAAAGAAAAGCAATGGAATAT
Pn3-27-Asc1 GCGGCGCGCCTTACTTGATAAAATCAACTAAAGC
SexA1-Pn3-28 Pn3-28 GCGACCTGGTATGGGCTCCCTTCCTAAAGTAAC
Pn3-28-Asc1 GCGGCGCGCCCTACTTTGCTAACAACACCTGATCC
Pac1-Pn3-29 Pn3-29 GCGTTAATTAAATGAAGTCAGAATTGATATTCTTGC
Pn3-29-Asc1 GCGGCGCGCCTTACATAATTTCCTCAAATAGCTTC
SexA1-Pn3-31 Pn3-31 GCGACCTGGTATGGATAACCAAAAAGGTAGAATC
Pn3-31-Asc1 GCGGCGCGCCCTATTGTGCATCTTTCTTCTTCTTAC
SexA1-Pn3-32 Pn3-32 GCGACCTGGTATGGATATCGAGAAAGGTAGAATC
Pn3-32-Asc1 GCGGCGCGCCTTAATATTGTGCGTCTTTCTTCATC
SexA1 Pn3-32-i5 Pn3-32-i5 GCGACCTGGTATGGATAACCAAGAAGCTAGAATCAG
Pn3-32-i5-Asc1 GCGGCGCGCCCTATTGTGCATCTTTCTTCTTCTTAC
18
SexA1-Pn3-33 Pn3-33 GCGACCTGGTATGGGTACAAATCCAGCTAAAAAG
Pn3-33-Asc1 GCGGCGCGCCTCATCCTGCAAGGAGAACTTCCTTA
SexA1-Pn3-35 Pn3-35 GCGACCTGGTATGAAAAGCAAGAATCACTTCCTC
Pn3-35-Asc1 GCGGCGCGCCTTACATTTCCAAAAACTGTTTTAC
SexA1-Pn3-36 Pn3-36 GCGACCTGGTATGGCATCCGATCTCTATGAGCAC
Pn3-36-Asc1 GCGGCGCGCCTTAAATTAACTTCTGCATCCTAGTG
SexA1-Pn3-38 Pn3-38 GCGACCTGGTATGAATCCGGTCGCCGCCGGTGAA
Pn3-38-Asc1 GCGGCGCGCCTCAGCATTGTAAAATATTCCTTGTA
SexA1-Pn3-39 Pn3-39 GCGACCTGGTATGGAAAACACACCCCATATAGCC
Pn3-39-Asc1 GCGGCGCGCCTTAGTTTTTATTCCACTTGCCAGCC
SexA1-Pn3-40 Pn3-40 GCGACCTGGTATGGATTCACCGTCAGACCAGCTTC
Pn3-40-Asc1 GCGGCGCGCCTTAGCTTTTTTTTTTCCTTTGTTTC
SexA1-Pn3-41 Pn3-41 GCGACCTGGTATGGAAGAGAGAAAAGAAAAGATG
Pn3-41-Asc1 GCGGCGCGCCTCAAGTATTCATTCTGCATTTCTGC
SexA1-Pn3-42 Pn3-42 GCGACCTGGTATGGAGAAAAAGGACTCAACTCG
Pn3-42-Asc1 GCGGCGCGCCTCATCTCTCCACACCCATCAATTTA
SexA1-Pn3-43 Pn3-43 GCGACCTGGTATGGGGAGTTCAGCGGAGACACATG
Pn3-43-Asc1 GCGGCGCGCCCTAAGTTGTAATCACTTCTAGTAGG
SexA1-Pn3-44 Pn3-44 GCGACCTGGTATGGAAGAATTTCCAATTCCAGTAG
Pn3-44-Asc1 GCGGCGCGCCTTAATCATTCCTCGGCTGGAGAGCTTC
SexA1-Pn3-47 Pn3-47 GCGACCTGGTATGATGGGCCACGCAACACAGGCC
Pn3-47-Asc1 GCGGCGCGCCTCACATTTTCAGATCCTCAATTAAG
SexA1-Pn3-54 Pn3-54 GCGACCTGGTATGAGCTCTGAAGGTCTGGTTCATG
Pn3-54-Asc1 GCGGCGCGCCTCATTGAGTAATATGCATGGTTCTC
SexA1-Pn3-55 Pn3-55 GCGACCTGGTATGGCTTCGGAAGGAAAGAAGAAAG
Pn3-55-Asc1 GCGGCGCGCCTTAAGAATATCCTTTGATCTCATCA
SexA1-Pn3-60 Pn3-60 GCGACCTGGTATGGATTATGTTCCGGCAGCTAATC
Pn3-60-Asc1 GCGGCGCGCCTTATGATTTGCGTAGTAACACCTCTTCC
SexA1-Pn3-67 Pn3-67 GCGACCTGGTATGGATCAACATACACTCCCACCTC
Pn3-67-Asc1 GCGGCGCGCCTTATTTATTCATTGATGGTGGAGG
SexA1-Pn3-68 Pn3-68 GCGACCTGGTATGGAAAGAAAAGCAATGGAATAT
19
Pn3-68-Asc1 GCGGCGCGCCTTACTTGATAAAATCAACTAAAGCA
SexA1-Pn3-74 Pn3-74 GCGACCTGGTATGGAAGAGAGAAAAGAAAAGATGAT
Pn3-74-Asc1 GCGGCGCGCCCTATTCATCCTCCCCTCTTACCTTG
SexA1-Pn3-75 Pn3-75 GCGACCTGGTATGGATACCAATGAAAAAACCAGAA
Pn3-75-Asc1 GCGGCGCGCCTTAGGACCATTCCTTCCCCTGGC
SexA1-Pn3-76 Pn3-76 GCGACCTGGTATGGAGAATGAGAAAACTTATAAAGC
Pn3-76-Asc1 GCGGCGCGCCTTAGAGTGCCAAAATCCGAGAAATA
SexA1-Pn3-79 Pn3-79GCGACCTGGTATGGAAAATCGACAAATTCTCCCTCCT
C
Pn3-79-Asc1GCGGCGCGCCTTACGCCTGTGATGTTGACAAGTTTAT
C
SexA1-Pn3-87 Pn3-87 GCGACCTGGTATGGCTTCCCAAATTGAAAAGCTC
Pn3-87-Asc1 GCGGCGCGCCTCATTGACCAGTTATTTGTTCAATG
SexA1-Pn3-88 Pn3-88 GCGACCTGGTATGGCTATTCTCCAAACCCAAGAC
Pn3-88-Asc1 GCGGCGCGCCTCATTTCAATTTCAGTTGTTCCACGAC
SexA1-Pn3-89 Pn3-89 GCGACCTGGTATGTTACCCATTCTTGACTTGACG
Pn3-89-Asc1 GCGGCGCGCCTCAATCATTTTTGAGTTGAAGTTGAGC
SexA1-Pn3-91 Pn3-91 GCGACCTGGTATGAAGAAATCTGCAGAGGTTGTAT
Pn3-91-Asc1 GCGGCGCGCCTCAAATATCTAACAAAAGTTGTTG
SexA1-Pn3-97 Pn3-97 GCGACCTGGTATGGAGCAAAATCAGAAAATGGCTAG
Pn3-97-Asc1 GCGGCGCGCCTTATCTCCTAGAGAAAGATAAAATGC
SexA1-Pn3-98 Pn3-98 GCGACCTGGTATGGGTGGTCAGCTTCACGTCTTC
Pn3-98-Asc1 GCGGCGCGCCATGGGTGGTCAGCTTCACGTCTTC
SexA1-Pn3-103 Pn3-103 GCGACCTGGTATGGATTCTCCCTTGTCATCCTCAC
Pn3-103-Asc1 GCGGCGCGCCTTATGCTTTCTCCCTTTTCTCTCTGAG
Pac1-Pn3-106 Pn3-106 GCGTTAATTAAATGCTGCTGCTGCTACTACTACTACT
Pn3-106-Asc1 GCGGCGCGCCTTATCTAGATATGTGTACAATAAATG
SexA1-Pn3-107 Pn3-107 GCGACCTGGTATGTCTACCTCCATCTCTATATCTTCC
Pn3-107-Asc1 GCGGCGCGCCATGTCTACCTCCATCTCTATATCTTCC
BSP-TRP1-F TRP1 TGGCGTCCGGATACAATCTTGATCCGGAGCT
BSP-TRP1-R TGGCGTCCGGACACAAACAATACTTAAATAAATAC
43803-up LPP gRNA GATCATTTATCTTTCACTGC
20
43803-LPP1-downCGCAGTGAAAGATAAATGATCAGCATAAATGCTGCCC
TAACGTTTTAGAGCTAGAAATAGCAAG
21
Table S3. Plasmids used in this work
Name Description Source
pRS425 2 MICRON, LEU2Sikorski et al.1
{Sikorski, 1989 #215}
pEASY-Blunt Simple Cloning vector for blunt ligation, Amp, Km TransGen Biotech
pUC57 Cloning vector with multiple cloning sites, Amp GenScript
p-SynAtUGD1 Cloning SynAtUGD1 gene into pUC57 This study
p-SynAtUXS3 Cloning SynAtUXS3 gene into pUC57 This study
p-SynPn3-29 Cloning SynPn3-29 gene into pUC57 This study
p-SynUGTpg101 Cloning SynUGTpg101 gene into pUC57 This study
pM3-SynAtUGD1 Containing PTEF1-SynAtUGD1-TCYC1 cassette This study
pM9-SynAtUXS3 Containing PPDC1-SynAtUXS3-TADH2 cassette This study
pRS425-LEU2-TEF1-SynPn3-29 Containing PTEF1-SynPn3-29-TCYC1 cassette This study
p-TRP1 Cloning TRP1 gene into pEASY-Blunt Simple This study
pGAL7-URA3 Cloning GAL7 and URA3 genes into pEASY-Blunt Simple This study
pM2-SynPgDDS Containing PPGK1-SynPgDDS-TADH1 cassette Dai et al.2
pM8-SynPgPPTS Containing PFBA1-SynPgPPTS-TTDH2 cassette Dai et al.2
22
pM4-AtCPR1 Containing PTDH3-AtCPR1-TTPI1 cassette Dai et al.2
pM3-SynPgPPDS Containing PTEF1-SynPgPPDS-TCYC1 cassette Dai et al.2
pM2-PGM1 Containing PPGK1-PGM1-TADH1 cassette Dai et al.2
pM8-PGM2 Containing PFBA1-PGM2-TTDH2 cassette Dai et al.2
pM4-UGP1 Containing PTDH3-UGP1-TTPI1 cassette Dai et al.2
pM3-Pn1-31 Containing PTEF1-Pn1-31-TCYC1 cassette This study
pM9-Pn1-31 Containing PPDC1-Pn1-31-TADH2 cassette This study
pM13-Pn3-29 Containing PTEF1-Pn3-29-TCYC1 cassette This study
pM16-SynUGTPg101 Containing PENO2-SynUGTPg101-TPDC1 cassette This study
pM16-Pn3-31 Containing PENO2-Pn3-31-TPDC1 cassette This study
pRS425-LEU2-TEF1-UGTs
(The UGT gene library)Containing PTEF1-Pn1-2, Pn1-3, Pn1-4, Pn1-6, Pn1-12, Pn1-14, Pn1-15, Pn1-16, Pn1-17,
Pn1-20, Pn1-21, Pn1-23, Pn1-24, Pn1-25, Pn1-28, Pn1-29, Pn1-30, Pn1-31, Pn1-34, Pn1-
37, Pn1-39, Pn1-45, Pn1-46, Pn1-47, Pn1-48, Pn1-49, Pn1-50, Pn1-51, Pn1-52, Pn1-53,
Pn1-54, Pn1-55, Pn1-57, Pn1-58, Pn1-61, Pn1-63, Pn1-66, Pn2-7, Pn2-9, Pn2-19, Pn2-20,
Pn2-21, Pn2-30, Pn2-31, Pn2-33, Pn2-35, Pn2-47.Pn3-1, Pn3-4, Pn3-5, Pn3-6, Pn3-7, Pn3-
8, Pn3-9, Pn3-11, Pn3-13, Pn3-15, Pn3-21, Pn3-24, Pn3-26, Pn3-27, Pn3-28, Pn3-31, Pn3-
32, Pn-3-32-i5, Pn3-33, Pn3-35, Pn3-36, Pn3-38, Pn3-39, Pn3-40, Pn3-41, Pn3-42, Pn3-43,
Pn3-44, Pn3-47, Pn3-54, Pn3-55, Pn3-60, Pn3-67, Pn3-68, Pn3-74, Pn3-75, Pn3-76, Pn3-
This study
23
79, Pn3-87, Pn3-88, Pn3-89, Pn3-91, Pn3-97, Pn3-98, Pn3-103, Pn3-107, Pn2-28, Pn2-39,
Pn3-19, Pn3-25, Pn3-29, Pn3-106-TCYC1 cassette, respectively, 2 micron, LEU2 marker
pLPPgRNA Containing LPP site gRNA, 2 micron, URA3 marker This study
24
Table S4. Strains used in this work
Name Description Source
BY-T3 BY4742-TRP, δDNA::PPGK1-tHMG1-TADH1-PTEF1-LYS2-TCYC1, TRP1::HIS3-PPGK1-ERG20-TADH2-PTDH3-ERG1-TTPI1-PTEF1-ERG9-TCYC1 Dai et al.3
BY-PPT BY-T3, GAL7::URA3:PPGK1-SynPgDDS-TADH1-PFBA1-SynPgPPTS-TTDH2-PTDH3-AtCPR1-TTPL1-PTEF1-SynPgPPDS-TCYC1 Wang et al.5
BY-GM BY-T3, EGH1::TRP1: PPGK1-PGM1-TADH1-PFBA1-PGM2-TTDH2-PTDH3-UGP1-TTPL1-PTEF1-SynPgPPDS-TCYC1 This study
PPD-PPT-GM BY-GM, GAL7::URA3:PPGK1-SynPgDDS-TADH1-PFBA1-SynPgPPTS-TTDH2-PTDH3-AtCPR1-TTPL1-PTEF1-SynPgPPDS-TCYC1 This study
PPD-PPT-GM-UGTs PPD-PPT-GM, harboring pRS425-LEU2-TEF1-UGTs, respectively. This study
Rh2-GM BY-PPT, EGH1::TRP1:PPGK1-PGM1-TADH1-PFBA1-PGM2-TTDH2-PTDH3-UGP1-TTPL1-PTEF1-Pn1-31-TCYC1 This study
Rh2-GM-UGTs Rh2-GM, harboring pRS425-LEU2-TEF1-UGTs, respectively. This study
Rd-GM BY-PPT, EGH1::TRP1:PPGK1-PGM1-TADH1- PPDC1-Pn1-31-TADH2-PENO2-Pn3-31-TPDC1-PFBA1-PGM2-TTDH2-PTDH3-UGP1-TTPL1-PTEF1-Pn3-29-TCYC1 This study
Rd-GM-UGTs Rd-GM, harboring pRS425-LEU2-TEF1-UGTs, respectively. This study
Rg1-XM BY-PPT, EGH1::TRP1:PPGK1-PGM1-TADH1- PPDC1-SynAtUXS3-TADH2-PENO2-SynUGTPg101-TPDC1-PFBA1-PGM2-TTDH2-PTDH3-UGP1-TTPL1-PTEF1-SynAtUGD1-TCYC1 This study
Rg1-XM-UGTs Rg1-XM, harboring pRS425-LEU2-TEF1-UGTs, respectively. This study
YS01 CEN.PK2-1D,
YJL064W::PPGK1-tHMG1-TADH1-PPDC1-ERG12-TADH2-PPYK1-ERG12-TPGI1-PTEF2-SynHMGR-N-TENO2-PENO2-IDI1-TPDC1-PFBA1-ERG13-TTDH2-PTDH3 -ERG8-TTPL1-PTEF1-ERG10-TCYC1,
NDT80::PPGK1-tHMG1-TADH1-PTDH3-SynHMGR-N-TTPL1-PTEF1-IDI1-TCYC1,
RHR2::PPGK1-ADH2-TADH1-PFBA1-SynPTA-TTDH2 -PTDH3-SynPK-TTPL1-PTEF1-SynADA-TCYC1,
This study
25
GAL7::PPGK1-VvCPR-TADH1-PFBA1-SynPgDDS-TTDH2-PTDH3-SynPgPPDS-TTPL1-PTEF1-PgPPDS-TCYC1, EGH1::PPGK1-VvCPR-TADH1-PFBA1-SynPgDDS-TTDH2-PTDH3-SynPgPPDS-TTPL1-
PTEF1-PgPPDS-TCYC1, YPL062W::PPGK1-AtSQS2-TADH1-PTDH3-AtSQE2-TTPL1-PTEF1-SmFPS-TCYC1,
ALD3::PCCW12-SynPgPPDS-TCYC1-PCWP2- SynPgDDS-TCWP2, p414-TEF1p-Cas9-CYC1t
YS01-CK YS01, LPP1::PPGK1-PGM1-TADH1-PFBA1-PGM2-TTDH2-PTDH3-UGP1-TTPL1-PTEF1-SynPn3-29-TCYC1, p414-TEF1p-Cas9-CYC1t, pLPPgRNA
YS01-CK2 YS01-CK , containing pRS425-LEU2-TEF1-Synpn3-29 plasmids, recovering URA3, HIS3, TRP1 Marker
26
Table S5. Primers used in DNA assembly
Primer name Sequence (5' to 3')
1-M-pEASY-PGK1-F CTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACAACATACGAGCCTTAATTAAACGCACAGATATTATAAC
s-7G-1-M-ADHT-PDC1-R CCTTGCTTGTTTATCTTGCACATCACATCAGCGGAACATATGCTCACCCAGTCGCATGTCGGCATGCCGGTAGAGGTGTGGTCAATAAG
s-7G-1-M-ADH1t-PDC1-F GGTCAGGTTGCTTTCTCAGGTATAGCATGAGGTCGCTCTTATTGACCACACCTCTACCGGCATGCCGACATGCGACTGGGTGAGCATATGTT
CCG
3G-1-M-ADH2t-ENO2-R TGTTAATCCACCTTCTAGCTCATTTTGGATGGCAACGGCAAGAGTAGGATTTAGAATTATATAACTTGATGAGATGAG
3G-2-M-ADH2t-ENO2-F GGTAATTCTTCTGTCATTTACTCATCTCATCTCATCAAGTTATATAATTCTAAATCCTACTCTTGCCGTTGCCATCC
6G-2-M-PDC1t-FBA1-R GTTATTTACAGAAGTTGGAAGGCTGGTATTGTTGTTCAAGCCAGCGGTGCCAGTTGGATCGGCAGTTTTGAATTGAGTAACCA
6G-2-M-PDC1t-FBA1-F AGTGTAATAGAAGACTAGACACCTCGATACAAATAATGGTTACTCAATTCAAAACTGCCGATCCAACTGGCACCGCTGGCTTGA
s-4G-4-M-TDH2t-TDH3-R GGCCTCCGCGTCATTAAACTTCTTGTTGTTGACGCTAACATTCAACGCTAGTATGGCGAAAAGCCAATTAGTGTGATAC
s-4G-1-M-ADHT-FBA1-R CCTCCGCGTCATTAAACTTCTTGTTGTTGACGCTAACATCAACGCTAGTATTCGGCATGCCGGTAGAGGTGTGG
27
s-4G-4-M-ADH1t-FBA1-F CTCAGGTATAGCATGAGGTCGCTCTTATTGACCACACCTCTACCGGCATGCCGAGATCCAACTGGCACCGCTGGCTTGA
S-4G-3-M-TDH2t-TDH3-F GGCATCACGGATTTTCGATAAAGCACTTAGTATCACACTAATTGGCTTTTCGCCATACTAGCGTTGAATGTTAGCGTCAAC
3G-3-M-TPL1t-TEF1-R AGGAGTAGAAACATTTTGAAGCTATGGTGTGTGGGGGATCACTTTAATTAATCTATATAACAGTTGAAATTTGGA
3G-2-M-TPLt-TEF1-F GTCATTTTCGCGTTGAGAAGATGTTCTTATCCAAATTTCAACTGTTATATAGATTAATTAAAGTGATCCCCCACAC
2-M-CYC1t-pEASY-R CGTATTACAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGCGTTGGCCGATTCATTAATGC
X1-M-pEASY- r-t-F CTTGCAAATGCCTATTGTGCAGATGTTATAATATCTGTGCGTTTAATTAAGGCTCGTATGTTGTGTGGAATTGT
GAL7-interg-1 ATATTTGAAAGGCTTATGATTTTCTCTTGC
GAL7-interg-2 GTCCAAATATTTGAAAACAAAGGTACAGC
X2-M-pEASY-r-t-R CGAAGGCTTTAATTTGCAAGCTGCGGCCCTGCATTAATGAATCGGCCAACGCGCCAGGGTTTTCCCAGTCACGACGTTG
X1-Xp-pEASY-M13R-F CCGACTGGAAAGCGGGCAGTGAGCG
2-Xp-pEASY-M13F-R CCATTCAGGCTGCGCAACTGTTGGGA
Xp-M-pEASY-M13R-F GTGCCTAATGAGTGAGCTAACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGTCACACAGGAAACAGCTATGACC
Xp-M-pEASY-M13F-R CTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGACGACGTTGTAAAACGACGGCCAGT
EGH1-UP-F GGCAACGAGATCCAATTGCGTGGTGTC
EGH1-UP-R CTTAGGGTAGGACTCTTCATCTTTGCC
28
EGH1-Down-F GCATTAGAAGGAAGTAATCTTTCGTAC
EGH1-Down-R GCAAATTTCAATGAACTGATGGCCAGG
EGH1-up-Marker-50bp-F GACCCCAAAACGGGCAGGGAAGTTGGCAAAGATGAAGAGTCCTACCCTAAGCACACAGGAAACAGCTATGACC
Only-pPGK1-R CACGAGGTTCTACTAAACTAAACCACC
LPP-50-PGK1-F GAGAAACTCCTACATCAACGCCTAAGGAAACTCGTCATATTCTACCAAGGACGCACAGATATTATAACATC
LPP-50-CYC1t-R TGAAGCTTGATCGGCTTTTATTCTTTCCTGATAGGACTCTGTAAGTGTTAGCGCGTTGGCCGATTCATTAATG
M-URA-F ATGTCGAAAGCTACATATAAGGAACGTG
M-URA-R TTAGTTTTGCTGGCCGCATCTT
M-HIS3-ORF-F AGGCAAGATAAACGAAGGCAAAG
M-HIS3-ORF-R CTACATAAGAACACCTTTGGTGG
LEU2-up-F TGCTTGCATCACAATACTTG
LEU2-down-R CTTACCTGTATTCCTTTACA
TRP1-up-F GCTTGTCACCTTACGTACAA
TRP1-down-R ACCCCCTGCGATGTATATTT
29
Table S6. The pair-wise amino acid identities of the important UGTs.
Percent identity (%)Pn1-31
Pn3-29 Pn3-31
Pn3-32 Pn3-32-i5 PgUGT74AE2 PgUGT94Q2 PgUGT71A29 PgUGTPg100 PgUGTPg101 BsYjiC
Pn1-31 100 23.2 22.9 23.1 22.1 96.2 22.9 21.6 21.4 23.0 20.1Pn3-29 - 100 27.3 26.4 25.9 23.4 28.1 95.4 85.9 98.9 15.3Pn3-31 - - 100 90.7 93.4 23.1 97.5 27.8 28.4 28.1 19.5Pn3-32 - - - 100 88.6 23.5 89.4 27.4 27.9 27.4 20.1Pn3-32-i5 - - - - 100 22.3 92.3 26.4 26.8 26.7 19.5PgUGT74AE2 - - - - - 100 23.1 21.8 21.7 23.2 19.6PgUGT94Q2 - - - - - - 100 27.8 28.2 28.1 19.8PgUGT71A29 - - - - - - - 100 83.0 95.8 15.0PgUGTPg100 - - - - - - - - 100 86.0 14.8PgUGTPg101 - - - - - - - - - 100 15.8BsYjiC - - - - - - - - - - 100
Supplementary Table S3-S6 Reference:1. Sikorski, R.S.& Hieter, P(1989). A system of shuttle vectors and yeast host strains designed for efficient manipulation of DNA in Saccharomyces cerevisiae.
Genetics 122:19-27.
2. Dai Z, et al. (2013) Metabolic engineering of Saccharomyces cerevisiae for production of ginsenosides. Metab Eng 20:146-156.
3. Dai Z, et al. (2014) Producing aglycons of ginsenosides in bakers' yeast. Sci Rep 4:3698.
30
4. Lin, T. et al. (2016 ) Construction of cell factories for production of lupeol in Saccharomyces cerevisiae. China Journal of Chinese Materia Medica,41 (6):1008-
1015.
5. Wang, J. et al. (2019 ) Optimization of UDP-glucose supply module and production of ginsenoside F1 in Saccharomyces cerevisiae. China Journal of Chinese
Materia Medica,44 (21):4596-4604.
31
Supplementary Sequences 1
>Pn1-31
MDREMLSKTHIMFIPFPAQGHMSPMMQFVKRLAWKGVRITIVLPAEIRDSMQINNSLINT
ECISFDFDKDDEMPYSMRAYMGVVKLKVTNKLSDLLEKQKTNGYPVNLLVVDSLYPSRV
EMCHQLGVKGAPFFTHSCAVGAIYYNARLGKLKIPPEEGLTSVSLPSIPLLGRNDLPIIRTG
TFPDLFEHLGNQFSDLDKADWIFFNTFDKLENEEAKWLSSQWPITSIGPLIPSMYLDKQLP
NDKDNDINFYKADVGSCIKWLDAKDPGSVVYASFGSVKHNLGDDYMDEVAWGLLHSK
YHFIWVVIESERTKLSSDFLAEAEEKGLIVSWCPQLEVLSHKSIGSFMTHCGWNSTVEALS
LGVPMVAVPQQFDQPVNAKYIVDVWRIGVQVPIGENGVLLRGEVANCIKDVMEGEIGDE
LRGNALKWKGLAVEAMEKGGSSDKNIDEFISKLVSS.
>Pn1-50
MGAELIFIPFPGAGHLISVVEIAKLLISRAEWLSITILVMKLSLDTGVAGYTQSLQKDAHNR
IVFADLPPDESVISMIKTIKTSDRLSIFRVFIESQKTQVRDAVKVIVSQSESNKLAGFVIDMF
CTPMIDVANEFGVPSYVFFTSGAAYLGLQFYHLSLDDEHKQDVIQYKDSDVELSVPYFVN
PVPAKVLPSVMLIKEGSTMIQSISRRFKETKAILVNTFLELEPHAIKCLADNGKIPPVYHVG
PIVNLKSKEGTTQNHNSEDGIISWLDNQPPLSVVFLCFGSLGSFDKEQVREIAYGLEHSGQ
RFLWSLRPPPQPTELFGLPKEYENHNEVLPEGFIKRTTGIGKVIGWAPQVVVLSHPAVGGF
VSHCGWNSTLESIWCGVPMATWPIHAEQQLNAFELVKELGMAVDIKMDYRKDSSIATEP
LVVTADEIEKGIKCLMNAASEMRKKVKEMKEKSRMAMVDGGSSHISLGHFIEDVMGNIQ
ERAKYNQVG.
>Pn3-13
MDIEKGRISIVMLPFLAHGHISPFFELAKHLSKRNCNIFLCSTPINLSSIKNRVSDKDSSASIK
LVELHLPSSPDPPPQYHTTNGLPSHLMVPLKNAFETVGPTFSEILKTLDPDLLIYDFNPSWA
PEIALSHNIPAVYFLTSAAATSSVALRALKNPGEKYPFPDFYDNSNITPEPPSADKMKLFHD
FVACFKRSCDIILIKSFRELEGKYIDLLSTLSKKTLVPVGPLVQDPLGHDEDPKTGHLINWL
DKRAESTVVFVCFGSEYFPSNEELEEVAIGLEISMVNFILAVRFLEGEKKGVLPEGFVQRV
GDRGLVVEGWAPQARILGHSSTGGFVSHCGWSSIMESVKFGVPVIAMARHLDQPLNAKL
AAEVGVGMEVVRDENGKYKREAIAEVIRKVVMEKNGEVIRRKARELSEKMKETGEQEI
GRAVEELVQICKMKKDAQY.
32
>Pn3-29
MKSELIFLPAPAIGHLVGMVEMAKLFISRHENLSVTVLIAKFYMDTGVDNYNKSLLTKPTP
RLTIVNLPESDPQNYMLKPRHAIFPSVIETQKTHVRDIILGMTQSESTRVVGLLADLLFINI
MDIANEFNVPTYVYSPAGAGHLGLAFHLQTLNDKKQDVTEFRHSDTELLVPSFANPVPAE
VLPSMYVDKEGGYDYLFSLFRRCRESKAIIINTFEELEPYAINSLRMDSMIPPIYPVGPILNL
NGDGQNSDEAAVILGWLDDQPPSSVVFLCFGSYGTFQENQVKEIAMGLERSGHRFLWSL
RPSIPKGETKLQLKYSNLEEILPVGFLDRTSCVGKVIGWAPQVAVLGHEAVGGFLSHCGW
NSTLESVWCGVPVATWPMYGEQQLNAFEMVKELGIAVEIEVDYKNEYFNMTNDFIVRAE
EIETKIKKLMMDEKNSEIRKKVKEMKEKSRLAMSENGSSYNSLAKLFEEIM.
>Pn3-31
MDNQKGRISIALLPFLAHGHISPFFELAKQLAKRNCNVFLCSTPINLSSIKNKDSSASVKLV
ELHLPSSPDLPPHYHTTNGLPSHLMVPLRNAFETVGPTFSEILKTLNPDLLIYDFNPSWAPE
IASSHNIPAVYFLTTAAASSSIGLHAFKNPGEKYPFPDFYDNSNITPEPPSADNMKLLHDFIA
CFERSCDIILIKSFRELEGKYIDLLSTLSDKTLVPVGPLVQDPMGHNEDPKTEQIINWLDKR
AESTVVFVCFGSEYFLSNEELEEVAIGLEISMVNFIWAVRLIEGEKKGVLPEGFVQRVGDR
GLVVEGWAPQARILGHSSTGGFVSHCGWSSIAESMKFGVPVIAMARHLDQPLNAKLAAE
VGVGMEVVRDDNGKYKREGIAEVIRKVVVEKSGEVIRRKARELSEKMKEKGEQEIDRA
VEELVQICKKKKDAQ.
>Pn3-32
MDIEKGRISIVMLPFLAHGHISPFFELAKHLSKRNCNIFLCSTPINLSSIKNRVSDKDSSASIK
LVELHLPSSPDLPPQYHTTNGLPSHLMVPLKNAFETVGPTFSEILKTLDPDLLIYDFNPSWA
PEIALSHNIPAVYFLTSAAATSSVALRALKNPGEKYPFPDFYDNSNITPEPPSADKMKLFHD
FVACFKRSCDIILIKSFRELEGKYIDLLSTLSKKTLVPVGPLVQDPLGHDEDPKTGHLINWL
DKRAESTVVFVCFGSEYFPSNEELEEVAIGLEISMVNFILAVRFLEGEKKGVLPEGFVQRV
GDRGLVVEGWAPQARILGHSSTGGFVSHCGWSSIMESVKFGVPVIAMARHLDQPLNAKL
AAEVGVGMEVVRDENGKYKREAIAEVIRKVVMEKNGEVIRRKARELSEKMKETGEQEI
GRAVEELVQICKMKKDAQY.
>Pn3-32-i5(Xyl)
MDNQEARISIVMLPFLAHGHISPFFELAKHLSKRNCNIFLCSTPINLSSIKNRVSDKDSSASI
KLVELHLPSSPDLPPHYHTTNGLPSHLMVPLRNAFETAGPTFSEILKTLNPDLLIYDFNPSW
33
APEIASSHNIPAVCFIIGGAASSSMSLHSFKNPGEKYPFLDFNENSNITPEPPSADNMKLFLD
FMTCFERSCDIILIKSFRELEGKYFDFFSTLSDKTVVPVGPLVQDPMGHNEDPKTEQFINW
LDKRAESTVVFVCFGSEYFLSNEELEEVAIGLEISMVNFIWAVRLIEGEKKGVLPEGFVQR
VGDRGLVVEGWAPQARILGHSSTGGFVSHCGWSSIAESMKFGVPVIAMARHLDQPLNAK
LAAEVGVGMEVVRDDNGKYKREGIAEVIRKVVVEKSGEVIRRKARELSEKMKEKGEQEI
DRAVEELVQICKKKKDAQ.
>Pn1-2
MFTTKMPTQKYSTQPPHVLIFPLPLQGAINSMLKLAELLCLSDINITFLITDNNHTRLLRYT
DIPSRFNKYPGFRLEKIPDGLPEDHPRGSRYIEMLESMKTNTKPIFKEMLTSGRLGDKLRP
VTCIIADGVMGFTCDVANEVGIPIIYTRTLSASCLWAFFCLPNLIETGELPLNGDDMDTPIK
SVPGTENFLRRRDLPNFCRSGDLSNPSIQLFKSEYKENSRADGLILNTFEDLEGPILSNIRTV
CPNLYTIGPLHSHLKTKLAAKSISSAISTNSLRKEDRSCIKWLDSQPPKSVIYVSFGSIAVMT
RNQLMEFWYGLVNSGCKFLWVIRPDSVAGDGEIPAEVLKGTKERGYIVGWAPQEEVLGH
SSVGGFLTHSGWNSTLESVMEGVPMICWPYFMDQQVNSRFVEKFWKIGLDMKDTCDRV
IVEKMVKDLMVERRDEFMKSADRMAKMGRKCLSEGGSSYCNLDRLIEDIKLMGVKSKQ
.
>Pn1-3
MDSPLSSSRPLEIYVLPFLSPGHMIPLSEIARLFAARGEHVTIITTPSNVALTPDKEQNASIRI
HTIPFPAKEVGLPDGLENFFSVKDIDTAAKVFTGMQLLRNSIEEYVTEHRPDCLVSDMFFP
WTADLAIRLDIPRLVFNATCMFSQVLKDAVRRPDSPHLTVKSDYDPFVIAGLPHPITMTRA
ELPDYVRTPNGYTKMMEEWKEAELKSYGVLVNNYYEFDSAYTDFYQKMVAPTQKIINV
GPAALIHRSGNEKVKRGHKTVVGEHECLSWLKSKDPNSVLYVCFGSGCIFPDAQLMEIAC
GLMAAGHDFMWVVLGKDDEKKEDEVNWLPLDFDEKMIKTNKGTIVKGWAPQVLILDH
PSVGGFLSHCGWNSVIEAVSAGIPMATWPLYAEHFYNEKLLTQVLGIGVEVGAEEWNLW
VDAGKKVIKREKIERAVNKIMDGGDGSKDMRRKTKEMGERAKKAVGEGGSSQRNVTVL
IEDLRKLREKREKA.
>Pn1-4
MDSPLSSSRPLEIYVLPFLSPGHMIPLSEIARLFAARGEHVTIITTPSNVALTPDKEQNASIRI
HTIPFPAKEVGLPDGLENFFSVKDIDTAAKVFTGMQLLRNSIEEYVTEHRPDCLVSDMFFP
WTADLAIRLDIPRLVFNATCMFSQVSDMPPFHLFFLSQVLKDAVRRPDSPHLTVKSDYDPF
34
VIAGLPHPITMTRAELPDYVRTPNGYTKMMEEWKEAELKSYGVLVNNYYEFDSAYTDFY
QKMVAPTQKIINVGPAALIHRSGNEKVKRGHKTVVGEHECLSWLKSKDPNSVLYVCFGS
GCIFPDAQLMEIACGLMAAGHDFMWVVLGKDDEKKEDEVNWLPLDFDEKMIKTNKGTI
VKGWAPQVLILDHPSVGGFLSHCGWNSVIEAVSAGIPMATWPLYAEHFYNEKLLTQVLGI
GVEVGAEEWNLWVDAGKKVIKREKIERAVNKIMDGGDGSKDMRRKTKEMGERAKKAV
GEGGSSQRNVTVLIEDLRKLREKREKA.
>Pn1-6
MERKAMEYSGSEILVLPFYGQGHLFPSMELCIHFSSRNYSPTLIIPSYLSSSIPDSLRHHPFIQ
ILQIAVDKSESPPPAEKAAGGRFQHQHQQLGQGIERLLSARSGDTNQTRPVCVVFDVMMS
WSKEIFAKFGFPAISFFTSGACSAAMVYAGWKARVDEMKPGDIRVLPGLPQDMALAYSD
LDRRPPRHGGGGGNGSPLGPPNGDRLNGPPSRGMGFGPPGPGQKPRWLDEVEGSVALLI
NTCDDLEKSFINYVATQIGKPVWGVGPLLPEKYWKSAGSLLRDGEIRSSGRQSSYTEDEV
TQWLDSKSCGSVIYISFGSEVGPTVEEYEQLANALGELNQPFIWVLQPNSGKPGPPSTLFG
VQTGPDVVAEGGYYPHSLDKKVGNRGLIIRGWAPQLLILSHPSTGGFLSHCGWNSTVEAI
GRGVPILAWPIRGDQYYNAKLVVSHLKVGCMVSSGGDSAKGMKKEDIIQGIERLMGDEE
VHKRARILQGKLESGFPASSVAALDALVDFIK.
>Pn1-12
MEQNQKMASPQEHIIMLPFFAHGHLFPFLALAKQIQERTGFTITLVSTPLNILYLKSTNTQN
PQIHLVPLPFNSSHHNLPPNTETTESLPLSQVITLFHASSSLESPFRRFISDVTIRDGKSPICIIS
DVFMGWANEVAKSLDIVNVSFSTCGAYGSAAYVSVWQKLPHRFLENDNDEFCLPWFPER
CRVTRSHLHQFVRVADGNDEWSKFFQQQTTFSLGSFGWLCNTVQEIEPLGLEVLKNCTK
LPIWCIGPLLPQRMLESSSNPGTFGKRAGKEPGLSPEECLKWLDLFPKSSVLYISFGSQNTI
RPTQMMELAKGLEESGQPFIWAIRPPIGFNLKENFRDEWLPPGFEEQMIHSKLGLLVHKW
APQLEILSHKSTGAFLSHCGWNSTLESLSQGVPIIGWPLAAEQVYNSKMMEEEMGVGIEL
TRGLESSIVKEDVKRVIEIVMGKNGKGEEMRKKACEIGELIRVAAREENDVKGSSLQAMD
DFVASILSFSRR.
>Pn1-14
MAANDKLHIVMFPWLAFGHILPYLKLAKLIAKKGHKISFISTPRNIDRLPKIPPNLAPHIDL
VKFPLPSIPNLPENAEATIDVPFNKVKYLKIAYDQLQQPLTQFLVSNSPNWILFDIIAYWVG
PIASKLNVRSSFFSIFSASIMGYFGPPSVLMHSDEDRRKPEDYSIKPKWVRFETTVAMSLH
35
QMQSTFVNFTEDDTENVPDQYRFAASIRDCEMVAVRSSSVFEPEWLELLDEIYQKPVVQV
GLLPTDTTDTDTGGKSNDSWRDIKDWLDKQEKVSVIYVAFGAETKPNQDELTELALGLE
LSGLPFIWALRKQRGSADPEPTELPEGFEERTRGHGLFYTSWVPQTKILSHDSVGGLLIHS
GWSSVIEAVQFGRALVLLPLLADQELIAKLVEEKKMGYLIPRNERDGWFSRDSVAESVRL
VMVDEEGKIYRDNVKEMKGVFGDMDKQDEYVDNLLSYLESHKILTK.
>Pn1-15
MEVVVDEAIKPHVAILPSPGMGHVIPLLEFAKRLVIIHNFHVSFLVITTEASTAQNNLLHSS
ALPADLHVIHLPPVDVSNVISDDMPIVTRISLICQESLKPLRSILIDLNRLKALIIDIFVTQTF
MVCQELSIPVYTFFTSTTQVLALSLYLPILDQEVKGEYIDLPEPIQLPGCKPIRIDDLLDQIR
DRKINEYNWFLLHVSRLPMAAGILVNTWEHLESTSTWLNALKQDPFVTNIPTPPIYPIGPL
VKHNDLVAQSDAYIMSWLDNQPRDSVLFVALGSGGTLRSEQLSELAWGLEASKQRFILVA
QIPTDLCAFATFFNVGSDGNDPVAYLPEGFVRRTEGVGLVVPSWAPQVAVLCHEATGGFLS
HCGWNSALESMVHGVPMIAWPLYAEQKMNATMLTEEVGVAVKPAARDGERVIRREEIER
VVRLVMESEEGKILRRRAKELQESAEKALVSGGSSYESLTRVVESWKK.
>Pn1-16
MDSQVSSRPLEIFVIPYLTPGHLIPLSELACLFAVRGEHVTIITTTTKAAVIQKTINKFSSSGH
PVSLHPIPFPSKEAGLPEGLELEQANDVETAGKFILGLNLMQPIIEDFVVMRRPDCIIADKF
YPWTSDLAARLSIPRLVFDPYSIFAKSLHEALLNPNSPHLTVESDYDPFVIPDFPHRITMTRS
QLPDSRFAQLFKVFREAEVNSYGLLANSVSELDSVYTEYYSKKMGHKVFHIGPASLIHQS
ANDKVERSHKSAVSERQCLSWLDSKKPDSVVYICFGSGCVLPDAQLMEIGYALELAGSD
FIWVVAGKNKDDEDEEKWLPRGFNERVVKKGKGLVVKGWAPQLLILDHPSTGGFLTHC
GWNSAIEAMIAGVPLISWPFICDNFFNEKFFTQVLGIGVEVGALDWKLFSEVGTKVINRE
KIEKAVRKLMDGAGEDEGKEMRKKTRELGEKATRAVKEAGSSHTNLTLLIEELKQLRIER
EERSDQ.
>Pn1-17
MKQLRELLTSNTSKNCKKSSEEQLEMSIPHVLAVPFPAQGHVIPLMELVQHFVKKGFKVT
FVNTDFNHKRVINALLEKDRNVGDMIHMVSLPDGLDPGEDRNDIGKLSEAVPRVMPEKL
EKLIKNINATDNNKITCLVADENMGWALKVAEKMGIRRVAFWPASAAVLGSIFSIPKLIKD
GIINNDGTIIKNQMIDVSPNMPARNTHFAWACFGDLKTQKMFFELTVNNNEFVKLADRIIC
NSAYELEPAAFTLIPNMLPIGPLLASNRHGNQAGNFWPEDSTCLTWLDQQPVCSVIYVAF
36
GSFTVFDQSQFQELALGLELTNKLFLWVVRPDMTNEMNEAYPKGFTDRIGTRGRMVGW
APQEKVLRHPSVACFLSHCGWNSTMEGVSNGVPFLCWPYFTDQFLNQSYICDVWKVGL
GFNKDESGIIRQGEIKNKVEHLLGDRKFRARALDLKEKVVNSVKGHGCSHKNLSNITDW
MK.
>Pn1-20
MAVAGAGAHILVFPYPAQGHMISLLDLTHQLAVRGLTITILVTPKNLSFLDPLLSKHPSIKPL
VLPFPVSPSIPAGVENLKDLPVGSFSVMMAALGELYNPMLNWFESHPSPPVAIISDMFLGW
THRLACQLSIRRFVFCPSGALAMSVIFALWRDMRQRNDPGDENELISFPEIPNSPVYPWSE
LSPVYRSFVAGGPRSEFLKDAFLGNIASWGIVINSFRELERVYLDYLKESLGHDRVWSVGP
LLPPEIDRVSRGGSSSVLASEITSWLDKFDDQTVVYVCFGSLAVLTNKQMEELALGLEKS
GVNFLWSSKIPTEGHVEGEYGMVPLGFQERVAGRGLVIKGWVPQVSILSHRAVGAFLTHC
GWNSVLESIVAGVPMLAWPMGADQFVNTDLLDELKVGIRVCKGANMIPDSDELARLVA
KGVNNEERGERIARAKELSKAALVSTKMGGSSYKSLDELVRHLSQDQDFLACDPKKHIIK
.
>Pn1-21
MWSFGKAENSRPHAVCVPFPTQGHINPTLKLAKLFHHRGFHITFVNTEFNHQVLLKTRGP
HALDRLPTFRFETIPDGLPPSNARATEDIPLLFDSTSKNCLAPFRELLLKLNNPLSSPDIPPV
TCIVSDFVMGFTLKAAEELGIPKVLFWTASACSFMCFLHYGQLIEKGFTPLKDSSYLTNG
YLDTVVDWIPGMKGIRLKDLPSFIRTTNPNDPMINFGLADTDKYYKSSPQIILNTYDTLEH
EVLDALSSFLPSNIYSIGPIHHQLNNIHKSSTNSIELNLWKDEQQCLAWLDSKEPNSVVYV
NFGSTTIMSPDQLVEFSWGLANSNQTFLWIIRPDLVNGDTTVLPPDSVEAIKERGLFASWC
SQEQVLRHPSIGGFLTHNGWNSTLESITCGVPMLCWPFFADQQTNCWYCCTRWGIGMEI
NSDAKRDEVESLVKELMVGVKGKKMKEKAMEWKKLAEGAITPPAGSSYINFEKMVND
VLLTNNISE.
>Pn1-23
MNAPSFHIAMFPWFALGHLTPFLHLSNKLAKQGHRVSFLIPTKTQPKLQSFNLHPDLITFIP
ITVPHVDGLPRGSETTSDVPFPLQTLLVTAMDYTEDHVECLLYDLKVDVVLFDFAHWIPG
LARRLGIKSIHYCIISPATIGYTLSPERHLNVDKITEADLMKPPANYPGSNITLHAHEARAFA
SRRVMKFGNNTLFNDRQFISLSQCDALGFRTCREIEGPYCDYLESQFGKPVLLSGPVIPEPP
TSPLEEKWAKWLSKFASGSVIYCAFGSECILKMYQFQELLYGLELTGMPFLAALKPPAGA
37
ESIEEALPDKFEERVKGRGVVHEGWVQQQLILGHPSVGCFITHCGSGSLAEALVNKCQLV
LLPQVGDQFINARMMSQNLKVGVEVEKGEEDGVFTRESVCKAVGNVAQEDNQVGKEVR
TNHARLRDFLLDKDLESSYIHSFSKKLQQLLG.
>Pn1-24
MDPPTFHIAMYPWFALGHLTPFLQLSNKLAKKGHKISFILPNRTQLKLQTFNLLPDLITFVP
ITVPHVDGLPHGTETTSDVPAPLHSLVMTAMDRTQKNIEILLHDLNVDIVFFDFTHWMPG
LARRLGIKSLFYSITCPGTTAFALSLAPRAHENNAKEADDFPLLSIKLHLHEARGFAAGSTE
NFGRGIKIFDRLLISLKECDALGFRSYRELDGPSCDFLESYFQKPVLLSGPLIPEPPNSTLDE
KWVKWLAKFGPGSVIYCAFGSECTLKKDQFQELLLGFMQMGMPFLAVLRPPLGVESVE
AAIPQGFNEGVGERGVVHEGWIQQQLILEHPSVGCFVTHCGVASLVEALMNKCELVLLP
QAAGDQIIQARILSTILKVGIEVEKGEEDGLFTRDSLCKAVRTVMEKDNEVGKEVRSNRS
KIREFLSNKELESSYINSFCEKIENLVLGRSS.
>Pn1-25
MDSQSNQLHFVMIPLMSPGHIIPMVDMAKLVAQRFVTVTIVTTPLNATRFGAVIARAIESG
LPIKFLQLRFPAVEAGLPEGCESIDNLPSFNLSRNFFAAVYMLQEPLQKMLEEIKPSPSCILS
DKHVTWTADTAKKFQIPWIIFDGMSCFTQICTHNLYTSRVHENVADSERFAVPNLPHHIEF
TKAQLPGLFNPRSPSAAAFHGRIRAAEVGAYGVVINSFEELERDYVNEFKKMRGDKVWC
VGPLSLCNKDKLDKAQRGNKASIDQNKILQWLDSWEPGSVVYACLGSLSSLAPPQLIELA
LGLEASEHPFVWVVREGNKQKEIENWILKEGFEERTKGRYLLIRGWAPQVLILSHPAIGGF
LTHCGWNSTIEGICAGVPMITWPLFAEQFFNEKLVVQVLDTGVSVGARAVVDLAEEDKN
GVTVRSEEVKKAIKRIMFGGEEWVERRERARELAKLAVRVVEEGGSSYLNMTLLIQDIM
QQTNTRLPSERKCSN.
>Pn1-28
MGLNSANKAHAVCVPFPAQGHVNPVMQLAKLLHSRGFYITFVNTEFNHRRLIRSKGYEW
VNGFDDFKFETIPEGLPPSDHDATQDPSALFDSIPKNCLAPFRDLLAKLNSSAEVPPVTYIV
ADGIMSFAVEAAKEIGVPEVQFWTATACGFMGYLHYPELMKRGIVPFKDENFMIDGTLDT
KVNWIPGMIRNLRLKDLPAFMRTTDPNNVLFTYARDEALNCFKASAMIFNTFEPLEHEVL
QALSSMFPHIYPIGPLSLLLKDLPNSPLYSFRSSFWKEDTKCLEWLDKREPNSVVYVNYGS
LTLMTDQHLKEFAWGLANSKHPFLWIVRPDIVMGDSAILSEDFLEDIKDRGLLASWCPQD
EVLSHESICVFLTHCGWNSTLESICEGLPIICWPFFAEQQTNCLYSCSDDQWGIGVEVNQD
38
VKRGEIEALVKEMMEGDKGKKLKKNALEWKKKAEEATSVGGSSYSSFDRFIKEALHCN
M.
>Pn1-29
MGNPKPHVVCIPYPSQGHVTPMMRLAKLLHSRGFHITFVNTEFNHKRLVRAKGPQSVKG
FSDFRFEAIPDGLPPSDRDATQDIPKLCDSTRKNCLVPFKELLGKLNSSPELPRVTCIISDGV
MSFGIKAAEEMGIPEVQFWTSPACAMMCFLHYREFINRGICPFKDDNYLTDGTLDKPIDW
IAGMKNIRYKDVPSFIRTTDPNDIMFDFIGEEAQNCLKAPAIIFNTFDAFEHQVLQAYASKF
NYRNIYTIGPLPLLGRHVPESPVNSLNSSLWKPDSKCLEWLDKKEKDSVLYINYGSITTMT
EQHLIEFAWGIANSKHPFLWIVRPDIVTGDSDSASLPREFLEETKEIGMLATWCAQDQVLA
HPAVGAFLTHCGWNSMMETVCEGVPVIGWPFFSDQQTNCRYSCTEWGIGMEVNEDVRR
EEVELLVKEMMGGEKGKELRRKAKEWKMLAEEATNVGGSSYQNFDRFIKEALHYEG.
>Pn1-30
MAKNQQKVHAIMIPYPFQGHVIPFVHLAIKLASRGFTITFVNTHSIHQQISKAQSTDAGDIF
AGARESGLDIRYATVSDGLPLEFDRSLNHDQHQESLLQVISAHVDELVGKLVCSGNPPVSC
LIADTFFVWPSSIASKYNLVNVSFWTEPALVFTLYYHLDLLRQNGHFASSDNRLDTIGYIP
GVQAIEPRDLMSYLQASDITTVVHRIIYEAFEDVKRADFVICNTVQELEHETLSALNQKQP
IYAIGPIFPNEFKKTKIATSLWSESDCSQWLNSKANGSVLYVSFGSYAHVSKHDIVEIAHGL
SVSEVDFVWVLRPDIVSSDDTDILPVGFEDSVKERGLIVSWCHQIGVISHPAIGGFLTHCG
WNSTMESMWCGVPLICFPLLTDQFTNRKLVVDDWKLGINLCDQKVITRKEVSEKIDSLM
RGKDLNIKLRNEMKKVRITLESACVTDGSSEKNFENFISDVKIKIKILGR.
>Pn1-34
MSGSSGKGAHVLVFPYPAQGHMLPILDLTHQLALHGLTITILVTPKNLPILNPLLSAHPSSSI
QTLLFPFPHRPSLLETENIKDIGNSGNVLIINALSALSGPIIHWFNTHPSPPVALLSDFFLGW
THHLAHQIGIPRICFYSSGAFLTSILACLWDDDIDTLVSLDVVDFPDLPHAPSFPKEHLPTVF
LTYRKSNPMWELVKDGMIANSASWGCVFNTFDAIEGEYLEYSGKKMGHERVFAVGPLSL
LGGPDHTTRGSTSRSSGSDNTNILAWLDDVYPDGSVLYVCFGSQKLLKRVQMEALSSAL
EQCGIKFIWVVKSPTAQQVADGYGFVSDEFEKRVSGRGLIIKEWAPQVSILNHRAVGGFLS
HCGWNSVLEAIVSGVTILGWPMEADQFVNAKLLVDYMGVAIRVCEGPDIVPNSFELAKK
VAESMNAEITEKMRAKELKVKALEGVKHCGSSSRDLDGLVKELAQLQLKNDR.
>Pn1-37
39
MEKNHCDNQSSGLKLAPPVTVIMVPFPAQGHLNQLLHLSRLISSYNIPVHFAGTTAHSRQ
AKLRVHGWDPLSIANIHFHEFSTPAIPSPPPKPNNSGKLPPHLEPLFDGLLLLREPVAQLLFA
LSYTTQRIVVIHDLAMASVVQDVVLIPYAESYSFQPVSAFFISSYKGENMREKFGIEDDIV
KDLPPFESILSSEVMEFSKKLQVHHKFNSGNLYNSSNVIEAQFLDILKKVHVSSDTDIKQW
AIGPFNPVVIYDSDPNQAHKCLKWLDKQAPNSVMFLSFGTTTTLSDEQILELAIGLENSEQ
KFIWVLRDADRVDIFAGDVRRAELPKGYEERLEGKGMVVRDWAPQLEILAHPSTGGFMS
HCGWNSCLESITMGVPIAAWPMHSDQPANAVLITKVLKIGVVVKDWALGDHELVKSSTV
ENAVRRLMASIEGDEIRRRAAQMGAAVRSSVAEGGVARVEMDDFIAHIRR.
>Pn1-39
MENNHVLHVVMFPWLAIGHLIPFLQLSKILAQKGHRVSFISTPRNIERLPKIPPDLSPMINL
VTFPFPKVHNLPDEAESSMDIPHHKAQFLKIAFDLLKSPLVDFLKNTRLKPDWIVYDYAS
HWLPSIAGELGILRGYFSLFSAACMAFLGPPSLLVSGEDGRLKGEDFTIVPKWIPFQSDVV
YRLHEVIKYVDGSAGNESGASDSVRFGASVGGSDLVLFRTCVEFEPEWFNLVCQLYRKPV
VPVGVLPPDNDDFGNEEKWVFIKEWLDKQMVNSVVYVALGSEAILSRDELSELALGLEQ
CGLPFFWVLRMPPGSTQDESQLLLPDGFVERVKSRGVVHQGWVPQVKILSHSAVGGFLT
HCGWNSVVEALGYGRVLILFPVMNDQGLNARLLKGKKVGVEIPRNERDGAFTSDSVAD
SVRIAMVSQEGESVRATAREMRGLFGHGIRNDNCIDNFVSHLVKMRKSQLMS.
>Pn1-45
MEKQTLANKPHFMTIPYPYQGHMNPMLQFSKRLASKGVQITILATNDVKTSKLAHTSSIN
IEYISYEIEQGDEIPKSVAAGLGYINHKVLKSVPGIIEKHKASGSPIKVIVYDSLIHGALELA
HKLGLRGASLFTQTCAVCSVYYHVQRGSLALPLEGHTVSLPSIPVLEIGDLPSLVQAHDRL
YPGLLEVIKKQFVDLEKADWIFFNVFDKLEEEISKWLSSQWPIKTIGPSIPSMFLDKRLPD
DRDYGLNLFKPDAEACIKWLNAKATGSVVYVSFGSVANLAQNHMDELAWGLLNSNINF
LWVVRETEQNKLPGNFNADASEKGLVVSWCPQLEVLAHKAVGSFMTHCGWNSTLEALS
LGVPMLVMPQWGDQTTNAKYVVDVWQTGVWVKARDKEIFTREDIANRIKDVMDGAK
GEELRANAIKWKELAVEAMSEGGSSDKNIDEFISKVLSTTT.
>Pn1-46
MEKQTLANKPHFMTIPYPYQGHMNPMLQFSKRLASKGVQITILATNDVKTSKLAHTSSIN
IEYISYEIEQGDEIPKSVAAGLGYINHKVLKSVPGIIEKHKASGSPIKVIVYDSLIHGALELA
HKLGLRGASLFTQTCAVCSVYYHVQRGSLALPLEGHTVSLPSIPVLEIGDLPSLVQAHDRL
40
YPGLLEVIKKQFVDLEKADWIFFNVFDKLEEEISKWLSSQWPIKTIGPSIPSMFLDKRLPD
DRDYGLNLFKPDAEACIKWLNAKATGSVVYVSFGSIANLGQDHMDEVAWGLLNSSSNFL
WVVRETEQNKLPGNFKAEASEKGLVVSWCPQLEVLAHKAVRSFMTHCGWNSTLESLSL
GVPMLVIPQWTDQTTNAKYVVDVWRTGVWVKARDKEIFTREDIVNCIKAVMDGAKGV
ELRANAIKWKELTVEAMSEGGSSDKNIDQFISKVISTTTCSFQADISCI.
>Pn1-47
MAETQKPHVVCMPYPAQGHITPMLKLAMLLHHRGFYITFVHTEFNYNRLLKSGGPKSIS
GSPTFRFETIPDGLPPPENPDATQNIIELCISTSKNCSIPFRELLNKLNNSSDVPPVSCIVSDAI
MAFSVEVSEELGIPNVFFWTVNAFTLMTYLHYSRLRELHKDVLNGSENGYLNYVIDWIP
GTGSIRMRDSSSLIWSPDLPDSFVLYCIQEISRTYKASAIILNTFDELECEVLKPLSSMLNRV
YSIGPIHNLSKSVIPDNHTKYLRSNLWKEDSGCIQWLDSKQPGSVVYINFGSITVMSPQHL
VEFAWGLANSMQNFLWIIRPDLVMGDVAVLPPEFEMQTKQRGLLASWCDQEQVLNHAS
VGGFLTHCGWNSTLESLSAGVPMICWPFFADQLTNCYCICKLWGVGLEIDSDVKRDGVE
SVVKELIEGEKGKEMKKRVVEWKNKAKSATSPYMGSSYLDIDKMVNEVILSPMK.
>Pn1-48
MSDFGDPNCIAHVALIPSSGMGHLTPFLRLAVSLMSHDVRVTFITAHSTVSLAESQTLSRFF
STYPKITPAQLPLIHFDDPSEDPFYIQYEAIRRSSHLLLPILSSILPHLSALITDMSLASTVIPV
TKALSLPNYVLFTSSAQMSTLFLSYHTIFGSRDSEENNLFTIPGLQPLPKSWIPPPLLSDKD
SLFRTQLMDNGKKMLQADGILINTFESFEQEPLAALNDGKVVTGLPPVTPIGPFAPCDFER
GQPLEWLEGQPMGSVVFVSFGSRTAISREQIRELGNGLLRSECRFVWVVKENKVDREDD
KELGELVGKDFLAKVKDKGLVVNHWVNQGEILRHPAVGGFLSHCGWNSVTEAMWHGV
PMLAWPQHGDQKMNAYVVSRAGLGIWVENWGWGGGEMVVKGEEISKKVREVMDNEL
LRVQVAQIRERARAAVADGGSSNKRVVELIQMWKNTRVT.
>Pn1-49
MGTRQSQQMYDISFLALHTQVKTMGAELIFIPSPGVGHLVSTVEIAKLLISRDERLSITVLV
MKFPHDTGVTAYTRSLQKDAPNRIVFVDLPQNESLISTPKSLFTSFIESQKSPVRDSVRQIV
SRSDSNKLAGFVIDMFCTPMIDVANEFGVPTYVFFTSGAAFLGLQFYHLSLSDEHNQDLA
EYKDTDVELSIPSFINPVPAKVLPSVILNKEGSTMLQSISRRFKEAKAILVNTFAELEPHAIK
SLGDNCKIPPIYHVGPIINLKNKEGTTQNHSSEDGIISWLDNQPPSSVVFLCFGSFGSFDEG
QVREIAHGLEQSGQRFLWSLRRRPEKMELPKDYENPEQVLPEGFIERTSGMGKVIGWAPQ
41
TAILSHPAVGGFVSHCGWNSTLESIWCGVPMATWPIYAEQQINAFELVKELGMAVEIKME
YREDYFFAPENNVVVTADQIEKGMRCLMMDGETEMRKKVEEMKEKSRMAMVEGGSSY
ISLRHFIEDVMGN.
>Pn1-51
MGAELIFIPFPGAGHLISVVEIAKLLISRAEWLSITILVMKLSLDTGVAGYTQSLQKDAHNR
IVFADLPPDESVISMIKTIKTSDRLSIFRVFIESQKTQVRDAVKVIVSQSESNKLAGFVIDMF
CTPMIDVANEFGVPSYVFFTSGAAYLGLQFYHLSLDDEHKQDVIQYKDSDVELSVPYFVN
PVPAKVLPSVMLIKEGSTMIQSISRRFKEAKAILVNTFLELEPHAIKCLADNGKIPPVYHVG
PIVNLKSKEGTTQNHNSEDGIISWLDNQPPLSVVFLCFGSLGSFDKEQVREIAYGLEHSGQ
RFLWSLRPPPQPTELFGLPKEYENHNEVLPEGFIKRTTGIGKVIGWAPQVVVLSHPAVGGF
VSHCGWNSTLESIWCGVPMATWPIHAEQQLNAFELVKELGMAVDIKMDYRKDSSIATEP
LVVTIHCCQ.
>Pn1-52
MTTVEEVFVVTSCGSGHLFPCIELCKHLSSRNFKTTLVLPSNLSSSLPSSFLHHHPLLSTAEI
AAASGGPPMLGSDPARHQMGLDLEAHLTRRSTESDSARPICAIVDFQMGWTKHVFWKFS
VPVISFFTFGACAAAIEWGAWKADAGKLNVGEVLNIPNLPNEMTVTFSDLKGRATGPSQ
AGRPGPSNTSLARLGGGGPPKPGDRPPWVPLIEDSIGLMFNTCDDLERPFIDYMANQMA
MPVWGVGPLLPDKYWQSSDRLISDRQIRGPKHPVNYNEDEVIQWLDSKPRGSVLYVSFG
SEVGPTSEECSQLASALEETTHPFIWVIQQKPGLPKSNANANANANENVGVGVGGGLIIK
GWAPQLLILSHPSTGGFLSHCGWNSTVEAIGRGVPFLAWPIRGDQIYNAKLVVNYLKVGY
MAMAEGGDDTLGISKKELILQGIDKLMSDEDVHKRAMAIRAKFHASFPKSSGEALDAFR
GVLL.
>Pn1-53
MTTVEEVFVVTSCGSGHLFPCIELCKHLSSRNFKTTLVLPSNLSSSLPSSFLHHHPLLSTAEI
AAASGGPPMLGSDPARHQMGLDLEAHLTRRSTESDSARPICAIVDFQMGWTKHVFWKFS
VPVISFFTFGACAAAIEWGAWKADAGKLNVGEVLNIPNLPNEMTVTFSDLKGRATGPSQ
AGRPGPSNTSLARLGGGGPPKPGDRPPWVPLIEDSIGLMFNTCDDLERPFIDYMANQMA
MPVWGVGPLLPDKYWQSSDRLISDRQIRGPKHPVNYNEDEVIQWLDSKPRGSVLYVSFG
SEVGPTSEECSQLASALEETTHPFIWVIQQKPGLPKSNANANANANANDNVGVGVGVGG
GLIIKGWAPQLLILSHPSTGGFLSHCGWNSTVEAIGRGVPFLAWPIRGDQIYNAKLVVNYL
42
KVGYMAMAEGGDDTLGISKKELILQGIDKLMSDEDVHKRAMAIRAKFHASFPKSSGEAL
DAFRGVLL.
>Pn1-54
MEINRHRKLTPMAAGNDNLHVVMFPWLAFGHLIPFLELSKSIAQKGHKVSFISTPRNIDR
LPAVPTNLAPLITFVKLKLPRVHKLRQDAEATTDIRTDDIPYLKTAYDGLEPGLTRFLETESP
DWIIYDFSPHWLPPVAARLGVSRAFFSIFNAWFIAFSGSSSEAMINGSDQRIRPEDFTVPPS
WIPFPNNVAYRLHEINWVMGSSTDNESGVSDFYRAGSVISGSEVVFVRYCNEFEPEWIKL
LEELHKKPVIPLGLMPPSAQDRVGDQNEAEWLTIKEWLDSQDKGSVVYVALGSEATLSQ
NELTELGIGLELSGLPFFWVLRKPPGSEESNPIELPDGFLERIRDRGIVWMSWVPQLKILGH
ESVGGFLSHLGWSSIIEGLMFGRPLVMLPFLVDQGLNARVLVDNKVGIEVPRNETDGLFT
RDSVAESLRLVMVEEEGRIYRERTKELSGIFGDKELHNRYMDTSIKYLENHTK.
>Pn1-55
MEHEDQKPHAIFVTYPLQGHVNPSVHLAIKLAQRGFTITFINTHYVHLKRSEARSTTGGG
DEDEDIFAGARNSGLDIRYVTISDGLPVGFDRSLNHDQFMACLLHVFSAHVEEAVEKLIM
KSGPPVNCLIADTFFVWPSKLAKKFGILYISYWTETALVFTLYYHVDLLRLNSHFGCIDAR
EDAITYVPGVQYIEPKDMTSYLQDIDTNTVVHQIIFNAFKDVRGADFVLCNTVHELEPDTI
SALQSKIPFLAIGPIIPFGLNKNPVPTSLWAETDCSQWLDSRPHGSVLYVSFGSYAHLTKND
LVEIGNGLLDSKLSFVWVIRPDIVSSDDPEPLPDGFREEMGERGIIITWCNQIQVLGHPAIG
GFLSHCGWNSVLESIWSEVPVLCFPLNTDQFTNRKLVVDEWKVGINLCDRKPVAKFEVS
EKINRLMSGKSGDDFRKKVRQVKKLLENALGSTDGSSEKNMKQFISELKVKIYSKFRNA
RSIVCPTANN.
>Pn1-57
MESPNRPHAVCIPLPAQGHINPMLKLAKLLHSKGFYITFVHTEYNYNRVVKSRGPDSVHG
PNDFGFETVSDGLPWTNQRRIIDLPDLCVSMPIHLLQSFRGLITKLNANSSSNIIPPVSCIVS
DGVMSFTMEVAEEFGIPEIFLFTPSACAMLGYLHFDELIERGYFPLKDESCLSNGYLDTEID
WIPAMPGIKLKHLPTFIRTTDKNDIMFNYNLESIQNALKTKTLILNTFDELEQQVLDAIKIK
FPSLYTVGPLSLLHQQHCQTKLAKSENESNLWEEDMSCLEWLDKKAPKSVVYVNYGSFV
IMTREQLSEFAWGLANSNYTFLWVIRPDLVLDGGDDIIISKDFMEKIGDRGMLVGWCPQE
EILRHPSVGGFLTHCGWNSVLESICEGVPMICWPFFADQQLNCVYLCREWGIGLEIDSDV
RREKVERLVKELMEGEKGNVKREKALEWKERAENATKNGGSSYINLDLLVMHLKQESV
43
GKNISKP.
>Pn1-58
MATEDPKLHVLILPYVTPSHMMPLVEIGRLIAARGVNITIIATPHNANLFRSSVDQDINSGH
QISIHELKFPSAEVGLPEGIENLSAITSTDMSAKVFEGIMRLRKPMEDLIRNLSPDCIFSDMF
YPWTVELAEELKIPRLMFYPSSFFYYCLSHSLKLYAPHDQKVQSDAESFLIPHLPDTIEMK
RSQLQDHVKGKSRLGVFMDAIKNSELKTYGIVHMTFYELEPAYADHYIKIKPAKFWGILP
LFQFFKGLKAPRSNDSQHNCLSWLDTQKPNSVVLLSFGSLVRFPDAQLTEIALALEASTHS
FIWVVRKSEANRENQEKSWLPAGFEERMMEGNKGMMVRGWAPQVKILAHPATGAFMT
HCGWNSVLEAVAAGVPLITWPLFAEQFYNEKAINEVLKIGVGVGAEVWNPTFEITCPPVG
RDKIEKALSKLMGGSEESQKIRQKAKEMAAMAKGAVEEGGSSYNNITALIEELKACAFE
KSKNG.
>Pn1-61
MNTLLLFHISFEQISKVLYLTFVSKASMAAKPFHIMMFPWLAFGHMLPYLELTKKFAQNG
IKVSFVSTLRNLQRLPPIPPNLANEIKLVEIPLPSVEGLPENCEATMDLQKEQVQHLKKAYD
MLPAAFEKLLQKDLPDLLFVDFSPYWIPEVAEKFGVQSAFFSVYTAATLAYLGPPNVLKS
GENRRSPQEWFTFPSLVTHISDYEHSRLKKAHFPDASGISTGQRLGRIVERCDFVAIRSCKE
FEGEYIDLVQQLYQKPVLPIGVLPPVLTDNKALDAINSNWSNNFKWLDEQRHKSVIFVGF
GSEYKMPIEQVHELAFGLELSNLPFLWILRKPDAIHSNALLPSGFDIRTSNRGQVCLGWAP
QSKILAHPSIGGCLFHSGWGTIIESLSFGHPLILMPMVADQGLNAKLLVEKGLGYEVPRNE
DNSFSRDMVAKSLKIVMTTQEGNPLRFKAGQTQTVFGNQHLHENYINKLIQYLGKFKRS
EDGVHKLLKKKSKIKDLV.
>Pn1-63
MEKLELIFIPTPMMGHATQALHLANLMINRFDHLSITVLIMKLPVDPISTNFMESLGSPAAT
SSSNVNSEVDRIRFHHLPPSKITSDCCYRTPGVLLDLVIEDHKDHVRNYLVTRKSTPAAAL
VVDMFCTTMMDVGNQFGIPTYVFFTSGAAFLQLLFHLQILHDVDNGDVSELINSDTELVI
SGFVNPIPPGVLPFVLIDKYQWSTRFLKYARACRKANGIIVNTFVELESHALNSLHGDGAT
KSPPIYPVGPIINHAQMNLADDRDHDIMRWLDDQPRSSVVFLCFGSMGGFDMEQVREIA
NGIELSGYRFLWSLRHPAEKGKSLFPTDYSCIEQIFLPDKFFERTAHVGKVIGWGPQLKILA
HEAVGGFVSHCGWNSILESLWHGVPIATWPIYSEQQLNAFEMVRELGLSVEIKLDYHNY
MDNDMNKVLVRAEEIERGIRSVMDGGNEVREKVKRMRDKSRMSMEEGGSSYKSLELLI
44
EDMKV.
>Pn1-66
MDYQEKILPPHVLLFPLPLQGPVNCMLKLAEFFSLAGLHVTFLNTHHIHHRLLKHAHIQS
RFAPYPKFRFETISDGLPEDHQRSGDRFFDIVDGLQLVTEPLFKEMLASGSLSSKSGTPITCI
IADGFFTFALDVAKEVGIPLFYFQTISPCALWVHLCLPKLIELGEVPFKGNDLDAKITCVPG
MEGFLRRRDLPSFCRNNDLNDPIMQHVLKEDQELPRAQGLILNTFEDLEKPILTHIRTICPN
LYPIGPIHTLHKSILSANKSPPQVNSSNSLWEDDRSCMTWLDAQPPNSVIYVSVGSLAVMT
RDQLMEIWHGLVNSGKRFLWVRRLGSIAREDDQKYETPTELSDATKERGCIVSWVQQEE
VLGHPSIGAFLTHSGWNSTLESIVEGVPMICWPYFVDQQVNSRFVGEVWRIGVDMKDSC
DRGIIERMIRDVMEVRKDEFQESAKVMAKLARQSVCEGESSYYSLDRLIKDIRKMTLSTP
HT.
>Pn2-7
MATQKCFRVLMFPWLAHGHISPFLELARKLIEKGNFFIYFCSTPINLISIKKKLSGDDHQNY
TKSIQLVEHNLPTLPQLPPHYHTTDGLPPDLNSTLRKAFEMSKLSFPNTLNTLKPDLLICD
DLFQWPEIVASSHDVPVVRFQTCSVTAGSFIAHTFRNPDVTYPFPSIYLHEYEIDQIRRCVD
AVFESGKEESGNLLVVNTSKAIEEKYFDYYSLLRGNTKIMPVGPLIQQAPNGDEDMKVIE
WLDKKDPCSTVFVSFGSEYFMQKEEVEEMAHGLELSNVNFIWVFRAPVGAEKVKLPLG
FVERVGGRGIVMEGWAPQARILGHSSIGGFVSHCGWNSVLESINFGVPIIGMPIKFEQPMN
ARLVSELGVCVEIVGDETRRFGREEVVNVIKKVVGGKIGDDLRRKVKELGATIKEKQEEE
MDDVLDELVQICNKKKRIVV.
>Pn2-9
MDTNSNSFRVVMFPWLAYGHVSPFLELAKKLSTKNFSVYFCSTPITLKPIKNKISNYKSIEL
VEYPLESTPEFPPHLHTSNGLPPHLMPTLKKYFENASHNFSQIIKTLSPHLVIYDYLMPSW
VPKFASSHQIPAVHFHIFGVANLAYFTCLVRDIPGFSFQSKTVCLKPSEIMKLVQAPRDNVE
AEENELSDCIIGSTEMFLIKSNREIEGKYLDFAADLFKKKIVPVGPLFQEISVNNQENDEEIF
RWLNKKEEFSTVYVSFGTESYLSKKGMEELANGLELSKVNFIWVIKFPEGEKINAAEALP
EGFLERVGEKGMIVERWVPQAKILGHKSIGGFVSHCGWSSVMESASVGVPIIALPMHHD
QPVNARLVVEVGFGLEVEKDENVEFWREEVARVVKEVVIEKSGVELRKKAKELSEQMK
AKGEEEVDLAIKELKTLCENNLGKYRDIN.
>Pn2-19
45
MATEEASLRVMMFPWLGYGHVSPFLELAKKLSAKNFHIYFCSTPINLNSIKNRIDSFSSSIE
LVEFHLPSSPELPPHYHTTNGLPHHLHKTLLQAFNMSKPNFSDILNNLKPNLLIYDTYQPW
VPEIASSHHIPAVNFHCTGAASSCFFYSNFKLQGKGLQFNFPAIYLRESEIRKMIASAPYDT
EAAEDPIYTCVEKSLDFVLVKSCRAIEDKYINFFSQLLNKKMVTVGPLAQSGEEEEEKDS
VIMEWLSQKGKSSTVFVSFGSENYLCNKKMEELAYGLELSNVNFIWVIRFPSGGEKIKIG
DVLPEGYLERVKERGLVVEGWAPQAKILGHSSTGGFVSHCGWSSVTESLSYGVPVIAIPM
NFDQPLNARLMVELGAGLEVLKDEDLEFDREEVARVIKEVVVEKSGEGIRCKAKELSEEI
RMKEEGEMCGVVEELEKIICNKNLDE.
>Pn2-20
MRVLPILLPAQTTTTTTMDSSSQLQQLHFVLIPLMSPGHLMPIVDMARLFAQHGVIVTIVS
TPLNTKRFKTIVDRAIDSGLQIRIIDLYFPAAEACLPQGCENMDSISRNLIKNFFMASSMLQ
QPFDQLFDQLSPRPSCIISGKNQAWTVETARKFNIPRLFFDGMGCFSFSCTHNLKMSEEFQ
RVTSKFETFLVPGLPHEIELTKAQLPEALNPGGSGDLIDVRNKMTAAESIADGIIVNSFEEL
EPEYVEMYTRAKGGNIWCIGPVSASNKLILDKAERGSFAPTENEIQCLEWLDLQEPNSVV
YACLGSISGLTASQLVELGLGLEASKRPFIWVIRGGERSKELERWIKQERFEERTKGRGLLV
RGWAPQLLILSHSSTGGFLTHCGWNSTLEGVSAGKPIIACPLFAEQFYNEKLVVKVLGTG
ASVGVEAAVTWGMEDQFGLVMKRENVEKAIQEVMDKGVEAEERRKRAREFGDMAKR
AIEEGGSSYLNIRSLIQHVKEKNELKHACT.
>Pn2-21
MDTEKLHVAIVASPGLGHLTPVLLLGNRLATHHNIHVTVIVVTTQLSTAESQILHPSTAPKL
LNIVQVPHVDISGLIDANTKVVTQLSIMMRETRPRIRSAISAMNHRPDALIVDLFGSELLPI
AEEFCMPRYIFVTSTAWFVALTTYCQVLDKEIQGEYVDQKEPLKIPGCRPVHPEDVVDPM
LDRSDQQYREYIRHGVEYSMFDGILMNTWEDLEPTTIQALRNDEILQTVVKVPVYPIGPL
STPVKPASQKSELIEWLDTQPSDSVMYVSFGSGGTLPAEQVNELARGLEQSQQRFIWVVR
PPIESHADGTFFTSGNGPDGTPAYLPERFLSRTHKLGKVVPLWAPQMEILSHPSVGVYLSH
CGWNSALESISNGVPMVAWPLYAEQRLNATLLTEDLGVAWRPTVLPGKKVVEREEIEKM
VRNVMQYREEGKGAWERVKEVKCSGDRALSKGGSSYNSLCEVIKDCERRLKKLI.
>Pn2-28
MENQGLRKCLHVVLVPAPLQGHISPMLQLGDVLHSMGFSIIVAHTIFNSPNPSNHPEFIFLP
ISDNLSDHDTSPGNRLALFKSINKNCEEPLRQSLAHIMHQQELGDRVVCIIYDQIMYFSEA
46
VASHLKLPCMNFRTISASFALAYKEIPRLLAEGYIPRQNSMLQDLVPGLFPLRFTDMPTDIG
SLEDVIENMELSINTRNYSAIIWNTVDHLEQTALAQFLQPRPVPLFSVGPLHKMASPSSTS
LLEVDTSCISWLDKQAPRSVIYVSIGSLAFMEEKELTEMAWGLASSGQPFLWVVRTGSEC
DSEWKLLPKGFKETIGERGCIVKWAPQRKVLAHDAVGGFWSHCGWNSTLESLSEGVPMI
CWPCLGDQKVNARYISHEWKVGQQFEHKLERNAIKQAIRGLMVNKEGENTRQNAMDI
MEKIWFSMNKGGSSYNSLNGLAEFISSFQLSK.
>Pn2-30
MYESTTKMGFAASDQKPHAVCIPFPAQGHINPMLKLSMVLHYRGFHITFVNTDFNHNRLL
KSRGSNSLDGLPDFRFHSIPDGLPPTEADVTQDIPTLCFSTARNCLVPFRELLHKLDQNSSS
LNSVDPPVTCIISDAVMTFTLIAAEEIGVPCVSFRTTTACCFMLNKQYSHLKEKVLLPLKD
ARYLTNEHLNMTVDWIPGIKNICLKDFPSFVQTTNLNDKMVEFTIGEAERASTASAVIFNT
FHELECDVLQALSSMCSPIYSIGPVRLLINKLPLSKLKPIGSNLWKEDTECLKWLDSKQLN
SVIYINYGSITVMTKEKLVEFAWGLANSKHNFLWVIRPDLIVGETAILPPEFYEEIKERGLIA
GWCPQEEVLDHVAVGGFLTHCGWNSMIESLSCGVPMICWPFFADQPTNCKLACSDWGV
GMEIGKNVRRDELEMHVRELMGGEKGKAMRKKAMEWKKKAEEATGEFGSSSLNLDKL
VRDVL.
>Pn2-31
MSKMEQTPHIAVVPSPGMGHLIPLIEFAKRLVNFHNVSVTVIVPTIGTPPKVETAVLESLPK
AINYVHLAPVSFEDLPRDSKVETTIFLAVTRSLSSLRGTIKLLVARTALVALISDPFGTDAFS
VAKEFDISPYLFFTSNAMNFAFSYMLPKFDETMSCEFRELPDPVIIPGCIPVHGGDLMDPV
QDRTNEVYKLLLHHTKQFSFAEGVLLNSFIELEEGAIKALQGKEPGKLPVYPIGPLIQTGSS
DEADPSECMKWLDNQPSGSVLFISFGSGGTLSYDQLIELALGLETSGQKFLWVARAPNDN
SSNAAFFSVESQNDPLSFLPKGFLNRTKGQGIVVSSWAPQTKILAHGSTGGFLSHCGWNS
TLESVVHGVPLIAWPLYAEQKMNAIMLAKGLKVALRPKVNQNGIVERDEIGKVVKSLIEG
EEGKKVRSRMKEVKDAATKVLREDGSSTKTLSELVKKWKNKIAPIAS.
>Pn2-33
MENRQILPPHVLIFPLPLQGPVNCMLKLAELFSLAELRVTFLNTHHIHHRLLKHAHVESRF
GRYPEFRFETISDGLPADHQRSGDRFIEVINGIEAVTKPLLKEMLASGCLSSKSGRPVTCIIA
DGFLSFVLDIAKEVGVPLLNFDTISPCALWTYLCIPKLIEVDELPFKGNDLDAKITSVPGME
NFLRRRDLPSFCRNNDLNDPIMQHVLKEDQELPRAQGLILNTFEDLEKPILTHIRTICPNLY
47
PIGPIHTLHKSILSANKSPPQVNSSNSLWEDDRSCMTWLDAQPPNSVIYVSVGSLAVMTRD
QLMEIWHGLVNSGKRFLWVRRLGSIAREDDQKYETPTELSDATKERGCIVSWVQQEEVL
GHPSIGAFLTHSGWNSTLESIVEGVPMICWPYFVDQQVNSRFVGEVWRIGVDMKDSCDR
GIIERMIRDVMEVRKDEFQESANVMGKLARQSVCEGGSSYYGLDRLIKNIKKISLSTSQA.
>Pn2-35
MAQQTIPPHVLIFPFPLQGHVTTMLKLSELFCLAGIHVTYLISTQNYIRLVRCTNVLSRLSP
YPGFRFQVLPESVSDGNAQSMDLVMNLFESLKTAKTFVKDMLIGEAQKPVTCIITDGVMS
FTLDLGEEIGIPIIYFRTISACSFWAYFCMPKLVEAGELPLKGDDMDVLIKNVPGMEGFLRQ
RDLPSLFRVSELNNSAFKMWSTETHQTIRAQALILNTFEDLEGPILSQIRTQCPNTYTIGPL
HAHLKARLDGKTASQLASSNSLWEEDKSCIKWLDNQPLKSVVYVSFGSIAMVTREQLVE
FWHGLVNSEIRFLWVIRPDSVTDEEMGSEIQAELEKATKGRGYMVEWVPQEAVLAHQAV
GGFLTHSGWNSTLESIFEGVPMICWPYFADQQINSRFVGEVWKVGLDIKDTCDRVVIEKA
VRDMMEVRKEEFVESADQIAEMARKSVGEGGSSYCNLNRLIEDIMLMTLGGITP.
>Pn2-39
MSMSNKHGPGQVIVVMVPFPAQGHLNQLLHLSRLISAYDIPVHYVGTTIHNRQAKLRVH
GWDPLSVSNIHFHDFQTPSFPSPPPNPNSPLGHLLPSFELASSHLRKPVASLLQQLSSAAKT
RLVIIHDSLMSSVVQDLVLVPNAEAYTFHSVSAFTVFLHLWENLGRPFAVQSDILKDLPSL
DGCFTSDFANFISSQHDCMKFNSGKIYNTCKLIETPYLDLLAKEQISKNKKQWPLGPFNPV
SVSQNQTPNLQHQCLKWLDKQATSSVIFVSFGTTTSLRDQQIQELALGLEKGAQNFIWVL
RDADKGDVFGGEVRRAKLPKGYEERMKGKGMIVRDWAPQLEILAHPSTAGFLSHCGWN
SCMESITMGVPILAWPMHSDQPRNTVLITKLLRIGLVVKNWSLRDELVVAETVGDVIKKL
MTSQEGGEIRRRAEKMGGDVRMSVAGGGVSRLELDSFIVHISR
>Pn2-47
MGAELIFIPSPGVGHLVSTVEIAKLLISRDERLSITVLVMKFPHDTGVTAYTRSLQKDAPNR
IVFVDLPQNESLISTPKSLFTSFIESQKSPVRDSVRQIVSRSDSNKLAGFVIDMFCTPMIDVA
NEFGVPTYVFFTSGAAFLGLQFYHLSLSDEHNQDLAEYKDTDVELSIPSFINPVPAKVLPS
VILNKEGSTMLQSISRRFKEAKAILVNTFAELEPHAIKSLGDNCKIPPIYHVGPIINLKNKEG
TTQNHSSEDGIISWLDNQPPSSVVFLCFGSFGSFDEGQVREIAHGLEQSGQRFLWSLRRRP
EKMELPKDYENPEQVLPEGFIERTSGMGKVIGWAPQTAILSHPAVGGFVSHCGWNSTLESI
WCGVPMATWPIYAEQQINAFELVKELGMAVEIKMEYREDYFFAPENNVVVTADQIEKGM
48
RCLMMDGETEMRKKVEEMKEKSRMAMVEGGSSYISLRHFIEDVMGN.
>Pn3-1
MKNPELIFMPAPFMGHLPSMLEMAKILVTRYETLSITVLLIKLPFDTGLSAYIQSLSSTPIPR
LTLVELPQSDPSTYMSKPPNTILYSLIEMQKIYVRDIVQDIFTSESTRLAGFFIDMFCVTMID
LAAEFKVPTYVFFAASAAFLGLMFHVQTLSDEYGKDVTELKDSDAELLVPSFRNPFPCKL
LPSMVLDNQGGCDVLLSMAKRFRETKGIIVNTFMELESYAIESLIASDRKIPPVYPVGPVL
NSAGVGYNSDETAVIIQWLDDQPVSSVVFLCFGSMGSISNDQVKEIAFALEHSNHRFLWS
LKPPLSKGAAKPSQDTSNLKDILPVGFLERTAGIGKVIGWAPQVAVLSHVAVGGFVSHCG
WNSILESVWFGVPLATWPLYAEQQLNAFEMVIEYGMALDIKMDYRYDGNMKNCVIVKA
EEIESKIRELMMDINRNELRKTVKDVQEKSRNAILEKGSSYVSIGSIMEEII.
>Pn3-4
MLPWLAFGHMIPFFHLSLALAKQGIRISFVSTPKNITRLPKVPAELAHLINLVKLPLPNVAT
SNYILPNDAEATVDLLPENVQYLKIAYDLLAQPFKQFVSGQLPDWIIVDLIPHWAVDVAQE
CGVPLIFFSAYNAAALAFVGPPEFLTGDAQKRVRPSPESLTISPEWISFRSSVAFRMYEAIGF
HPGLYGENASEIKDAQRVASVIQGCYAMAIRSCPEVEGEYLNLLGNIFGKPVIPVGLLPPAP
PGERENVVPWNLMFNWLDKQKARSVVFVGFGSECKLSKDQVNEIAYGLEMSELFFIWV
LQKPDWTSHDVTTATPPGFTQRTAEKGVVHVGWAPQKEILAHPSIGGSLFHCGWGSVIET
LQYGHCLVALPFVFDQGLTARMLVDKGLAIEVERREDGSFSRDDIAKSLRQAMVSNEGE
NLRNNAKEVALVFGDQKLHQEHYIGEFAQFLKNGIVDIDGNTGKNSEK.
>Pn3-5
MEKQGEKKASEQSTGGRLLLFPPPLQGHISPMLQLANILHSKGFKITIIHTNFNSPNKSNYP
HFTFEPISESLVENEFEGSSTADFGNVLILIKLLNNTCVDPFRDCIVRLLSEEPILCLITDALL
VFTQAVADKLKVPRIVLRTGSLSSFHVFHAHPLLREKGYFSMEDYKSEAPVAEIPPLKVKD
IPMVFTHEEESLHEIIEAMMKATKAASGLIWNSFEELEQSSLIGIRQDFPIPNFTIGPFHKYF
VASSSSLFTQDQTSISWLDLQKPESVLYVSFGSILTIDKSDFLEMAWGLANSMQKFLWVVR
PGVIRGSEWLEPLPDGLLEAVHERGHIVKWAPQQQVLAHPATACFWTHNGWNSTLESICE
GVPMICSPCFGDQPVNARYVEGVWKVGVVLENGLERREIESGIRRLMVDEEGKEMKRRT
KCLKEKVNLCLKQGGSSNQSLQNLVSYIMSL.
>Pn3-6
MAEQTIPPHVLIFPFPLQGHVTTMLKLSELFCLAGIHVTYLISTQNYIRLVRCTNVLSRLSP
49
YPGFRFQVLPESVSDGNAQSMDLVMNLFESLKTAKTFVKDMLIGEAQKPVTCIITDGVMS
FTLDLGEEIGIPIIYFRTISACSFWAYFCMPKLVEAGELPLKGDDMDVLIKNVPGMECFLRQ
RDLPSLCRVSDLNNSAFTMISSETHQTVRARALILNTFEDLEGPILSQIRTEFPNTYTIGPLH
AHLKARLDGKTASQLASSNSLWEEDKSCIKWLDNQPLKSVVYVSFGSIAMVTREQLVEF
WHGLVNSGIRFLWVIRPDSVTDEEMGSEIQAELEKATKARGYMVEWAPQEEVLAHQAVG
GFLTHSGWNSTLESIFEGVPMICWPYFADQQINSRFAGEVWKVGLDIKDTCDRVVIEKAV
RDMMEVRKEEFVESADQIAEMARKSVGEGGSSYCNLNRLIEDIMLMTVGGIRA.
>Pn3-7
MKRAKIIMVPYPAQGHVTPMLKLASGLASRGFEPVLVTPEFIHRSIAPQIGGRSEISCTKIP
DELDEGIRRDFFAIEMAMENNMPVHLERIVQKLVDEDGGRVACFVVDLLASWAIKVGCD
RGIPVAGFWPGMLEAYHLITAIPDMIQTGIISETGIPQYQGPVFFKSLKPMLSTEDLPWLIGT
SVERKSRFKFWTKTLDRSKALQWLLVNSYPDHDEDDDDDEIKTQQMIMQVTNYDSQGD
SPHILPVGPLSNDYARMKNASFWEEDVSCLDWLGKHKDGCVVYISFGSWVSPIEEGKVK
SLALALEASMRPFIWVLGCNWRQGLPSGYMERVWKRGKIVSWAPQMEVLQHKAVGCY
LTHCGWNSTMEAIQCRKRMLCYPVAGDQFLNCAYIVKVWRIGVKLSVFGQRDLEDGIER
VMEDDEMSNRLMRLNERVMGKEANSRMMDNLATFTDFVSQQNLNS.
>Pn3-8
MEMKSYGAHVLALSYPSQGHINPMLQFCKRLVSKGLKATLLITNFISTSMNPKSTSVRLD
TISDGYDEGGFMQAANVEAYLETLQSAGSKSLSELLNKYKKSGIPIDCVIYDAFCPWALD
VTKEAGLVGAAFFTQPCAVNYIYYYAYHGLLTLPVDSGSLPVSIPGLPLLELQDLPSFIYVH
GSYPAYFKLVLNQFLKVEKADYIFVNTFYKLEAEMVDTMSKLIPLITIGPTIPSFFLDNEDE
NDKEYGLNLFHLGSSNPIITWLNTKPKGSVVYVGFGSMANLSEDQMEELAWGLKESNY
NFLWVVRDSEEAAKLPQNFAQKMAGKGLLVTWSPQLEVLSNEAVGCFFSHSGWNSTIEA
LSLGVPMVVMPQWTDQTTNAKLVQDVWKVGIRVRVDEKGVVGREEIEYCIREVMEGNR
GKEMKDNAMKWKGLAKEAVSEGGTTDKNIDQFVSKLSTRSSSHI.
>Pn3-9
MAAGNDNLHVVMFPWLAFGHLIPFLELSKSIAQKGHKVSFISTPRNIDRLPAVPTNLAPLI
TFVKLKLPRVHKLRQDAEATTDIRTDDIPYLKTAYDGLEPGLTRFLETESPDWIIYDFSPHW
LPPVAARLGVSRAFFSIFNAWFIAFSGSSSEAMINGSDQRIRPEDFTVPPSWIPFPNNVAYRL
HEINWVMGSSTDNESGVSDFYRAGSVISGSEVVFVRYCNEFEPEWIKLLEELHKKPVIPL
50
GLMPPSAQDRVGDQNEAEWLTIKEWLDSQDKGSVVYVALGSEATLSQNELTELGIGLELS
GLPFFWVLRKPPGSEESNPIELPDGFLERIRDRGIVWMSWVPQLKILGHESVGGFLSHLG
WSSIIEGLMFGRPLVMLPFLVDQGLNARVLVDNKVGIEVPRNETDGLFTRDSVAESLRLV
MVEEEGRIYRERTKELSGIFGDKELHNRYMDTSIKYLENHTK.
>Pn3-11
MGSFGKAENSRPHAVCVPFPTQGHINPTLKLAKLFHHRGFHITFVNTEFNHQVLLKTRGP
HALDRLPTFRFETIPDGLPPSNARATEDIPLLFDSTSKNCLAPFRELLLKLNNPLSSPDIPPV
TCIVSDFVMGFTLKAAEELGIPKVLFWTASACSFMCFLHYGQLIEKGFTPLKDSSYLTNG
YLDTVVDWIPGMKGIRLKDLPSFIRTTNPNDPMINFGLADTDKYYKSSPQIILNTYDTLEH
EVLDALSSFLPSNIYSIGPIHHQLNNIHKSSTNSIELNLWKDEQQCLAWLDSKEPNSVVYV
NFGSTTIMSPDQLVEFSWGLANSNQTFLWIIRPDLVNGDTTVLPPDSVEAIKERGLFASWC
SQEQVLRHPSIGGFLTHNGWNSTLESITCGVPMLCWPFFADQQTNCWYCCTRWGIGMEI
NSDAKRDEVESLVKELMVGVKGKKMKEKAMEWKKLAEGAITPPAGSSYINFEKMVND
VLLTNNISE.
>Pn3-15
MGAELIFIPSPGVGHLVSTVEIAKLLISRDERLSITVLVMKFPHDTGVTAYTRSLQKDAPNR
IVFVDLPQNESLISTPKSLFTSFIESQKSPVRDSVRQIVSRSDSNKLAGFVIDMFCTPMIDVA
NEFGVPTYVFFTSGAAFLGLQFYHLSLSDEHNQDLAEYKDTDVELSIPSFINPVPAKVLPS
VILNKEGSTMLQSISRRFKETKAILVNTFAELEPHAIKSLGDNCKIPPIYHVGPIINLKNKEG
TTQNHSSEDGIISWLDNQPPSSVVFLCFGSFGSFDEGQVREIAHGLEQSGQRFLWSLRRRP
EKMELPKDYENPEQVLPEGFIERTSGMGKVIGWAPQTAILSHPAVGGFVSHCGWNSTLESI
WCGVPMATWPIYAEQQINAFELVKELGMAVEIKMEYREDYFFAPENNVVVTADQIEKGM
RCLMMDGETEMRKKVEEMKEKSRMAMVEGGSSYISLRHFIEDVMGN.
>Pn3-19
MRNMKTAELVFIPAWSISHFVPMMEMAKLFVARYEQLSVTILFMKYPVEIGLGSYIESLLS
TTIPHFRFLEVPQTDPNTYMSKPPHTRFSAFIETQKAHVRDIVFDIARSETTQFKGCIIDLFC
ITLADIADELNVPSYVFSAPSASFLGLMFHLHSLVDEHNIDITEYKDSDAELFVPSYTNPVP
AKVLPSVLLDKEGGSALFVSIARRLRETKGIVVNTFMELESHALKSLAVDSKIPVFYPVGP
VLNLTGEGKNSEEKTIILRWLDEQPPSSVVFMCFGSFGSFQEDQVKEIALALERSKHHFLW
SLRPPRPKDKTKVPLEYSNLEKVIPPGFLERTAGIGKVIGWAPQVSVLSHQAVGGFVSHCG
51
WNSILESLWFGVPMGTWPLYAEQQMNAFKMVNELGIAVDIKIDYRNDHNMKTEVIVRAE
EIESKIMEMMMDENRSERMKVKEMKEKSRLAVMEKGSSYAAIGRLIEEIIM.
>Pn3-21
MFPWLAFGHMIPFLELAKLIAHKGHKISFISTPKNIDRLPKQLPPNLSSLITFVKLPLPHVQ
NLPENAEATSDVPLDKVKYLKIAYDGLQQSITEFIQNSAPDLILFDFASYWLGPLASKLNIC
SAFFSIFTAAALGFIGSAESMISGDYRKKPEDFTVLPKWVTFESTVKFKLFEVLRNRDNEN
VSDLYRMGATIKGCDFVAIRSCYELEPEWLRLLEDLNEKRFIPVGQLPTTSFENGDDVKDV
AWREMKGWLDMQPKGTVVYVAFGSEAKPSQTELTEIALGLELSELPFFWVLRTQRGKV
DTEVIELPQGFEERTKGRGVVCTSWAPQLKILSHDSVGGFLTHSGWTSVVEAIQFEKALIL
LTFLADQGINARVLEKKKMAYSIPRDDGDGSFTRDSVAESLKLVIIEKEGQIYRDKVKEMK
KVFVDKNRQEEYVNNLLGYLQTTRKIKRELI.
>Pn3-24
MGGQLHVFFFPMLAQGHMIPMLDIAKLFASRGVNATIITTPLNAPVFSKAVEKTKQLGFE
LSLRIIKFPAVPGLPEGCENADQLTSPEMFPTFFNATILLKEQVEQILQEYRPNCLVADMFFP
WATDVAAKLDIPILVFHGTSFFALCATEQVRLHKPFNNLLSDSEPFVLPNLPHDVKLTRSQL
SPHVRAAVETEFSKMHIQIRESEQRSYGVIVNSFYKLEPDYADHYRDVLKRKAWHIGPVS
LCNRGIEDKAQRGKKAAIDEDECLEWLDSKKPNSVIYVCFGSVCNFPSSQLYELAMGLE
NSGQEFIWAVRKGKNEEENEKWMPEGFEERIKGKGLIIRGWAPQVLILDHEAIGGFVTHC
GWNSSLEGITAGVPLVTWPSFADQFFNEKLVTEILRIGVAVGAKEWAIGGGRGGVKRDAIE
TAVKRLMVGEEAEERRNRAKALKEMAIKAVEEGGSSYLDLDALIKELSFYHS.
>Pn3-25
MSQSPAMSKIQNQLHFVLVPLLAQGHMIPMIDMARLLAQHGVVVSLVTTPHNASRFASTI
HRARDSGLKIQLIQIPFPWQEVGLPPGCENLDSVPSRDLIGNFFSALNKLQQPLEQHLQEL
MPPPSCVISDKYLSWTTKTAEKFHVPRLVFHGMCCFSLLSSHNIRLYNAHLSVTSDSQPFV
VPGMPQRVEITKAQLPGAFVTLPDLDDIRDQMREAESSAYGVVVNSFCELEQGCFEEYK
KAIAKKVWCIGPVSLCNKDNLDKFERGNKASIDETHCTEWLDSMKPKSVIYACLGSQCR
LVPAQLMELGLALESSKHPFIWVIKEGERFQELEKWLVEEEFEERNKRRGLLIKGWAPQV
LILSHPAIKAFLTHCGWNSTIEGVCSGVPMITWPMFAEQFFNEKLIVDILRIGIKVGVQVSV
RWGEEEKIGVLVKREQIQKAIETIMNGGGEEGRIRKRVTKLSKVGARAMEDGGSSHFNIS
LLIQDIWKQKNNQEEL.
52
>Pn3-26
MFPWLAFGHMIPFLELAKLIAHKGHKISFISTPKNIDRLPKQLPPNLSSLITFVKLPLPHVQ
NLPENAEATSDVPLDKVKYLKIAYDGLQQSITEFIQNSAPDLILFDFASYWLGPLASKLNIC
SAFFSIFTAAALGFIGSAESMISGDYRKKPEDFTVLPKWVTFESTVKFKLFEVLRNRDNEN
VSDLYRMGATIKGCDFVAIRSCYELEPEWLRLLEDLNEKRFIPVGQLPPTLFESGDDVKDV
AWREMKGWLDMQVKGRVVYVAFGSEAKLSQTELTEIALGLELSELPFFWVLRKQRGKE
DTEVIELPEGFEERTKGRGVVCTSWAPQLKILSHDSVGGFLSHAGWGSVVEAIQFEKALIL
LTFLADQGLNARFLEEKRMAYSIPRDDGDGSFRRDSVADSLKLVIIEKEGQIYRDKVKEM
KNVFVDKNRQEEYIDNLLGYLEANRKIKRELI.
>Pn3-27
MERKAMEYSGGEILVLPFYGQGHLFPSMELCIHFSSRNYSPTLIIPSYLSSSIPDSLRHHPFI
QILQIAVDKSESPPPAEQAAGGRFQHQHQQLGQGIERLLSARSGDTNQTRPVCVVFDVM
MSWSKEIFAKFGFPAISFFTSGACSAAMVYAGWKARVDEMKPGDIRVLPGLPQDMALAY
SDLDRRPPRHGGGGGNGSPLGPPNGDRLNGPPSRGMGFGPPGPGQKPRWLDEVEGSVAL
LINTCDDLEKSFINYVATQIGKPVWGVGPLLPEKYWKSAGSLLRDGEIRSSGRQSSYTEDE
VTQWLDSKSCGSVIYISFGSEVGPTVEEYEQLANALGELNQPFIWVLQPNSGKPGPPSTLF
GVQTGPDVVAEGGYYPHSLDKKVGNRGLIIRGWAPQLLILSHPSTGGFLSHCGWNSTVE
AIGRGVPILAWPIRGDQYYNAKLVVSHLKVGCMVSSGGDSAKGMKKEDIIQGIERLMGD
EEVHKRARILQGKLESGFPASSVAALDALVDFIK.
>n3-28
MGSLPKVTKPHAVCIPYPAQGHINPMLKLAKLLHQKGFHISFVNTEFNHNRLLRSRGPAV
LDGLPDFRFYAIPDGLPPSDPDATQDIPSLCKYTPTHSLLPFCSLISKLNDTSVSDVPPVTCI
VSDGVMSFTLKAAEKFGLPEVLFWTTSACGLLGYLQYRHLVERGYTPLKDMSYVTNGY
LDTKIEWVPGMKDIKLRDFPSFIRTTDPNDTMLNFLQTESAALPRARALILNTFDALEQDV
LDAISAIQPRIYTVGPLQLMVNQIQDHRLKSMSSNLWKEEAGCIDWLDTKEANSVVYVN
FGSITVMTAKQLTEFAWGLANSKKHFLWIVRPDIVVGDSAMLPAEFITDTENRGMLASWC
PQEEVLKHPAIGGFLTHNGWNSTVESISGGVPVICWPFFAEQQTNCRYSCVEWGIGMEID
NEVKRDEVEVLVRELMDGEKGKEMRRNALGWKKKAEEAAGHADGSSRVNLDKLVDQ
VLLAK.
53
>Pn3-33
MGTNPAKKAHAVCVPFPAQGHVSPVMQLAKLLHSRGFYITFVNTEFNHRRLLRSKGSAW
VNGFDHFMFETIPEGLPPSNRDATQDPAALFDSIPKNCLAPFRDLLAKLNSSPEVPPVTCIV
ADGIMSFVIEAAKEIGVPEVQFWTASACGFLGYFHYPELMKRGIVPLKDENFMIDGTLDT
QVDWIPGMIRNLRLKDLPTFLRTTDPNNVLFTYLRDEVQNCFKASATIFNIFEPLEHEALQ
ALSSMFPHIYPIGPLSMLLKNLPNSPLYSFRSNLWIEDTKCLEWLDKWEPNSVVYVNYGS
LTIMSDQHLKEFAWGLANSKHPFLWIVRPDIVMGDSAILSEDFFEEIKGRGLLASWCPQDE
VLSRESVGVFLTHCGWNSTLESICEGVPTICWPFFAEQQTNCRYSCSDDQWGIGVEVNQD
VKRDEIDALVREMMEGDKGKKLKEKALAWKKKAEEATRIGGSSYNSFDRFIKEVLLAG.
>Pn3-35
MKSKNHFLLLCLPVQSHINPTLQLAKILTRSGANVTYATTTTGLGRLGALPTIDGLSYATFS
DGKEHNGTLILDDYLAVLRRVAPQSVTKLLQDLSTKGTPVTFIVYTVVLPWVAEVARDM
HLPSAFLFIQCATAFSIFHRFFNSQDGLHDGVHDISPDVSVQLPGLPLFRCKDIPDFVFRHN
QYYSSMVPVLQEHIQTLEKDPNPFVLVNTFDALEGDIIKSFPNMKLLAIGPLLPSAFSDGN
DLDDKSFGGTLFQNPNNYLTWLDSKPDQSVIYASFGSIMQLKETQKEEILQGLMTTNRPF
LWVIRDINEGEVKSMKLNNGIADELGFIVPWCSQVEVLCHRSIGCFVTHCGWNSTVESIT
GGVPVVGCPHFSEQQTNIKMVEEVWGNGIRVRENDDGVFGREEIRRCLDIVMGEEEKGK
EIRRNAMKWKCLAMEAVKAGGSSHNNVKQFLEM.
>Pn3-36
MASDLYEHQTNDRKKSQVVVVMVPFPAQGHLNQLLHLSRLISAYNIPVHYVGTTTHSCQ
AKSRIQGWDPLSITNIHFHEFPTPPFFTPSPNPNTSTKFPAHLQPSFEASSHLRDPVASLVHA
LAPTAHKIVVIHDAFMSSIIQDVVTISNAELYCFQSGSAFNFLALKWESIGRPSSLKLETLK
NLPSMDGCFTQEFKEFATCQIRLMDKFSSGTIHDVSKVIEGEYFDLLTKEEFSGNIKQWAL
GPFNPVSISDHKPHNCLNWLDKQDPNSVIYVSFGTTTSLTDEQIHELAIGLEESEQKFIWV
LREADKGDIFEGNYARRAELPKGYEQRIQGKGMILRDWVPQLEILEHASTGGFMSHCGW
NSCMESMSMGVAMAAWPMHSEQPRNAMLITDVLKIGTLVRDWERRDELVTSLSVEKAV
RRLMVSKEGDEMRMRAAEIGGTVRRSVAEGGVTRAEFDSFISRITRMQKLI.
>Pn3-38
MENTPHIAIFPSPGMGHLIPLVQFAKRFISQHKFTATLIVPTDGPLLKAQKTFLDALPAGIAY
VVLPPVNLDDLPSDVNIETQIAITVARSLPSLRDVFKSLLATKKLVALVVDLFGTDAFDVA
54
NEFKVSPYIFFPSTAMSLSFFLHLPKLDEAESCEYRDLSGPIQLPGCIPIHGRDLLDPVQDR
KNDAYKWILHHTKRYRLAEGIMVNSFKDLEDGAIKALQEESDIPPVYPIGPLIQIDSSSAA
DGSECLRWLDNQPRGSVLFISFGSGGTHSSDQINELALGLEMSEQRFLWVVRSPNNETAN
ANFFNVESQKDPFDFLPKGFLERTKGNGLVVPSWAPQAQILRHHSTGGFLTHCGWNSILE
TVVNGIPLIAWPLYAEQKMNAVMLSEGLKVALRPQVDENGLVGRAKIAKVVKCLMEGEE
GKRLRNRMRDLKDAAAKVLSDDGSSTKSMAELAGKWNKN.
>Pn3-39
MENTPHIAIFPSPGMGHLIPLVQFAKRFISQHKFTATLIVPTDGPLLKAQKTFLDALPAGIAY
VVLPPVNLDDLPSDVNIETQIAITVARSLPSLRDVFKSLLATKKLVALVVDLFGTDAFDVA
NEFKVSPYIFFPSTAMSLSFFLHLPKLDEAESCEYRDLSGPIQLPGCIPIHGRDLLDPVQDR
KNDAYKWILHHTKRYRLAEGIMVNSFKDLEDGAIKALQEESDIPPVYPIGPLIQIDSSSAA
DGSECLRWLDNQPRGSVLFISFGSGGTHSSDQINELALGLEMSEQRFLWVVRSPNNETAN
ANFFNVESQKDPFDFLPKGFLERTKGNGLVVPSWAPQAQILRHHSTGGFLTHCGWNSILE
TVVNGIPLIAWPLYAEQKMNAVMLSEGLKVALRPQVDENGLVGRAKIAKVVKCLMEGEE
GKRLRNRMRDLKDAAAKVLSDDGSSTKSMAELAGKWNKN.
>Pn3-40
MEKKDSTRPHVLIFPLPLQGPVNSMFKLAELLCLAGIHITFLVTDYIHTRLLRYTDIHSRFN
RYPGFRLETISDGLREDHPRGDRIMELFDSLKTNTKPLFKELLTTGSLGSDKGRPITCIIAD
GIMGFTCDVANEIGIPIIYVRTISACCLWVFFCLPKLIEAGELPFNGDDLDTPIKSVPGTENS
LRIRDLPSFCRSGDICHPSIQLYKTERQELRRAYGLILNTFEDLEGSILSQMRTVCPNLYPIG
PLHTHLKTKLAAESTSPAISSNSLWKEDKSCITWLDSQPPKSVIYVSFGSLAIMTRDQLME
FWHGLVNSGCKFLWVIRPDSIAGDSEIPAEVAEGTKERGYIVGWAPQEEVLAHPSVGGFLT
HSGWNSTLESVIEGKPMICWPYFMDQQVNSRFVEEFWKLGLDMKDTCDRVIFEKMVKD
LMVERRDEFMKSADQMARLGKKCLSEGGSSYCNLVRLIEDIKLMGVER.
>Pn3-41
MEERKEKMIVIFPYMAQGHIIPFLSLALQIEKKGYQITFVNTPLNIKNLKQSLPLNSSIRLLE
IPFNSSDHRLPPETENTDSIPFSLTLTLLEASVSLKPAFRNLISDLVRGGAPPLAVIADIFFGW
TAEVAHEFGIFHTIFSSTGGFGMACYYSVWMNLPHNYTDSVEFTLPDFPEAGLIHRTQLSA
NVLAADGTDPSSKIIQLLLSSWVDSDGILFNTIEEIDKIGLYYFRRKLSLPVWPIGPILLSVD
SRARSNKVCGISSESCINWLDSKPQNSVLYISFGSQHTISASQMMQLAKALDSIDINFIWV
55
VRPPLGFDMNLEFDAVEWLPEGFLKRIEEQNRGLIIVKWAPQVEILLHKAVAAFLSHCGW
NSVLESISAGVPLIGWPMGAEQFYNVKYLEEEVGVCMEVARGTNFEVRNEDIVKKIGIV
MGENGKGKEIREKACEVKKMIENGGRDEEGYKGSSVRAMEEFLNVAAFFGKEKKVRGQ
KCRMNT.
>Pn3-42
MEKKDSTRPHVLIFPLPLQGPVNSMFKLAELLCLAGIHITFLVTDYIHTRLLRYTDIHSRFN
RYPGFRLETISDGLREDHPRGDRIMELFDSLKTNTKPLFKELLTTGSLGSDKGRPITCIIAD
GIMGFTCDVANEIGIPIIYVRTISACCLWVFFCLPKLIEAGELPFNGDDLDTPIKSVPGTENS
LRIRDLPSFCRSGDICHPSIQLYKTERQELRRAYGLILNTFEDLEGSILSQMRTVCPNLYPIG
PLHTHLKTKLAAESTSPAISSNSLWKEDKSCITWLDSQPPKSVIYVSFGSLAIMTRDQLME
FWHGLVNSGCKFLWVIRPDSIAGDSEIPAEVAEGTKERGYIVGWAPQEEVLAHPSVGGFLT
HSGWNSTLESVIEGKPMICWPYFMDQQVNSRFVEEFWKLGLDMKDTCDRVIFEKMVKD
LMVERRDEFMKSADQMARLGKKCLSEGGSSYCNLVRLIEDIKLMGVER.
>Pn3-43
MGSSAETHVGVLAFPFATHAGLLLGLVCRLAAAAPNVNFSFFSTAVSNRSLFSDPNSPYSN
VIPYDVSDGVPEGYVFSGKPQEDINLFLTVASDEFRRGLEKASVDSGRKITCLVADAFLWF
SGDLAEQIGVPWVPLWTSGACSLSIHVYTDLIRQTVGLGAGIEGRMDEILTFIPGFSELRLG
DLPGGVLFGNLESPFSIMLHKMGQTLPRAAAVPINSFEELDPDLMKDIKSKFKKILNVGPF
NLTSPPQSSKSDEHGCIPWLDNKNPKSVAYIAFGTVATPPPNELVSLAEALEESGTPFLWSL
KENLKNHLPKGFLERNSKSGKIVAWAPQIQVLSHDAVGVVITHGGWNSVVESLAAGVPVI
CRPFFGDHHINTWMVENVWKIGVRIEGGVFTRNGTMNALEQVLSREKGKKLKEQITVFK
ELALKAVGPNGTSTLNFKRLLEVITT.
>Pn3-44
MEEFPIPVVMNSNSAQKPHIVCVPFPAQGHIIPMMQLAKLLHSRGFHITFVNSEHNQRRLI
RSKGPEWVAGFTDFRFETIPEGLPRIDPSKSHFPVAVFESVPKTCLAPFRDLLIKLKSTAEVP
AITSIIADGVMGFTIKAGIEFGIPVVQFWTASACGFMGYFQYPELIKRGIFPFKDQSFRSDG
SLDAKVDWIPGMRNIRLKDIPSFVQTTDPNDISFNFLREEVQNCLKASALIFNTFDAFEHE
VLQAISSMFPHIYTVGPIPLLSKGIPESQLQSFRPSLWIEDTKCLEWLDKREPNSVVYVNYG
SITKMTDHHLKEFAWGLANSKHPFLWIVRDNIIMGDSAILSEDFLEATKDRSLLTSWCQQ
DQVLSHSSIGVYLTHCGWNSTLETICAGVPIICWPFFAEQQTNCRYSCTDQWGIGVEVNH
56
DVKRDEIEALVKEMMEGDKGKQLKKKALEWKKMAEEATSVGGSSYSNFDRLIEEALQP
RND.
>Pn3-47
MMGHATQALHLANLMINRFDHLSITVLIMKLPVDPISTNFMESLGSPAATSSSNVNSEVD
RIRFHHLPPSKITSDCCYRTPGVLLDLVIEDHKDHVRNYLVTRKSTPAAALVVDMFCTTM
MDVGNQFGIPTYVFFTSGAAFLQLLFHLQILHDVDNGDVSELINSDTELVISGFVNPIPPGV
LPFVLIDKYQWSTRFLKYARACRKANGIIVNTFVELESHALNSLHGDGATKSPPIYPVGPII
NHAQMNLADDRDHDIMRWLDDQPRSSVVFLCFGSMGGFDMEQVREIANGIELSGYRFL
WSLRHPAEKGKSLFPTDYSCIEQIFLPDKFFERTAHVGKVIGWGPQLKILAHEAVGGFVSH
CGWNSILESLWHGVPIATWPIYSEQQLNAFEMVRELGLSVEIKLDYHNYMDNDMNKVLV
RAEEIERGIRSVMDGGNEVREKVKRMRDKSRMSMEEGGSSYKSLELLIEDLKM.
>Pn3-54
MSSEGLVHVLLVSFPGQGHVNPLLRLGNLLAAKGLLVTFSTPQNVGKVLRKANKMIDKL
TPIGEYGGMIRFEFFEDGLDEDDLRRNDLNFYMPQLQEFGSKAVVSIIKKHEQQGRPISCII
NNLFIPWVSDLAETLNIPSAVLWVQSCACFSAYYHFHHKLLPFPTDTDLKINVQLPCMPLL
KYDEIPSFLHPSDPFLVLGKLILGQFKNLSKPFCVLMETFQELEDELLTYMSNYCNIKPIGP
LFINPITDPTSTVRADFIKADDCVDWLDSKEANSSVVYISFGSVIYLKQEQLDEIANGLLNS
GVCFLWVLRPAHDSSVEPAVLPDGFLEKAGDKGKVVQWCSQEKVLSHRAVACFVTHCG
WNSSLEALSCGVPVVAFPAWGDQVTNAKYLVDVLNVGVRLSRGDAEKRIIGREEVEKRI
REVTSGPKAVEMKENALKWKKAAKEAVAEGGSSDRNLQDFVDEIKRKRTMHITQ.
>Pn3-55
MASEGKKKETHVLMVAFAAHGHMNPMLRLGKRLVSKGINVTFAVNEYGRDLIVNSPAFA
GIHLEFYSDGLSLDADRKTSMDLFMESICKYGPVSISALIQTIGVEKFSCIINNPFVPWVAD
VAAELEIPCAMLWIQPCMLYAIYYRFYNNLNLFPTSTNPNMSVELPGFPLLASEDLPSFILP
SNTSGPFPKLLAELFQNMNKIKWVLGNSFYELEKDIIDSMKNVHPIRPVGPLVPSTLLLGE
NDQNLHKDKCIEWLDHQKNSSVIYISFGTIIEFSTKEMEVVAKCLKKTKLKFLWAVKKGE
ELLPLGFPEEIKDQGLIVKWSPQVEVLGHPSVACFLTHCGWNSLIETITAGVPVIAYPKWT
DQPTNAKLIDILHVGVRLRPGQDGELSGEELEKCIEEIMVGPSSEEYKKNAAELKQEARE
TMAHGGSSDRNIQWFVDEIKGYS.
>Pn3-60
57
MDYVPAANHHKPHAVFVAYPAQGHINPLLKLAILIHHKGFHITFVNTEDTHNRLLQSSGPE
SLDGLPDFRFETIPDGLPPTEADATQDTKQVRLSTKQNCLAPFRKLVQKLNENCSENVPPV
CCIIFDALMSFAITAAEEIGVPSVCVRTSAACCFMLYKHYHLLTQKDEISSTSGHLDAEIDYI
PGIKSMRLKDLPTFIQTVSQNDEMLNFILGEVKRASEASAIIFNTYNALEFDVLRALSSMS
PPIYSIGPLPMLVNQENNKFKKSIGSNLWKEETTNCIQWLNSKKPNSVIYVNYGSLTVMTR
EILVEFAWGLANSKHNFLWIIRPDLVKGETEILPLEFVEDTKERGMIAGWCSQEEVLGHSS
VGGFLTHCGWNSMIESLSSGVPMICWPFFADQPTNCKLACKDWGVGMEIDKNVSRDEV
QMRVSELMEGGKGVEMRKKAMEWKKKAEEATGVDGSSSWNLDKLVEEVLLRKS.
>Pn3-67
MDQHTLPPHVLIFPLPAQGHVNSMLKLAELLCSAGIHVTFLVSAQNHTRLLRYTDVQSRF
SPYSGFHFETLPEGIYDGQVNYNGLIMNLFDSLKEIAEPFLKDFLSARKSSSPVTCIISDGLF
SYVLDLAEELAIPVIYFRTISACAFWAYFCIPQLIEAGELPLKGDDMDVPISSVPGMESYLR
RRDLPSFCRADLTDGSFQMVLNQTQQTPRAQALILNTFEDLEGAILSQVGAKCPNIYTVG
PLHAHLKARLPTKRTSSNSLREEDESCLTWLDDQPLNSVIYVSFGSITTMTRDQHVEFWH
GLVNSGKRFLWVVRPDSLASKEVKREIWGELEEATKARGYIVGWAAQEEVLAHRAVGGF
LTHCGWNSTLESIVAGVPMLCWPFLADQQINSRFVGEVWKIGMDMKDSCDRVVIEKMV
RDLMEVRRDEFVGRASEMANLARDSLRVGGTSYCNLDRLVKDIKLMVVPPPSMNK.
>Pn3-68
MERKAMEYSGSEILVLPFYGQGHLFPSMELCIHFSSRNYSPTLIIPSYLSSSIPDSLRHHPFIQ
ILQIAVDKSESPPPAEQAAGGRFQHQHQQLGQGIERLLSARSGDTNQTRPVCVVFDVMMS
WSKEIFAKFGFPAISFFTSGACSAAMVYAGWKARVDEMKPGDIRVLPGLPQDMALAYSD
LDRRPPRHGGGGGNGSPLGPPNGDRLNGPPSRGMGFGPPGPGQKPRWLDEVEGSVALLI
NTCDDLEKSFINYVATQIGKPVWGVGPLLPEKYWKSAGSLLRDGEIRSSGRQSSYTEDEV
TQWLDSKSCGSVIYISFGSEVGPTVEEYEQLANALGELNQPFIWVLQPNSGKPGPPSTLFG
VQTGPDVVAEGGYYPHSLDKKVGNRGLIIRGWAPQLLILSHPSTGGFLSHCGWNSTVEAI
GRGVPILAWPIRGDQYYNAKLVVSHLKVGCMVSSGGDSAKGMKKEDIIQGIERLMGDEE
VHKRARILQGKLESGFPASSVAALDALVDFIK.
>Pn3-74
MEERKEKMIVIFPYMAQGHIIPFLSLALQIEKKGYQITFVNTPLNIKNLKQSLPLNSSIRLLE
IPFNSSDHRLPPETENTDSIPFSLTLTLLEASVSLKPAFRNLISDLVRGGAPPLAVIADIFFGW
58
TAEVAHEFGIFHTIFSSTGGFGMACYYSVWMNLPHNYTDSVEFTLPDFPEAGLIHRTQLSA
NVLAADGTDPSSKIIQLLLSSWVDSDGILFNTIEEIDKIGLYYFRRKLSLPVWPIGPILLSVD
SRARSNKVCGISSESCINWLDSKPQNSVLYISFGSQHTISASQMMQLAKALDSIEINFIWVV
RPPLEFDINLGFDTEEWLPEGFIKRVEDQNKGLIIVKWAPQVEILSHRAVAAFLSHCGWNS
VLESLSRGVPLIGWPMGAEQFYNVKYLEEEVGVCVEVARGTKFEVRSEDIVEKIGIVMRE
NGKGKEMREKADQLKKMIENGGRDEEGYKGSSVKAMEEFLSVAAFFGKDKVRGEDE.
>Pn3-75
MDTNEKTRIKVVMLPWLAYGHISPYLELAKKLSKRDFYVYFCSTSINLSSIRKKLAVDDH
DAIQLIEFQLTSQTELPPHHHTTKGLPPHLIPDLIKALGMSGPNVINILNTVNPDLIIYDVFQ
LWVPAFAASLQIPAVHFQVVGALSTAAVYRFKVDSSIPVPCSRIFLDDTNIRKSPDYDSSSG
ENSGIVDLTFGTAIQSSDIILIKSSREFDEKNIEYYSLLMDKKIVPTGPLVQVNTCVAVHTEN
EKDDIMDWLSKKEESSTVYVSFGSECYLSEPRIRELAHGLELSNVNFIWVISFPEGDEEMC
NTSIEDVLPEGFLDRVKDRGVIVSWAPQERILGHGGLGGFVSHCGWGSVVEAMSYGVPII
AMPAQYEQPLHAMFVEEVGVGVEVLKDESGEFRRDEIAKAIKKVVVEKNGEGVRKKAR
EMGKAIKKRGEEEVECVVEELTKLCKKYQKVAAGQGKEWS.
>Pn3-76
MENEKTYKAHIMVLAYHGQGHINPMVQFSKRLASKGMKITVTTTLSNIKAMKKASSSVI
FESVYDDAAEGGVGAPGGFQGFLDRFEASGSRNLAQLIKKQENSGYPIKCLVYDANIHW
ASNIAKQFAIPGAAFFTQSCAAIASYYPMHCDLSDKSLPFPVFSMPGLPPPKLPYLPSLGAV
TGQYSPIIRFICKQFDNIENAEWVLFNSFDKLEEEVVKWMSNLWTVRNIGPTVPSVYLDN
RVENDNDYGFNLFKPSTEVCMQWLNTKETGSVVYVSFGSAASLSAEQMAEMAEALKQS
RHSFLWVVKPTEIKLPTNFVEETSEKGLVVAWCPQLEVLAHHAVGCFISHCGWNSTVEAI
SFGVPVVAMPQFLDQMTNAYFVEKVWGIGIQPKENEENVTSAEEIGRCINGVMNGKEIK
KKAKQWKELAKEAIDENGSSDKSIDEIISRILAL.
>Pn3-79
MENRQILPPHVLIFPLPLQGPVNCMLKLAELFSLAELRVTFLNTHHIHHRLLKHAHVESRF
GRYPEFRFETISDGLPADHQRSGDRFIEVINGIEAVTKPLLKEMLASGCLSSKSGRPVTCIIA
DGFLSFVLDIAKEVGVPLLNFDTISPCALWTYLCIPKLIEVDELPFKGNDLDAKITSVPGME
NFLRRRDLPSFCRNNDLNDPIMQHVLKEDQELPRAQGLILNTFEDLEKPILTHIRTICPNLY
PIGPIHTLHKSILSANKSPPQVNSSNSLWEDDRSCMTWLDAQPPNSVIYVSVGSLAVMTRD
59
QLMEIWHGLVNSGKRFLWVRRLGSIAREDDQKYETPTELSDATKERGCIVSWVQQEEVL
GHPSIGAFLTHSGWNSTLESIVEGVPMICWPYFVDQQVNSRFVGEVWRIGVDMKDSCDR
GIIERMIRDVMEVRKDEFQESANVMGKLARQSVCEGGSSYYGLDRLIKNIKKINLSTSQA.
>Pn3-87
MASQIEKLHFILIPFMSQSHILPLMDFAKVLAQRHLTVTIITTPLNAISVKSIIDHAQKTHLKI
GLDTVPFPSQEAGLPQGCESMETLPSHDMAKEFFVGCKMLQEPIQKLLAELQPRPSCIIST
NSLPWTGKVASNLGIPRYAFQTIACFSLLMSHNVGRMTLVQTAISDTEPFVLQGMPDKIVL
TKSQIPQIVTRTGEEDRKGIIDQMIEAERLTRGMVVNSFEEMEPKYVEAFKNMGKKIWCI
GPVSLCNKEMSDKLERGGNKDSIDQSLCLKWLDSMESNSVIYACFGSMGIIPSSQIIETGL
GLEASNCPFIWIIRKRDLSAKVEKWLEDENFEERVKGRGLIIRGWAPQVLILSHPSVGGFIT
HCGWNSTLEAVCAGVPMITWPMFSEQFYNEKLVVNVLKIGVRVGVEVAMKTEEEDKVY
VRREQVKEAIEQLMDEEEKGERIKRAKELSEMATKATEEGGSSFLNITMLIQDIIEQITGQ.
>Pn3-88
MAILQTQDLHFILIPFMSPSHIIPLMDLAKLLTQRPQLTVTVITTPLNAITIQSIIDRAQKSTH
VKIGLSTVPFPAQEAGLPQGLESMEALPSPDTWKPFFIACTMLQQPIQDLLEELQPRPSCIIS
SNVLPWTVKVASNLKIPRYDFQTVCCFSLLLGQTVMRMNVDETVGSDSEPFVVPNMPDE
IVITKSQIPQETERTGGGDKKGIVDQMREAELLTRGMVVNSFYEMESKYVDAHMNMGR
KVWCIGPVSLCNKDMWDKLERGKKASIDDEDVCMKWLDSMEPSSVIYACFGSLGLGRIS
EAQIVEIGLGLEASNRPFIWTIKKKDLSSRVEKWLEEEKFEERVKGRGLIMRGWAPQVLIL
SHPSVGGFITHCGWNSTLEAVSSGVPMITWPMFAEQFYNEKLIVNVLKIGVRTGVEAAM
KMGDVGYNEVYVSTDQLKKAIKEVMDSEEKGKRIRRRARELRDMAKKATEEGGSSYLN
TTMLIQDVVEQLKLK.
>Pn3-89
MLPILDLTHQLAVHGLTITILVTPKNLPILNPLLSAHPSSSIQTLLFPFPHRPSLLETENIKDIG
NSGNVLIINALSALSGPIIHWFNTHPSPPVALLSDFFLGWTHHLAHQIGIPRICFYSSGAFLT
SILACLWDDDIDTLVSLDVVDFPDLPHAPSFPKEHLPTVFLTYRKSNPMWELVKDGMIAN
SASWGCVFNTFDAIEGEYLEYSRKKMGHERVFAVGPLSLLGGPDHTTRGSTSRSSGSHNT
NILAWLDDVYPDGSVLYVCFGSQKLLKRVQMEALSSALEQCGIKFIWVVKSPTAQQVAD
GYGFVSDEFEKRVSGRGLIIKEWAPQVSILNHRAVGGFLSHCGWNSVLEAIVSGVTILGW
PMEADQFVNAKLLVDYMGVAIRVCEGPDIVPNSFELAKKVAESMNAEITEKMRAKELKV
60
KALEGVKHCGSSSRDLDGLVKELAQLQLKND.
>Pn3-91
MKKSAEVVFISAPAIGNLVPAVEFATHLTKTDPRLSATILVITTSHRPLVTTYIESRAAVNAG
KVRFIHLPPVEPPSPDQYQSSIAFMCLLIEKHKPHVKDALANLMQTQSESENSVVPVAGIF
VDMFCMPIMDVAAELGVPCYLFFASPATFLGFMLHLPSLHGQLASEFDDSATVLTIPAFTN
PVPIPVLPTFFLRRNDEDGCSCFLRNACRYKETKGIVVNTFKELEPCALDSLSADFADLPQ
VYPVGPVVDHCGPAGWNSDQSHENITKWLDKQPPSSVVFLCFGSMGSLSPAQVKNIAVG
LERSGYRFVWQLREPARAKSCLPSNYTNLEGVLPDGFLDRTAEFGLVCGWVPQVTILAH
KAIGGFVSHCGWNSILESLWYGVPIATWPIYAEQQLNAFEMVRELELSVEIRLDYRDGVS
DLVCAEEVEKGVRILMGNEGGEVREKVKKMREMSKNALIENGSSFVGLGLLIQQLLLDI.
>Pn3-97
MEQNQKMASPQEHIIMLPFFAHGHLTPFLALAKQIQERTGFTITLVSTPLNILYLKSTNTQN
PQIHLVPLPFNSSHHNLPPNTETTESLPLSQVITLFHASSSLESPFRRFISDVTIRDGKSPICIIS
DVFMGWANEVAKSLDIVNVSFSTCGAYGSAAYVSVWQKLPHRFLENDNDEFCLPWFPER
CRISRSHLHQFVRVADGNDEWSKFFQQQTTFSLGSFGWLCNTVQEIEPLGLEVLKNCTKL
PIWCIGPLLPQRMLESSSNPGTFGKRAGKEPGLSPEECLKWLDLFPKSSVLYISFGSQNTIR
PTQMMELAKGLEESGQPFIWAIRPPIGFNLKENFRDEWLPPGFEEQMIHSKLGLLVHKWA
PQLEILSHKSTGAFLSHCGWNSTLESLSQGVPIIGWPLAAEQVYNSKMMEEEMGVGIELT
RGLESSIVKEDVKRVIEIVMGKNGKGEEMRKKACEIGELIRVAAREENDVKGSSLQAMDD
FVASILSFSRR.
>Pn3-98
MGGQLHVFFFPMLAQGHMIPMLDIAKLFASRGVNATIITTPLNAPVFSKAVEKTKQLGFE
LSLRIIKFPAVPGLPEGCENADQLTSPEMFPTFFNATILLKEQVEQILQEYRPNCLVADMFFP
WATDVAAKLDIPILVFHGTSFFALCATEQVRLHKPFNNLLSDSEPFVLPNLPHDVKLTRSQL
SPHVRAAVETEFSKMHIQIRESEQRSYGVIVNSFYKLEPDYADHYRDVLKRKAWHIGPVS
LCNRGIEDKAQRGKKAAIDEDECLEWLDSKKPNSVIYVCFGSICNFPSSQLYELAMGLVD
SGQEFIWVVSKGKNEEENEKWMPERFEERMKGNGLIIRGWAPQVLILDHEAIGGFVTRC
GWNSTLEEISAGVPLVTWPLYADQFYNEKLVIEILRIGVAVGAKVWVAGDGSGSVKRDAV
EMAVKRLMAGEESEERRNRAKALKEMAIKAVEEGGSSYLDLDALIKELSFYHS.
>Pn3-103
61
MDSPLSSSRPLEIYVLPFLSPGHMIPLSEIARLFAARGEHVTIITTPSNVALTPDKEQNASIRI
HTIPFPAKEVGLPDGLENFFSVKDIDTAAKVFTGMQLLRNSIEEYVTEHRPDCLVSDMFFP
WTADLAIRLDIPRLVFNATCMFSQVLKDAVRRPDSPHLTVKSDYDPFVIAGLPHPITMTRA
ELPDYVRTPNGYTKMMEEWKEAELKSYGVLVNNYYEFDSAYTDFYQKMVAPTQKIINV
GPAALIHRSGNEKVKRGHKTVVGEHECLSWLKSKDPNSVLYVCFGSGCIFPDAQLMEIAC
GLMAAGHHFMWVVLGKDDEKKEDEVNWLPLDFDEKMIKTNKGMIVKGWAPQVLILD
HPSVGGFLSHCGWNSVIEAVSAGIPMATWPLYAEHFYNEKLLTQVLGIGVEVGAEEWNL
WVDAGKKVIKREKIERAVNKIMDGGDGSKDMRRKTKEMGERAKKAVGEGGSSQRNVT
VLIEDLRKLREKREKA.
>Pn3-106
MLLLLLLLLITIAHALITPLTLNPFTMSMSNKHGPGQVIVVMVPFPAQGHLNQLLHLSRLIS
AYDIPVHYVGTTIHNRQAKLRVHGWDPLSVSNIHFHDFQTPSFPSPPPNPNSPLGHLLPSFE
LASSHLRKPVASLLQQLSSAAKTRLVIIHDSLMSSVVQDLVLVPNAEAYTFHSVSAFTVFL
HLWENLGRPFAVQSDILKDLPSLDGCFTSDFANFISSQHDCMKFNSGKIYNTCKLIETPYL
DLLAKEQISKNKKQWPLGPFNPVSVSQNQTPNLQHQCLKWLDKQATSSVIFVSFGTTTSL
RDQQIQELALGLEKGAQNFIWVLRDADKGDVFGGEVRRAKLPKGYEERMKGKGMIVR
DWAPQLEILAHPSTAGFLSHCGWNSCMESITMGVPILAWPMHSDQPRNTVLITKLLRIGL
VVKNWSLRDELVVAETVGDVIKKLMTSQEGGEIRRRAEKMGGDVRMSVAGGGVSRLEL
DSFIVHISR.
>Pn3-107
MSTSISISSKRNMQINNDTIVLYPSPGIGHLVSMVELAKLILTHHPDSFSRIIILITTAPNFNTG
STGPYISRVSAITPSITFHHLPEPATSLPPNPGASVEAISYEFLRLNNPNVHQALQSISSTKSL
KIKAFIIDFFCTPAFEVSSSLNIPTYYFFTSSASALSFLLYLPIIHETTTTSLKDLNAFLDIPGSP
PIFSSELAKPLQDRNKVEYKYFIQTALEMAKSFGIIVNTFQSLEPRAITAISDGLCIPNAPTPP
IYCIGPLIATNNSGSCDAHECLIWLNSQPSRTVVFLCFGSLGLFKEEQLKEMARGLEKSGH
RFLWVVRSPAAADNQDQPQEPDLNALLPQGFLDRTKGRGLVVKSWAPQIAVLNHESVGG
FVTHCGWNSILEGVSAGVPMIGWPLYAEQGMNKVMLVEELKLGLGLGESGEGRFVSGA
ELEKQVRELMESKSGEMVRRRVMEMSDAAKTALGKDDGSSRIALANLIAKWK.
62
Supplementary Sequences 2
>Pn1-31ATGGACAGAGAAATGTTGAGCAAAACTCACATTATGTTCATCCCATTCCCAGCTCAAGGCCACATGAGCCCAATGATGCAATTCGTCAAGCGTTTAGCCTGGAAAGGCGTGCGAATCACGATAGTTCTTCCGGCTGAGATTCGAGATTCTATGCAAATAAACAACTCATTGATCAACACTGAGTGCATCTCCTTTGATTTTGATAAAGATGATGAGATGCCATACAGCATGCGGGCTTATATGGGAGTTGTAAAGCTCAAGGTCACAAATAAACTGAGTGACCTACTCGAGAAGCAAAAAACAAATGGCTACCCTGTTAATTTGCTAGTGGTCGATTCATTATATCCATCTCGGGTAGAAATGTGCCACCAACTTGGGGTAAAAGGAGCTCCATTTTTCACTCACTCTTGTGCTGTTGGTGCCATTTATTATAATGCTCGCTTAGGGAAATTGAAGATACCTCCTGAGGAAGGGTTGACTTCTGTTTCATTGCCTTCAATTCCATTGTTGGGGAGAAATGATTTGCCAATTATTCGGACTGGCACCTTTCCTGATCTCTTTGAGCATTTGGGGAATCAGTTTTCAGATCTTGATAAAGCGGATTGGATCTTTTTCAATACTTTTGATAAGCTTGAAAATGAGGAAGCAAAATGGCTATCTAGCCAATGGCCAATTACATCCATCGGACCATTAATCCCTTCAATGTACTTAGACAAACAATTACCAAATGACAAAGACAATGACATTAATTTCTACAAGGCAGACGTCGGATCGTGCATCAAGTGGCTAGACGCCAAAGACCCTGGCTCGGTAGTCTACGCCTCATTCGGGAGCGTGAAGCACAACCTCGGCGATGACTACATGGACGAAGTAGCATGGGGCTTGTTACACAGCAAATATCACTTCATATGGGTTGTTATAGAATCCGAACGTACAAAGCTCTCTAGCGATTTCTTGGCAGAGGCAGAGGAAAAAGGCCTAATAGTGAGTTGGTGCCCTCAACTCGAAGTTTTGTCACATAAATCTATAGGTAGTTTTATGACTCATTGTGGTTGGAACTCGACGGTTGAGGCATTGAGTTTGGGCGTGCCAATGGTGGCAGTGCCACAACAGTTTGATCAGCCTGTTAATGCCAAGTATATCGTGGATGTATGGCGAATTGGGGTTCAGGTTCCGATTGGTGAAAATGGGGTTCTTTTGAGGGGAGAAGTTGCTAACTGTATAAAGGATGTTATGGAGGGGGAAATAGGGGATGAGCTTAGAGGGAATGCTTTGAAATGGAAGGGGTTGGCTGTGGAGGCAATGGAGAAAGGGGGTAGCTCTGATAAGAATATTGATGAGTTCATTTCAAAGCTTGTGTCCTCCTGA>Pn3-29ATGAAGTCAGAATTGATATTCTTGCCCGCCCCGGCCATCGGACACCTCGTGGGAATGGTGGAGATGGCTAAACTCTTCATCAGTCGACATGAAAACCTCTCGGTCACCGTCCTCATCGCGAAATTCTACATGGATACGGGGGTAGACAACTACAATAAATCACTCTTAACAAAGCCTACCCCGCGTCTCACAATTGTAAATCTCCCGGAAAGCGACCCCCAAAACTATATGCTCAAACCACGCCACGCCATCTTTCCTAGCGTCATCGAGACTCAGAAGACACACGTGCGAGACATAATATTAGGCATGACTCAGTCCGAGTCGACTCGGGTCGTTGGTTTGCTGGCTGACCTTTTGTTCATCAACATTATGGACATTGCCAATGAGTTCAATGTTCCAACTTATGTATACTCCCCTGCCGGAGCAGGTCATCTTGGCCTCGCGTTCCATCTCCAGACACTCAACGACAAAAAGCAAGATGTGACCGAGTTCAGGCACTCGGACACTGAGTTATTGGTACCGAGTTTTGCAAACCCGGTTCCCGCCGAGGTCTTGCCGTCGATGTATGTGGATAAAGAAGGTGGGTATGATTATTTGTTTTCATTGTTCCGGAGGTGCAGAGAGTCAAAGGCAATTATTATTAACACGTTTGAGGAGCTGGAACCCTATGCGATCAATTCCCTCCGGATGGATAGTATGATCCCTCCGATCTACCCGGTGGGACCCATACTAAATCTCAACGGTGATGGCCAAAACTCCGATGAGGCTGCTGTGATCCTTGGTTGGTTAGACGATCAACCACCTTCATCTGTGGTGTTTTtGTG
63
CTTTGGTAGCTATGGAACCTTTCAaGAAAaCCAgGTGAAGGAGATTGCAATGGGTCTAGAGCGCAGTGGGCATCGCTTCTTGTGGTCCTTGCGTCCGTCTATCCCTAAAGGCGAGACAAAGCTTCAGCTTAAATACTCAAATTTGGAAGAAATTCTCCCAGTCGGATTCTTGGACAGGACATCATGCGTCGGAAAAGTTATTGGATGGGCCCCGCAAGTGGCGGTGCTCGGACACGAGGCAGTCGGAGGGTTCCTGTCTCATTGTGGTTGGAATTCGACATTAGAGAGTGTGTGGTGTGGCGTGCCCGTCGCAACATGGCCAATGTACGGCGAGCAACAACTCAATGCTTTTGAGATGGTTAAGGAGTTGGGTATTGCGGTGGAAATTGAGGTGGACTATAAGAATGAATATTTTAACATGACGAATGATTTTATTGTTAGGGCAGAAGAAATTGAGACGAAAATAAAGAAGTTGATGATGGATGAAAAGAATAGTGAAATAAGGAAGAAGGTAAAGGAAATGAAAGAAAAGAGTAGGCTTGCAATGTCTGAGAATGGATCATCTTATAATTCCTTGGCGAAGCTATTTGAGGAAATTATGTAA>Pn3-31ATGGATAACCAAAAAGGTAGAATCAGTATAGCGTTGCTACCATTTTTAGCCCATGGTCACATATCTCCCTTCTTTGAGCTAGCCAAACAACTAGCAAAAAGAAATTGCAATGTTTTCCTCTGTTCTACCCCAATCAATCTTAGCTCCATCAAGAATAAGGATTCCTCTGCTTCTGTAAAACTAGTTGAGCTTCATCTTCCATCTTCCCCTGATCTTCCTCCTCACTATCACACCACAAATGGCCTCCCTTCCCATCTCATGGTCCCACTCAGAAACGCCTTTGAAACAGTAGGCCCCACCTTCTCTGAAATCCTTAAAACCTTAAACCCTGATTTGCTTATTTATGATTTCAATCCCTCATGGGCACCGGAGATCGCTTCGTCTCACAATATTCCGGCAGTTTATTTCCTAACCACGGCAGCAGCCAGCTCTTCCATTGGCCTACATGCTTTCAAAAACCCAGGTGAAAAATACCCATTTCCAGATTTTTATGATAACAGTAATATTACCCCTGAACCACCTTCTGCAGATAACATGAAGCTACTTCATGATTTTATCGCTTGTTTCGAACGATCTTGCGATATTATTTTGATTAAGAGTTTTAGAGAACTAGAAGGGAAATATATTGATTTGCTTTCCACTTTATCTGATAAAACTTTGGTTCCTGTTGGTCCACTCGTTCAAGATCCTATGGGCCATAATGAAGATCCAAAAaCAGAGCAGATTATAAACTGGCTTGACaAAAGGGCTGAATCTACAGTGGTGTTTGTCTGCTTTGGAAGTGAGTATTTTCTCTCCAATGAGGAATTGGAAGAAGTAGCAATTGGGCTAGAGATTAGCATGGTTAATTTCATATGGGCTGTGAGATTAATTGAAGGAGAGAAAAAAGGGGTTTTACCAGAGGGGTTTGTTCAAAGGGTAGGAGACAGAGGATTGGTTGTGGAGGGGTGGGCTCCACAGGCAAGAATTTTAGGACATTCAAGCACCGGTGGGTTTGTGAGCCATTGTGGGTGGAGTTCTATTGCGGAGAGTATGAAGTTTGGGGTTCCAGTAATTGCCATGGCTAGGCATCTTGATCAGCCTTTGAATGCTAAGCTGGCGGCGGAGGTTGGTGTGGGCATGGAGGTTGTGAGAGATGATAATGGGAAATATAAGAGGGAAGGGATTGCAGAGGTAATAAGAAAAGTCGTTGTGGAGAAAAGTGGGGAGGTTATCAGGAGGAAAGCAAGGGAGTTGAGTGAGAAAATGAAAGAGAAAGGAGAGCAAGAGATTGATAGGGCAGTGGAGGAGCTAGTACAAATTTGTAAGAAGAAGAAAGATGCACAATAG>Pn3-32ATGGATATCGAGAAAGGTAGAATCAGTATAGTTATGCTACCATTTTTAGCCCATGGTCACATATCTCCATTCTTTGAGCTAGCCAAGCATCTCTCAAAAAGAAATTGCAATATATTCCTCTGTTCTACCCCAATCAATCTTAGCTCCATCAAGAACAGAGTATCTGATAAGGATTCCTCTGCTTCAATAAAACTAGTAGAGCTTCATCTTCCATCTTCCCCTGATCCTCCTCCTCAGTACCACACCACAAATGGCCTCCCTTCCCATCTCATGGTCCCACTCAAAAACGCCTTTGAAACAGTAGGCCCCACCTTCTCTGAAATCCTTAAAACCTTAGACCCTGATTTGCTTATTTATGATTTCAATCCCTCATGGGCACCGGAGATCGCTTTGTCTCACAATATTCCGGCAGTTTATTTCCTAACCTCGGCAGCAGCCACCTCTTCCGTGGCCCTACGTGCTTTGAAAAACCCAGG
64
TGAAAAATACCCATTTCCAGATTTTTATGATAACAGTAATATTACCCCTGAACCACCTTCTGCAGATAAAATGAAGCTATTTCATGATTTTGTTGCTTGTTTCAAACGATCTTGCGACATTATTTTGATTAAGAGTTTTAGAGAACTAGAAGGGAAATATATTGATTTGCTTTCCACTTTATCTAAGAAAACTTTGGTTCCTGTTGGTCCACTCGTTCAAGATCCTTTGGGACATGATGAAGATCCAAAAACAGGGCATCTTATAAACTGGCTTGACAAAAGGGCTGAATCTACAGTGGTGTTTGTCTGCTTTGGAAGTGAGTATTTTCCCTCCAATGAGGAATTGGAAGAAGTAGCAATTGGGCTAGAGATTAGCATGGTTAATTTCATATTGGCTGTGAGATTTCTTGAAGGAGAGAAAAAAGGGGTTTTACCAGAGGGGTTTGTTCAAAGGGTAGGAGACAGAGGATTGGTTGTGGAGGGGTGGGCTCCACAGGCAAGAATTTTAGGACATTCAAGCACCGGTGGGTTTGTGAGCCATTGTGGGTGGAGTTCTATTATGGAGAGTGTGAAGTTTGGGGTTCCAGTAATTGCCATGGCCAGGCATCTTGATCAGCCTTTGAATGCTAAGCTGGCGGCGGAGGTCGGTGTGGGCATGGAGGTTGTGAGAGATGAAAATGGGAAGTATAAGAGAGAAGCGATTGCAGAGGTAATAAGAAAAGTCGTGATGGAGAAAAATGGGGAGGTTATCAGGAGGAAAGCAAGGGAATTGAGTGAGAAAATGAAAGAGACAGGAGAGCAAGAGATTGGTAGGGCAGTGGAGGAGCTAGTACAAATTTGTAAGATGAAGAAAGACGCACAATATTAA>Pn3-32-i5ATGGATAACCAAGAAGCTAGAATCAGTATAGTTATGCTGCCATTTTTAGCCCATGGCCACATTTCTCCATTCTTTGAGCTAGCCAAGCATCTCTCAAAAAGAAATTGCAATATATTCCTCTGTTCTACCCCAATCAATCTTAGCTCCATCAAGAACAGAGTATCTGATAAGGATTCCTCTGCTTCTATAAAACTAGTAGAGCTTCATCTTCCATCTTCCCCTGATCTTCCTCCTCACTACCACACCACAAATGGCCTCCCTTCCCATCTCATGGTCCCACTCAGAAACGCCTTTGAAACAGCAGGCCCCACCTTCTCTGAAATCCTTAAAACCTTAAACCCCGATTTGCTTATTTATGATTTCAATCCCTCATGGGCACCGGAGATCGCTTCGTCTCACAATATTCCGGCAGTTTGTTTCATAATTGGGGGAGCAGCCTCCTCTTCCATGAGCCTACATAGTTTCAAAAACCCAGGTGAAAAATACCCATTTCTAGATTTTAATGAGAACAGTAATATTACCCCTGAACCACCTTCAGCAGATAACATGAAGCTATTTCTTGATTTTATGACTTGTTTCGAACGATCTTGCGACATTATTTTGATTAAGAGTTTTAGAGAACTAGAAGGGAAATATTTTGATTTTTTTTCCACTTTATCTGATAAAACTGTGGTTCCTGTTGGTCCACTCGTTCAAGATCCTATGGGCCATAATGAAGATCCAAAAACAGAGCAGTTTATAAACTGGCTTGACAAAAGGGCTGAATCTACAGTGGTGTTTGTCTGCTTTGGAAGTGAGTATTTTCTCTCCAATGAGGAATTGGAAGAAGTAGCAATTGGGCTAGAGATTAGCATGGTTAATTTCATATGGGCTGTGAGATTAATTGAAGGAGAGAAAAAAGGGGTTTTACCAGAGGGGTTTGTTCAAAGGGTAGGAGACAGAGGATTGGTTGTGGAGGGGTGGGCTCCACAGGCAAGAATTTTAGGACATTCAAGCACCGGTGGGTTTGTGAGCCATTGTGGGTGGAGTTCTATTGCGGAGAGTATGAAGTTTGGGGTTCCAGTAATTGCCATGGCTAGGCATCTTGATCAGCCTTTGAATGCTAAGCTGGCGGCGGAGGTTGGTGTGGGCATGGAGGTTGTGAGAGATGATAATGGGAAATATAAGAGGGAAGGGATTGCAGAGGTAATAAGAAAAGTCGTTGTGGAGAAAAGTGGGGAGGTTATCAGGAGGAAAGCAAGGGAGTTGAGTGAGAAAATGAAAGAGAAAGGAGAGCAAGAGATTGATAGGGCAGTGGAGGAGCTAGTACAAATTTGTAAGAAGAAGAAAGATGCACAATAG
65
>PGM1ATGTCACTTCTAATAGATTCTGTACCAACAGTTGCTTATAAGGACCAAAAACCGGGTACTTCAGGTTTACGTAAGAAGACCAAGGTTTTCATGGATGAGCCTCATTATACTGAGAACTTCATTCAAGCAACAATGCAATCTATCCCTAATGGCTCAGAGGGAACCACTTTAGTTGTTGGAGGAGATGGTCGTTTCTACAACGATGTTATCATGAACAAGATTGCCGCAGTAGGTGCTGCAAACGGTGTCAGAAAGTTAGTCATTGGTCAAGGCGGTTTACTTTCAACACCAGCTGCTTCTCATATAATTAGAACATACGAGGAAAAGTGTACCGGTGGTGGTATCATATTAACTGCCTCACACAACCCAGGCGGTCCAGAGAATGATTTAGGTATCAAGTATAATTTACCTAATGGTGGGCCAGCTCCAGAGAGTGTCACTAACGCTATCTGGGAAGCGTCTAAAAAATTAACTCACTATAAAATTATAAAGAACTTCCCCAAGTTGAATTTGAACAAGCTTGGTAAAAACCAAAAATATGGCCCATTGTTAGTGGACATAATTGATCCTGCCAAAGCATACGTTCAATTTCTGAAGGAAATTTTTGATTTTGACTTAATTAAAAGCTTCTTAGCGAAACAGCGCAAAGACAAAGGGTGGAAGTTGTTGTTTGACTCCTTAAATGGTATTACAGGACCATATGGTAAGGCTATATTTGTTGATGAATTTGGTTTACCGGCAGAGGAAGTTCTTCAAAATTGGCACCCTTTACCTGATTTCGGCGGTTTACATCCCGATCCGAATCTAACCTATGCACGAACTCTTGTTGACAGGGTTGACCGCGAAAAAATTGCCTTTGGAGCAGCCTCCGATGGTGATGGTGATAGGAATATGATTTACGGTTATGGCCCTGCTTTCGTTTCGCCAGGTGATTCTGTTGCCATTATTGCCGAATATGCACCCGAAATTCCATACTTCGCCAAACAAGGTATTTATGGCTTGGCACGTTCATTTCCTACATCCTCAGCCATTGATCGTGTTGCAGCAAAAAAGGGATTAAGATGTTACGAAGTTCCAACCGGCTGGAAATTCTTCTGTGCCTTATTTGATGCTAAAAAGCTATCAATCTGTGGTGAAGAATCCTTCGGTACAGGTTCCAATCATATCAGAGAAAAGGACGGTCTATGGGCCATTATTGCTTGGTTAAATATCTTGGCTATCTACCATAGGCGTAACCCTGAAAAGGAAGCTTCGATCAAAACTATTCAGGACGAATTTTGGAACGAGTATGGCCGTACTTTCTTCACAAGATACGATTACGAACATATCGAATGCGAGCAGGCCGAAAAAGTTGTAGCTCTTTTGAGTGAATTTGTATCAAGGCCAAACGTTTGTGGCTCCCACTTCCCAGCTGATGAGTCTTTAACCGTTATCGATTGTGGTGATTTTTCGTATAGAGATCTAGATGGCTCCATCTCTGAAAATCAAGGCCTTTTCGTAAAGTTTTCGAATGGGACTAAATTTGTTTTGAGGTTATCCGGCACAGGCAGTTCTGGTGCAACAATAAGATTATACGTAGAAAAGTATACTGATAAAAAGGAGAACTATGGCCAAACAGCTGACGTCTTCTTGAAACCCGTCATCAACTCCATTGTAAAATTCTTAAGATTTAAAGAAATTTTAGGAACAGACGAACCAACAGTCCGCACATAG>PGM2ATGTCATTTCAAATTGAAACGGTTCCCACCAAACCATATGAAGACCAAAAGCCTGGTACCTCTGGTTTGCGTAAGAAGACAAAGGTGTTTAAAGACGAACCTAACTACACAGAAAATTTCATTCAATCGATCATGGAAGCTATTCCAGAGGGTTCTAAAGGTGCCACTCTTGTTGTCGGTGGTGATGGGCGTTACTACAATGATGTCATTCTTCATAAGATTGCCGCTATCGGTGCTGCCAACGGTATTAAAAAGTTAGTTATTGGCCAGCATGGTCTTCTGTCTACGCCAGCCGCTTCTCACATCATGAGAACCTACGAGGAAAAATGTACTGGTGGTATTATCTTAACCGCCTCACATAATCCAGGTGGTCCAGAAAATGACATGGGTATTAAGTATAACTTATCCAATGGGGGTCCTGCTCCTGAATCCGTCACAAATGCTATTTGGGAGATTTCCAAAAAGCTTACCAGCTATAAGATTATCAAAGACTTCCCAGAACTAGACTTGGGTACGATAGGCAAGAACAAGAAATACGGTCCATTACTCGTTGACATTATCGATATTACAAAAGATTATGTCAACTTCTTGAAGGAAATCTTCGATTTCGACTTAATCAAGAAATTCATCGATAATCAACGTTCTACTAAGAATTGGAAGTTACTGTTTGACAGTATGAACGGTGTAAC
66
TGGACCATACGGTAAGGCTATTTTCGTTGATGAATTTGGTTTACCGGCGGATGAGGTTTTACAAAACTGGCATCCTTCTCCGGATTTTGGTGGTATGCATCCAGATCCAAACTTAACTTATGCCAGTTCGTTAGTGAAAAGAGTAGATCGTGAAAAGATTGAGTTTGGTGCTGCATCCGATGGTGATGGTGATAGAAATATGATTTACGGTTACGGCCCATCTTTCGTTTCTCCAGGTGACTCCGTCGCAATTATTGCCGAATATGCAGCTGAAATCCCATATTTCGCCAAGCAAGGTATATATGGTCTGGCCCGTTCATTCCCTACCTCAGGAGCCATAGACCGTGTTGCCAAGGCCCATGGTCTAAACTGTTATGAGGTCCCAACTGGCTGGAAATTTTTCTGTGCTTTGTTCGACGCTAAAAAATTATCTATTTGTGGTGAAGAATCGTTTGGTACTGGTTCCAACCACGTAAGGGAAAAGGACGGTGTTTGGGCCATTATGGCGTGGTTGAACATCTTGGCCATTTACAACAAGCATCATCCGGAGAACGAAGCTTCTATTAAGACGATACAGAATGAATTCTGGGCAAAGTACGGCCGTACTTTCTTCACTCGTTATGATTTTGAAAAAGTTGAAACAGAAAAAGCTAACAAGATTGTCGATCAATTGAGAGCATATGTTACCAAATCGGGTGTTGTTAATTCCGCCTTCCCAGCCGATGAGTCTCTTAAGGTCACCGATTGTGGTGATTTTTCATACACAGATTTGGACGGTTCTGTTTCTGACCATCAAGGTTTATATGTCAAGCTTTCCAATGGTGCAAGATTCGTTCTAAGATTGTCAGGTACAGGTTCTTCAGGTGCTACCATTAGATTGTACATTGAAAAATACTGCGATGATAAATCACAATACCAAAAGACAGCTGAAGAATACTTGAAGCCAATTATTAACTCGGTCATCAAGTTCTTGAACTTTAAACAAGTTTTAGGAACTGAAGAACCAACGGTTCGTACTTAA>UGP1ATGTCCACTAAGAAGCACACCAAAACACATTCCACTTATGCATTCGAGAGCAACACAAACAGCGTTGCTGCCTCACAAATGAGAAACGCCTTAAACAAGTTGGCGGACTCTAGTAAACTTGACGATGCTGCTCGCGCTAAGTTTGAGAACGAACTGGATTCGTTTTTCACGCTTTTCAGGAGATATTTGGTAGAGAAGTCTTCTAGAACCACCTTGGAATGGGACAAGATCAAGTCTCCCAACCCGGATGAAGTGGTTAAGTATGAAATTATTTCTCAGCAGCCCGAGAATGTCTCAAACCTTTCCAAATTGGCTGTTTTGAAGTTGAACGGTGGGCTGGGTACCTCCATGGGCTGCGTTGGCCCTAAATCTGTTATTGAAGTGAGAGAGGGAAACACCTTTTTGGATTTGTCTGTTCGTCAAATTGAATACTTGAACAGACAGTACGATAGCGACGTGCCATTGTTATTGATGAATTCTTTCAACACTGACAAGGATACGGAACACTTGATTAAGAAGTATTCCGCTAACAGAATCAGAATCAGATCTTTCAATCAATCCAGGTTCCCAAGAGTCTACAAGGATTCTTTATTGCCTGTCCCCACCGAATACGATTCTCCACTGGATGCTTGGTATCCACCAGGTCACGGTGATTTGTTTGAATCTTTACACGTATCTGGTGAACTGGATGCCTTAATTGCCCAAGGAAGAGAAATATTATTTGTTTCTAACGGTGACAACTTGGGTGCTACCGTCGACTTAAAAATTTTAAACCACATGATCGAGACTGGTGCCGAATATATAATGGAATTGACTGATAAGACCAGAGCCGATGTTAAAGGTGGTACTTTGATTTCTTACGATGGTCAAGTCCGTTTATTGGAAGTCGCCCAAGTTCCAAAAGAACACATTGACGAATTCAAAAATATCAGAAAGTTTACCAACTTCAACACGAATAACTTATGGATCAATCTGAAAGCAGTAAAGAGGTTGATCGAATCGAGCAATTTGGAGATGGAAATCATTCCAAACCAAAAAACTATAACAAGAGACGGTCATGAAATTAATGTCTTACAATTAGAAACCGCTTGTGGTGCTGCTATCAGGCATTTTGATGGTGCTCACGGTGTTGTCGTTCCAAGATCAAGATTCTTGCCTGTCAAGACCTGTTCCGATTTGTTGCTGGTTAAATCAGATCTATTCCGTCTGGAACACGGTTCTTTGAAGTTAGACCCATCCCGTTTTGGTCCAAACCCATTAATCAAGTTGGGCTCGCATTTCAAAAAGGTTTCTGGTTTTAACGCAAGAATCCCTCACATCCCAAAAATCGTCGAGCTAGATCATTTGACCATCACTGGTAACGTCTTTTTAGGTAAAGATGTCACTTTGAGGGGTACTGTCATCATCGTTTGCTCCGACGGTCATAAAATCGATATTCCAAACGGCTCCATATTGGAAAAT
67
GTTGTCGTTACTGGTAATTTGCAAATCTTGGAACATTGA>SynAtUXS3 (a codon-optimized version of the AtUXS3 gene)ATGGCTGCAACTTCTGAAAAGCAAAACACTACAAAACCACCACCATCTCCATCACCATTGAGAAACTCAAAGTTCTGTCAACCAAACATGAGAATTTTAATTTCTGGTGGTGCTGGTTTTATTGGTTCACATTTGGTTGATAAATTGATGGAAAACGAAAAGAATGAAGTTGTTGTTGCAGATAACTACTTCACTGGTTCTAAGGAAAATTTGAAGAAATGGATCGGTCATCCAAGATTCGAATTGATCAGACATGATGTTACAGAACCATTGTTGATCGAAGTTGATAGAATCTATCATTTGGCTTGTCCAGCATCACCAATTTTCTATAAGTACAACCCAGTTAAGACTATTAAAACAAATGTTATTGGTACATTGAACATGTTGGGTTTGGCTAAGAGAGTTGGTGCAAGAATTTTGTTGACTTCTACATCAGAAGTTTATGGTGACCCATTGATTCATCCACAACCAGAATCTTACTGGGGTAATGTTAATCCAATTGGTGTTAGATCATGTTATGATGAAGGTAAAAGAGTTGCTGAAACTTTGATGTTCGATTACCATAGACAACATGGTATCGAAATCAGAATCGCAAGAATTTTCAATACATACGGTCCAAGAATGAACATCGATGATGGTAGAGTTGTTTCTAACTTCATCGCTCAAGCATTGAGAGGTGAAGCATTGACTGTTCAAAAGCCAGGTACTCAAACAAGATCTTTTTGTTACGTTTCAGATATGGTTGATGGTTTGATCAGATTGATGGAGGGTAACGATACAGGTCCAATTAATATCGGTAATCCTGGTGAATTCACTATGGTTGAATTGGCTGAAACAGTTAAGGAATTGATTAATCCATCTATCGAAATTAAAATGGTTGAAAATACTCCAGATGATCCAAGACAAAGAAAGCCAGATATCTCAAAGGCAAAGGAAGTTTTGGGTTGGGAACCAAAAGTTAAATTGAGAGAAGGTTTGCCATTGATGGAAGAAGATTTCAGATTGAGATTAAATGTTCCAAGAAATTAA>SynAtUGD1 (a codon-optimized version of the AtUGD1 gene)ATGGTTAAAATTTGTTGTATTGGTGCTGGTTATGTTGGTGGTCCAACTATGGCTGTTATGGCATTGAAATGTCCAGAAATCGAAGTTGTTGTTGTTGATATCTCTGAACCAAGAATTAATGCTTGGAACTCAGATAGATTGCCAATCTATGAACCAGGATTAGAAGATGTTGTTAAGCAATGTAGAGGTAAAAATTTGTTTTTCTCTACTGATGTTGAAAAGCATGTTTTCGAATCTGATATCGTTTTTGTTTCAGTTAATACTCCAACAAAAACTCAAGGTTTGGGTGCTGGTAAAGCTGCAGATTTGACATATTGGGAATCTGCTGCAAGAATGATTGCAGATGTTTCAAAGTCTTCAAAGATCGTTGTTGAAAAATCAACTGTTCCAGTTAGAACAGCTGAAGCAATTGAAAAGATTTTGACTCATAACTCTAAGGGTATCGAATTCCAAATCTTGTCAAATCCAGAATTTTTAGCTGAAGGTACTGCAATTAAAGATTTGTACAACCCAGATAGAGTTTTAATTGGTGGTAGAGATACAGCTGCAGGTCAAAAGGCTATTAAAGCATTGAGAGATGTTTACGCTCATTGGGTTCCAGTTGAACAAATCATCTGTACAAATTTGTGGTCTGCAGAATTGTCAAAGTTGGCTGCAAACGCATTTTTGGCACAAAGAATCTCTTCAGTTAATGCTATGTCTGCATTATGTGAAGCTACTGGTGCAGATGTTACACAAGTTGCTCATGCAGTTGGTACAGATACTAGAATCGGTCCAAAGTTCTTGAATGCTTCTGTTGGTTTCGGTGGTTCATGTTTCCAAAAGGATATCTTGAATTTGATCTATATCTGTGAATGTAACGGTTTGCCAGAAGCTGCAAACTACTGGAAGCAAGTTGTTAAGGTTAACGATTACCAAAAGATTAGATTCGCTAACAGAGTTGTTTCTTCAATGTTCAACACAGTTTCTGGTAAAAAGATTGCTATCTTGGGTTTCGCTTTTAAGAAAGATACAGGTGACACTAGAGAAACACCAGCTATTGATGTTTGTAACAGATTGGTTGCTGATAAGGCAAAGTTGTCTATCTATGATCCACAAGTTTTGGAAGAACAAATCAGAAGAGATTTGTCAATGGCTAGATTTGATTGGGATCATCCAGTTCCATTGCAACAAATTAAAGCAGAAGGTATCTCTGAACAAGTTAACGTTGTTTCAGATGCTTACGAAGCAACTAAAGATGCTCATGGTTTGTGTGTTTTGACAGAATGGGATGAATTCAAATCTTTGGATTTCAAGAAAATTTTCGATAACATGCAAAAACCAGCTTTCGTTTTCGATGGTAGAAACGT
68
TGTTGATGCTGTTAAGTTGAGAGAAATCGGTTTTATTGTTTACTCTATCGGTAAACCATTGGATTCATGGTTGAAGGATATGCCAGCTGTTGCATAA>SynUGTpg101 (a codon-optimized version of the UGTpg101 gene)ATGAAGTCCGAATTGATTTTTTTGCCAGCTCCAGCTATTGGTCATTTGGTTGGTATGGTTGAAATGGCCAAGTTGTTCATTTCTAGGCACGAAAACTTGTCCGTTACCGTTTTGATTGCTAAGTTCTACATGGATACCGGTGTTGACAATTACAACAAGTCCTTGTTGACTAACCCAACTCCAAGATTGACTATCGTTAACTTGCCAGAAACTGACCCACAAAACTATATGTTGAAACCTAGACATGCCATCTTTCCATCCGTTATTGAAACTCAAAAGACCCACGTTAGAGACATCATTTCTGGTATGACTCAATCCGAATCTACCAGAGTTGTTGGTTTGTTGGCTGATTTGTTGTTCATCAACATTATGGATATCGCCAACGAATTCAACGTTCCAACTTATGTTTATTCTCCAGCTGGTGCTGGTCACTTGGGTTTAGCTTTTCACTTGCAAACTCTGAACGATAAGAAGCAAGACGTTACCGAATTCAGAAACTCTGATACCGAATTATTGGTTCCCTCATTTGCTAATCCAGTTCCAGCTGAAGTTTTGCCATCTATGTATGTTGACAAAGAAGGTGGTTACGACTACCTGTTTTCTTTGTTCAGAAGATGCAGAGAATCCAAGGCCATTATTATCAACACCTTCGAAGAATTGGAACCCTACGCTATTAACTCCTTGAGAATGGATTCTATGATCCCACCAATCTATCCAGTTGGTCCAATTTTGAATTTGAACGGTGATGGTCAAAACTCCGATGAAGCTGCTGTTATTTTAGGTTGGTTGGATGATCAACCACCATCCTCTGTTGTTTTTTTGTGTTTTGGTTCCTACGGCACCTTCCAAGAAAATCAAGTAAAAGAAATCGCCATGGGTCTAGAAAGATCTGGTCATAGATTTTTGTGGTCTTTGAGGCCATCTATTCCAAAGGGTGAAACTAAGTTGCAGTTGAAGTACTCTAACCTGGAAGAAATTTTGCCAGTTGGTTTCTTGGATAGAACCTCTTGTGTTGGTAAAGTTATTGGTTGGGCTCCACAAGTTGCTGTTTTGGGTCATGAAGCTGTTGGTGGTTTTTTGTCTCATTGTGGTTGGAACTCTACCTTGGAATCTGTTTGGTGTGGTGTTCCAGTTGCTACTTGGCCAATGTATGGTGAACAGCAATTGAATGCTTTCGAAATGGTCAAAGAATTGGGTATCGCCGTTGAAATCGAAGTTGATTACAAGAACGAATACTTCAACATGAACAACGACTTCATCGTTAGAGCCGAAGAAATCGAAACGAAGATCAAAAAGTTGATGATGGACGAGAAGAACTCCGAGATTCGTAAGAAAGTCAAAGAGATGAAGGAAAAGTCCAGATTGGCTATGTCTGAAAACGGTTCTTCTTACAACTCTTTGGCCAAGCTGTTTGAAGAGATCATGTGA>SynPn3-29 (a codon-optimized version of the Pn3-29 gene)ATGAAAAGTGAGCTAATTTTTCTACCAGCTCCAGCTATTGGTCATCTAGTTGGTATGGTTGAAATGGCAAAGTTATTCATTTCTAGGCACGAGAATTTGAGTGTTACTGTTCTAATTGCTAAGTTTTATATGGACACTGGTGTTGATAATTATAACAAGAGTCTATTGACTAAACCAACTCCAAGATTAACTATAGTTAACTTGCCAGAGTCTGATCCACAGAATTACATGTTAAAGCCTAGACATGCTATTTTCCCAAGTGTTATTGAAACACAAAAGACTCATGTTCGTGATATTATTTTGGGTATGACACAATCTGAATCAACACGTGTTGTAGGATTACTAGCAGATCTTCTATTCATTAATATAATGGATATAGCTAACGAATTCAACGTACCTACATACGTTTATAGTCCAGCTGGTGCTGGACACCTAGGTTTAGCTTTTCACTTGCAAACTCTAAATGATAAGAAACAGGACGTTACTGAATTCAGACATTCAGATACAGAACTTTTAGTTCCATCATTCGCTAATCCAGTACCAGCTGAAGTTCTTCCATCTATGTACGTTGACAAGGAAGGTGGTTACGACTACTTATTCAGTCTTTTTAGACGTTGTAGGGAAAGTAAAGCTATAATAATCAATACTTTCGAAGAATTGGAGCCATACGCTATTAACAGTTTGCGTATGGACTCTATGATTCCACCAATATACCCAGTTGGTCCAATTTTGAACCTAAATGGAGACGGTCAGAATAGTGACGAAGCAGCAGTTATTTTAGGATGGCTAGATGACCAGCCTCCAAGTTCAGTTGTTTTCCTTTGTTTCGGATCATACGGTACTTTCCAGGAGAATCAAGTTAAAGAAATAGCTATGGGATTAGA
69
AAGGTCCGGTCACAGGTTTTTATGGTCACTAAGACCATCCATTCCAAAGGGTGAAACTAAATTACAATTGAAGTATAGTAACTTAGAGGAGATATTGCCTGTTGGTTTCTTAGATCGAACTTCTTGTGTTGGTAAAGTAATAGGTTGGGCTCCACAGGTTGCTGTTCTAGGTCATGAAGCTGTTGGTGGTTTCTTATCACACTGCGGATGGAACTCTACTTTGGAATCCGTTTGGTGCGGTGTTCCAGTTGCTACTTGGCCTATGTATGGTGAACAGCAGTTAAACGCATTCGAAATGGTAAAAGAATTAGGAATAGCTGTTGAGATAGAAGTTGATTACAAAAACGAGTACTTCAATATGACTAACGACTTCATAGTAAGAGCTGAGGAGATAGAAACTAAGATTAAAAAGTTGATGATGGACGAGAAAAACTCAGAGATTCGAAAGAAAGTTAAAGAGATGAAGGAGAAATCTCGTCTAGCTATGTCAGAAAACGGTTCCAGTTACAACTCTCTAGCTAAATTGTTCGAAGAGATAATGTGA
70
Supplementary Sequences 3
>SlGAME1(Gal)
MAGENLHVLFLPFLSAGHFIPLVNAARLFASRGVKATILTTPHNALLFRSTIEDDVRISGFPI
SIVTIKFPSAEVGLPEGIESFNSATSPEMPHKVFYALYLLQKPMEDKIRELHPDCIFSDMYY
PWTVDLAEELHIPRILYNLSAYMCYSIMHNLKLYRPHKQPDLDESQSFVVPGLPDEIKFKL
SQLTEDLRKPEDQKSVFDELLEKVRVSEERSYGIVHDTFYELEPAYINYYQKLKKPKWWH
FGPLSHFASKIRSKELITEHNNNDIVVDWLNAQKPKSVLYVSFGSMARFPENQLNEIAQAL
HASNVPFIFVLRPNEETASWLPVGNFEDKTKKGLFIVGWVPQLTIMEHPATGGFMTHCGT
NSVLEANTFGVPMITWPLYADQFYNEKVVEVNGLGIKIGIDVWNDGIEITGPVIGSAKIRE
AIERLMSSNDSEEIMNIRDRVMAMSKMAQDATDEGGSSWNNLTALIQHIKNFNLN
>SlGAME2(Xyl)
MAMEENEQSAIAHVVFIPYAMTSHITPLVHIARLLAFHGLKVTIIAPPHNALLFQSSVDRD
CLFWGSNISVRTIQFPSEEIGLPVGIENFIASPSMEIVGKVHYGFLLLQKPMEQMIRELNPN
CIISDMFFPWTVDLAEELQIPRFSFQPGTFVHQCAWVFIRELKPYENHVSFSIPGLPLDIQM
KVSEIEDFLKGETEYRKTVEDVLQAEIRSHGIIHNTCSELEPGFAQLYEKARGVKGWHIGP
VALFINNYEAENSCCDPWKGYGDCFDWLENQQSKSVLFVCFGSMIRFSDDQLKEMAVGL
KAANCPTIWVFKEQDKNGFCSKRLKEMKGENMFIIEGWAPQVSILKHGAIGGFLTHCGW
NSILESLSVGVPLITWPLFSDNFYTDKLLEKLGLAIGIGADVWNPGFILSCPSLSGEKIELAV
KRLINNSEESRNIRENAKLMAKKLKVATEEGGSSHAQLMGLIHEIKRCALKKSSL
>SlGAME18
MGNMRIVMLPYLAYGHITPFLELAKKLSNRGFSIHICSSPINLSFIKAKIPEKYSSSIHLVEL
HLPNLPELPPHHHTTNGLPNHLKQTLFKTLKMTKPQLHQILSDLKPDFFIYDIMLLWSAVV
ASSLNIPSLRFYTVNAAIFSYFFHFYFNPGEEFPFPALYMRDYELAKMTHEVADDAEVEVD
RDKVTESDKFVLVHSTKSIDGKYMDYLCGTGQAKVVPIGTESPEDGVGDVDKIDIELVK
WLEKKTEHSTVYVSFGSEYFLSKEEMEEVAYGLEVSGVDFIWVVRYQKGEQLELPQGFK
ERIGDRGRIIEGWAPQQRILKHSSIGGFVTHCGWNSTLESIEFGVPIIAMPMLYDQPLNARL
MVENGVAVEVPRDEKGNLDRVNIAEKIKHVIRDETGENLRKKMNNLGENVRSQREEEM
DGVVKVIQLLIDEKKGTL
>SlGAME17
71
MDNKDDVAVVIAPLHLQSHLRQLLHFACRISSYGLPVYYLGLAASNREARQHSTTLNPCD
IEKIHFHDLQIPNEDPPPDIPMHIWHASMRTREPIASFLGDISSKARRIVVVHDLLMSYNVQ
DISSYPNGESYVFHCLPIFDMYCSHYAPAAGFPIPLEEALLKRLPSNDGCYNPEDVEHHAK
YLNHCMGKNAGDIFNTSQVIDGTAIVDSMANLATMQNKKLWALGPILLTQDHDKVQTK
HFCLDWLNKQPPKSVIYVSFGTSTSFSAEQIKELAIGLELSKQKFIWVLRNADVGDPVNN
KCEENRSGTLELPEGFEERVKGVGLVVREWAPQQEIFAHPSTGGFMSHCGWTSCLESIIEG
VPIAAWPIQFDQPKNAFLVAEVLKIGLHVREWEQRNELVTASTVQNIVCKLMASEEGDMI
RKRAQELRTAVKRSTEEGGVSRMELDSFIAHITR
>StSGT3(Rha)
MAMEQNEETAMPHVVFIPYAMTSHITPLVHIARLFALHGLKVTIIAPQHNALLFQSSVDRD
RLFSGSNITVRTIQFPSEEVGLPVGIENFIASPSMEIVGKVHYGFILLQKIMEQLIREINPNCI
VSDMFFPWTVDLAEEMQIPRFSFQPATSIHQCAWVLIREFKPYKNVASDSERFLIPGLPLDI
KMKVSEIEDFLKEETEYTKTVDDVLQAEVRSHGIIHNTCSELEPGVAQLYEKARGVKGW
HIGPLALFINKYEAEISSKQISNSNINSCSDPWKGYGDCFNWLENQQPNSVLFVCFGSMIR
FSDDQLKEMAVGLKAANCPTIWVFREQDKNEVDEKDEHSDWSRNGFKEMIGEKMFIIQG
WAPQQLILKHRAIGGFLTHCGWNSILESLAIGVPLITWPLFSDNFYTDKLLETLGLAIGIGA
DVWNPGFILSCPPLSGEKIELAVKRLMNNSEESRKIRENAKLMAKKLKSATEEGGSSHSQ
LIGLIEEIKRCAFKKSS
>StSGT1(Gal)
MVATCNNGEILHVLFLPFLSAGHFIPLVNAARLFASRGVKATILTTPHNALLFRSTIDDDVR
ISGFPISIVTIKFPSAEVGLPEGIESFNSATSPEMPHKIFYALSLLQKPMEDKIRELRPDCIFSD
MYFPWTVDIADELHIPRILYNLSAYMCYSIMHNLKVYRPHKQPNLDESQSFVVPGLPDEI
KFKLSQLTDDLRKSDDQKTVFDELLEQVEDSEERSYGIVHDTFYELEPAYVDYYQKLKKP
KCWHFGPLSHFASKIRSKELISEHNNNEIVIDWLNAQKPKSVLYVSFGSMARFPESQLNEI
AQALDASNVPFIFVLRPNEETASWLPVGNLEDKTKKGLYIKGWVPQLTIMEHSATGGFMT
HCGTNSVLEAITFGVPMITWPLYADQFYNEKVVEVRGLGIKIGIDVWNEGIEITGPVIESA
KIREAIERLMISNGSEEIINIRDRVMAMSKMAQNATNEGGSSWNNLTALIQHIKNYNLN
>GmUGT73F4(Xyl)
MDLQQRPLKLHFIPYLSPGHVIPLCGIATLFASRGQHVTVITTPYYAQILRKSSPSLQLHVV
DFPAKDVGLPDGVEIKSAVTDLADTAKFYQAAMLLRRPISHFMDQHPPDCIVADTMYSW
72
ADDVANNLRIPRLAFNGYPLFSGAAMKCVISHPELHSDTGPFVIPDFPHRVTMPSRPPKMA
TAFMDHLLKIELKSHGLIVNSFAELDGEECIQHYEKSTGHKAWHLGPACLVGKRDQERGE
KSVVSQNECLTWLDPKPTNSVVYVSFGSVCHFPDKQLYEIACALEQSGKSFIWIVPEKKG
KEYENESEEEKEKWLPKGFEERNREKGMIVKGWAPQLLILAHPAVGGFLSHCGWNSSLE
AVTAGVPMITWPVMADQFYNEKLITEVRGIGVEVGATEWRLVGYGEREKLVTRDTIETAI
KRLMGGGDEAQNIRRRSEELAEKAKQSLQEGGSSHNRLTTLIADLMRLRDSKSAT
>BvUGT73C10
MVSEITHKSYPLHFVLFPFMAQGHMIPMVDIARLLAQRGVKITIVTTPHNAARFENVLSR
AIESGLPISIVQVKLPSQEAGLPEGNETFDSLVSTKLLVPFFKAVNMLEEPVQKLFEEMSPQ
PSCIISDFCLPYTSKIAKKFNIPKILFHGMCCFCLLCMHVLRKNREILENLKSDKEHFVVPY
FPDRVEFTRPQVPLATYVPGEWHEIKEDMVEADKTSYGVIVNTYQELEPAYANGYKEARS
GKAWTIGPVSLCNKVGADKAERGNKADIDQDECLKWLDSKEEGSVLYVCLGSICSLPLS
QLKELGLGLEESQRPFIWVVRGWEKNKELLEWFSESGFEERVKDRGLLIKGWSPQMLIL
AHHSVGGFLTHCGWNSTLEGITSGVPLLTWPLFGDQFCNQKLVVQVLKVGVSAGVEEVT
NWGEEEKIGVLVDKEGVKKAVEELMGESDDAKEIRKRVKELGQLAHKAVEEGGSSHSNI
TSLLEDIMQLAQPNN
>BvUGT73C11
MVSEITHKSYPLHFVLFPFMAQGHMIPMVDIARLLAQRGVKITIVTTPHNAARFENVLSR
AIESGLPISIVQVKLPSQEAGLPEGNETFDSLVSMELLVPFFKAVNMLEEPVQKLFEEMSPQ
PSCIISDFCLPYTSKIAKKFNIPKILFHGMCCFCLLCMHVLRKNREILENLKSDKEHFVVPY
FPDRVEFTRPQVPMATYVPGEWHEIKEDIVEADKTSYGVIVNTYQELEPAYANDYKEARS
GKAWTIGPVSLCNKVGADKAERGNKADIDQDECLKWLDSKEEGSVLYVCLGSICSLPLS
QLKELGLGLEESQRPFIWVVRGWEKNKELLEWFSESGFEERVKDRGLLIKGWSPQMLIL
AHHSVGGFLTHCGWNSTLEGITSGIPLLTWPLFGDQFCNQKLVVQVLKVGVSAGVEEVT
NWGEEEKIGVLVDKEGVKKAVEELMGESDDAKERRKRVKELGQLAQKAVEEGGSSHSN
ITSLLEDIMQLAQSNN
>BvUGT73C12
MVSEITHKSYPLHFVLFPFMAQGHMIPMVDIARLLAQRGVKITIVTTPHNAARFKNVLSR
AIESGLPISIVQVKLPSQEAGLPEGNETLDSLVSMELMIHFLKAVNMLEEPVQKLFEEMSP
QPSCIISDFCLPYTSKIAKKFNIPKILFHGMCCFCLLCMHILRKNREIVENLKSDKEHFVVP
73
YFPDRVEFTRPQVPVATYVPGDWHEITEDMVEADKTSYGVIVNTYQELEPAYANDYKEA
RSGKAWTIGPVSLCNKVGADKAERGNKADIDQDECLKWLNSKEEGSVLYVCLGSICNLP
LSQLKELGLGLEESQRPFIWVIRGWEKNKELHEWFSESGFEERIKDRGLLIKGWAPQMLIL
SHHSVGGFLTHCGWNSTLEGLTAGLPLLTWPLFADQFCNEKLAVQVLKAGVSAGVDQPM
KWGEEEKIGVLVDKEGVKKAVEELMGESDDAKEIRRRAKELGELAHKAVEEGGSSHSNI
TSLLEDIMQLAQSNN
>BvUGT73C13
MVSEITHKSYPLHFVLFPFMAQGHMIPMVDIARLLAQRGVKITIVTTPHNAARFENVLNR
AIESGLPISIVQVKLPSQEAGLPEGNETFDSLVSMELLVPFFKSVNMLEEPVQKLFEEMSPQ
PSCIISDFCLPYTSKIAKKFNIPKILFHGMCCFCLLCMHVLRKNHEIVENLKSDKEHFVVPY
FPDRVEFTRPQVPVATYVPGDWHEITGDMVEADKTSYGVIVNTCQELEPAYANDYKEARS
GKAWTIGPVSLCNKVGADKAERGNKADIDQDECLKWLNSKEEGSVLYVCLGSICNLPLS
QLKELGLGLEESQRPFIWVIRGWEKNKELLEWFSESGFEERIKDRGLLIKGWAPQMLILSH
HSVGGFLTHCGWNSTLEGLTAGLPLLTWPLFADQFCNEKLAVQVLKAGVSAGVDQPMK
WGEEEKIGVLVDKEGVKKAVEELMGESDDAKEIRRRAKELGELAHKAVEEGGSSHSNIT
SLLEDIMQLAQSNN
>PgUGT74AE2
MLSKTHIMFIPFPAQGHMSPMMQFAKRLAWKGVRITIVLPAQIRDSMQITNSLINTECISFD
FDKDDGMPYSMQAYMGVVKLKVTNKLSDLLEKQKTNGYPVNLLVVDSLYPSRVEMCH
QLGVKGAPFFTHSCAVGAIYYNAHLGKLKIPPEEGLTSVSLPSIPLLGRDDLPIIRTGTFPDL
FEHLGNQFSDLDKADWIFFNTFDKLENEEAKWLSSQWPITSIGPLIPSMYLDKQLPNDKG
NGINLYKADVGSCIKWLDAKDPGSVVYASFGSVKHNFGDDYMDEVAWGLLHSKYNFIW
VVIEPERTKLSSDFLAEAEEKGLIVSWCPQLEVLSHKSIGSFMTHCGWNSTVEALSLGVP
MVAVPQQFDQPVNAKYIVDVWQIGVRVPIGEDGVVLRGEVANCIKDVMEGEIGDELRGN
ALKWKGLAVEAMEKGGSSDKNIDEFISKLVSS
>PgUGT94Q2
MDNQNGRISIALLPFLAHGHISPFFELAKQLAKRNCNVFLCSTPINLSSIKDKDSSASIKLV
ELHLPSSPDLPPHYHTTNGLPSHLMLPLRNAFETAGPTFSEILKTLNPDLLIYDFNPSWAPEI
ASSHNIPAVYFLTTAAASSSIGLHAFKNPGEKYPFPDFYDNSNITPEPPSADNMKLLHDFIA
CFERSCDIILIKSFRELEGKYIDLLSTLSDKTLVPVGPLVQDPMGHNEDPKTEQIINWLDKR
74
AESTVVFVCFGSEYFLSNEELEEVAIGLEISTVNFIWAVRLIEGEKKGILPEGFVQRVGDRG
LVVEGWAPQARILGHSSTGGFVSHCGWSSIAESMKFGVPVIAMARHLDQPLNGKLAAEV
GVGMEVVRDENGKYKREGIAEVIRKVVVEKSGEVIRRKARELSEKMKEKGEQEIDRALE
ELVQICKKKKDEQ
>PgUGT71A29
MKSELISLAA.AIGRLVGMVEMAKLFISRHENLSVTVLIAKFYMDTGVDNYNKSLLTNPTP
RLTIVNLPESDPQNYMLKPRHAIFPSVIETQKTHVRDIISGMTQSESTRVVGLLADLLFINI
MDIANEFNVPTYVYSPAGAGHLGLAFHLQTVNDKKQDVTEFRNSDTELLVPSFANPVPAE
VLPSMYVDKEGGYDYLFSLFRKCRESKAIIINTFEELEPYAINSLRMDSMIPPIYPVGPILNL
NGDGQNADEAAVILGWLDDQPPSSVVFLCFGSYGTFQENNQVKEIAMGLERSGHRFLWS
LRSSIPKGETKLQLKYSNLKEILPVGFLDRTSCVGKVIGWAPQVAVLGHESVGGFLSHCG
WVSTLESVWCGPVATWPMYGEQQLNAFEMVKELGIAVEIEVDYKKDYFNMKNDFIVRA
KEIETKIKKLMMDENNSEIRKKVLEMKEKSRAASSENGSSYNSLAKLFEEIM
>SgUGT74AC1
MEKGDTHILVFPFPSQGHINPLLQLSKRLIAKGIKVSLVTTLHVSNHLQLQGAYSNSVKIE
VISDGSEDRLETDTMRQTLDRFRQKMTKNLEDFLQKAMVSSNPPKFILYDSTMPWVLEV
AKEFGLDRAPFYTQSCALNSINYHVLHGQLKLPPETPTISLPSMPLLRPSDLPAYDFDPAST
DTIIDLLTSQYSNIQDANLLFCNTFDKLEGEIIQWMETLGRPVKTVGPTVPSAYLDKRVEN
DKHYGLSLFKPNEDVCLKWLDSKPSGSVLYVSYGSLVEMGEEQLKELALGIKETGKFFL
WVVRDTEAEKLPPNFVESVAEKGLVVSWCSQLEVLAHPSVGCFFTHCGWNSTLEALCLG
VPVVAFPQWADQVTNAKFLEDVWKVGKRVKRNEQRLASKEEVRSCIWEVMEGERASEF
KSNSMEWKKWAKEAVDEGGSSDKNIEEFVAMLKQT
>BsYjiC
MKKYHISMINIPAYGHVNPTLALVEKLCEKGHRVTYATTEEFAPAVQQAGGEALIYHTSLN
IDPKQIREMMEKNDAPLSLLKESLSILPQLEELYKGDQPDLIIYDFVALAGKLFAEKLNVPV
IKLCSSYAQNESFQLGNEDMLKKIKEAEAEFKAYLEQEKLPAVSFEQLAVPEALNIVFMPK
SFQIQHETFDDRFCFVGPSLGERKEQEGLLIDKDDRPLMLISLGTAFNAWPEFYKMCIKAF
RDSSWQVIMSVGKTIDPESLEDIPANFTIRQSVPQLEVLEKADLFISHGGMNSTMEAMNA
GVPLVVIPQMYEQELTANRVDELGLGVYLPKEEVTVSSLQEAVQAVSSDQELLTRVKNMQ
KDVKEAGGAERAAAEIEAFMKKSAVPQ
75
>PgUGTpg100
MKSELIFLPVPAFGHLVGMVEMAKLFISRHENLSVTVLISKFFIDTGIDNYNKSLLAKPTPR
LTIINLPEIDPQKYLLKPRCAIFPSLIENQKTHVRDVMSRMTQSESTRVVGLLADILFVDIFD
IADEFNVPTYVYSPAGAGFLGLAFHLQTLNDDKKQDVTEFRNSDTELLVPSFANPVPAEFL
PSIFLEKDGRHDVLLSLYWRCREAKGIIVNTFEELEPYAINSLRMDSMIPPIYPVGPILNLN
GEGQNSDEAAVILGWLDDQPPSSVVFLCFGSFGSFPENQVKEIAMGLERSGHRFLWSLRP
CISEGETTLQLKYSNLELPAGFLDRTSCVGKVIGWAPQMAILAHEAVGGFVSHCGWNSVL
ESVWYGMPVATWPMYGEQQLNAFEMVKELGLAVEIEVDYRNEYNKSDFIVKADEIETKI
KKLMMDGKNSKIRKKVKEMKEKSRVAMSENGSSYTSLAKLFEEIM
>PgUGTpg101
MKSELIFLPAPAIGHLVGMVEMAKLFISRHENLSVTVLIAKFYMDTGVDNYNKSLLTNPTP
RLTIVNLPETDPQNYMLKPRHAIFPSVIETQKTHVRDIISGMTQSESTRVVGLLADLLFINI
MDIANEFNVPTYVYSPAGAGHLGLAFHLQTLNDKKQDVTEFRNSDTELLVPSFANPVPAE
VLPSMYVDKEGGYDYLFSLFRRCRESKAIIINTFEELEPYAINSLRMDSMIPPIYPVGPILNL
NGDGQNSDEAAVILGWLDDQPPSSVVFLCFGSYGTFQENQVKEIAMGLERSGHRFLWSL
RPSIPKGETKLQLKYSNLEEILPVGFLDRTSCVGKVIGWAPQVAVLGHEAVGGFLSHCGW
NSTLESVWCGVPVATWPMYGEQQLNAFEMVKELGIAVEIEVDYKNEYFNMNNDFIVRA
EEIETKIKKLMMDEKNSEIRKKVKEMKEKSRLAMSENGSSYNSLAKLFEEIM
76