Download - (2525) Structural Characterization Of Glycopeptides: Integration Of ...

Transcript
Page 1: (2525) Structural Characterization Of Glycopeptides: Integration Of ...

l h f Structural characterization of glycopeptides: Integration of g y p p g

complementary CID and ETD based information.information.

K.F. Medzihradszky1,2, A.J. Lynn1, P. Baker1, R.J. Chalkley1, Z. Darula2 and A.L. Burlingame1.

1Mass Spectrometry Facility, University of California San Franciscop y y y2Proteomics Research Group, Biology Research Center of the Hungarian

Academy of Sciences, Szeged, Hungary.

58th ASMS

Page 2: (2525) Structural Characterization Of Glycopeptides: Integration Of ...

O-glycosylationg y y

Two functionally different types:1. Long-chain modifications of extracellular proteins2. Single residue regulatory modification of

intracellular proteins – O GlcNAcylation (analogous intracellular proteins – O-GlcNAcylation (analogous to phosphorylation)

O-GlcNAcylation is increasingly studied for its regulatory role and interaction with phosphorylation.

Secreted O-glycosylation is a somewhat neglected research area, even though aberrant glycosylationhas been linked to cancer and Alzheimer’s diseasehas been linked to cancer and Alzheimer s disease.

Page 3: (2525) Structural Characterization Of Glycopeptides: Integration Of ...

The study of both types of O-glycosylation present similar analytical challenges by mass spectrometry.

All O-linked glycopeptides display characteristic fragmentation in CID:

• The glycosydic bond is weaker than the peptide bonds • The glycosydic bond is weaker than the peptide bonds, thus the spectrum is usually dominated by ions present due to carbohydrate losses and non-reducing end oxonium i ions.

• Gas-phase deglycosylaton occurs - the sugar is eliminated without leaving any telltale sign on the originally modified without leaving any telltale sign on the originally modified amino acid, making site assignments impossible in most instances. (Figure 1).

MS/MS analysis utilizing ETD permits both the identification of the modified sequence and the identification of the modified sequence and the unambiguous determination of the modification site(s) (Figures 2 & 3) [2,3].

Page 4: (2525) Structural Characterization Of Glycopeptides: Integration Of ...

P i / 714 8520(4 )

Figure 1: CID of SAGalGalNAc-Modified Peptide

T8110318 #2241 RT: 30.40 AV: 1 NL: 2.36E4T: ITMS + c NSI d Full ms2 [email protected] [185.00-2000.00]

100855.96

Precursor ion = m/z 714.8520(4+).

(3+)MH+-SA

80

90801.95

(3+)MH+-SAGal MH+-SAGalGalNAc

50

60

70

e A

bund

ance

(4+)

30

40

Rel

ativ

e

911.53642.10

766.03

601.76

( )

(4+)

(3+)

300 400 500 600 700 800 900 1000 11000

10

20 518.40509.48 734.27

274.14684.79

1035.39 1100.91828.06500.51 1165.57866.37 975.34415.08375.05 553.74

(2+)

300 400 500 600 700 800 900 1000 1100m/z

•Sugar structure can be identified, but neither peptide sequence nor modification site can be determined.

Page 5: (2525) Structural Characterization Of Glycopeptides: Integration Of ...

Figure 2: ETD of Same SAGalGalNAc-Modified Peptide as Figure 1.

T8110318 #2242 RT: 30.40 AV: 1 NL: 1.48E4T: ITMS + c NSI d Full ms2 [email protected] [50.00-2000.00]

90

100953.29

c10/c15(2+)

60

70

80

nce 715.10

(2+) c14(2+) c17

40

50

60

Rel

ativ

e A

bund

an

1430.22359 z3

(3+) -SA

(2+) c17

(2+) z18

766.3 c6

783.3 z7

10

20

30

1106.82939.17 1250.311046.44 1149 68 1307 09854 94394 38 646 13

c2 c3 z4 z6z10 z12

(2+) z14

400 600 800 1000 1200 1400 1600 1800m/z

0

1149.68 1307.09854.94394.38 646.13472.15247.17 1709.66 1934.591521.97

z10 z12

924.7 c8

636KTFMLQASQPAPT(GalNAcGalSA)HSSLDIK655 from Inter-alpha-trypsin inhibitor heavy chain H1 precursor was confidently identified.

Page 6: (2525) Structural Characterization Of Glycopeptides: Integration Of ...

Sample Preparation & Mass Spectrometry

* GlcNAc-modified glycopeptides were enriched on a wheat germ agglutinin column as published earlier [1]. * Secreted glycopeptides bearing mucin core-1 type Secreted glycopeptides bearing mucin core 1 type structures were isolated by Jacalin-affinity chromatography, and were analyzed intact or after di ti t l l th G lNA [2] digestion to leave only the GalNAc core [2].

MS analyses were performed on an LTQ-Orbitrap mass y p pspectrometer in LC/MS mode. The instrument was operated in a data-dependent fashion: MS acquisitions were followed by CID and ETD analyses of 3 software-selected multiply by CID and ETD analyses of 3 software selected multiply charged ions. Precursor masses were measured in the Orbitrap, while MS/MS experiments were performed and th f m nts m s d in th lin t pthe fragments were measured in the linear trap.

Page 7: (2525) Structural Characterization Of Glycopeptides: Integration Of ...

Data-processing I• Raw data were converted into peaklists using in-house

software, PAVA. Database searching was performed against the UniProt database supplemented with a random sequence pp qfor each entry, and species specified as Bos taurus (31074 entries searched).

• Trypsin was specified as the enzyme, 1 missed cleavage and Trypsin was specified as the enzyme, 1 missed cleavage and non-specific cleavage at one of the peptide termini were permitted.

• Mass accuracy: 15 ppm for precursor ions and 0 6 Da for Mass accuracy: 15 ppm for precursor ions and 0.6 Da for fragment ions.

• Carbamidomethylation of Cys = fixed modification; variable modifications: acetylation of protein N termini; Met modifications: acetylation of protein N-termini; Met oxidation; cyclization of N-terminal Gln residues; and HexHexNAc or SAHexHexNAc (for intact glycopeptides) and HexNAc (for partially deglycosylated or GlcNAc-and HexNAc (for partially deglycosylated or GlcNAcmodified peptides) modification on Thr and Ser residues. 3 modifications per peptide.

Page 8: (2525) Structural Characterization Of Glycopeptides: Integration Of ...

Data-processing II• The same peaklists were used by 2 versions of the search

engine, Protein Prospector: v.5.3 and v5.4:• Scoring in v5.3 was optimized for tryptic peptides• Scoring in v5.4 uses different weighting depending on

precursor charge and basic residue locationprecursor charge and basic residue location.

• Acceptance criteria: Protein score: 22, E>0.05; peptide Acceptance criteria Protein score 22, E 0.05; peptide score: 15, E>0.1.

P i P ( 5 5) i bli l il bl• Protein Prospector (v5.5) is publicly available:

http://prospector ucsf eduhttp://prospector.ucsf.edu

Page 9: (2525) Structural Characterization Of Glycopeptides: Integration Of ...

New features in Protein Prospector v5.4 t d T d (P t 2781)presented on Tuesday (Poster 2781)

1) Neutral loss peaks from charge reduced species are d i t d t b hi removed prior to database searching.

2)Multiply-charged fragment ions are now considered in ETD data (for precursors of charge state 3+ or higher). ( p g g )

3)New scoring was introduced that gives different fragment ion type weighting depending on the precursor ion charge state and presence of basic residues at the ion charge state and presence of basic residues at the peptide termini.

Page 10: (2525) Structural Characterization Of Glycopeptides: Integration Of ...

Re-interrogation of Published [2] Datasets

Modified with SA1 0GalGalNAc: 16 LC/MS/MS files

Table 1. Comparison of the performance of different versions of ProteinProspector BatchTag. Version 5 3 3 Version 5 4 2 improvement

Modified with SA1-0GalGalNAc 16 LC/MS/MS files

Comparison of Number of Glycopeptides Identified at Different Acceptance Thresholds Version 5.3.3 Version 5.4.2 improvement

E<0.10 21 61 2.9 E<0.05 16 42 2.63

Intact glycopeptides

E<0.01 10 28 2.8 E<0.10 23 35 1.52Partially E 0.10 23 35 1.52E<0.05 21 30 1.42

Partially deglycosylated peptides E<0.01 17 23 1.35

Only the core GalNAc retained: 1 LC/MS/MS file

Page 11: (2525) Structural Characterization Of Glycopeptides: Integration Of ...

Figure 3: One of the new glycosylation sites identified: Thr-605 of Kininogen-1

T8110314 #1483 RT: 25.77 AV: 1 NL: 1.98E3T: ITMS + c NSI d Full ms2 [email protected] [50.00-2000.00]

100718.23

x5

717 8379 (4 )

70

80

90 717.8379 (4+)

c11(2+)

c17(2+)c111296 6

50

60

tive

Abu

ndan

ce

c10(2+)

c11(2+)649

-58

-17(3+)

1296.6

20

30

40

Rel

at 1435.62

957.08

937.60407.20

1075.98620.31 1405 83921 74

z2278 z3

c9(2+)

c10(2+)(3+)

(2+)z13

c16(2+)

400 600 800 1000 1200 1400 1600 1800 2000m/z

0

10

1405.83921.74 1231.90563.45 1035.21 1381.83505.79362.27 1136.05897.77801.41 1472.70 1760.85 1843.101572.93

c3 z4c6

z13z11

y10y14(2+)

m/z

C(Carbamidomethyl)PSRPWKPVNGVNPT(HexNAcHexSA)VEM(Oxidation)K

Page 12: (2525) Structural Characterization Of Glycopeptides: Integration Of ...

Filtering for glycopeptides• CID data provides useful information to support ETD identification:p pp

1. Characteristic sugar losses indicate the presence of the carbohydrate

2. Limited peptide fragmentation may aid ID confirmation.

• Identifying the characteristic neutral losses could be automated:• Input: separated CID and ETD peaklists in mgf format.

P l s i t “fi d t l l ss l” ( si 0 2 1)• Perl script “find_neutral_loss.pl” (version 0.2.1):1. Identifies CID spectra in which peaks corresponding to neutral

losses of sugar moieties (for example, -203.1 or -291.1 Da, corresponding to HexNAc or sialic acid, respectively) from the p g , p y)precursor ion are observed. The script looks for 1 or 2 losses at all the possible charge states (1+ to zn-1, where n is the precursor charge).

2 Identifies corresponding ETD spectra of the same precursor mass 2. Identifies corresponding ETD spectra of the same precursor mass in the ETD MGF file to create a subset MGF file that only contains ETD spectra of potential glycopeptides.

• Mass of the neutral loss, the threshold for how many of the most intense peaks are searched for the neutral loss peak and the intense peaks are searched for the neutral loss peak and the precursor retention time window for correlating ETD and CID spectra can all be altered.

Page 13: (2525) Structural Characterization Of Glycopeptides: Integration Of ...

Testing the filtering scriptTesting the filtering script

• CID data from LC/MS/MS experiments of fractions f p f fenriched in GalNAc- or GlcNAc-modified peptides were screened for glycopeptides

• ~90% of the unmodified peptides were eliminated• ~5% of the glycopeptides may be affected: either

eliminated or identified from a different charge eliminated or identified from a different charge state

Page 14: (2525) Structural Characterization Of Glycopeptides: Integration Of ...

m/z 203 07

Figure 4: False-Positive Glycopeptide “IDd” by the Script

90

95

100539.0

(3+)

SDPDQGVEVTGHFETAKb2

m/z 203.07

y15

70

75

80

85

90

Precursor – 203 Da?

b2 y15

45

50

55

60

65

ativ

e A

bund

ance

(2+)

25

30

35

40

45

Rel

a

807.9

739.3

(2+)

0

5

10

15

20597.0

784.4 834.0707.21001.6667.1490.3 1083.9452.4 843.3258.2 287.1 344.3191.1 928.6 1174.5

200 300 400 500 600 700 800 900 1000 1100m/z

0

Unmodified peptides may produce glycopeptide-like CID fragmentation !

Page 15: (2525) Structural Characterization Of Glycopeptides: Integration Of ...

778.750% z6* z8*/c9*1556.9

Figure 5: False-Positive Glycopeptide “ousted” by the scripte

inte

nsity

983.5

1279 6z9

8 9

z111539.7ETD

Rel

ativ

e

912.51192.5

1279.6

529 3

z4

z8

1426.6

z101295.8

y9

1184.4

c8

1167.7

b8z7*

z10y11 z12

1398.4

c12

1600140012001000800600m/z

1095.7

529.3

100%

e in

tens

ity

y9

y6

b4-H2O

b2-NH3

CID

Rel

ativ

e

799.4

244.3330.1 443.2

535.3 548.4

y92+

b3-H2O

2 3

140012001000800600400m/z

LFSLPYS(GlcNAc)RTRL vs. LFSLPAQPLWNNR

Page 16: (2525) Structural Characterization Of Glycopeptides: Integration Of ...

SummarySummary• The altered ETD searches with Protein

Prospector yielded superior results from the very same data“I ti ” CID d t ith th h l f • “Incorporating” CID data with the help of a glycopeptide-identifying script may boost the confidence level of glycopeptide confidence level of glycopeptide identifications

• Linking different charge states of the same g gcomponent will be beneficial, since different charge states may produce the best CID and ETD dataETD data

Page 17: (2525) Structural Characterization Of Glycopeptides: Integration Of ...

References

1. O-linked N-acetylglucosamine proteomics of postsynaptic density preparations using lectin weak affinity chromatography and mass spectrometry. Vosseller K, Trinidad JC, Chalkley RJ, Specht CG, Thalhammer A, Lynn AJ, Snedecor JO, Guan S, Medzihradszky KF, M ltb DA S h f R B li AL M l C ll P t i 5 923 34 Maltby DA, Schoepfer R, Burlingame AL. Mol Cell Proteomics. 5, 923-34 (2006).

2 Affinity enrichment and characterization of mucin core 1 type 2. Affinity enrichment and characterization of mucin core-1 type glycopeptides from bovine serum. Darula Z, Medzihradszky KF. Mol Cell Proteomics. 8, 2515-26 (2009).

3. Identification of protein O-GlcNAcylation sites using electron transfer dissociation mass spectrometry on native peptides. Chalkley RJ, Thalhammer A, Schoepfer R, Burlingame AL. Proc Natl Acad Sci U S A., p , g106, 8894-9 (2009).

Acknowledgement

This work was supported by NIH NCRR grant P41RR001614 (to ALB) and Hungarian Science Foundation grants OTKA T60283 (to KFM)