Yana Safonova - Институт...
Transcript of Yana Safonova - Институт...
![Page 1: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/1.jpg)
Immunoinformatics: application of algorithmic approaches to
solving immunological problems
Yana Safonova
Center for Algorithmic BiotechnologySt. Petersburg State University
![Page 2: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/2.jpg)
Outline
● Introduction● Repertoire construction problem● Evolutionary analysis of antibodies● Analysis of immune response dynamics● Analysis of paired antibody repertoires & new biological
insights from analysis of paired repertoires
![Page 3: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/3.jpg)
Innate & adaptive immune system
cell-mediated immune response
humoral immune
response
3
![Page 4: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/4.jpg)
Antibody & antigen
Antigen recognition
![Page 5: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/5.jpg)
Antibody & antigen
Antigen recognition Antibody - antigen binding
![Page 6: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/6.jpg)
Antibody & antigen
Antigen recognition Antibody - antigen binding
1. Antigen neutralization
![Page 7: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/7.jpg)
Antibody & antigen
Antigen recognition
2. Destroying antigen by immune cells
Antibody - antigen binding
1. Antigen neutralization
![Page 8: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/8.jpg)
Once you’ve met an antigen,
your adaptive immune system never forgets it!
![Page 9: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/9.jpg)
This principle is used for vaccine design:
Real antigens
Once you’ve met an antigen,
your adaptive immune system never forgets it!
![Page 10: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/10.jpg)
This principle is used for vaccine design:
Real antigens Vaccine
Once you’ve met an antigen,
your adaptive immune system never forgets it!
![Page 11: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/11.jpg)
![Page 12: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/12.jpg)
Where do antibody live?
![Page 13: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/13.jpg)
Antibody repertoiresThere is a billion of B-cells circulating in human blood at any given moment (out of 1018 estimated antibodies)
Analysis of concentrations of all antibodies in the organism (antibody repertoire) is a fundamental problem in immunology
13
While generation of antibody repertoires provides a new avenue for antibody drug development, it remains unclear how to construct antibody repertoires from NGS data
![Page 14: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/14.jpg)
V(D)J recombination
14
Antibodies are produced by B-cells, each with unique genome:
IGH locus in human genome (1 MB length)
![Page 15: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/15.jpg)
Antibodies are produced by B-cells, each with unique genome:
15
Antibody somatic recombination
![Page 16: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/16.jpg)
Antibodies are produced by B-cells, each with unique genome:
16
Antibody somatic recombination
![Page 17: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/17.jpg)
17
Antibodies are produced by B-cells, each with unique genome:
Antibody somatic recombination
![Page 18: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/18.jpg)
Antibodies are produced by B-cells, each with unique genome:
18
Antibody somatic recombination
![Page 19: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/19.jpg)
Antibodies are produced by B-cells, each with unique genome:
19
Antibody somatic recombination
Random insertions/deletions
![Page 20: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/20.jpg)
Antibodies are produced by B-cells, each with unique genome:
20
Antibody somatic recombination
![Page 21: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/21.jpg)
Antibodies are produced by B-cells, each with unique genome:
21
Antibody somatic recombination
![Page 22: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/22.jpg)
Antibodies are produced by B-cells, each with unique genome:
22
Antibody somatic recombination
![Page 23: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/23.jpg)
Antibodies are produced by B-cells, each with unique genome:
23
Antibody somatic recombination
![Page 24: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/24.jpg)
Antibodies are produced by B-cells, each with unique genome:
24
Antibody somatic recombination
![Page 25: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/25.jpg)
Antibodies are produced by B-cells,each with unique genome:
25
Antibody somatic recombination
Random insertions/deletions
![Page 26: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/26.jpg)
Somatic recombination results in unique immunoglobulins genes encoding amino acid sequence of antibodies
26
Antibody somatic recombination
![Page 27: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/27.jpg)
An antibody recognizes a foreign agent (antigen) using its antigen-binding site
27
Antibody versus antigen
![Page 28: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/28.jpg)
The most diverged part of antigen-binding site is complementarity determining region 3 (CDR3)
28
CDR3
Antigen binding site in antibody
![Page 29: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/29.jpg)
29
Further optimization of antibody affinity is achieved through somatic hypermutations
CDR3
Somatic hypermutations
![Page 30: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/30.jpg)
30
CDR3
...many somatic hypermutations
![Page 31: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/31.jpg)
Architecture of antibodies
![Page 32: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/32.jpg)
From biological problems to computational challenges
VDJ classification problem. Given an antibody generated from a known set of V, D, and J segments, identify what specific V, D, and J segments generated this antibody
![Page 33: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/33.jpg)
From biological problems to computational challenges
VDJ classification problem. Given an antibody generated from a known set of V, D, and J segments, identify what specific V, D, and J segments generated this antibody
![Page 34: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/34.jpg)
From biological problems to computational challenges
VDJ classification problem. Given an antibody generated from a known set of V, D, and J segments, identify what specific V, D, and J segments generated this antibody
Important model organisms in immunology with still unknown sets of V, D, and J segments
![Page 35: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/35.jpg)
From biological problems to computational challenges
VDJ classification problem. Given an antibody generated from a known set of V, D, and J segments, identify what specific V, D, and J segments generated this antibody
VDJ reconstruction problem. Given a set (millions) of antibodies generated from an unknown set of V, D, and J segments, reconstruct these sets
![Page 36: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/36.jpg)
Outline
● Introduction● Repertoire construction problem● Evolutionary analysis of antibodies● Analysis of immune response dynamics● Analysis of paired antibody repertoires & new biological
insights from analysis of paired repertoires
![Page 37: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/37.jpg)
Roche 454(2005)
low coverage
low accuracy
long reads
VDJ classification
37
Sequencing of antibody repertoire
VDJ classification
![Page 38: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/38.jpg)
Roche 454(2005)
low coverage
low accuracy
long reads
VDJ classification
Sequencing of antibody repertoire
Illumina HiSeq 2000
(2001)
high coverage
high accuracy
short reads
+ CDR3 classification
CDR3 classification
VDJ classification
![Page 39: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/39.jpg)
Roche 454(2005)
low coverage
low accuracy
long reads
VDJ classification
Sequencing of antibody repertoire
Illumina MiSeq
(2013)
med. coverage
high accuracy
long reads
+ full-length classification
Illumina HiSeq 2000
(2001)
high coverage
high accuracy
short reads
+ CDR3 classification
full-length classification
CDR3 classification
VDJ classification
![Page 40: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/40.jpg)
Illumina MiSeq
(2013)
med. coverage
high accuracy
long reads
+ full-length classification
Illumina HiSeq 2000
(2001)
high coverage
high accuracy
short reads
+ CDR3 classification
Roche 454(2005)
low coverage
low accuracy
long reads
VDJ classification
40
HiSeq Rapid SBS Kit v2
(2015)
high coverage
high accuracy
long reads
+ high throughput
Sequencing of antibody repertoire
high throughput
full-length classification
CDR3 classification
VDJ classification
![Page 41: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/41.jpg)
41
MiGEC: Shugay et al., Nat Methods, 2014MiXCR: Bolotin et al., Nat Methods, 2015
IMSEQ: Kuchenbecker et al., Bioinformatics, 2015IgRepertoireConstructor: Safonova et al., Bioinformatics, 2015
Full-length antibody classification(repertoire construction)
In contrast to well-studied VDJ and CDR3 classification, full-length antibody classification takes into account the entire variable region of antibody
![Page 42: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/42.jpg)
Repertoire construction problem
● Giant read clustering problem● Giant error correction problem
![Page 43: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/43.jpg)
What makes this clustering problem difficult?
x 1018
High repetitiveness High mutation rate
Huge repertoire size Uneven distribution of abundances
● Global coverage threshold cannot be used for error correction● Sequencing errors often look like natural variations
![Page 44: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/44.jpg)
Outline
● Introduction● Repertoire construction problem● Evolutionary analysis of antibodies● Analysis of immune response dynamics● Analysis of paired antibody repertoires & new biological
insights from analysis of paired repertoires
![Page 45: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/45.jpg)
Secondary diversification of antibodies
![Page 46: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/46.jpg)
● B-cell lineages reflect evolutionary development of antibodies
46
Clonal analysis of antibody repertoire
![Page 47: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/47.jpg)
47
Clonal analysis of antibody repertoire ● B-cell lineages reflect
evolutionary development of antibodies
● Lineage can be represented as a clonal tree
![Page 48: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/48.jpg)
48
Clonal analysis of antibody repertoire ● B-cell lineages reflect
evolutionary development of antibodies
● Lineage can be represented as a clonal tree
● Some intermediate clones may be missing in the repertoire
![Page 49: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/49.jpg)
Standard phylogenetic algorithms assume that all species are represented by leaves and should be adapted for clonal trees
49
Clonal analysis of antibody repertoire
![Page 50: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/50.jpg)
Who is the ancestor here?
50
germline segments
![Page 51: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/51.jpg)
Who is the ancestor here?
51
1
2
New hypermutaions
![Page 52: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/52.jpg)
Who is the ancestor here?
52
1
2
Shared hypermutations
New hypermutaions
![Page 53: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/53.jpg)
53
Another example: who is the ancestor here?
![Page 54: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/54.jpg)
54
Another example: who is the ancestor here?
Individual hypermutations 1
Individual hypermutations 2
![Page 55: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/55.jpg)
55
Ancestral antibody may be missing…
Shared hypermutaions
1
2
Ancestral antibody is not present in the repertoire
Individual hypermutations 1
Individual hypermutations 2
![Page 56: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/56.jpg)
What is the evolutionary tree?
9 antibody sequences share CDR3 and differ by SHMs in V segments
Hypermutations (SHMs) in V segment
![Page 57: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/57.jpg)
Any tree reconstruction approach will work
+1 +3 +1 +2
+2 +4
+3+1
Nested SHMs define directions of edges between antibodies in the clonal tree
![Page 58: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/58.jpg)
Repertoire construction step is very important for clonal analysis!
![Page 59: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/59.jpg)
Repertoire construction step is very important for clonal analysis!
![Page 60: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/60.jpg)
SHMs in V segments are easy to find
D J
• One can easily identify mutations in the V segment using alignment against the template (germline V segment)
![Page 61: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/61.jpg)
SHMs in CDR3 are difficult to identify
• One can easily identify mutations in the V segment using alignment against the template (germline V segment)
• But there is no template for CDR3!
![Page 62: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/62.jpg)
SHMs in CDR3 are difficult to identify
• One can easily identify mutations in the V segment using alignment against the template (germline V segment)
• But there is no template for CDR3!○ deletions in gene segments○ non-genomic VD and DJ insertions○ addition of palindromes
![Page 63: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/63.jpg)
63
A more complex case: who is the ancestor?
CDR3
![Page 64: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/64.jpg)
64
A more complex case: who is the ancestor?
CDR3
?
![Page 65: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/65.jpg)
65
A more complex case: who is the ancestor?
1
2
Information about VDJ scenarios allows us to make the a choice:● Antibodies 1 and 2 belong to the same lineage
CDR3
?
![Page 66: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/66.jpg)
66
A more complex case: who is the ancestor?
Information about VDJ scenarios allows us to make the right choice:● Antibodies 1 and 2 belong to the same lineage ● Antibodies 1 and 2 are not related
CDR3
?
1
2
![Page 67: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/67.jpg)
Another puzzle
4 antibodies share SHMs in V segments but differ in CDR3s
![Page 68: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/68.jpg)
Another puzzle
● It is unclear how to select direction between two similar CDR3s● It is unclear whether two similar CDR3s belong to a single clonal
tree or not
![Page 69: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/69.jpg)
Murugan et al., PNAS, 2012
Why do we need a VDJ probabilistic model?
V D J
To compute VDJ scenario, we need to:
● perform VDJ classification to find germline segments (well-studied problem)
● specify deletions in gene segments● specify non-genomic insertions● specify addition of palindromes
![Page 70: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/70.jpg)
Murugan et al., PNAS, 2012
Why do we need a VDJ probabilistic model?
V D J
To compute VDJ scenario, we need to:
● perform VDJ classification to find germline segments (well-studied problem)
● specify deletions in gene segments● specify non-genomic insertions● specify addition of palindromes
Recombination events are not distributed uniformly
![Page 71: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/71.jpg)
Murugan et al., PNAS, 2012
Why do we need a VDJ probabilistic model?
V D J
To compute VDJ scenario, we need to:
● perform VDJ classification to find germline segments (well-studied problem)
● specify deletions in gene segments● specify non-genomic insertions● specify addition of palindromes
We need a probabilistic VDJ recombination model for a realistic description of these events
Recombination events are not distributed uniformly
![Page 72: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/72.jpg)
Why do we need an SHM probabilistic model?
SHM hotspots such as the degenerative 4-mers:
trigger mutations in antibodies Somatic hypermutagenesis engages AID enzyme that changes immunoglobulin genes to improve antibody affinity
Rogozin and Kolchanov, Biochimica et Biophysica Acta, 1992
AT
AG C C
T
![Page 73: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/73.jpg)
Building probabilistic SHM model
● The SHM model takes into account both the mutated nucleotide and its neighbours
● Detect new hot spots and compares SHMs in IG chains
Yaari et al., Front Immunol, 2013
5-mer Freq A C G T
ACAAC 83 − 0.24 0.48 0.28
GGCGT 1742 0.22 − 0.12 0.66
CCGTC 12 0.35 0.52 − 0.13
TCTCC 516 0.32 0.54 0.14 −
![Page 74: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/74.jpg)
Building probabilistic SHM model
● The SHM model takes into account both the mutated nucleotide and its neighbours
● Detect new hot spots and compares SHMs in IG chains
Yaari et al., Front Immunol, 2013
TCTCC 5-mer profiles for IGL, IGH, and IGK chains aggregated over 60 datasets
5-mer Freq A C G T
ACAAC 83 − 0.24 0.48 0.28
GGCGT 1742 0.22 − 0.12 0.66
CCGTC 12 0.35 0.52 − 0.13
TCTCC 516 0.32 0.54 0.14 −
![Page 75: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/75.jpg)
Outline
● Introduction● Repertoire construction problem● Evolutionary analysis of antibodies● Analysis of immune response dynamics● Analysis of paired antibody repertoires & new biological
insights from analysis of paired repertoires
![Page 76: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/76.jpg)
Time series
Laserson et al, PNAS, 2014
![Page 77: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/77.jpg)
Clonal analysis in time
Clonal analysis of time series of antibody repertoire allows one to estimate efficiency of immune response
Sequencing data provided by
before immunization
right after immunization
highestimmune response
![Page 78: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/78.jpg)
Outline
● Introduction● Repertoire construction problem● Evolutionary analysis of antibodies● Analysis of immune response dynamics● Analysis of paired antibody repertoires & new
biological insights from analysis of paired repertoires
![Page 79: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/79.jpg)
Clonal analysis for antibody repertoire
Sequencing data provided by
![Page 80: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/80.jpg)
Sequencing data provided by
Clonal analysis for paired antibody repertoire
![Page 81: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/81.jpg)
● utilizes information about chain pairing to construct paired clonal tree● reveals that, contrary to previous views, B-cells often co-express
multiple heavy and light chains.
Sequencing data provided by
Clonal analysis for antibody repertoire
![Page 82: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/82.jpg)
Light chain duality
co-expression of both kappa and lambda chains by a single B-cell
Pelanda et al., Cur Opin Immunol, 2014Giachino et al., J Exp Med, 1995
![Page 83: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/83.jpg)
Allelic inclusion
production of chains from both haplomes by B-cells
Casellas et al., J Exp Med, 2007Beck-Engeser et al., PNAS, 1987
![Page 84: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/84.jpg)
Duality + allelic inclusion
A single B-cell may express multiple chains due to allelic inclusions and/or light chain duality
![Page 85: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/85.jpg)
Multi-chain effect
Multi-chain effect: B-cell can express up to 6 different chains:
A single B-cell may express multiple chains due to allelic inclusions and/or light chain duality
![Page 86: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/86.jpg)
Multi-chain effect
A single B-cell may express multiple chains due to allelic inclusions and/or light chain duality
?
?
which ones
participate in the real pairing?
Multi-chain effect: B-cell can express up to 6 different chains:
![Page 87: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/87.jpg)
Multi-chain effect is common in healthy B-cells!
one heavy chain (IGM) IGK + IGL
one heavy chain (IGM +IGD) IGK + IGL
one heavy chain (IGA) IGK + IGL
25% (!) of B-cells with known pairing have allelic inclusions and/or light chain duality
two heavy chains and single light chain
two heavy chains and multiple light chains
![Page 88: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/88.jpg)
Clonal analysis reveals true chain pairing
K1 H1 L1
Cells 1, 2, and 3 express identical heavy, kappa and lambda chains. Thus, 1, 2, and 3 are clones of the same B-cell
Which light chain contributes to the antibody: kappa or lambda?
1 2 3
Example from AbVitro sequencing data
![Page 89: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/89.jpg)
Clonal analysis reveals true chain pairing
K1 H1 L1
L2
1 2 3
Cell 4 shares heavy and kappa chains with cells 1, 2 and 3, but has different lambda chain (L2)
4
![Page 90: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/90.jpg)
K1 H1 L1
L2
1 2 3
Alignment of L1 and L2 reveals that L1 is an ancestor of L2
Thus, cell 4 is a descendant of cells 1, 2, and 3
4
Clonal analysis reveals true chain pairing
Cell 4 shares heavy and kappa chains with cells 1, 2 and 3, but has different lambda chain (L2)
![Page 91: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/91.jpg)
K1 H1 L1
L2
1 2 3
4
Clonal analysis reveals true chain pairingClonal analysis reveals true chain pairing
Alignment of L1 and L2 reveals that L1 is an ancestor of L2
Thus, cell 4 is a descendant of cells 1, 2, and 3
Evolution of L1 into L2 provides evidence that cells 1, 2, 3, and 4 generate functional antibodies
![Page 92: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/92.jpg)
K1 H1 L1
L2
1 2 3
4But it contradicts with a fact that H1 is non-productive
Clonal analysis reveals true chain pairing
Evolution of L1 into L2 provides evidence that cells 1, 2, 3, and 4 generate functional antibodies
Alignment of L1 and L2 reveals that L1 is an ancestor of L2
Thus, cell 4 is a descendant of cells 1, 2, and 3
![Page 93: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/93.jpg)
K1 H1 L1
L2
K2H2
1 2 3
4
Cell 5 expresses heavy and kappa chains
5
There are more B-cells to analyze!
![Page 94: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/94.jpg)
K1 H1 L1
L2
K2
K3
H2
1 2 3
4
5
K2 and K1 have originated from a an unknown kappa chain K3 that is missing in the repertoire
There are more B-cells to analyze!
![Page 95: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/95.jpg)
We are not done yet…
K1 H1 L1
L2
K2
K3
H2
H3K4
L3
1 2 3
4
5
Cell 6 expresses heavy, kappa and lambda chains
6
![Page 96: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/96.jpg)
K1 H1 L1
L2
K2
K3
H2
H3K4
L3
1 2 3
4
5
Alignment reveals that H3 is an ancestor of H2
6
We are not done yet…
![Page 97: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/97.jpg)
K1 H1 L1
L2
K2
K3
H2
H3K4
L3
1 2 3
4
5
K4 is an ancestor of a virtual chain K3
6
We are not done yet…
![Page 98: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/98.jpg)
K1 H1 L1
L2
K2
K3
H2
H3K4
L3
1 2 3
4
5
L3 is an ancestor of L16
We are not done yet…
![Page 99: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/99.jpg)
Evolutionary analysis helps to understand true chain pairing
H1 lineage is non-productive, so it does not participate in pairing
Lineage H3 → H2 is more likely to participate in chain pairing
![Page 100: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/100.jpg)
Evolutionary analysis helps to understand true chain pairing
● Lambda lineage contain synonymous mutations
● Mutations in lambda lineage are grouped into CDRs
● Mutations in kappa chain are distributed randomly along variable region
Lambda lineage undergoes selection, thus it more likely participates in chain pairing
![Page 101: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/101.jpg)
Evolutionary analysis helps to understand true chain pairing
Using information about clonal lineages for H, K and L chains and the SHM model, we can select the most likely chain pairing
H3 → H2L3 → L1 → L2
![Page 102: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/102.jpg)
AlexanderShlemov
AndreyBzikadze
Sergey Bankevich
Alla Lapidus
Pavel A.Pevzner
TimofeyProdanov
AndreySlabodkin
Yana Safonova
![Page 103: Yana Safonova - Институт биоинформатикиbioinformaticsinstitute.ru/sites/default/files/2807yanasafonova.pdf · IgRepertoireConstructor: Safonova et al., Bioinformatics,](https://reader035.fdocuments.in/reader035/viewer/2022071020/5fd415a579ff91782318bf7d/html5/thumbnails/103.jpg)
Thank you!
103