Relatedness part 2 - Familias

18
Magnus Dehli Vigeland Statistical methods in genetic relatedness and pedigree analysis NORBIS course, 6 th – 10 th of January 2020, Oslo Relatedness part 2: Why are some siblings more alike than others?

Transcript of Relatedness part 2 - Familias

Page 1: Relatedness part 2 - Familias

Magnus Dehli Vigeland

Statistical methods in genetic relatedness and pedigree analysis

NORBIS course, 6th – 10th of January 2020, Oslo

Relatedness part 2: Why are some siblings more alike than others?

Page 2: Relatedness part 2 - Familias

Plan

• Revisiting the meiosis

– Recombination and crossovers

– Morgan and centiMorgan

• Back to the coefficients: Variation!

• Summary

Statistical methods in genetic relatedness and pedigree analysis

Page 3: Relatedness part 2 - Familias

Recombination

• Crossovers:

• The genetic distance between two loci:

= average number of crossovers between them per meiosis

• Units:

– 1 Morgan (M) = 1 crossover per meiosis (on average)

– 1 centiMorgan (cM) = 0.01 M

• The human genome: Ca 30 Morgan

Statistical methods in genetic relatedness and pedigree analysis

Page 4: Relatedness part 2 - Familias

Rule of thumb: One crossover per chromosome arm

Statistical methods in genetic relatedness and pedigree analysis

Page 5: Relatedness part 2 - Familias

How does this work in practice?

Statistical methods in genetic relatedness and pedigree analysis

Page 6: Relatedness part 2 - Familias

How does this work in practice?

Take home message: • IBD is not a pointwise concept!• IBD in a particular locus → IBD in the surrounding region (to closest crossovers)

Statistical methods in genetic relatedness and pedigree analysis

1

0

1

2

1

IBD status

Page 7: Relatedness part 2 - Familias

Simulation of IBD sharing

between full siblings

IBD from father

IBD from mother

Statistical methods in genetic relatedness and pedigree analysis

Realised IBD coefficients: Proportions of genome with IBD = 0, 1, 2

Page 8: Relatedness part 2 - Familias

Realised inbreeding coefficient

• Recall the definition:f = P( autozygosity in random locus )

= expected fraction that is autozygous

• Empirical/realised/observed inbreeding:freal = fraction which is actually autozygous

• Considerable variation – between chromosomes

– between individuals

– between species (depends on genome length!)

Statistical methods in genetic relatedness and pedigree analysis

Page 9: Relatedness part 2 - Familias

Realised inbreeding: natural variation

1000 simulations • R-package: ibdsim2• Decode recombination map• All 22 human autosomes

Statistical methods in genetic relatedness and pedigree analysis

Page 10: Relatedness part 2 - Familias

Distribution of realised IBD coefficients

1000 simulations

Some siblings are more alike than others!

library(ibdsim2)

x = nuclearPed(2)

s = ibdsim(x, sims = 1000)

k = realised_kappa(s, id.pair = 3:4)

forrel::showInTriangle(k, ...)

Page 11: Relatedness part 2 - Familias

Variation depends on the genome

Statistical methods in genetic relatedness and pedigree analysis

Human: • 22 autosomes• 3000 cM

Opossum: • 8 autosomes• 800 cM

Shorter genome = more variation!

Page 12: Relatedness part 2 - Familias

Indistinguishable relationships?

𝜅0 = 0.5𝜅1 = 0.5𝜅2 = 0

Page 13: Relatedness part 2 - Familias

Simulated IBD distributions

ϕ = 1/8

ConclusionIn theory these are distinguishable!

In practice this requires accurate estimation of IBD segments.

Page 14: Relatedness part 2 - Familias

Realised IBD coefficients: Half siblings

Statistical methods in genetic relatedness and pedigree analysis

Page 15: Relatedness part 2 - Familias

Zooming in on the κ0-axis

Statistical methods in genetic relatedness and pedigree analysis

Page 16: Relatedness part 2 - Familias

The probability of zero IBD

Third cousins: Expected fraction of the genome with IBD = 1:

𝑘1 =1

64

Theoretically possible to have no IBD sharing!

N'th cousins P(zero IBD)

first 0.0 %

second 0.0 %

third 1.5 %

fourth 28 %

fifth 67 %

Two individuals can have a common ancestor without being genetically related

Statistical methods in genetic relatedness and pedigree analysis

Page 17: Relatedness part 2 - Familias

Relatedness: Summary

• Measuring relatedness with increasing precision:

– the kinship/inbreeding coefficient φ

– the IBD coefficients κ = (κ0, κ1, κ2)

– Jacquard's 9 coefficients Δ

• Each coefficient is

– the probability of observing a certain IBD pattern in a random locus

– the expected proportion of the genome in this state

• IBD is not a pointwise phenomenon: Always in segments

– determined by meiotic crossovers

– consequence: Variation in the realised IBD!

• Family relation genetic relation⇏

Statistical methods in genetic relatedness and pedigree analysis

Page 18: Relatedness part 2 - Familias

So...what does it mean to be related?

• Pedigree based definition: φ > 0

potentially having alleles IBD

• Genomic definition (realised relatedness):

actually having alleles IBD

Statistical methods in genetic relatedness and pedigree analysis