Post on 12-Feb-2016
description
1
Introduction to Genetics and Genomics
51:123
Terry Braun
2
Outline• Basic Mendelian Genetics
– Mendel’s laws• independent assortment • independent segregation
– mitosis and meiosis– PCR and markers– dominant/recessive and pedigrees
• genotype and phenotype– alleles
• Basic molecular genetics– DNA– RNA– proteins– Central Dogma
• genes and gene structure– cells and chromosomes
Principles of Genetics, Tamarin, Human Molecular Genetics 2, Strachan and Read
3
KeyTerms• marker – a region of the genome that may often be uniquely identified and
distinguished between individuals• minisatellite – a type of marker that varies in length from 14 to 100
nucleotides• microsatelite – a type of marker that is very short (2, 3, 4, 5, 6
nucleotides) -- aka STRP's (short tandem repeat polymorphisms)• polymorphism – a sequence variation• SNP -- single nucleotide polymorphism
• polymerase chain reaction (PCR) – a reaction that mimics DNA duplication in meiosis (aka DNA amplification) (Kary Mullis)
• DNA polymerase – a molecule that is essential for DNA duplication (and PCR)
• primer – a piece of DNA that is essential for starting DNA replication (and PCR)
• genotype – the genetic state of an individual (typically represented by a marker)
4
Genetic Marker– A genetic marker allows for the observation of the
genetic state at a particular genomic location (locus).• A genotype is the measured state of a genetic marker.• A tool for observing inheritance patterns (Mendel's rules and
meiosis)• May never be feasible to sequence cases directly, however the
current cost is decreasing
– An “informative” marker is often “heterogeneous, or “polymorphic” and enables the observation of the inheritance of genetic material.
5
Example -- genotypes1 1 1 1
1 1 1 1
Pedigreemalefemaleparentsoffspring
1 2 3 4
1 4 2 4
These labels (markers) are a measure of the genetic state of each individual.Recall from "Rule of Segregation", offspring get one gene from each parent.Markers are not genes, but they are regions on chromosomes (meiosis).
uninformative heterogeneous
6
What a marker looks like in the Genome
Geneticists assign numerical values to different versions of markers
7
Sources of Markers in the Genome
• duplications• unequal homologous recombination• slippage and errors during DNA
duplication
8
Duplicating DNA – to Use Markers to "Probe" Genomes of Individuals
• mitosis is process that copies DNA in biology• the first step is to "unzip" the 2 strands of the
double helix (DNA)• an enzyme called DNA polymerase makes a
copy by using each strand as a template• two other components
– nucleotides (A, G, T, C) (A-T, G-C, etc)– a short stretch of DNA called a "primer" (to prime the
process)
9
PCR – Polymerase Chain Reaction
• PCR is a process that copies DNA exponentially• mimics the process by organisms, but in vitro (in
a test tube) • relies on the ability of DNA-copying enzymes to
remain stable at high temperatures• Necessary components (in a vial)
– piece of DNA to be copied– large quantities of four nucleotides– large quantities of primer sequence– DNA polymerase (Taq – named for Thermus
aquaticus, a bacterium that lives in hot springs)
10
PCR Reaction• The reaction can be carried out entirely in a vial
simply by changing the temperature– separate the 2 strands (in DNA)
• heat to 75-90 C (165 F) for 30 seconds• this "melts" the DNA apart – the base pairing comes undone
– "anneal" the primers• primers cannot bind to the template strands at such high
temp – cooled to 55 C for 20 seconds– make complete copy of template (and thus new
templates for the next cycle)• Taq polymerase works best at 75 C (hot springs)• nucleotides are added (complement – if template has A, T is
added, etc)
11
PCR Reaction• Three steps
– separation of strands– annealing of primers to template– synthesis of new strands
• Takes approx. 2 minutes• Each reaction is carried out in the same vial, and after every cycle, each
piece of DNA is duplicated (exponential copying)• Cycle can be repeated 30 times (2^30 = 1,073,741,824)• 1 million copies can be made in approximately 3 hours from a single copy of
DNA– this is why very minute samples can be used to identify individuals in crime
scene investigations• Valuable tool to multiply unique regions of DNA so they can be detected in
LARGE genomes• Note, we need to know the flanking sequence to be able to design primers• Also, this flanking sequence needs to be unique otherwise the reaction
could amplify sequence from multiple regions of the genome
12
Exponential Nature of Reaction
13
Sequencing Reaction
14
Automated
15
Components of the Reaction
16
DNA polymerase (Taq) and Synthesis
17
18
Animations
• http://allserv.rug.ac.be/~avierstr/principles/pcrani.html
19
Markers – the early days
• Prior to the HGP, markers were (and still are) valuable tools for observing inheritance patterns
• Investigators consumed considerable time and resources identifying markers
• Some markers were observed in a test group of individuals to asses quality, and heterogeneity.– CEPH (Centr d'Etude du Polymorphisme Humain)
• Affymetrix SNP Chip -- 500,000 SNPs (~$450 -- 2007)
20
Marker GATA50G06/D15S643, Genotypes, and primers –
133101: 215, 197133102: 219, 215
Genomic chr15 :
ttctgctctt ttgtctaaaa tgtcagtcta aatccttact tgtaattgtg 57501064ccctactttg ccgttgctgc ctggctatac cttgtattta ttgctggcct 57501114ATACCTGGAG TCCTTGGTCC ttcttgggaa aaagtattga ggttttaaag 57501164ctcttatcct tggggacaga ttaaaccctt aaactatcta tctgtctgtc 57501214tgtctgtcta tctatctatc tgtctatcta tctatctatc tatctatcta 57501264tctatctatc tatctatcta cctacctaac tacctaccaa aaaaGCATTG 57501314AGGTTTTAAA GCTGTTatcc ttggggacag attaaaccct caaccctcta 57501364tctatctatc tatctatcta tctatctatc tatctatcta tctatctatc 57501414atctgtcacc tattta
http://genome.ucsc.edu/cgi-bin/hgc?hgsid=76756345&o=57501058&t=57501337&g=stsMap&i=GATA50G06&c=chr15&l=57401058&r=57601337&db=hg18&pix=800http://research.marshfieldclinic.org/genetics/genotypingData_Statistics/genotypes_referenceIndividuals.asp
21
Marker GATA50G06/D15S643, Genotypes, and primers –
133101: 215, 197133102: 219, 215
Genomic chr15 :
ttctgctctt ttgtctaaaa tgtcagtcta aatccttact tgtaattgtg 57501064ccctactttg ccgttgctgc ctggctatac cttgtattta ttgctggcct 57501114ATACCTGGAG TCCTTGGTCC ttcttgggaa aaagtattga ggttttaaag 57501164ctcttatcct tggggacaga ttaaaccctt aaactatcta tctgtctgtc 57501214tgtctgtcta tctatctatc tgtctatcta tctatctatc tatctatcta 57501264tctatctatc tatctatcta cctacctaac tacctaccaa aaaaGCATTG 57501314AGGTTTTAAA GCTGTTatcc ttggggacag attaaaccct caaccctcta 57501364tctatctatc tatctatcta tctatctatc tatctatcta tctatctatc 57501414atctgtcacc tattta
22
Genome to Gene Sequence
Markers are typically NOT genes, however they may reside in the genome relatively close to a gene.
23
Basis for Inheritance of Disease: Examples
Aa Aa
A a
A
a
AA Aa
Aa aa
1/2 1/2
1/2
1/2
1/4 1/4
1/4 1/4
P(AA) = 1/4P(Aa) = 1/2P(aa) = 1/4
Aa Aa AA AA Aa
A from mom/dad?a from mom/dad?
Pedigreemalefemaleparentsoffspring
24
Examples234 238 232 238
234, 238 238, 238 238, 232 234, 232234, 238
1 2322 2343 2364 2385 2406 242
2 4 1 4
Note that the lawyers for OJ Simpson argued that "recoded" allele numbers increased the likelihood of contamination and false identification.
If you "genotype" an individual at enough markers, you can calculate the probability of uniquely identifying an individual.
1 2342 2363 2384 2405 242
3
25
Examples
Affected individuals
26
ExamplesDominant model
Geneticists then look for genes that mimic this pattern of inheritance
27
ExampleRecessive model.Very unlikely, because "founders"marrying in also carry the disease,which by definition is a raregenetic disorder.
28
BBS4 Pedigree
29
Monogenic and Polygenic Diseases
– monogenic (Mendelian) -- one gene• “simple” (dominant and recessive) Mendelian inheritance• direct correspondence between one gene mutation and one
disorder• majority of disease genes found are monogenic
– polygenic -- (complex) multiple genes• heterogeneity – disease caused by multiple genes• epistasis – disease caused by multiple interacting genes• obviously finding these is harder -- but why???
30
...Mongenic and Polygenic Diseases
• phenocopy• reduced penetrance
– Example -- sickle cell anemia• “classic” recessive disorder• defect in red blood cells (hemoglobin)• but… infant hemoglobin gene can “leak”• wide range of phenotypes
31
Bardet-Biedl Syndrome (BBS)• Obesity
– Diabetes/ hypertension
• Retinopathy• Hypogenitalism• Polydactyly• Mental Retardation• Renal Anomalies• Heart defectsRare disorder, but common phenotypes
32
Molecular Analysis of BBS• BBS1 - 11q13 Novel*• BBS2 - 16q22 Novel*• BBS3 - 3p13• BBS4 - 15q21 Novel†, TPR Repeats• BBS5 - 2q31• BBS6 - 20p12 Type II Chaperonins• BBS7 - 4q27 Novel*• BBS8 - 14q31 Novel†, TPR Repeats
*,† - Some Similarity
33
34
Some Useful Properties of DNA• fragments of DNA have a minute negative charge
– if you apply an electric field to DNA in a matrix, it will migrate to the positive pole
• DNA is a linear molecule, but it tends to fold up (similar to a knot)– this bound up molecule of DNA will have a unique cross-
sectional area profile that is dependent on its sequence• Gel electrophoresis – DNA is placed in a polyacrylamide
gel and a voltage is applied– polyacrylamide gel and pool analogy– applied charge will cause DNA to migrate dependent on its size,
and its sequence
35
BBS4 Deletion (by PCR)Example of Usage
exons 3 4
36
Molecular Genetics
• Not covered– molecular details of DNA duplication
• continuous replication, discontinuous, Okazaki fragments, etc.
37
Genome – so now we know where it comes from biologically – at least
most of it• mitochondria
– organelle of eukaryotes– number varies per cell – 10 to 10K– human mitochondria is 16,569 nts– mostly coding (no introns???)– duplex strand and circular– inherited maternally only
• consequences– mito thought to be originally free-living bacteria– origins (one or multiple events?)
38
Leber Optic Atrophy
• LHON– mid-life, central vision loss– caused by missense mutations in mtDNA– generally familial
39
• Evolution of the mitochondrial genome and origin of eukaryotic cells
40
END
41
Another Marker?BRCA1-A good predictive marker of drug sensitivity in breast cancer treatment?
* Mullan PB, * Gorski JJ, * Harkin DP.
Centre for Cancer Research and Cell Biology, Queen's University Belfast, Belfast, Northern Ireland, BT9 7AB, United Kingdom.
There are currently only two predictive markers of response to chemotherapy for breast cancer in routine clinical use, namely the Estrogen receptor-alpha and the HER2 receptor. The breast and ovarian cancer susceptibility gene BRCA1 is an important genetic factor in hereditary breast and ovarian cancer and there is increasing evidence of an important role for BRCA1 in the sporadic forms of both cancer types. Our group and numerous others have shown in both preclinical and clinical studies that BRCA1 is an important determinant of chemotherapy responses in breast cancer. In this review we will outline the current understanding of the role of BRCA1 as a determinant of response to DNA damaging and microtubule damaging chemotherapy. We will then discuss how the known functions of this multifaceted protein may provide mechanistic explanations for its role in chemotherapy responses.
42
Hardy-Weinberg Equilibrium• Rule that relates allelic and genotypic frequencies in a population
of diploid, sexually reproducing individuals if that population has random mating, large size, no mutation or migration, and no selection
• Assumptions– allelic frequencies will not change in a population from one
generation to the next– genotypic frequencies are determined in a predictable way by
allelic frequencies– the equilibrium is neutral -- if perturbed, it will reestablish within
one generation of random mating at the new allelic frequency• Ideal case
43
Expected allele frequenciesDeviations from distributionmay indicate special cases.
44
H-W
• f(AA) = p2
• f(Aa) = 2pq• f(aa) = q2
• (p+q)2
• (p2 + q2 + r2 + 2pq + 2pr + 2qr)= (p+q+r)2
45
Use of H-W• All other things being equal, we can "expect" that the distribution of genes in a
subset of a population would be represented by the distribution of genes in the population
• Deviations from this expected distribution is evidence of selection or enrichment• Association – when a specific variation of a gene (allele) is correlated with a
phenotype (or disease, or trait) more frequently than you would expect by H-W – also called Linkage Disequilibrium (since genes are normally in equilibrium)
Often used to evaluate validity of an assay. For example, let us say that I genotype 400 people at a marker with 2 alleles (A and B). I observe the following genotypes:
marker1: AA: 36 AB 168 BB 196marker2: AA 2 AB 37 BB 360marker3: AA 64 AB 144 BB 192
Which maker is suspicious?
46
Will return to Linkage in Later Lectures
47
48