Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly...

43
Next Generation Phenotyping and Breeding Univ of MN Plant Breeding Symposium 15 March 2013 Patrick S. Schnable Iowa State University China Agriculture University Data2Bio, LLC 1

Transcript of Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly...

Page 1: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Next Generation Phenotyping and Breeding

Univ of MN

Plant Breeding

Symposium

15 March 2013

Patrick S. Schnable

Iowa State University

China Agriculture University

Data2Bio, LLC 1

Page 2: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

2

Page 3: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Outline

• Next Generation Sequencing

• Next Generation Genotyping

• Next Generation Phenotyping

3

Page 4: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Plant breeders, agronomists and farmers have been successful

Page 5: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Corn Grain Yields

0

1000

2000

3000

4000

5000

6000

7000

8000

9000

10000

11000

1865 1875 1885 1895 1905 1915 1925 1935 1945 1955 1965 1975 1985 1995 2005

Year

Av

erag

e C

orn

Yie

lds

(kg

/h

a)

0

20

40

60

80

100

120

140

160

Troyer (2006) Crop Sci. 46:528-543

Sustained Yield Increases

Page 6: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Private-Sector R&D Investment in Plant

Breeding is Increasing (inflation-adjusted

constant dollars)

USDA Report: The Seed Industry in U.S. Agriculture

Currently (2011) the big three (Monsanto, DuPont/Pioneer, & Syngenta) have a combined seed-based R&D budget of an estimated ~$2B

Page 7: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Challenges Require Novel Approaches for Crop Improvement

• Agricultural inputs:

– Increasing costs (e.g., nitrogen)

– Reduced availability (e.g., water)

– Undesirable ecological impacts

• Environmental stress

– Droughts/floods

– Temperature extremes

– New pests & diseases

Page 8: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

How to Translate Genomic Data

into Crop/Livestock

Improvement?

The Maize Genome

Sequencing Project,

Rick Wilson, PI

8

Maize contains ~50,000 genes

Schnable, Ware

et al. (2009)

Science

Page 9: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

• Quantitative Trait Loci (QTL) Mapping & Genome-wide association studies (GWAS) link SNPs (markers) and genes to qualitative (single gene) and quantitative (multi-genic) traits

• Genomic Selection

Genomic Approaches

to Increase Genetic Gain

Page 10: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Outline

• Next Generation Sequencing

• Next Generation Genotyping

• Next Generation Phenotyping

10

Page 11: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

11

Page 12: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Next Generation Sequencing (NGS)

Technologies are Game Changing

• In 1980s PCR-based technologies revolutionized how

we detect and quantify nucleic acids

• Currently, NGS technologies are changing how we:

– Examine gene expression (RNA-Seq)

– Identify polymorphisms and conduct genotyping

(GBS)

– Map and clone genes (BSR-Seq, GWAS)

– etc.

12

Page 13: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Requirements for QTL & GWAS

• Genetic Diversity

• Genotyping Data

• Phenotypic (Trait) Data

Page 14: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Outline

• Next Generation Sequencing

• Next Generation Genotyping

• Next Generation Phenotyping

14

Page 15: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

SNP Discovery (Includes base substitutions, Insertions, and deletions)

Raw Data

原始数据

1

Acquire data from internal data sets

and public databases

Reads Trimming 序列修剪

2

Performed using in-house short

reads trimming or using lucy (--error

0.01 0.01 parameter)

Repeat Filtering 循环过滤

3

Run in-house repeatmasking

script and discard reads w/ >40%

repeat content for 454 reads only

AGPv2 Alignment 比对

4

Align to AGPv2 and Maize

mitochondria (AY506529.1) and

chloroplast (X86563.2) genomes

Remove Read Stacking 移除冗杂

5

Remove potential PCR artifacts from

stacked reads having the same

coordinates

SNP Calling

SNP检测

6

Call SNPs using in-house SNP calling

pipeline

SNP DISCOVERY WORKFLOW

15

Page 16: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Bulked Segregant Analysis (BSA)

16

Objective:

Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Rationale:

Linkage between genetic markers and the target gene is detected via comparative

allelic quantification of genetic markers in pools of mutant and wild-type siblings.

Mutants Non-mutants

Page 17: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Non-mutant pool Mutant pool

Green chr

Red chr

. . . . . .

Recessive mutant

Red:Green

The ratio of Red:Green is proportional to the genetic

distance between the mutant gene and the genetic marker.

Genetic

Marker

17

1 : 3 F2

BSA

Page 18: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

BSR-Seq: RNA-seq based

BSA

• RNA-seq:

• Mutants vs. Non-mutants

• ~30 individuals

• Obtain SNP markers based on RNA-seq reads

• Map mutant gene location based on allele counts

18 Liu et al., 2012 PLoS ONE

Page 19: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

PP = Posterior Probability ≈ Probability of linkage

Location identified via BSR-Seq matches known

physical location of cloned gl3 gene

19

Page 20: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Advantages of BSR-Seq

20

• Maps mutant gene to a small chromosomal

interval

• Identifies SNPs in this chromosomal interval,

which can be used for fine-scale mapping

• Defines genes in this chromosomal interval

that are differentially expressed, and which are

therefore candidate genes

• Yields genome-wide transcription profiles of

mutants vs. non-mutants -> clues to mutant

function

• Works for any species with a sequenced (or not

yet sequenced) genome

Page 21: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Data2Bio, LLC

•Founded in 2010, Data2Bio helps scientists design, conduct and analyze next generation sequencing projects

•Core strengths are bioinformatics and genomics

•Offices in the US and China; Academic and private-sector customers in North American, Europe and Asia

•Proprietary bioinformatic pipelines and genomic technologies associated with DNA barcoding and genotyping-by-sequencing (GBS)

21

Page 22: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Conventional GBS

22

Line 1

Line 2

~1M sites, i.e., more than is required for most applications SNP calls based on few reads (error rate) Can not call heterozygotes Limited overlap of sites among lines -> missing data across lines Requires imputation (heavy-duty bioinformatics)

Page 23: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

tGBS: Value of Read Selection

23

tGBS more stringently controls fraction of genome that is sequenced and genotyped (“read selection”). Hence, given the same number of reads/line, reads are clustered at fewer sites: • Less missing data across lines • 1,000-60,000 SNPs (“tunable”) -> adequate for most

applications • High confidence SNP calls • Can confidently genotype heterozygous loci

Line 1

Line 2

Page 24: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Outline

• Next Generation Sequencing

• Next Generation Genotyping

• Next Generation Phenotyping

24

Page 25: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Contributions to Phenotype (traits)

• Phenotype (P) = Genotype (G) +

Environment (E) + GxE

• E and GxE complicate predictions of

hybrid performance (P) from genotype (G)

• Two strategies for dealing with “E” and

“GxE”

25

Page 26: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

1st Strategy: Controlled

Environments

• Conduct phenotyping under controlled

environments (“E”), e.g., growth chambers

or greenhouses

• This can remove E (and GxE) from the

equation, allowing accurate prediction of

the performance of a given hybrid in a

given growth chamber or greenhouse

26

Page 27: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

2nd Strategy: Field Conditions

• Conduct phenotyping under relevant field

conditions, but this requires growing genotypes

in many characterized environments (and

appropriate experimental design and statistical

analyses)

• But if we want to understand and model the

influences E and GxE on hybrid performance (P)

we need to conduct experiments under

conditions that match production conditions as

closely as possible

27

Page 28: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Predictive Models for Hybrid

Performance

• Once we understand the relationships and interactions among genotype, environment and phenotype (P= G + E + GxE) we will be able to:

– More efficiently develop hybrids that exhibit yield stability across environments and/or that are engineered for particularly challenging environments

– Much more accurately predict the performance of a given hybrid in a given field in a given year

28

Page 29: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

What is required to define the effects of

G, E and GxE on phenotype?

• Measure traits from many genotypes, in

many diverse (and commercially relevant)

environments

• Quantity of data to be collected requires

new phenotyping technologies and

strategies

29

Page 30: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Number of locations matters (a lot):

Page 31: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Vision #1 for High-Throughput,

Field-based Phenotyping

– Conduct a coordinated phenotyping project

involving MANY regional stations across the

U.S. (e.g., at Land Grant Universities) to study

how corn performs under diverse

environmental conditions

– Possibility for public-private partnership

– Develop novel high-throughput field-based

phenotyping technologies

31

Page 32: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Lie Tang

Robotics Specialist

Ag & Biosystems

Engineering

Maria Salas

Fernandez Sorghum Breeder

Agronomy

Patrick Schnable

Maize Genomics

Agronomy

Grant #: 2012-67009-19713

Page 33: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Mobile Carrier with RTK-GPS

John Deere Sub-Compact Utility Tractor Equipped

with Topcon Universal Auto-Steer System

Multiple Light 3D Time-of-

Flight Cameras

PMD Nano

camera

Bright-field

image

Amplitude image Distance image

ASC TigerEye 3D

Flash LIDAR

Topcon IP-S2

Mobile 3D

Mapping Unit 33

Design of the automated phenotyping robot

for biomass yield-related architecture traits

Camera View Angle 2

Row 3

60-inch row spacing

Row 2Row 1 Row 4

GPS

Camera View Angle 1

GPS

Top-View

Back-View

4 m

Lead Screw Drive

3D TOF Cameras

Page 34: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Diversity Panel Yu Panel 387 photoperiod insensitive lines •Converted tropical sorghums •Elite materials •Historically important elite lines

300 photoperiod sensitive lines •A subset lines selected based on biomass potential.

Identify sorghum biomass yield traits via

genome-wide association studies (GWAS)

34

Page 35: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

• Yelp: cell phone app that provides restaurant reviews

• Tens of thousands of individual users create value for all users by contributing their individual data (i.e., restaurant reviews) to a shared database

• Tremendously successful (71M unique visitors/month); advertising-driven business model

35

Vision #2 for HTP, Field-based

Phenotyping is based on validated

social networking/local review

paradigms

Page 36: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

• Farmer Participatory Phenotyping (FP2)

• Many farmers already have GPS and yield

monitors on their combines

• Collect consistent data at high spatial

resolution from commercial production

fields; upload to ISU servers

36

Vision #2 for High-Throughput,

Field-based Phenotyping

Page 37: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Farmer Participatory

Phenotyping • Combine yield data with soil maps, weather data etc.

• With thousands of potential participants it will be possible to collect unprecedented amounts of data

• Information technology resources will be vital to store and interpret the vast amounts of collected data

• Develop statistical models that accurately predict phenotypes (hybrid performance) under specific environmental conditions

• Determine which hybrids are suited for particular environments (soil types, moisture levels, fertilization rates, and other agronomic practices)

37

Page 38: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Resulting Predictive Models will:

• Be relevant to production fields

• Enhance our ability to efficiently produce

superior hybrids

• Improve ability of farmers to select

appropriate hybrids for local conditions

• Improve ability to predict yields prior to

planting and during growing season,

reducing volatility of commodity prices

38

Page 39: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Longer-term Vision for FP2

• Develop novel sensors that can detect

more subtle phenotypes that could be

placed on tillage equipment or center pivot

irrigation booms to collect stress

phenotypes (water/nutrient/disease/insect)

early in season

39

Page 40: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

• Because farmer-participants in FP2 will be able to access phenotypic data for each commercially available corn variety from thousands of fields with known soil types and that are being managed with different (but defined) agronomic practices, the process by which farmers select varieties for planting and the agronomic practices they employ will be driven by a rich set of unbiased data, thereby maximizing farmer profits

40

Value of FP2 to Farmers

Page 41: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Bottom Line

• Advances in Next Generation Sequencing technologies have allowed for the development of Next Generation Genotyping technologies (e.g., multiple flavors of GBS)

• In turn, Next Generation Genotyping technologies offer the potential to greatly increase the efficiency of plant breeding and the development of hybrids with enhanced stress resistant and/or yield stability

• But to realize the benefits of Next Generation Genotyping we now need to develop “Next Generation Phenotyping” strategies: – Controlled conditions (e.g., greenhouses)

– Field based • Coordinated regional phenotyping stations (N=hundreds)

• Farmer Participatory Phenotyping, FP2 (N=thousands)

– New data collection technologies/sensors

41

Page 42: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

Collaborators

Jinsheng Lai

Nathan Springer

Jeffrey Jeddeloh

Mike Scanlon (PI, Cornell)

Jianming Yu, M. Timmermans,

G. Muehlbauer,

D. Jannick-Buckner

Brad Barbazuk

The Maize Genome

Sequencing Project

Srinivas Aluru

Hui-Hsien Chou

Carolyn Lawrence

Dan Nettleton

42

Page 43: Next Generation Phenotyping and Breedingschnablelab.plantgenomics.iastate.edu/docs/...Rapidly genetically map mutant genes or QTLs MICHELMORE, RW et al., 1991, PNAS 88: 9828-9832.

43

PSS has substantial IP and equity interests in Data2Bio LLC