Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load...

66
Association analysis and more with TASSEL M. Cinta Romay 05/10/2012

Transcript of Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load...

Page 1: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

Association analysis and more with TASSEL

M. Cinta Romay

05/10/2012

Page 2: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

TASSEL: 1.What is Tassel? 2.Where do I get Tassel? 3.How do I use Tassel?

a. Graphical User Interface (GUI) b. Standalone version c. Source code

Page 3: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

1. What is Tassel?

• Java application that runs in your computer (any operating system)

• Trait Analysis by aSSociation, Evolution and Linkage: provides tools to investigate the relationships between genotypes and phenotypes a. Association study b. Analysis of Linkage Disequilibrium c. Principal Component Analysis d. Cluster Analysis e. Overview of genotypic data f. …

• Design to analyze large datasets

Page 4: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

2. Where do I get Tassel?

www.maizegenetics.net/tassel

Page 5: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

2. Where do I get Tassel?

www.maizegenetics.net/tassel

Page 6: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

2. Where do I get Tassel?

www.maizegenetics.net/tassel

Page 7: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

2. Where do I get Tassel?

www.maizegenetics.net/tassel

Page 8: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

2. Where do I get Tassel?

www.maizegenetics.net/tassel

Page 9: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

2. Where do I get Tassel?

www.maizegenetics.net/tassel

Page 10: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

2. Where do I get Tassel?

www.maizegenetics.net/tassel

Page 11: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

2. Where do I get Tassel?

www.maizegenetics.net/tassel

Page 12: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

2. Where do I get Tassel?

www.maizegenetics.net/tassel

Page 13: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

2. Where do I get Tassel?

www.maizegenetics.net/tassel

Page 14: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

2. Where do I get Tassel?

www.maizegenetics.net/tassel

Page 15: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

www.maizegenetics.net/tassel

Page 16: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

www.maizegenetics.net/tassel

Page 17: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

www.maizegenetics.net/tassel

Some incompatibilities might not allow your java virtual machine to start

Do not panic!!

Page 18: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

www.maizegenetics.net/tassel

Page 19: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

www.maizegenetics.net/tassel

Page 20: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

DATA

Page 21: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Datasets and reports- Stored in RAM

Task progress

Comments about data and reports

Page 22: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Functions are organized in 3 modes

Page 23: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 24: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 25: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 26: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 27: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 28: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 29: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 30: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 31: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 32: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

50%

1407

Page 33: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

5% MAF

0.05

Page 34: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 35: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 36: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 37: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 38: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 39: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 40: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 41: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 42: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 43: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 44: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 45: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 46: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 47: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 48: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Page 49: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Zoom

Position

Save plot

Selection upper and lower triangle

Page 50: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI) TASSEL v 4.0

Page 51: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI) TASSEL v 4.0

Page 52: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI) TASSEL v 4.0

Page 53: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI) TASSEL v 4.0

Page 54: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI) TASSEL v 4.0

Page 55: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI) TASSEL v 4.0

Page 56: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI) TASSEL v 4.0

Page 57: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI) TASSEL v 4.0

Page 58: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

a. Graphical User Interface (GUI)

Steps to perform Association analysis:

1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis 4. Merge G+P+Q 5. Run GLM (simple trait) 6. Load Kinship matrix/ calculate kinship matrix 7. Select G+P+Q and K and run MLM 8. Visualize the results with QQ plot and Manhatan plot

Page 59: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

b. Standalone version

• Consists on modules (i.e. Plugins) • Output from one module can be input to another module.

Determined by order specified.

* Terry Casstevens ([email protected])

Input.txt HapMap file

Load file LD analysis Output.png LD plot

run_pipeline.bat –fork1 –h input.txt –ld –ldd png –o output.png –runfork1

GBS analysis pipeline is an extension to Tassel, and, as such, GBS commands are run as TASSEL plugins via the command line

Page 60: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

b. Standalone version

• Install: • Unzip the Tassel standalone build onto your system

• Change into the root directory: • tasselx.0.standalone

• Execute: • Windows: use run_pipeline.bat • UNIX: run_pipeline.pl

* Terry Casstevens ([email protected])

Page 61: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

b. Standalone version

• Modify memory from bat file:

Similarly for the perl file: edit and find the –Xmx comand!

* Terry Casstevens ([email protected])

Page 62: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

b. Standalone version

* Terry Casstevens ([email protected])

• Pipelines consist of pipeline segments. Each segment has a name (1, A…) that follows: • -fork: start of a pipeline segment that should be executed

sequentially • -combine: starts a pipeline segment that combines datasets

from multiple pipeline segments • Output from a pipeline segment can be used as input to another

by referencing with -input

run_pipeline.bat –fork1 …–fork2… -fork3… –combineA –input1 –input2 …

Page 63: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

b. Standalone version

* Terry Casstevens ([email protected])

• Pipeline segments can initiate execution in two ways: • -runforkX • By receiving input from another segment

–run_pipeline.bat -fork1 -h "path/File1.txt" -fork2 -includeTaxaInFile "path/TaxaFile1.txt" –input1 –export "path/OutputFile1.txt" -exportType Hapmap -fork3 –includeTaxaInFile "path/TaxaFile2.txt" –input1 –export "path/OutputFile2.txt" -exportType Hapmap –runfork1

File1.txt HapMap file

Load file

OutputFile1 HapMap file

Subset Taxa2

Subset Taxa1

OutputFile2 HapMap file

TaxaFile1

TaxaFile2

Page 64: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

b. Standalone version

List of all available pipeline controls

www.maizegenetics.net/tassel

Page 65: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

c. Source code

• Allows manipulation of the code to adapt it to your own projects (requires some java/coding knowledge)

• Sourcecode is publicly available: • https://tassel.svn.sourceforge.net/svnroot/tassel

• Instructions for setting up Tassel in NetBeans can be found at:

• http://www.maizegenetics.net/tassel-in-netbeans

Page 66: Association analysis and more with TASSEL · Steps to perform Association analysis: 1. Load genotypic data 2. Load phenotypic data 3. Load population structure data or run a PC analysis

3. How do I use tassel?

c. Source code