PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History...

26
TreeGenes A Comprehensive Resource for Forest Tree Genomics Emily Grau Department of Plant Sciences University of California, Davis dendrome.ucdavis.edu

Transcript of PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History...

Page 1: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

TreeGenes

A Comprehensive Resource for Forest Tree Genomics

Emily Grau Department of Plant Sciences University of California, Davis

dendrome.ucdavis.edu

Page 2: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

dendrome.ucdavis.edu

TreeGenes Database: History

–!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic maps and associated markers

Page 3: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

dendrome.ucdavis.edu

TreeGenes Database: History

–!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic maps and associated markers

–!Expanded to other data types •! Sequence

–! Reseqeuncing, Large-Scale Genotyping, Transcriptomics/Expression

–! Full Genome Sequences

•! Analysis and Visualization Tools –! Ability for users to mine the data

•! Resources for the user community –! Literature, Colleagues

Page 4: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

!"###$

%"###$

&"###$

''"###$

()*+',$ -./+',$ 0.1+',$ 234+'5$ ()*+'5$

TreeGenes Database: Users

Unique Web Visitors to TreeGenes Database per month, June 2013-June 2014

6,000

dendrome.ucdavis.edu

9,000

2,060 users from 849 organizations in 94 countries

Page 5: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

•! 1,290 species from 101 genera –!At least one genetic artifact from each species –!Conifers but is also inclusive of all forest trees

•! Full genome sequence: 13 species •! Transcriptome/Expression resources:

3,920,817 sequences from 263 species •! 106 genetic maps from 35 species

dendrome.ucdavis.edu

TreeGenes Database: Species

Page 6: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

dendrome.ucdavis.edu

TreeGenes Database: Data Sources

Automated User submissions

Page 7: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

Automated –!NCBI (primary repositories)

•! Protein, EST, cDNA, TSA, Unigene databases •! Introduced to TreeGenes with added value •! Information should be sent to primary dbs first

–!Literature •! Web of Science, PubMed

dendrome.ucdavis.edu

TreeGenes Database: Data Sources

Page 8: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

User submissions –  Internal projects or collaborations (day one) – Submissions of data post-analysis at publication

time

dendrome.ucdavis.edu

TreeGenes Database: Data Sources

Page 9: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

User submissions (Software with full front-end and back-end support) Laboratory Information Management System Sequence, Genotype, Phenotype, Environmental Information

dendrome.ucdavis.edu

TreeGenes Database: Data Sources

Track barcoded samples from collection through sequencing

Upload phenotype /environmental data

Data can be integrated into TreeGenes in real time or at project end

Page 10: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

User submissions: external Most submissions from TGG

dendrome.ucdavis.edu

TreeGenes Database: Data Sources

Submit genetic maps or population study data

Obtain TGDR accession number!

Page 11: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

Interfaces – Existing viewers – Custom development

dendrome.ucdavis.edu

TreeGenes Database: Data Access

Page 12: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

Comparative mapping

dendrome.ucdavis.edu

TreeGenes Database: Interfaces

Page 13: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

Genome browsing & annotation

dendrome.ucdavis.edu

TreeGenes Database: Interfaces

Page 14: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

–!Bulk retrieval of resequencing data, genotypes, and phenotypes

–!Describe search options?

dendrome.ucdavis.edu

TreeGenes Database: Interfaces

– Describe search options?

Page 15: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

Download results

dendrome.ucdavis.edu

TreeGenes Database: Interfaces

Page 16: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

Download results or pipe to CartograTree via SSWAP (Simple Semantic Web Architecture Protocol)

dendrome.ucdavis.edu

TreeGenes Database: Interfaces

Page 17: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

dendrome.ucdavis.edu

TreeGenes Database: Interfaces

–! Providing context to geo-referenced data –!Originated from Tree Biology Working Group through

iPlant

Page 18: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

dendrome.ucdavis.edu

TreeGenes Database: Interfaces

–!Data from TreeGenes, WorldClim, Ameriflux, TRY-db –!Google fusion tables & Google maps

Page 19: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

dendrome.ucdavis.edu

TreeGenes Database: Interfaces

–!Retrieve genotype, phenotype, environmental, and sequence data

–!Further analysis (TASSEL, MUSCLE) via SSWAP

Retrieve genotype, phenotype, environmental, and

Page 20: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

Genome Sequence Annotation Server –!Can handle large, complex genomes

dendrome.ucdavis.edu

TreeGenes Database:

Current Development

Page 21: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

–!Save work, upload modifications for approval

P1153

dendrome.ucdavis.edu

TreeGenes Database:

Current Development

Page 22: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

Tripal Galaxy dendrome.ucdavis.edu

TreeGenes Database:

Future Development

Page 23: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

Tripal Galaxy –!Tripal

•! Frontend & backend open source database solution

•! CHADO: database schema from GMOD •! Drupal: open source web development

platform •! TreeGenes will transition into using Tripal

to ease data transfer

dendrome.ucdavis.edu

TreeGenes Database:

Future Development

Page 24: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

Tripal Galaxy –!Galaxy

•! Workflow & data analysis platform •! Build multi-step analysis pipeline •! Tripal Galaxy will develop modules for

analysis with Galaxy

dendrome.ucdavis.edu

TreeGenes Database:

Future Development

Page 25: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

Tripal Galaxy –!Work with other databases –! Improve data integration, data transfer –!Pull datasets easily from other datasets

& sources on the web for analysis

dendrome.ucdavis.edu

TreeGenes Database:

Future Development

Page 26: PineRefSeq - TreeGenes · 2020. 5. 8. · dendrome.ucdavis.edu TreeGenes Database: History –!Began as the Dendrome project (USDA funded initiative) in 1993 to hold forest tree genetic

dendrome.ucdavis.edu

TreeGenes Database: Team

Project Leads David Neale Jill Wegrzyn

University of Connecticut

Development Team Jacob Zieve Hans Vasquez-Gross Andrew Brown

Advising Damian Gessler

Semantic Options/University of Arizona

Lead Database Administrator Emily Grau

[email protected]

@TreeGenes TreeGenes Database