NetBioSIG2012 chaozhang-mosaic

Post on 10-May-2015

2.125 views 0 download

Tags:

description

With the improvement of high-throughput technology, the dramatic increase of large-scale data in both biomolecular concentration and biomolecular interactions has resulted in many biological networks, such as protein interaction networks, gene regulatory networks, and metabolic networks. Although functional analysis is the fundamental step of better understanding biological networks, utilizing vast wealth of data and huge amount of knowledge to annotate and analyze the function of biological networks is still challenging in nowadays bioinformatics. Many software tools are available to visualize and analyze function-derived biological networks, but most of them are isolated with simple functions. One challenge faced by these visualization tools is how to make sense of such networks often represented as massive “hairballs.” Many network analysis algorithms filter or partition networks based on topological features, or mathematically model networks rely on their statistical properties, sidestepping the issue of making sense of the network itself altogether. On other hand, traditional functional enrichment analysis methods regard a network as a list of genes, and annotate networks with gene set enrichment methods. However, it does not consider the topological dynamics of network which might lead to the different functions under different conditions. Therefore, it is necessary to consider molecular interactions to correctly and specifically annotate biological networks. As one of the most successful open source frameworks in bioinformatics, Cytoscape is a powerful network visualization platform that actively supports independent plugin development. By integrating model-view-controller design pattern and Cytoscape techniques, it makes possible an integrated ontology-annotated biological network visualization and analysis platform. In the first stage of the project, we successfully developed two interactive plugins -- Mosaic (http://nrnb.org/tools/mosaic) and NOA (http://nrnb.org/tools/noa) -- to address both visualization and analysis respectively. Mosaic supports interactive network annotation and visualization that includes partitioning, layout and coloring based on biologically-relevant ontologies. It shows slices of a given network in the visual language of biological pathways, which are familiar to any biologist and are ideal frameworks for integrating knowledge, and also provides researchers with an interactive tool to evaluate biological interactions within the context of well-defined processes, functions and cellular localization while retaining all original network information. NOA first introduced link ontology that assigns functions to interactions based on the known annotations of joint genes via optimizing two novel indexes ‘Coverage’ and ‘Diversity’. Then, NOA generates two alternative reference sets to statistically rank the enriched functional terms for a given biological network. It has been proved to be more efficient not only in...

Transcript of NetBioSIG2012 chaozhang-mosaic

1

Network-Ontology Visualization and Analysis

Chao ZhangComputer Science Department

2

GO annotation

Background

Up to October 26, 2010, there have been more than 2,753,338 annotations covering 48 species in GO database

annotation

3

Enrichment analysis

Background

4

Importance and common drawbacks

• Function enrichment analysis– DAVID (2003) Genome biol 3,364

(3,293)– DAVID (2008) Nature protcols 1,908

(1,802)– Bingo (2005) Bioinformatics 657 (609)

• Limitations– Lack of good visualization tool– All for gene set, not network

Background

5

Workflow

Background

Visualization+

Analysis

•Including as more as possible functions of other software

•Relatively independent modules with interactions

•Very flexible, 4 individual plugins or 1 bundle

6

Visualization (Mosaic)

• What’s the difference?

Mosaic

7

Visualization (Mosaic)

• Mosaic shows slices of a given network in the visual language of biological pathways, which are familiar to any biologist and are ideal frameworks for integrating knowledge

Mosaic

Biological Process

Molecular functionCellular

components

8

Visualization (Mosaic)

• Partitioning with BP• Layout with CC• Coloring with MF• Automatic annotation• ID mapping• Overview network

.

Mosaic

9

1st Partitioning/decomposition

• Decompose a huge biological network to sub-networks (Biological process)

Mosaic

10

2nd Layout/Pathway

• Currently only support PathVisio and WikiPathway formats (Cellular components)

Mosaic

11

3rd Coloring/Highlighting

• Molecular function

Other projects>AVALON

12

3rd Coloring/Highlighting

• Molecular function

Other projects>AVALON

13

Overview network

Mosaic

14

Analysis (NOA)

NOA

Molecular systems biology, 2009

Normal DiseaseDisease

The same gene list

Different phenotype

15

Edge-based algorithm

NOA: a novel Network Ontology Analysis method

Nucleic Acids Research, 2009

NOA

Blast2GO, GoAnnotator, Goanna, etc

FatiGO, DAVID, g:profiler,BiNGO, etc

NOA

Gene annotation

Gene list annotation

Network annotation

N e t w o r k

Network

Single genes

Single genes

G ene set

Gene set

16

Analysis (NOA)

• Edge-base algorithm• Batch mode• Heatmap• Overview network• Interaction with Mosaic

NOA

17

Node/edge-based algorithms

Other projects>AVALON

18

Node/edge-based algorithms

Other projects>AVALON

19

Batch mode

Other projects>AVALON

20

Batch mode

Other projects>AVALON

21

Acknowledgement

Project advisor:Alexander R. Pico (UCSF)Dong Xu (Uni. Of Missouri)

Mosaic:Kristina Hanspers (UCSF)Allan Kuchinsky (Agilent)Nathan Salomonis (UCSF)

NOA:Jiguang Wang (Columbia Uni.)Luonan Chen (CAS)Xiangsun Zhang (CAS)

Google’ Summer of Code Program

Acknowledgement

22

Question?

Mosaic:http://nrnb.org/tools/mosaic/

NOA:http://nrnb.org/tools/noa/

Chao Zhang:chaozhang@mizzou.edu

Acknowledgement