Aleksi Kallio CSC – IT Center for Science [email protected] Chipster and collaboration with other...
-
Upload
britney-lee -
Category
Documents
-
view
213 -
download
0
Transcript of Aleksi Kallio CSC – IT Center for Science [email protected] Chipster and collaboration with other...
Aleksi KallioCSC – IT Center for Science
Chipster and collaboration with other bioinformatics platforms
Chipster introduction
Free, open source software for analyzing high-throughput data such as NGS
Available as a ready-to-run VM with a large collection of analysis tools and reference data• Use directly or via Chipster GUI
Chipster GUI enables users to• Visualize data efficiently• Share analysis sessions• Document what they have done • Save and share automatic workflows
Chipster in a nutshell
Analysis tools for different kinds of data
140 NGS tools for• RNA-seq• miRNA-seq• exome/genome-seq• ChIP-seq• FAIRE/DNase-seq• MeDIP-seq• CNA-seq• Metagenomics (16S rRNA)
140 microarray tools for• gene expression• miRNA expression• protein expression• aCGH• SNP• integration of different data
60 tools for sequence analysis• BLAST, EMBOSS, MAFFT• Phylip
Technical features Client-server user interface, loosely coupled distributed backend
• Can be spread over different clouds, elasticity Data on client or server side
• No duplication of data Workflows – reusing and sharing your analysis pipeline
• You can save your analysis steps as a reusable automatic ”macro”
Web based interface for system administration and tool development• Tool scripts can be R,
Python or Java Integrated user support
functionality• Easy to see what the user
has done
Chipster admin GUI
Chipster compared to Galaxy There is no obvious way to compare two complex systems…
• Windows vs. Linux, vi vs. Emacs, Python vs. Java… Many technical differences, but maybe the core difference is in
how tools, workflows etc. are presented to user• Chipster’s approach is more integrated: focus on usability,
consistent biologist friendly terminology, single complete virtual machine distribution, automated updates for the whole system
• Galaxy’s approach is more modular: focus on tool distribution, tool developer community, workflow driven work, several customised versions available
• YMMV… Typical feedback we hear: in Chipster people like the GUI and in
particular being able to visualise the session, in Galaxy people like the breadth of available tools and integrations
Collaboration opportunities
Tool evaluation and selection Selecting best tools takes effort Finding and testing example datasets takes effort Wiki for shared best practices?
Should include basic justification for selection e.g. benchmarks, references to review articles…
Lightweight alternative to full comparison or review article
Cloud integration Combined efforts to bring different bioinformatics platforms to
major generic and scientific clouds Not only about software infrastructure parts (easy), but tools and
databases (hard) and keeping them up-to-date (really hard) Tools to achieve elasticity not only within platforms, but across
platforms Running several platforms and scaling resources across the platform
according to changing workloads
Chipster in EGI FedCloud Chipster VM available in FedCloud Applications Database Chipster Virtual Organization
Tool platform Why every platform needs to integrate the same tools and
databases, but in a different way? There are many highly sophisticated solutions out there, but
typically with low coverage of the day-to-day tools Virtual machine images and containers (Docker) are practical tools
for software packaging Supporting technologies
Bio-Linux, Debian Med BioImg.org CernVM-FS
Tool platform idea
Factory that generates VMI’s or containers 24/7• Always tested • Latest software versions
High-quality and widely used VMI’s/containers• One good software bundle is better than dozens of
poorly baked ones Vision: you can just assume that software and
databases are there. “Spotify” of bioinformatics software.
For cloud: automatically updated VM with hooks for update events
Could there be NeIC project around this?
16
Finally
Thanks to users and contibutors!
More info
[email protected] http://chipster.csc.fi http://chipster.github.io/chipster/