KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3),...
Transcript of KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3),...
![Page 1: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/1.jpg)
Gregory Landrum, Ph.D. NIBR Informatics Novartis Institutes for BioMedical Research, Basel
8th KNIME Users Group Meeting
Berlin, 26 February 2015
KNIME Enterprise server usage and global deployment at NIBR
![Page 2: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/2.jpg)
2
Novartis Institutes for BioMedical Research (NIBR) A global network of >6,000 scientists, physicians, and business professionals.
![Page 3: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/3.jpg)
R&D at Novartis Bringing innovative medicines to patients
3
The Drug Development Process
Source: http://www.nibr.com/cs/groups/public/@nibr_com/documents/document/n_prod_200363.pdf
![Page 4: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/4.jpg)
R&D at Novartis Bringing innovative medicines to patients
4 Source: http://www.nibr.com/cs/groups/public/@nibr_com/documents/document/n_prod_200363.pdf
![Page 5: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/5.jpg)
Timelines and economics
Paul, S. M., Mytelka, D. S., Dunwiddie, C. T., Persinger, C. C., Munos, B. H., Lindborg, S. R., & Schacht, A. L. (2010). Nature reviews Drug discovery, 9(3), 203-214.
![Page 6: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/6.jpg)
NIBR: Making it work
§ Our model of research – connecting the laboratory to the clinic, and pursuing molecular pathways across a landscape of multiple diseases – means that we have to be a highly collaborative organization.
§ Every project is made up of cross-functional teams, drawn from pathways scientists, chemists, disease area specialists, informaticians, clinicians and more.
§ We’re doing scientific research, not making widgets
§ Lots of collaboration, lots of technology, lots of data
6
![Page 7: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/7.jpg)
| Presentation Title | Presenter Name | Date | Subject | Business Use Only 7
Lots of data
![Page 8: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/8.jpg)
Lots of data Shape of the data generated for a project
8
Hit finding 106 rows, 1-2 columns
Hit-to-lead 103 rows, 5-10 columns
Lead optimization 102 rows, 102 columns Clinic
1 rows, 104 columns
![Page 9: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/9.jpg)
The role of NIBR Informatics (NX)
§ Identifying and driving new opportunities to accelerate science with leading-edge computing and informatics solutions.
§ Traditional IT stuff: service desk, hardware support, network, etc.
§ Designing, building, deploying, and supporting tools/systems for: • portfolio management; document management; compliance and reporting • lab informatics; sample management and logistics; electronic lab notebooks • high-performance computing; large-scale data warehousing and mining,
machine learning • scientific data analysis; visualization; reporting
§ Pushing the frontier: research and exploration
§ Combination of purchased and in-house developed systems, lots of different technologies, lots of integration work
9
![Page 10: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/10.jpg)
NIBR and KNIME
§ We believe KNIME can be really useful, so we want to make it available to all of our scientists
§ We’re supporting both people who are using KNIME to solve problems in their own labs/groups and people who want to make tools available to others.
§ Need to support exchange of workflows and information across all our sites
§ Need to be integrated into our data and software environment
10
![Page 11: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/11.jpg)
Infrastructure
§ Internal node development
§ Enterprise servers + cluster integration
§ Standardized desktop releases for Windows, Linux, Mac
§ Nightly builds for users comfortable on the bleedingleading edge
§ Dev and test servers to support our node development
11
![Page 12: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/12.jpg)
12
NIBR’s KNIME servers
![Page 13: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/13.jpg)
KNIME for NIBR internal distribution
13
Standardized set of nodes and extensions
Customized preferences
![Page 14: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/14.jpg)
KNIME for NIBR make it supportable
14
Allow a reset to the default configuration without requiring a new install.
![Page 15: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/15.jpg)
In-house node development make it useful
§ Connections to internal data sources and applications
§ Wrappers around in-house developed algorithms
§ Connection to our web service framework for cheminformatics services
15
![Page 16: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/16.jpg)
Open-source node development
§ Chemistry nodes based on the RDKit • open-source cheminformatics toolkit • useable from C++, Python, Java, C#
• NIBR scientists/developers actively participate • www.rdkit.org
§ Standard cheminformatics tasks + some nice extras
§ Developed both in-house and together with knime.com
16
![Page 17: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/17.jpg)
Sponsored node development
§ Modifications to naïve Bayes nodes to support fingerprints
§ Fingerprint naïve Bayes supporting unbalanced datasets
§ Database schema browser
§ Improvements to database connector, readers
§ Ensemble tree classifier
§ New Python integration
17
![Page 18: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/18.jpg)
Integration example 1: Descriptor calculation
18
![Page 19: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/19.jpg)
Integration example 2: DART
19
Internal web-based tool used by project teams to do querying and reporting from our data warehouse
![Page 20: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/20.jpg)
Integration example 2: DART
20
Internal web-based tool used by project teams to do querying and reporting from our data warehouse
Access to saved queries and views
URL contains full state of query/view
![Page 21: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/21.jpg)
Integration example 2: DART + KNIME
21
![Page 22: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/22.jpg)
Integration example 2: DART + KNIME
22
Access to saved queries and views
![Page 23: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/23.jpg)
Usage snapshot
23
Unique users per month
Users by site
Notes: 1) stats only include KNIME client 2) December data incomplete
Overall: 240 unique users, mostly scientists
![Page 24: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/24.jpg)
What are those users doing with KNIME ?
§ Querying and reporting from our warehouse
§ virtual chemistry
§ processing usage statistics
§ mining medchem project data
§ processing and analyzing experimental data
§ machine learning
§ triaging high-throughput screening results
§ looking up chemical catalog numbers
§ …
24
in other words: a bit of everything
![Page 25: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/25.jpg)
KNIME Server usage
§ Primarily used to share workflows
§ Increasingly used as a quick and easy deployment platform for small application/services built in KNIME
§ This is mainly driven by the scientists themselves
§ Areas for improvement: • Would be nice if it were easier to sync between servers • Would be great if the server could do “RESTful” web services.
§ Still: enabling scientists to share workflows and make (hopefully) simple applications available to each other is great
25
![Page 26: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/26.jpg)
Wrapping up
§ KNIME in heavy use to solve many different problems
§ Enterprise server used to exchange workflows globally
§ Web portal provides a way for scientists to deploy tools to each other
§ KNIME is a great platform for us to build upon
26
![Page 27: KNIME Enterprise server usage and global deployment at NIBR · Nature reviews Drug discovery, 9(3), 203-214. NIBR: Making it work ! Our model of research – connecting the laboratory](https://reader035.fdocuments.in/reader035/viewer/2022070906/5f7aada3e2ed9121d073cd3e/html5/thumbnails/27.jpg)
Acknowledgements
§ NIBR • Manuel Schwarze (NX) • Mark Duffield (NX)
• David Nick (NX) • Marc Litherland (NX) • John Davies (CPC) • Richard Lewis (GDC) • Remy Evard (NX)
27
§ knime.com