The future of scientific information & communication

125
The future of scientific information & communication Antony Williams SUNY Potsdam, April 12 th 2013

description

Our access to scientific information has changed in ways that were hardly imagined even by the early pioneers of the internet. The immense quantities of data and the array of tools available to search and analyze online content continues to expand while the pace of change does not appear to be slowing. While scientists now have access to the enormous capacities and capability of the internet the vast majority of scientific communication continues to be through peer-reviewed scientific journals. The measure of a scientist’s contribution is primarily represented by their publication profile and the citations to their published works and offers an incomplete view of their activities. However, we are at the beginning of a new revolution where the ability to communicate offers the opportunity to embrace new forms of publishing and where scientific participation and influence will be measured in new ways. This presentation will provide an overview of our new generation of “openness” in which open source, open standards, open access and open data are proliferating. The future of scientific information and communication will be underpinned by these efforts, influenced by increasing participation from the scientific community and facilitated collaboration and ultimately accelerate scientific progress.

Transcript of The future of scientific information & communication

Page 1: The future of scientific information & communication

The future of scientific information & communication

Antony Williams

SUNY Potsdam, April 12th 2013

Page 2: The future of scientific information & communication

How does the internet influence you?• How many of you visit the internet/check your

email less than a dozen times per day?• Where do you go for fact-checking?• How many on Facebook? How many on Twitter?• You know you have an online profile right?• Scientists…how many of you are working on

building a scientific profile online?• How many of you online now???

Page 3: The future of scientific information & communication

Me….and my vanity!

Page 4: The future of scientific information & communication

Searching Antony Williams

Page 5: The future of scientific information & communication

Searching ChemConnector…

Page 6: The future of scientific information & communication

http://re.vu/AntonyWilliams

Page 7: The future of scientific information & communication

Wikipediahttp://en.wikipedia.org/wiki/Antony_John_Williams

Page 8: The future of scientific information & communication

LinkedInhttp://www.linkedin.com/in/AntonyWilliams

Page 9: The future of scientific information & communication

Academia.edu

Page 10: The future of scientific information & communication

And Mendeleyhttp://www.mendeley.com/profiles/antony-williams/

Page 11: The future of scientific information & communication

And My Co-author Graph

Page 12: The future of scientific information & communication

And Videos

–YouTube–SciVee–Vimeo–Slideshare

Page 13: The future of scientific information & communication

I am Quantified…

Page 14: The future of scientific information & communication

ResearchGate

Page 15: The future of scientific information & communication

Google Scholar Citations

Page 16: The future of scientific information & communication

LinkedIn

Page 17: The future of scientific information & communication

AltMetrics

Page 18: The future of scientific information & communication

Usage, Citations, Social Media…

Page 19: The future of scientific information & communication

Scientists are “Quantified”• Stats are gathered and analyzed • Employers can find them, tenure will depend

on them, funding are affected by them• Scientists Impact Factors, H-index and many

other variants• Science is both competitive and collaborative

Page 20: The future of scientific information & communication

If it was not just about me…

• Together we might:– build an encyclopedia– …and rate restaurants– …share book reviews – …and movie reviews– …and reviews of service providers– …organize sit-ins and social action– …and more data might just be Open

Page 21: The future of scientific information & communication

If it was not just about me…• Together we might:

– build an encyclopedia– …and rate restaurants– …provide book reviews to each other– …or movie reviews– …or reviews of service providers– …organize sit-ins and social action– …and more data might just be Open– …more scientists might collaborate and share

Page 22: The future of scientific information & communication

It is so difficult to navigate…

What’s the structure?What’s the structure?

Are they in our file?

Are they in our file?

What’s similar?What’s similar?

What’s the target?

What’s the target?Pharmacology

data?Pharmacology

data?

Known Pathways?

Known Pathways?

Working On Now?

Working On Now?Connections to

disease?Connections to

disease?

Expressed in right cell type?

Expressed in right cell type?

Competitors?Competitors?

IP?IP?

Page 23: The future of scientific information & communication

Let’s Change the World

• Let’s map together all historical chemistry data and build systems to integrate new data

• Heck, let’s integrate chemistry and biology data and add in disease data too

• Lets model the data and see if we can extract new relationships – quantitative and qualitative

• Let’s make it all available on the web

Page 24: The future of scientific information & communication

That’s a BIG Request

Page 25: The future of scientific information & communication

What About Something Smaller?

• We’re going to map the world• We’re going to take photos of as many places

as we can and link them together• We’ll let people annotate and curate the map• Then let’s make it available free on the web• We’ll make it available for decision making • Put it on Mobile Devices, Give it Away

Page 26: The future of scientific information & communication

Where am I from?

Page 27: The future of scientific information & communication

Wikipedia

Page 28: The future of scientific information & communication

Wikipedia

Page 29: The future of scientific information & communication
Page 30: The future of scientific information & communication
Page 31: The future of scientific information & communication
Page 32: The future of scientific information & communication

I care…I want to contribute…

Page 33: The future of scientific information & communication

The Power of Contribution

Page 34: The future of scientific information & communication

How do you spell Afonwen?

Page 35: The future of scientific information & communication

Whoa…

• So the world can be mapped…• We can enter a 3D environment within the map• We can add annotations• We can use the data, we can reference it, we

can extract it, we can make decisions with it• And we can do it on our lap, in our hands• Let’s crowdsource chemistry and biology!!!

Page 36: The future of scientific information & communication

Science is being Crowdsourced

• Crowdsourcing science is happening…– Contribution of data

• Our data, About us• Our data, generated in labs• Open Data, data validation and curation

– Contribution of software• Open Source, Open Standards

– Contribution of funding

Page 37: The future of scientific information & communication

If we can map the planet…

• …then we should map the Galaxy!

Page 38: The future of scientific information & communication

GalaxyZoo

Page 39: The future of scientific information & communication
Page 40: The future of scientific information & communication

Various ways to contribute

Page 41: The future of scientific information & communication
Page 42: The future of scientific information & communication

Where Am I From?

Page 43: The future of scientific information & communication

Where Am I From?

Page 44: The future of scientific information & communication

What can be done with Big Data

Page 45: The future of scientific information & communication

Patients Like Me

Page 46: The future of scientific information & communication

Patients Like Me

Page 47: The future of scientific information & communication

I am Chemist

Page 48: The future of scientific information & communication

Back to this….

• Let’s map together all historical chemistry data and build systems to integrate new data

• Heck, let’s integrate chemistry and biology data and add in disease data too

• Lets model the data and see if we can extract new relationships – quantitative and qualitative

• Let’s make it all available on the web

Page 49: The future of scientific information & communication

How can I contribute to chemistry?

• Publish data, share data, validate and curate data• Publish chemicals, syntheses and data• “Publish” – Papers, Blogs, Reports, Tweets,

Presentations, Videos • Contribute to Wikipedia • Participate in chemistry communities• Contribute to the Big Data

Page 50: The future of scientific information & communication

• I’ve performed a few dozen chemical syntheses• I’ve run thousands of analytical spectra• I’ve generated thousands of NMR assignments• I’ve probably published <5% of all work • Most of it has been lost• But things can be different today….

About Me…as a Chemist

Page 51: The future of scientific information & communication

Blog• Opinions, procedures, observations, experiences

Page 52: The future of scientific information & communication

Presentations

Presentations, Videos, Report, Pre-publications

Page 53: The future of scientific information & communication

YouTube/Vimeo/SciVee

• Presentations are easy to turn into movies and publish to these services

• Literally “gives you a voice”

Page 54: The future of scientific information & communication

Data as a Publication

Page 55: The future of scientific information & communication

Data as a Publication?

Page 56: The future of scientific information & communication

http://figshare.com/articles/Prevalence_and_use_of_Twitter_among_scholars/104629

Page 57: The future of scientific information & communication

Contributing to the “Big Data” Maps

Page 58: The future of scientific information & communication

My Data Contributions…

Page 59: The future of scientific information & communication

Data & Curations to ChemSpider

• The Royal Society of Chemistry free database• 28.5 million chemicals and growing daily• Software interfaces to integrate to• Amenable to community contribution

– Deposit structures, property data, spectral data– Data annotation, validation and curation

Page 60: The future of scientific information & communication
Page 61: The future of scientific information & communication

• 3-year Innovative Medicines Initiative project

• Integrating chemistry and biology data using semantic web technologies

• Open source code, open data and open standards

• Academics, Pharma companies, Publishers….

Page 62: The future of scientific information & communication

The Publishers!?

Page 63: The future of scientific information & communication

(Some) Publishers are Changing?

• Data cannot be copyrighted and we have lots• Scientists contribute data in document form • Most publishers are open to Open Access

• Scientific publications are built on data so what can be done to release the data? Much data is not published? Many scientists will not share…

Page 64: The future of scientific information & communication

Publications - a summary of work

• Scientific publications are a summary of work– Is all work reported?– How much science is lost to pruning?– What of value sits in notebooks and is lost?

• How much data is lost?– How many compounds never reported?– How many syntheses fail or succeed?– How many characterization measurements?

Page 65: The future of scientific information & communication

Community Repository for Data• Funding agencies encourage sharing of data• Increasing availability of “Open Data”• Institutional repositories have no specific domain

support • Why not develop a community repository for

chemistry data – private, public, embargoed?• Provides data to develop models/algorithms?

Page 66: The future of scientific information & communication

Chemical Database Service• National Chemical Database

Service for UK Academics

• Integrating Commercial Databases and Services

• Chemicals, analytical data, prediction algorithms

• Development of data repository

Page 67: The future of scientific information & communication

Model Building with Community Data

• Community data as a basis of model building– Consume data from available databases, community

data, new publications and build predictive algorithms for the community

– How many algorithms are reported and lost? How much repeat work is done in the domain of algorithmic development?

Page 68: The future of scientific information & communication

Pulling Data from our Archive

• Our contribution to the world of chemistry data• DERA – digitally enabling the RSC archive

– Text mining• Find chemicals, reactions, analytical data, properties

– Algorithmic checking• Validate algorithmically what we can - robots

– “Web 2.0 interfaces” for curating and validating

Page 69: The future of scientific information & communication

What if we could capture it all?Digitally Enhancing the RSC Archive

Page 70: The future of scientific information & communication

Human Validation and Curation

Page 71: The future of scientific information & communication

Web 2.0 Contribution

• We have been contributing to the web for a along time already – but how much in chemistry?

• A few blogs, an increasing amount of tweeting but what about data sharing in chemistry?

Page 72: The future of scientific information & communication

The Old Way of Challenging

Page 73: The future of scientific information & communication

Challenging Science…

Page 74: The future of scientific information & communication

Collaboration towards completion

Page 75: The future of scientific information & communication

Detailed constructive dialog

Page 76: The future of scientific information & communication

Oxidation by Sodium Hydride?

Page 77: The future of scientific information & communication

The Blogosphere Analyzes…

Page 78: The future of scientific information & communication

The Blogosphere Analyzes…

Page 79: The future of scientific information & communication

How much is in the archives?

Page 80: The future of scientific information & communication

Open Notebook Science Analysis

Page 81: The future of scientific information & communication

Oxidation by Sodium Hydride?

Page 82: The future of scientific information & communication

What is Hexacyclinol?

Page 83: The future of scientific information & communication

The Blogosphere “Discusses”…

Page 84: The future of scientific information & communication

What is real, what is fake?

http://www.youtube.com/watch?v=hMpAoC-h5SA

Page 85: The future of scientific information & communication

Chemistry is Dangerous!

http://tinyurl.com/cl2awnj

Page 86: The future of scientific information & communication

Chemistry is Dangerous

• Florida DJs May Face Felony for April Fools' Water Joke Worse Than Rubio's

“… told their listeners that "dihydrogen monoxide" was coming out of the taps

throughout the Fort Myers area.”

Page 87: The future of scientific information & communication

www.dhmo.org

Page 88: The future of scientific information & communication

How do you recognize good vs bad?

Page 89: The future of scientific information & communication

Is this real?

Page 90: The future of scientific information & communication

Junk vs Real

“We then established a collaboration with professor Sum Ting Wong, a fugitive from the North Korean University Hu Yu Hai Ding”

“..identified as the new protein Wai So Dim”

Page 91: The future of scientific information & communication

What is real, what is fake?

Page 92: The future of scientific information & communication

Helping to change science

• Participation and contribution • Immediacy of action• Platforms for contribution• Openness…whatever that is

Page 93: The future of scientific information & communication

Openness – Carries Licensing

• Openness may be hard..

• Open Access flavors• Open Source licenses• Open Data licenses• Open Notebook Science

Page 94: The future of scientific information & communication

Getting Called Out in Public…Rules for Licensing Data

Page 95: The future of scientific information & communication

Challenged in the Twittersphere

Page 96: The future of scientific information & communication

Annotating Articles Today…

Page 97: The future of scientific information & communication

Attribution to me…

Page 98: The future of scientific information & communication

Remember Quantifying Scientists• Scientists Impact Factors. Science is both

competitive and collaborative• Can we measure ALL contributions to science?

Page 99: The future of scientific information & communication

Article-Level metrics are here

Page 100: The future of scientific information & communication

The Alt-Metrics Manifesto• http://altmetrics.org/manifesto/

Page 101: The future of scientific information & communication

ImpactStory

Page 102: The future of scientific information & communication

ImpactStory

Page 103: The future of scientific information & communication

Scientists AltMetrics

Page 104: The future of scientific information & communication

Detailed Usage Statistics

Page 105: The future of scientific information & communication

Usage, Citations, Social Media, Etc

Page 106: The future of scientific information & communication

• Persistent unique digital identifier • Integrates to workflows such as manuscript

and grant submission• Supports automated linkages with your

professional activities

Enabled by

Page 107: The future of scientific information & communication

Micropublishing How much data is lost?

• How many reactions never get published?• How much data could be shared?• How many properties are measured and lost?• What stands in the way of sharing?

– Is it technology? – Permissions? “The Boss”, Licensing?

Page 108: The future of scientific information & communication

Micropublishing Syntheses

Page 109: The future of scientific information & communication

ChemSpider SyntheticPages

Page 110: The future of scientific information & communication

What is real, what is fake?

Page 111: The future of scientific information & communication

Profile

Page 112: The future of scientific information & communication
Page 113: The future of scientific information & communication
Page 114: The future of scientific information & communication

Interactive Data

Page 115: The future of scientific information & communication

Rewards and Recognition

• The badgesonomy culture of recognition is growing.

• Badges are commonplace– FourSquare – Klout

Page 116: The future of scientific information & communication

Rewards and Recognition

• Rewards and Recognition starting with CSSP then expands to other platforms

• Including paths to expose such recognition on AltMetrics platforms – in discussion…

Page 117: The future of scientific information & communication

Impact by Data Set onData

IC50 Measurements for 62 substituted benzoxazolesChemSpider Data Repository: DOI: 10.1356/CSID784.4

Page 118: The future of scientific information & communication

What Does the Future Hold?

Page 119: The future of scientific information & communication

The Data Deluge Will Not Go Away

Page 120: The future of scientific information & communication

The Linked Network Will Grow

Page 121: The future of scientific information & communication

We DON’T want this world..

Page 122: The future of scientific information & communication

Thanks Martin!

Page 123: The future of scientific information & communication
Page 124: The future of scientific information & communication

We’re not there yetYou can’t get there from here

Page 125: The future of scientific information & communication

Thank you

Email: [email protected] Twitter: ChemConnectorPersonal Blog: www.chemconnector.com SLIDES: www.slideshare.net/AntonyWilliams