A Journey in Data Discovery Wendy Watkins TSES 3001 30 October, 2007.
-
Upload
elfrieda-merritt -
Category
Documents
-
view
217 -
download
0
Transcript of A Journey in Data Discovery Wendy Watkins TSES 3001 30 October, 2007.
<odesi>A Journey in Data Discovery
Wendy WatkinsTSES 300130 October, 2007
<odesi> (Ontario Data Documentation, Extraction Service and
Infrastructure)
• A project to provide data on a browser-based platform
• Uses the DDI (Data Documentation Initiative) international standard for metadata
• Written in XML so it can be read across multiple platforms– Text-based representation that can be preserved (ascii)
• Allows searching across datasets and servers
• Provides an easy-to-use interface for beginning researchers
<odesi> • A $1.04 million project funded by OntarioBuys (province) and OCUL
(University Libraries)• Labour-intensive project requiring the input of extensive metadata to
provide better access and preservation• A collaborative effort between Carleton and Guelph• Work is being done by co-op students from both universities• Will include:
– 55 years of Gallup Canada datasets (1945-2000)– About 250 Statistics Canada surveys– Canadian election surveys from 1965-2006– Other polling data– Data from the Inter-university Consortium for Political and Social
Research at Ann Arbor• Data will be mounted on Scholars’ Portal• XML files will be shared across the country
<odesi>
• Will build on previous work • CRIC files are already in this format• Exposes undergraduates to the research
enterprise at an early stage in their careers • Is important in developing numeracy skills• Focus is on understanding what the data show,
not on the formulae• More will be available at Carleton by the next
term
<odesi>
• Uses the Nesstar software• Metadata is first put into the publisher• Can be as rich as you want to make it• Can be exported as an XML file
– If something better comes along, the new software will read the file
• Collaboration through sharing the work
click
• click
click
• click
click
• click
click
• click
click
• click
click
• click
click
• click
<odesi>
• Will allow data to be shared by all with access to the web
• Because of its format, evaporation will be avoided
<odesi> (Ontario Data Documentation, Extraction Service and
Infrastructure)
• A project to provide data on a browser-based platform
• Uses the DDI (Data Documentation Initiative) international standard for metadata
• Written in XML so it can be read across multiple platforms– Text-based representation that can be preserved (ascii)
• Allows searching across datasets and servers
• Provides an easy-to-use interface for beginning researchers
<odesi> • A $1.04 million project funded by OntarioBuys (province) and OCUL
(University Libraries)• Labour-intensive project requiring the input of extensive metadata to
provide better access and preservation• A collaborative effort between Carleton and Guelph• Work is being done by co-op students from both universities• Will include:
– 55 years of Gallup Canada datasets (1945-2000)– About 250 Statistics Canada surveys– Canadian election surveys from 1965-2006– Other polling data– Data from the Inter-university Consortium for Political and Social
Research at Ann Arbor• Data will be mounted on Scholars’ Portal• XML files will be shared across the country
<odesi>
• Will build on previous work • CRIC files are already in this format• Exposes undergraduates to the research
enterprise at an early stage in their careers • Is important in developing numeracy skills• Focus is on understanding what the data show,
not on the formulae• More will be available at Carleton by the next
term
<odesi>
• Uses the Nesstar software• Metadata is first put into the publisher• Can be as rich as you want to make it• Can be exported as an XML file
– If something better comes along, the new software will read the file
• Collaboration through sharing the work
click
• click
click
• click
click
• click
click
• click
click
• click
click
• click
click
• click
<odesi>
• Will allow data to be shared by all with access to the web
• Because of its format, evaporation will be avoided