CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles...

41
CANSIM CANSIM A look at A look at 3 3 interfaces interfaces Ontario DLI Training Ontario DLI Training University of Guelph University of Guelph April 12, 2006 April 12, 2006 Suzette Giles Suzette Giles Data, Map and GIS Librarian Data, Map and GIS Librarian Ryerson University Library Ryerson University Library

Transcript of CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles...

Page 1: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

CANSIMCANSIM A look atA look at 3 interfaces 3 interfaces

Ontario DLI TrainingOntario DLI TrainingUniversity of GuelphUniversity of Guelph

April 12, 2006April 12, 2006

Suzette GilesSuzette GilesData, Map and GIS Librarian Data, Map and GIS Librarian Ryerson University LibraryRyerson University Library

Page 2: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

A look at A look at Stat. Can. website, E-Stat and CHASS Stat. Can. website, E-Stat and CHASS

Where?Where? What?What? AccessAccess ContentContent SearchingSearching

ResultsResults VisualizationVisualization ManipulationManipulation Output formatsOutput formats Which to use?Which to use?

Page 3: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

““Imitation is the sincerest flattery”Imitation is the sincerest flattery”Sources:Sources:

Statistics Canada: About CANSIM:Statistics Canada: About CANSIM: http://www.statcan.ca/english/ads/cansimII/index.htmhttp://www.statcan.ca/english/ads/cansimII/index.htm Statistics Canada E-STAT:Statistics Canada E-STAT: http://estat.statcan.ca/content/English/over.shtmlhttp://estat.statcan.ca/content/English/over.shtml University of Toronto CHASS CANSIM information:University of Toronto CHASS CANSIM information:

http://00dc1.chass.utoronto.ca/cansim2/English/index.hthttp://00dc1.chass.utoronto.ca/cansim2/English/index.htmlml

University of Toronto Data Library Services: University of Toronto Data Library Services: http://www.chass.utoronto.ca/datalib/codebooks/cstdli/chttp://www.chass.utoronto.ca/datalib/codebooks/cstdli/cansim.htmansim.htm

Page 4: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

Where is CANSIM??Where is CANSIM?? Statistics Canada home page:Statistics Canada home page:

Click on Advanced searchClick on Advanced search Search CANSIM is in left hand menu Search CANSIM is in left hand menu OR Click on Our Products and ServicesOR Click on Our Products and Services CANSIM is under “ Access our Online databases”CANSIM is under “ Access our Online databases”

E-STAT:E-STAT: Left hand menu of Table of Contents pageLeft hand menu of Table of Contents page

CHASS: Google or go via University of CHASS: Google or go via University of Toronto’s Data Library Service page “CHASS Toronto’s Data Library Service page “CHASS interface to selected databases”interface to selected databases”

Page 5: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

What is CANSIM?What is CANSIM?

““CANSIM is Statistics Canada's key socio-CANSIM is Statistics Canada's key socio-economic database.” (Stat Can website)economic database.” (Stat Can website)

““CANSIM: Canadian Socio-Economic CANSIM: Canadian Socio-Economic Information Management System.” (CHASS)Information Management System.” (CHASS)

““CANSIMCANSIM is a multidimensional database is a multidimensional database containing more than 26 million time series containing more than 26 million time series regrouped in over 2,400 tables” E-STAT regrouped in over 2,400 tables” E-STAT April 4, 2006April 4, 2006

Page 6: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

CANSIM I and CANSIM IICANSIM I and CANSIM II CANSIM I : Original CANSIM database CANSIM I : Original CANSIM database

consisting of 908,879 time series in 9,380 consisting of 908,879 time series in 9,380 matrices. Contains matrices and time series not matrices. Contains matrices and time series not in CANSIM II. Series start with a letter in CANSIM II. Series start with a letter followed by numbers (called a label) Last followed by numbers (called a label) Last updated June 1, 2002. (CHASS)updated June 1, 2002. (CHASS)

CANSIM II (CANSIM). Reorganized CANSIM II (CANSIM). Reorganized database. Matrices called Tables. Time series database. Matrices called Tables. Time series all start with a V, sometimes called a vector or all start with a V, sometimes called a vector or a label (CHASS)a label (CHASS)

Page 7: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

Timing!!Timing!!

NOTICE:NOTICE: The CANSIM service will be The CANSIM service will be unavailable most of this coming weekend, unavailable most of this coming weekend, from 7PM (Eastern time) Friday April 7 to from 7PM (Eastern time) Friday April 7 to approximately 7PM Sunday April 9, because approximately 7PM Sunday April 9, because of a major database reconfiguration. of a major database reconfiguration.

Page 8: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

AccessAccess

ProductProduct SupplierSupplier UseUse CostCost

CANSIMCANSIM Statistics Statistics CanadaCanada

UnrestrictedUnrestricted Fee ($3.00 to Fee ($3.00 to $5,000)$5,000)

CANSIMCANSIM E-STATE-STAT Restricted –Restricted –

DSPDSP

““Free” via IP Free” via IP AddressAddress

CANSIM ICANSIM I CHASSCHASS Restricted – Restricted – DLIDLI

““Free” via IP Free” via IP AddressAddress

CANSIM IICANSIM II CHASSCHASS Restricted – Restricted – DLIDLI

““Free” via IP Free” via IP AddressAddress

Page 9: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

ContentContent

Stat. CanStat. Can E-STATE-STAT CHASSCHASS

Number of TablesNumber of Tables 2,400+2,400+ 2,400+2,400+ 2,5412,541

Number of SeriesNumber of Series 25 million +25 million + 26 million+26 million+ 28 million+28 million+

Terminated seriesTerminated series YesYes YesYes YesYes

UpdatesUpdates DailyDaily YearlyYearly WeeklyWeekly

CANSIM I dataCANSIM I data NoNo NoNo YesYes

CANSIM II dataCANSIM II data YesYes YesYes YesYes

ConcordancesConcordances NoNo NoNo YesYes

Page 10: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

NOTESNOTES

When a method of measurement or definition When a method of measurement or definition or an attribute or concept changes, the old or an attribute or concept changes, the old series is terminated, and a new series with a series is terminated, and a new series with a new series identifier is begun. (CANSIM – the new series identifier is begun. (CANSIM – the many faces, UT/DLS)many faces, UT/DLS)

When SIC 1980 was changed to NAICS 1997 When SIC 1980 was changed to NAICS 1997 series were terminated and new ones begun. series were terminated and new ones begun. This explains the limited time line of the This explains the limited time line of the NAICS seriesNAICS series

Page 11: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

Content (CANSIM II)Content (CANSIM II)

Stat. CanStat. Can E-STATE-STAT CHASSCHASS

User GuideUser Guide YesYes YesYes LimitedLimited

Table directoryTable directory YesYes YesYes NoNo

Terminated SeriesTerminated Series YesYes YesYes YesYes

IMBD/Survey listsIMBD/Survey lists YesYes YesYes YesYes

Numerical list of SeriesNumerical list of Series NoNo NoNo YesYes

Vector (series) listingVector (series) listing YesYes NoNo (Yes)(Yes)

Link to publications Link to publications

& tables& tables

YesYes NoNo NoNo

Page 12: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

SearchingSearching

Stat.CanStat.Can E-STATE-STAT CHASSCHASS

By Keyword /Text By Keyword /Text YesYes YesYes YesYes

By Subject (Browse)By Subject (Browse) YesYes YesYes YesYes

By Table numberBy Table number YesYes YesYes YesYes

By Series number By Series number YesYes YesYes YesYes

Survey number - get Survey number - get TablesTables

YesYes YesYes NoNo

Page 13: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

SearchingSearching

Stat.CanStat.Can E-STATE-STAT CHASSCHASS

Advanced /Boolean searchAdvanced /Boolean search YesYes YesYes NoNo

By Dimension member desc.By Dimension member desc. YesYes YesYes NoNo

IMDB (surveys) by keywordIMDB (surveys) by keyword NoNo NoNo YesYes

Frequently requested seriesFrequently requested series NoNo NoNo YesYes

Page 14: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

NOTES – Searching/ ResultsNOTES – Searching/ Results

CHASS - get listing of series unless search by Table CHASS - get listing of series unless search by Table numbernumber

Stat Can - get listing of Tables unless search by Stat Can - get listing of Tables unless search by Series numberSeries number

Therefore difficult to compare retrieval Therefore difficult to compare retrieval PETS – CHASS got 60 series (82 with carPETS)PETS – CHASS got 60 series (82 with carPETS) PETS – Stat Can got 5 tables – did not include PETS – Stat Can got 5 tables – did not include

carpetscarpets Important to check “Match full keyword” in Important to check “Match full keyword” in

CHASSCHASS

Page 15: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

ResultsResultsText /Keyword searchText /Keyword search Stat. CanStat. Can E-STATE-STAT CHASSCHASS

11stst level - Tables level - Tables YesYes YesYes NoNo

11stst level - Series level - Series NoNo NoNo YesYes

Subject (browse)Subject (browse)

1st level - Tables1st level - Tables YesYes YesYes YesYes

Survey (browse)Survey (browse)

1st level – Tables1st level – Tables YesYes YesYes --

Page 16: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

ResultsResults

Text /Keyword searchText /Keyword search Stat. CanStat. Can E-STATE-STAT CHASSCHASS

22ndnd level get: level get:

Link to Survey inform.Link to Survey inform. YesYes YesYes YesYes

Related subjects, categoriesRelated subjects, categories YesYes YesYes NoNo

Vector directoryVector directory YesYes NoNo (Yes)(Yes)

Link to publicat. & tablesLink to publicat. & tables YesYes NoNo NoNo

Page 17: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

ResultsResults

Stat. CanStat. Can E-STATE-STAT CHASSCHASS

Selection of series-pick listSelection of series-pick list YesYes YesYes NoNo

Date selection - seriesDate selection - series MultipleMultiple MultipleMultiple SingleSingle

Retrieve as individ. seriesRetrieve as individ. series YesYes YesYes YesYes

Retrieve as a tableRetrieve as a table YesYes YesYes NoNo

Retrieve series from Retrieve series from different tablesdifferent tables

YesYes YesYes YesYes

Page 18: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

NOTESNOTES

Notes from Chris Leowski’s presentation in 2002:Notes from Chris Leowski’s presentation in 2002:

CANSIM II: vector numbers not recycled when CANSIM II: vector numbers not recycled when a series terminated. In CANSIM I they were.a series terminated. In CANSIM I they were.

No frequency conversion in the CHASS No frequency conversion in the CHASS CANSIM II, this is not a CHASS priority.CANSIM II, this is not a CHASS priority.

Badly need a way of pointing users to series that Badly need a way of pointing users to series that replace terminated series and vice versa.replace terminated series and vice versa.

Page 19: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

Visualisation of resultsVisualisation of resultsIndividual Time SeriesIndividual Time Series E-STATE-STAT CHASSCHASS

Line(s) graphLine(s) graph YesYes YesYes

Bar(s) graphBar(s) graph YesYes YesYes

Lines graph with regression lineLines graph with regression line NoNo ??

Pie chartPie chart YesYes NoNo

Scatter chartScatter chart YesYes NoNo

HistogramHistogram YesYes NoNo

Box and whiskerBox and whisker YesYes NoNo

Page 20: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

Manipulation of ResultsManipulation of Results

E-STATE-STAT CHASSCHASS

Change of frequencyChange of frequency MultipleMultiple CANSIM ICANSIM I

Convert to annual - sumConvert to annual - sum YesYes CANSIM ICANSIM I

Convert to annual -averageConvert to annual -average YesYes CANSIM ICANSIM I

Percent changesPercent changes YesYes NoNo

Year to date sums & averagesYear to date sums & averages YesYes NoNo

Moving averagesMoving averages YesYes NoNo

Centred moving averagesCentred moving averages YesYes NoNo

Page 21: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

Output formatsOutput formatsE-STATE-STAT CHASSCHASS

HTML table HTML table YesYes NoNo

Comma separated (CSV)Comma separated (CSV) Yes*Yes* YesYes

SpreadsheetSpreadsheet Yes*Yes* YesYes

RATS, SAS, ShazamRATS, SAS, Shazam NoNo YesYes

SPSS, TSP, TSPterseSPSS, TSP, TSPterse NoNo YesYes

PRN (tab separated)PRN (tab separated) Yes*Yes* NoNo

* Choice of time as columns or rows

Page 22: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

Choosing which to useChoosing which to use

Currency – Daily vs. weekly vs. yearlyCurrency – Daily vs. weekly vs. yearly Ease of searching – pick lists in Stat. Can. helpfulEase of searching – pick lists in Stat. Can. helpful Sophistication of user – list of series can make Sophistication of user – list of series can make

finding data difficult with CHASS interfacefinding data difficult with CHASS interface Frequently used series are fast – in CHASSFrequently used series are fast – in CHASS Could use Stat Can interface to find series # and then Could use Stat Can interface to find series # and then

go to CHASS to get most recent datago to CHASS to get most recent data Output required – CHASS has more formats for Output required – CHASS has more formats for

statistical packages statistical packages Data manipulation requiredData manipulation required Data visualisation requiredData visualisation required

Page 23: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

Statistics Canada: Search results

Page 24: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

Statistics Canada: Series selection

Page 25: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

Statistics Canada: selecting Dimension members and dates

Page 26: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

CHASS: Selection options

Page 27: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

CHASS: Keyword search

Page 28: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

CHASS: Results page

Page 29: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

CHASS: Series information

Page 30: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

CHASS: Retrieval, date and output selection

Page 31: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

CHASS: Display of data

Page 32: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

E-STAT: Search CANSIM

Page 33: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

E-STAT: Text search

Page 34: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

E-STAT: Advanced search

Page 35: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

E-STAT: Search results

Page 36: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

E-STAT: Series selection

Page 37: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

E-STAT: selecting Dimension members and dates

Page 38: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

E-STAT: output options

Page 39: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

E-STAT: HTML table, time as rows

Page 40: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.
Page 41: CANSIM A look at 3 interfaces Ontario DLI Training University of Guelph April 12, 2006 Suzette Giles Data, Map and GIS Librarian Ryerson University Library.

Search done on topic Pets in CANSIM II, CHASS interface