COMIC: A tool for composite indices evaluation
Transcript of COMIC: A tool for composite indices evaluation
COMIC: A tool for composite indices
evaluation
P. Massoli, M. Mazziotta, A. Pareto, C. Rinaldelli
Padua, 17th -18th September 2015
Contents
1. Objectives
2. Software requisites
3. What the software can do?
4. Implemented methods
5. Directory tree
6. Input data
Graphical user interface
7. Output data
The “Table of Contents” file
Plots & Tables
8. Further developments
COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
1
Objectives
2
COMIC = COMposite Indices Creator
It has been entirely developed by the authors who are members of the
BES Committee at the Italian Institute of Statistics - ISTAT
This software enables the user to:
calculate composite indices and their subsequent ranks
compare differnt methods of calculation in order to choose the most
appropriate one with ease
Write efficient reports about the results obtained
COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Software requisites
3
COMIC has been requested to be entirely written in SAS in order to allow experts within the BES Committee to partecipate actively in the implementation and integration of the methods of composition.
COMIC is relatively easy to use and it doesn’t necessitate installation: one simply copies the software on their computer (indipendently of the operative system: Windows, Linux, Unix, ...).
No particular skills are needed to run the software.
COMIC has been developed by using a SAS System Base setup and thus, no particular and specific SAS packages are required as these could hinder its use.
COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
What the software can do?
4
It deals with data pertaining to any type of statistical unit. It currently optimizes
calculations when dealing with statistical units like: regions and groups thereof
(macroregions)
It performs exploratory analysis of the input data (Correlation matrox, ACP,
Factor analysis, communalities, scree plots, matrix plots, …)
It evaluates the rank for each synthesys method or composite index selected
It creates geographical maps of the values of the composite indices (solely for
those statistical units for which it makes sense)
It creates historical series of the composite indices (solely in the case of input
data relevant to two or more time periods)
It creates “speed-acceleration plots (if both a static and a dynamic methods are
selected)
It compares the methods selected by means of cograduation matrices and
scatterplots
It performs an Influence analysus (reports and plots)
COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Implemented methods (1/6)
5
Data la matrice X={xijt} di n righe (unità statistiche), m colonne (indicatori) e p strati (anni), si
calcola la matrice trasformata R={rijt}, con:
negativa; polarità ha indicatorel' se )Min(Max
)(Max
positiva; polarità ha indicatorel' se )Min(Max
)Min(
jx
jx
r
jj
j
jj
j
xx
ijtx
xx
xijt
ijt
dove jxMin e
jxMax sono, rispettivamente, il minimo e il massimo dell’indicatore j, tra le n
unità statistiche, nei p anni considerati o dei valori forniti dall’esterno.
L’indice sintetico, per l’unità i al tempo t, è dato dalla formula:
m
rm
j ijt
it
11-0M .
Algebraic mean of 0-1 indices (Media indici 0-1)
COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Implemented methods (2/6)
6
Data la matrice X={xijt} di n righe (unità statistiche), m colonne (indicatori) e p strati (anni), si
calcola la matrice trasformata Z={zijt}, con:
negativa; polarità ha indicatorel' se S
)M(
positiva; polarità ha indicatorel' se S
)M(
jx
jx
z
jt
jt
jt
jt
x
xijt
x
xijt
ijt
dove jtxM e
jtxS sono, rispettivamente, la media e lo scostamento quadratico medio
dell’indicatore j al tempo t.
L’indice sintetico, per l’unità i al tempo t, è dato dalla formula:
m
zm
j ijtz
it
1M .
Algebraic mean of z-scores (Media z-scores)
COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Implemented methods (3/6)
7
Data la matrice X={xijt} di n righe (unità statistiche), m colonne (indicatori) e p strati (anni), si
calcola la matrice trasformata Z={zijt}, con:
negativa; polarità ha indicatorel' se 10 S
)M(100
positiva; polarità ha indicatorel' se 10S
)M(100
jx
jx
z
jt
jt
jt
jt
x
xijt
x
xijt
ijt
dove jtxM e
jtxS sono, rispettivamente, la media e lo scostamento quadratico medio
dell’indicatore j al tempo t. L’indice sintetico, per l’unità i al tempo t, è dato dalla formula:
itzzit ititcvSMMPI -/
dove m
zm
j ijt
zit
1M ;
m
zm
j zijt
z
it
it
1
2)M(S ;
it
it
z
z
itM
Scv .
MPI (Mazziotta-Pareto Index)
COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Implementedmethods (4/6)
8
Data la matrice X={xijt} di n righe (unità statistiche), m colonne (indicatori) e p strati (anni), si
calcola la matrice trasformata R={rijt}, con:
negativa; polarità ha indicatorel' se 7060)Min(Max
)(Max
positiva; polarità ha indicatorel' se 7060)Min(Max
)Min(
jx
jx
r
jtjt
j
jtjt
jt
xx
ijtx
xx
xijt
ijt
dove jtxMin e
jtxMax sono, rispettivamente, il minimo e il massimo dell’indicatore j, tra le n
unità statistiche, nei p anni considerati o dei valori calcolati in modo da porre uguale a 100 un
valore di riferimento (per es., la media nazionale nell’anno base).
L’indice sintetico, per l’unità i al tempo t, è dato dalla formula:
itrrit ititcvSMMPI -/
.
AMPI (Adjusted MPI)
COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Implemented methods (5/6)
9
Data la matrice X={xijt} di n righe (unità statistiche), m colonne (indicatori) e p strati (anni),
l’indice sintetico ‘statico’, per l’unità i al tempo t, è dato dalla formula:
m
j
m
bjt
ijt
itx
x
1
1
100JS
dove bjtx è il valore base dell’indicatore j al tempo t (per es., la media nazionale nell’anno di
riferimento). L’indice sintetico ‘dinamico’, per l’unità i al tempo t, è dato dalla formula:
m
j
m
tij
ijt
itx
x
1
1
)1(
100JD
dove )1( tijx è il valore dell’indicatore j, per l’unità i al tempo t-1.
Jevons index: static and dynamic
COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Implemented methods (6/6)
10
Data la matrice X={xijt} di n righe (unità statistiche), m colonne (indicatori) e p strati (anni), si
calcola la matrice trasformata R={rijt}, con:
negativa; polarità ha indicatorel' se 1198)Min(Max
)(Max
positiva; polarità ha indicatorel' se 1198)Min(Max
)Min(
jx
jx
r
jtjt
j
jtjt
jt
xx
ijtx
xx
xijt
ijt
dove jtxMin e
jtxMax sono, rispettivamente, il minimo e il massimo dell’indicatore j, tra le n
unità statistiche, nei p anni considerati o dei valori calcolati in modo da porre uguale a 100 un
valore di riferimento (per es., la media nazionale nell’anno base).
L’indice sintetico, per l’unità i al tempo t, è dato dalla formula:
m
j
mijtit r
1
1
IMG .
IMG (Indice Media Geometrica)
COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Directory tree
11
COMIC is distributed by means of a compressed archive. When it is
uncompressed, the directory tree is as follows:
Directory DATI which contains the following predefined sub-directories:
Input: input data can be stored here (or in any other directory
selected by the user);
Maps: where COMIC searches for all the necessary data in order
to create a geographical map;
Output: output data can be sored here (or in any other directory
selected by the user);
Directory PGM (where all the SAS macros are stored)
Directory SETTINGS (where all execution parameters are stored in
order to facilitate repeated runs)
COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Input data (1/2)
12
Four input data set types can be selected:
Microsoft Excel Worksheet (.xls not .xlsx)
ASCII - Comma Separated Values (.csv)
ASCII - Tab Separated Values (.txt)
SAS data set
Metodi per la costruzione di indici sintetici: teoria e pratica – P. Massoli
Roma, 6-7 novembre 2014
Input data (2/2)
13 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Region year type wgt ind1 ind2 ind3 ind4 Piemonte 2006 2 1 68,6 7,1 27 10,4 Valle d'Aosta 2006 2 1 70,8 5,4 27 24,2 Liguria 2006 2 1 65,9 8,8 34,1 18,3 Lombardia 2006 2 1 70,4 6,9 25,3 16,7 Bolzano/Bo. 2006 2 1 73,6 3,8 33,5 25,9 Trento 2006 2 1 69,4 6 19 17,6 Veneto 2006 2 1 69,3 7,2 22,5 13,5 Friuli-V.G. 2006 2 1 68,3 7,1 14,9 16,6 Emilia-Rom. 2006 2 1 73,1 6,2 26,4 19,8 Toscana 2006 2 1 68,7 8,7 17,1 16,6 Umbria 2006 2 1 66,9 10,6 25,2 19,3 Marche 2006 2 1 68,2 8,8 30,9 8,5 Lazio 2006 2 1 63,5 14,4 16 23,6 Abruzzo 2006 2 1 62 13,1 17,7 18 Molise 2006 2 1 56,6 20,9 13,4 17,8 Campania 2006 2 1 48,3 29,6 17,2 25,7 Puglia 2006 2 1 49,6 25,8 22,9 22,2 Basilicata 2006 2 1 54,7 23,3 19,6 26,2 Calabria 2006 2 1 49,9 30,1 13,3 36,5 Sicilia 2006 2 1 49,1 29,8 16,2 29,3 Sardegna 2006 2 1 56 21,6 10,2 18,4 Italia 2006 2 62,5 14,5 21,1 20,3 Piemonte 2007 2 1 68,8 7,9 17,5 11,5 Valle d'Aosta 2007 2 1 72,1 5,4 21,8 26,9
Graphical User Interface (1/6)
14
In order that COMIC may be used even by users with low software skills,
a simple but effective graphic user interface has been created:
The %WINDOW/%DISPLAY SAS Base macro facility has been
implemented in order to catch all the variables/parameters necessary
for a correct COMIC run.
Users communicate with COMIC only by using the GUI (no SAS code
needs to be written).
Many calculations take place automatically and all the data caught by
using the GUI is saved in ASCII files in the SETTINGS directory.
COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Graphical User Interface: menu (2/6)
15 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Graphical User Interface: base indicators (3/6)
16 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Graphical User Interface: weights (4/6)
17 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Graphical User Interface: min/max values (5/6)
18 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Graphical User Interface: data time periods (6/6)
19 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Output data
20
While the software is running, an output sub-directory by the same name
as that of the input data file is created automatically (or overwritten if it
already exists). The directory in question is created within the parent
directory DATI\output or any other directory selected by the user. The
output directory tree in question is as follows:
• Dati: where all the output datasets SAS are saved (ranks, historical
series, …);
• Word: where all the reports in rich text format (rtf) are saved;
• Html: where all the reports written in html language are saved;
In order to facilitate navigation among the reports, a hypertextual “table of
contents” file written in html (sommario.html) is automatically created in
the output directory;
COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Table of contents (1/4)
21 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Table of contents (2/4)
22 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Table of contents (3/4)
23 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Table of contents (4/4)
24 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Plots and tables (1/11)
25 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Plots and tables (2/11)
26 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Plots and tables (3/11)
27 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Plots and tables (4/11)
28 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Plots and tables (5/11)
29 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Plots and tables (6/11)
30 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Plots and tables (7/11)
31 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Plots and tables (8/11)
32 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Plots and tables (9/11)
33 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Plots and tables (10/11)
34 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Plots and tables (11/11)
35 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Further developments
36 COMIC: A tool for composite indices evaluation
Padua, 17th -18th September 2015
Some other SAS procedure can be implemented to
enhance exploratory analysis of input data;
World and Europe geograèhical mapping feature will
be added to those already existing;
Another robustness analysis will be added to the
already existing influence analysis in order to better
investigate the implemented methods robustness.