Post on 26-Dec-2015
Some Grand Traditions
• Scholarly and professional communications
• Elucidating underlying relationships
• Visual metaphors to enhance understanding
Data Visualization
• Mining new ground
• A return to old themes
Scholarly and Professional Communications
• Players (people & institutions)• Topics • Works
– Journals– Authors (oeuvres)– Documents
• Bibliographic elements• Content-bearing or concept-symbols
Choice of Elements to Study
• Authors, Title words
• Journal / source
• Institutions / affiliations
• Subject augmenting assignments
• Other content-bearing text words
• Citations, Acknowledgements
• Stopwords
NeglectedIslands of Analysis:
Visualizing Stopwords in Visualizing Stopwords in ASIST Conference Papers
SIG-CON
2004 ASIST Annual Conference
Providence, RI - November 16, 2004Ted Morris, Kent State University
School of Library and Information Science
Domain of Study
• Titles of ASIST 2004 Annual Conference presentations– Drawn from Preliminary Program
• Locations of authors’ institutions– Explicitly stated or inferred– Googled
Results
• 1329 “words”
• 378 “stopword” occurrences
• 39 “stopwords”– Variously distributed as
• Conjunctions• Prepositions• Articles• “Multipurpose” (e.g., adverb + preposition)
and 65
of 64
the 61
a 31
in 31
for 25
an 17
to 14
on 11
across 4
from 4
new 4
as 5
at 5
with 5
more 3
what 3
between 2
how 2
Texas 2
their 2
by, forever, I, inside, now, out, past, still, than, that, they, through, together, we, when, why, within, within, you 1
ASIST 2004 Presentation Stopword Distribution
0
10
20
30
40
50
60
70
an
d of
the a in for
an to on as at
with
acr
oss
fro
m
ne
w
mo
re
wh
at
be
twe
en
ho
w
Te
xas
the
ir
by
fore
ver I
insi
de
no
w
ou
t
pa
st
still
tha
n
tha
t
the
y
thro
ug
h
tog
eth
er
we
wh
en
wh
y
with
in
you
r
Results Discussion
And 65Of 64With 5
Why 1How 2
For 25Why 1
Results Discussion 2
In 31Inside 1Within 1
Out 1Outside 0
The 61That 1
68
A 31An 17
48
Results Discussions 3
Across 4Between 2Out 1Through 1
8
New 4Forever 1Now 1Past 1Still 1When 1
9
Results Discussions 4
More 3Than 1
Their 2We 1They 1Who 0Our 0
ASIST 2004 Presentation Institutional Location Distribution
0
5
10
15
20
25
Ca
na
da
De
nm
ark
Sw
ed
en
Au
str
alia
Tu
rke
y
Fin
lan
d
En
gla
nd
Ko
rea
Ne
the
rla
nd
s
Sco
tla
nd
Ta
iwa
n
Be
lgiu
m
Ch
ina
Fra
nce
Isra
el
Arg
en
tin
a
Bra
zil
Ind
ia
Ind
on
esia
Mé
xic
o
No
rwa
y
Ru
ssia
Sw
itze
rla
nd
Canada* 20
Denmark 7
Sweden 7
Australia 6
Turkey 6
Finland 4
England 3
Korea 3
Netherlands 3
Scotland 3
Taiwan 3
Belgium 2
China 2
France 2
Israel 2
Argentina, Brazil,India, Indonesia, México, Norway, Russia, Switzerland 1
* Ontario 13Nova Scotia 4Quebec 3
ASIST 2004 Presentation Institutional Location Distribution
TexasTexas 3535
New York 22
Pennsylvania 18
New Jersey 17
North Carolina 16
Maryland 15
Indiana 14
Florida 13
MassachusettsMassachusetts 1313
Tennessee 13
Washington 13
Michigan 12
Illinois 11
Wisconsin 10
California 8
Hawaii 7
New Mexico 4
Ohio 4
Washington DC 4
Georgia 3
Louisiana 3
Missouri 3
Oklahoma 3
Virginia 3
Arizona 2
Connecticut 2
Colorado, Delaware, Kansas, Kentucky, Oregon, South Carolina 1
0
5
10
15
20
25
30
35
40
Te
xa
s
Ne
wY
ork
Pe
nn
sylv
an
ia
Ne
w J
ers
ey
No
rth
Ca
rolin
a
Ma
ryla
nd
Ind
ian
a
Flo
rid
a
Ma
ssa
ch
use
tts
Te
nn
esse
e
Wa
sh
ing
ton
Mic
hig
an
Illin
ois
Wis
co
nsin
Ca
lifo
rnia
Ha
wa
ii
Ne
w M
exic
o
Oh
io
Wa
sh
ing
ton
Ge
org
ia
Lo
uis
ian
a
Mis
so
uri
Okla
ho
ma
Vir
gin
ia
Ari
zo
na
Co
nn
ecticu
t
Co
lora
do
De
law
are
Ka
nsa
s
Ke
ntu
cky
Ore
go
n
So
uth
Ca
rolin
a
U.S. 2004 Electoral College Vote Distribution
Bush Kerry
3535
22221818
1717
16161515
1414
1313
1313
1313
1313
1212
1111
1010
88
77
44
88
44
3333
33
33
33
22
22
1111
11 11
11
11
109 169
ASIST 2004 Presenter Affiliations by Red/Blue State
Further Research
• Co-occurrence analysis of stopwords
• Correlation of title length to stopword frequency
• Correlation of punctuation frequency to stopword frequency
• Co-occurrence of -- punctuation??!!Co-occurrence of -- punctuation??!!