AutoCardSorter - Designing the Information Architecture of a web site using Latent Semantic Analysis

Christos Katsanos | ckatsanos@ece.upatras.gr

Nikolaos Tselios | nitse@ece.upatras.gr

Nikolaos Avouris | avouris@ece.upatras.gr

AutoCardSorter: Designing the

Information Architecture of a Web Site

Using Latent Semantic Analysis

ACM SIGCHI | Florence, Italy | 5-10 April, 2008

Purpose & Motivation

Automate Structural Design of Information Spaces

Increase efficiency and flexibility for practitioners

Why it is important?

Structural design greatly affects user

experience

Current approaches (e.g. Card Sorting)

often neglected:

Time constraints

Cost to recruit users and run the studies

Increased complexity for data analysis

Challenging for large sites (>100 pages)

Our tool-based Methodology

Page Text

Descriptions

Semantic Similarity

Measure (e.g. LSA)Hierarchical Clustering

Algorithms

Interactive Tree

Structure

Additional Support

1. Number of Groups

2. Cross-Hierarchy Links

Semantic

Similarity Matrix

The tool Interface (1/2)

The tool Interface (2/2)

Validation Study Design

Card SortingAutoCardSorter

Investigate quality of results & efficiency

Health & Nutrition Site

Same content item descriptions

18 representative users

Measures & Analysis

P1 P2 P3 P4 P5

P2 0.94 -

P3 0.11 0.33 -

P4 0.33 0.28 0.11 -

P5 0.50 0.83 0.06 0.06 -

P1 P2 P3 P4 P5

P2 0.62 -

P3 0.21 0.14 -

P4 0.49 0.51 0.83 -

P5 0.61 0.11 0.21 0.92 -

Validity

Similarity-Matrices Correlation

AutoCardSorter Card Sorting

LSA (P5,P1)Frequency Users

placed in Same Pile

P1 and P5

Validity

% Agreement of Design

1) Hierarchical Cluster Analysis of Card Sorting Data

2) AutoCardSorter vs User-Data Dendrogram

a) Eigenvalue Analysis to ‘cut’ objectively

b) User structure => Ideal

c) In Agreement => Longer sequence of pages

grouped together in the same category as Ideal

Efficiency

Total Time Required

AutoCardSorter

Card Sorting

Study Results

Study Results – Validity (1/2)

AutoCardSorter produced results of

comparative quality with Card Sorting:

Similarity-Matrices Correlation = 0.80 (p<0.01)

% Agreement of Design = 100%

Study Results – Validity (2/2)

15AutoCardSorter Card Sorting

Study Results - Efficiency

Discussion - Advantages

Increased efficiency (x27)

Reduces resources required

Explore alternative solutions early

Simple to learn and apply

Easy to apply for large sites (>100)

Possibility for

wider adoption

Discussion – Current Limitations

Lack of qualitative feedback

No insight to category-labels

Future Research

More validation studies in different domains

Additional constraints (e.g. group size)

Improvements to algorithm

Dynamic semantic similarity algos (e.g. LSA IR)

Alternatives to Hierarchical Clustering (e.g.

Factor Analysis)

A Demo - Sit back and enjoy

Summary & Questions

Proposed an approach that automates structural

design of an information space.

Validation study depicted substantial effectiveness

gain, with similar results to a user-based technique

Cheap + Fast + Easy = Possibility for wider adoption

Complementary to user-based methods

Christos Katsanos | ckatsanos@ece.upatras.gr

Extra Slides

More Validation Studies

Summary of Results

Health &

Nutrition

Educational

Portal

Travel &

Tourism Site

Similarity-Matrices

r (p<0.01)0.80 0.52 0.59

% Agreement of

Design100% 93% 87%

Efficiency

(X Times Faster)27 11 14

Efficiency

Number of Proposed Categories

Avg. Items/Proposed Category

Correlation against No of items

Statistical Semantic Similarity

Measures - Overview

LSA: Latent Semantic Analysis (Landauer &

Dumais, 1997)

LSA-IR (Falconer et al, 2006)

PLSA (Hofmann, 1999)

PMI: Point-wise Mutual Information (Manning &

Schutze, 1999)

PMI-IR (Turney, 2001)

GLSA (Matveeva et al, 2005)

HAL: Hyperspace Analogue to Language (Lund &

Burgess, 1996)

COALS (Rhode et al, 2004) 28

Latent Semantic Analysis

AutoCardSorter - Designing the Information Architecture of a web site using Latent Semantic Analysis

Technology

Transcript of AutoCardSorter - Designing the Information Architecture of a web site using Latent Semantic Analysis

An Introduction to Latent Semantic Analysis

Lecture 15: Latent Semantic Indexing

Latent Semantic Indexing

Lecture 14: Latent Semantic Indexing +

Pairwise Latent Semantic Association for Similarity Computation …adni.loni.usc.edu/adni-publications/Pairwise Latent... · 2019-06-04 · Pairwise Latent Semantic Association for

Tracing semantic change with Latent Semantic Analysis · 2 Tracing semantic change with Latent Semantic Analysis 1 Introduction The widespread availability of affordable and powerful

Latent Semantic Analysisberlin.csie.ntnu.edu.tw/Courses/Information Retrieval and...Latent Semantic Analysis: Schematic • Dimension Reduction and Feature Extraction – PCA – SVD

Latent Semantic Indexing SI650: Information Retrieva l

Latent Concepts and the Number Orthogonal Factors in Latent Semantic Analysis

Multi-Relational Latent Semantic Analysis .

Latent Semantic Analysis: Five methodological …/67531/metadc...Latent Semantic Analysis Latent Semantic Analysis (LSA) originated in the late 1980s (Deerwester et al. 1990) as an

Polarity Inducing Latent Semantic Analysis

Latent Semantic Indexing and Beyond

Latent Semantic Indexing For Information Retrieval

POTENTIAL APPLICATIONS OF LATENT SEMANTIC ANAL YSIS TO WRITING …peterfoltz.me/ewExternalFiles/ApplicationsofLSAtoWriting... · 2020-02-08 · POTENTIAL APPLICATIONS OF LATENT SEMANTIC

Address standardization with latent semantic association

Latent Semantic Analysis - sfs.uni-tuebingen.decebert/teaching/11GeometrieBedeutung/lsa.pdf · Latent Semantic Analysis Christian Ebert & Fritz Hamm Lineare Algebra IV: Diagonalisie-rungen

Matrix Factorization and Latent Semantic Indexing 1 Lecture 13: Matrix Factorization and Latent Semantic Indexing Web Search and Mining.

Phishing Detection Using Probabilistic Latent Semantic Analysis

Probabilistic Latent Semantic Analysis (pLSA)