Interpretting Embeddings with Comparison

123
Interpretting Embeddings with Comparison Michael Gleicher Department of Computer Sciences University of Wisconsin - Madison

Transcript of Interpretting Embeddings with Comparison

Page 1: Interpretting Embeddings with Comparison

Interpretting Embeddingswith Comparison

Michael GleicherDepartment of Computer SciencesUniversity of Wisconsin - Madison

Page 2: Interpretting Embeddings with Comparison

Two talks (or five) in one!How do we deal with embeddings of text data?Alexander, et. al. Serendip: Topic Model-Driven Visual Exploration of Text Corpora. VAST ‘14.

Alexander & Gleicher. Task-Driven Comparison of Topic Models. VAST ‘15

Heimerl & Gleicher. Interactive Analysis of Word Vector Embeddings. EuroVis ’18.

Heimerl, et al. Interactive Visual Comparison of Object Embeddings. (TVCG ‘21)

How do we use comparison as a tool for hard data problems?Gleicher. Considerations for Visualizing Comparison. InfoVis 2017.

Page 3: Interpretting Embeddings with Comparison

How do we deal with embeddings of text data?Alexander, et. al. Serendip: Topic Model-Driven Visual Exploration of Text Corpora. VAST ‘14.

Alexander & Gleicher. Task-Driven Comparison of Topic Models. VAST ‘15

Heimerl & Gleicher. Interactive Analysis of Word Vector Embeddings. EuroVis ’18.

Heimerl, et al. Interactive Visual Comparison of Object Embeddings. TVCG ‘21

Eric AlexanderAssistant Prof.

Carleton College

Florian HeimerlPost-Doc

University of Wisconsin

Page 4: Interpretting Embeddings with Comparison
Page 5: Interpretting Embeddings with Comparison

Michael GleicherUniversity of Wisconsin Visual Computing Group

Human Graphics Interactionauthoring pictures, videos, animations

Human Robot Interactionrobots!

Human Data Interactionvisualization, visual analytics, interactive learning

`

Page 6: Interpretting Embeddings with Comparison

Data Visualization

Visual Analytics

Human Data Interaction

Page 7: Interpretting Embeddings with Comparison

Interpretting Embeddingswith Comparison

Michael GleicherDepartment of Computer SciencesUniversity of Wisconsin - Madison

Page 8: Interpretting Embeddings with Comparison

Two different stories…Embeddings are a great way to analyze text collections!

But they are hard to interpret!

Use comparison as a strategy

Comparison is a great way to think about analysis problems!

But it’s abstract – need examples!

Use embeddings as a case study

Page 9: Interpretting Embeddings with Comparison

Interpretting Embeddingswith Comparison

Michael GleicherDepartment of Computer SciencesUniversity of Wisconsin - Madison

Page 10: Interpretting Embeddings with Comparison

What is an embedding?

General mathematics:Place a smaller structure into a larger structure

Computer science:Place a discrete set of objects into a vector spaceEncode relationships between objects

Paris

4,8,1,3,… Atlanta

5,2,1,7,…New York

4,8,1,3,…

London

5,2,1,7,…

Boston

3,2,5,1,…

Tokyo

9,2,6,4,…Beijing

7,3,2,7,…San Jose

5,2,1,7,…

Jakarta

3,2,5,1,…

Sydney

9,2,6,4,…

Munich

7,3,2,7,…

High Dimensional DataObjects have associated Vectors

Page 11: Interpretting Embeddings with Comparison

Kinds of relationships in embeddings

Distance

A is closer to B than to C

BC

D

AE

Linear Structure

A is to B as C is to D

Semantic Directions

A is more X than C

B

C

D

AF

F

E

BA

CD

Relationships are interesting even if global positions are not

Page 12: Interpretting Embeddings with Comparison

Embeddings

We care about relationships not values

The coordinates (axes) may have no meaning

A B1

Page 13: Interpretting Embeddings with Comparison

Very different shapes?

Page 14: Interpretting Embeddings with Comparison

Similar neighbors(points close in one are close in the other)

Page 15: Interpretting Embeddings with Comparison

Embeddings for analyzing Text Corpora

Embed DocumentsSimilar documents are close

Estimate similar meaning by word usage statistics in a corpus

Embed WordsSimilar words are close

Estimate similar meaning by word usage statistics in a corpus

Page 16: Interpretting Embeddings with Comparison

Document Similarity

Documents that use similar words (probably) have similar content

Idea from the 1960s

https://www.nytimes.com/2019/01/02/obituaries/karen-sparck-jones-overlooked.html

Page 17: Interpretting Embeddings with Comparison

Similarity based on word counts?

Need to deal with similar words (synonyms)Need to reduce the number of words (too high dimensional)

Find groups of wordsPre-defined dictionaries (word types)

word usage analysisStatistical groupings (words that tend to go together)

Topic Modeling

Page 18: Interpretting Embeddings with Comparison

Topic modeling

cursekill’d

revengedeed

lovewoofair

oaths

leadfarewell

fastmeet

Page 19: Interpretting Embeddings with Comparison

Word vector embeddings

Place words in a high-dimensional vector space

Words similar in meaning should be close in space

Infer similarity by distributional semantics:similar context implies similar meaning

Construct embeddings by processing a corpus of text

my pet cat is brownmy pet dog is brownmy big car is brown

Page 20: Interpretting Embeddings with Comparison

Why use Word Vector Embeddings?Learn about Language or Corpora (Texts)Find similar words/synonymsTrack changes of word usageExploring polysemy Creating lexical resources Evidence of bias…

Natural Language ApplicationsPre-ProcessingTranslationSentiment AnalysisInterpretation…

Page 21: Interpretting Embeddings with Comparison

Several ways to build word embeddingsWord2VecSkip-gram modelNeural embedding

GLoVECo-occurrence modelFactor matrix by optimization

Several ways to build topic models(document embeddings)LDA (Latent Dirichlet Allocation)

Standard algorithmIterative and probabilistic

NMF (Non-Negative Matrix Factors)

Fast AlgorithmLess widely adopted

Page 22: Interpretting Embeddings with Comparison

Why Interpret Embeddings?

Gain insight into a modelDid you build a good model?

Gain insight into the modeling processWhich methods are better?

Gain insight about the underlying dataWhat does the model tell us?

Page 23: Interpretting Embeddings with Comparison

Challenges of Word Vector EmbeddingsChallenges of Document EmbeddingsLarge numbers of Words (or documents)High-dimensional spaces Complex relationships – meaningless positions (or questionable)

Complex processes for building embeddingsComplex downstream applications

No ground truth - subjective aspects

Page 24: Interpretting Embeddings with Comparison

Interactive Analysis of Word Vector Embeddings

Florian Heimerl and Michael GleicherDepartment of Computer SciencesUniversity of Wisconsin – MadisonEuroVis 2018

Page 25: Interpretting Embeddings with Comparison

Summary:Word vector embeddings offer unique challenges

Task analysis of needs 3 Designs for unmet needs

Buddy Plots

Concept Axis Plots

Co-occurance Matrices

Page 26: Interpretting Embeddings with Comparison

Task Analysis:What do people do with Word Embeddings?Literature survey

111 papers from diverse communities

Consider use cases in Linguistics, HCI, Digitial Humanities, etc.

Augment this list by extrapolationwhat tasks would users want – but aren’t doing yet

Page 27: Interpretting Embeddings with Comparison

Use cases suggest tasksLearn about Language or Corpora (Texts)Find similar words/synonymsTrack changes of word usageExploring polysemy Creating lexical resources

Natural Language ApplicationsPre-ProcessingTranslationSentiment AnalysisInterpretation

EvaluationIntrinsic (good embedding?)Extrinsic (applications success?)

InterpretationIdentify items of interestProbe values of interest

Page 28: Interpretting Embeddings with Comparison

Linguistic Tasks and CharacteristicsWe identified 7 distinct linguistic tasks within the literature

4 characteristics pertinent to those tasks: similarity, arithmetic structures, concept axis, and co-occurrences

28

Page 29: Interpretting Embeddings with Comparison

Prior Work: Word Vector Embedding Visualizations

Liu et al., InfoVis 2017 Analogy Relationships

Smilkov et al., 2016 Distance Relationships

Rong and Adar, 2016 Training Process

Page 30: Interpretting Embeddings with Comparison

Tasks and Characteristics vs. Visualizations

Rank word pairs similarityView neighbors similaritySelect synonyms similarityCompare concepts average, similarityFind analogies offset, similarityProject on Concept concept axisPredict Contexts co-oc. probability

Smilkov, et al 2016

Liu, et al 2017

Page 31: Interpretting Embeddings with Comparison

Tasks and Characteristics vs. Visualizations

Rank word pairs similarityView neighbors similaritySelect synonyms similarityCompare concepts average, similarityFind analogies offset, similarityProject on Concept concept axisPredict Contexts co-oc. probability

Smilkov, et al 2016

Liu, et al 2017

Tasks we seek designs for in this paper

Page 32: Interpretting Embeddings with Comparison

Buddy Plots

Concept Axis Plots

Co-occurance Matrices

1. Similarities (local distances)

2. Co-Occurrances

3. Concept Axes

Page 33: Interpretting Embeddings with Comparison

Buddy Plots

Concept Axis Plots

Co-occurance Matrices

1. Similarities (local distances)

2. Co-Occurrances

3. Concept Axes

Page 34: Interpretting Embeddings with Comparison

Concept Axes:Understanding Semantic DirectionsOpposing concepts make an axis

Food

Pet

Page 35: Interpretting Embeddings with Comparison

Concept Axes:Understanding Concept AxesDefine an axis from one concept to another

Food

Pet

Dog

Cat

Goldfish

Chicken

Cow

Trout

Page 36: Interpretting Embeddings with Comparison

Ways to define axes

Vector between two conceptsInteraxis - Kim et al., 2015

Classifier between two groupsExplainers – Gleicher, 2013

Food

Pet

Dog

Cat

Goldfish

Chicken

Cow

Trout

Page 37: Interpretting Embeddings with Comparison

Multiple Concept Axes

Use multi-variate plots

2D = Scatterplot

Pet Food

Smal

l

Big

Page 38: Interpretting Embeddings with Comparison

man woman

academic

worker

Page 39: Interpretting Embeddings with Comparison

Buddy Plots

Concept Axis Plots

Co-occurance Matrices

1. Similarities (local distances)

2. Co-Occurrances

3. Concept Axes

Page 40: Interpretting Embeddings with Comparison

Similarities:Understanding local distancesDistances are meaningful

even if absolute values are notWhat is close to a word?Are there groups of words that are similar?

Ordered lists are usefulDensity (how many can you show)Sense of relative distancesComparison between words

Embedding ProjectorSmilkov, et al. 2016

Page 41: Interpretting Embeddings with Comparison

Buddy Plots (1D lists)

Alexander and Gleicher, 2016 – for Topic Models

Map distance (to selected reference) to horizontal axis

Referenceobject

Closestto ref

Page 42: Interpretting Embeddings with Comparison

Not dimensionality reduction?

Map distance to word to horizontal axisFocus on a single point – other relations not preserved

?

Page 43: Interpretting Embeddings with Comparison

Embedding ProjectorSmilkov, et al. 2016

Page 44: Interpretting Embeddings with Comparison

Stacked/Chained buddy plots

Use color to encode distance

Page 45: Interpretting Embeddings with Comparison

Stacked/Chained buddy plots

Use color to encode distance in the reference row

A

AB

B

Page 46: Interpretting Embeddings with Comparison

Stacked/Chained buddy plots

Use color to encode distance in the reference

Page 47: Interpretting Embeddings with Comparison
Page 48: Interpretting Embeddings with Comparison

Same word… different embedding

Broadcast

Page 49: Interpretting Embeddings with Comparison

Comparison?

For interpretationDo the differences show something?

Word meaning change(between corpora)

Correlations / Biases(within corpora)

For modeling (selection)How are these models different?

Which is better?

Page 50: Interpretting Embeddings with Comparison

Comparison?Comparison is an important task in Data AnalysisComparison is special! (since it involves multiple things)

It’s an important special case

It deserves special attention

Almost all Data Analysis can be viewed as comparisonComparison is a lens (to look at problems)

It’s a generally useful tool

It deserves special attention

Page 51: Interpretting Embeddings with Comparison

How do I think about comparison?to help me develop tools to help people do it

Page 52: Interpretting Embeddings with Comparison

My CS838 (Data Visualization) classMarch 11, 2010

Page 53: Interpretting Embeddings with Comparison

Maybe I shouldn’t publish this… It’s my secret weapon to devise new tools…

Page 54: Interpretting Embeddings with Comparison

Considerations for Visualizing Comparison

Michael GleicherDepartment of Computer SciencesUniversity of Wisconsin – MadisonInfoVis 2017

Page 55: Interpretting Embeddings with Comparison

What is this paper?

Considerations for Visualizing Comparison

4 questions to ask when designing a visualization or tool ?

Page 56: Interpretting Embeddings with Comparison

To examine (two or more objects, ideas, people, etc.) in order to note similaritiesand differences

To mark or point out the similarities and differencesof (two or more things)

Page 57: Interpretting Embeddings with Comparison

What is the comparison? Why is it hard?

How to address the challenges? Which visual design to use?

Page 58: Interpretting Embeddings with Comparison

Comparative ElementsTargetsActions

Comparative ChallengesNumber of TargetsLarge or Complex TargetsComplex Relationships

Scalability StrategiesScan SequentiallySelect SubsetSummarize Somehow

Comparative DesignsJuxtaposeSuperposeExplicit Encoding

What is the comparison? Why is it hard?

How to address the challenges? Which visual design to use?

Page 59: Interpretting Embeddings with Comparison

Wrong Question:Is my problem Comparison?

Just about anything can be viewed as comparison

Not everything benefits from being viewed this way

Page 60: Interpretting Embeddings with Comparison

Serendip, VAST ‘14

Alexander, E., Kohlmann, J., Valenza, R., Witmore, M., & Gleicher, M. (2014). Serendip: Topic model-driven visual exploration of text corpora. In 2014 IEEE Conference on Visual Analytics Science and Technology (VAST)

Page 61: Interpretting Embeddings with Comparison

Serendip: Topic model-driven visual exploration of text corpora Use a topic model to guide exploration of a text corpusFind patterns and connect back to specifics

Documents Model Documents/Passages

Page 62: Interpretting Embeddings with Comparison

Goals

Support inquiry across levels of abstraction

Corpus Document Passage Word

Page 63: Interpretting Embeddings with Comparison

Goals

Support inquiry across levels of abstractionCombat issues of scale in the data

Many documents Long documents

Page 64: Interpretting Embeddings with Comparison

Goals

Support inquiry across levels of abstractionCombat issues of scalePromote serendipitous discovery:

Multiple entry pointsHighlight adjacencyFlexible exploration

A. Thudt, U. Hinrichs, S. Carpendale 2012

Page 65: Interpretting Embeddings with Comparison

Basics of Serendip

Three different (interlinked views)

Corpus Document Passage Word

CorpusViewer TextViewer RankViewer

Page 66: Interpretting Embeddings with Comparison
Page 67: Interpretting Embeddings with Comparison

Is this comparison?No!A tool for exploring a topic model!

We didn’t describe it as comparison

Tool for looking at one topic model

Unclear how users think about it

Yes!Comparison thinking really helped!

We did think about comparison

Tool for using topic models

Our users had comparison tasks

Page 68: Interpretting Embeddings with Comparison

Is this comparison? I don’t care!No!A tool for exploring a topic model!

We didn’t describe it as comparison

Tool for looking at one topic model

Unclear how users think about it

Yes!Comparison thinking really helped!

We did think about comparison

Tool for using topic models

Our users had comparison tasks

A survey of comparison would have missed this.

It’s a great example of comparison ideas

Page 69: Interpretting Embeddings with Comparison

Comparative ElementsTargetsActions

Comparative ChallengesNumber of TargetsLarge or Complex TargetsComplex Relationships

Scalability StrategiesScan SequentiallySelect SubsetSummarize Somehow

Comparative DesignsJuxtaposeSuperposeExplicit Encoding

What is the comparison? Why is it hard?

How to address the challenges? Which visual design to use?

Page 70: Interpretting Embeddings with Comparison

Question 1:What is the comparison?

What are the elements of the comparison?

Page 71: Interpretting Embeddings with Comparison

The Elements of a Comparison . . .

Targets — Set of things being compared

Action — What to do with the relationship among them

To mark or point out the similarities and differences of (two or more things)

To examine(two or more objects, ideas, etc.) in order to note similarities and differences

Page 72: Interpretting Embeddings with Comparison

Question 1A: The Elements: TargetsDo you know what you are comparing?

Explicit Comparisons – the system has the set of targets

Implicit Comparisons – the system may not know all the targetscompare against an implicit baselinecompare against the user’s knowledgecompare with targets only the user knows

Page 73: Interpretting Embeddings with Comparison

Question 1A:The Elements: TargetsWhat is being compared? – Comparison Targets

Does the model match my expectations?What documents are similar?How do groups of documents differ?What words indicate these differences?How are words used differently?Where in texts are these differences?Do the patterns match other things I know?

Page 74: Interpretting Embeddings with Comparison

Question 1B:The Elements: ActionsVerbs on relationships

Try to be more specific than “examine” or “compare”

Truth in Advertising: I didn’t have this worked out in 2010

Page 75: Interpretting Embeddings with Comparison

Question 1B:The Elements: ActionsWhat to do with the relationship? Comparison Actions (Verbs)

Does the model match my expectations? Measure/Quantify relationshipWhat documents are similar? Identify similar thingsHow do groups of documents differ? Measure/Quantify relationshipWhat words indicate these differences? Dissect a differenceHow are words used differently? Identify meaningful differencesWhere in texts are these differences? Contextualize the relationshipsDo the patterns match other things I know? Identify similar things

Page 76: Interpretting Embeddings with Comparison

Comparative ElementsTargetsActions

Comparative ChallengesNumber of TargetsLarge or Complex TargetsComplex Relationships

Scalability StrategiesScan SequentiallySelect SubsetSummarize Somehow

Comparative DesignsJuxtaposeSuperposeExplicit Encoding

What is the comparison? Why is it hard?

How to address the challenges? Which visual design to use?

Page 77: Interpretting Embeddings with Comparison

Question 2:Why is this comparison hard?If it isn’t hard, you probably don’t need to think about it (much)

AbstractlyToo many targets to compare

Large or Complex Targets

Complex Relationships

Serendip

Challenges of Scale!

Lots of documents

Long / complex documents

Complicated models

Page 78: Interpretting Embeddings with Comparison

Many different comparisons

Challenges from the kind (not scale)• Hard target types (implicit)• Hard action types (dissection)• Hard combinations (dissect implicit)

Tasks influence scalability challengesSolutions must respond to both!

Only Scalability Challenges?

Page 79: Interpretting Embeddings with Comparison

Serendip Comparison Example:Where does this happen? (contextualize)Task Challenge:Contextualize – fit user knowledgeStrategy: show in contextDesign: use text as scaffold

Scalability Challenge:Long DocumentsStrategy: summarizeDesign: overview + detail

Page 80: Interpretting Embeddings with Comparison

Line-graph OverviewCombating scale

Showing trendsAffording navigation

Document

Page 81: Interpretting Embeddings with Comparison

Comparative ElementsTargetsActions

Comparative ChallengesNumber of TargetsLarge or Complex TargetsComplex Relationships

Scalability StrategiesScan SequentiallySelect SubsetSummarize Somehow

Comparative DesignsJuxtaposeSuperposeExplicit Encoding

What is the comparison? Why is it hard?

How to address the challenges? Which visual design to use?

Page 82: Interpretting Embeddings with Comparison

Question 3:What is your strategy for those challenges?AbstractlyScan Sequentially

Select Subset

Summarize SomehowSarikaya, Gleicher & Szafir. (2018). Design Factors for Summary Visualization in Visual Analytics. Computer Graphics Forum, 37(3), 145–156. EuroVis 2018.Scalability Strategies!

Page 83: Interpretting Embeddings with Comparison

Serendip Comparison Example:Compare GroupsTask Challenge:Implicit targets – what groups?Strategy: make explicitDesign: user specifies groups

Scalability Challenge:Lots of documentsStrategy: summarizeDesign: how to present statistics?

Page 84: Interpretting Embeddings with Comparison

Comparative ElementsTargetsActions

Comparative ChallengesNumber of TargetsLarge or Complex TargetsComplex Relationships

Scalability StrategiesScan SequentiallySelect SubsetSummarize Somehow

Comparative DesignsJuxtaposeSuperposeExplicit Encoding

What is the comparison? Why is it hard?

How to address the challenges? Which visual design to use?

Page 85: Interpretting Embeddings with Comparison

Question 4:What Visual Design for Comparison?Abstractly

Juxtaposition

Superposition

Explicit Encoding

Gleicher, M., Albers, D., Walker, R., Jusufi, I., Hansen, C. D., & Roberts, J. C. (2011). Visual comparison for information visualization. Information Visualization, 10(4), 289–309.

Page 86: Interpretting Embeddings with Comparison

Comparative ElementsTargetsActions

Comparative ChallengesNumber of TargetsLarge or Complex TargetsComplex Relationships

Scalability StrategiesScan SequentiallySelect SubsetSummarize Somehow

Comparative DesignsJuxtaposeSuperposeExplicit Encoding

What is the comparison? Why is it hard?

How to address the challenges? Which visual design to use?

Page 87: Interpretting Embeddings with Comparison

Should I think about comparison?

Maybe… If it helps

Page 88: Interpretting Embeddings with Comparison

If you’re so good at comparison…We should be able to compare complex things

Page 89: Interpretting Embeddings with Comparison

Interactive Visual Comparison of Object Embeddings

Florian Heimerl, Christoph Kralj,Torsten Möller and Michael GleicherUniversity of Wisconsin – Madison, University of ViennaTVCG 2020 (presented at VIS ‘21)

Page 90: Interpretting Embeddings with Comparison

The Problem:Compare two embeddingsWe are interested in the relationships between objects

local structure

Not their positions in spaceglobal structure

Page 91: Interpretting Embeddings with Comparison

back to the introductory example…

10 dimensional data mapped to 2 dimensions867 objects2 runs of TSNE (different random seeds)Perplexity 30, Learning Rate 200

Local structure (near neighbors) should be preserved(that’s what the algorithm does)

Page 92: Interpretting Embeddings with Comparison

Very different shapes?

Page 93: Interpretting Embeddings with Comparison

Similar neighbors(points close in one are close in the other)

Page 94: Interpretting Embeddings with Comparison

Is the local structure really similar?

measure difference/similarity?

assess and localize it?

interpret and diagnose it?

identify exemplars?

understand context?

Metrics (multiple)

Summary views

Link between views

Connect to detail views

Connect back to global views

Page 95: Interpretting Embeddings with Comparison

Similar neighbors(points close in one are close in the other)

Page 96: Interpretting Embeddings with Comparison

How many objects have the same nearest neighbor?

651

216

Page 97: Interpretting Embeddings with Comparison

Color Encoding

Page 98: Interpretting Embeddings with Comparison

7 Items

327 Items

Page 99: Interpretting Embeddings with Comparison

Where are those 7?

Page 100: Interpretting Embeddings with Comparison

What are those 7?

Page 101: Interpretting Embeddings with Comparison

Spread view:Where do they go? (outside of the top N)

Page 102: Interpretting Embeddings with Comparison

7 items2 items

651 items

Page 103: Interpretting Embeddings with Comparison

Select 2 groups

7 items

2 items

Page 104: Interpretting Embeddings with Comparison

Distance view (highlight those 9)

Page 105: Interpretting Embeddings with Comparison

Clusters (k-Means)

Page 106: Interpretting Embeddings with Comparison
Page 107: Interpretting Embeddings with Comparison

Select by region

Page 108: Interpretting Embeddings with Comparison

A real example:Word Vector Embeddings, 2 CoproraWikipedia (modern) vs EEBO (1470-1700)

GloVE (embedding algorithm)300 dims (key parameter of algorithm)5000 most common words (can’t show all words)

Page 109: Interpretting Embeddings with Comparison
Page 110: Interpretting Embeddings with Comparison
Page 111: Interpretting Embeddings with Comparison

Select groups with lots of overlap

Page 112: Interpretting Embeddings with Comparison

Sort by most frequent words (from 41)numbers, days, proper names, …

Page 113: Interpretting Embeddings with Comparison

Select no overlap in top 50

Page 114: Interpretting Embeddings with Comparison

Cup?

Page 115: Interpretting Embeddings with Comparison

Another example: Build topic models with abstracts?Vispub data (871 papers)Abstracts vs. Full TextNMF, 10 topics

Hypothesis: Abstracts are different than papers

Page 116: Interpretting Embeddings with Comparison
Page 117: Interpretting Embeddings with Comparison

Dimensionality Reduction of Topic Models(abstracts vs. texts)

Red dots are the “topics” – 1 hot vectors of each topic (unit dimensions in each vector)

Page 118: Interpretting Embeddings with Comparison

Pick a cluster – it does correspond

Page 119: Interpretting Embeddings with Comparison

Most of these have lots of overlap

Page 120: Interpretting Embeddings with Comparison

Cluster doesn’t matchleft: abstracts about GPU/renderingright: papers more about domain

Page 121: Interpretting Embeddings with Comparison

Limitations

Binary comparison – less good for parameter tuningneed new designs?

Many complex views that need to be combinedaddress usability – through task-driven pre-arrangements?

Scalability of implementation and visual designsValidation on real problems with real users

Page 122: Interpretting Embeddings with Comparison

SummaryEmbeddings for Text AnalysisIf we can design interpretation tools

Compare document and word embeddings to interpret them

Specialized tools for comparison of embeddings

Task-centric design process

Comparison as Analysis ApproachBecause we have a design process

An approach to thinking about design for comparison

Examples using text analysis with embedding comparison

4 considerations of comparison

Page 123: Interpretting Embeddings with Comparison

Thanks!

To you for listeningTo my students and collaborators

(too many to list)To our sponsors

NSF, NIH, DARPA, Mellon Foundation

Michael Gleicherhttp://pages.cs.wisc.edu/~gleicher