The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland...

27
The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University

Transcript of The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland...

Page 1: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

The Emergentist Approach To Language

As Embodied in Connectionist Networks

James L. McClelland

Stanford University

Page 2: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

Some Simple Sentences

• The man liked the book.• The boy loved the sun.• The woman hated the rain.

Page 3: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

The Standard Approach:Units and Rules

• Sentences• Clauses and phrases• Words• Morphemes• Phonemes

• S -> NP VP• VP-> Tense V (NP)• Tense V -> V+{past}• V -> like, love, hate…• N -> man, boy, sun…• man -> /m/ /ae/ /n/• {past} -> ed• ed -> /t/ or /d/ or /^d/‡

‡depends on context

Page 4: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

What happens with exceptions?

Page 5: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

Standard Approach to the Past Tense

• We form the past tense by using a (simple) rule.• If an item is an exception, the rule is blocked.

– So we say ‘took’ instead of ‘taked’

• If you’ve never seen an item before, you use the rule• If an item is an exception, but you forget the exceptional

past tense, you apply the rule• Predictions:

– Regular inflection of ‘nonce forms’• This man is blinging. Yesterday he …• This girl is tupping. Yesterday she …

– Over-regularization errors:• Goed, taked, bringed

Page 6: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

The Emergentist Approach

• Language (like perception, etc) arises from the interactions of neurons, each of which operates according to a common set of simple principles of processing, representation and learning.

• Units and rules are useful to approximately describe what emerges from these interactions but have no mechanistic or explanatory role in language processing, language change, or language learning.

Page 7: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

An Emergentist Theory:Natural Selection

• No grand design• Organisms produce offspring with random

differences (mating helps with this)• Forces of nature favor those best suited to

survive• Survivors leave more offspring, so their traits are

passed on• The full range of the animal kingdom including

all the capabilities of the human mind emerge from these very basic principles

Page 8: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

An Emergentist/Connectionist Approach to the Past Tense

• Knowledge is in connections• Experience causes connections to change• Sensitivity to regularities emerges

Page 9: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

The RM Model• Learns from verb [root, past tense] pairs

– [Like, liked]; [love, loved]; [carry, carried]; [take, took]

• Present and past are represented as patterns of activation over units that stand for phonological features.

Page 10: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

Examples of ‘wickelfeatures’ in the verb ‘baked’

• Starts with a nasal followed by a vowel

• Has a long vowel preceded by a nasal and followed by a stop

• Ends with a dental stop preceded by a velar stop

• Ends with an unvoiced sound preceded by another unvoiced sound

Page 11: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.
Page 12: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

A Pattern Associator NetworkPattern representing sound of the verb’s past tense

Pattern representing the sound of the verb root

Matrix ofconnections

Summed input

p(a=

1)

Page 13: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

Learning rule for the Pattern Associator network

• For each output unit:– Determine activity of the unit based on its input.– If the unit is active when target is not:

• Reduce each weight coming into the unit from each active input unit.

– If the unit is inactive when the target is active:• Increase the weight coming into the unit from each active

input unit.

• Each connection weight adjustment is very small– Learning is gradual and cumulative

Page 14: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

Some Learning Experiments

• Learn a single item– Test for generalization

• Learn from a set of regular items– Test for generalization

Page 15: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.
Page 16: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

Most frequent past tenses in English:

– Felt– Had– Made– Got– Gave– Took– Came– Went– Looked– Needed

Trained with top ten words only.

Here’s where 400 more words were introduced

Over-regularization errors in the RM network

Page 17: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

Over-regularization simulation in the 78 net

• First learn one exception

• Then continue training the exception with all other forms

• What happens to the exception?

Page 18: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

Questions?

Page 19: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

Some features of the model

• Regulars co-exist with exceptions.

• The model produces the regular past for most unfamiliar test items.

• The model captures the different subtypes among the regulars:– like-liked– love-loved– hate-hated

• The model is sensitive to the no-change pattern among irregulars:– hit-hit– cut-cut– hide-hid

Page 20: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

Additional characteristics

• The model exploits gangs of related exceptions.– dig-dug– cling-clung– swing-swung

• The ‘regular pattern’ infuses exceptions as well as regulars:– say-said, do-did– have-had– keep-kept, sleep-slept– Burn-burnt– Teach-taught

Page 21: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

Key Features of the Past Tense model

• No lexical entries and no rules

• No problem of rule induction or grammar selection

Page 22: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

Strengths and Weaknesses of the Models

Page 23: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

Elman’s Simple Recurrent Network

• Task is to predict the next element of a sequence on the output, given the current element on the input units.

• Each element is represented by a pattern of activation.

• Each box represents a set of units.

• Each dotted arrow represents all-to-all connections.

• The solid arrow indicates that the previous pattern on the hidden units is copied back to provide context for the next prediction.

• Learning occurs through connection weight adjustment using an extended version of the error correcting learning rule.

Page 24: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

Results for Elman net trained with letter sequences

Page 25: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.
Page 26: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

Hidden Unit Patterns for Elman Net Trained on Word Sequences

Page 27: The Emergentist Approach To Language As Embodied in Connectionist Networks James L. McClelland Stanford University.

Key Features of the Both Models

• No lexical entries and no rules

• No problem of rule induction or grammar selection