WH-DEPENDENCIES WITHOUT MOVEMENTling.umd.edu/~bradl/publications/Wh sans movements.pdf · mation on...
Transcript of WH-DEPENDENCIES WITHOUT MOVEMENTling.umd.edu/~bradl/publications/Wh sans movements.pdf · mation on...
WH-DEPENDENCIES WITHOUT MOVEMENT
BRADLEY LARSON
[email protected] UNIVERSITY OF MARYLAND
1. Introduction
In this paper I argue that some wh-question constructions in which wh-phrase find
themselves in left-peripheral positions are not in fact derived via movement trans-
formations. As is well known, wh-question dependencies (and a-bar dependencies in
general) display certain constraints and abilities. If we take these constraints and
abilities to be indicative of wh-movement dependencies, then wh-questions that fail
to display these ought not be considered dependencies of the same type.
In particular, I argue that these non-movement wh-dependencies are found in
coordinated wh-questions (CWH) questions like in (1).
(1) What and when did Iris eat?
In (1), the first word (‘what’) does not seem to show the signature of having moved
from the complement position of the verb. I argue for a what-you-see-is-what-you-get
analysis of this initial wh-word: it is only syntactically present in its overt position.
Instead of by means of a syntactic representation of the wh-word in the pre-
movement position, I argue that the interpretation of the relevant wh-words is guided
through semantic reconstruction.
1
This will be made clearer and more explicit later. First some background infor-
mation on how such non-movement wh-questions are to be derived.
2. History
In this section I rehash a few of the differences between current and former mod-
els of generative syntax, focussing on interpretation and structure building. These
differences will provide the necessary background to deriving wh-questions without
relying on movement.
2.1. Interpretation. In the extended standard theory of generative grammar (EST)
(Chomsky 1965) wh-movement transformations were of a quite different type than
what is the state of the art today. First, there was a representational level of ‘deep
structute’ off of which thematic interpretation was read (2).
(2) [Ivy [fixed what]]
In the above representation, ‘what’ is interpreted as the internal argument of ‘fixed’
(the fixee) and ‘Ivy’ the external (the fixer). This representation would feed a trans-
formation into that would move the wh-word from its deeply embedded position to
a less embedded position (3).
(3) [what [Ivy fixed]
At this level of representation (‘surface structure’) such things as topic-comment
interpretations were determined. Note that since the thematic interpretation was by
this point already set in stone, there was no need to represent the wh-word in its
base position.
2
This model of wh-movement is of course no longer in fashion. Nowadays it is
common practice to work under a model in which all semantic information is read
off the final derived representation, not just topic-comment information but also
thematic relations (Chomsky 1993,1995).1 In order for this to possible, it is necessary
that wh-words be somehow represented in their base positions. That is, roughly the
same representation as found in (2) feeds a wh-movement operation that results in
(4).2
(4) [what [Ivy [fixed what]]]
From this representation, the thematic relationship between verb and internal argu-
ment is preserved throughout the derivation and any information relevant to focus or
presupposition is also extant. Further, the lower position will not be pronounced and
as such the overt word order will be something like (5) (See Nunes 2001 for details).
(5) ‘what’ > ‘Ivy’ > ‘fixed’.
The descriptive results of the two approaches are identical in this constrained case.
The takeaway point is that in the current theory the correct interpretation of the
moved wh-word seems to require some sort of maintenance of its pre-movement
representation. The EST did not require this.
What I will show is that the current theory can accommodate the correct the-
matic interpretation of a wh-word without it having a syntactic representation in
the position where it would acquire that thematic interpretation.1 Of course not everyone believes only this. Some (Uriagereka 1999 and others) take semanticinformation to be read off the syntactic derivation multiple times throughout the derivation, notjust at the end.2I will assume the copy theory of movement (Chomsky 1995) here (though others would surelysuffice).
3
2.2. Structure Building. Prior to the early nineties, structure building operations
necessarily preceded movement operations. Either phrase structure rules (Chomsky
1957) or later X-bar formulae (Chomsky 1970, Jackendoff 1977) would create a deep
structure like in (6)
(6)
Ivyfixed what
Transformations would then apply to the fully formed phrase structure, resulting
in a representation like that in (7) which was in turn transferred to the levels PF
and LF.
(7)
what ...
Ivyfixed what
The differences currently is that there are no longer the representational levels
of deep and surface structure. As such, structure building and movement trans-
formations can intermingle derivationally. In order to capture the parity between
structure building and movement transformations, the concept of merge was intro-
duced (Chomsky 1995). Merge, represented in (8), takes two syntactic elements and
combines them such that they create a new element that is also subject to merge.
(8) Merge(x,y) ⇒ {x,y}x
4
Merge can take two entirely unrelated objects and combine them as shown above,
external merge. It can also merge an element that is embedded within another to that
self same element. That is, given a structure like that in (9), it is possible to merge
the embedded ‘y’ to the ‘q’-phrase that encompasses it, thus creating something like
(10).
(9) {q,{z,{x,yi}x}x}q
(10) {yi,{q,{z,{x,yi}x}x}q}q
This gives us exactly the effect of movement, yet still in terms of generalized structure
building.
The important point here is that the introduction of the a wh-word into its surface
position is now no longer a different type of operation from introducing it as in the
internal argument of a verb. So long as the more deeply embedded structure of the
derivation is such that it licenses the merger of the wh-word, the derivation ought to
work.
3. Deriving Syntactic Gaps
Imagine a language in which wh-question formation works like this: A wh-word
moves to the front of a clause and leaves behind an X (11). All other aspects of this
language are identical to English.
(11) a. Ivy ate something
b. What did Ivy eat X?
5
In this reality, deriving (11b) could involve a movement transformation that leaves
behind the X, much like English wh-movement transformations leave behind gaps.
It would be prudent, however, to determine whether there were any other way to
derive the question construction. With the X there is novel entity that needs to be
accounted for. Perhaps the derivation does not involve movement. For example, by
merging the X in low as the complement to the verb and merging the wh-word higher
up in the structure, one could derive the same word order and interpretation. This is
of course logically possible and it is either a licit derivation or not. So much should
be obvious.
But back to reality and wh-movement does not leave behind an X but rather
negative space, a gap. This being this case, and given the history of the field, it is
perhaps more understandable why the non-movement possibility was not explored
as a non-mutually exclusive alternative for wh-movement. But given the state of the
field now with respect to structure building, there is no excuse not to explore such
an analysis.
In this paper, such an exploration is carried out with respect to wh-question forma-
tion. In the subsection below, I offer control constructions as an analogous instance
of gap derivation in which both tacks have been taken.
3.1. Control. With respect to control constructions (12), two means of gap creation
exist in the literature.One way is to code into the grammar a means to avoid the
overt expression of the base position of a moved element (Bowers 1973, OÕNeil 1995,
and Hornstein 1999 among others) (13).
(12) Ivan wanted [gap] to leave.
6
(13) Ivani wanted Ivani to leave.
Another way is to posit the introduction of null elements into the derivation. This
is followed by the introduction of the relevant antecedent in the form of the matrix
subject in this case (See Chomsky 1981, Landau 2001 and others) (14).
(14) Ivani wanted PROi to leave
Ignoring the arguments for adopting either of the above proposals, the distinction
is important with an eye towards wh-movement. Again in wh-movement, there is a
gap that is interpreted as related to an antecedent (15).
(15) What did Ivan fix [gap]
Most theorizing of such constructions has taken the movement-thoery tack. That is,
the wh-word moves from the lower position into the higher one (16).
(16) Whati did Ivan fix whati?
But an equally logically possible approach is that of the PRO theory: The antecedent
to is base generated in a different position from the gap. This is essentially the
approach taken here (sans the postulation of a PRO analogue). The derivation of
wh-questions may be possible via direct base generation of the wh-phrase into the
spec,CP position (17).3
3There is another logical possibility for deriving the gap in (): there is simply nothing there syn-tactically. There is no subject represented in the embedded clauses (17).
(17) Ivan wanted to leave
There are to be sure many reasons to doubt this as a means of deriving control sentences, butit remains a logical, if deeply flawed, possibility. When explored with respect to wh-movementhowever, the derivation of a gap without movement does not seem so far fetched.
7
(18) Whati did Ivan fix?
In the next section I show how this might be possible without falling afoul of syntactic
restrictions.
3.2. Syntactic Leeway. As noted earlier, there is no longer any difference in current
theory between merging a wh-word into a spec,CP position as its first entry into the
derivation and merging a wh-word into that position after having merge a copy of
it elsewhere. They are both instances of merge, well-formedness is a concern for
another part of the grammar.
This being the case, it ought to be possible to merge a wh-word for the first time
into the spec,CP position of the following structure:
(19) CP
what ⇒merge C
C TP
Ivy T
T V
fix
Given a few other operations, the structure above could easily produce the sentence
in (20) below:
(20) What did Ivy fix?
The structure may be ruled out at the interfaces or earlier, but the merge itself is
surely licit. It only differs from the traditional conception of wh-movement in that8
there is no unpronounced copy of the wh-word elsewhere in the structure. Further,
the presumed Markovian nature of the derivation should make this information in-
accessible when merging in the wh-phrase.
However, there is something that prior to the introduction of the wh-word that
potentially marks the sentence in (20) as unacceptable independent of the wh-word.
The verb ‘fix’ in English obligatorily requires an internal argument. That is, (21) is
unacceptable:
(21) *Ivy fixed.
Some verbs in English work differently and do not require an internal argument, like
‘eat’:
(22) Ivy ate.
This being the case, it should be possible to derive a sentence whereby the wh-word
enters late so long as the verb optionally takes an internal argument. In other words,
a sentence like (23) ought to be derivable in the way that (24) suggests. In this
sentence, the subcategorization frame of the verb is respected.
(23) What did Ivy eat?
9
(24) CP
what ⇒merge C
C TP
Ivy T
T V
eat
Another potential roadblock arises that might preclude the above representation
from being syntactically well-formed. Under most theories of phrase structure, the
wh-word ‘what’ would be taken to be a noun phrase of sorts and thus require abstract
case. Case assigners in contemporary theory are finite T-heads as well as most verbs.
The wh-word in question is not in a relevant position in the structure above to receive
case from either of these sources. In the following subsection I will argue that these
concerns turn out not to be relevant. ex-situ wh-words do not require case.
3.3. What case? As noted above, even if a wh-word can be first-merged into the
spec,CP position in such a way that might be in many ways licit syntactically, it
would still be missing abstract case. As such, the case-less wh-word ought to mark
the derivation as being ill-formed.
In this subsection I argue that this qualm is misguided and that wh-words that do
not reside in-situ do not require abstract case.4
4Note that not all wh-words ought to be considered noun phrases. Wh-words like ‘why’, ‘where’,and ‘when’ are hard to justify as being noun phrases. So if it turned out to be the case that somewh-words did in fact require abstract case, non-movement wh-questions should still be derivable aslong as the wh-word in question is one of these
10
It seems fairly clear that in-situ wh-words (by which I mean wh-words that are not
found in spec,CP) do indeed require case. They pattern exactly like regular nouns
in their distribution with respect to case:
(25) a. Which teacher said {who/Iris} seemed to be the best?
b. *Which teacher said it seemed {who/Iris} to be the best?
It is also almost unavoidable that licit ex-situ wh-words (wh-words in spec,CP) stem
from a position in which abstract case was acquirable. The wh-word in (26) could
have gotten case from being in the spec,TP position, the wh-word in (26) could have
gotten case from being the complement to the VP.
(26) a. Whoi [TP whoi likes Autumn]?
b. Whati does Ivy [V P like whati]?
It is not possible to determine whether these wh-words required case. It could have
been the case that they were in case positions and that is what in part makes the
derivations legitimate. It is equally possible given the facts that the wh-words simply
never needed case to begin with and being in a case position at some stage of the
derivation was irrelevant.
11
So it seems that to be rather difficult to determine whether ex-situ wh-words
require abstract case. If they remain in-situ, they seem to require it, but if they
move it is hard to say.5
To test whether the spec,CP wh-words require case would call for an instance
where case is not assigned to the position from which wh-movement stems from.
Case is not assigned by tenseless T-heads to their specifiers, as evinced in (27).
(27) It is certain Ivan to win
Perhaps if wh-movement stemmed from the spec,TP position in the above sentence,
the sentence would be acceptable due to wh-words not requiring case. This turns out
not to be the case, as seen in (28). This does not necessarily show that the moving
wh-word requires case as I will argue below.
(28) *Who is it certain to be at the party?
First, Lasnik and Frieden 1981 argue that it is crucially wh-traces that require case,
not the wh-word itself. They note that in the case theory developed in Chomsky
1980 (following inspiration from Vergnaud) lexically realized NPs require case. They
take the overt wh-operator in (28) to work identically to an overt relative operator
like in (29). The sentences are both unacceptable, presumably due to the fact that
the wh-word does not receive case.5It may seem more elegant to say that if a wh-word in one position requires case then wh-words inany position require case. But it could be the case the in-situ wh-words in multiple wh-questionsin English deal with abstract case in a manner different than that of spec,CP wh-words, Along thelines of Marantz 1991. In that system accusative case is assigned due to the fact that a higher nounphrase received nominative case. The lower noun phrase is ‘dependent’ on the case of the uppernoun phrase and it is assigned in a different way. it could be that the in-situ wh-words case isdependent on there being a wh-word in spec,CP, one that doesn’t get case itself.
12
(29) *The man who it is certain to be at the party...
However, when the relative operator is made covert, the sentence is still bad (30). It
cannot be the case that there is a constraint on the overt operator, relative or wh-.
Rather, they argue that the trace of the movement is what requires case.
(30) The man it is certain to be at the party...
The approach to wh-questions suggested here eschews all traces and their modern
analogues. Since there are no wh-traces to acquire or fail to acquire case, the fact
that the wh-word does not acquire case is orthogonal to the issue.
The case against the wh-word requiring case can also be made in a different way.
First, the sentence above is quite reminiscent of super raising constructions in Eng-
lish, which are ungrammatical for reasons not solely stemming from case considera-
tions:
(31) *Ivan seems that it is certain to be at the party.
Setting aside this for a moment, there is another construction in which case is not
assigned to the position from which wh-movement stems from. Take (32). It seems
that the embedded subject, Ivan, receives case neither from the tenseless head nor
from the matrix verb.
(32) *Ivy assured Iris Ivan to be at the party.
If there is wh-movement from Ivan’s position we could avoid the resemblance to super
raising. When this is done, the sentence is surprisingly acceptable (Postal, 1974).
(33) Who did Ivy assure Iris to be at the party?
13
Kayne (1980) takes the above sentence to suggest that the wh-word can receive case
from the matrix verb when it moves into the embedded spec,CP. But there is just
as much evidence suggesting that the wh-word does not require case at all. Matrix
verb to subordinate spec,CP is a rather unexpected relationship in which to assign
case.
Recall our super raising-like construction from above in () (repeated here as (34)).
If its ungrammaticality stems solely from a lack of case, it ought to be rescuable by
getting case from a verb like ‘assure’. This turns out not to be so as seen in (35).
(34) *Who is it certain to be at the party?
(35) *Who did Ivy assure Iris that it is certain to be at the party?
This suggests that the ungrammaticality of (35) is not due to lack of case on the overt
wh-word or one of its traces, but rather something else entirely. For some reason it
is not licit to wh-move a subject across an expletive in a position the subject could
have moved to.6
The results of this study allow for a conception of the grammar in which ex-
situ wh-words do not require case.7 This is an admittedly unintuitive result, but
consistent with the data. Assuming case to be irrelevant to these wh-words allows
us to insert a wh-word late into a non-case position without pain of Case-related
ungrammaticality.
6The construction in () is also not an island to all wh-movement. As seen below in (i), extractionof a complement from the most embedded position is licit.
() What did Jake assure Jim that Joey is certain to win?7Another, seemingly indistinguishable, logical possibility is that all moving wh-words get case inthe spec,CP position.
14
3.4. Summary. What we have seen in this section is that inserting a wh-word into
a derivation late does not run afoul of anything syntactically. It is possible to derive
the lower gap from which the wh-word only apparently moves from. Further, it seems
that moving wh-words do not require abstract case in the relevant ways to mar the
derivation. That is, inserting the wh-word initially into spec,CP will not cause the
derivation to be ruled out due to lack of case on the wh-word. Case is simply not
required in that instance.
If this conception of wh-‘movement’ is not ruled out syntactically, it ought to
be considered whether such representations are ruled out semantically. This is the
subject of the following section.
4. Deriving Gaps Semantically
Assuming it to be possible to derive a wh-question both without movement and
without violating any narrowly syntactic strictures, it behooves us to determine
whether the derived representations are legitimate semantically.
4.1. Presupposition and event variable restriction. It is certainly the case that
certain apparent gaps are licit semantically. Take for instance optionally transitive
verbs like ‘eat’. This verb can take an internal argument (36) and it can also be
followed by a gap (37).
(36) Ivy ate something
(37) Ivy ate
15
The example in (37) suggests that the lack of internal argument is licit semantically.
This contrasts with obligatorily transitive verbs like ‘fix’. Seen below, such verbs
require internal arguments and are unacceptable without them:
(38) Ivy fixed something
(39) *Ivy fixed
It seems to be the case that verbs like ‘eat’ allow for the presupposition of an internal
argument. The sentence ‘Ivy ate’ presupposes that Ivy ate something. In other
words, sentence () entails sentence ().8 This is something that words like ‘fix’ do not
seem to do.
How then to account for the varying acceptability ratings between to the two
sentences lacking internal arguments? One potential solution is to adopt a neo-
Davidsonian approach to semantic composition and attribute the difference to the
lexical semantics or pragmatics of the given words. That is, both sentence () and ()
are grammatical with respect the semantics yet () is licit because of some proprietary
aspect of the verb.
Under a neo-Davidsonian approach, the sentences in () and () would have the
rough logical forms in (40) and (41) respectively:
(40) [[Ivy ate]] = ∃e{eat(e) & Agent(Ivy, e)}
‘there exists an event of eating whose agent was Ivy’
8There may be some slight difference between the two sentences. Some speakers take ‘Ivy ate’ toonly suggest that she ate a meal or its equivalent instead of any given thing.
16
(41) [[Ivy fixed]] = ∃e{fixed(e) & Agent(Ivy, e)}
‘there exists an event of fixing whose agent was Ivy’
This approach holds that both of the above logical forms are perfectly licit. It is
simply some extra-semantic notion that makes it such that a fixing event without an
explicit patient is odd, whereas an eating event without an explicit theme is fine.
This seems like the wrong idea however. It is not the case that the sentence
represented in (40) is not interpreted as having an internal argument. It is rather
the case that the internal argument is implicit or presupposed. We then would
like to somehow code this distinction into the semantics. Many researchers have
previously done exactly that: code presuppositional material into the restrictor of
quantifiers (see Berman 1991, Krifka 1993, and Herburger 2000 as proponents of such
an approach). Following these authors, I posit that instead of the logical form in
(40), a sentence like ‘Ivy ate’ has a logical form like that in (42).
(42) [[Ivy ate]] = ∃x[∃e: Theme(x,e)]{eat(e) & Agent(Ivy, e)}
‘there exists an x and there exists an event e whose theme is x such that the
event was an eating and its agent was Ivy.
Following Heim (1982) and Kamp (1984), I take indefinites to be variables that are
unselectively bound by existential operators. This existential operator binds into
the restrictor of the event variable in this case. The event variable is then restricted
to only those events that have something as their theme, which is what we want
with respect to verbs like ‘eat’. This is not the case with ‘fix’. There is no lexically
determined variable in the restrictor of the event variable in that case. This is to say
that the gap after ‘fixed’ is not licensed semantically, but the gap after ‘eat’ is.
17
4.2. Wh-questions and event variable restriction. We saw in the previous sub-
section that the optional gap after verbs like ‘eat’ can be licensed by existentially
bound variable in the restrictor of an event quantifier. In this subsection I will adapt
that idea to a representation of wh-movement and show that wh-questions can be
formed without movement and yet be licit semantically.
The representation involves an existential quantifier that binds a variable. This
variable could presumably be bound by some other sort of operator as well, including
a wh-operator.9 As we saw in the section, wh-words can be introduced late and high
into the structure and as such it should be possible for that this wh-word can function
as the wh-operator that binds into the restrictor.
That is, the sentence in (43) can be represented at logical form as in (44).
(43) What did Ivy eat?
(44) [[What did Ivy eat]] = ?x[∃e: Theme(x,e)]{eat(e) & Agent(Ivy, e)}
‘for what x does there exist an event e whose theme is x such that the event
was an eating and its agent was Ivy.
The same cannot be said about ‘fix’. Seen below, without movement of the wh-word
(thus leaving behind a variable to bind) the wh-operation has nothing to bind. I
take such vacuous binding to be illicit semantically.
9Note that Chomsky 1982 argues for a similar configuration in parasitic gap constructions. Thewh-phrase that moves in the matrix clause can serve as the operator to a variable that the parasiticgap consists of. This is in principle no different that what I propose here: a wh-operator can binda variable that is not derived from movement from that position.
18
(45) [[What did Ivy fix]] = ?x∃e{fix(e) & Agent(Ivy, e)}
‘for what x does there exist an event e such that the event was an eating
and its agent was Ivy.
This contrasts with full wh-movement which would create the licit representation
in (46). The trace of the wh-word would be represented as the relevant variable,
though not necessarily in the restrictor, but in the nuclear scope.
(46) [[What did Ivy fix]] = ?x∃e{fix(e) & Agent(Ivy, e) & Theme(x,e)}
‘for what x does there exist an event e such that the event was an eating
and its agent was Ivy and its theme was x.
Thus is it not the case that sentences of the form ‘What did Ivy fix?’ are necessarily
going to be illicit. In fact, there is no discernible difference between the movement-
derived version of any wh-question and the non-movement version. The mean the
same things and are phonetically realized identically. In the next section, I explore
a potential means of contrasting the two.
4.3. Summary. In the previous two sections we have seen that it is in principle
possible to derive wh-questions without resorting to movement. The only constraint
seems to be that for this to be possible, it must be that the wh-word that is undergo-
ing the movement comes from a position where it would not necessarily be required
to arise overtly in a declarative sentence. That is, wh-questions without movement
are possible when the wh-word could have stemmed from the object position of a
verb like ‘eat’, but not a verb like ‘fix’.
19
5. Empirical differences
In this section, I will address questions as to whether or not this logically and
theoretically possible novel derivation of wh-questions has any merit empirically. It
will be seen that answering this question is not as simple might be expected.
As we have already seen, there is little to no difference between the posited non-
movment derivations of wh-questions and the traditionally type. They have very
similar semantics (so long as the verb is of the right type) and they are string iden-
tical. Any differences that arise are going to stem from syntax internal signatures of
movement.
5.1. A false lead. An obvious signature of wh-movement is that of island sensitivity.
If it is truly the case that object-wh dependencies for verbs like ‘eat’ do not require
movement, it should be the case that we fail to see the effects of island constraints
in those instances. That is, a sentence like (47) ought to be more acceptable than a
sentence like (48).
(47) *What did Ivy meet a man who ate?
(48) *What did Ivy meet a man who fixed?
It should be apparent to the reader that there is in fact no asymmetry in the accept-
ability of the above sentences. They are both bad.10
10It is interesting to note that this approach to islands and null arguments makes the correctpredictions for single-wh questions though not for English. Italian (and other null subject languages)not only allows full null subjects but also seems not to be subject to island constraints when thegap is in a subject position of a finite clause (Rizzi, 1982). The analysis here seems applicable tosuch cases, but in a way that is not clear to me yet.
20
This lack of differential acceptability however does not necessarily rule out the
possibility of movementless wh-questions. From a parsing perspective, it would be
incredibly odd for the sentence in (47) to be judged acceptable.
When the parser comes across the initial wh-word in (47), it posits a gap to follow.
I assume that the default dependency that the parser predicts is one of movement.
It is surely the the more common dependency for wh-words. At this point in the
parse, it is unknown whether the wh-word ‘what’ will form a dependency with a gap
in the subject or object position. Since it is the case that in English non-movement
wh-questions can only form as dependencies with object gaps of specific verbs, it
makes sense for the parse to assume that such a circumscribed set of conditions is
less likely and thus to be ignored. Positing movement is going to be the choice that
leaves more options open for successful parses in English.
So the parser assumes a movement dependency between the wh-word in (47) and
the upcoming gap. When the gap is reached, it turns out that the movement predic-
tion cannot have been correct because movement out of the island was not possible.11
But what to do at this point. It has turned out by the time the gap is parsed out
that movement could not have worked. The intended meaning of the utterance has
surely been discerned and there there is no motivation to carry out a costly re-parse.
As such, the sentences are simply judged unacceptable.
11It is likely not to be the case that the movement prediction will be shown to be false any earlierthan the gap site. Any island for movement that is parsed does not necessarily need to be thesource of the potential movement. That is, the string in () does not necessarily have to end withan island violation.
(i) What did Ivy make the claim that Iris...
It could instead end up being entirely licit:
(ii) What did Ivy make [the claim that Iris ate] for?21
In order to truly test whether non-movment wh-questions are empirically viable,
it would be necessary to test sentences in which the impossibility of a movement
dependency is made clear very early on in the parse. In the next section, I investigate
such sentences.
5.2. What and When? At the outset of the this article I claimed that sentences
of a the sort in (49) involve non-movement wh-question formation.
(49) What and when did Ivy eat?
For such sentence the initial wh-word ‘what’ does not arrive at the left periphery via
movement, but rather base generation. The second wh-word ‘when’ arrives there via
normal wh-movement.
The impetus for this analysis comes from many places. First, movement to the
sentence initial position would require wh-movement in English to be freer that
previously expected. Second, the parsing issues raised earlier suggest that this con-
figuration is one that could allow a non-movement parse. Third, the first wh-word
does not display the empirical signature of wh-movement.
5.3. Strange Movement. English is not a multiple wh-movement language. Unlike
some languages, English does not allow sentences like in (50).
(50) What when did Ivy eat?
It would then be quite unexpected that both of the wh-words in the sentences at
hand (49) would involve both of the wh-words having moved into that position. Were
that the case it would be a novelty in English.
22
Further, the type of movement that would be required would be novel as well.
Movement is taken necessarily to target the root of trees (Chomsky 1995).12 As
such the derived position of the wh-word should c-command its trace. Yet in these
instances, the wh-words do not seem to c-command their traces (51).
(51)
&did
Ivytj
eat ti
whati &
& whenj
There are presumably other problems with positing this sort of movement. This
is not supposed to be an exhaustive demonstration. Rather, it should be sufficient
to show that assuming movement for both of the wh-words in the relevant sentences
requires one to posit novel types of movement for English.
5.4. Parsing hints. As suggested above, the parser needs to be tipped off as to
the applicability of a non-movement dependency early on so as not to default to
a movement prediction. In these sentences such a tip-off exists. When parsing,
the initial wh-word is reached and the default movement prediction is activated.
However, once the immediately adjacent word ‘and’ is met it becomes clear that a
movement prediction cannot work. This wh-word is coordinated and thus cannot
c-command anything but the other coordinand (52).
12Other options such as ‘tucking-in’ Richards 1999 notwithstandings.
23
(52) CP
& ...
what &
& ...
It cannot c-command any potential gap. Given this deduction, the parser should not
posit a movement dependency. As we have seen though, the parser has the option
of positing a non-movement dependency. Revising this initial prediction should not
be that costly as only two words have yet been parsed.
A prediction is thus made. At the point when the string ‘what and’ is parsed,
it should only be possible for optionally transitive verbs like ‘eat’ to host the gap.
We determined earlier that only those verbs could allow non-movment wh-questions.
Since those verbs are the only ones that can handle such a dependency, it follows
that only they should be able to licit in a sentence that begins with ‘what and’. This
prediction is borne out (see Gracanin-Yuksek 2007 among others):
(53) What and when did Ivy eat?
(54) *What and when did Ivy fix?
This is a stark contrast compared to which verbs can usually host wh dependency
gaps. That is, any verb that takes a internal argument can host such a gap generally:
(55) What did you eat?
(56) What did you fix?
(57) What did you want?
24
(58) *What did you sleep?
But only optionally transitive verbs can host an internal argument gap in ‘what
and’ questions. This is suggestive of a odd constraint on wh-dependecies if not a
different sort of dependency all together.
5.5. Empirical signature of movement. I take syntactic movement to display
a signature driven by two things: island constraints and syntactic reconstruction
effects. In this subsection, I investigat whether the initial wh-word displays any of
these effects.
5.5.1. Islands. We saw above that the relevant island effects are impossible to discern
in regular, singular wh-movement cases. Unfortunately it turns out that a different
problem arises here.
It should be possible to construct a sentence like (59) and determine whether the
first wh-word is subject to the island constraint that threatens from below. If it has
not actually moved into that position, it ought not be subject to that island and
thus not be judged as bad as (60).
(59) *What and when did Ivy meet the man that ate?
(60) *What and when did Ivy meet the man that fixed?
The sentences are equally bad. It is near impossible to determine whether the island
violation was effected by the first wh-word or the second. Even when there is no
potential island violation, the coordinated wh-words must be interpreted as stemming
from the same clause:
25
(61) What and when did Ivy say that Ivan ate?
The above question can only be interpreted as inquiring about what Ivan and when
Ivan ate. It does not have an interpretation concerning what Ivan ate and when Ivy
said that Ivan ate. Because of this, it will rarely be possible to test a sentence where
the first wh-word is interpreted into an island and the second wh-word not.
The only islands that we are allowed to test are left-branch islands like the one
exemplified in (62).
(62) *Whose did Ivan steal book at the party?
(63) *How many did Iris meet people in Rome?
(64) *How tall did Ivy kiss a soldier on the cheek?
If the wh-phrases in the three sentences above we the first conjunct of a fronted
coordinated wh-question. The island effects ought to be ameliorated because, by
hypothesis, there has been no movement out of the island.13 This turns out to
indeed be the case, the sentences below show ameliorated island effects:
(65) Whose and when did Ivan steal books at the party?
(66) How many and when did Iris meet people in Rome?
(67) ??How tall and when did Ivy kiss a soldier on the cheek?
13The sentences must be altered such that the first coordinated wh-phrase is necessary forgrammaticality. That is, the example in () should not count against the hypothesis because () isnot acceptable either.
() *Whose and when did Ivan shake hand?
() *When did Ivan shake hand?
26
Since the island effects seem to be obviated. There is one fewer reason to assume
that the first wh-word of these sentences arrives in its overt position via movement.
5.5.2. Reconstruction. Another effect of movement is that it is often possible to re-
construct the moved element in to its base position. That is, for a sentence like in
(68). It seems that the wh-phrase can be syntactically realized in the direct object
position so as to license the reciprocal.
(68) Which pictures of each other did the women like <which picture of each
other>?
Reconstruction is commonly take to arise because the wh-phrase in the left periphery
left a copy of itself behind when it moved from the verb-complement position. If the
wh-phase had simply been base-generated high, there would be no copy of it in the
lower position and reconstruction should not be possible.
If the first wh-phrase of the sentences we have been investigating shows reconstruc-
tion effects, that would be evidence of it having moved from a lower base position.
If not, then it is evidence against such an analysis.
Contrasting with the sentence in (68) above, the sentences below are less accept-
able. The lack of abject unacceptability is presumably due to the fact that meaning
is relatively clear. This suggests that the first wh-phrase is unable to reconstruct.14
(69) *?Which books about each other and when did the women read?
(70) *?Which book about himself and when did Ivan read?
14Sentences with wh-phrases of similar complexity are still acceptable:
() Which books about cats and when did the women read?
27
When reconstruction makes for an ungrammatical sentence like in (71), we predict
that the analogous ‘what and’ sentence to be more acceptable. This seems to be the
case in (72).
(71) *Which book about Ivani did hei read?
(72) Which book about Ivani and when did hei read?
5.6. Summary. I have argued for three things in this section. First, given current
theory and basic facts about the number of wh-words that can move per clause in
English, movement does not seem to be possible for both of the wh-words in the
relevant sentences like in (73).
(73) What and when did Ivan sing?
Second, given that non-movement wh-questions have already be argued to be deriv-
able, the particular type of sentence discussed here uniquely guide the parser to posit
a non-movement dependency.
Third, the first wh-word in these sentences fails to show the signature of traditional
movement: island-sensitivity and reconstruction effects.
All three of these taken together (movement is not likely, non-movement is possible,
no movement is detected) I posit a (preliminary) representation for sentence (73)
below in (74).
28
(74)
&did
Ivytj eatwhati &
& whenj
In the following section I will summarize pervious accounts of this type of sentence
and discuss their shortcomings. In the section following that, I will offer a detailed
derivation of the above type of sentence and give a semantic representation for them.
6. Previous Accounts
In this section I outline two strains of previous accounts and argue that they are
both empirically inadequate. There has already been a fair amount of ilk spilled on
this type of sentence and the criticisms here are not exhaustive. In fact, the criticisms
here hinge mostly upon the novel island and reconstruction data presented herein.
The main approaches fall neatly into two groups: Bi-clausal analyses and mono-
clausal analyses. There are of course sub-varieties of such accounts.15 These are
different in interesting ways, but not in ways that are relevant to the criticisms
presented here.
6.1. Mono-clausal Accounts. Mono-clausal analyses posit that the relevant sen-
tence types are derived by movement of two wh-words into the left periphery of a
single clause (Zoerner 1995, Kazenin 2001, Skrabalova 2006, Gribanova 2008, and
Zhang 2007 among others). The result of this movement is either a coordinated
15See Gracanin-Yuksek 2007 and Citko and Gracanin-Yuksek 2010 for a hybrid, bi- and mono-clausal approach.)
29
wh-phrase (75) or an instance of two specifiers of C separated by a spurious ‘and’
(76).16
(75) [CP [XP whati and whenj ] [C did Ivy eat ti tj ]]
(76) [CP whati [CP and [CP whenj [C did Ivy eat ti tj ]]]]
The relevant fact here is that both wh-movements are entirely typical. There is
nothing elided in either case nor any other reason to suspect that any of the usual
movement constraints ought not still apply.
The most obvious criticism is that languages like English do not allow multiple
overt wh-movement. Whatever disallows such dual movement generally would have
to be obviated here somehow. This would require that this obviation be proprietary
to the particular construction at hand: something to be avoided if at all possible.
The second problem with this type of approach is that, as we have seen in the
previous section, the first wh-phrase of coordinated wh-phrase sentence does not
show any island effects as shown in the sentence repeated in (77).
(77) How many and when did Iris meet people?
(cf. *How many did Iris meet people?)
This is unexpected under any mono-clausal account. The movement of the string-
initial wh-phrase is not theorized to be any different in and of itself than movement
generally. Again, there could be some proprietary property to these constructions
16It should be noted that the representation in () is very similar to what is argued for here. Thecrucial difference is that there string-initial wh-phrase has not movement into its surface position.
30
that obviates the island constraints, but nothing of the sort has yet be proposed and
if the lack of constraints can be captured in a generalizable way, so much the better.
There are other problems with a mono-clausal account. See (People) for details.
6.2. Bi-clausal Accounts. Bi-clausal accounts posit that there are in fact two sep-
arate clauses, each involving wh-movement (Browne 1972, Banreti 1992, Whitman
2002 and others). The leftward clause subsequently undergoes IP-deletion. In other
words, such sentence are an example of backward sluicing. Such an account is
sketched in (78) below.
(78) [whati [IP did Ivan eat ti ]] and [whenj [did Ivan eat tj ]]?
This approach happens to make the opposite prediction with respect to the island
observance of the first wh-phrase. IP-ellipsis has long been know to obviate island
effects (Ross, 1969 among others) including the ones relevant for our discussion here.
As seen below, left branch islands are seemingly obviated by traditional sluicing:17
(79) I know that Ivan stole a book, but I can’t remember [CP whosei [IP Ivan
stole ti book]]
(80) I know that Iris met some people, but I can’t remember [CP [how many]i [IP
Iris meet ti people]]
17It does not seem that sluicing obviates all left branch islands. Shown below in (), certain degreequestion phrases cannot seem to avoid island constraints (See Kennedy and Merchant 2000 fordetails):
() ?*I know that Ivy kissed a soldier, but I don’t know [CP [how tall]i [IP Ivy kissed a ti soldier]]
This counts as an asymmetry between the facts of traditional sluicing and backward sluicing. Butthe backward sluicing example is somewhat degraded:
() ?How tall and when did Ivy kiss a soldier.
31
However, this approach has other problems. Kazanin 2001 and others point out
that the un-elided versions of the examples would have different interpretations from
the elided versions. That is, if (81) is the elided version of (82), it is a mystery why
they have different interpretations.
(81) What and when did Ivan eat?
(82) What did Ivan eat and when did Ivan eat?
The sentence in (81) has an interpretation that more closely resembles that of (83)
where there is an overt pronoun in the second clause that is co-indexed with the first
clause’s wh-phrase:
(83) Whati did Ivan eat and when did Ivan eat iti?
However, if backwards sluicing is forced on the above sentence, the result is ungram-
matical:
(84) *What and when did Ivan eat it?
(viz. *Whati did Ivan eat ti and when did Ivan eat it?)
There are, again, other criticisms of such accounts but the one above is sufficient for
serious doubts. Traditional sluicing does not run into the same problems as backward
sluicing does here.
Yet whether or not the above argument holds is beside the point with respect to one
final problem that both approaches succumb to. In both types of previous accounts
there is movement of the intial wh-phrase. As such we expect to find reconstruction
32
effects of the sort discussed earlier. Regular movement allows for reconstruction
effects and sluicing does not obviate them (85).
(85) I know that the women read some books about each other, I just can’t
remember which books about each other.
Both types of previous accounts predict reconstruction effects, but there do not seem
to be any reconstruction effects in the sentences at hand. The approach posited here
is the only one that predicts this.
In the next section, I offer a derivation of the relevant sentences that avoids the
shortcoming of the previous accounts.
7. Deriving Gaps
In the previous sections I have argued that certain long-distance wh-depedencies
are dot derived via movement but rather base generation (first merge) of a wh-phrase
into the left-periphery. In addition to this, I have presented argumentation that
current accounts of the construction are inadequate. In this section I got through
an example derivation of the type I am discussing. Following this I offer a logical
form of the resulting syntactic representation. I conclude with a discussion of the
cross-linguistics viability of this approach.
7.1. Sample Derivation. In this section I offer a simplified derivation of the sen-
tence in (86).
(86) What and when did Ivy eat?
33
The initial step in the derivation is the merger of a suitable verb into the derivation.
That is, an optionally transitive verb is merged not with an internal argument, but
rather with a small v (87).
(87) Merge(read,v) ⇒ v
v read
There is no subcategorization clash that arises from failing to merge an internal
argument with the verb and the derivation can continue as it would otherwise.
The next crucial step in the derivation is the introduction of the first wh-word in
its low, base-generated position (88).18
(88) v
whenj v
v read
The derivation will continue in unexciting fashion until the C-head is merged and
the impetus for wh-movement of ‘when’ is introduced. The only extant wh-word in
the derivation will then move to the Spec,CP position (89).19
18The precise structural location of this wh-word is irrelevant for our purposes.19The particular featural motivation for this movement is also irrelevant here.
34
(89) CP+wh
C+wh
C+wh ...
... v
whenj v
v read
At this point the derivation has obeyed all syntactic strictures and ought to be
well formed, all else being equal. For such a structure, as has been argued earlier, it
is possible for the late (though not post-spellout) introduction of another wh-word.
7.2. Introduction of the second wh-word. Following Hornstein (2009) and Horn-
stein and Nunes (2008) in particular, there is a point in the derivation of a wh-
question in which the moved wh-word has not fully Merged with the matrix C-head.
These researchers break down Merge into two sub-operations: concatenate and label.
Concatenate first combines two syntactic objects into an unlabeled set. This set
does not act as (nor is it really) a constituent. As such the members of the set are
liable to undergo further concatenation operations. After concatentation, the set can
(adjuncts may, arguments must) undergo Label which takes one of the members of the
set and labels it with respect to another object that has been concatenated with. This
decomposition allows for a principled means of accounting for adjunctival behavior in
a bare phrase structure system (see Hornstein 2009 with respect to Chomsky 1995)
as well as free relative constructions (Hornstein and Nunes 2008). For want of space,
35
assume this to be a possible means of construing merge (though there are other
means of accounting for the following steps in the derivation).
Taking this to be a possibility, there is a stage in the derivation of the above
sentence at which the wh-C concatenation has not yet been labeled and wh-word is
still open to further concatenation (90).
(90)
whenj C+wh
C+wh ...
... v
tj v
v read
At this point, another wh-word can concatenate with the moved wh-word.20 The
result, before labeling, would be something like that in (91).
20Larson (2010) argues that this sort of merger of like categories is what underlies coordination.The lexical item ‘and’ is an epiphenomenal grammatical formative under this view.
36
(91)
whati whenj C+wh
C+wh ...
... v
tj v
v read
As argued by Larson (2011) the the extension condition is irrelevant to the labeling
operation. As such, the concatenated relations sketched above can label in any
order. For concreteness let us say that the two wh-words undergo labeling first. The
arguement wh-word ‘what’ has adjoined to the adjunct wh-word ‘when’ and thus the
latter should project under labeling. This is shown in (92).
(92)whenj C+wh
C+wh ...
... v
tj v
v read
whati whenj
Following this, the head of the coordinated wh-phrase ‘when’ labels with the C-
head, as shown in (93).21
21The same effect can be achieved by adopting a sideward movement (Nunes 2001) analysis of themoving wh-word whereby the moving wh-word moves first to a different derivational workspace.There, the second wh-word adjoins to it before this new object merges into spec,CP.
37
(93) CP+wh
whenj C+wh
C+wh ...
... v
tj v
v read
whati whenj
The above is the final representation of the sentence in (). IN what follows I offer
a simple logical form for the above representation.
7.3. Logical Form. As noted earlier, verbs like ‘eat’ presuppose internal objects
that are need not be represented explicitly in the syntax. Such presuppositions can
be semantically captured by introducing the presupposed elements into the restrictor
of the event variable. If the event is restricted to only those event in which there was
a relevant internal argument, we can capture the inference that there is a theme in
sentences like (94). The logical form of such a sentence would then look something
like (95).
(94) Ivan read.
(95) [[Ivan read]] =
∃x[∃e: Theme(x,e)]{read(e) & Agent(Ivan, e)}
‘there exists an x and there exists an event e whose theme is x such that the
event was an reading and its agent was Ivan.’
38
A similar thing can be done with respect to the sentence type at hand. A sentence
like (96) can have the logical form in (97).
(96) What and when did Ivan read?
(97) [[What and when did Ivan read]] =
?x?t[∃e: Theme(x,e)]{read(e) & Agent(Ivan, e) & Time(t,e)}
‘for what x and for what t does there exist an event e whose theme is x such
that the event was an reading and its agent was Ivan and it took place at
time t.’
The argument question operator binds into the restrictor of the event quantifier and
the time question operator binds into the nuclear scope of the event quantifier. The
specifics of such an account are again not very important. The crucial idea is that
the argument question operator can bind a position that it did not move from in the
syntactic derivation. Instead, it can bind into the event quantifier’s restrictor.
7.4. Cross-linguistic Predictions. The data in this article so far has come entirely
from English. This was a fitting starting point given the difference in English between
obligatorily transitive verbs and optionally transitive ones. It seems to be the case
that in English it is possible to infer covert direct objects of particular verbs (like
‘eat’) but not others (like ‘fix’). This is by no means a deep fact about English.
Some exceedingly similar verbs have different transitivity requirements. With ‘eat’
it is possible to infer the direct object, with ‘devour’ it is not (98).
(98) *Iris devoured
39
There are many other pairs of the same sort (‘finish’ and ‘bake’ versus ‘complete’ and
’fry’). The point is that we do not this different to be reflective of a deep repercussion
of the lexical semantics of the verb. That is, there should not be something about
the act of baking that differs from the act of frying that allows one to presume the
theme.
English could have been different in such a way that all the optionally transitive
verbs were obligatorily transitive and vice verse. Equally possible would be that
English verbs were all optionally transitive. Were this the case, we would expect
all direct objects to be able to be represented by the first wh-phrase in the relevant
sentences. We are able to test this hypothesis with Hungarian, a language in which
all verbs are optionally transitive.
As shown in (99) below, the verb repair does not require an internal argument
(data from Liptak 2011 and an informant).
(99) Javítottál
repaired.2sgmeg
PV(You repaired.)
(100) Hol
wherejavítottál
repaired.2sgmeg
PV(Where did you repair?)
Given that the internal argument of the ‘repair’ in Hungarian seems to be be pre-
supposed, it ought to be possible to create a ‘what and’ from such a verb, contrary
to the facts in English. This is the case (101).
(101) Mit
whatjavítottál
repaired.2sgés
andhol
wherejavítottál
repaired.2sgmeg?
PV(What and when did you repair?)
40
The same ought to hold in language where the subject of finite clauses is not
obligatorily overt. In English, this is obviously not possible (102). Furthermore, it is
not possible for the relevant types of sentences to have the first wh-phrase be related
to the subject position (103).
(102) *Will sleep
(103) *Who and when will sleep?
However, in a language like Russian, it is possible (given the correct context) for a
finite clause to be missing a subject (data from Gribanova 2008) (104).22
(104) Budet
will.3sgspat.
sleep.infHe will sleep.
(105) Gde
wherebudet
will.3SGspat?
sleep.infWhere will he sleep?
I then predict that Russian ought to allow subjects of finite verbs to serve as the
first wh-phrase of the relevant sentence types. This turns out the be the case as seen
below in example (106).
(106) Kto
whoi
andgde
wherebudet
will.3sgspat?
sleep.inf(who and where will sleep?)
22I am not adopting the traditional little pro coding of subject drop. Instead I follow Alexiadou andAnagnostopoulou 1999 and posit that the agreement marking on the finite verb suffices to satisfythe EPP in these cases.
41
Given the cross-linguistic suggestions above, it seems that there is a generalization to
be made. As long as the argument is optional generally, the first wh-word in can be
related to that missing argument. The presentation in the previous sections predicts
this. So long as the inferred argument can be seen as a presupposition that can be
coded in the restrictor of an event variable.23
7.5. Summary. In this section I have shown that the particular approach to wh-
questions presented here are plausible. They can be derived with minimal extra
assumptions and they can also be given a relatively straightforward semantic repre-
sentation.
8. Conclusion
In this article I have argued essentially two things. First, I have argued that is
theoretically possible to derive a wh-question that has the appearance of movement
but did not involve any actual movement from a base-position in the syntax. Wh-
phrases can optionally be introduced into derivations at the spec,CP position. Ex-
situ wh-phrases were argued to not require abstract case and argued to have the
ability to bind variables that arise only in the semantics.
Second, I argued that a certain type of sentence allows us to unearth derivations in
which ex-situ wh-phrases do not move into their positions. Such sentences (beginning
with something like ‘what and’) have been difficult to capture in the past and I
suggest that a non-movement account works for reasons theoretical and empirical.
The first wh-word in such sentences does not seem to be able to arrive in its overt23It is possible that not all covert arguments can or should be captured in this way. Thus, it couldbe the case that a language has a null argument possibility that does not correlate with allowing thecorresponding wh-phrase as the first conjunct. The converse however ought to hold. Any wh-phrasein the first conjunct ought to correspond to a position that is optionally left empty.
42
position by traditional means of movement. Further, it does not show any of the
telltale signs of having moved at all.
All of this is made possible by the radical re-conceptualization of the syntax that
the minimalist program carries with it.
9. Works Cited
43