MSG Syntax Section
MSG Syntax Section
1
b. A man kicked the ball.
c. The ball kicked a man.
d. A ball kicked the man.
e. The ball, a man kicked.
f. The man, a ball kicked.
All the other 114 combinations, a few of which are given in (2), are unacceptable to native
speakers of English. We use the notation * to indicate that a hypothesized example is ungram-
matical.
(2) a. *Kicked the man the ball.
b. *Man the ball kicked the.
c. *The man a ball kicked.
It is clear that there are certain rules in English for combining words. These rules constrain
which words can be combined together or how they may be ordered, sometimes in groups, with
respect to each other.
Such combinatory rules also play important roles in our understanding of the syntax of an
example like (3a).2 Whatever these rules are, they should give a different status to (3b), an
example which is judged ungrammatical by native speakers even though the intended meaning
of the speaker is relatively clear and understandable.
(3) a. Kim lives in the house Lee sold to her.
b. *Kim lives in the house Lee sold it to her.
The requirement of such combinatory knowledge also provides an argument for the assumption
that we use just a finite set of resources in producing grammatical sentences, and that we do not
just rely on the meaning of words involved. Consider the examples in (4):
(4) a. *Kim fond of Lee.
b. Kim is fond of Lee.
Even though it is not difficult to understand the meaning of (4a), English has a structural re-
quirement for the verb is as in (4b).
More natural evidence of the ‘finite set of rules and principles’ idea can be found in cognitive,
creative abilities. Speakers are unconscious of the rules which they use all the time, and have no
difficulties in producing or understanding sentences which they have never heard, seen, or talked
about before. For example, even though we may well not have seen the following sentence
before, we can understand its meaning if we have a linguistic competence in English:
(5) In January 2002, a dull star in an obscure constellation suddenly became 600,000
times more luminous than our Sun, temporarily making it the brightest star in our
galaxy.
2 Starting in Chapter 2, we will see these combinatory rules.
2
A related part of this competence is that a language speaker can produce an infinite number
of grammatical sentences. For example, given the simple sentence (6a), we can make a more
complex one like (6b) by adding the adjective tall. To this sentence, we can again add another
adjective handsome as in (6c). We could continue adding adjectives, theoretically enabling us
to generate an infinitive number of sentences:
(6) a. The man kicked the ball.
b. The tall man kicked the ball.
c. The handsome, tall man kicked the ball.
d. The handsome, tall, nice man kicked the ball.
e. . . .
One might argue that since the number of English adjectives could be limited, there would be a
dead-end to this process. However, no one would find themselves lost for another way to keep
the process going (cf. Sag et al. 2003):
(7) a. Some sentences can go on.
b. Some sentences can go on and on.
c. Some sentences can go on and on and on.
d. Some sentences can go on and on and on and on.
e. . . .
To (7a), we add the string and on, producing a longer one (7b). To this resulting sentence (7c),
we once again add and on. We could in principle go on adding without stopping: this is enough
to prove that we could make an infinite number of well-formed English sentences.3
Given these observations, how then can we explain the fact that we can produce or under-
stand an infinite number of grammatical sentences that we have never heard or seen before? It
seems implausible to consider that we somehow memorize every example, and in fact we do not
(Pullum and Scholz 2002). We know that this could not be true, in particular when we consider
that native speakers can generate an infinite number of infinitely long sentences, in principle. In
addition, there is limit to the amount of information our brain can keep track of, and it would
be implausible to think that we store an infinite number of sentences and retrieve whenever we
need to do so.
These considerations imply that a more appropriate hypothesis would be something like (8):4
(8) All native speakers have a grammatical competence which can generate an infinite
set of grammatical sentences from a finite set of resources.
3 Think of a simple analogy: what is the longest number? Yet, how many numbers do you know? The second question
to speakers’ internalized knowledge of their language, whereas performance refers to actual usage of this abstract
knowledge of language.
3
This hypothesis has been generally accepted by most linguists, and has been taken as the subject
matter of syntactic theory. In terms of grammar, this grammatical competence is hypothesized
to characterize a generative grammar, which we then can define as follows (for English, in
this instance):
(9) Generative Grammar:
An English generative grammar is the one that can generate an infinite set of well-
formed English sentences from a finite set of rules or principles.
The job of syntax is thus to discover and formulate these rules or principles.5 These rules tell us
how words are put together to form grammatical phrases and sentences. Generative grammar,
or generative syntax, thus aims to define these rules which will characterize all of the sentences
which native speakers will accept as well-formed and grammatical.
[Step I: Observing Data] To discover a grammar rule, the first thing we need to do is to
check out grammatical and ungrammatical variants of the expression in question. For example,
5 In generative syntax, ‘rules’ refers not to ‘prescriptive rules’ but to ‘descriptive rules’. Prescriptive rules are those
which disfavor or even discredit certain usages; these prescribe forms which are generally in use, as in (i). Meanwhile,
descriptive rules are meant to characterize whatever forms speakers actually use, with any social, moral, or intellectual
judgement.
(i) a. Do not end a sentence with a preposition.
b. Avoid double negatives.
c. Avoid split infinitives.
The spoken performance of most English speakers will often contain examples which violate such prescriptive rules.
6 Much of the discussion and data in this section are adopted from Baker, C.L. (1995).
4
let us look at the usage of the word evidence:
(10) Data Set 1: evidence
a. *The professor found some strong evidences of water on Mars.
b. *The professor was hoping for a strong evidence.
c. *The evidence that John found was more helpful than the one that Smith found.
What can you tell from these examples? We can make the following observations:
(11) Observation 1:
a. evidence cannot be used in the plural.
b. evidence cannot be used with the indefinite article a(n).
c. evidence cannot be referred to by the pronoun one.
In any scientific research one example is not enough to draw any conclusion. However, we
can easily find more words that behave like evidence:
(12) Data Set 2: equipment
a. *We had hoped to get three new equipments every month, but we only had enough
money to get an equipment every two weeks.
b. *The equipment we bought last year was more expensive than the one we bought
this year.
We thus extend Observation 1 a little bit further:
(13) Observation 2:
a. evidence/equipment cannot be used in the plural.
b. evidence/equipment cannot be used with the indefinite article a(n).
c. evidence/equipment cannot be referred to by the pronoun one.
It is usually necessary to find contrastive examples to understand the range of a given observa-
tion. For instance, words like clue and tool act differently:
(14) Data Set 3: clue
a. The professor gave John some good clues for the question.
b. The student was hoping for a good clue.
c. The clue that John got was more helpful than the one that Smith got.
(15) Data Set 4: tool
a. The teacher gave John some good tools for the purpose.
b. The student was hoping for a tool.
c. The tool that Jones got was more helpful than the one that Smith got.
Unlike equipment and evidence, the nouns clue and tool can be used in the test linguistic con-
texts we set up. We thus can add Observation 3, different from Observation 2:
(16) Observation 3:
5
a. clue/tool can be used in the plural.
b. clue/tool can be used with the indefinite article a(n).
c. clue/tool can be referred to by the pronoun one.
[Step II: Forming a Hypothesis] From the data and observations we have made so far, can
we make any hypothesis about the English grammar rule in question? One hypothesis that we
can make is something like the following:
(17) First Hypothesis:
English has at least two groups of nouns, Group I (count nouns) and Group II
(non-count nouns), diagnosed by tests of plurality, the indefinite article, and the
pronoun one.
[Step III: Checking the Hypothesis] Once we have formed such a hypothesis, we need to
check out if it is true of other data, and also see if it can bring other analytical consequences.
A little further thought allows us to find support for the two-way distinction for nouns. For
example, consider the usage of much and many:
(18) a. much evidence, much equipment, information, much furniture, much advice
b. *much clue, *much tool, *much armchair, *much bags
(19) a. *many evidence, *many equipment, *many information, *many furniture, *many
advice
b. many clues, many tools, many suggestions, many armchairs
As observed here, count nouns can occur only with many, whereas non-count nouns can com-
bine with much. Similar support can be found from the usage of little and few:
(20) a. little evidence, little equipment, little advice, little information
b. *little clue, *little tool, *little suggestion, *little armchair
(21) a. *few evidence, *few equipment, *few furniture, *few advice, *few information
b. few clues, few tools, suggestions, few armchairs
The word little can occur with non-count nouns like evidence, yet few cannot. Meanwhile, few
occurs only with count nouns.
Given these data, it appears that the two-way distinction is quite plausible and persuasive.
We can now ask if this distinction into just two groups is really enough for the classification of
nouns. Consider the following examples with cake:
(22) a. The mayor gave John some good cakes.
b. The president was hoping for a good cake.
c. The cake that Jones got was more delicious than the one that Smith got.
Similar behavior can be observed with a noun like beer, too:
6
(23) a. The bartender gave John some good beers.
b. No one knows how to tell from a good beer to a bad one.
These data show us that cake and beer may be classified as count nouns. However, observe
the following:
(24) a. My pastor says I ate too much cake.
b. The students drank too much beer last night.
(25) a. We recommend to eat less cake and pastry.
b. People now drink less beer.
The data mean that cake and beer can also be used as non-count nouns since that can be used
with less or much.
[Step IV: Revising the Hypothesis] The examples in (24) and (25) imply that there is an-
other group of nouns that can be used as both count and non-count nouns. This leads us to revise
the hypothesis in (17) as following:
(26) Revised Hypothesis:
There are at least three groups of nouns: Group 1 (count nouns), Group 2 (non-count
nouns), and Group 3 (count and non-count).
We can expect that context will determine whether a Group 3 noun is used as count or as non-
count.
As we have observed so far, the process of finding finite grammar rules crucially hinges on
finding data, drawing generalizations, making a hypothesis, and revising this hypothesis with
more data.
7
b. *The average age at which people begin to need eyeglasses vary considerably.
Once we have structural knowledge of such sentences, it is easy to see that the essential element
of the subject in (28a) is not pilots but strike. This is why the main verb should be has but not
have to observe the basic agreement rule in (27). Meanwhile, in (28b), the head is the noun age,
and thus the main verb vary needs to agree with this singular noun. It would not do to simply
talk about ‘the noun’ in the subject in the examples in (28), as there is more than one. We need
to be able to talk about the one which gives its character to the phrase, and this is the head. If
the head is singular, so is the whole phrase, and similarly for plural. The head of the subject and
the verb (in the incorrect form) are indicated in (29):
(29) a. *[The recent strike by pilots] have cost the country a great deal of money from
tourism and so on.
b. *[The average age at which people begin to need eyeglasses] vary considerably.
Either example can be made into a grammatical version by pluralizing the head noun of the
subject.
Now let us look at some slightly different cases. Can you explain why the following examples
are unacceptable?
(30) a. *Despite of his limited educational opportunities, Abraham Lincoln became one of
the greatest intellectuals in the world.
b. *A pastor was executed, notwithstanding on many applications in favor of him.
To understand these examples, we first need to recognize that the words despite and notwith-
standing are prepositions, and further that canonical English prepositions combine only with
noun phrases. In (30), these prepositions combine with prepositional phrases again (headed by
of and on respectively), violating this rule.
A more subtle instance can be found in the following:
(31) a. Visiting relatives can be boring.
b. I saw that gas can explode.
These examples each have more than one interpretation. The first one can mean either that the
event of seeing our relatives is a boring activity, or that the relatives visiting us are themselves
boring. The second example can either mean that a specific can containing gas exploded, which
I saw, or it can mean that I observed that gas has a possibility of exploding. If one knows English
syntax, that is, if one understands the syntactic structure of these English sentences, it is easy to
identify these different meanings.
Here is another example which requires certain syntactic knowledge:
(32) He said that that ‘that’ that that man used was wrong.
This is the kind of sentence one can play with when starting to learn English grammar. Can you
analyze it? What are the differences among these five thats? Structural (or syntactic) knowledge
8
can be used to diagnose the differences. Part of our study of syntax involves making clear exactly
how each word is categorized, and how it contributes to a whole sentence.
When it comes to understanding a rather complex sentence, knowledge of English syntax
can be a great help. Syntactic or structural knowledge helps us to understand simple as well as
complex English sentences in a systematic way. There is no difference in principle between the
kinds of examples we have presented above and (33):
(33) The government’s plan, which was elaborated in a document released by the Trea-
sury yesterday, is the formal outcome of the Government commitment at the Madrid
summit last year to put forward its ideas about integration.
Apart from having more words than the examples we have introduced above, nothing in this
example is particularly complex.
1.4 Exercises
1. For each of the following nouns, decide if it can be used as a count or as a non-count
(mass) noun. In doing so, construct acceptable and unacceptable examples using the tests
(plurality, indefinite article, pronoun one, few/little, many/much tests) we have discussed
in this chapter.
(i) activity, art, cheese, discussion, baggage, luggage, suitcase, religion, sculpture,
paper, difficulty, cheese, water, experience, progress, research, life
2. Check or find out whether each of the following examples is grammatical or ungrammat-
ical. For each ungrammatical one, provide at least one (informal) reason for its ungram-
maticality, according to your intuitions or ideas.
(i) a. Kim and Sandy is looking for a new bicycle.
b. I have never put the book.
c. The boat floated down the river sank.
d. Chris must liking syntax.
e. There is eager to be fifty students in this class.
f. What is John eager to do?
g. What is John easy to do?
h. Is the boy who holding the plate can see the girl?
i. Which chemical did you mix the hydrogen peroxide and?
j. There seem to be a good feeling developing among the students.
k. Strings have been pulled many times to get students into that university.
3. Consider the following set of data, focusing on the usage of ‘self’ reflexive pronouns and
personal pronouns:
(i) a. He washed himself.
9
b. *He washed herself.
c. *He washed myself.
d. *He washed ourselves.
(ii) a. *He washed him. (‘he’ and ‘him’ referring to the same person)
b. He washed me.
c. He washed her.
d. He washed us.
Can you make a generalization about the usage of ‘self’ pronouns and personal pronouns
like he here? In answering this question, pay attention to what the pronouns can refer to.
Also consider the following imperative examples:
(iii) a. Wash yourself.
b. Wash yourselves.
c. *Wash myself.
d. *Wash himself.
(iv) a. *Wash you!
b. Wash me!
c. Wash him!
Can you explain why we can use yourself and yourselves but not you as the object of
the imperatives here? In answering this, try to put pronouns in the unrealized subject
position.
4. Read the following passage and identify all the grammatical errors. If you can, discuss
the relevant grammar rules that you can think of.
(i) Grammar is important because it is the language that make it possible for
us to talk about language. Grammar naming the types of words and word
groups that make up sentences not only in English but in any language. As
human beings, we can putting sentences together even as children–we can
all do grammar. People associate grammar for errors and correctness. But
knowing about grammar also helps us understood what makes sentences and
paragraphs clearly and interesting and precise. Grammar can be part of lit-
erature discussions, when we and our students closely reading the sentences
in poetry and stories. And knowing about grammar means finding out that
all language and all dialect follow grammatical patterns.8
8 Adapted from “Why is Grammar Important?” by The Assembly for the Teaching of English Grammar
10
2
2.1 Introduction
In Chapter 1, we observed that the study of English syntax is the study of rules which generate
an infinite number of grammatical sentences. These rules can be inferred from observations
about the English data. One simple mechanism we recognize is that in forming grammatical
sentences, we start from words, or ‘lexical’ categories. These lexical categories then form a
larger constituent ‘phrase’; and phrases go together to form a ‘clause’. A clause either is, or is
part of, a well-formed sentence:
(1) sentence
L
rr LLL
rrr LL
rr L
. . . clause
L . ..
rrr LLL
rr LL
rr L
. . . phrase
L . ..
rrr LLL
rr LL
rr L
. . . word ...
Typically we use the term ‘clause’ to refer to a complete sentence-like unit, but which may be
part of another clause, as a subordinate or adverbial clause. Each of the sentences in (2b)–(2d)
contains more than one clause, in particular, with one clause embedded inside another:
(2) a. The weather is lovely today.
b. I am hoping that [the weather is lovely today].
c. If [the weather is lovely today] then we will go out.
d. The birds are singing because [the weather is lovely today].
This chapter deals with what kind of combinatorial rules English employs in forming these
phrases, clauses, and sentences.
11
2.2 Lexical Categories
2.2.1 Determining the Lexical Categories
The basic units of syntax are words. The first question is then what kinds of words (also known
as parts of speech, or lexical categories, or grammatical categories) does English have? Are
they simply noun, verb, adjective, adverb, preposition, and maybe a few others? Most of us
would not be able to come up with simple definitions to explain the categorization of words.
For instance, why do we categorize book as a noun, but kick as a verb? To make it more difficult,
how do we know that virtue is a noun, that without is a preposition, and that well is an adverb
(in one meaning)?
Words can be classified into different lexical categories according to three criteria: meaning,
morphological form, and syntactic function. Let us check what each of these criteria means,
and how reliable each one is.
At first glance, it seems that words can be classified depending on their meaning. For exam-
ple, we could have the following rough semantic criteria for N (noun), V (verb), A (adjective),
and Adv (adverb):
(3) a. N: referring to an individual or entity
b. V: referring to an action
c. A: referring to a property
d. Adv: referring to the manner, location, time or frequency of an action
Though such semantic bases can be used for many words, these notional definitions leave a
great number of words unaccounted for. For example, words like sincerity, happiness, and pain
do not simply denote any individual or entity. Absence and loss are even harder cases.
There are also many words whose semantic properties do not match the lexical category that
they belong to. For example, words like assassination and construction may refer to an action
rather than an individual, but they are always nouns. Words like remain, bother, appear, and
exist are verbs, but do not involve any action.
A more reliable approach is to characterize words in terms of their forms and functions. The
‘form-based’ criteria look at the morphological form of the word in question:
(4) a. N: + plural morpheme -(e)s
b. N: + possessive ’s
c. V: + past tense -ed or 3rd singular -(e)s
d. V: + 3rd singular -(e)s
e. A: + -er/est (or more/most)
f. A: + -ly (to create an adverb)
According to these frames, where the word in question goes in the place indicated by , nouns
allow the plural marking suffix -(e)s to be attached, or the possessive ’s, whereas verbs can have
the past tense -ed or the 3rd singular form -(e)s. Adjectives can take comparative and superlative
12
endings -er or -est, or combine with the suffix -ly. (5) shows some examples derived from these
frames:
(5) a. N: trains, actors, rooms, man’s, sister’s, etc.
b. V: devoured, laughed, devours, laughs, etc.
c. A: fuller, fullest, more careful, most careful, etc.
d. Adv: fully, carefully, diligently, clearly, etc.
The morphological properties of each lexical category cannot be overridden; verbs cannot have
plural marking, nor can adjectives have tense marking. It turns out, however, that these morpho-
logical criteria are also only of limited value. In addition to nouns like information and furniture
that we presented in Chapter 1, there are many nouns such as love and pain that do not have
a plural form. There are adjectives (such as absent and circular) that do not have comparative
-er or superlative -est forms, due to their meanings. The morphological (form-based) criterion,
though reliable in many cases, is not a necessary and sufficient condition for determining the
type of lexical categories.
The most reliable criterion in judging the lexical category of a word is based on its syntactic
function or distributional possibilities. Let us try to determine what kind of lexical categories
can occur in the following environments:
(6) a. They have no .
b. They can .
c. They read the book.
d. He treats John very .
e. He walked right the wall.
The categories that can go in the blanks are N, V, A, Adv, and P (preposition). As can be seen
in the data in (7), roughly only one lexical category can appear in each position:
(7) a. They have no TV/car/information/friend.
b. They have no *went/*in/*old/*very/*and.
(8) a. They can sing/run/smile/stay/cry.
b. They can *happy/*down/*door/*very.
(9) a. They read the big/new/interesting/scientific book.
b. They read the *sing/*under/*very book.
(10) a. He treats John very nicely/badly/kindly.
b. He treats John very *kind/*shame/*under.
(11) a. He walked right into/on the wall.
b. He walked right *very/*happy/*the wall.
As shown here, only a restricted set of lexical categories can occur in each position; we can then
assign a specific lexical category to these elements:
13
(12) a. N: TV, car, information, friend, . . .
b. V: sing, run, smile, stay, cry, . . .
c. A: big, new, interesting, scientific, . . .
d. Adv: nicely, badly, kindly, . . .
e. P: in, into, on, under, over, . . .
In addition to these basic lexical categories, does English have other lexical categories? There
are a few more. Consider the following syntactic environments:
(13) a. student hits the ball.
b. John sang a song, Mary played the piano.
c. John thinks Bill is honest.
The only words that can occur in the open slot in (13a) are words like the, a, this, that, and so
forth, which are determiner (Det). (13b) provides a frame for conjunctions (Conj) such as and,
but, so, for, or, yet.1 In (13c), we can have the category we call ‘complementizer’, here the word
that – we return to these in (17) below.
Can we find any supporting evidence for such lexical categorizations? It is not so difficult to
construct environments in which only these lexical elements appear. Consider the following:
(14) We found out that very lucrative jobs were in jeopardy.
Here we see that only words like the, my, his, some, these, those, and so forth can occur here.
These articles, possessives, quantifiers, and demonstratives all ‘determine’ the referential prop-
erties of jobs here, and for this reason, they are called determiners. One clear piece of evidence
for grouping these elements as the same category comes from the fact that they cannot occupy
the same position at the same time:
(15) a. *[My these jobs] are in jeopardy.
b. *[Some my jobs] are in jeopardy.
c. *[The his jobs] are in jeopardy.
Words like my and these or some and my cannot occur together, indicating that they compete
with each other for just one structural position.
Now consider the following examples:
(16) a. I think learning English is not easy at all.
b. I doubt you can help me in understanding this.
c. I am anxious you to study English grammar hard.
Once again, the possible words that can occur in the specific slot in (17) are strictly limited.
(17) a. I think that [learning English is not all that easy].
1 These conjunctions are ‘coordinating conjunctions’ different from ‘subordinating conjunctions’ like when, if, since,
though, and so forth. The former conjoins two identical phrasal elements whereas the latter introduces a subordinating
clause as in [Though students wanted to study English syntax], the department decided not to open that course this year.
14
b. I doubt if [you can help me in understanding this].
c. I am anxious for [you to study English grammar hard].
The italicized words here are different from the other lexical categories that we have seen so
far. They introduce a complement clause, marked above by the square brackets, and may be
sensitive to the tense of that clause. A tensed clause is known as a ‘finite’ clause, as opposed to
an infinitive. For example, that and if introduce or combine with a tensed sentence (present or
past tense), whereas for requires an infinitival clause marked with to. We cannot disturb these
relationships:
(18) a. *I think that [learning English to be not all that easy].
b. *I doubt if [you to help me in understanding this].
c. *I am anxious for [you should study English grammar hard].
The term ‘complement’ refers to an obligatory dependent clause or phrase relative to a head.2
The italicized elements in (18) introduce a clausal complement and are consequently known as
‘complementizers’ (abbreviated as ‘C’). There are only a few complementizers in English (that,
for, if , and whether), but nevertheless they have their own lexical category.
Now consider the following environments:
(19) a. John not leave.
b. John drink beer last night.
c. John leave for Seoul tomorrow?
d. John will study syntax, and Mary , too.
The words that can appear in the blanks are neither main verbs nor adjectives, but rather words
like will, can, shall and must. In English, there is clear evidence that these verbs are different
from main verbs, and we call them auxiliary verbs (Aux). The auxiliary verb appears in front
of the main verb, which is typically in its citation form, which we call the ‘base’ form. Note the
change in the main verb form in (20b) when the negation is added:
(20) a. He left.
b. He did not leave.
There is also one type of to which is auxiliary-like. Consider the examples in (21) and (22):
(21) a. Students wanted to write a letter.
b. Students intended to surprise the teacher.
(22) a. Students objected to the teacher.
b. Students sent letters to the teacher.
It is easy to see that in (22), to is a preposition. But how about the infinitival marker to in (21),
followed by a base verb form? What lexical category does it belong to? Though the detailed
properties of auxiliary verbs will not be discussed until Chapter 8, we treat the infinitival marker
2 See Chapter 4 for a fuller discussion of ‘head’ and ‘complement’.
15
to as an auxiliary verb. For example, we can observe that to behaves like an auxiliary verb
should:
(23) a. It is crucial for John to show an interest.
b. It is crucial that John should show an interest.
(24) a. I know I should [go to the dentist’s], but I just don’t want to.
b. I don’t really want to [go to the dentist’s], but I know I should.
In (23), to and should introduce the clause and determines the tenseness of the clause. In (24),
they both can license the ellipsis of its VP complement.3
Another property to shares with other auxiliary verbs like will is that it requires a base verb
to follow. Most auxiliary verbs are actually finite (tensed) forms which therefore pattern with
that in a finite clause, while the infinitival clause introduced by for is only compatible with to:
(25) a. She thought it was likely [that everyone *to/might/would fit into the car].
b. She thought it was easy [for everyone to/*might/*would fit into the car].
Finally, there is one remaining category we need to consider, the ‘particles’ (Part), illustrated
in (26):
(26) a. The umpire called off the game.
b. The two boys looked up the word.
Words like off and up here behave differently from prepositions, in that they can occur after the
object.
(27) a. The umpire called the game off .
b. The two boys looked the word up.
Such distributional possibilities cannot be observed with true prepositions:
(28) a. The umpire fell off the deck.
b. The two boys looked up the high stairs (from the floor).
(29) a. *The umpire fell the deck off .
b. *The students looked the high stairs up (from the floor).
We can also find differences between particles and prepositions in combination with an object
pronoun:
(30) a. The umpire called it off . (particle)
b. *The umpire called off it.
(31) a. *The umpire fell it off .
b. The umpire fell off it. (preposition)
3 See Chapter 8 for detailed discussion on the ellipsis.
16
The pronoun it can naturally follow the preposition as in (31b), but not the particle in (30b).
Such contrasts between prepositions and particles give us ample reason to introduce another
lexical category Part (particle) which is differentiated from P (preposition). In the next section,
we will see more tests to differentiate these two types of word.
18
c. We need more intelligent leaders.
These sentences have different meanings depending on how we group the words. For example,
(42a) will have the following two different constituent structures:
(43) a. John saw [the man with a telescope].
(the man had the telescope)
b. John [[saw the man] with a telescope].
(John used the telescope)
Even these very cursory observations indicate that a grammar with only lexical categories is
not adequate for describing syntax. In addition, we need a notion of ‘constituent’, and need to
consider how phrases may be formed, grouping certain words together.
Cleft: The cleft construction, which places an emphasized or focused element in the X posi-
tion in the pattern ‘It is/was X that . . . ’, can provide us with simple evidence for the existence
of phrasal units. For instance, think about how many different cleft sentences we can form from
(46).
(46) The policeman met several young students in the park last night.
With no difficulty, we can cleft almost all the constituents we can get from the above sentence:
(47) a. It was [the policeman] that met several young students in the park last night.
b. It was [several young students] that the policeman met in the park last night.
c. It was [in the park] that the policeman met several young students last night.
d. It was [last night] that the policeman met several young students in the park.
19
However, we cannot cleft sequences that not form constituents:6
(48) a. *It was [the policeman met] that several young students in the park last night.
b. *It was [several young students in] that the policeman met the park last night.
c. *It was [in the park last night] that the policeman met several young students.
Constituent Questions and Stand-Alone Test: Further support for the existence of phrasal
categories can be found in the answers to ‘constituent questions’, which involve a wh-word such
as who, where, when, how. For any given wh-question, the answer can either be a full sentence
or a fragment. This stand-alone fragment is a constituent:
(49) A: Where did the policeman meet several young students?
B: In the park.
(50) A: Who(m) did the policeman meet in the park?
B: Several young students.
This kind of test can be of use in determining constituents; we will illustrate with example (51):
(51) John put old books in the box.
Are either old books in the box or put old books in the box a constituent? Are there smaller
constituents? The wh-question tests can provide some answers:
(52) A: What did you put in your box?
B: Old books.
B: *Old books in the box.
(53) A: Where did you put the book?
B: In the box.
B: *Old books in the box.
(54) A: What did you do?
B: *Put old books.
B: *Put in the box.
B: Put old books in the box.
Overall, the tests here will show that old books and in the box are constituents, and that put old
books in the box is also a (larger) constituent.
The test is also sensitive to the difference between particles and prepositions. Consider the
similar-looking examples in (55), including looked and up:
(55) a. John looked up the inside of the chimney.
b. John looked up the meaning of ‘chanson’.
6 The verb phrase constituent met . . . night here cannot be clefted for independent reasons (see Chapter 12).
20
The examples differ, however, as to whether up forms a constituent with the following material
or not. We can again apply the wh-question test:
(56) A: What did he look up?
B: The inside of the chimney.
B: The meaning of ‘chanson’.
(57) A: Where did he look?
B: Up the inside of the chimney.
B: *Up the meaning of ‘chanson’.
(58) A: Up what did he look?
B: The inside of the chimney.
B: *The meaning of ‘chanson’.
What the contrasts here show is that up forms a constituent with the inside of the chimney in
(55a) whereas it does not with the meaning of ‘chanson’ in (55b).
Substitution by a Pronoun: English, like most languages, has a system for referring back to
individuals or entities mentioned by the use of pronouns. For instance, the man who is standing
by the door in (59a) can be ‘substituted’ by the pronoun he in (59b).
(59) a. What do you think the man who is standing by the door is doing now?
b. What do you think he is doing now?
There are other pronouns such as there, so, as, and which, which also refer back to other con-
stituents.
(60) a. Have you been [to Seoul]? I have never been there.
b. John might [go home], so might Bill.
c. John might [pass the exam], and as might Bill.
d. If John can [speak French fluently] – which we all know he can – we will have no
problems.
A pronoun cannot be used to refer back to something that is not a constituent:
(61) a. John asked me to put the clothes in the cupboard, and to annoy him I really stuffed
them there [there=in the cupboard].
b. John asked me to put the clothes in the cupboard, and to annoy him I stuffed them
there [them=the clothes].
c. *John asked me to put the clothes in the cupboard, but I did so [=put the clothes] in
the suitcase.
Both the pronoun there and them refer to a constituent. However, so in (61c), referring to a VP,
refers only part of a constituent put the clothes, making it unacceptable.
21
Coordination: Another commonly-used test is coordination. Words and phrases can be co-
ordinated by conjunctions, and each conjunct is typically the same kind of constituent as the
other conjuncts:
(62) a. The girls [played in the water] and [swam under the bridge].
b. The children were neither [in their rooms] nor [on the porch].
c. She was [poor] but [quite happy].
d. Many people drink [beer] or [wine].
If we try to coordinate unlike constituents, the results are typically ungrammatical.
(63) a. *Mary waited [for the bus] and [to go home].
b. *Lee went [to the store] and [crazy].
Even though such syntactic constituent tests are limited in certain cases, they are often
adopted in determining the constituent of given expressions.
22
(67) NP
lllylyEEE
lll yy EE
l llll yyy EE
E
l y
(Det) A* N (PP/S)
24
2.5.3 AP: Adjective Phrase
The most common environment where an adjective phrase (AP) occurs is in ‘linking verb’
constructions as in (83):
(83) John feels .
Expressions like those in (84) can occur in the blank space here:
(84) happy, uncomfortable, terrified, sad, proud of her, proud to be his student, proud that
he passed the exam, etc.
Since these all include an adjective (A), we can safely conclude that they all form an AP. Look-
ing into the constituents of these, we can formulate the following simple PS rule for the AP:
(85) AP → A (PP/VP/S)
This simple AP rule can easily explain the following:
(86) a. John sounded happy/uncomfortable/terrified/proud of her.
b. John felt proud that his son won the game.
c. John sounded *happily/*very/*the student/*in the park.
The verb sounded requires an AP to be followed, but in (86c) we have no AP. In addition,
observe the contrasts in the following examples:
(87) a. *The monkeys seem [want to leave the meeting].
b. The monkeys seem [eager to leave the meeting].
(88) a. *John seems [know about the bananas].
b. John seems [certain about the bananas].
These examples tell us that the verb seem combines with an AP, but not with a VP.
25
2.5.5 PP: Preposition Phrase
Another major phrasal category is preposition phrase (PP). PPs like those in (92), generally
consist of a preposition plus an NP.
(92) from Seoul, in the box, in the hotel, into the soup, with John and his dog, under the
table, etc.
These PPs can appear in a wide range of environments:
(93) a. John came from Seoul.
b. They put the book in the box.
c. They stayed in the hotel.
d. The fly fell into the soup.
One clear case in which only a PP can appear is the following:
(94) The squirrel ran straight/right .
The intensifiers straight and right can occur neither with an AP nor with an AdvP:
(95) a. The squirrel ran straight/right up the tree.
b. *The squirrel is straight/right angry.
c. *The squirrel ran straight/right quickly.
From the examples in (92), we can deduce the following general rule for forming a PP:11
(96) PP → P NP
The rule states that a PP consists of a P followed by an NP. We cannot construct unacceptable
PPs like the following:
(97) *in angry, *into sing a song, *with happily, . . .
any time its environment is satisfied, regardless of any other contextual restrictions.
26
e. AdvP → (AdvP) Adv
f. PP → P NP
The rules say that a sentence is the combination of NP and VP, and an NP can be made up of
a Det, any number of As, an obligatory N, and any number of PPs, and so on.. Of the possible
tree structures that these rules can generate, the following is one example:
(99) ST
jj jjjjj TTTTTTT
j TTTT
jjjj
NP VP
tJJ XX
jjjj XXXXXXXXX
ttt JJJJ jjjj XXXXX
t j
tt J jjj X
Det A N V NP
J PP
tt JJJJ tJJ
tt J ttt JJJJ
t J t J
tt tt
... ... ... . . . Det N P NP
J
tt JJJJ
ttt JJ
tt
... ... . . . Det N
... ...
With the structural possibilities shown here, let us assume that we have the following lexical
entries:
(100) a. Det: a, an, this, that, any, some, which, his, her, no, etc.
b. A: handsome, tall, fat, large, dirty, big, yellow, etc.
c. N: book, ball, hat, friend, dog, cat, man, woman, John, etc.
d. V: kicked, chased, sang, met, believed, thinks, imagines, assumes etc.
Inserting these elements in the appropriate pre-terminal nodes (the places with dots) in (99), we
are able to generate various sentences like those in (101):13
(101) a. This handsome man chased a dog.
b. A man kicked that ball.
c. That tall woman chased a cat.
d. His friend kicked a ball.
There are several ways to generate an infinite number of sentences with this kind of grammar.
As we have seen before, one simple way is to repeat a category (e.g., adjective) infinitely. There
are also other ways of generating an infinite number of grammatical sentences. Look at the
following two PS rules from (98) again:
13 The grammar still generates semantically anomalous examples like#The desk believed a man or#A man sang her
hat. For such semantically distorted examples, we need to refer to the notion of ‘selectional restrictions’ (see Chapter
7).
27
(102) a. S → NP VP
b. VP → V S
As we show in the following tree structure, we can ‘recursively’ apply the two rules, in the sense
that one can feed the other, and then vice versa:
(103) 7654
0123
S
r rLLL
r LL
rr
89:;
?>=<
LL
rr
NP VP
L
rrr LLL
r LL
0123
7654
r
rr L
N V SL
rrr LLL
rr LL
?>=<
89:;
rr L
John believes NP VP
rLL
rrr LLL
r LL
rr
N V SJ
ttt JJJJ
tt JJ
tt
Mary thinks Tom is honest
It is not difficult to expand this sentence by applying the two rules again and again:
(104) a. Bill claims John believes Mary thinks Tom is honest.
b. Jane imagines Bill claims John believes Mary thinks Tom is honest.
There is no limit to this kind of recursive application of PS rules: it proves that this kind of
grammar can generate an infinite number of grammatical sentences.
One structure which can be also recursive involves sentences involving auxiliary verbs. As
noted before in (79), an auxiliary verb forms a larger VP after combining with a VP:
(105) S
rLL
rrr LLL
0123
7654
r LL
rr
NP VP
r rLLL
rr LL
0123
7654
r LL
rr
N V[AUX +] VP
r rLLL
rr LL
r LL
rr
They will V NP
This means that we will also have a recursive structure like the following:14
14 Due to the limited number of auxiliary verbs, and restrictions on their cooccurrence, the maximum number of
28
(106) S
nnnPPPP
nnn PPP
89:;
?>=<
nnn PP
NP VP
nnnPPPP
nn PPP
0123
7654
nn PP
nn
N V [AUX +] VP
nn nPPPP
nn PPP
89:;
?>=<
nn PP
nn
They will V[AUX +] VP
nnnPPPP
nnn PPP
nn n PP
have V[AUX +] VP
S
kkkkkk SSSSSS
kkk SSSS
kk
been studying English syntax
Another important property that PS rules bring us is the ability to make reference to hierar-
chical structures within given sentences, where parts are assembled into sub-structures of the
whole. One merit of such hierarchical structural properties is that they enable us to represent the
structural ambiguities of sentences we have seen earlier in (42). Let us look at more examples:
(107) a. The little boy hit the child with a toy.
b. Chocolate cakes and pies are my favorite desserts.
Depending on which PS rules we apply, for the sentences here, we will have different hierar-
chical tree structures. Consider the possible partial structures of (107a) which the grammar can
generate:15
(108) a. VP
nn nnWWWWWWWWW
nnn WWWW
VP PP
nnnnPPPPP rrrLLLL
n nn PP rr r LL
V NP with the toy
zDD
zzz DDD
z
hit the child
b. VP
jjjjjjTTTTTTT
jjj TT
V NP
jjjjjTTTTTT
jjjj TTT
hit Det N PP
rrrLLLL
rrr LL
the child with the toy
15 One can draw a slight different structure for (108b) with the introduction of the rule ‘NP → NP PP’.
29
The structures clearly indicate what with the toy modifies: in (108a), it modifies the whole VP
phrase whereas (108b) modifies just the noun child. The structural differences induced by the
PS rules directly represent these meaning differences.
In addition, we can easily show why examples like the following are not grammatical:
(109) a. *The children were in their rooms or happily.
b. *Lee went to the store and crazy.
We have noted that English allows two alike categories to be coordinated. This can be written
as a PS rule, for phrasal conjunction, where XP is any phrase in the grammar.16
(110) XP → XP+ Conj XP
The ‘coordination’ rule says two identical XP categories can be coordinated and form the same
category XP. Applying this PS rule, we will then allow (111a) but not (111b):
(111) a. PP
ggggggggWWWWWWWWW
ggggg WWWW
PP Conj PP
oooOOOO qqqMMMM
o o OOO qq MM
oo q
in their rooms or on the porch
b. *PP
W
g gggggggg WWWWWWWWW
gggg WWW
PP
J Conj AP7
tttt JJJJ 777
tt J
to the store and crazy
30
This in turn means that in addition to the PP formation rule, the grammar needs to introduce the
following VP rules:
(114) a. VP → V Part NP
b. VP → V NP Part
c. VP → V PP
Equipped with these rules, we then can easily represent the differences of these grammatical
sentences (112a), (112b) and (113b) in tree structures:
(115) a. VP
jjjjjTTTTTT
jjjj TTT
V PP
jjjjjTTTTTT
jjjj TTT
get P NP
??
???
off the bus
b. VP
iUU
iiiiiii UUUUUUUU
iii U
V Part NP
pNN
ppp NNNNN
ppp
put off the customers
c. VP
U
iiiiiii UUUUUUUU
i iii UU
V NP Part
ppppNNNNN
ppp NN
put the customers off
As represented here, the particle does not form a constituent with the following or preceding
NP whereas the preposition does form a constituent with it.
In summary, we have seen that a grammar with lexical categories can not only generate
an infinite number of grammatical English sentences, but also account for some fundamental
properties, such as agreement and constituency.17 This motivates the introduction of phrases
into the grammar.
17 In this chapter, we have not discussed the treatment of agreement with PS rules. Chapter 6 discusses the subject-
31
2.7 Exercises
1. Determine the lexical category of the italicized words in the following. In doing so, use
the three criteria (morphological, semantic, and syntactic) to provide the evidence for
your answer and state which one is the most reliable one.
(i) a. His second book came out earlier this year and became an instant best-
seller.
b. When you book something such as a hotel room, you arrange to have it.
c. Price quotes on selected categories will be sent out upon request.
d. No doubt that he was forced to leave his family against his will.
e. He intended to will the large amount of money to Frank.
f. Jane stood aside to let her pass.
g. He has a rail pass that’s right for you.
h. It is important for us to spend time with children.
i. He was arrested for being drunk.
j. I think that person we met last week is insane.
k. We believe that he is quite reasonable.
l. I forgot to return the book that I borrowed from the teacher.
2. Consider the following data carefully and describe the similarities and differences among
that, for, if and whether. In so doing, first compare that and for and then see how these
two are different from if and whether.
(i) a. I am anxious that you should arrive on time.
b. *I am anxious that you to arrive on time.
(ii) a. I am anxious for you to arrive on time.
b. *I am anxious for you should arrive on time.
(iii) a. I don’t know whether/if I should agree.
b. I wonder whether/if you’d be kind enough to give us information.
(vi) a. If students study hard, teachers will be happy.
b. Whether they say it or not, most teachers expect their students to study
hard.
3. Check if the italic parts form a constituent or not, using at least two constituenthood tests
(e.g., cleft, pronoun substitution, stand-alone, etc.). Also provide tree structures for each
of the following examples.
(i) a. John bought a book on the table.
b. John put a book on the table.
(ii) a. She turned down the side street
b. She turned down his offer.
(iii) a. He looked at a book about swimming.
b. He talked to a girl about swimming.
32
c. He talked with a girl about swimming.
(iv) a. I don’t know the people present.
b. John called the president a fool.
4. Explain why the examples in (i) are ungrammatical. As part of the exercise, first draw
structure for each example and try to determine the applicability of the the PS rules such
as the coordination rule in (110), presented earlier in this chapter.
(i) a. *Could you turn off the fire and on the light?
b. *A nuclear explosion would wipe out plant life and out animal life.
c. *He ran down the road and down the President.
d. *I know the truth and that you are innocent.
e. *Lee went to the store and crazy.
5. Provide a tree structure for each of the following sentences and suggest what kind of VP
rules are necessary. In doing so, pay attention to the position of modifiers like proudly,
by the park, and so forth.
(i) a. John refused the offer proudly.
b. I consider Telma the best candidate.
c. I saw him leaving the main building.
d. He took Masako to the school by the park.
e. John sang a song and danced to the music.
f. John wants to study linguistics in near future.
g. They told Angelica to arrive early for the award.
h. That Louise had abandoned the project surprised everyone.
6. Each of the following sentences is structurally ambiguous – it has at least two structures.
Represent the structural ambiguities by providing different tree structures for each string
of words.18
(i) a. I know you like the back of my hand.
b. I forgot how good beer tastes.
c. I saw that gas can explode.
d. Time flies like an arrow.
e. I need to have that report on our webpage by tomorrow.
7. Provide tree structures for each of the following sentences and see if there are any new
PS rules that we need to add, to supplement those we covered in this chapter. If there are
any places you cannot assign structures, please use triangles.
(i) Different languages may have different lexical categories, or they might
associate different properties to the same one. For example, Spanish uses
adjectives almost interchangeably as nouns while English cannot. Japanese
has two classes of adjectives whereas English has one; Korean, Japanese,
18 Fori-e, to help tease out the ambiguity, consider related potential interpretations like Please put that book on my
desk. and That report on our webpage alleges that it does not function well..
33
and Chinese have measure words while European languages have nothing
resembling them; many languages don’t have a distinction between adjec-
tives and adverbs, or adjectives and nouns, etc. Many linguists argue that
the formal distinctions between parts of speech must be made within the
framework of a specific language or language family, and should not be
carried over to other languages or language families.19
34