A CROSS-LINGUISTIC STUDY OF EARLY LEXICAL DEVELOPMENT

Maria Cristina Caselli

National Council of Research, Institute of Psychology, Rome

Elizabeth Bates

University of California, San Diego

Paola Casadio

National Council of Research, Institute of Psychology, Rome Clinica e Centro di Ricerca Santa Lucia

Judi Fenson

Children's Hospital and Research Center, San Diego

Larry Fenson

San Diego State University

Lisa Sanderl

National Council of Research, Institute of Psychology, Rome

Judy Weir

Children's Hospital and Research Center, San Diego



This research was supported by the John D. and Catherine T. MacArthur Foundation Research Network on Early Childhood Transitions, by NIDCD grants DC01289 "Origins of Communication Disorders", and DC00216 "Cross-linguistic studies of aphasia". The manuscript was completed while Bates was a visiting researcher at the Institute of Psychology, National Council of Research, at the Via Nomentana site directed by Virginia Volterra. We are grateful to Virginia Volterra, Elena Pizzuto and Alison Gopnik for comments on an earlier version of the manuscript. Please address all inquiries to Elizabeth Bates, Center for Research in Language, Dept. 0526, University of California at San Diego, La Jolla, CA 92093-0526.

ABSTRACT

Cross-linguistic studies have shown that children can vary markedly in rate, style and sequence of grammatical development, within and across natural languages. It is less clear whether there are robust cross-linguistic differences in early lexical development, with particular reference to the onset and rate of growth in major lexical categories (e.g., nouns, verbs, adjectives and grammatical function words). In this study, we present parental report data on the first stages of expressive and receptive lexical development, for 659 English infants and 195 Italian infants between 8 - 16 months of age. Although there are powerful structural differences between English and Italian that could affect the order in which nouns and verbs are acquired, no differences were observed between these languages in the emergence and growth of lexical categories. In both languages, children begin with words that are difficult to classify in adult part-of-speech categories (i.e., "routines"). This is followed by a period of sustained growth in the proportion of vocabulary contributed by common nouns. Verbs, adjectives and grammatical function words are extremely rare until children have vocabularies of at least 100 words. The same sequences are observed in production and comprehension, although verbs are reported earlier for receptive vocabulary. Our results are compared with other reports in the literature, with special reference to recent claims regarding the early emergence of verbs in Korean.



It is now clear that patterns of language learning can vary markedly across different language com-munities. Thanks in large measure to a generation of pioneering studies by Dan Slobin and his colleagues, striking variations in the patterning of early grammar have been documented (Slobin, 1985, 1992; see also Bates & Marchman, 1988; Braine, 1976; MacWhinney & Bates, 1989). Although children do go beyond their data and produce creative errors in every language studied to date, their first sentences are always tailored to and shaped by the structural properties of their native language-from the rich inflectional systems of Turkish and Greenlandic Eskimo to the austere structure of Chinese, which provides no grammatical inflections of any kind. There is, for example, no "universal tele-graphic stage", and no single order in which gram-matical structures of a particular type are acquired.

Much less is known about cross-linguistic variation in early lexical development, from first words to the onset of word combinations. However, studies of children exposed to English lead us to expect sub-stantial variability in this stage of development as well. There are now many studies documenting large individual differences in rate of language learning (Bloom, Lightbown, & Hood, 1975; McCarthy, 1954; Miller & Chapman, 1981). In the largest study to date, Fenson, Dale, Bates, Reznick and Thal (1994) report the first stages of lexical development in 1,803 normally developing children between 8 and 30 months of age, demonstrating massive variation in onset time and rate of growth in all aspects of early language and communication (see also Fenson et al., 1993; Bates, Marchman et al., 1994). Although many children in their sample offered no systematic evidence for word comprehension before 10 months of age, the parents of some 10-month-old infants report receptive vocabu-laries of more than 100 words. In the same vein, some children are reported to produce little or no meaningful speech before 16 months of age, while others displayed expressive vocabularies of more than 100 words at or before 16 months. Such large individual differences in rate of language development present serious problems for the idea that all normal children develop on a single maturational timetable.

Another line of evidence against this universal timetable comes from studies documenting individual differences in "style" of language learning, qualitative variations that are difficult to explain in terms of linguistic input and/or rate of maturation (Bloom, Light-bown, & Hood, 1975; Nelson, 1973; for reviews, see Bates, Bretherton, & Snyder, 1988; Bates, Dale, & Thal, 1994; Nelson 1981; Goldfield & Snow, 1985; Pine & Lieven, 1990; Shore, 1995). One stylistic extreme is characterized by an "analytic" approach to language, with emphasis on words for common nouns early in the one-word stage (with little or no use of multiword "formulas"), and an abrupt transition into combinatorial speech marked by "telegraphic" combi-nations of nouns and other content words. The other extreme is characterized by an "holistic" or "formulaic" approach to early language, with hetero-geneous voca-bularies from the very beginning of meaningful speech (including production of multiword formulae), and use of pronouns and other grammatical function words (albeit in relatively "frozen" forms) from the beginning of multiword speech. Bates et al. (1988) have argued that these stylistic variations are based upon at least two partially dissociable mechanisms for language learning: a segmenting "parts before the whole" mechanism, and a suprasegmental "whole before the parts" mechanism. Both these learning mechanisms are necessary for normal language, but for a variety of reasons (matura-tional; environmental; perhaps temperamental) some children rely more on one than the other in the early stages of language learning.

In this paper, we will present evidence on the composition of early vocabulary in 195 Italian children between 8 and 16 months of age, compared with 659 English children in the same age range from the Fenson et al. study described above. These two languages vary along some critical dimensions, with particular empha-sis on the relative salience of nouns and verbs. Hence they provide a fair test of the idea that children can learn nouns or verbs in any order, depending on their language input (Gopnik & Choi, 1990, in press). Our results will show that this hypothesis is incorrect, at least for English and Italian. We will demonstrate a strikingly similar distribution of word types in young children exposed to these two languages, characterized by the dominance of nouns and the virtual absence of verbs and adjectives up to the point where total vocabulary exceeds 100 words. Based on these results, we will propose as a working hypothesis for future cross-linguistic research that there may be a universal sequence in development from nouns to verbs, a sequence that cannot be reversed by variations in the relative salience of nouns and verbs in the input lan-guage.

There are several theoretical and empirical argu-ments in favor of the proposed noun-verb sequence in its current form, including the following.

Theoretical arguments for the early appearance of nouns include a proposed constraint on early word learning that Markman (1989) calls the "Whole Object Constraint," referring to a (presumably innate) tendency for young children to assume that new words will refer to whole objects, and not to subparts of that object (e.g., the bunny's ears), or to the actions or changes of state in which the objects participates (e.g., hopping, holding still, or nibbling a carrot). For obvious reasons, the Whole Object constraint would favor the early acquisition of nouns over other lexical items.

Theoretical arguments for the late appearance of verbs were first outlined in detail by Gentner (1982), who claimed that the semantic structures underlying verbs and other predicates are inherently more complex and open-ended than the structures which define noun meaning (see also Clark, 1983). O'Grady (1987) pro-vides some related arguments based on the logical underpinnings of nouns and verbs. He points out that nouns are typically used as arguments or "primaries", referring to some entity or class of entities. Verbs and adjectives are most often used as predicates or "secondaries", to indicate an action, state or relationship involving one or more arguments or "primaries". If a child is capable of using verbs and adjectives in a meaningful way, then s/he must have some kind of nominal argument in mind, i.e., "the thing that is pretty", "the thing that is gone", "the thing that is possessed". For this reason, acquisition of verbs and adjectives cannot proceed until the child has mastered enough nouns to support predication.

These theoretical arguments presuppose a trans-parent mapping from Objects --> Arguments --> Nouns, complemented by an equally transparent mapping from Actions/States --> Predicates --> Verbs. In fact, the correlations presupposed by this analysis are far from perfect (e.g. nouns can be used to set up a relationship, as in "Mary is an engineer", and verbs can be used to name an activity or event, as in "Mary loves to dance"), and may be even weaker in the language produced by young children (e.g., "Hot" may be used a name for all hot objects-see Methods). Furthermore, even if it could be shown that children first use verbs to express predication, it does not follow that the noun arguments of those verbs have already been expressed in speech. Such arguments could be indicated through pointing, or presupposed with no explicit signal of any kind (cf. Goldin-Meadow & Mylander, 1985; Greenfield & Smith, 1976; Iverson, Capirci, & Caselli, 1994; Volterra & Caselli, 1986).

Although there are flaws in the logical argument for a noun-verb sequence, empirical support is readily available for children acquiring English as their native language. For example, Bates, Marchman et al. (1994) have reported that lexical verbs comprise less than 5% of total vocabulary for English-speaking children with vocabularies under 50 words, compared with approxi-mately 14% for children with vocabularies of 500 words or more (see also Bates et al., 1988). In other words, lexical verbs are relatively late, at least in this language. However, the relative lateness of verbs does not necessarily imply an early bias towards nouns. As noted earlier, some children do produce a very high proportion of names for common objects during the first phase of word learning, but others start out with a heterogeneous set of "non-nominals" that includes proper names, routine expressions (e.g., "Hi" and "Night-night"), and functors that serve a specific social purpose (e.g., the word "up", uttered while extending the arms to be picked up, or the words "more", "mine!" and "no" to accompany requests, demands and re-fusals). Because there is so much variation in the ratio of nouns to non-nominal expressions during the first stages of lexical development, the evidence to date for English-speaking children may be best interpreted as "early verb avoidance" rather than "early noun pre-ference."

This brings us to a consideration of cross-linguistic evidence against the hypothesized noun-verb sequence. The proposal that verbs are universally late has come under fire in some interesting and provocative papers by Gopnik and Choi (1990; in press). The thrust of this work is well illustrated by the title of their most recent paper: "Names, relational words and cognitive develop-ment in English- and Korean-speakers: Nouns are not always learned before verbs." In both papers, Gopnik and Choi present some detailed logical and empirical arguments against the notion that all children neces-sarily acquire nouns before verbs. They suggest that this proposed universal may be an epiphenomenon of two methodological facts: (1) the fact that most studies have been based on English, and (2) the fact that so many studies have been based on parental report and/or on data collection in cultural contexts that emphasize object naming. They present data from free speech and parental interviews for a small sample of children ac-quiring Korean as their native language, showing that most of the words that these children use in the early stages of lexical development are non-nominals (broad-ly defined), including a higher proportion of verbs (narrowly defined) than one sees in English-speaking children observed under similar conditions. This differ-ence between English and Korean is explained by a combination of linguistic and cultural differences, with particular emphasis on the fact that Korean is an SOV language (placing the verb in a salient position), with extensive use of subject and object omission (which means that verbs are often the only content word in sentences spoken to small children). In other words, they have proposed that cross-linguistic differences in the salience of verbs can influence the order in which these categories emerge during the first stages of language learning (although it is worth noting that their evidence for early verbs in children is actually based on results for non-nominals, broadly defined to include lexical verbs together with other non-nominal cate-gories; we will return to this point later, comparing verbs with the broader non-nominal category in our own data).

We ourselves are quite sympathetic to cross-lin-guistic research, and to the argument that linguistic input can have a powerful impact on language and (perhaps) cognitive development (see MacWhinney & Bates, 1989). Indeed, in an earlier study we used parental-report instruments to investigate the growth of nouns, verbs and other categories in 14-month-old Italian- and English-speaking children, and found what looked like evidence in favor of a cross-linguistic difference in the age at which verbs begin to appear (Bates, Caselli, & Casadio, 1990). However, that study had methodological limitations that led to our present comparative analysis, with rather different results.

We have persisted in this comparison of English and Italian, because we believe that these languages display striking contrasts in the relative salience of nouns and verbs (albeit somewhat different contrasts from those that Gopnik and Choi report for English and Korean). As such, English and Italian represent a strong test of the hypothesis that cross-linguistic differ-ences in the input to children can affect the order in which nouns and verbs are acquired. This claim rests on several interacting contrasts between English and Italian, briefly summarized as follows.

(1) Word order variation. The order Subject-Verb-Object (SVO) is preserved quite rigidly in Eng-lish, compared to most of the world's languages. To be sure, there are some alternative word orders in English, including object-fronting in Wh-questions (e.g. "Who did you see?"), and auxiliary fronting in yes-no questions (e.g. "Are you coming?"). Since questions are plentiful in the speech addressed to children, this means that English children are exposed to some variation in basic SVO. Nevertheless, deviations from SVO are far more common in Italian. In fact, all possible orders of subject, verb and object can and do occur in informal Italian speech, in all sentence types (i.e., declaratives, interrogatives and imperatives). In conversations among Italian adults, SVO is still the predominant order (i.e., the most frequent order, and the one used in pragmatically neutral conditions). How-ever, Bates (1976) has shown that the speech addressed to young Italian children is evenly divided between subject-initial and subject-final forms (when subjects are expressed at all-see below). Hence Italian infants receive even more word order variation than we would expect based on analyses of adult speech, due perhaps to the fact that word order variation is used for empha-sis and clarification, two very common functions in child-directed speech.

(2) Subject omission. English is a language in which subjects are obligatory in free-standing declara-tive sentences, including empty subjects like the "it" in "It is raining." This property (which English shares with some of the other Indo-European languages, e.g., German and French) contrasts with so-called "pro-drop" or null-subject languages like Spanish, Chinese or Italian, where subjects are only mentioned if they are new and/or emphatic information (and not mentioned at all in constructions like "Piove" - "(It) is raining"). Bates (1976) reports that subjects are omitted approxi-mately 70% of the time in informal conversation among adults, and in speech to very young children. This means that verbs are the first element in the sentence a high proportion of the time. It also means that verbs constitute a much higher proportion of the content words that Italian children hear, compared with their English counterparts.1

(3) Clitic pronouns. English uses full pronouns, in all sentence positions. By contrast, Italian distinguishes between phonologically strong full pro-nouns (which are always used for contrast or emphasis) and phonologically weak clitic pronouns (the default way of referring to an element that is already established in discourse). Clitic object pronouns must occur immediately before the verb in a declarative or interrogative form (as in "Lo voglio" - "It (I) want"), forming an SOV or (when subjects are omitted) an OV structure. Because such sentences are very common in the speech addressed to young children, this means that the input to Italian children often approximates the input to children who are learning an SOV language (e.g., Korean).

(4) Agreement marking. Verbs carry a relatively small number of agreement contrasts in English com-pared with other Indo-European languages. For example, the English present indicative conjugation contains only contrast (eat vs. eats), compared with six contrasts in Italian, one for each person and number. As a result, the English verbs carries very little information about the identity of the subject (i.e., "Who did what to whom"); by contrast, Italian listeners can and do rely on agreement contrasts to assign sentence roles (Devescovi, D'Amico, Smith, & Bates, 1993; MacWhinney, Bates, & Kliegl, 1984).

Because of the first three properties (i.e., rigid word order, obligatory subjects and full pronouns), the verb is almost always located somewhere in the middle of the sentence in English, where it is hardest to hear and hardest to remember. And because of the fourth property (minimal agreement marking), the verb also carries less information in English than it does in many other languages. Putting these facts together, we should not be surprised to find that verbs are a relatively late acquisition for English-speaking children. By contrast, verbs are very salient in Italian, and especially salient in the speech addressed to small Italian children. Hence Italian ought to provide a good test of the robustness and cross-linguistic generality of the noun-to-verb shift.

To confirm or disconfirm the proposed noun-verb sequence in English and Italian, we need to quantify the categories "noun" and "verb". Can we assume that adult definitions of "noun" and "verb" are appropriate for use with small children? Bates, Camaioni and Volterra (1975) have noted that children often start their linguistic careers with words like "Vroom" that are difficult to classify into adult part-of-speech categories (see also Ninio, 1993, 1994). Indeed, it could be argued that none of the adult part-of-speech categories are present in the first stage of language learning (cf. Tomasello, 1992; Ninio, 1994). One-year-olds often use words in ways that deviate from adult patterns of usage for the very same word, e.g., the "Hot" example described above (Volterra, 1979; Volterra & Caselli, 1986). Because there is no necessary one-to-one relationship between adult part-of-speech categories and the way that words are used by individual children, it is fair to ask whether anything can be learned from a study of vocabulary composition that relies on adult classifications.

Our solution to this problem is to analyze the child's reported vocabulary from the point of view of adult part-of-speech categories. An item is classified as a "noun", "verb" or "function word" if that item func-tions as a noun, verb or function word in the adult language. However, we do not assume that children in this age range necessarily represent or recognize categories like noun, verb, or grammatical function word. Instead, we treat adult part-of-speech categories as independent variables. That is, we treat these categories as a summary of the child's linguistic input, including differences between Italian and English in the phonological, semantic, morphological and syntactic properties of nouns and verbs. To the extent that children treat nouns differently from verbs, content words differently from grammatical function words, and so forth, we can assume that they have been affected by these differences. Hence developmental sequences in the acquisition of part-of-speech types may be taken to reflect changes in the child's ability to deal with these types. They do not necessarily reflect the emergence of explicit or implicit categories like noun or verb from the child's point of view. In the same vein, cross-linguistic differences in vocabulary compo-sition (with vocabulary size held constant) can be taken to reflect the child's sensitivity to variations in the nature of the input language. They do not necessarily reflect cross-linguistic differences in the "moment" (if there is a moment) at which such categories emerge in the mind of the child.

METHOD

Subjects

English-speaking subjects were 659 infants be-tween 8-16 months of age whose parents participated in a large norming study of the Macarthur Communicative Development Inventories, conducted in San Diego, Seattle and New Haven (Fenson et al. 1993, 1994). As described in more detail in Fenson et al. (1993), the sample represents families across the socioeconomic spectrum within the United States, but is dispro-portionately weighted toward families with a high-school education or better (i.e., 79% of the mothers report at least 12 years of formal education). The sample was screened (based on a parental questionnaire about family and medical history) to exclude children with serious medical problems, neurological deficits and/or mental retardation. Children with significant exposure to languages other than English were also excluded from the sample. There were at least 30 boys and 30 girls in each one-month age group. For a detailed breakdown by sex and age, see Table 1.

TABLE 1:
AGE BY GENDER BREAKDOWN FOR EACH LANGUAGE GROUP

                  ENGLISH           ITALIAN

Age in months  Male    Female    Male    Female

     8          32       33       11        8

     9          37       32        4       10

     10         35       32       10       11

     11         38       46        7       13

     12         45       41       16       18

     13         40       36        7        8

     14         42       42        7       10

     15         32       33       15       12

     16         33       30       13       15


   TOTAL       334      325       90      105

There were 195 Italian subjects spread across the 8 - 16-month age range, with approximately equal distri-bution of males and females at each age level (i.e., a chi-square test for distribution of males vs. females across the 9 age levels failed to reach significance-see Table 1 for a breakdown of the Italian sample by age and sex). With one exception, the children and their families were all residents of cities in the central to northern regions of Italy (52% from central regions, including Rome; 48% from the northern areas of Italy). Parents filled out a basic information sheet based on the questionnaire used in the MacArthur norming sample for English, including information on sex and birth order, the child's medical history, exposure to lan-guages other than Italian, and parental education and occupation. The same exclusionary criteria were adopted for both the English and the Italian samples. Similar to the U.S. sample, a full spectrum of socio-economic groups were represented among the 195 families in the study, but the sample was dispro-portionately weighted toward families in which the parents had a high-school education or better.

Materials

The English-language version of the CDI:Infants is composed of two sections. Part I is a checklist of 396 words that are among the first to appear in the vocabularies of young English-speaking children. Next to each word, the parent is asked to indicate if the child (a) understands that word, and (b) understands and produces that word. The checklist is divided into 19 semantic categories: sound effects (e.g., "moo", "vroom"), animal names, vehicle names, toys, food items, articles of clothing, body parts, furniture, house-hold objects, outside things and place to go, people (including proper nouns), routines and games (e.g., "peekaboo"), verbs (called "action words" on the CDI form), words for time, adjectives (called "descriptive words" on the CDI form), pronouns, question words, prepositions, and quantifiers. All forms are presented in their "citation form" (e. g., verbs are listed as stems); Part II is a checklist of 63 communicative and/or symbolic gestures that also develop in this age range. Data for Part II of the Infant Scale will not be discussed in this paper.

As described in more detail by Fenson et al. (1994), the English and Italian versions of the MacArthur Communicative Development Inventories have been developed in tandem across a twenty-year period, from open-ended interviews used in both countries (e.g., Bates, Benigni, Bretherton, Camaioni & Volterra, 1979; Bates, Camaioni & Volterra, 1975) through various intermediate versions of increasing specificity, with more and more reliance on recognition memory in a checklist format. The content of the Italian MacArthur CDI reflects many years of parent report, home obser-vation and laboratory studies on the acquisition of Italian, but the current version is modelled as closely as possible on the English version in terms of overall format, number and type of lexical categories, and number of items. The Italian word checklist includes 401 words divided into the same 19 semantic categories listed above for English. For all content word categories (from sound effects through adjectives), the number of items within each category is identical in English and Italian. The small difference in number comes entirely from function word categories that are necessarily longer in Italian (e.g., four forms of the definite article instead of one).

Procedure

Parents of the children in the U.S. norming study were contacted through local doctors' offices and day care programs, and were asked to complete the inven-tory and accompanying family history questionnaire, and return both in a postage-paid envelope. Approxi-mately 30% of the families that were contacted completed and returned a questionnaire. (Unfortu-nately, we have no way of knowing whether there are demographic differences between this sample and those families who declined to participate in the project.) For English, where the scale is printed on machine-scan-nable forms, all returned questionnaires were first subjected to visual analysis (to assure that parents had followed directions and forms were appropriate for machine scanning), and then scored mechanically.

In the Italian study, families were contacted through area pediatricians and staff members within the national health care system for each region. The questionnaire was accompanied by a letter to the families and the family and medical history question-naire. Approximately 50% of the families who were contacted completed and returned the questionnaire (we have no information on the demographic characteristics of families who declined to participate). The Italian data were entered into the computer manually, within a scoring program developed for this purpose.

In both English and Italian, the forms provided an opportunity for parents to write in comments or add additional words, but these word additions were not used in any of the counts described below, for either language. This was also true for the proper noun category. For example, the child would be given a score of 1 item if the parents checked the word for "grandfather", but no additional credit was given if the parent wrote in names for two different grandfathers.

Data Reduction

Following the procedures outlined by Bates, Marchman et al. (1994), we calculated a series of proportion scores to examine the composition of production and comprehension vocabulary, as follows:

(1) Percent nominals, broadly defined to include common nouns, proper nouns, places to go, and sound effects that are often used by young children to refer to animals and objects;2

(2)Percent common nouns, more stringently de-fined to include only words from the adult language that stand for concrete objects (animal names, vehicles, toys, food and drink, clothing, body parts, furniture and rooms, small household objects); the nominal categories excluded from this count are names for people, sound effects, and "places to go";3

(3)Percent proper nouns and terms for people;4

(4)Percent sound effects (e.g., "vroom", "meow");

(5)Percent routines, a heterogeneous category that includes social words like "hi" and "bye", daily routines or events like "breakfast" or "nap", and familiar com-mands like "don't";

(6)Percent verbs;

(7)Percent adjectives;

(8)Percent predicates (verbs and adjectives com-bined);5

(9)Percent grammatical function words, which includes pronouns, question words, prepositions and quantifiers.6

In calculating these scores (and in the original design of the Italian and English scales), every effort was made to use comparable criteria for word classi-fication in each language. We also tried to stick closely to adult-based classifications, consistent with our claim that adult part-of-speech categories should be treated as independent variables in the analysis of child language. Nevertheless, because the adult categories themselves are idealizations, we were forced to make some arbi-trary (and arguable) classifications in many cases, especially for routines, sound effects and other marginal categories (i.e., items that one cannot simply look up in a dictionary). There are a few items that ended up in different categories for English vs. Italian, even though they have a similar function in each language. For example, the word "uh-oh" is classified under sound effects in English, while its nearest equivalent "bum" is classified as a routine in Italian. In the same vein, the word "allgone" was classified as an adjective in English (because of the sentence contexts in which it is typically used by adults), while its equivalent in Italian (a frozen phrase "Non c'è più"-"There is no more") was classified among the routines.7 All of these classi-fications are probably best viewed as approximations, so that our quantitative results (based on category proportion scores) must be considered together with the qualitative item analyses presented later.

RESULTS AND DISCUSSION

We will begin with a global comparison of age-related changes in vocabulary comprehension and production in the two language groups. Then we will examine similarities and differences between English and Italian in the composition of vocabulary, with children grouped according to vocabulary size rather than age. This is the section that will provide our most important test of the hypothesis that verbs can develop before nouns in a language that renders verbs particu-larly salient. Finally, we will undertake a qualitative comparison of the first 100 words acquired by English vs. Italian children, in production and in compre-hension. This analysis will not change our conclusions about the late onset of verbs, but it will clarify and illustrate some item-specific effects that can emerge as a consequence of linguistic and cultural differences.

Age-related changes in vocabulary size

To determine whether the two groups differ in rate of vocabulary development from 8 - 16 months, we carried out two separate 2 (language) by 9 (age in months) between-group analyses of variance, for total comprehension and total production, respectively.

In the analysis of word comprehension, there was a significant main effect of age (F(8,853) = 65.05, p .0001), but no main effect of language group (F(1,853) = 0.485, n.s.) and no interaction between language group and age (F(8,853) = 1.86, n.s.). There is indeed substantial growth in receptive vocabulary size across this age range, from a mean of 36 words at 8 months, to 86 words at 12 months, to 191 at 16 months. This developmental increase in word comprehension is illus-trated in Figure 1 (although there was no significant effect of language, we have graphed data for each group separately, for comparison with the data on production).


Figure 1

There were also large individual differences in rate of growth for word comprehension, in both English and Italian. Across the two groups, the mean number of words comprehended was 105, but the standard devia-tion was 86.5, and we observed a range from 0 to 396-in other words, from no words at all to all of the words on the list! The mean was quite similar for both language groups (105 for English vs. 107 for Italian), as was the standard deviation (86.5 for English, 83.3 for Italian). Hence these two populations are quite comparable in size, rate of growth, and amount of variation observed for word comprehension.

By contrast, the analysis of word production did reveal a consistent difference in rate of growth for English vs. Italian. There was a significant main effect of age (F(8,853) = 33.12, p .0001), but also a signi-ficant effect of group (F(1,853) = 15.75, p .0001), and a significant age by group interaction (F(8,853) = 3.37, p .001). Figure 1 shows that Italian children lag behind the English group in total vocabulary size at most ages, although the difference is most evident at the highest levels. At 8 months of age, the average score for English children is 1.8 words, compared with 1.1 for Italian. By 12 months, the English average is 10 words, compared with an Italian mean of 9. The groups move further apart at 15 months, when the mean for English is 48 words compared with 22 for Italian, and further still at 16 months, where the corresponding numbers are 64 and 27. There are substantial individual differences in rate of growth within each language group, but the variance is larger in the English sample (English standard deviation = 39.39; Italian standard deviation = 18.12).

This apparent cross-linguistic difference in rate of growth and range of variation for vocabulary produc-tion may be an artifact of differences in sample size. As Figure 1 clearly indicates, the data for vocabulary production are characterized by floor effects across the period from 8 - 16 months, with many children who produce almost no speech at all. In a situation like this, where the most of the sample is still "hugging the floor", there is nowhere for the variance to go but up. As sample size increases, we increase the probability of picking up children with exceptionally high scores, but we cannot pick up additional children with exception-ally low scores because one cannot go any lower than zero (and zero is not an unusual score). Our sample includes 195 Italians-a very large number within the literature on child language, but a much smaller number than the 659 subjects that we have for English. Thus, our probability of picking up exceptionally high scores is approximately 3.4 times greater in English than it is in Italian; our probability of picking up exceptionally low scores is also 3.4 greater in English, but 3.4 times zero is still zero. As a result, there is an inevitable upward skewing of the mean as sample size goes up. In view of this statistical confound, we suspect that our data do not support a true "English advantage" in productive vocabulary. However, because there is a consistent group difference in vocabulary size, all the analyses that follow will be based on language level rather than age, so that English and Italian children are matched for total vocabulary.

Cross-linguistic and developmental differences in vocabulary composition

For analyses of word production, children were divided into six groups based on their total vocabulary size: (1) zero words, (2) 1 - 5 words, (3) 6 - 10 words, (4) 11 - 20 words, (5) 21 - 50 words, and (6) more than 50 words. A chi square test confirmed that the two language groups are evenly balanced across these six vocabulary levels. For analyses of word compre-hension, children were also divided into six groups based on vocabulary size, as follows: (1) 0 - 20 words, (2) 21 - 50 words, (3) 51 - 100 words, (4) 101 - 150 words, (5) 151 - 200 words, and (6) more than 200 words. Again, a chi square test confirmed that the English and Italian groups are evenly balanced across vocabulary levels.

Table 2 reports mean proportion scores in produc-tion for each language and vocabulary group; Table 3 contains the corresponding subgroup means in comprehension. For each modality (production and comprehension), each of these proportion scores was analyzed within a Language by Vocabulary Level analysis of variance (2 ¥ 6 for analyses of compre-hension; 2 ¥ 5 for analyses of production, excluding children with zero words for obvious reasons). We will start below with analyses of word production (following Bates, Marchman et al., 1994, for English), and then move on to analyses of word comprehension (which have not been presented for English or Italian in any of our previous work).

TABLE 2: COMPOSITION OF PRODUCTION VOCABULARY
AS A FUNCTION OF LANGUAGE AND VOCABULARY SIZE

Variable          Language     Number of Words in Production Vocabulary:

                               1-5     6-10    11-20    21-50    >50      Total

% Nominals        English      80.4    75.4     70.7     72.8    73.6      75.5
                  Italian      91.0    84.5     74.7     70.4    72.6      82.0

% Common Nouns    English      16.4    22.9     32.9     44.5    54.1      30.2
                  Italian      20.6    28.1     31.4     36.5    46.2      28.8

% People          English      32.3    24.8     16.5     10.7     6.5      21.1
                  Italian      35.1    35.4     19.6     16.9    11.1      27.6

% Sound Effects   English      30.7    27.1     20.4     15.5     7.7      22.7
                  Italian      35.3    21.0     23.6     16.5    11.8      25.3

% Routines        English      14.2    19.5     18.7     15.0     9.1      15.5
                  Italian       7.8    13.3     20.7     20.1    16.4      13.8

% Verbs           English       0.5     1.1      2.1      2.7      6.8      2.1
                  Italian       1.3     0.7      2.8      4.1      4.5      2.3

% Adjectives      English       1.7     0.5      2.7      4.3      4.8      2.5
                  Italian       0.0     0.4      0.5      1.0      2.1      0.5

% Predicates      English       2.2     1.5      4.8      7.0     11.6      4.6
  (Verb + Adj)    Italian       1.3     1.0      3.3      6.0      6.6      2.8

% Function Words  English       3.2     3.6      5.8      5.2      5.4      4.4
                  Italian       0.0     1.1      1.3      3.5      4.0      1.4


Changes in the composition of expressive vocabulary

As Table 2 shows, the category Nominals (broadly defined) accounts for 75.5% of total production voca-bulary in the English sample, compared with 82% in Italian. Under the broad definition of non-nominals discussed by Gopnik and Choi, this means that 24.5% of vocabulary falls into the non-nominal category for English children from 8 - 16 months, compared with 18% for Italian. So, contrary to predictions based on verb salience, there is actually a slight advantage for nominals in the Italian group, reflected in a significant main effect of language (F(1,674) = 10.49, p .001). There was also a main effect of vocabulary size (F(4,674) = 9.54, p .0001), with no significant inter-action (F(4,674) = 1.97, p .10). The main effect of vocabulary size actually reflects a slight drop in percent nominals, as other classes start to grow (e.g., verbs, adjectives, and closed-class words-see below). The source of the small Italian advantage on nominals is made clear by analyses looking at subtypes within the Nominal category.

When analyses were carried out over the more restricted category of Common Nouns, we found a large main effect of vocabulary level (F(5,674) = 54.20, p .0001), but no main effect of language (F(1,674) = 0.02, n.s.), and no interaction (F(5,674) = 1.80, n.s.). Note first that common nouns occupy a much smaller pro-portion of total production vocabulary than nominals more broadly defined: an average of 30.2% for English (compared with 75.5% for all nominals), and 28.8% for Italian (compared with 82% for all nominals). How-ever, in line with results described by Bates, Marchman et al. (1994), common nouns also occupy an increasing proportion of total vocabulary as the lexicon expands, from 17.4% in children with only 1 - 5 words, to 52.7% in children with more than 50 words. In this respect, our results for Italian exactly mirror previous reports for English on the disproportionate representation and growth of object names in the first phases of lexical development. Figure 2 illustrates these developmental changes in the common-noun category, for each lan-guage. The flat dotted line around 45% is included to illustrate the proportion of the entire checklist composed of common nouns. As discussed by Bates, Marchman et al.(1994), this line indicates the "checklist baseline". If acquisition of nouns were a constant or random process across this period of development, with no change in noun bias, then common-noun proportion scores should hover around 45% at every level of development, for both languages. Obviously this is not the case. In both groups, common nouns are under-represented in the early stages but increase in importance as vocabulary expands.


Figure 2

We also looked at individual variation around the mean for common-noun scores, to see whether Italian children display the same "referential-to-expressive" dimension of variation that has been described pre-viously for English (Bates et al., 1988; Bates, March-man et al., 1994; Nelson, 1973). Figure 3a presents the mean percent common-noun scores at each vocabulary level for English, together with scores for children who are 1.28 standard deviations above and below the mean (parametric values that approximate the top and bottom 10th percentiles-Bates, Marchman et al., 1994); Figure 3b presents the corresponding values for Italian. It is clear from these figures that the same develop-mental and stylistic factors are at work in each lan-guage. That is, nouns increase in importance across this developmental range for both groups, but there is also considerable variation in "nouniness" within each group.


Figure 3a

Figure 3b

So where does the slight Italian advantage on nominals come from? Table 2 includes all language by vocabulary level scores for two further subsets from the global nominal category, proper nouns and sound effects. Analysis of variance revealed a significant decrease in proper noun proportion scores as a function of vocabulary level (F(4,674) = 26.49, p .0001), from an average of 33% in children with 1 - 5 words, to 7% in children with more than 50 words. The interaction was not reliable (F(4,674) = 0.45, n.s.), but there was a small main effect of language (F(1,674) = 5.31, p .022), reflecting slightly more production of names for people in the Italian sample. As we shall see later in the item analyses, this finding may reflect cultural differ-ences in the number and proximity of grandparents and other family members.

By contrast, there was no group difference in the analysis of sound effects (F(1,674) = 0.50, n.s.), nor was there a significant interaction (F(4,674) = 0.79, n.s.), but there was a significant main effect of voca-bulary level (F(4,674) = 17.17). As we just saw for proper nouns, sound effects are another category that decreases in relative size as vocabulary grows. For children with 1 - 5 words, the sound effect category constitutes 32% of total vocabulary on average, compared with 8.5% in children with more than 50 words. This confirms the oft-reported finding that sound effects are (like proper names) among the first nominals produced by one-year-old children. However, these items decrease in importance as "true" common nouns take off-a finding that holds for both language groups.

Routines like "bye" and "pattycake" form another category that decreases in relative size as vocabulary expands. Indeed, just as proper nouns and sound effects constitute the "starter set" within the nominal category, these routines can be viewed as a "starter set" in the category of non-nominals. Analysis of variance re-vealed a significant main effect of vocabulary level (F(4,674) = 5.58, p .0001), no main effect of language (F(1,674) = 1.28, n.s.), but a reliable albeit small interaction between language and vocabulary size (F(4,674) = 3.13, p .02). In both language groups, the development of routines is characterized by an inverted U-shaped function, with an initial increase in relative size followed by a decline. However, the shape of this function is slightly different for English and Italian children. The English children reach a peak proportion score of 19.5% for children with 6 - 10 words, falling to 9.1% for children with more than 50 words. The Italian children reach a peak of 20.7% for children with 11 - 20 words, falling to 16.4% for children with more than 50 words. We may speculate that the protracted emphasis on verbal routines in Italian is one more reflection of cultural differences in adult-child interaction-a tend-ency to "show off" to the relatively large number of aunts, uncles and cousins that fill the lives of small Italian children. But it may also reflect decisions about where to put early acquired items like "allgone" and "uh-oh". Cross-linguistic differences as small as the ones reported here can be caused by a very small number of items, a point that will be clear in the qualitative analysis of first words presented later on.

This brings us to the critical analysis for competing claims regarding the emergence of nouns and verbs. As Table 2 shows, verbs occupy a very small proportion of total expressive vocabulary in both the English and the Italian sample. The overall mean is 2.06% for English and 2.27% for Italian. Analysis of variance revealed no significant main effect of language (F(1,674) = 1.28, n.s.), and no significant interaction (F(4,674) = 1.91, n.s.). But there was a large main effect of vocabulary size (F(4,674) = 24.08, p .0001), reflecting a gradual increase in the contribution of verbs to total vocabulary size, from less than 1% in children with 1 - 5 words to 6.4% in children with more than 50 words. This developmental change is illustrated in Figure 2 for both languages, on the same figure with the scores described above for common nouns (to facilitate comparison). Note also that all the verb proportion scores fall well short of the percentage that verbs contribute to the checklist as a whole. That is, because verbs represent approximately 14% of the whole checklist, a hypo-thetical child who develops by selecting randomly across categories would average somewhere around 14% across the first stages of lexical development. Instead, verbs constitute far less than 14% of voca-bulary throughout this period.

Before we accept the conclusion that nouns de-velop before verbs in both English and Italian, we need to consider a possible confound: The list as a whole contains far fewer verbs than nouns. Perhaps children are developing equally fast in both categories, but verb development is undersampled by the checklist. As Bates, Marchman et al. (1994) point out, the dispro-portionate numbers of nouns and verbs on this list is no accident. The final form of the MacArthur Communicative Development Inventory was developed over a 20-year period, with several revisions of the word checklist based on parental feedback and laboratory observations. There are approximately three times as many nouns as verbs on the final checklist because (1) this ratio is representative of the adult language, and (2) this ratio is also reflected in diary studies, laboratory observations, and in previous studies using versions of the word checklist. Nevertheless, following Bates, Marchman et al. (1994), we also carried out an analysis of the percentage of all noun and verb opportunities that were checked by parents of children at each vocabulary level. That is, we cal-culated a "noun opportunity score" (dividing the com-mon nouns reported for each child by 182, the total number of common nouns provided on the list) and a "verb opportunity score" (dividing the verbs reported for each child by 55, the total number of verbs provided on the list). These scores were analyzed in a multi-variate analysis of variance comparing noun vs. verb opportunity scores (hereafter Word Type) as a function of language and vocabulary level. Word type was treated as a within-subjects variable; all other factors were between groups.

Results of this analysis are illustrated in Figure 4. All main effects and interactions were statistically reliable: Language (F(4,665) = 21.80, p .0001), Vocabulary Size (F(4,665) = 82.03, p .0001), Word Type (F(4,665) = 381.95), Language by Vocabulary Size (F(1,665) = 13.65, p .0001), Word Type by Vocabulary Size (F(4,665) = 89.54, p .0001), and a three-way interaction (F(4,665) = 4.15, p .003). Several conclusions can be drawn from these complex results. First, we should note that children produce a very small percentage of the available items across this range of development, for both nouns and verbs. Hence there is little danger that ceiling effects on the checklist have skewed our results in some way. Second, some English children are filling up the various noun and verb opportunities more quickly than their Italian counterparts, although this is only true for children with more than 50 words. This finding reflects the upward skewing due to larger sample size that we discussed in our analyses of raw scores for word production. That is, the sample of 659 English-speaking children contains more children with extremely high scores than the sample of 195 Italians. Despite these differences, it is clear that the common-noun category is "filling up" faster than verb category in both language groups, even though this analysis has equated the two word types for the number of items available on the checklist.


Figure 4

We submit that the underrepresentation of verbs in very young children reflects a fundamental develop-mental fact: Lexical verbs do not develop until common nouns are a well-established component of the emerg-ing lexicon. Our data suggest that this sequence is not affected by crucial cross-linguistic differences in the salience and availability of verbs.

A similar conclusion holds for adjectives, although we do find some small cross-linguistic differences here. As Table 2 shows, adjectives are very rare in the expressive vocabulary of 8- to 16-month-old children, constituting 2.5% of total vocabulary for the whole English sample, and only 0.5% of total vocabulary for the Italian group. Analysis of variance revealed a main effect of language (F(1,674) = 11.82, p .001), a main effect of vocabulary size (F(4,674) = 7.85, p .001), with no significant interaction (F(4,674) = 0.92, n.s.). As we shall see later in our qualitative analysis of first words, the American advantage on adjectives is due almost entirely to 2 - 3 specific words.

Following Bates, Marchman et al. (1994), we also combined verbs and adjectives into a single predicate category. As Table 2 shows, this category constitutes 4.6% of total vocabulary for the full English sample, compared with 2.8% for Italian. Analysis of variance revealed a large and reliable main effect of vocabulary size (F(4,674) = 27.71, p .0001), a small but reliable main effect of language (F(1,674) = 3.96, p .05), with no significant interaction. It is already clear from the analyses above that this small cross-linguistic difference comes from a few adjectives that are acquired early by English-speaking children.

We end these analyses of word production with the heterogeneous category of grammatical function words, known to be relatively rare in the first stages of lan-guage learning. As Table 2 shows, grammatical function words constitute 4.4% of total vocabulary for 8- to 16-month-old children acquiring English, compared with 1.37% for children acquiring Italian. Since function words constitute approximately 9% of the items on each checklist, this means that function words are indeed underrepresented in the vocabularies of 8 - 16-month-old English and Italian children (i.e., this result cannot be viewed as an artifact of the list itself). Analysis of variance revealed a significant main effect of language (F(1,674) = 11.17, p .001), a significant main effect of vocabulary size (F(4,674) = 2.47, p .044), with no significant interaction (F(4,674) = 0.33, n.s.). In fact, there is very little growth in the function word category across this period of develop-ment. On average, scores increase slightly from 2.4% in children with only 1 - 5 words, up to 5.2% in children with more than 50 words. As Bates, Marchman et al. (1994) have pointed out for English, the early use of function words bears no predictive relationship to the subsequent emergence of grammar. And as we shall point out here for Italian and English children, the first function words are restricted almost entirely to words that can be used alone in single-word commands and routines (e.g., "Mine!"). The small cross-linguistic difference favoring English comes from a subset of these items.

To summarize so far, there are far more similarities than differences in the composition of early productive vocabulary in these English- and Italian-speaking children. Above all, there is no evidence whatsoever for an Italian advantage in verbs (narrowly defined) or non-nominals (broadly defined). Instead, both groups provide evidence for a developmental sequence from reference to predication, reflected in the early onset and growth of nouns and the paucity of verbs and other predicates. Let us now turn to the composition of receptive vocabulary, to see whether the Gopnik and Choi hypothesis of cross-linguistic differences in the sequencing of nouns and verbs receives more support from what these children know about language, as opposed to what they actively do with the words they know.

Changes in the composition of receptive vocabulary

Previous studies comparing the early stages of comprehension and production have concluded that verbs play a much larger role in early receptive language (Bates et al., 1979; Benedict, 1979; Goldin-Meadow, Seligman, & Gelman, 1976). However, as Margaret Harris (1993) has also pointed out, this conclusion is subject to some methodological criti-cisms. In particular, adults tend to collect evidence for early word comprehension from the child's response to commands (e.g., "Spit it out!" or "Get the book!'"), a fact which may weight their estimates heavily toward the acquisition of action verbs. It is also clear from Harris' review that the proportion of verbs reported for early receptive vocabulary tends to be greater in studies that rely on the adult's recall memory (i.e., informal diaries or open-ended questions), and smaller in studies that rely on adult recognition memory (e.g., checklists like the one employed here). In the present study, parents used the same checklist to report word comprehension and production. This means that they can rely heavily on recognition memory. The side-by-side comparison of comprehension and production may bias them to use similar criteria in the assessment of each-a fact which may be an advantage or a dis-advantage, depending on one's view. In any case, if there is a verb advantage in reports of word com-prehension, we may be in a position to see cross-linguistic differences that did not emerge in our data on word production.

Table 3 presents the nine proportion scores for word comprehension, for each of the language and vocabulary-level groups in this 2 ¥ 6 design. Starting with the class of Nominals (broadly defined), Table 3 reports an overall mean of 61.8% for English and 63.8% for Italian. Although this is smaller than the overall means of 75.5% and 82% described above for word production, it is clear that nominals constitute the largest category of word types in both modalities. However, the developmental patterns uncovered for comprehension and production do differ in some respects. Analysis of variance revealed a small but significant main effect of language (F(1,843) = 4.08, p .05), but no main effect of vocabulary size (F(5,843) = 0.94, n.s.), and no interaction (F(5,943) = 1.30, n.s.). In other words, we don't find the same developmental decrease in the nominal category that we uncovered in our analyses of word production.

TABLE 3: COMPOSITION OF RECEPTIVE VOCABULARY
AS A FUNCTION OF LANGUAGE AND VOCABULARY SIZE

Variable          Language     Number of Words in Receptive Vocabulary:

                               1-20    21-50   51-100  101-150  151-200  >200    Total

% Nominals        English      60.4    62.6     61.2     63.2     61.4    61.8    61.8
                  Italian      66.8    67.0     63.7     64.2     60.4    60.7    63.8

% Common Nouns    English      22.0    37.2     44.2     48.6     47.5    48.4    41.5
                  Italian      31.7    44.5     47.0     49.2     47.8    48.2    45.2

% People          English      31.5    14.0      8.2      5.4      4.7     4.1    11.1
                  Italian      29.4    14.8      8.6      6.4      4.8     4.0    10.9

% Sound Effects   English       6.2     8.8      5.2      4.8      4.3     3.7     5.6
                  Italian       5.1     7.0      5.4      4.5      3.7     3.5     4.9

% Routines        English      27.3    18.6     13.1      9.8      8.6     6.6    14.1
                  Italian      25.2    18.7     12.5      9.9      8.4     6.4    13.3

% Verbs           English       6.8    10.0     14.6     15.0     16.6    16.0    13.2
                  Italian       6.9    10.8     17.6     17.7     20.0    17.5    15.3

% Adjectives      English       2.8     5.2      6.7      6.9      7.2     7.8     6.1
                  Italian       0.5     1.6      3.0      4.4      5.5     6.5     3.6

% Predicates      English       9.7    15.2     21.3     21.9     23.9    23.8    19.3
  (Verb + Adj)    Italian       7.4    12.4     20.6     22.1     25.5    24.1    19.0

% Function Words  English       2.5     3.6      4.1      4.9      5.8     7.1     4.5
                  Italian       0.6     1.9      3.2      3.5      5.4     8.0     3.7

Turning to the more restricted category of common nouns, scores were actually larger for comprehension than they were for production, averaging 41.5% for English (compared with 30.2% in production) and 45.2% for Italian (compared with 28.2% in production). Analysis of variance on the common-noun scores in comprehension revealed a significant main effect of language (F(1,843) = 10.23, p .001), reflecting slightly larger common-noun proportion scores for Italians (again, the opposite of predictions based on verb salience). There was also a significant main effect of vocabulary size (F(5,843) = 70.86, p .0001), and a small but reliable interaction (F(5,843) = 2.43, p .04). Figure 5 shows that common nouns make up an increas-ing proportion of total receptive vocabulary in this period of development, from 24.2% in children with fewer than 20 words to 48.3% in children with between 100 - 150 words (common-noun proportion scores hold steady after this point). The initial rise in common nouns starts somewhat earlier for children acquiring Italian, but the basic pattern is the same. Note also that the proportion of common nouns in comprehension actually "overshoots" the checklist ceiling (i.e., the dotted line indicated the 45% contribution of common nouns to the checklist as a whole). In other words, common nouns are overrepresented in children with receptive vocabularies between 50 and 200 words.


Figure 5

We also examined growth within two small subsets of the broad nominal category, proper nouns and sound effects. In contrast with our analysis of word pro-duction, we found no evidence for a cross-linguistic difference in comprehension of proper nouns (F(1,843) = 0.04, n.s.), nor did language interact with vocabulary size (F(5,843) = 0.39, n.s.). However, we did find the expected developmental decrease in proper-noun proportion scores (F(5,843) = 186.04, p .0001), with scores dropping from an average of 31% in children with under 20 words to 4% in children with more than 200 words. In line with our analyses of word produc-tion, we also found no cross-linguistic difference in the proportional representation of sound effects in word comprehension (F1,843) = 2.07, n.s.), and no significant interaction (F(5,843) = 0.56, n.s.), but there was (as expected) a significant effect of vocabulary size (F(5,843) = 15.99, p .0001), reflecting an overall drop in the contribution of sound effects from around 26% in children with fewer than 20 words to 3.6% in children with more than 200 words.

In our earlier analyses of word production, routines constituted another category that decreases sharply with development, leading us to propose that routines are the "starter set" for non-nominals, while proper names and sound effects serve as the "starter set" for nominals. The same finding appeared in our analyses of word comprehension. There was a significant main effect of vocabulary size (F(5,843) = 114.2, p .0001), reflect-ing the fact that routines comprise around 27% of receptive vocabularies in children with 20 words or less, compared with an average of 6.5% in children with more than 200 words. In contrast with the small Italian advantage that we found for word production, there was no main effect here for language (F(1,843) = 0.45, n.s.), and no interaction (F(5,843) = 0.26, n.s.).

The main question here revolves around the pro-portional representation of verbs in early word compre-hension. Table 3 shows that verbs are still a relatively small category, compared with nominals (broadly defined) or common nouns (narrowly defined). Over-all, verbs constitute 13.2% of the items reported for English, compared with 15.3% for Italian. Analysis of variance revealed a significant main effect of language (F(1,843) = 11.73, p .001), and a significant main effect of vocabulary size (F(5,843) = 44.36, p .0001), with no interaction (F(5,843) = 0.78, n.s.). The developmental data for verbs in comprehension are illustrated in Figure 5 (plotted separately for each language), where we also plotted the data for comprehension of common nouns. It is clear that verbs do increase markedly across this range of development, from approximately 6.8% of total receptive vocabulary in children with 20 words or less, to 16.3% in children with more than 200 words. In fact (in contrast with our findings for production), verb proportion scores rise above the checklist baseline for children with more than 50 words in their receptive vocabulary. It is also clear from Figure 5 that Italians are ahead by 1 - 2 percentage points in verb comprehension across this period of development. Hence we do find a small version of the predicted Italian advantage on verbs in our data for comprehension, although a difference this small could be explained by two or three salient words (see item analyses below). It is also clear that the verb class is always much smaller than the common-noun class, for both English and Italian children, even in comprehension.

Similar results are obtained for the adjective category, except that this time the advantage goes to English-speaking children. Overall, adjectives account for 6.1% of total vocabulary in English, vs. 3.6% in Italian. This is higher than the averages reported for production, but it is still very low. Analysis of variance revealed a significant main effect of language favoring the Americans (F(1,843) = 73.05, p .0001), a signi-ficant main effect of vocabulary size (F(5,843) = 33.07, p .0001), with no significant interaction (F(5,843) = 1.64, n.s.). The vocabulary size result reflects an increase from 2.8% in children with under 20 words to 7.8% in children with more than 200 words. The American advantage of approximately 2 percentage points at each development level is contributed (as we shall see below) by a handful of specific words.

When the verb and adjective categories are added together, we find that the combined predicate category accounts for 19.2% of comprehension vocabulary, on average, compared with 43% for common nouns and 62% for all nominals. Analysis of variance on predicate scores revealed no significant main effect of language (F(1,843) = 0.84, n.s.), and no significant interaction (F(5,843) = 0.96, n.s.). But there was a significant main effect of vocabulary size (F(5,843) = 68.18), reflecting a clear increase in the relative size of the predicate class as the receptive lexicon expands from under 20 words (mean = 9.1%) to over 200 words (mean = 23.8%).

Finally, Table 3 summarizes results for the category of grammatical function words in compre-hension. In line with our findings for production, these words constitute only 4.3% of receptive vocabulary across this age range, from a low of 2% in children with fewer than 20 words to a high of 7.3% in children with more than 200 words. The overall average was 4.5% for English, 3.7% for Italian. Analysis of variance revealed a small but reliable effect of language (F(1,843) = 5.55, p .02), a significant main effect of vocabulary size (F(5,843) = 17.53, p .0001), with no significant interaction. Hence there is a small advan-tage in this category for children acquiring English, and a small but still reliable increase in the proportional representation of function words as a function of receptive vocabulary size.

We conclude that the data for comprehension and production both provide evidence for a universal se-quence from nouns to verbs and adjectives. This pattern holds up in both languages, and it holds up across modalities (even though verbs and adjectives do emerge earlier and grow faster in comprehension than production). The few cross-linguistic differences that we do find in vocabulary composition always involve differences of 1 - 3 percentage points-a difference that could be contributed by a few salient items. This brings us to a qualitative analysis of the first 100 words acquired within each language, in production and in comprehension.

Item analyses

Table 4 presents the first 50 words in the expressive vocabularies of American and Italian children, where "first acquired" is defined in terms of the percentage of children in the sample who were reported to produce that word.8 These percentages are collapsed over age levels, but it is worth pointing out that Fenson et al. (1994) report extremely high correlations between this criterion for the assessment of age of acquisition, and a different criterion based on the age at which 50% of the sample is reported to use the word. In Table 4, the words are listed in order of acquisition within each language group. Within each list, words that would qualify as non-nominatives according to the broad criteria used by Gopnik and Choi (in press) are marked with an asterisk (regardless of how those words were classified on the MacArthur list). Those non-nominatives that qualify as lexical verbs in the adult language are also marked with an asterisk, and presented in capital letters.

TABLE 4: FIRST 50 WORDS IN PRODUCTION
FOR ENGLISH VS. ITALIAN INFANTS

                 ENGLISH                         ITALIAN

Rank   Word       % sample       Word            Translation      % sample

1.     Daddy       54.9          mamma           mommy             49.7
2.     mommy       52.9          papa            daddy             46.7
3.   * bye         43.1          bau-bau         (dog sound)       41.5
4.   * hi          39.3        * pappa           (food/mealtime)   36.9
5.   * uh-oh       35.5          nonna           grandma           32.8
_____________________________________________________________________________

6.     Baa-baa     31.9          brum-brum       (vehicle sound)   28.7
7.     Ball        30.9          acqua           water             27.2
8.     dog         30.6          nonno           grandpa           23.1
9.   * no          28.5        * nanna           (sleep/bedtime)   21.5
10.    bottle      25.2        * no              no                21.5
_____________________________________________________________________________

11.    woof        24.9          miao            (cat sound)       21.0 
12.    baby        24.6        * grazie          thanks            20.5
13.  * yum-yum     24.1        * ciao            hi/bye            17.9
14.    grr         23.5        * cuccu-settete   (hiding game)     16.9
15.    kitty       21.8          palla           ball              16.4
16.    vroom       20.2          muuh            (cow sound)       15.9
17.    book        19.9        * non c'è più     (is no more)      14.9
18.    bird        19.6          scarpe          shoes             14.4
19.    duck        18.8          coccode         (rooster sound)   13.8
20.    balloon     18.4          beh-beh         (sheep sound)     12.8
21.    cat         18.2          (child's own name)  --            12.8
22.  * night-night 17.1          ih-oh           (donkey sound)    12.8
23.    quack       17.0          bimbo           child             12.3
24.    shoe        17.0        * pronto          (hello on phone)  11.8
25.    meow        16.6        * bum             boom              11.3
26.    banana      16.3          grr             (lion sound)      10.8
27.  * hot         16.0          qua-qua         (duck sound)      10.8
28.    Juice       15.4          (babysitter's name)  --           10.3
29.    Eye         14.8          cip-cip         (bird sound)      10.3
30.    Grandma     14.3        * si              yes               10.3
31.    moo         14.2          tuttu           (train sound)     10.3
32.  * thank-you   14.0        * zitto           hush/quiet        10.3
33.  * up          14.0        * (fare) popo/pipi (make) pee/poo   9.7
34.    cookie      13.5          clop-clop       (horse sound)     9.2
35.    Nose        13.5        * bua             hurt/owie         8.7 
36.  * ouch        13.4        * (dare) toto     (give) spanking   8.2
37.    Cracker     12.3        * mio             my/mine           8.2
38.    grandpa     12.3          pane            bread             8.2
39.  * shh         12.0          biscotto        cookie            7.7
40.    bath        11.8          cane            dog               7.2
41.    keys        11.8          ciuccio         pacifier          7.2
42.    Bubbles     11.4          zio             uncle             7.2
43.  * down        11.4          latte           milk              6.7
44.    car         11.2          orologio        watch/clock       6.7
45.  * yes         11.0          zia             aunt              6.2
46.    cheese      10.9          banana          banana            5.6
47.    bear        10.7        * basta           enough/stop       5.6
48.  * hello       10.6          bambola         doll              5.1
49.    Fish        10.4        * DARE            to give           5.1
50.  * allgone     10.3          gatto           cat               5.1
51.    Hat         10.3          mela            appole            5.1
For English, the first five words reported in produc-tion are "daddy" (54.9%), "mommy" (52.9%), "*bye" (43.1%), "*hi" (39.3%) and "*uh-oh" (a sound effect that is difficult to classify but typically occurs when things fall down or break - 35.5%). None of these are common nouns by our criteria, and none of them are verbs, although three (especially "*uh-oh") would qualify as non-nominals with a "verb-like" status using the criteria described by Gopnik and Choi (in press). For Italian, the first five words are "mamma" (49.7%), "papa" (46.7%), "bau-bau" (a sound effect for dogs - 41.5%), "*pappa" (an all-purpose word for food or mealtime - 36.9%) and "nonna" (for grandmother - 32.8%). None of these are common nouns by our criteria, and none are verbs. All five of these words were classified as nominals on the MacArthur list (and in the analyses over children), because "*pappa" was listed with food words. If we use the Gopnik and Choi criteria and reclassify "*pappa" as a non-nominal, it is still the case that nominals dominate among the first five words in Italian. Note also that the first five words already indicate a greater use of words for people in the Italian sample, since "grandma" ranks up at the top with "mommy" and "daddy" on the Italian list.

Moving down to the next five words, the English favorites are "baa-baa" (a sound effect for sheep, although some families reportedly use this sound for dogs - 31.9%), "ball" (30.9%), "dog" (30.6%), "*no" (28.5%) and "bottle" (25.2%). If we follow Gopnik and Choi and take "*uh-oh" out of the sound effect category (where it was counted broadly as a nominal) and put it into the category of non-nominal routines, then we have an expressive lexicon that breaks down into 50% nominals (20% names for people, 10% sound effects and 30% common nouns), and 50% non-nominals (with no lexical verbs). Moving down the corresponding list for Italian, the second set of five words includes "brum-brum" (a sound effect for vehicles - 28.7%), "acqua" (the word for water - 27.2%), "nonno" (for grandfather - 23.1%), "*nanna" (a general term for sleeping and for naps - 21.5%) and "*no" (which means exactly what it means in English - 21.5%). Hence the first ten words in Italian break down into 70% nominals (broadly defined using the Gopnik and Choi criteria), including 40% words for people, 20% sound effects and 10% common nouns. There are only three non-nominals ("*no", "*pappa" and "*nanna"), constituting only 30% of the list, and no lexical verbs.

To summarize so far, it is clear lexical verbs do not appear among the first ten words produced by English or Italian children. And if we adopt the very broad criteria proposed by Gopnik and Choi, treating all non-nominals as verbs, then we actually find a small disadvantage in the non-nominal category for Italians-although this difference probably has more to do with the salience of words for people in the Italian culture (i.e., the first 10 words for Italian include "mommy", "daddy", "grandma" and "grandpa", while the first 10 words for English are restricted to "mommy" and "daddy"). However, this does not mean that children begin their lexical careers by naming objects. Indeed, most of the items in the Top Ten for English and/or Italian are not common nouns. We will return to this point in the final discussion.

We can obtain a more stable picture of vocabulary composition by considering the first 50 words in each language (Table 4).8 In English, this list includes no lexical verbs whatsoever. In Italian, the corresponding list contains exactly one lexical verb "*DARE" ("to give"). This word was used by only 5.1% of the sample, which means a total of 10 children out of 195. Hence we may conclude that lexical verbs are sorely lacking in the first stages of language development, in both English and Italian. If we follow Gopnik and Choi (1990) and enlarge our definition to include all non-nominals, then we find a total of 15 non-nominal words for the English sample (29.4% of the list). This includes two adjectives ("*hot" - 16%; "*allgone" - 10.3%), and two grammatical function words ("*up" - 14%; "*down" - 11.4%). On the Italian side, we find a total of 17 non-nominals among the first 50 words (33% of the list). This includes the one verb mentioned above, no adjectives, 13 items that fall under the heterogeneous category of routines, and one gram-matical function word ("*mio", or "mine", used by 8.2% of the sample).

We went on to examine the next 50 words on the list for English and Italian, to see whether conclusions based on the first 50 words would change markedly in any direction. In English, the first lexical verbs finally do appear when the list is expanded to 100 items, including "*GO" (7.5% of the sample), "*SEE" (5.8%), "*EAT" (5.6%) and "*BITE" (5.3%). Still, this means that verbs constitute only 4% of the first 100 words acquired by English-speaking children. In Italian, four additional verbs appear when the list is expanded to 100, including "*PIANGERE" ("to cry" - 2.6%), "*DONDOLARE" ("to rock" - 2.1%), "*MANGIARE" ("to eat" - 2.1%) and "*APRIRE" ("to open" - 2.1%). This brings the total number of verbs in Italian to five, constituted 5% of the first 100 words. In other words, the emergence of lexical verbs continues to follow a parallel course in English and Italian.

It should be clear from these qualitative analyses that the very small cross-linguistic differences reported earlier for adjectives and function words must be treated with caution. In every case, they are due to fewer than three items, a result that could shift in either direction if items were moved from one category to another. The main results to emerge from these item analyses come from the larger and more robust categories (i.e., common nouns and lexical verbs). In this regard, the item analyses support our conclusions in the earlier analyses over children: Verbs are relatively late in both English and Italian, common nouns outnumber verbs and other predicates at every point in this age range, and common nouns increase in importance at the expense of those hard-to-classify items that dominate in the very first stages of lexical development (e.g., routines, sound effects, names for people).

In the same spirit, Table 5 presents the first 50 words reported for comprehension, in English and Italian.8 The numbers are considerably larger and more stable in this case, since comprehension gets off the ground much earlier than production (another universal that withstands a cross-linguistic test.....). But the item analysis complements and confirms what we have already learned from our analyses over subjects, for both comprehension and production.

TABLE 5: FIRST 50 WORDS IN COMPREHENSION
FOR ENGLISH VS. ITALIAN INFANTS

                 ENGLISH                         ITALIAN

Rank   Word       % sample       Word            Translation      % sample

1.     mommy       95.0          mamma           mommy             91.3
2.     daddy       93.5          papa            daddy             88.2
3.   * bye         88.6          (child's own name)   ---          82.6
4.   * no          86.3        * ciao            hi/bye            82.6
5.   * peekaboo    84.3        * pappa           (food/mealtime)   81.1
_____________________________________________________________________________

6.     bath        76.2        * cuccu-settete   (hiding game)     81.0
7.     ball        75.0          acqua           water             79.0
8.     bottle      75.0        * no              no                77.9
9.   * hi          74.0          palla           ball              75.9
10.  * allgone     71.9          bau-bau         (dog sound)       75.4
_____________________________________________________________________________

11.    dog         70.8          nonna           grandma           75.4
12.    book        68.7          cane            dog               74.9
13.  * night-night 68.5          biberon         bottle            71.8
14.    diaper      67.4          telefono        telephone         70.3
15.  * KISS        66.2        * bravo           good              67.7
16.  * uh-oh       65.1          nonno           grandpa           66.7
17.  * pattycake   62.6          scarpe          shoes             66.2
18.    juice       61.9          biscotto        cookie            65.6
19.    shoe        61.9        * BERE            to drink          65.1
20.    baby        61.6          miao            (cat sound)       64.6
21.    grandma     61.3          latte           milk              64.1
22.    outside     61.0        * nanna           (sleep/bedtime)   63.6
23.    car         60.1          mano            hand              63.1
24.  * EAT         59.7        * basta           (enough/stop)     62.6
25.    kitty       58.8        * pane            bread             62.6
26.  * DRINK       58.1        * (fare) bagno    (have/do) bath    62.1
27.    keys        56.3        * gatto           cat               62.1
28.  * DON'T       55.8          bimbo           child             60.5
29.    comb        55.4        * DARE            to give           59.5
30.    nose        55.4        * MANGIARE        to eat            59.5
31.  * HUG         54.9          piede           foot              59.0
32.    banana      54.4        * BACIARE         to kiss           59.0
33.    cookie      54.2        * BALLARE         to dance          59.0
34.    bathtub     53.2          automobile      car               57.9
35.    balloon     52.9        * non c'è più     (is no more)      56.9
36.    milk        52.9          panolino        diaper            56.9
37.    cat         52.7        * si              yes               56.9
38.    Cracker     52.7          bavaglino       bib               56.4
39.    telephone   52.6          capelli         hair              56.4
40.  * yes         52.6          bocca           mouth             55.9
41.    cheerios    51.4          bicchiere       glass             54.9
42.    bird        50.4          uccellino       bird              54.4
43.  * yum-yum     50.4          passegino       stroller          53.8
44.    grandpa     50.1        * pronto          (hello on phone)  53.3
45.    woof        49.5          ciuccio         pacifier          51.3
46.  * DANCE       49.3          letto           bed               51.3
47.    baa-baa     49.0          naso            nose              50.3
48.    meow        48.3          televisione     television        49.7
49.  * LOOK        48.2        * ANDARE          to go             49.2
50.    mouth       48.2          cucchiaio       spoon             49.2
51.                            * PETTINARE       to comb           49.2
52.                            * SALUTARE        to greet          49.2

Starting with the Top Ten Hits in comprehension for each language group, the breakdown for English is as follows: "mommy" (95%), "daddy" (93.5%), "*bye" (88.6%), "*peekaboo" (84.3%), "*no" (86.3%), "bath" (76.2%), "bottle" (75%), "ball" (75%), "*hi" (74%) and "*allgone" (71.9%). Following the criteria proposed by Gopnik and Choi, this corresponds to 60% non-nominals, although none of these items are lexical verbs. The corresponding list for Italian is "mamma" (91.3%), "papa" (88.2%), the child's own name (82.6%), "*ciao" (which means both "hi" and "goodbye" - 82.6%), "*pappa" (a general word for food or mealtime - 81.1%), "*cuccu-settete" (a word used in hide-and-seek - 81%), "acqua" ("water" - 79%), "*no" (77.9%), "palla" ("ball" - 75.9%) and "bau-bau" (a sound effect for dogs - 75.4%). According to the Gopnik and Choi criteria, this corresponds to 40% non-nominals. Again, none of these items are lexical verbs. Hence the data for comprehension in English and Italian are consonant with our findings for production: no lexical verbs, very few common nouns, with a preponderance of routines, sound effects and names for people.

Surveying the top 50 words in comprehension for each language, we find a total of six lexical verbs for English ("*KISS" - 66.2%; "*EAT" - 59.7%; "*DRINK" - 58.1%; "*HUG" - 54.9%; "*DANCE" - 49.3%; "*LOOK" - 48.2%). If we also include the word "*DON'T" (which was classified among routine expressions on the MacArthur list), this brings the total to seven verbs, or 14% of the list. Hence lexical verbs are still a minority in compre-hension, but (in line with results by other investigators) there are more verbs in comprehension than we found in production. Applying the criteria proposed by Gopnik and Choi, we can classify 17 of the first 50 words in comprehension as non-nominals (i.e., 34%, compared with 29.4% in production).

Looking at the corresponding list for Italian, we find a total of eight lexical verbs in Italian (15.4% of the list). These include "*BERE" ("to drink" - 65.1%), "*DARE" ("to give" - 59.5%), "*MANGIARE" ("to eat" - 59.5%), "*BACIARE" ("to kiss" - 59%), "*BALLARE" ("to dance" - 59%), "*ANDARE" ("to go" - 49.2%), "*PETTINARE" ("to comb" - 49.2%) and "*SALUTARE" ("to greet" - 49.2%). Using broad criteria to define the class of non-nominals, the total for Italian is 20 items, or 38.5% of the list (compared with 33% for production).

If we move on to examine the next 50 words in comprehension, the verb category does show some proportional increase. Briefly, we find a total of 18 verbs among the first 100 items for English (18% of the list), compared with 24 verbs among the first 100 items for Italian (24% of the list). Hence, as we saw in our analyses over children, there are indeed more verbs in early comprehension than we find in early production (cf. Benedict, 1979; Harris, 1993). But nominals are still the dominant class for both comprehension and production as soon as children move beyond their first 5 - 10 words, and common nouns expand more rapidly than verbs as the list grows. There does appear to be an Italian advantage for verbs in comprehension when we consider the first 100 words, but the difference is still very small.

SUMMARY AND CONCLUSION

In this cross-linguistic study of early lexical devel-opment, the following conclusions emerge.

(1) If we restrict our attention entirely to the first 5 - 10 words in each language, we find a preponderance of sound effects, routines, and names for people, with a few common nouns scattered in. These are the very items that are hardest to classify in adult part-of-speech categories. In other words, the very first words produced by English and Italian children are neither nouns nor verbs-a point to which we shall return below.

(2) When we expand our developmental window to include the first 50 - 100 words, the overwhelming majority are nominals (broadly defined); even when we restrict the definition of nominals to common nouns (i.e., names for classes of concrete objects), we cannot escape the conclusion that nouns predominate and grow sharply (in proportion to other items) across the first stages of lexical development, for both English and Italian.

(3) Verbs are very rare among the first 50 - 100 words produced by young children, even in a language like Italian in which verbs are particularly salient and informative.

(4) Verbs emerge earlier and grow faster in comprehension than they do in production, but they are still outnumbered by common nouns at every point across this period, in both English and Italian.

(5) Adjectives and grammatical function words are also very rare among the first 50 - 100 words acquired by English and Italian children, in both comprehension and production.

As we pointed out in the introduction, all of these analyses have been carried out from the point of view of adult part-of-speech categories. We do not assume (and should not assume) that these categories are somewhere in the mind of the child. Rather, we treat adult cate-gories like noun and verb as independent variables, a short-hand way of referring to semantic, grammatical and/or phonological differences in the input to which small children are exposed. Verbs differ from nouns in all languages, along many different semantic, gramma-tical and phonological dimensions (including prosody). To the extent that children treat nouns differently from verbs in the first stages of lexical development, we may infer that they have been influenced by these differ-ences in their linguistic input. That is, there is some-thing "in the child" that responds differently to nouns and verbs, something which must be due to the characteristics that distinguish nouns from verbs (and from adjectives and function words) in the adult language.

At the same time, verbs ought to be far more salient for Italian children. Had we seen any differences between English and Italian children in the emergence and growth of major lexical categories, we would be justified in concluding that children are influenced by differences in the relative salience of these categories in their respective input languages. We did find differ-ences in order of emergence (suggesting that verbs are somehow more difficult than nouns), but we did not find any differences between the two languages in the onset and growth of major lexical categories. The robustness of this noun-verb sequence despite striking cross-linguistic differences in the input lead us to propose that this sequence may be a developmental universal.

However, we should also stress that common nouns (i.e., object names) are relatively rare in the first 5 - 10 words produced by English and Italian children. It would, therefore, be incorrect to argue that all children begin their lexical careers with object naming. We agree with Gopnik and Choi (and many other investigators in the field-e.g., Bloom, Hafitz, & Lifter, 1980; Nelson, 1981; Tomasello, 1992) that non-nominals play an important role in early lexical development. But how should that role be defined? We think it is a mistake to confuse the broad category of non-nominals used by 8 - 16-month-old children with true predication, i.e., with the explicit postulation of a state that exists in one entity (i.e., single-argument predicates) and/or a relation that exists between two or more entities (i.e., n-place predicates). The hetero-geneous set of non-nominals that appear in Tables 4 and 5 are hard to place in any clear-cut part-of-speech category-which is why we have been forced to use non-technical categories like "sound effects" and "routines" that do not appear in any grammar or dictionary of the adult language. In earlier work by members of our research team (e.g., Bates, Camaioni, & Volterra, 1975; Bates, Benigni, Bretherton, Camaioni, & Volterra, 1979), we proposed that the referential function of words may emerge gradually from verbal routines that are best viewed as speech acts or performatives, vocal conventions that children use in particular situations to achieve some social function. Some of these vocal routines happen to be nouns in the adult language, others (rarely) might be verbs or adjectives, while others (like "hi" and "uh-oh") defy classification. The important point is that these words are best construed as actions, language games, things that children do with speech in a familiar and well-structured context (see also Barrett, 1982; Dromi, 1987; Lieven & Pine, 1990; Nelson & Lucariello, 1985; Ninio, 1993 & 1994; Tomasello, 1992).

From this point of view, we should not be at all surprised to find that children pick up the highest-frequency games and routines that are proposed to them by adults in this period of development. Indeed, some of the item-specific differences between English and Italian that we have uncovered here seem to fall into this category. For example, the first 50 words produced by English-speaking children include an adjective "hot", reported for 16% of the sample. The adjective "caldo" (which means "hot") does not appear among the first 100 words for Italian. American mothers often use the single word utterance "HOT!" to warn infant away from stoves, lightbulbs and other dangerous items (and, as we have already noted, the word "hot" is often used by English-speaking children as a name for bright and/or hot objects). In the same context, Italian mothers are more likely to respond with "SCOTTA!" ("Burns!"), or simply "NO NO!". Hence these two "equivalent" items are differentially available to children learning English vs. Italian. In the same vein, it is interesting that "nonna" ("grandmother") and "nonno" ("grandfather") are among the first 10 words for Italian children, while the corresponding items rank 30th and 38th, respectively, for children acquiring English. This "linguistic" phenomenon finds a ready explanation in the fact that Italian families are more likely to live in the same city with grandparents and other relatives, resulting in regular visits that foster the early acquisition of names for these family members. Without dwelling further on details of this sort, we will simply conclude that cultural differences do influence the content of infant vocabulary. In this regard, we agree with Gopnik and Choi. Nevertheless, these cultural differences are apparently not sufficient to overcome the developmental progression from nouns to verbs, a shift which is (we assume) motivated by critical differences in the input characteristics of nouns and verbs, and developmental differences in children's sensitivity to these characteristics.

The proposed noun-verb shift must be amended in two respects. First, as we have just noted, children do not necessarily begin with common nouns, but with a heterogeneous class of items that are perhaps best viewed as routines or language games. Second, verbs are relatively late, but so are adjectives and function words. Following O'Grady (1987) and Bates, Marchman et al. (1994), we would like to suggest that verbs and other relational terms are rare within the first 100 words because it is difficult for children to understand the meaning and purpose of "secondaries" (i.e., terms that express predication) before they have a firm grasp of "primaries". (i.e., terms that set up the arguments of a predicate). Hence the proposed universal sequence in the acquisition of lexical cate-gories may reflect changes at the level of underlying meanings, a developmental sequence that may be summarized as Routines -> Reference -> Predication.

This proposed universal is a hypothesis, not a con-clusion, for two reasons: (1) the limited utility of parental report data for the study of semantic change, and (2) other cross-linguistic findings that lead to a different conclusion.

With regard to the first point, a test of the proposed referent-to-predication shift will require detailed infor-mation on the way that children use their words in context. Previous studies of this sort have yielded mixed conclusions regarding semantic change in the second year of life. Some investigators have concluded that there is an important shift late in the one-word stage, from naming and language games to the use of single words to express relational meanings (e.g., DeLaguna, 1927; Howe, 1976). Others have argued that predication is present early in one-word speech, in the way non-nominals and some nominal expressions are used (e.g., Gopnik, 1981; Rescorla, 1981). Based on our findings here, we would predict that further observational evidence will come down on the side of a referent-to-predication shift during the second year. But our own data are silent on this point.

With regard to the second problem, our findings for English and Italian differ from those of Gopnik and Choi for English and Korean (see also Cheng, 1994, whose findings for Chinese parallel those of Gopnik and Choi for Korean). However, these authors also used a different methodology (i.e., free-speech samples complemented by a parental interview). What would happen if our checklist methodology were applied to large samples of Korean children in the same age range? Relevant findings come from an unpublished dissertation by Pae (1993), who adapted the MacArthur Infant Scale for Korean, and administered it to the mothers of 90 Korean infants between 12 and 23 years of age, all residents in middle-class apartment com-plexes in Seoul, Korea. Pae's results for both com-prehension and production were remarkably similar to ours, suggesting that there are few differences among Korean, Italian and English children in the onset and growth of nouns vs. verbs when comparable parental report methods are employed. She concludes that "Contrary to the verb bias argument, the Korean-speaking children in this sample did not show any sign of relative ease in learning verbs" (Pae, 1993, p. ii).

The contrast between Gopnik and Choi's findings based on free speech and those of Pae using parental report lead to the suggestion that cross-linguistic differ-ences in the emergence of non-nominal expressions (especially verbs) may reflect differences in the forms that children prefer to use, and not to differences in the words they are able to use (see also Bates et al., 1988). If this is the case, then we might expect to find differences between English and Italian children in early verb use in a free play situation-a possibility that we are currently pursuing.

Another possible explanation for these conflicting findings revolves around the issue of homophony, i.e., the use of a single sound for two different meanings or functions. The English language is notorious for the flexibility with which words can move back and forth across the noun-verb boundary (Clark & Clark, 1979). In particular, virtually any noun can be used as a verb if the context is set up just right (e.g., "He Richard-Nixoned the tapes before appearing at the Senate hearing...."). Because so many English nouns have corresponding verb homophones (and vice-versa), the form class assignments that we adopted in the MacArthur CDI are perhaps best viewed as a classification into "most typical or frequent word class", as opposed to an absolute classification based on inherent properties of each word. Such denominalized verbs are far less common in Italian, and when they do occur, they involve so much morphological alteration in the shape of the word that the respective noun and verb forms are not homophones at all. Indeed, the line between nouns and verbs on the Italian MacArthur CDI is clearly indicated not only by class membership (i.e., verbs are listed under "action words"), but also by the inflectional form in which the verbs are listed (i.e., in the infinitive, the citation form for Italian verbs). Despite these differences between English and Italian, we have observed the same sequence of acquisition for nouns and verbs. However, it is possible that the results observed to date for Korean and Chinese may revolve (at least in part) around the homophony issue. Like Italian, Korean and Chinese are languages that permit extensive noun omissions and word order variation. However, like English, Korean and Chinese are lan-guages with a high degree of noun-verb homophony. This leads to some interesting possibilities, including the following:

(1) Korean and Chinese children do not acquire verbs earlier than English or Italian children. Rather, they appear to have acquired verbs earlier because they have misclassified adult verbs as nouns. Hence they are using more verbs as nouns (to name objects, or perhaps to name events), and/or they are using more verbs in rote routines.9

(2) Korean and Chinese children really do acquire verbs earlier than English children do. Italian children would do the same if they could, but they are "blocked" from picking up those enticing and salient verbs due to the complex morphology that distinguishes verbs from nouns in their language.

These issues can only be settled by converging methodologies, complementing maternal diary and parental report studies with detailed analyses of the way that infants use their early words in context.

FOOTNOTES

1 Of course the pro-drop parameter is an idealization. Subject omission is obligatory in English imperatives, and subjects are frequently omitted in response to questions (e.g., "What did you do?" -- "Went to the movies"), in some informal conversations (e.g., "Really blew it this time!), and in most of the interviews granted by former President Bush. Never-theless, subject omission in English never comes close to the 70% ratio observed in informal Italian, in either adult- or child-directed speech.

2 The formula [1 - percent nominals] yields a score corre-sponding approximately to the proportion of total vocabulary composed of non-nominals as defined by Gopnik and Choi (in press).

3 The category "places to go" was excluded from the count of common nouns because many of its items function more like adverbials in the adult language.

4 Words like "aunt" that could describe more than one individual were included in "nominals" but excluded from "common nouns"

5 We are using the term "predicate" in as a theory-neutral descriptor at this point, and will return later to the more controversial issue of the meanings that children intend with such words

6 The ambiguous class of "words about time" was excluded from the function word count, although it includes some highly frequent terms like "now".

7 Another cultural difference is reflected in the composition of the two word lists. Words for eliminatory functions (i.e., "pee" and "poo") and words for genitalia were excluded from the English list even though these words are reported for some children in the 8 - 16-month age range. The decision was made because so many parents (and some investigators) expressed embarrassment or discomfort in reporting these items. The Italian team reached a different decision for these particular items, a fact which may be related to cultural differences in comfort level regarding sexuality and bodily functions.

8 The list of words in Table 4 actually includes 51 items for each language, instead of 50, due to the fact that the last two items on each list were tied in rank ("allgone" and "hat" were both produced by 10.3% of the Americans; "dare", "gatto" and "mela" were all produced by 5.1% of the Italians). In the same vein, Table 5 contains 50 words for English but 52 words for Italian, due to a tie in rank for the last four words ("andare", "cucchiaio", "pettinare" and "salutare", all compre-hended by 49.2% of the Italian sample).

9 Some investigators have argued that lexical verbs do predominate in the first signs produced by children exposed to American Sign Language (Launer, 1982), and in the signing systems that emerge among deaf children of hearing parents (Goldin-Meadow & Mylander, 1985). However, this apparent difference between signed and spoken languages may have more to do with the ambiguity of nouns and verbs within the adult language, where names for common objects are some-times derived from the actions that are carried out on those objects. From a slightly different point of view, Volterra (personal communication) has proposed that verbs may be rare among the first words produced by hearing children because they are able to express verb-like meanings in the gestural modality (e.g., drinking from cups, putting a tele-phone receiver to the ear). However, this proposal is also controversial, since many investigators view such gestures as a form of gestural recognition or naming (e.g., Acredolo & Goodwyn, 1988). This is a matter that merits further investigation.

REFERENCES

Acredolo, L.P., & Goodwyn, S.W. (1988). Symbolic gesturing in normal infants. Child Development, 59, 450-456.

Barrett, M. (1982). The holophrastic hypothesis: conceptual and empirical issues. Cognition, 11, 47 - 76.

Bates, E. (1976). Language and context: Studies in the acquisition of pragmatics. New York: Academic Press.

Bates, E., Benigni, L., Bretherton, I., Camaioni, L., & Volterra, V. (1979). The emergence of symbols: Cognition and communication in infancy. New York: Academic Press.

Bates, E., Bretherton, I., & Snyder, L. (1988). From first words to grammar: Individual differences and dissociable mechanisms. New York: Cambridge University Press.

Bates, E., Camaioni, L., & Volterra, V. (1975). The acquisition of performatives prior to speech. Merrill-Palmer Quarterly, 21:3, 205-226. (Reprinted in E. Ochs & B. Schieffelin, Eds., Developmental pragmatics (pp. 111-128). New York: Academic Press, 1979.

Bates, E., Caselli, M.C., & Casadio, P. (1990). A cross-linguistic study of early lexical development using parental report. In C. Rovee-Collier (Ed.), Abstracts of the Seventh International Conference on Infant Studies. Special issue of Infant Behavior and Development , 13, 258. Norwood, NJ: Ablex.

Bates, E., Dale, P.S., & Thal, D. (1994). Individual differences and their implications for theories of language development. In Paul Fletcher & Brian MacWhinney (Eds.), Handbook of child language (pp. 96-151). Oxford: Basil Blackwell.

Bates, E., & Marchman, V. (1988). What is and is not universal in language acquisition. In F. Plum (Ed.), Language, communication and the brain. New York: Raven Press, 19-38.

Bates, E., Marchman, V., Thal, D., Fenson, L., Dale, P., Reznick, S., Reilly, J., & Hartung, J. (1994). Developmental and stylistic variation in the composition of early vocabulary. Journal of Child Language, 21:1, 85-124. (Reprinted in K. Perera, Ed., Growing points in child language. Cambridge: Cambridge University Press, in press).

Benedict, H. (1979). Early lexical development: Comprehension and production. Journal of Child Language, 6, 183-200.

Bloom, L., Hafitz, E., & Lifter, K. (1980). Schematic organization of verbs in child language and the acquisition of grammatical morphemes. Language, 6, 386-412.

Bloom, L., Lightbown, P., & Hood, L. (1975). Structure and variation in child language. Mono-graphs of the Society for Research in Child Development.

Braine, M.D.S. (1976). Children's first word combinations. With commentary by Melissa Bowerman. Monographs of the Society for Research in Child Development, 41, Serial # 164.

Cheng, S.-W. (1994). Beginning words of three children acquiring Mandarin-Chinese. Manuscript in preparation, National Chung Cheng University, Taiwan.

Clark, E. (1983). Meanings and concepts. In J. Flavell and E. Markman (Eds.), Cognitive Development, Volume III of P. Mussen (Ed.), Handbook of Child Psychology. New York: John Wiley, 787 - 840.

Clark, E., & Clark, H. (1979). When nouns surface as verbs. Language, 55, 767 - 811.

DeLaguna, G. (1927). Speech: Its function and development. Bloomington: Indiana University Press.

Devescovi, A., D'Amico, S., Smith, S., Mimica, I., & Bates, E. (1993). The development of sentence comprehension in Italian and Serbo-Croatian: Local versus distributed cues. Manuscript, University of Rome "La Sapienza". (submitted for review).

Dromi, E. (1987). Early lexical development. New York: Cambridge University Press.

Fenson, L., Dale, P.A., Bates, E., Reznick, J.S., & Thal, D. (1994). Variability in early communicative development. Monographs of the Society for Research in Child Development, 59, Serial No. 242, No. 5.

Fenson, L., Dale, P., Reznick, J.S., Thal, D., Bates, E., Hartung, J., Pethick, S., & Reilly, J. (1993). The MacArthur Communicative Development Inven-tories: User's guide and technical manual. San Diego: Singular Publishing Group.

Gentner, D. (1982). Why are nouns learned before verbs: Linguistic relativity versus natural partitioning. In S.A. Kuczaj II (Ed.), Language development: Vol. 2. Language, thought and culture. Hillsdale, NJ: Erlbaum.

Goldfield, B.A., & Snow, C. (1985). Individual differences in language acquisition. In J. Gleason (Ed.), Language development. Columbus: Merrill Publishing Co.

Goldin-Meadow, S., & Mylander, C. (1985). Gestural communication in deaf children: The effects and non-effects of parental input on early language development. Monographs of the Society for Research in Child Development.

Goldin-Meadow, S., Seligman, M.E.P., & Gelman, R. (1976). Language in the two-year-old. Cognition, 4, 189-202.

Gopnik, A. (1981). Development of non-nominal expressions in 1-2 year olds: Why the first words aren't about things. In P. S. Dale & D. Ingram (Eds)., Child language: An international perspective. Baltimore, Maryland: University Park Press.

Gopnik, A., & Choi, S. (1990). Do linguistic differences lead to cognitive differences? A cross-linguistic study of semantic and cognitive develop-ment. First Language, 10, 199-215.

Gopnik, A., & Choi, S. (in press). Names, relational words, and cognitive development in English- and Korean-speakers: Nouns are not always learned before verbs. In M. Tomasello & W. Merriman (Eds.), Beyond names for things: Young children's acquisition of verbs. Hillsdale, NJ: Lawrence Erlbaum.

Greenfield, P.M., & Smith, J.H. (1976). The structure of communication in early language development. New York: Academic Press.

Harris, M. (1993, July). The origins of early lexical comprehension. Paper presented at the Sixth International Congress for the Study of Child Language, Trieste, Italy.

Howe, C. (1976). The meanings of two-word utterances in the speech of young children. Journal of Child Language, 13, 29-47.

Iverson, J., Capirci, O., & Caselli, M.C. (1994). From communication to language in two modalities. Cognitive Development, 9(1), 23-43.

Launer, P.B. (1982). "A plane" is not "to fly": Acquiring the distinction between related nouns and verbs in American Sign Language. Unpublished doctoral dissertation, City University of New York.

Lieven, E., & Pine, J.M. (1990). [Review of From first words to grammar: Individual differences and dissociable mechanisms - Bates, E., Bretherton, I., & Snyder, L.]. Journal of Child Language ,17, 495-501.

MacWhinney, B., & Bates, E. (Eds.) (1989). The crosslinguistic study of sentence processing. New York: Cambridge University Press.

MacWhinney, B., Bates, E., & Kliegl, R. (1984). Cue validity and sentence interpretation in English, German and Italian. Journal of Verbal Learning and Verbal Behavior, 23, 127-150.

Markman, E.M. (1989). Categorization and naming in children: Problems of induction. Cambridge, MA: MIT Press/Bradford Books.

McCarthy, D. (1954). Language development in children. In L. Carmichael (Ed.), Manual of Child Psychology. New York: John Wiley.

Miller, J.F., & Chapman, R. (1981). The relation between age and mean length of utterances in morphemes. Journal of Speech and Hearing Research, 24, 154-161.

Nelson, K. (1973). Structure and strategy in learning to talk. Monographs of the Society for Research in Child Development, 38, (1 & 2, Serial #149).

Nelson, K. (1981). Individual differences in language development: Implications for development and language. Developmental Psychology, 17, 170-187.

Nelson, K., & Lucariello, J. (1985). The development of meaning in first words. In M. D. Barrett (Ed.), Children's single-word speech. New York: Wiley.

Ninio, A. (1993). On the fringes of the system: children's acquisition of syntactically isolated forms at the onset of speech. First Language, 13, 291-313.

Ninio, A. (1994). Words with holes: the acquisition of the predicateness of predicates. Emory Conference on Cognitive and Functional Approaches to Grammatical Development, Emory University, Atlanta, Georgia, January 1994.

Pae, Soyeong (1993). Early vocabulary in Korean: Are nouns easier to learn than verbs? Unpublished doctoral dissertation, University of Kansas.

Pine, J.M., & Lieven, E. (1990). Referential style at 13 months: Why age-defined cross-sectional measures are inappropriate for the study of strategy differ-ences in early language development. Journal of Child Language, 17, 625-631.

Rescorla, L. (1981). Category development in early child language. Journal of Child Language, 8, 225 - 238.

Scollon, R. (1976). Conversations with a one-year-old: A case study of the developmental foundation of syntax. Honolulu: University Press of Hawaii.

Shore, C. (1995). Pathways into language: Individual differences in language development. Thousand Oaks, CA: Sage Publications.

Slobin, D. (Ed.). (1992). The crosslinguistic study of language acquisition (Vol. 3). Hillsdale, NJ: Erlbaum.

Slobin, D. (Ed.). (1985). The crosslinguistic study of language acquisition (Vols. 1 & 2). Hillsdale, NJ: Erlbaum.

Tomasello, M. (1992). First verbs: A case study of early grammatical development. Cambridge [England] ; New York: Cambridge University Press.

Volterra, V. (1979). First words in language and action: a qualitative look. In Bates, E., Benigni, L., Bretherton, I., Camaioni, L., & Volterra, V. (1979). The emergence of symbols: Cognition and communication in infancy. New York: Academic Press, 141 - 182.

Volterra, V., & Caselli, M.C. (1986). First stage of language acquisition through two modalities in deaf and hearing children. Italian Journal of Neurological Science, Supplement 5, Maturation and Learning, 109-115.