Introduction to Phonology 2 (LIN 637)

This course is about phonology, but it is valuable to begin with a broader sense of both the science of language and cognitive science to see how phonology fits into this enterprise.

The scientific study of language is an important part of understanding human cognition. As Chomsky (1965) explains, there are three fundamental goals of cognitive science.

Chapter 1 of Chomsky (1965) explains what these goals mean for the science of language. Following Chomsky and Halle (1968, 3), it is useful to “think of a language as a set of sentences, each with an ideal phonetic form and an associated intrinsic semantic interpretation. The grammar of the language is the system of rules that specifies this sound-meaning correspondence.”

Consequently, the grammars of languages constitute a theory of competence because they encapsulate the knowledge that humans have regarding their language (construed as a set of phonetic forms paired with semantic interpretations). Chomsky (1965, 8–9) elaborates further.

Where do these grammars come from? To some extent, they must be learned because humans learn the language spoken in the community they are raised in. The learning theory for language science must explain how humans come to acquire a particular grammar based on their experience. Chomsky (1965, 7) considers the learning theory to be a centerpiece of language science.

A correct theory of learning would provide a deep explanation for theories of competence. It would provide a single explanation of how humans acquire the various specific “system[s] of rules” which relates pronounced forms with semantic interpretations.

Theories of performance are another important part of language science. These theories seek to explain systematic facts about language use: its production, comprehension, and the behavior we observe every day.

Chomsky is firmly stating that any correct theory of performance will depend on a correct theory of competence. A theory of performance is ultimately a theory of how many different factors interact to produce linguistic behavior. The grammar is but one of these factors. Chomsky and Halle (1968, 3) explain the distinction between competence and performance this way.

The distinction between competence and performance is a fundamental one in language science. It isolates the grammatical factors from non-grammatical ones.

Phonological Knowledge

The aspect of language that phonology addresses is the knowledge speakers have regarding the pronunciation of words and phrases in their language. Systematic facts about the pronunciation of words and phrases in a language are often referred to as sound pattern of a language.

Aside on Signed Languages

The patterns involved in the production of words and phrases in spoken languages are typically called sound patterns. This has the unfortunate consequence of overlooking signed languages, which also have ’phonology’ (Brentari 2019). It is more accurate to say that phonology is a theory regarding a systematic set of facts about the way words and phrases are produced and perceived, independent of the modality of the language. Richard Larson likes to think of this as part of the externalization of language. Personally, I belive the term “morphololgy” is most appropriate (the study of form) but that’s already taken by another subdiscipline of linguistics!

This terminology touches on broader issues affecting the sign language community. Words like ’speak’, ’talk’, and ’speech’ are typically thought of as involving vocalizations. Is it correct to say that users of signed languages speak, talk, or engage in other acts of speech?

In linguistics, it is generally understood that these kinds of words have a more general meaning beyond the one limited to vocalized communication. These more general meanings have yet to diffuse through broader society, but there is an ongoing effort. Having said that, there is also a (gentle) push from the sign language users to employ words that are not specific to a particular modality. For example, they may prefer people to say "Mr. Jones uses ASL" instead of "Mr. Jones speaks ASL".

Linguistic terms like ‘phonetics’, ‘phonology’, ‘phoneme’ and so on, also have a narrow meaning and a broader meaning. These words, with the root ‘phon’, literally can thought of as having to do with sound. Even the word ‘pronunciation’ is narrowly interpreted to refer to sounds, even though it derives from the Latin pro ‘forth, out, in public’ and nuntiare “announce,” from nuntius “messenger.” These terms arose historically from the primacy of spoken languages linguistic research.

Nowadays, linguists recognize that signed languages, just like spoken languages, have a phonology and a phonetics too. The modality is different but the basic scientific questions are the same. In sign phonology, we are interested in the mental organization of the units that make up words. In phonetics, we are interested in how signs are perceived and produced.

In this class, when we use terms like ’speak’ and ’phonology’, we are using them in their broader sense, and are not just referring to the modality of vocalized communication.

Aspects of phonological knowledge

It is a striking fact that natural languages have sound patterns. I will give examples of sound patterns in a moment in an effort to convince you that there are systematic aspects to your pronunciation, and that every human language exhibits sound patterns. But for now, assume that there is something systematic about the way you pronounce words and phrases in your language. What then are the goals of phonology? It is precisely to find an explanation for this fact.

Not surprisingly, there are three dimensions this explanation, which match the goals expressed above. Understanding the sound patterns of languages means answering the following three questions.

In service to the above questions, the study of phonology tries to answer the following:

In this course, we mainly address the questions in [q:comp] above. Answers to these questions help us better understand the competence humans bring to bear on the sound patterns of their language.

With these questions in mind, I would like to now present a brief survey of different kinds of phonological knowledge. Speaking generally, there are three kinds of knowledge regarding the pronunciation of words and phrases in natural language: phonotactic knowledge, knowledge of processes, and knowledge of contrasts. These are all examples of sound patterns that humans have internalized. A fourth type of sound pattern comes from typology: are there recurring tendencies or patterns across languages? In this section, a brief introduction to each type of sound pattern is presented.

Phonotactics

Phonotactic patterns refer to the possible words in a language (Chomsky and Halle 1965; Halle 1978).

English speakers can coin new words. However, speakers are much more willing to coin new words with words on the left, and not with the words on the right.

Possible and impossible English words.
possible new words	impossible new words
flump	flunp
blick	bnick
bist	bizt
slem	srem

It is striking that many English native speakers agree with the division in 1 even though they have zero experience with all of those words. This is an example systematic, uniform linguistic behavior that requires an explanation.

Assuming agreement among the native English speakers, how did they learn to discriminate words they never heard before in the same way? The answer of course is that there is a system of rules or constraints—a grammar—that speakers have internalized that allow them to classify logically possible words into well- and ill-formed groups. This is knowledge that English speakers acquired and learned since for example words like [srem] are perfectly acceptable in other languages. But it is also knowledge which English speakers were never explicitly taught.

With respect to the above case, it would appear that this system of rules is sensitive to the sub-parts of words. All the sub-parts of the words on the left in 1 are well-formed, but that is not the case for the words on the right. English words cannot end with [np] or [zt] sequences, nor can they begin with [bn] or [sr] sequences.

This last sentence is an example of a linguistic generalization, which is a hypothesis about the character of our linguistic knowledge and competence. This hypothesis makes at least two predictions. First, existing English words should not begin or end with the aforementioned sequences. We can examine existing English words to see whether this prediction is true. Second, it predicts that English speakers should avoid coining new words with the aforementioned sequences. This can be is testable in principle with a behavioral experiment. Native speakers can also conduct this behavioral experiment on themselves with a little bit of introspection.

Here is another example of a sound pattern. Below are actual and hypothetical words below from Navajo (Sapir and Hojier 1967).

Possible Navajo Words		Impossible Navajo Words
ʃiːteːʒ	‘we (dual) are lying’	ʃiːteːz
dasdoːlis	‘he (4th) has his foot raised’	dasdoːliʃ
sokos	(hypothetical)	sokoʃ
ʃokoʃ	(hypothetical)	ʃokos
kiːteːp	(hypothetical)
piːteːk	(hypothetical)

Note [ʃ] is like sh in shoe and [ʒ] is like ge in beige. Can you determine what grammar speakers of Navajo have internalized that allow them to distinguish between these two groups of words?

It clearly has to do with the sounds [s,ʃ,z,ʒ], which are examples of sibilant sounds. In Navajo, words can contain either [s,z] sounds or [ʃ,ʒ] sounds, but not both. As the examples indicate, it is not necessary that the sibilants be adjacent to each other. In fact, they can be separated by many other speech sounds as evidenced by the word [dasdoːlis] ‘he (4th) has his foot raised.’

The above examples have established one kind of sound pattern: phonotactic knowledge. There are rules and constraints which govern the possible words in languages. Speakers can coin new words, but they cannot coin any arbitrarily sequence of speech sounds as a new word. Speakers distinguish logically possible words with which they have had no prior experience. This is the expected behavior of individuals who have internalized a productive and generative system of rules and constraints.

The above examples made a binary distinction between “possible” and “impossible” words. Some question whether this is just a convenient abstraction (Albright and Hayes 2003). It has been argued that there are well-formedness is gradient. For example, it has been argued that since native English speakers rate [kɪp] as more well-formed than [θwiːks], which they rate as more well-formed than [bzɑrʃk, that the phonotactic grammar must likewise be gradient (Hayes and Wilson 2008). This argument has been criticized for mistakenly conflating competence with performance (Gorman 2013; Heinz and Idsardi 2017; Durvasula 2025).

Whatever the details may be, the central fact of phonotactic knowledge is that speaker-hearers of the same language community uniformly distinguish logically possible words in the same way, at least more or less. What is the nature of this knowledge—what are the rules and constraints that govern this system—and how do children come to learn it?

Processes

Another aspect of phonological knowledge comes from phonological processes. Evidence for phonological processes comes from morphological alternations. Morphemes are the smallest sequences of speech sounds with a particular meaning. Therefore, unlike phonotactic patterns, the semantic meanings or morphemes play an important role in understanding the evidence for phonological processes.

A morphological alternation is the observation that the same morpheme is pronounced differently in different contexts. The English plural provides a familiar example.

	singular	plural
cat	kʰæ t	kʰæ ts
sack	sæ k	sæ ks
dog	dɔg	dɔgz
grub	gɹʌ b	gɹʌ bz
dish	dɪʃ	dɪʃəz
lodge	lɔd͡ʒ	lɔd͡ʒəz
pea	pʰi	pʰiz
cow	kʰaU	kʰaUz
man	mæ n	mɛn
foot	fʊt	fit
wife	waɪf	waɪvz
whiff	wɪf	wɪfs
…

Ignoring irregular forms like men and feet, it is clear that regular plural morpheme has three forms [-s, -z, -əz]. These do not appear to be assigned arbitrarily to nouns. One way to see this is to conduct the following experiment. Which pronunciation goes with which of the following made-up words?

As with the phonotactic patterns above, the fact that English speakers answer this question uniformly is strong evidence that they have internalized a system of rules and/or constraints.

There are several ways these facts can be analyzed. Below I provide two analyses to illustrate.

Both analyses can account for the facts of regular plural formation in English. In the first one, which I will refer to as the morphological analysis, a choice is made among variants based on the pronunciation of the noun. In the second one, which I will refer to as the phonological analysis, the plural morpheme is fixed as [-z] and the resulting word may undergo transformations if the proper conditions are met. These transformations are insertion of schwa (called schwa-epenthesis) or devoicing of the [z]. Such transformations are called phonological processes.

Generative phonologists argue that in general the phonological analysis (Analysis 2) is the better scientific explanation than the morphological explanation (Analysis 1) for the various pronunciations of the plural morpheme. The primary argument comes from the fact that multiple morphemes with the same kind of underlying form show the same patterns of allomorphy. This is the case in language after language.

On the basis of such arguments, we will conclude that morphological alternations constitute evidence for phonological transformations. Indeed, these arguments are so important they provide what I call the Fundamental Principle of Phonology:

From this principle (and the arguments for it), the rest of the field of phonology follows.

Let’s provide another example, this time from Georgian (Aronson 1982). Consider the form of the adjectival suffix below:

As with the example from the English plural, it is possible to state two analyses. One selects the correct pronunciation of the morpheme based on qualities of the root. The other fixes the phonetic form of the morpheme, affixes it to the root, and then subjects the resulting word to a series of phonological processes, or transformations.

Also, it is useful to ask ourselves: How did English and Georgian speakers learn these patterns? What possible purpose could such patterns serve?

Contrasts

Georgian	gloss
phizik-uri	‘physical’
kimi-uri	‘chemical’
akti-uri	‘active’
phrang-uli	‘French’
german-uli	‘German’
reakti-uli	‘reactive’
real-uri	‘real’
terminal-uri	‘terminal’

A third kind of sound pattern has to do with what is called contrast. As with phonological processes, meaning plays a role here too.

Speech sounds are contrastive if they can be used to signal different meanings. For instance consider the words shown below from Nepali.

Here there are pairs of words which establish that aspiration [ʰ] is contrastive in Nepali. In each pair, the words are identical except for the presence or absence of aspiration in the first consonant. Therefore, in order to know what meaning is being conveyed, it is necessary to know whether aspiration is a present or absent property of these sounds. This is why aspiration is said to be contrastive in Nepali.

Minimal pairs for aspiration as contrastive in Nepali.
Nepai	gloss	Nepai	gloss
pir	anxiety, pain	bar	fence
pʰir	Turn on!	bʰar	burden
tal	lake	dar	a kind of tree
tʰal	plate	dʰar	edge
kal	time, death	gol	circle, charcoal
kʰal	kind, skin	gʰol	Mix! Stir!

The relations that exist between contrastive units of speech are an important part of the study of contrast. In Nepali it is clear that the relation between [p] and [pʰ] is the same as the relation between [g] and [gʰ]. However, it is not always straightforward to determine the system of contrast present in a language. For instance what are the relations that are present in a language with a three-way contrast between the vowels [i,u,a]? As we will see, there is evidence that part of the knowledge that speakers have of the system of contrast of their language includes these relations. This evidence partly comes from the fact that languages with the same speech sounds, even the same contrastive speech sounds, organize the relations between these sounds differently, as was famously pointed out by Sapir (Sapir 1925).

Furthermore, we will see, some languages may allow certain speech sounds to be contrastive in all positions in a word and others may allow certain speech sounds to be contrastive in only some positions. The lexicon of some languages may show that many words make use of particular contrastive features, but other contrastive features are only present in a small portion of the lexicon. Marginal and limited cases like these are of interest because it raises the question: why should a language make use of a contrast in only a limited number of circumstances?

Cross-linguistic patterns

A fourth type of sound pattern refers to the patterns that emerge when multiple languages are considered. Typological generalizations are especially interesting because they exist despite the considerable variation observed across languages. They therefore can lead to hypotheses about the linguistic universals. These in turn help distinguish the humanly possible phonological grammars from the logically possible ones.

Here are some examples of typological generalizations that have been presented in the phonological literature.

Summary

Phonological patterns are the sound patterns of language. Phonotactic patterns, phonological processes, and systems of contrast are facets of a phonological system that govern the knowledge speakers have about the pronunciation of words and phrases in their language. When these phonological systems are studied around the world, typological sound patterns emerge as well. Each kind of sound pattern provides some insight into the nature of the underlying system, and consequently into the nature of how the language faculties of human minds work.

As we will see, different phonological theories intertwine systems of contrast, phonotactic patterns, and phonological processes to different degrees and in different ways. In this course, we will develop an awareness of the empirical facts and will concentrate on developing principles for phonological analysis and for evaluating theories.

The fundamental principle of phonological analysis

We thus state the fundamental principle of phonological analysis of phonological analysis as follows.

The systematic variation of pronunciation in morphological paradigms is best explained by positing

Consequently, there is a phonological module between the the output of the morphological component of the grammar and the phonetics component. The output of morphology is the input to the phonological component. The output of the phonological component is the input to the phonetics module.

Consequently, phonology, as a field itself is primarily concerned with three questions.

To emphasize the different levels of representation that are entailed by a phonological analysis, they are distinguished by different kinds of delimiters. Slashes are standardly used to represent sequences at the abstract, underlying level and braces are used at more concrete surface level. We will refer to representations at the abstract and more concrete levels as underlying representations and surface representations, respectively.

People have tried to derive the fundamental principle from other principles. For example, a principle on economy of description (Chomsky and Halle 1968). It may very well derive from such a principle. However, trying to establish this depends on the descriptive formalism. So the devil, as always, is in the details.

Formal Theories of Phonology

Next we turn to different formalizations of phonological theory. Broadly speaking, there are two influential ones. They differ fundamentally in the way they answer the key questions in phonology, which are repeated here.

SPE and LP differ in key respects, but both adopt the position that the formal grammar is an ordered sequence of rules that change underlying forms step by step into surface forms. Constraint-based theories come in many variations, but Optimality Theory and its derivatives (Harmonic Grammar, Harmonic Serialism, and Maximum Entropy Goldwater and Johnson 2003; McCarthy 2008; Pater 2009) centralize global optimization. Not all constraint-based theories are like that, however, and there are theories, both historically and today, that utilize constraint satisfaction instead of optimization.

Broadly speaking, rule-based and optimization, constraint-based theoroes have different ontologies. According to Wikipedia, the principle questions of ontology include “What can be said to exist?”, and “Into what categories, if any, can we sort existing things?” In phonology, the existing things we are interested in is our phonological knowledge—our knowledge of the lexical representations, the surface forms, and the mapping—and how this knowledge is manifest in our minds (what is psychologically real).

For example take SPE and OT as specific examples. The difference between these theories can be stated as follows.

Of course, these are not the only two theories that have been studied. Especially in the 1980s, prior to the advent of OT in the early 1990s, there were many theories which mixed rules and constraints. The grammatical model called Harmonic Serialism (McCarthy 2008) also synthesizes aspects of these two earlier approaches, broadly construed. However, we will focus on the aforementioned two theories, and only mention these others as needed.

Finally, while the theories do differ, it is important to realize they all admit that the phonological component of a grammar includes underlying forms, surface forms, and a mapping between them. In other words, the common base for these two theories is the fundamental principle discussed earlier.

Albright, Adam, and Bruce Hayes. 2003. “Rules Vs. Analogy in English Past Tenses: A Computational/Experimental Study.” Cognition 90: 119–61.

Aronson, Howard. 1982. Georgian, a Reading Grammar. Slavica Publishers, Inc.

Bale, Alan, and Charles Reiss. 2018. Phonology: A Formal Introduction. The MIT Press.

Brentari, Diane. 2019. Sign Language Phonology. Cambridge University Press.

Chomsky, Noam. 1965. Aspects of the Theory of Syntax. Cambridge, MA: MIT Press.

Chomsky, Noam, and Morris Halle. 1965. “Some Controversial Questions in Phonological Theory.” Journal of Linguistics 1: 97–138.

———. 1968. The Sound Pattern of English. New York: Harper & Row.

Du, Naiyan, and Karthik Durvasula. 2025. Psycholinguistics and Phonology: The Forgotten Foundations of Generative Phonology. Cambridge University Press. https://doi.org/https://doi.org/10.1017/9781009347631.

Durvasula, Karthik. 2025. “A Closer Look at What/How We Can Learn from Computational Modelling of Phonotactics.” https://ling.auf.net/lingbuzz/009051.

Goldwater, Sharon, and Mark Johnson. 2003. “Learning OT Constraint Rankings Using a Maximum Entropy Model.” In Proceedings of the Stockholm Workshop on Variation Within Optimality Theory, edited by Jennifer Spenader, Anders Eriksson, and Osten Dahl, 111–20.

Gorman, Kyle. 2013. “Generative Phonotactics.” PhD thesis, University of Pennsylvania.

Greenberg, Joseph. 1978. “Initial and Final Consonant Sequences.” In Universals of Human Language: Volume 2, Phonology, edited by Joseph Greenberg, 243–79. Stanford University Press.

Halle, Morris. 1978. “Knowledge Unlearned and Untaught: What Speakers Know About the Sounds of Their Language.” In Linguistic Theory and Psychological Reality. The MIT Press.

Hayes, Bruce, and Colin Wilson. 2008. “A Maximum Entropy Model of Phonotactics and Phonotactic Learning.” Linguistic Inquiry 39: 379–440.

Heinz, Jeffrey, and William Idsardi. 2017. “Computational Phonology Today.” Phonology 34 (2): 211–19.

Maddieson, Ian. 1984. Patterns of Sounds. Cambridge, UK: Cambridge University Press.

McCarthy, John J. 2008. “The Gradual Path to Cluster Simplification.” Phonology 25 (2): 271–319.

Oxford, Will. 2014. “Patterns of Contrast in Phonological Change: Evidence from Algonquian Vowel Systems.” Language.

Pater, Joe. 2009. “Weighted Constraints in Generative Linguistics.” Cognitive Science 33: 999–1035.

Prince, Alan, and Paul Smolensky. 1993. “Optimality Theory: Constraint Interaction in Generative Grammar.” 2. Rutgers University Center for Cognitive Science.

———. 2004. Optimality Theory: Constraint Interaction in Generative Grammar. Blackwell Publishing.

Sapir, Edward. 1925. “Sound Patterns in Language.” Language 1 (2): 37–51.

Sapir, Edward, and Harry Hojier. 1967. “The Phonology and Morphology of the Navaho Language.” University of California Publications 50.

Wilson, Colin. 2006. “Learning Phonology with Substantive Bias: An Experimental and Computational Study of Velar Palatalization.” Cognitive Science 30 (5): 945–82.

Language Science