This week we will have two practice talks for the upcoming MUMM meeting:
Time: Monday 5/4, 5pm
Location: 32-D831
First speaker: Hyesun Cho
Title: The problem of generalization in a statistical learning model of phonotactics
Achieving a proper level of generality in OT-style constraints has not been a problem in most of the phonological grammar learning models up to now (such as GLA (Boersma, 1997); a Maximum Entropy model (Goldwater and Johnson, 2003); a GLA-HG model (Coetzee and Pater, 2008)), because in those models, constraints do not have to be learned. Instead, learning a grammar mainly involves finding rankings or weights of the given set of constraints. In contrast, the phonotactic learning model of Hayes and Wilson (2008) is different from others in that the model derives constraints themselves from the surface forms in the training data. This paper shows that in doing so, it is not trivial to learn constraints with a proper level of generalization.
I ran the Hayes and Wilson model on Korean training data. In the resulting grammar, the model makes overly-broad generalizations, i.e., the constraints that penalize both possible (grammatical) and impossible (ungrammatical) sequences equally, especially when the frequency differences between the possible sequences and the impossible sequences are very small. Because of this, some of the possible sequences are predicted to be worse than impossible sequences. The three problem cases include: post-obstruent tensing, diphthong restrictions, and labials-[?] sequences. I discuss two possible solutions: adjusting feature specifications and employing an additional learning bias.
The grammar learned by the Hayes and Wilson model consists of markedness constraints only. I ran a model that uses both faithfulness constraints and markedness constraints (Goldwater and Johnson, 2003) for the problem cases above. It turns out that a proper level of generalization in the constraints is crucial in a faithfulness model as well.
Second speaker: Patrick Jones
Title: Evidence for the Phonological Stem in Kinande
A number of recent studies on Bantu verbal phonology (Downing 1994, 1998, 1999, 2000; Herman 1996) have argued that the phonological processes which affect the Bantu verb, rather than referring directly to constituents of morpho-syntactic structure (M-Constituents), instead refer exclusively to morpho-prosodic constituents (P-Constituents) which are derived from them. Evidence for this view, which has its origins in the work of Selkirk (1986) and Inkelas (1989), is found in a number of phonological processes whose domains of application approximate constituents defined by morpho-syntactic structure, but do not match them exactly. In this talk, I will argue that the existence of a Phonological Stem (PStem) domain is strongly supported by the verbal phonology of Kinande, and that by recognizing the PStem as a morpho-prosodic domain to which phonological constraints can refer, it is possible to provide straightforward and unified analyses of four distinct phonological processes - Verbal Reduplication, Intonational Tone Assignment, Lexical Tone Assignment, and Purposive Suffix Affixation - that must otherwise be explained in terms of arbitrary and idiosyncratic constraints. However, I will argue against the position that only morpho-prosodic constituents, and not morpho-syntactic ones, may be referred to by phonological processes. I will argue that reference to the MStem as well as reference to the PStem is necessary in order to successfully account for Verbal Reduplication and Lexical Tone Assignment, and will therefore argue for a theory in which phonological processes make necessary, but not exclusive, reference to morpho-prosodic domains.