Category Archives: Computational linguistics

Richard Futrell in CLC/Psycholing Workshop, Friday, April 27

All are welcome at the final CLC event this spring: Richard Futrell (MIT, BCS) will speak at Psycholinguistics Workshop on Friday, April 27th, 10am-11am, in ILC N400. Richard will also be available for individual meetings – please contact Chris Hammerly to set up an appointment. See below for abstract and title.

Memory and Locality in Natural Language

April 27th (Fri), 10am-11am, ILC N400 (Psycholing Workshop)

I explore the hypothesis that the universal properties of human languages can be explained in terms of efficient communication given fixed human information processing constraints. First, I show corpus evidence from 54 languages that word order in grammar and usage is shaped by working memory constraints in the form of dependency locality: a pressure for syntactically linked words to be close to one another in linear order. Next, I develop a new theory of human language processing cost, based on rational inference in a noisy channel, that unifies surprisal and memory effects and goes beyond dependency locality to a new principle of information locality: that words that predict each other should be close. I show corpus evidence for information locality. Finally, I show that the new processing model resolves a long-standing paradox in the psycholinguistic literature, structural forgetting, where the effects of memory on language processing appear to be language-dependent.

Music and language events this week

On Tuesday April 10th 3-4 pm in ILC N458, there will de a discussion of “Harmonic syntax of the 12-bar blues” by UMass Linguistics undergrad alum Jonah Katz. A link and abstract appear below.

On Friday April 13th 2:30 – 3:30 in ILC N400, Stefanie Acevedo (Yale) will present “Explaining expectation entropically: An empirical study of harmony in popular music” (abstract below).

At 3:30 Friday the 13th, David Temperley (Eastman School of Music) will present “A Model of Emotional Expression in Rock”.

All are welcome to all of these events. Please contact Joe Pater if you would like to meet with either Acevedo or Temperley while they are here.

==========

Jonah Katz (2017). Harmonic syntax of the 12-bar blues: a corpus study. Music Perception, 35(2), 165-192. Preprint (LingBuzz). Supplementary materials: data, statistical models, tree graphs, description of modeling.

Abstract. This paper describes the construction and analysis of a corpus of harmonic progressions from 12- bar blues forms included in the jazz repertoire collection The Real Book. A novel method of coding and analyzing such data is developed, using a notion of ‘possible harmonic change’ derived from the corpus and logit mixed-effects regression models describing the difference between actually occurring harmonic changes and possible but non-occurring ones in terms of various sets of theoretical constructs. Models using different sets of constructs are compared using the Bayesian Information Criterion, which assesses the accuracy and complexity of each model. The principal results are that: (1) transitional probabilities are better modeled using root-motion and chord- frequency information than they are using pairs of individual chords; (2) transitional probabilities are better described using a mixture model intermediate in complexity between a bigram and full trigram model; and (3) the difference between occurring and non-occurring chords is more efficiently modeled with a hierarchical, recursive context-free grammar than it is as a Markov chain. The results have implications for theories of harmony, composition, and cognition more generally.

Acevedo abstract: Given a preponderance of common _stock_ progressions in popular music, like the “Doo-Wop” (I-vi-IV-V) or the “Axis” (I-V-vi-IV) progressions, sequences of chords are often taken as a starting point for analysis. These chord sequences contextualize the sometimes _non-functional_ chord usage in popular music. While recent music-theoretical work uses computational methods to analyze harmonic probabilities in musical corpora and model their stylistic norms, it often focuses on analyzing lower-order probabilities such as single chord counts or chord-to-chord transitional probabilities. In this talk, I propose the use of information entropy, a measure of statistical uncertainty, as a way to segment harmonic progressions in a corpus of popular music (the McGill Billboard Corpus). The resultant harmonic segments are classified into prototypical chains based on functional categories that are determined by chord sequences as opposed to individual chords. The results and implications of the project are contextualized within recent research on popular music harmony and implicit learning of musical style.

Temperley abstract. In this talk, I present a framework for the analysis of emotional expression in rock music. The talk surveys some of the material in my new book The Musical Language of Rock (Oxford, 2018).

I begin with a two-dimensional model of emotion, well-established in music psychology, with valence (positive versus negative emotion) on one axis and energy (also known as arousal or activity) on the other. Valence is determined mainly by pitch collection (roughly, major versus minor, though there is more to it than that); energy depends on a variety of cues such as tempo, pitch register, loudness, and textural thickness. I then add a third dimension for complexity, or (in experiential terms) tension. Tension is affected by the density of events and also by their expectedness, with faster rhythms and low-probability events being higher in tension. Low-probability events can arise from such things as surprising harmonies, shifts outside of the currently established scale, irregular phrases, and extreme or unusual syncopations.

I then apply this model to the verse-chorus unit (VCU)—a formal section containing a verse and chorus; this is the core element of conventional rock form. We find consistent trajectories across the VCU in all three expressive dimensions—valence, energy, and tension. The chorus tends to be higher in energy than the verse; in terms of valence, many songs show a “sharp-ward” shift between verse and chorus, reflected not only in simple minor-to-major shifts but also in more subtle ways. With regard to tension, however, the peak tends to be in the middle of the VCU, either in the prechorus (if there is one) or in an extension of the verse. I present a number of examples, showing how the current model sheds light on both normative and exceptional cases.

Spring 2018 Computational Linguistics Community (CLC) Events

See below for our exciting line-up of CLC events this semester. All welcome! Mark your calendars!

Soroush Vosoughi (MIT) Data Science Seminar talk
- Feb 22nd at 4pm, CS 150/1
COLING Paper Clinic
- February 28th (Wed), 4-5pm, CS 303 (NLP Reading Group)
Yulia Tsvetkov (CMU Computer Science)
- March 1st (Thu), 12pm, CS 150/1 (MLFL)
Yelena Mejova (QCRI) iSchool Seminar talk
- March 6th, at 4pm, CS 150/1
Brian Dillon (UMass Linguistics) on “Syntactic Frequency Effects in Recognition Memory”
- March ~~9th~~ 30th (Fri), 12:20-1:20, ILC N451 (Experimental Lab)
Michael Becker (Stony Brook Linguistics) on Modeling Arabic Plurals
- April 9th (Mon), 10am-11am, ILC N451 (Sound Workshop)
Richard Futrell (MIT BCS) title TBA
- April 27th (Fri), 10am-11am, ILC N400 (Psycholing Workshop)

UMass Linguists at the LSA Annual Meeting

5 Replies

Our department was extremely well represented at this year’s Linguistic Society of America annual meeting, held in Salt Lake City Jan. 5-7 2018. Highlights included the plenary address by Lisa Green (introduced by outgoing LSA president Alice Harris), and the first meeting of the Society for Computation in Linguistics, organized by Gaja Jarosz and Joe Pater. Rajesh Bhatt deserves special thanks for all his work as program co-chair. The photo shows just some of the current students and faculty, and alums. (Can anyone name them all? Comments open below.) The talks and posters delivered by current members of the department, including many students, are listed below (student presentations are asterisked).

*Carolyn Jane Anderson (University of Massachusetts Amherst): The San Lucas Quiaviní Zapotec andative and venitive

*John Duff (University of Massachusetts Amherst), Alice Harris (University of Massachusetts Amherst): Udi and the location of Caucasian Albanian agreement clitics

*Alexander Goebel (University of Massachusetts Amherst), Brian Dillon (University of Massachusetts Amherst), Lyn Frazier (University of Massachusetts Amherst): Investigating the parallelism requirement of too

Lisa Green (University of Massachusetts Amherst), “African American English and Fifty Years of Research: Variation, Development, and Implications for the Pipelines”

*Coral Hughto (University of Massachusetts Amherst): Investigating the consequences of iterated learning in phonological typology

*Kimberly Johnson (University of Massachusetts Amherst): Expletive voice: another look at the Creek causative

*Andrew Lamont (University of Massachusetts Amherst): Subsequential steps to unbounded tonal plateauing

Joe Pater (University of Massachusetts Amherst), Lisa Sanders (University of Massachusetts Amherst), Evan Hare (University of Massachusetts Amherst), Claire Moore-Cantwell (Simon Fraser University): ERP signatures of implicit and explicit phonological learning

*Brandon Prickett (University of Massachusetts Amherst): Similarity-based phonological generalization

Tom Roeper (University of Massachusetts Amherst), Bart Hollebrandse (University of Groningen), Ana Perez (University of Toronto), Angeliek van Hout (University of Groningen), Petra Schulz (Goethe University Frankfurt), Anca Sevcenco (University of Bucharest): Avoidance by children as evidence of self-embedding recursion

*Katerina A. Tetzloff (University of Massachusetts Amherst): Analyzing surface unnaturalness and opacity in phonetically natural steps: final devoicing and vowel lengthening in Friulian

CS 585 Poster Sessions