The slaves who were ourselves had known terror intimately, confused sunrise with pain, & accepted indifference as kindness. – Ntozake Shange, Sassafrass, Cypress & Indigo
Sanford. Baltimore. Chicago. Staten Island. Charlotte. Cleveland. Oakland. Austin. Los Angeles. The Bronx.
Despair in the United States is nothing new. It is neither an emotion confined to the neatly-drawn borders of this land nor is it experienced more acutely by any one group of people. The vast discrepancy between the results of the popular vote and the electoral college’s selection of Donald Trump as forty-fifth president of the United States amply reveals despair to be an sentiment viscerally experienced by a wide swath of people in this country, irrespective of race, ethnicity, gender, class, or sexuality.
Such despair has been ignored, however, by those who have caused and who continue causing the suffering of peoples of both indigenous and, later, African descent. We are taught that men from what we now recognize as Europe arrived in this hemisphere in the late fifteenth century, settling initially on a strip of earth in the Caribbean Sea that would become the first site of massacre and genocide, acts which unleashed, if one lends credence to the narrator of Junot Díaz’s The Brief Wondrous Life of Oscar Wao, the fukú, the “Curse and the Doom of the New World.” The narrating voice himself characterizes the curse not in the actions of death, but in the “screams of the enslaved, [..] the death bane of the Tainos, uttered just as one world perished and another began […]” (1). The fukú resonated through the sounds that these human beings made.
Not a house in the country ain’t packed to its rafters with some dead Negro’s grief. – Toni Morrison, Beloved
The State’s unwillingness to hold George Zimmerman responsible for the murder of Trayvon Martin–and its subsequent refusal to hold any police officer accountable for the hundreds of deaths they have caused–has galvanized the United States in the last four years. Hundreds of thousands of men, women, and children alike have taken to the streets, as #BlackLivesMatter, a true and succinct sentence, has roused ghosts of the past who have never left us, who have always been present, accompanying us on this journey.
This post is not a reflection of the music that has served as a soundtrack to these protests, though there are articles that have done so, such as this one, this one, and this one. These pieces do not include the extensive list of articles that address perhaps the most widely-viewed piece of protest music thus far, Beyoncé’s “Formation” video, a scarce offering of which can be found here, here, and here. Instead, it is an essay inspired by the sounds of the protesters themselves, the noises made by the minds, bodies, and spirits of the men, women and children who have taken to public spaces and sometimes commercial zones in order to confront and object to the protections applied to those who kill men, women, and children, often of African descent.
Listen to Los Angeles in 2013. . .
. . .to Houston in 2014. . .
. . .to New York City in 2014. . .
. . .and to Charleston in 2015. . .
. . .
In his pivotal Cities of the Dead: Circum-Atlantic Performance (1996), Joseph Roach characterized New Orleans and London as urban centers marked by two simultaneous, consistent acts: appropriation by white people and white power structures of the cultures of the peoples they have violently marginalized, and then, at the same time, a clear distancing from those very cultures and peoples. Although now in its twentieth year of publication, Roach’s theorization of the circum-Atlantic world remains vastly underutilized in scholarly circles—particularly in sound studies, where it should have special resonance– and has become increasingly critical to our understanding of this historical moment, as it “insists on the centrality of the diasporic and genocidal histories of Africa and the Americas, North and South, in the creation of the culture of modernity” (4). With this configuration, Roach accomplishes two feats simultaneously: first, he decentralizes the United States as the focal point of studies about the so-called New World, instead, placing on equal footing all of the histories and cultures of the Americas. For this scholar of the literatures of the Americas, particularly those written by men and women of African descent, Roach’s is a critical gesture that facilitates comparative work across national boundaries.
Second, and most importantly, Roach emphasizes the role of murder, rape and the destruction of whole cultures indigenous to the American and African continents in the foundation of the nations of this hemisphere. Ta-Nehisi Coates is perhaps the most recent writer to remind us that the most potent legacy of such modernity, racism, “is a visceral experience, that is dislodges brains, blocks airways, rips muscle, extracts organs, cracks bones, breaks teeth” (Between the World and Me, 10). That which we know as “modernity,” itself a deeply flawed construct that remains in need of serious revision, was born of broken backs, mutilated limbs, hushed middle-of-the-night tears of indigenous and African peoples. Moans and sighs, whispers and wails, cries and screams, they are the musical score of this hemisphere’s American experiment.
The slaves who were ourselves aided Indigo’s mission, connecting soul & song, experience & unremembered rhythms –Ntozake Shange
In the face of a populace accustomed to ignoring the wailing of mothers who have buried their children, who have disregarded their dignity and the weight and shape and taste of their loss, men, women, and children have mobilized. They have made manifest that which communities of peoples of African descent have spoken of and have documented since the founding of this nation. As Roach has utilized the term performance, the literal rituals of mourning by communities of African heritage not only commemorate those who have recently passed but they also invoke the spirits of those who have long borne witness to such violence. Throughout his study, Roach distinguishes between a European heritage that begins to segregate the living from the dead during the Enlightenment (50), and more traditional cultures, particularly African ones, where spirits mingle with their human counterparts. While written texts may not, and often do not, adequately commemorate the loss of lives deemed marginal to the larger society, performance itself – chants, wails, songs – serve not only to memorialize but also as gestures of restoration.
Protesters and activists are no longer satisfied with the well-established decree that we should wait for a distant moment for a more perfect realization of the United States’s many promises. No, instead, they have identified this as the historical moment in which those oaths are to be fulfilled. They have walked, marched, and stomped through streets, on sidewalks, parks, churches, filling malls and transportation hubs with their bodies as testimony. They have repossessed and redefined spaces once thought of as simply neutral, transparent space as Katherine McKittrick refers to it in Demonic Grounds, revealing the fault-lines of difference based on class, race, gender, and sexuality in this society (xv). They have done so manipulating sound, both recycling chants used through the decades to protest injustice and, at times, simply occupying space, without a word uttered.
The silence waged in the 2014 protest in Grand Central Terminal after the non-indictment of Daniel Pantaleo in the murder of Eric Garner does not represent erasure, but rather a purposeful demonstration of the willful humanity of those unwilling to forget.
They quiet themselves. They replace the sounds of unfettered pain and grief with its absence, until all that you hear is the mechanized announcement of train schedules. The contrast is stark: the moment highlights what Claudia Rankine has identified as the condition of black life in Citizen, that of mourning (145), against a backdrop of technological advancement, that which has been built on the backs of and through the physical, emotional, and intellectual labor of black life. Here, the members of this community enact what has been called a “die-in”: simulating the physical positioning of bodies in caskets, they force onlookers to confront an uncomfortable truth about the history of this country and of the nations of this hemisphere.
All of us walk on land soaked in the blood of those who have made our lives easier and more convenient. The men and women at Grand Central make manifest what Roach terms surrogation: in the chasm left by death, they offer a replacement, one that both evokes those who have died and disturbs the complacency of survivors themselves (2). The performance serves to confront those who dare say that the violence of genocide and enslavement of past generations should remain in the past; no, these men and women and the spirits they invoke respond. Time is not linear, as we have been taught. For past, present, and future are temporal constructs used to service oppression and domination; this will no longer do.
Here, in this here place, we flesh; flesh that weeps, laughs; flesh that dances on bare feet in grass. Love it. Love it hard. – Toni Morrison
We bear witness to the reclamation of grief, of lives cut short at the hand of a government charged with protecting those human beings who inhabit its borders, at least theoretically. While, as Roach surmises, “memory [may be] a process that depends crucially on forgetting” (2), we hold space to those dedicated to not forget, to instead excavate the silences, breathe life into those histories, remembering that the stories we have heard, the pages we have read, were once human beings. We create “counter-memories” as challenge and testimony, as a sacred pledge to those who are no longer present physically in this realm (Roach 26). We recall the cultures and practices of those who lived before the written form was a tool of exclusion, when remembrance was a practice of community.
American culture, in the hemispheric sense, incorporates all such rituals, across generations; as Roach notes, it is performance that “works on behalf of living memory, by bringing the parties together as often as necessary” (138). No longer consigned to the past, the spirits of those killed by the state are revived, their existences in the human plain celebrated. They are not defined by how they died but instead by how they lived. While literacy of the written form can separate, sound and gesture more effectively bypass the fictions of difference based on race, ethnicity, gender, class, and sexuality. Cities of the Dead amplifies how “performance can articulate what otherwise may not be properly communicated” (161).
It’s so magic folks feel their own ancestors coming up out of the earth to be in the realms of their descendants – Ntozake Shange
We say their names. We say their names: Eleanor Bumpers. Anthony Báez. Sean Bell. Aiyana Stanley-Jones. Tyisha Miller. Oscar Grant. Rekia Boyd. Trayvon Martin. Tanisha Anderson. Renisha McBride. Eric Garner. Yvette Smith. Tamir Rice. Sandra Bland. Freddie Gray. Korryn Gaines. Akia Gurley. Alton Sterling. Philando Castile. Micah Jester. Deborah Danner. Walter Scott. Michelle Lee Shirley.
The list, tragically, grows, and still we say their names. We do so as an act of remembrance. As an offering. As peoples of African descent around the world do in times of ceremony, in the name of ritual. We remember those who have come before us, who have birthed this current historical moment of awakening here in the United States. We say their names.
And, as the sounds of their names said aloud echoes, we pray. Ashé.
Vanessa K. Valdés is associate professor of Spanish and Portuguese at The City College of New York; she is the editor of Let Spirit Speak! Cultural Journeys through the African Diaspora (2012) and The Future Is Now: A New Look at African Diaspora Studies (2012) and the book review editor of sx salon. She is the author of Oshun’s Daughters: The Search for Womanhood in the Americas (2014). The title of this essay is inspired by Josh Kun’s Audiotopia: Music, Race, and America, where he writes that his book is “focusing on the spaces of music, the spaces of songs, and the spaces of sounds” (25).
REWIND! . . .If you liked this post, you may also dig:
Music Meant to Make You Move: Considering the Aural Kinesthetic–Imani Kai Johnson
**This post was co-authored by Marit J. MacArthur and Lee M. Miller
Like it or not, we are now accustomed to contemporary pop vocalists manipulating their voices using Autotune and other tools or effects for pitch correction. We may exult in it, and congratulate ourselves on our sophisticated appreciation of the options available to the contemporary vocalist. In another mood, we may scream for low-fi and acoustic music, feel cynical about the possibility that we might ever hear an unmediated voice, live or recorded (if we ever did), and/or laugh off the notion of authenticity in performance entirely. Of course, rather than tricking the audience or trying to sound somehow “better” than they are, many performers manipulate their voices to pose questions about the nature of performance—Reggie Watts and Anna Deavere Smith come readily to mind—and to test essentialist assumptions about and perceptions of voice and sound.
Watts, in an exemplary 2012 TED Talk, plays with the different sorts of authority and affect conveyed by, among other voices: upper-class-British-absurd-explanatory, affectively-meaningful-nonsense-foreign-language, and caz-hip-hop-introducing-a-song-chat. Inhabiting and playing with different voices, he amuses listeners into recognizing how much intonation—the rise and fall of pitch—and other acoustic features affect our perception of a speaker’s voice, and how much we expect people to speak in ways that match our assumptions about their identities.
We cannot all be so talented at vocal imitation, however. And in sound, voice and performance studies concerned with speech, machine-assisted manipulation of vocal recordings—which we term “vocal deformance”—is much less common than in the creative industries. A playful approach to vocal deformance, as a critical and creative practice, has much to teach us about our perceptions of speech in general, and performative speech in particular. Too often, when we use archival poetry recordings in our teaching, they may reify an idea that students are often loathe to relinquish: a poem is a finished art object, weighted with authorial intention and biographical significance, with one possible interpretation (the instructor’s). When we play a single canonical recording of T.S. Eliot reading The Waste Land in 1946, for instance, his particular intonation, together with assumptions that he was a stuffy, overeducated, repressed snob, can foreclose the possibility of a fresh encounter with the many voices of the poem and a multitude of interpretations.
Using vocal deformance in the classroom and in our own research and scholarship, we can unsettle overdetermined readings of poems, essentialist assumptions about the poets who speak them and questions of poetic authority, and recover the crucial oral components of poetry. Below we offer some examples of vocal deformance of poetry readings, and consider the potential and limits of this technique for teaching and research with recordings of performative speech. As John Hyland wrote in Sounding Out! in 2014, “The act of listening to recorded poetry … poses particular analytic challenges, which become more complex when the politics of identity are brought to bear on … questions of voice and poetry.” Among these challenges are essentialist assumptions, both about identity and recording medium, which are difficult to avoid when we listen. Hyland concludes that, when we listen to recordings, “the poet’s voice falsely takes on an authoritative ‘aura,’ as Walter Benjamin used that word”; one way to counter to this is to listen to the same poem read by the poet at different points in their career, in different contexts, as Hyland does with three recordings of Amiri Baraka’s “Black Dada Nihilismus.” Another approach is to play with recordings.
The concept of deformance dates to a 1999 essay by Jerome McGann and Lisa Samuels. They take inspiration from Emily Dickinson, who sometimes liked to read poems backward, for the potential insights of reading against the form, scrambling the original sequence, and so on. According to McGann and Samuels, Dickinson’s
critical model is performative, not intellectual [. . . ]. it is anti-theoretical: not because it is opposed to theory (i.e., speculative thought), but because it places theory in a subordinated relation to practice. Deformative moves reinvestigate the terms … [of] critical commentary [, with] dramatic exposure of subjectivity as a live and highly informative option of interpretive commentary, if not indeed one of its essential features. [our italics]
Too often in the literature class room, the subjectivity of interpretation is something of a problem. While we might initially encourage a somewhat fluffy reader-response discussion of a poem, eventually we might also worry that students are simply wandering too far from it, following their own random associations with a phrase or metaphor, without learning to parse the rich intricacy of the whole poem. One effect of vocal deformance is that it makes space for the playful response, and also keeps bringing students back to the telling phrase, to the words of the poem, imagining what difference it makes if they are said in different ways, trying on different interpretations, as it were.
While vocal deformance can be applied to any performative speech, it particularly lends itself to poetry recordings. Poetry is, of course, an oral form with a fraught relationship between text and performance, and poetry reading styles are often perceived to be highly conventional, so that we feel we are listening to a Poem rather than a particular poem. From a literary and performance studies perspective, what could be more tiredly familiar than a canonical recording of a canonical poem by a canonical poet in a conventional style of poetry reading that deadens the audience to the charms and nuances of that poem? And how can we do something productive and interesting with the (sometimes extremely) idiosyncratic subjectivity of student responses to canonical texts?
As an interpretive practice, vocal deformance opens up new possibilities for testing assumptions about performance, poetic authority and gender, and, potentially, about race, class, education, region, and canonicity. Is The Waste Land (1922) the deadly serious poem that many readers often take it to be, partly because it is presented to them as an immensely influential Modernist monolith? How does T.S. Eliot’s seemingly grim reading of it, and our perception of his style, contribute to such an interpretation of the poem? After all, the working title of the poem was “He Do the Police in Different Voices,” from Charles Dickens’s Our Mutual Friend (1864-65), and it includes many different voices or speakers, from the clairvoyant Madame Sosistris to Tiresias. What better way to defamiliarize and exploit the authority of the poem than to deform Eliot’s authoritative reading voice?
How do we respond to the now-canonical voice of Eliot reading the opening lines of The Waste Land, “April is the cruellest month, breeding / lilacs out of the dead land”?
Okay. Now what if we raise his pitch? Is he suddenly his own great-aunt? What does the same lament mean, spoken by a voice that sounds like an elderly woman?
And if we leave his pitch alone, but speed up his speaking rate, does he suddenly sound like an old-school radio announcer, the poem a deranged weather forecast?
In terms of digital humanities research, a refreshing aspect of vocal deformance is that it avoids some of the easy and misleading reassurances of the empirical move. It’s not that it only clarifies what we thought we were hearing (as visualizing intonation through pitch contours can), but that it encourages multiplicity in listening.
Vocal deformance is essentially a playful strategy for defamiliarization that reminds us, in many ways, of the subjective, creative, even arbitrary nature of interpretation. In this, it has clear affinities with the OULIPO movement (which Dickinson’s practice of reading backwards presages). It may help us imagine, create and respond to alternative sequences and versions of recorded canonical texts—and to any apparently stable, singular performance of a text. The art of the glitch is one deformative practice, with the goal countering screen essentialism, the unreflective assumption that a digital artefact is immutable, stable and coherent. For an example of glitching photographs, see Trevor Owens’s “Glitching Files for Understanding: Avoiding Screen Essentialism in Three Easy Steps,” and Michael Kramer’s blog post about using audio deformance in a digital folk music history seminar at Northwestern University, “Distorting History to Make It More Accurate,” which demonstrates some potential insights gained by glitching newspaper images, photographs and music (Bob Dylan’s “Tangled Up in Blue”). John Melillo and Johanna Skibsrud’s “Two Sides for Wallace Stevens,” on Harvard’s Woodberry Poetry Room site, also offers a beguiling example of audio deformance.
Most deformative practices work with text and image, however, and the few that manipulate recordings introduce noise, skipped phrases, repetition, etc., usually without changing the acoustic features of the voice. It is well worth applying deformance more often to speech, not only in linguistics and the neurobiology of speech perception, but in humanistic study of performative speech because our perception of speech is nothing if not subjective, not to say mysterious, for two reasons.
First, our expectations of what we will hear influence what we do hear, from simple sounds to complex language comprehension. Often these expectations, which can be visual, auditory, cultural, etc., have been naturalized by the listener over time as unconscious reactions. Though many have anecdotal experience of this phenomenon (see an example about a black student, a white teacher, and a black student-teacher disagreeing on what the student said in a 2012 Sounding Out! piece by Christina Sharpe), it is has been demonstrated in many experiments as well. For instance, our perception of foreign-accented speech changes rapidly as we hear a few sentences and calibrate our internal expectations, as shown by Clarke and Garret’s 2004 study “Rapid Adaptation to Foreign-accented English.” And, according to Richard Warren’s “Perceptural Restoration of Missing Speech Sounds” (1970), “when natural speech is interrupted by noisy gaps like a cough or a slammed door, we unknowingly “fill in” the noise, vividly hearing speech sounds that do not exist acoustically. This phenomenon arises both from linguistic expectations as well as our deep familiarity with basic speech acoustics, as shown in Shahin, Bishop, and Miller’s “Neural mechanisms for illusory filling-in of degraded speech.” Similarly, in an illusion called the McGurk effect—noted by Harry McGurk and John MacDonald in 1976—just seeing a talker’s lip movements changes the perception of speech sounds categorically, say from “buck” to “duck.”
Though much of this reshaping of our acoustic perception happens unconsciously, we can also profoundly alter what we hear through selective attention. Particularly in everyday acoustic environments, we hear speech better when we expect it, and when it matches our specific expectations: from a given location, from a certain talker or type of talker, at a certain pitch, and so on (See “Speech Recognition in Adverse Conditions: A Review” by Mattys, Davis, et al. 2012). Perceptual filters fundamentally constrain our experience: if we attend to a talker in one ear, we may not even realize when a second talker in the other ear switches from English to German, as Cherry concluded in “Some Experiments on the Recognition of Speech” in 1953. Social and cultural knowledge also changes what we hear. Listening to someone whom a listener visually perceives as a “non-native speaker” can make speech sound not only more “accented” (see Donald Rubin’s “Nonlanguage Factors Affecting S Judgments of Nonnative English-Speaking Teaching Assistants” from 1992)—what we might call a subjective quality—but, as Molly Babel and Jamie Russell found in 2015’s “Expectations and Speech Intelligibility,” it can also trigger speech processing reactions that make the speech less intelligible to the listener making visual judgments regarding accented speech.
Given what we know about the brain, the fact that expectations affect perception—of recorded voices reading poems, in this case—should not come as a surprise. A growing consensus holds that the brain’s job is not merely to represent the world; rather it strives to predict the world, make inferences about it, and correct those expectations whenever a mismatch is detected (see Knill and Pouget’s “The Bayesian brain: the role of uncertainty in neural coding and computation”  and Karl Friston’s 2010 “The free-energy principle: a unified brain theory?”) In somewhat familiar environments and situations (pretty much everything after infancy), predictive inference is far more efficient than continually rendering the perceptual world de novo. This means that vocal deformance—particularly when it manipulates a known voice, as with canonical poets, or a familiar way of speaking, as with conventional poetry reading styles—waves a red flag at the brain. Change wakes up the quiescent, habitual brain to something new and potentially informative, because the voice does not fit our expectations for what the person would or should sound like. Listen to Reggie Watts!
This effect can also operate inversely; that is, if we do not expect someone to have a particular voice, we may adjust the stories we tell ourselves about our perceptions, to better match our expectations. In musicology, we might think of Nina Eidsheim’s article on the racialized reception of opera singer Marian Anderson, the first African American to sing at New York’s Metropolitan Opera:
the timbre of her voice has routinely (if often admiringly) been characterized as ‘black,’ … [despite] classical music’s minimal indulgence of individual style … this distinction [has] to be based on an assumption that the black body is intrinsically different from the white body and that even when emitting a timbre recognized as classical, the resonance of a singer’s black body is evident (3, 4).
Certainly, as Jennifer Stoever writes, “listening [is] an interpretive site where racial difference is coded, produced, and policed” (62). The same is true of gender difference and many other identity markers and cultural factors related to authority and authenticity. As Shai Burstyn notes in the article “In Quest of the Period Ear,” about attempts to imagine how contemporary audiences experienced medieval music, “culture plays a highly significant—though not exclusive—role in shaping the cognitive skills of its members” (695). If it is remarkably difficult to escape our stereotypical expectations and perceptions of what a person’s voice “should” sound like, that is partly because our brain uses such expectations to make predictions about our sonic experience. We cannot overcome our expectations through good will alone, and engaging with these issues in the classroom, which can be challenging, also provides an opportunity to help students think critically about essentialism and voice, for those moments when a student in the back of the room mutters in surprise that Langston Hughes “doesn’t sound black,” or exclaims that Walt Whitman “doesn’t sound gay.” Though it is not designed to assess stereotyping in speech perception, the Harvard implicit bias test is a good way to engage students in questions of cultural bias and perception [also, see “So You Flunked a Racism Test. Now What?”].
Furthermore, our affective responses to acoustic, non-verbal qualities of speech matter tremendously to our interpretation of verbal semantics, of the meaning of the words spoken. According to voice perception research in Foundations of Voice Studies: An Interdisciplinary Approach to Voice Production and Perception by Jody Kreiman and Diana Sidtis, when we listen to speech, “[s]ome authors … have claimed that normal adults usually believe the tone of voice rather than the words…. For example, the contrast in ‘I feel just fine’ spoken in a tense, tentative tone might be politely ignored, while, ‘I’m not angry’ spoken in hot anger would not” (304). The teacher’s boring tone of voice on the Peanuts cartoon makes the point.
In other words, we pick up on the affective meaning of a speaker’s tone of voice, and weigh it against the semantic meaning of the words spoken. While Kreiman and Sidtis argue that tone cannot be reduced to intonation patterns, “the fundamental frequency of the human voice [pitch] … heads the list of important cues for emotional meanings” (311). Pitch manipulation, then, changes the affective meaning of speech. Tone of voice is also influenced by other acoustic features, including speaking rate or tempo, and rhythm. In poetry recordings, the poet’s tone of voice influences the listener’s interpretation of a poem.
Two fundamental intonation patterns are rising or falling pitch. In American English, relatively high or relatively low pitch at the end of an utterance, compared to the beginning and middle, seems to carry distinct meanings, as demonstrated by Janet Pierrehumbert and Julia Hirschberg. They developed the ToBI (Tones and Break Indices) system for marking the prosody or intonation of speech. Rising intonation can make any utterance sound like a question, whether it is one or not. A relatively high pitch at the end of an utterance—called a high boundary tone—can make the speaker sound less confident or assertive, and more open to other’s opinions. Rising intonation implies that more is to come, that the utterance is not conclusive or concluded, that it should be understood in connection to the next utterance, and sometimes, that the speaker seeks the listener’s agreement before proceeding.
Uptalk, notably practiced among Generation Xers and now millennials, sounds conciliatory, agreeable and open, on the one hand, and lacking in confidence and authority, on the other—depending on the listener and the context. Marybeth Seitz-Brown argues that criticizing uptalk “implies that if women just spoke like men, our ideas would be valuable … [and] sexist listeners would magically understand us, and we would be taken seriously. But the problem is not with feminized qualities, of speech or otherwise, the problem is that our culture pathologizes feminine traits as something to be ashamed of or apologize for.”
Conversely, women can be criticized when they sound too much like men; see “Why Do So Many People Hate the Sound of Hillary Clinton’s Voice?” Falling intonation—and ending an utterance on a relatively low pitch, or a low boundary tone, implies conclusion, closure and confidence. The utterance, such intonation implies, finishes the argument (if there is one), does not seek the listener’s agreement or opinion, and suggests that this utterance can be understood on its own, without connection to subsequent utterances. Donald Trump, for example, is fond of falling intonation and low boundary tones (for a parody of masculine confident declarative intonation, have a listen at Troy McClure from The Simpsons.)
Of course, not all women use uptalk, and not all men use falling intonation with low boundary tones. In American culture, for better or worse, low boundary tones do seem to carry a tone of authority. And in poetry reading as well. Eliot’s original, and now canonical, reading of the opening line of The Waste Land, “April is the cruellest month,” uses falling intonation, so that it sounds like a confident assertion, with a low boundary tone on “month.”
“[B]reeding / lilacs out of the dead land” sounds like a steady, inevitable process, ending on a slightly higher relative pitch, implying that there is more to come, and that the phrase should be understood in connection to the next line, “mixing / Memory and desire, stirring / Dull roots with spring rain.”
What is so compelling and seemingly authoritative about Eliot’s reading style? In some basic sense, the falling intonation of the first phrase does it. Why does it strike many contemporary listeners as pompous? How might we undercut the seeming authority of the Eliotic voice? Make him do uptalk. Here we have simply inverted his intonation.
Suddenly he sounds doubtful. The opening line becomes a question—“April is the cruellest month[?]”—instead of a confident statement. Suddenly, Eliot himself expresses the skepticism or confusion many an undergraduate has felt—before we encountered this poem, did we not assume that spring, the return of life and fertility, is a cheerful escape from winter? And his deformed recital of “breeding / Lilacs out the dead land” suddenly sounds more like an agonized complaint, expressing the painful, reluctant awakening of desire in one who had found the dull sleep of winter comforting. Inverting the typical poetic authority of falling intonation into uptalk may embolden readers to entertain very different readings of the poem’s opening.
The editors of Poetry Archive had hopes of stimulating listeners of The Waste Land when they made available a 1935 recording of the poem, claiming: “Whilst the sound quality is understandably not so good, the recording is fascinating for Eliot’s faster, more energetic rendition. Listening to this urgent interpretation blows the dust of this iconic poem and helps us encounter it afresh.” However, if the fundamental falling intonation pattern of Eliot’s reading style doesn’t change—and overall, it doesn’t, between the 1935 and 1946 recordings—his voice may remain, for listeners, an aloof poetic authority.
Falling intonation with low boundary tones, then, is a fundental tone of poetic authority. Listen to Adrienne Rich reading from her poem, “What Kind of Times Are These” (1995), which leads the reader to a place “between two stands of trees … near … [where] the persecuted / … disappeared into the shadows.” She insists, “this isn’t a Russian poem, this is not somewhere else but here,” and concludes, “to have you listen at all it’s necessary / to talk about trees.”
She sounds like she means it. Rich has to write poems about nature, her tone implies, to wake people up to the political horrors of the American past and present. Poetry as a form, in pastoral guise, allows her to sneak in political content, potentially grabbing the attention of people who might only listen to poetry if they think it is safely, simply about nature. (Click here to hear the entire poem, starting at 4:01.)
When we invert her intonation, turning it into uptalk, she sounds as if she is questioning the wisdom of this approach, and/or chiding her listeners for making her take it. In this case, uptalk exerts a different kind of authority, the challenging question.
Is it ethical to manipulate the intonation and other vocal qualities on poetry recordings, for the purposes of teaching and research? Obviously it would not be, if we were to present the manipulated recordings as the authentic voice of a poet. And all peoples have the right to protect culturally sensitive recordings, such as sacred songs, music, dances and prayers; see “Native American Intellectual Property Issues.” Otherwise, potential conflicts are similar to those with sampling in the music industry (See Kembrew McLeod and Peter DiCola’s Creative License: The Law and Culture of Digital Sampling ). Vocal deformance, however, can help remind us that no single reading of a poem, by the poet or someone else, is the ultimately authoritative one.
In teaching writing, we (the authors) sometimes ask our students to explore alternative methods of presenting the same material. This can be as simple as writing the same sentence, the thesis for instance, in three different ways, or it can involve a different format. Write a poem, record oneself reading it, then try to represent it with a collage of images. Turn a 2,000-word essay into a 250-word presentation with verbal and sonic components. An instructive trick with the opening line of W.H. Auden’s “Musée des Beaux Arts” (1940): “About suffering they were never wrong, the Old Masters.” They were never wrong, the Old Masters, about suffering. The Old Masters were never wrong about suffering. Each version of the opening creates a subtly different emphasis, on suffering versus the wisdom of the Old Masters.
Too often, we lock ourselves into one approach, and cannot imagine an alternative. Locked into one approach, too often we cannot imagine an alternative. Alternatives we cannot imagine, if we lock into one approach too quickly. Writing three different opening paragraphs to the same essay, or rearranging the lines of the poem, stimulates our imagination and our critical faculties because it dramatizes different possibilities, possibilities that offer a different emphasis. And when we play with the pitch, intonation and speaking rate of a poem, this can change the tone as dramatically, from a challenge to confession, or an assertion into doubt.
In the classroom, poet Harryette Mullen is often popular with students, both for her poems on the page and for her expressive reading style, while students can sometimes resist recordings by Adrienne Rich (saying that she sounds lecture-y) and Louise Glück (saying that she sounds bored by her own poems), even as they are engaged by the poems on the page.
When Mullen reads “Present Tense” (2002)—a beguiling comical poem, loosely about the grammatical present and the speaker’s and the world’s present circumstances —what is it about her contrastive intonation that sounds expressive? She ends her opening phrases, “Now that my ears are connected to a random answer machine” with rising intonation and high boundary tones. This draws the reader on: keep listening, the statement’s not finished.
When we flip the intonation pattern, so that each utterance ends on a relatively low pitch, she sounds more conventional, a poetic authority declaring observations, confident and closed off.
Another tone of poetic authority approaches pure monotony. It was practiced by Alfred Lord Tennyson, Irish modernist poet W.B. Yeats and, perhaps through Yeats’s influence, by American poets such as Yvor Winters. Note how similar they sound here. Winters reads the opening of his poem “The Journey” (1931), moving into a Yeatsian monotone after the title and location of Snake River Country, “I now remembered slowly how I came[.]”
Here is Yeats reading the opening of “The Lake Isle of Innisfree”: “I will arise and go now, and go to Innisfree[.]”
All we have to do to turn Winters into Yeats is raise his pitch a bit:
Monotone performance is—at least acoustically—quite uninformative for the brain. Early parts of the auditory brain rapidly adapt or habituate to a wide array of regularities such as pitch and temporal pattern, and they only signal when the pattern changes, as noted in “Early selective-attention effect on evoked potential reinterpreted” (Näätänen, Gaillard et al., 1978). But expectations can work differently across speech’s descriptive dimensions. When speech is usually vivid, as in a direct quotation (“He said ‘I’m leaving now”), higher-level voice-processing areas in the right temporal lobe actually work harder to process (unexpectedly) monotone quotes, according to Yao, Belin, et al.’s “Brain ‘talks over’ boring quotes: top-down activation of voice-selective areas while listening to monotonous direct speech quotations” (2012). In other words, sameness of pitch often means the brain must work harder to grasp meaning.
Interestingly, David Hadbawnik relates in Sounding Out! his disappointment with the productions of three audio recordings of three poetic specimens from Middle English created with SPARSAR, because “they produced monotone outputs that fail to account for prosody.” Vocal deformance might allow him to try to approximate Middle English prosody with the specimens.
MacArthur has written elsewhere about “poet voice,” which she also calls “monotonous incantation.” But how close are contemporary canonical poets to actual monotone, compared to Tennyson and Yeats? Here is Glück, whose reading style is often mentioned as an example of Poet Voice, reading the third stanza of “The Wild Iris” (1992): “It is terrible to survive / as consciousness / buried in the dark earth”:
Not much manipulation is required to make it purely monotone, which may account for some students saying she sounds bored by her own poems—though they do not say that about Yeats. They say Yeats’s voice makes them feel like they are in church.
Ideally, Glück’s manner of reading her poem should not prevent students from appreciating it. While in other contexts we may defend women’s use of uptalk, it also seems fair to raise the point that academic poetry reading can seem to discourage the expression of affect. (See Donald Hall’s well-known polemic, “The Poetry Reading: Public Performance / Private Art” (1985) and David Groff’s “The Peril of the Poetry Reading: Page Versus Performance” .) Vocal deformance, among other strategies, might help students perceive as much drama in Glück’s poems as they do in Mullen’s—and find as much as poetic authority in both poets’ voices as they do in Yeats’s churchy one. Here, we’ve manipulated Glück’s voice to sound more like Mullen’s style of reading, with a wider pitch range and rising intonation and high boundary tones.
If we want to explore alternatives to conventional modes of reading poetry, as many do, directly deforming the acoustic qualities of canonical recordings is an excellent way to defamiliarize performance conventions. Ideally, it can help us listen to alternate versions of the history of poetic performance and to different, unimagined possibilities in the present. Given the extraordinary vitality of spoken word and slam poetry outside the academy, it would be a missed opportunity to suppress varied reading styles in the classroom. At the same time, it would be a great shame to leave behind canonical American poetry when the poets’ reading styles fail to appeal to students.
Finally, if we want to liberate students from the anti-performative tendencies of academic culture, resist essentialist readings of poems according to our assumptions about the identities of the poets who wrote them, and dramatize the idea that there are many ways to read a poem, vocal deformance can help, alongside other strategies. As Yvon Bonenfant wrote in a 2014 Sounding Out! piece, “we are mostly neurotic, or otherwise hung up on, what kinds of sounds we make, where and when.”
Instead, let’s play in different voices.
NOTE: To illustrate vocal deformance, we used Straight, a state-of-the-art open-source voice synthesis program developed by Hideki Kawahara at Wakayama University in Japan, with the Advanced Telecommunications Research Institute and the Auditory Brain Project. We also used Drift, an open-source pitch-tracking tool that uses an algorithm developed by Byung Suk Lee and Dan Ellis, implemented by Robert Ochshorn and Max Hawkins with support from MacArthur’s ACLS Digital Innovations Fellowship in 2015-16, to visualize intonation with pitch contours.
Marit J. MacArthur is associate professor of English at California State University, Bakersfield, and a research associate in Cinema and Digital Media at the University of California, Davis.
Lee M. Miller is associate professor of Neurobiology, Physiology, & Behavior at the University of California, Davis, and technical director of the Center for Mind and Brain.
Featured Image: Cropped and Enlarged version of Bill Smith’s “Voice Glitch,” Attribution 2.0 Generic (CC BY 2.0)
A Listening Mind: Sound Learning in a Literature Classroom–Nicole Furlonge
Audio Culture Studies: Scaffolding a Sequence of Assignments–Jentery Sayers
Each of the essays in our “Medieval Sound” forum focuses on sound as it, according to Steve Goodman’s essay “The Ontology of Vibrational Force,” in The Sound Studies Reader, “comes to the rescue of thought rather than the inverse, forcing it to vibrate, loosening up its organized or petrified body (70). These investigations into medieval sound lend themselves to a variety of presentation methods loosening up the “petrified body” of academic presentation. Each essay challenges concepts of how to hear the Middle Ages and how the sounds of the Middle Ages continue to echo in our own soundscapes.
Read all the previous posts here, and, HEAR YE!, in April 2017, look for a second series on Aural Ecologies of noise! –Guest Editors Dorothy Kim and Christopher Roman
As humans, we engage all of our senses in every undertaking, whether or not we consciously perceive our sensory interactions. For instance, when we consume a gourmet meal, we don’t simply taste the food—we also see it, smell it, and feel it. We might also hear it as it is being prepared and/or consumed, and the meal’s pleasure can be enhanced by conversation. Overall, our experiences are enriched (or worsened) through our multisensory engagement. Similarly, reading involves multimodal feedback. While we might think of it as solely a visual experience, both auditory and tactile interactions occur within the process. As The Handbook of Multisensory Processes (518) tells us, audiotactile (sound+touch) and visuotactile (sight+touch) interactions are of great functional importance as they link remote senses to the body.
Thus, our interactions with everyday objects are multisensory, even if we do not consciously realize that fact. Arguably, although the sense of hearing is the first to develop in the womb, it is often the sense we overlook in solitary pursuits such as reading. Nevertheless, every human action occurs within a soundscape, much like they take place within a landscape. A soundscape is “an environment of sound with emphasis on the way it is perceived and understood by the individual, or by a society. It thus depends on the relationship between the individual and any such environment” (Handbook for Acoustic Ecology, 1978). Like landscapes, soundscapes must be considered in context and in relation to multisensory experience.
In particular, audio-visual interactions have been shown to have an effect on soundscape perception. Soundscape design elements reflect this concern. For example, a plan might include adding fountains, both as a noise control element and as a deliberate introduction of a soothing sonic feature. At the same time, fountains are a managed version of a natural element (water) that incorporate certain visual effects (e.g. reflective space and sparkling sunlight) creating textured and appealing landscapes. How the element is introduced into the environment has an effect on the perception (and appreciation) of the space. Furthermore, the touch of cool, clean water can supplement the overall impression, heightening the soothing effect initiated by the tinkling sound of the moving water. This is an audiotactile experience; that is, sounds connected with the sense of touch, an ecological system that combines the haptic and the aural.
Although the field of audiotactile integration has been somewhat dormant in the biological sciences since Paul von Schiller suggested back in 1932 that sounds, especially patterned noises, could affect tactile perception of roughness, recently some researchers have conducted experiments that test audiotactile qualities of materials. Several have suggested that these results might be synthetic—that is, the impact the sounds have modulate the haptic perception of the material being touched. For the most part, there seems to be connections between the perceptions of the sounds involved in touch and the perceptions of the stiffness of material. However, one study demonstrates that synchronized movements and sounds can affect the perception of the subject’s own skin. Suffice it to say, then, that sounds and texture and material quality are linked, both physically and perceptively.
Although humans rarely display deliberate awareness of audiotactile interaction, both auditory and haptic stimulation share similar temporal and psychological patterns in human consciousness. This connection would have perhaps been even more true in the Middle Ages than it is now, since the context of parchment and manuscript production and consumption was more immediately personal than paper production and reading is today.
To understand both the historicizing of the senses and the impact of shifting modes of literacy, it is possible to recreate some of the former immediacy of parchment production. During the summer of 2015, I participated in a National Endowments for the Humanities Seminar on Manuscript Materiality. This occasion provided me with the opportunity to make a manuscript page replica, starting from the “ground up” with the creation of parchment. We also studied theory, page layout, and other material circumstances, allowing us to really think about how people—both medieval and modern—engage with manuscripts using their senses. Elsewhere I have discussed the sense of touch. Here, I want to extend that discussion to include the sense of hearing; that is, I will focus on the sounds of parchment-making and parchment-reading, as activated through touching.
Parchment (Latin pergamenum) is the general term used for an animal hide that has been prepared for writing. Vellum (Latin vitulinum) more specifically refers to prepared calfskin. Parchment is made through an extended process of skinning, cleaning (de-fleshing and de-hairing), stretching, and scraping. It is stretched and scraped on special frames with adjustable screw pegs. The parchment maker scrapes the skin to the desired thickness with a curved tool, adjusting the pegs as the skin dries and changes texture. Often the skin is rewetted, scraped, and stretched numerous times in order to achieve the desired thickness. Sometimes a pumice stone finish is used at the end to create a surface porous enough to accept and retain ink. This is a vastly different process than tanning, which involves chemical alteration of the skins.
In the seminar, our parchment master was Jesse Meyer of Pergamena. He provided tools, guidance, and expertise as we participants stumbled through the process. Parchment making is hard, smelly work. My hands ached after only a few go-rounds with the tools, which included a pumice-like concoction of over-baked bread mixed with ground glass, knives that had been reshaped and re-handled, and Jesse’s special skin-refiner tool discussed below. Jesse told us many eye-opening things over the days of parchment making; however, possibly one of the most intriguing was how parchment masters could make a parchment “sing,” and how they, through this sound, knew whether or not the skin had reached its full potential.
So when Jesse demonstrated the various techniques on his sample skin, I listened carefully to the sounds he made by scraping as well as watching what he did with his hands. As he scraped away, the parchment did indeed sing. You can hear it yourself:
Audio Clip of Jesse’s “Parchment Singing”
This aspect of parchment making fascinated me. After our seminar was done, I got back in touch with Jesse to talk further about the sounds of parchment making. He was more than forthcoming about his experiences. When Jesse first read about medieval parchment making, he ran across several mentions of the “ringing” sound that parchment masters produced when shaving their skins. He, like many of us, had never considered that aspect before. So he started paying attention to the different sounds he made as he used various tools on the skins. Right now, he uses a handmade tool that consists of a saw blade shaved to his specificity with a handmade handle on it. Each of us got a chance to hold it and try it on our own skin. It was an unwieldy tool for the uninitiated, and my parchment did not “sing” like Jesse’s did.
Jesse is an expert, but even he says he cannot quite tell the nuances among the different types of skins by sound alone, although he notes that there are similarities among the types. Perhaps that skill could be developed over years of working solely with parchment, as a master in the Middle Ages would have. What Jesse has shared, however, is valuable: thicker skins are not as flexible, but they produce a “better,” that is clearer, sound.
Thickness of parchment can be due to a number of factors including preparation technique, but also the age and type of animal. Older animals yield thicker skins. Thicker skins are usually smoother and yield a cleaner sound. Of course if the animal has been injured or diseased, the skin may not be smooth. The firmer and tighter a skin is, the denser it is, and the easier it is to shave as well. In fact, Jesse says that denser skins can also make full, warm, “drum-like” sounds. Even more intriguingly, when I asked Jesse if he had ever noticed a difference between the hair side and the flesh side of skins, he said yes: the hair side of parchment sounds better to him because it is cleaner once the hair has been removed. The flesh side often retains fibrous bits even after many scrapings, and produces a more diffuse sound. When this side is scraped, it leaves behind a “fuzzy” residue until it has seen many passes with the scraper.
The relative dryness and “freshness” of the skin can also alter the clarity of the “ringing.” For example, compare the sound from the freshly prepared goatskin last summer to the sound from scraping a drier goatskin in Jesse’s workshop:
Repeat of first audio clip of Jesse’s “parchment singing”
Comparison clip of Jesse making a drier parchment sing
Both of these clips were produced from goatskins, and both were produced by Jesse who also used the same tool on each. While the sound is very similar, and both ring true, the drier skin produces a clearer, purer sound.
Plainly, then, the sounds of parchment making are vital to quality production. Most medieval manuscripts are made of three types of animal skins: sheep, goat, and calf. The prevalence of each animal is geographically dependent (sheepskin is more common in England, calfskin in France, and goatskin in Italy), although of course manuscripts traveled, and wealthy patrons commissioned materials they preferred. (see “DNA May Reveal Origins of Medieval Manuscripts” from Livescience)
Age, breed, size, and animal health can all contribute to the audiotactile qualities of a skin. However, there are some general guidelines. Sheepskins, for instance, are stretchier than goat skins, so their “ringing” can be muffled. Goatskins, which are thinner and stiffer, make a higher pitched “ring.” Calfskins are larger and easier than the others to get clean, and thus often make the cleanest “ring” and can do so more quickly than the others.
The type of skin is not the only factor in play. A rough blade would have produced a rough sound; conversely, an even, sharp blade would have produce the cleanest sound. The tautness of the skin in the frame can also affect pitch and tone, as can its dryness, its fatty content, and the age of the animal. As noted in a recent study, “The density of collagen fibrils in calf and goat parchment, compared with a more open weave and higher fat content in sheep parchment, favors the former two species [for producing the finest parchment]” (15070). Nevertheless, master parchment makers should have been able to manipulate any skin to produce superior results. As long as the corium (the dermis layer of skin containing all the connective tissues, including collagen, elastic fibers, hair follicles, sebaceous glands, blood vessels, and a number of other components) is sufficiently ground down, the parchment produced can be made very fine. And as the corium wears away, the sound of the scraping grows ever cleaner and clearer, just as the feel of the skin grows ever smoother. Overall, the tone gets higher as the skin gets thinner, and as long as the scrape is even, the tone should remain pure.
Here’s a video of Jesse scraping a calfskin. Compare this clear “ringing” to the goatskin scrapings:
That’s all well and good for production of manuscripts, but texts are made for consumption. Earlier, I mentioned my having considered the qualities of hapacity and manuscripts involving Christ’s side wound in another blog post. One of the manuscripts I discussed, London, British Library, MS Egerton 1821, was clearly designed to be touched. This unusual manuscript contains a number of woodcuts that reflect devotion to the wounds of Christ, but, more strikingly, opens with three pages painted black, covered in drops of paint meant to emulate Christ’s flowing blood.
After a series of woodcuts, seven more pages appear. These are painted red with darker red paint splatters representing drops of blood. Although these pages do not contain specific images, they are meant to evoke interactive piety. The reader is invited to touch Christ’s wounds while praying or meditating. The worn appearance of Folio 2r demonstrates just how frequently the pages were touched and rubbed.
How did those pages feel to a medieval reader? Did they feel rough or smooth? Did the reader feel a frisson of excitement? Animal skin, such as parchment, carried with it the essence of the life of the animal, thus imbuing the images painted onto it with some semblance of life force, such as suggested by Thomas Aquinas in his Question 8 (Summa Theologica) regarding the potential for divinity placed within material objects. To a certain extent, then, touching an image of Christ was akin to touching a proxy of his body, allowing a powerful and individual haptic experience of faith. But what about the sounds made when these images became the subject of interaction? Was the medieval reader aware of touching the page, touching Christ’s wounds, even more because he or she would hear the interaction?
I took it upon myself to rub the worn folio in Egerton 1821. I did so reverently, if not because I felt a mystical connection to Christ, but because I felt awed at being able to reproduce a medieval experience (albeit 500 years later). There was a distinct sound, which you can hear in this clip:
Clip of the author rubbing the worn folio in Egerton 1821
I was surprised at the resultant sound. The worn part of the page looked soft, and the paint splatters looked cracked. Instead, to my surprise, the worn portions felt rougher than the cracked paint. Like the modern studies demonstrated, my audiotactile perceptions were altered initially by what I saw, but then by what I heard. At first, I touched hesitantly, but when the sounds produced became rhythmic, my hands felt smoother and the noise sounded more even. If I were repeatedly rubbing the same spot, in the same manner, producing the same sounds—much as a medieval reader might have done—the combined sensations would likely have produced a soporific and meditative state. That is, combining touch, particularly of a textured surface, with measured reading might have resulted in the ideal perceptive state for experiencing an immersive religious experience.
If, as numerous studies have demonstrated, vibrotactile stimuli can facilitate hearing, both for those with and without hearing impairments, then the sounds hand and fingers make when exploring a surface must contribute to an individual’s haptic perception and vice versa. How, then, would this connect on the behavioral or emotional level? Researchers have been exploring the reciprocal interactions of the auditory, tactile, and visual (sometimes referred to as cross-modal effects), often concentrating on sensory thresholds, information processing performance, and spatial navigation; however, only recently are studies beginning to investigate the emotional and physical benefits of such exchanges. For instance, one such study suggests auditory-tactile stimulation as a means to increase health and well-being.
Thus, a combination of touch felt by a reader with sound heard by a reader at the same time might influence the reader’s state of mind in a positive way, resulting in a positive effect on the body as well. A desired state can be reached more quickly through an audiotactile combination, resulting in a sensory illusion (perceiving something not physically extant but mentally present)—a powerful manner of evoking emotion. Similarly, the positive physical effects include relaxation, stress relief, and sleep enhancement. Again, this seemingly suggests that multisensory integration, especially the combination of touch and sound, might have produced a mental state in the (medieval) reader that made them particularly receptive to spiritual experience.
I would suggest, then, that as medieval scholars, we should examine how audiotactile events are processed during dynamic contact between hands and material. Since different sensory modalities are integrated in the human brain to form our perceptions as a whole, including spatial and temporal relationships, it is important that we consider multisensory interactions that code the location of external events relative to our own bodies. Thus, to think through the process of making, touching, and hearing medieval parchment opens up a lot of possibilities for the study of medieval materiality—indeed for materiality in general as a field. The importance of the whole body sensory experience, including hearing, in reading is something we need to continue to imagine, to reimagine, to recreate, and to explore.
Michelle M. Sauer is a professor at the University of North Dakota in the English Department. She recently released her latest book, titled Gender in Medieval Culture (Bloomsbury, 2015).
REWIND!…If you liked this post, you may also dig:
Mouthing the Passion: Richard Rolle’s Soundscapes–Christopher Roman
EPISODE LI: Creating New Words from Old Sounds–Marcella Ernest, Candace Gala, Leslie Harper, and Daryn McKenny