Tag Archive | Jonathan Sterne

Stir It Up: From Polyphony to Multivocality in A Brief History of Seven Killings

For many, the audiobook is a source of pleasure and distraction, a way to get through the To Read Pile while washing dishes or commuting. Audiobooks have a stealthy way of rendering invisible the labor of creating this aural experience: the writer, the narrator, the producer, the technology…here at Sounding Out! we want to render that labor visible and, moreover, think of the sound as a focus of analysis in itself.

Over the next few weeks, we will host several authors who will make all of us think differently about the audiobook selections on our phone, in our car, and in our radios. Last week we listened to a book that listens to Dublin, in a post by Shantam Goyal. Today we have seven narrators telling us the story of an assassination attempt on Bob Marley. What will the audiobook whisper to us that the book cannot speak?

—Managing Editor Liana Silva


Reviews of A Brief History of Seven Killings, Marlon James’ 686-page rendering of the echoes of an assassination attempt on Bob Marley, almost invariably invoke the concept of polyphony to name its adroit use of multiple narrators. In The New York Times, Zachary Lazar maintained that the “polyphony and scope” of the 2014 novel made it much more than a saga of drug and gang violence stretching from 1970s Kingston to 1990s New York. And the Booker Prize, which James was the first Jamaican to win, similarly praised it as a “rich, polyphonic study,” with chief judge Michael Wood calling attention to the impressive “range of voices and registers, running from the patois of the street posse to The Book of Revelation.” It was thus not only the sheer number of voices in a preliminary three-page “Cast of Characters” that critics so unanimously admired but also the variety and nuance evident within them. Norwegian publisher Mime Books even took these polyphonic features a step further by hiring not one but twelve translators in a casting process that auditioned prominent novelists, playwrights, and performers.

Cover of the book, under fair use

James recalls realizing early on that this novel would be one “driven only by voice” (687), which might make such enthusiastic responses to its plurality of perspectives seem unsurprising. But what happens when such polyphony leaves the page behind and actual material voices drive its delivery? If the audiobook is a format of the novel (and here I follow Jonathan Sterne’s definition of format in MP3: The Meaning of a Format as “a whole range of decisions that affect the look, feel, experience, and workings of a medium” [7]), what lessons can listeners learn that print cannot provide? As I argue, the 26-hour-long audiobook version of A Brief History, which Highbridge Audio produced with seven actors (Robertson Dean, Cherise Boothe, Dwight Bacquie, Ryan Anderson, Johnathan McClain, Robert Younis, Thom Rivera), allows us to engage with multivocality rather than polyphony, which is to say the multiple vocal performances of a single individual rather than the presence of many narrators within a print work. And just as this novel’s polyphonic structure destabilizes any attempt at a definitive account of the events it portrays, the multifaceted performances of its audio format work to untrain ears that have been conditioned to hear necessary ties between voices and bodies. 

Of course, this effect is not one that most listeners consciously seek, as reviews of the audiobook articulating various reasons for turning to this format as well as diverging responses to it readily attest. Gayle, on Audible, began with the print version: “but as soon as I got to the first chapter that was written in Jamaican patois I knew that I was not able to do that in my head and I was going to miss a lot.” Sound here conveys sense more swiftly than the page, the ear apparently better suited than the eye to encounter difference. (Woodsy, another reviewer, even felt emboldened to ventriloquize in text that sonically distinctive speech: “I found that listening to the Audible version was helpful. Now all me need do is stop thinking in Jamaican.”) Yet it was Andre who offered by far the most memorable characterization of the audiobook and its affordances. As he explained, in James’ novel “the language is a thick, tropical forest of words. Audiobook is the machete that slices through this forest of words so I can enjoy the treasures inside.” The violence of this metaphor matches that of the novel’s most disturbing scenes, yet what is most striking is the way it reiterates once more how reviewers found it easier to access the work aurally rather than visually.

“Studio Microphone” by Richard Feliciano, CC BY-NC-ND 2.0

These reviews, and other similarly favorable appraisals, rarely consider the audiobook on its own terms, insisting instead on comparisons with the text. Negative ones, however, often note distinctively sonic features, with some reviewers echoing one of the Booker judges—who reportedly consulted a Jamaican poet about the accuracy of James’ ear for dialogue—by questioning the veracity of the Jamaican accents in a novel that also features American, Colombian, and Cuban ones. Tending to readily identify themselves as Jamaican, these writers and listeners rarely acknowledge that at least some of the actors were born on the island when asserting that the accents are off. In any case, such efforts to link sound and authenticity, as Liana Silva has argued with respect to the audiobook, wrongly suggest that those who belong to a group must conform to a single sound. James, too, distrusts discourses of the authentic, as characters repeatedly cast suspicion and scorn on anyone uttering the phrase “real Jamaica.”

If the polyphony in James’ novel prevents any one perspective from becoming either representative or definitive, the audiobook pushes this process even further by demonstrating how a single performer’s voice can possess such range that it seems to contain multiple ones. Each performer is responsible for all the voices within the sections narrated by their primary characters, which means that the same character can occasionally be voiced by different actors. In one section, a performer does the voices of a tough-talking Chicago-born hitman and the jittery Colombians he speaks with in Miami; in others, that same performer is both a white Rolling Stone journalist from Minnesota who’s attuned to racial difference and the black Jamaicans he converses with in Kingston. Continuity or strict one-to-one correspondences between performer and character ultimately matter less than the displays of vocal difference that allow the audiobook to contest essentialized notions of voice.

As a result, the audiobook articulates just how constructed vocal divisions based on race, gender, and class are by having its performers constantly cross them. It amplifies the very arbitrariness of such divisions and thereby reveals how, if the page is the space of polyphony, then what the audiobook stages is multivocality. Although they might seem like synonyms, these two terms can actually help us appreciate crucial differences and, in doing so, highlight the specificity of the audio format. On the one hand, –phony or phōnē, as Shane Butler reminds us in The Ancient Phonograph, ambiguously refers to both voice and the human capacity for speech (36), whereas –vocality centers the voice. On the other, the shift from the Greek poly- to the Latin multi- signals a contrast in what gets counted: while polyphony names the quantity of perspectives contributing to a narrative (when introducing it in Problems of Dostoevsky’s Poetics, Mikhail Bakhtin emphasized that polyphony consisted of “a plurality of independent and unmerged voices and consciousnesses” [6]), multivocality instead specifies how the number of voices can exceed the number of performers. In this way, the concept of multivocality outlined here with respect to the audiobook resonates with its use in another context by Katherine Meizel, who mobilizes it with reference to singing and the borders of identity. In both cases, voice names a multiplicity of practices rather than an immutable or inevitable expression, which in turn aligns with Nina Sun Eidsheim’s argument in The Race of Sound about the voice being not singular but collective and not innate but cultural (9).

“Last Exit (Recording Studio)” by Flickr user Drew Ressler, CC BY-NC-ND 2.0

We can therefore say that where print-based polyphony works on the eye by placing various perspectives on a page without necessarily challenging visual perceptions of difference, multivocality in the audiobook can retrain an ear’s culturally ingrained ideas about voice. James himself has experience with these seemingly inescapable meanings assigned to vocal sounds. In a moving essay for The New York Times Magazine, he recounts how, even at the age of 28, “I was so convinced that my voice outed me as a fag that I had stopped speaking to people I didn’t know.” That was already long after high school, when, as he remembered in a New Yorker profile, he had begun “tape-recording his efforts to sound masculine, repeating words like ‘bredren’ and ‘boss.’” He was well aware of the links that listeners created between voice and identity and that could, as he suggests, prove risky in a place with overt homophobia like Jamaica. Writing, however, offered him a space to take on any voice and, at the same time, not be concerned with the sound of his own.

Yet if the page allowed James to effortlessly shift among narrative voices, the audiobook format exhibits voices that ostensibly shift without any effort. Perhaps the most compelling example emerges in the work of Cherise Boothe, whose performance of the novel’s sole female primary character presents the voices of other figures as well. Toward the end of the novel, this character, Dorcas Palmer, is a caretaker for a much older and wealthier white man with amnesia in New York. Boothe not only captures the changes as Palmer often eliminates her Jamaican accent and occasionally lets it loose but also registers the man’s moments of lucidity and confusion. Even if, as listeners, we understand that Boothe is the voice behind both of these characters, the two vocal performances are so distinct that they effectively erode the basis for any beliefs about how a certain body should sound.

Adopting different voices is certainly not unique to the audiobook, but it does provide one of the few forms of extended exposure to this practice. Yet it is worth noting that A Brief History markedly differs from the model of a more extensive cast like the one comprised of 166 voices that recorded George Saunders’ Lincoln in the Bardo. By assigning a performer to every character, such productions ultimately emphasize vocal uniqueness in roughly the same way that Adriana Cavarero conceives it, namely as an index of individuality. But there the voice remains something singular or somehow essential, for there is no opportunity to perform the plurality that appears across A Brief History. At the same time, the use of seven actors also offers a contrast with the opposite extreme: a single performer responsible for all the roles, which demonstrates multivocality but does so on such a small scale that it feels exceptional instead of ordinary. The middle ground, which is to say the model found in A Brief History, allows us to hear multiple instances of how the voice is entrained rather than essential, possibility rather than inevitability.

Screenshot from Youtube video “Marlon James: A Brief History of Seven Killings” by Chicago Humanities Festival

When briefly addressing audiobooks in an interview, James remarked that this format possesses a distinct advantage: “even something that is not necessarily plain can be translated because of tone and symbol and voice.” In other words, a voice can register its changing surroundings; conveying these subtle transformations on the page, however, is often far more difficult. This shortcoming is one that Edward Kamau Brathwaithe once memorably described when explaining why he insisted on using a tape recorder in a lecture on language in the Caribbean: “I want you to get the sound of it, rather than the sight of it.” The idiomatic familiarity of the first half, which clashes so sharply with the awkwardness of the second, suggests that the multivocality of an audiobook can open ears by accentuating how the voice is not fixed but in constant formation.

Featured Image: “Audiobook” by Flickr user ActuaLitte, CC-BY-SA-2.0

Sam Carter is a PhD Candidate in Romance Studies at Cornell University. His work on literature and sound in the Southern Cone has appeared in Latin American Textualities: History, Materiality, and Digital Media and is forthcoming in the Revista Hispánica Moderna.

tape reelREWIND! . . .If you liked this post, you may also dig:

SO! Reads: Jonathan Sterne’s MP3: The Meaning of a Format–Aaron Trammell

Radio de Acción: Violent Circuits, Contentious Voices: Caribbean Radio Histories–Alejandra Bronfman

Scenes of Subjection: Women’s Voices Narrating Black Death“–Julie Beth Napolin

Look Away and Listen: The Audiovisual Litany in Philosophy

This is an excerpt from a paper I delivered at the 2017 meeting of the Society for Phenomenology and Existential Philosophy.

“Compressed and rarefied air particles of sound waves” from Popular Science Monthly, Volume 13. In the public domain, via Wikimedia Commons.

According to sound studies scholar Jonathan Sterne in The Audible Past, many philosophers practice an “audiovisual litany,” which is a conceptual gesture that favorably opposes sound and sonic phenomena to a supposedly occularcentric status quo. He states, “the audiovisual litany…idealizes hearing (and, by extension, speech) as manifesting a kind of pure interiority. It alternately denigrates and elevates vision: as a fallen sense, vision takes us out of the world. But it also bathes us in the clear light of reason” (15).  In other words, Western culture is occularcentric, but the gaze is bad, so luckily sound and listening fix all that’s bad about it. It can seem like the audiovisual litany is everywhere these days: from Adriana Cavarero’s politics of vocal resonance, to Karen Barad’s diffraction, to, well, a ton of Deleuze-inspired scholarship from thinkers as diverse as Elizabeth Grosz and Steve Goodman, philosophers use some variation on the idea of acoustic resonance (as in, oscillatory patterns of variable pressure that interact via phase relationships) to mark their departure from European philosophy’s traditional models of abstraction, which are visual and verbal, and to overcome the skeptical melancholy that results from them. The field of philosophy seems to argue that we need to replace traditional models of philosophical abstraction, which are usually based on words or images, with sound-based models, but this argument reproduces hegemonic ideas about sight and sound.

For Sterne, the audiovisual litany is traditionally part of the “metaphysics of presence” that we get from Plato and Christianity: sound and speech offer the fullness and immediacy that vision and words deny. However, contemporary versions of the litany appeal to a different metaphysics. For example, Cavarero in For More Than One Voice argues that the privileging of vision over sound is the foundation of the metaphysics of presence. “The visual metaphor,” she argues, “is not simply an illustration; rather, it constitutes the entire metaphysical system” (38). The problem with this videocentric metaphysics is that it “legitimates the reduction of whatever is seen to an object” (Cavarero 176) and it cannot “anticipate” or “confirm the uniqueness” of each individual (4). In other words, it objectifies and abstracts, and that’s bad. If vision is the foundation of the metaphysics of presence, one way to fix its problems is to replace the foundation with something else. Cavarero thinks vocal resonance avoids the objectifying and abstracting tendencies that images and text supposedly lend to philosophy.

Similarly, in the same way that the traditional audiovisual litany “assume[s] that sound draws us into the world while vision separates us from it” (Sterne 18), Barad’s argument for agential realism in Meeting the Universe Halfway assumes that diffraction draws theorists into actual contact with matter while “reflection still holds the world at a distance” (87). Agential realism looks is the view that even the most basic units of reality, like the basic particles of matter, exercise agency as they interact to form more complex units; diffraction is Barad’s theory about how these particles interact. This litany of distance-versus-relationality and external objectivity versus immersive materiality structures Barad’s counterpoint between reflection and diffraction. For example, she contrasts traditional investment in reflective surfaces—“the belief that words, concepts, ideas, and the like accurately reflect or mirror the things to which they refer-makes a finely polished surface of this whole affair” (86)–with diffractive interiorities, which get down to “the real consequences, interventions, creative possibilities, and responsibilities of intra-acting within and as part of the world” (37). But how do we know Barad is appealing to an audiovisual litany? We know because her fundamental concept–diffraction–describes the behavior of waveforms as they encounter other things, and 21st century Western scientists and music scholars think sound is a waveform. When two or more waves interact, they produce “alternating pattern[s] of wave intensity” or “increasing and decreasing intensities” (Barad 77), like ripples in water or alternating light frequencies.

“diffracted hydrogen” by Flickr user candace, CC BY 2.0

Barad appeals to notions of consonance and dissonance to explain how these patterns interact. For example, when diffracting light waves around a razor blade, “bright spots appear in places where the waves enhance one another-that is, where there is ‘constructive interfer­ence’-and dark spots appear where the waves cancel one another-that is, where there is ‘destructive interference’” (Barad 77). This “constructive” and “destructive” interference is like audio amplification and masking: when frequencies are perfectly in sync (peaks align with peaks, valleys with valleys), they amplify; when frequencies are perfectly out of sync (peaks align with valleys), they cancel each other out (this is how noise-cancelling headphones work). Constructive interference is consonance: the synced patterns amplify one another; destructive interference is dissonance: the out-of-sync patterns mask each other. Both types of interference are varieties of resonance, a rational or irrational phase relationship among frequencies. Rational phase relationships are ones where the shorter phases or periods of higher frequencies are evenly divisible into the longer phases/periods; irrational phase relationships happen when the shorter phases can’t be evenly divided into the longer wavelengths. Abstracting from waveforms to philosophical analysis, Barad often uses resonance as a metaphor to translate wave behavior into materialist philosophical methods. However, even though most of Barad’s examples throughout Meeting the Universe Halfway are visual, she’s describing what scientists call acoustic relationships.

For example, Barad argues that “diffractively read[ing]” philosophical texts means processing “insights through one another for the patterns of resonance and dissonance they coproduce” (195; emphasis mine). Similarly, she advises her readers to tune into the “dissonant and harmonic resonances” (43) that emerge when they try “diffract­ing these insights [from an early chapter in her book] through the grating of the entire set of book chapters” (30). As patterns of higher and lower intensity that interact via ir/rational phase relationships, diffraction patterns are a type of acoustic resonance. Appealing to acoustics against representationalism, Barad practices a version of the audiovisual litany. And she’s not the only new materialist to do so—Jane Bennett’s concept of vibration and Elizabeth Grosz’s notion of “music” also ontologize a similar idea of resonance and claim it overcomes the distancing and skeptical melancholy produced by traditional methods of philosophical abstraction.

“Painter” by Flickr user Flood G., CC BY-NC-ND 2.0

There are also instances of the audiovisual litany in phenomenology. For example, Alia Al-Saji develops in the article A Phenomenology of Critical-Ethical Vision” a notion of “critical-ethical vision” against “objectifying vision,” and, via a reading of Merleau-Ponty, grounds the former, better notion of sight (and thought) in his analogy between painting and listening. According to Al-Saji, “objectifying vision” is the model of sight that has dominated much of European philosophy since the Enlightenment. “Objectifying vision” takes seeing as “merely a matter of re-cognition, the objectivation and categorization of the visible into clear-cut solids, into objects with definite contours and uses” (375). Because it operates in a two-dimensional metaphysical plane it can only see in binary terms (same/other): “Objectifying vision is thus reductive of lateral difference as relationality” (390). According to Al-Saji, Merleau-Ponty’s theory of painting develops an account of vision that is “non-objectifying” (388) and relational. We cannot see paintings as already-constituted objects, but as visualizations, the emergence of vision from a particular set of conditions. Such seeing allows us “to glimpse the intercorporeal, social and historical institution of my own vision, to remember my affective dependence on an alterity whose invisibility my [objectifying] vision takes for granted” (Al-Saji 391). Al-Saji turns to sonic language to describe such relational seeing: “more than mere looking, this is seeing that listens (391; emphasis mine).

This Merleau-Pontian vision not only departs from traditional European Enlightenment accounts of vision, it gestures toward traditional European accounts of hearing. Similarly, Fred Evans, in The Multivoiced Body uses voice as a metaphor for the Deleuzo-Guattarian metaphysics that he calls “chaosmos” or “composed chaos” (86); he then contrasts chaosmos to “homophonic” (67) Enlightenment metaphysics. According to Evans, if “‘voices,’ not individuals, the State, or social structures, are the primary participants in society” (256), then  “reciprocity” and “mutual intersection” (59) appear as fundamental social values (rather than, say, autonomy). This analysis exemplifies what is at the crux of the audiovisual litany: voices put us back in touch with what European modernity and postmodernity abstract away.

“Image from page 401 of “Surgical anatomy : a treatise on human anatomy in its application to the practice of medicine and surgery” (1901)” by Flickr user Internet Archive Book Images

The audiovisual litany is hot right now: as I’ve just shown, it’s commonly marshaled in the various attempts to move past or go beyond stale old Western modernist and postmodernist philosophy, with all their anthropocentrism and correlationism and classical liberalism. To play with Marie Thompson’s words a bit, just as there is an “ontological turn in sound studies,” there’s a “sound turn in ontological studies.” But why? What does sound DO for this specific philosophical project? And what kind of sound are we appealing to anyway?

The audiovisual litany naturalizes hegemonic concepts of sound and sight and uses these as metaphors for philosophical positions. This lets philosophical assumptions pass by unnoticed because they appear as “natural” features of various sensory modalities. Though he doesn’t use this term, Sterne’s analysis implies that the audiovisual litany is what Mary Beth Mader calls a sleight. “Sleights” are, according to Mader in Sleights of Reason,“conceptual collaborations that function as switches or ruses important to the continuing centrality and pertinence of the social category of a political system like “sex” (3). Sleights, in other words, are conceptual slippages that render underlying hegemonic structures like cisheteropatriarchy coherent. More specifically, sleights are “conceptual jacquemarts” (Mader 5). Jacquemarts are effectively the Milli Vanilli of clocks: sounds appear to come from one overtly visible, aesthetically appealing source action (figures ringing a bell) but they actually come from a hidden, less aesthetically appealing source action (hammers hitting gongs). The clock is constructed in a way to “misdirect or misindicate” (Mader 8) both who is making the sound and how they are making it. A sound exists, but its source is misattributed. This is exactly what happens in the uses of the audiovisual litany I discuss above: philosophers misdirect or misindicate the source of the distinction they use the audiovisual litany to mark. The litany doesn’t track the difference between sensory media or perceptual faculties, but between two different methods of abstraction.

Screenshot from Milli Vanilli’s video “Don’t Forget My Number”

This slippage between perceptual medium and philosophical method facilitates the continued centrality of Philosophy-capital-P: philosophy appears to reform its methods and fix its problems, while actually re-investing in its traditional boundaries, values, and commitments. For example, both new materialists and sound studies scholars have been widely critiqued for actively ignoring work on sound and resonance in black studies (e.g., by Zakiyyah Jackson, Diana Leong, Maire Thompson). As Zakiyyah Jackson argues in Outer Worlds: The Persistence of Race in Movement “Beyond the Human,” new materialism’s “gestures toward the ‘post’ or the ‘beyond’ effectively ignore praxes of humanity and critiques produced by black people” (215), and in so doing ironically reinstitute the very thing new materialism claims to supercede. Stratifying theory into “new” and not-new, new materialist “appeals to move ‘beyond’…may actually reintroduce the Eurocentric transcendentalism this movement purports to disrupt” (Jackson 215) by exclusively focusing on European philosophers’ accounts of sound and sight. Similarly, these uses of the litany often appeal only to other philosophers’ accounts of sound or music, not actual works or practices or performances. They don’t even attend to the sonic dimensions of literary texts, a method that scholars such as Jennifer Lynn Stoever and Alexander Weheliye develop in their work. Philosophers use the audiovisual litany to disguise philosophy’s ugly politics—white supremacy and Eurocentrism—behind an outwardly pleasing conceptual gesture: the turn from sight or text to sound. With this variation of the audiovisual litany, Philosophy appears to cross beyond its conventional boundaries while actually doubling-down on them.

Featured image: “soundwaves” from Flickr user istolethetv

Robin James is Associate Professor of Philosophy at UNC Charlotte. She is author of two books: Resilience & Melancholy: pop music, feminism, and neoliberalism, published by Zer0 books last year, and The Conjectural Body: gender, race and the philosophy of music was published by Lexington Books in 2010. Her work on feminism, race, contemporary continental philosophy, pop music, and sound studies has appeared in The New Inquiry, Hypatia, differences, Contemporary Aesthetics, and the Journal of Popular Music Studies. She is also a digital sound artist and musician. She blogs at its-her-factory.com and is a regular contributor to Cyborgology.

tape reelREWIND! . . .If you liked this post, you may also dig:
From “listening” to “filling in”: where “La Soeur Écoute” Teaches Us to Listen-Emmanuelle Sontag
The Listening Body in Death –Denise Gill
Re-orienting Sound Studies’ Aural Fixation: Christine Sun Kim’s “Subjective Loudness”-Sarah Mayberry Scott