Tag Archive | MP3: The Meaning of a Format

“Keep it Weird”: Listening with Jonathan Sterne (1970-2025)

Dr. Jonathan Sterne passed away earlier this year. He was, in many ways, a model scholar and colleague.

The intellectual ferment of the field now called “sound studies” is often traced to the sonic ecologists of the 1960s, but the theoretical energy of the early 2000s, generated by figures such as Ana Maria Ochoa, Alexander Weheliye, Emily Thompson, Trevor Pinch (1952-2021), and of course Jonathan Sterne, was necessary for the field to gain interdisciplinary traction in the twenty-first century. Sterne’s The Audible Past: Cultural Origins of Sound Reproduction (Duke University Press: 2003) was perhaps the single-most important book in this regard.

Trained in communications, and working in departments of communication, first at Pitt and later McGill, Sterne oriented his work toward media studies, and indeed, The Audible Past is principally about mediation. It poses questions about the role of sound in the history of mediation that earlier generations of sound studies had tended to elide, especially regarding the contingent and often cultural role of the human ear in reception.  These questions opened the door for anthropologists, historians, communications scholars and ethnomusicologists in particular to think and even identify with sound studies, and many of us who were trained in the 2000s did so enthusiastically, with Sterne’s writing a lodestar.

The enduring terms and frameworks that came from The Audible Past alone are remarkable: the audiovisual litany, a critique of the chestnut that hearing and vision are ideological binaries, for example, is practically axiomatic now in sound studies. And its clever expression of the audiovisual binary’s religious undertones (see also “The Theology of Sound: A Critique of Orality” [2011], one of many worthwhile Sternian deep cuts) further thickens the plot. Consider as well the concept of “audile techniques,” or vernacular methods of audition that emerge in response to new sound reproduction technologies, which has been used to frame countless projects in sonic histories of science, medicine, business, and technology. To revisit The Audible Past now is to witness a thinker who anticipated the central questions of an emergent field at the moment of its rekindling, with prescience and depth.

Sterne’s focus on media continued, although his second monograph—and Sounding Out!’s first book review!– MP3: The Meaning of a Format (Duke University Press: 2012) saw a move toward Marxist economic analysis, the kind of methodological shift that he would pull off time and again. The book is almost certainly the most complete treatment of what was (and to some degree remains) the world’s most important audio reproduction format, and once again he introduced a concept, that of “format theory,” that is widely and actively cited (and reviewed by Pitchfork!). The book is also funny! Sterne’s revelation of the Napster cat head logo at the end of a chapter about the role of cat heads in auditory lab experiments is the sort of superb comic timing which, to put it lightly, one doesn’t find much in academic writing.

Sterne was not limited to media-focused work, however. He was responsive to current events: his 2012 “Quebec’s #casseroles: on participation, percussion and protest,” for instance, was an on-the-spot reading of the sonic tactics of local student tuition strikes. He worked ethnographically: a 1997 article, published in Ethnomusicology and called “Sounds Like the Mall of America: Programmed Music and the Architectonics of Commercial Space,” is an immersive study of music in retail space. He was an excellent editor: his 2012 The Sound Studies Reader (Routledge) is superbly curated, remaining a cornerstone assignment. More recently, he turned to disability studies, publishing Diminished Faculties: A Political Phenomenology of Impairment (Duke University Press) in 2022. The book is philosophical as well as reflexive and personal, showcasing yet more of his range. The way that Sterne allowed curiosity to lead his research in many directions suggests a scholar who was in the business for the right reasons.

If you want to hear JS talk about Diminished Faculties or just hear him talk, you can treat yourself here.

Yet still this breadth and influence pales compared to what he arguably did best and most enthusiastically – mentoring students. He did not advise me formally, but when I was a graduate student he was friendly and accessible, which since his admission to hospice I have learned he was for many other people as well. Like the enduringly resonant concepts in The Audible Past and other books, bits of his advice still ring in my ears, and I pass those tidbits on to other students now. His letters of recommendation for his own students were inspired, indeed among the most thoughtful I’ve ever read. He respected his students enormously, and on their behalf wrote long, detailed letters in which they were cast as mature thinkers, and their scholarship as a serious project.

There are many scholars whom we might memorialize for their published contributions, but we should reserve a higher space for those whose mentorship commitments were as deep as Sterne’s. For all of his critical insights, he was motivated in the end not by status but by community, which the outpouring of sadness at his passing reveals above all. Farewell, then, to an architect of sound studies who, safe to say, was also widely loved.

***

JS was a HUGE day one supporter of Sounding Out! His belief in us helped the blog grow in innumerable, weird, and wonderful ways. JS gave us some crucial shine way back in 2012, when he generously published his post on the Casseroles Protests in Montreal before expanding it for print. JS really, really got what we were up to with SO! and we deeply appreciated his encouragement to “keep it weird” when it came to this thing called sound studies. Always, JS, always! and thank you! –JLS, SO! Ed

PS: We encourage you to leave “Sterne stories” and other memories of Jonathan in the comments to this post.

Benjamin Tausig is associate professor of music at SUNY-Stony Brook University, and author of Bangkok Is Ringing: Sound, Protest, and Constraint (Oxford, 2019) and Bangkok After Dark: Maurice Rocco, Transnational Nightlife, and the Making of Cold War Intimacies (Duke, 2025).

REWIND!…If you liked this post, you may also dig: 

Sterneworks!

Quebec’s #casseroles: on participation, percussion and protest–Jonathan Sterne

Sounding Out! Podcast #27: Interview with Jonathan Sterne

This is What It Sounds Like . . . . . . . . On Prince (1958-2016) and Interpretive Freedom–Benjamin Tausig

SO! Amplifies: The Electric Golem (Trevor Pinch and James Spitznagel)Qiushi Xu 

SO! Reads: Jonathan Sterne’s MP3: The Meaning of a Format-Aaron Trammell

Stir It Up: From Polyphony to Multivocality in A Brief History of Seven Killings

For many, the audiobook is a source of pleasure and distraction, a way to get through the To Read Pile while washing dishes or commuting. Audiobooks have a stealthy way of rendering invisible the labor of creating this aural experience: the writer, the narrator, the producer, the technology…here at Sounding Out! we want to render that labor visible and, moreover, think of the sound as a focus of analysis in itself.

Over the next few weeks, we will host several authors who will make all of us think differently about the audiobook selections on our phone, in our car, and in our radios. Last week we listened to a book that listens to Dublin, in a post by Shantam Goyal. Today we have seven narrators telling us the story of an assassination attempt on Bob Marley. What will the audiobook whisper to us that the book cannot speak?

—Managing Editor Liana Silva


Reviews of A Brief History of Seven Killings, Marlon James’ 686-page rendering of the echoes of an assassination attempt on Bob Marley, almost invariably invoke the concept of polyphony to name its adroit use of multiple narrators. In The New York Times, Zachary Lazar maintained that the “polyphony and scope” of the 2014 novel made it much more than a saga of drug and gang violence stretching from 1970s Kingston to 1990s New York. And the Booker Prize, which James was the first Jamaican to win, similarly praised it as a “rich, polyphonic study,” with chief judge Michael Wood calling attention to the impressive “range of voices and registers, running from the patois of the street posse to The Book of Revelation.” It was thus not only the sheer number of voices in a preliminary three-page “Cast of Characters” that critics so unanimously admired but also the variety and nuance evident within them. Norwegian publisher Mime Books even took these polyphonic features a step further by hiring not one but twelve translators in a casting process that auditioned prominent novelists, playwrights, and performers.

Cover of the book, under fair use

James recalls realizing early on that this novel would be one “driven only by voice” (687), which might make such enthusiastic responses to its plurality of perspectives seem unsurprising. But what happens when such polyphony leaves the page behind and actual material voices drive its delivery? If the audiobook is a format of the novel (and here I follow Jonathan Sterne’s definition of format in MP3: The Meaning of a Format as “a whole range of decisions that affect the look, feel, experience, and workings of a medium” [7]), what lessons can listeners learn that print cannot provide? As I argue, the 26-hour-long audiobook version of A Brief History, which Highbridge Audio produced with seven actors (Robertson Dean, Cherise Boothe, Dwight Bacquie, Ryan Anderson, Johnathan McClain, Robert Younis, Thom Rivera), allows us to engage with multivocality rather than polyphony, which is to say the multiple vocal performances of a single individual rather than the presence of many narrators within a print work. And just as this novel’s polyphonic structure destabilizes any attempt at a definitive account of the events it portrays, the multifaceted performances of its audio format work to untrain ears that have been conditioned to hear necessary ties between voices and bodies. 

Of course, this effect is not one that most listeners consciously seek, as reviews of the audiobook articulating various reasons for turning to this format as well as diverging responses to it readily attest. Gayle, on Audible, began with the print version: “but as soon as I got to the first chapter that was written in Jamaican patois I knew that I was not able to do that in my head and I was going to miss a lot.” Sound here conveys sense more swiftly than the page, the ear apparently better suited than the eye to encounter difference. (Woodsy, another reviewer, even felt emboldened to ventriloquize in text that sonically distinctive speech: “I found that listening to the Audible version was helpful. Now all me need do is stop thinking in Jamaican.”) Yet it was Andre who offered by far the most memorable characterization of the audiobook and its affordances. As he explained, in James’ novel “the language is a thick, tropical forest of words. Audiobook is the machete that slices through this forest of words so I can enjoy the treasures inside.” The violence of this metaphor matches that of the novel’s most disturbing scenes, yet what is most striking is the way it reiterates once more how reviewers found it easier to access the work aurally rather than visually.

“Studio Microphone” by Richard Feliciano, CC BY-NC-ND 2.0

These reviews, and other similarly favorable appraisals, rarely consider the audiobook on its own terms, insisting instead on comparisons with the text. Negative ones, however, often note distinctively sonic features, with some reviewers echoing one of the Booker judges—who reportedly consulted a Jamaican poet about the accuracy of James’ ear for dialogue—by questioning the veracity of the Jamaican accents in a novel that also features American, Colombian, and Cuban ones. Tending to readily identify themselves as Jamaican, these writers and listeners rarely acknowledge that at least some of the actors were born on the island when asserting that the accents are off. In any case, such efforts to link sound and authenticity, as Liana Silva has argued with respect to the audiobook, wrongly suggest that those who belong to a group must conform to a single sound. James, too, distrusts discourses of the authentic, as characters repeatedly cast suspicion and scorn on anyone uttering the phrase “real Jamaica.”

If the polyphony in James’ novel prevents any one perspective from becoming either representative or definitive, the audiobook pushes this process even further by demonstrating how a single performer’s voice can possess such range that it seems to contain multiple ones. Each performer is responsible for all the voices within the sections narrated by their primary characters, which means that the same character can occasionally be voiced by different actors. In one section, a performer does the voices of a tough-talking Chicago-born hitman and the jittery Colombians he speaks with in Miami; in others, that same performer is both a white Rolling Stone journalist from Minnesota who’s attuned to racial difference and the black Jamaicans he converses with in Kingston. Continuity or strict one-to-one correspondences between performer and character ultimately matter less than the displays of vocal difference that allow the audiobook to contest essentialized notions of voice.

As a result, the audiobook articulates just how constructed vocal divisions based on race, gender, and class are by having its performers constantly cross them. It amplifies the very arbitrariness of such divisions and thereby reveals how, if the page is the space of polyphony, then what the audiobook stages is multivocality. Although they might seem like synonyms, these two terms can actually help us appreciate crucial differences and, in doing so, highlight the specificity of the audio format. On the one hand, –phony or phōnē, as Shane Butler reminds us in The Ancient Phonograph, ambiguously refers to both voice and the human capacity for speech (36), whereas –vocality centers the voice. On the other, the shift from the Greek poly- to the Latin multi- signals a contrast in what gets counted: while polyphony names the quantity of perspectives contributing to a narrative (when introducing it in Problems of Dostoevsky’s Poetics, Mikhail Bakhtin emphasized that polyphony consisted of “a plurality of independent and unmerged voices and consciousnesses” [6]), multivocality instead specifies how the number of voices can exceed the number of performers. In this way, the concept of multivocality outlined here with respect to the audiobook resonates with its use in another context by Katherine Meizel, who mobilizes it with reference to singing and the borders of identity. In both cases, voice names a multiplicity of practices rather than an immutable or inevitable expression, which in turn aligns with Nina Sun Eidsheim’s argument in The Race of Sound about the voice being not singular but collective and not innate but cultural (9).

“Last Exit (Recording Studio)” by Flickr user Drew Ressler, CC BY-NC-ND 2.0

We can therefore say that where print-based polyphony works on the eye by placing various perspectives on a page without necessarily challenging visual perceptions of difference, multivocality in the audiobook can retrain an ear’s culturally ingrained ideas about voice. James himself has experience with these seemingly inescapable meanings assigned to vocal sounds. In a moving essay for The New York Times Magazine, he recounts how, even at the age of 28, “I was so convinced that my voice outed me as a fag that I had stopped speaking to people I didn’t know.” That was already long after high school, when, as he remembered in a New Yorker profile, he had begun “tape-recording his efforts to sound masculine, repeating words like ‘bredren’ and ‘boss.’” He was well aware of the links that listeners created between voice and identity and that could, as he suggests, prove risky in a place with overt homophobia like Jamaica. Writing, however, offered him a space to take on any voice and, at the same time, not be concerned with the sound of his own.

Yet if the page allowed James to effortlessly shift among narrative voices, the audiobook format exhibits voices that ostensibly shift without any effort. Perhaps the most compelling example emerges in the work of Cherise Boothe, whose performance of the novel’s sole female primary character presents the voices of other figures as well. Toward the end of the novel, this character, Dorcas Palmer, is a caretaker for a much older and wealthier white man with amnesia in New York. Boothe not only captures the changes as Palmer often eliminates her Jamaican accent and occasionally lets it loose but also registers the man’s moments of lucidity and confusion. Even if, as listeners, we understand that Boothe is the voice behind both of these characters, the two vocal performances are so distinct that they effectively erode the basis for any beliefs about how a certain body should sound.

Adopting different voices is certainly not unique to the audiobook, but it does provide one of the few forms of extended exposure to this practice. Yet it is worth noting that A Brief History markedly differs from the model of a more extensive cast like the one comprised of 166 voices that recorded George Saunders’ Lincoln in the Bardo. By assigning a performer to every character, such productions ultimately emphasize vocal uniqueness in roughly the same way that Adriana Cavarero conceives it, namely as an index of individuality. But there the voice remains something singular or somehow essential, for there is no opportunity to perform the plurality that appears across A Brief History. At the same time, the use of seven actors also offers a contrast with the opposite extreme: a single performer responsible for all the roles, which demonstrates multivocality but does so on such a small scale that it feels exceptional instead of ordinary. The middle ground, which is to say the model found in A Brief History, allows us to hear multiple instances of how the voice is entrained rather than essential, possibility rather than inevitability.

Screenshot from Youtube video “Marlon James: A Brief History of Seven Killings” by Chicago Humanities Festival

When briefly addressing audiobooks in an interview, James remarked that this format possesses a distinct advantage: “even something that is not necessarily plain can be translated because of tone and symbol and voice.” In other words, a voice can register its changing surroundings; conveying these subtle transformations on the page, however, is often far more difficult. This shortcoming is one that Edward Kamau Brathwaithe once memorably described when explaining why he insisted on using a tape recorder in a lecture on language in the Caribbean: “I want you to get the sound of it, rather than the sight of it.” The idiomatic familiarity of the first half, which clashes so sharply with the awkwardness of the second, suggests that the multivocality of an audiobook can open ears by accentuating how the voice is not fixed but in constant formation.

Featured Image: “Audiobook” by Flickr user ActuaLitte, CC-BY-SA-2.0

Sam Carter is a PhD Candidate in Romance Studies at Cornell University. His work on literature and sound in the Southern Cone has appeared in Latin American Textualities: History, Materiality, and Digital Media and is forthcoming in the Revista Hispánica Moderna.

tape reelREWIND! . . .If you liked this post, you may also dig:

SO! Reads: Jonathan Sterne’s MP3: The Meaning of a Format–Aaron Trammell

Radio de Acción: Violent Circuits, Contentious Voices: Caribbean Radio Histories–Alejandra Bronfman

Scenes of Subjection: Women’s Voices Narrating Black Death“–Julie Beth Napolin