Archive | Voice RSS for this section

Blank Space and “Asymmetries of Childhood Innocence”  

In 2015 a video of a child in an Internet café in the Philippines began to trend on social media sites. Titled, Kanta ng isang Anak para sa kanyang inang OFW “Blank Space (“Song of a child for her overseas foreign worker mother”), the video shows a girl singing via Skype to her mother who is working in an unnamed location, presumably outside of the Philippines. “Ma kakantahan ulit kita ha?” (I’ll sing for you again mom), she says, and starts singing Taylor Swift’s “Blank Space”. Her mother attentively watches and listens to her song, soon beginning to cry in longing for a daughter she has not seen in a long time. The girl’s attention is divided between the screen that shows the lyrics, the camera that films her singing, and her mother who quietly observes. This video has over 110,000 views and is one of many archived messages from a child singing or speaking to their mother who labours transnationally. Despite the videos’ jittery framing and low quality, the intended message of shared longing across cyber and transnational borders is clear.

The Spanish-American war (1899-1902) resulted in the relinquishment of the colony of the Philippines from Spain to the United States. This transfer of power instituted the imperial specter that continues to grip the archipelago. The many performances of American pop music on Youtube and on stages throughout the Philippines are what Christine Bacareza Balance calls the “musical aftermath of US imperial cultures” (2016). Having amassed over 97 million YouTube views in the Philippines, Taylor Swift’s overwhelming popularity is evidence of this continued imperial presence. In the video, the young Filipinx girl sings lyrics written by Swift: “I’m dying to see how this one ends. Grab your passport and my hand.” When sung by this child these lyrics take on different meaning than Swift likely intended. Perhaps she is anticipating an end to the necessity of separation between mother and daughter. 

Using song, the video provides evidence of what Hannah Dyer calls the ‘asymmetries of childhood innocence’ (2019), reminding its audience of the ways transnational labour and global capital impact children’s experiences of kinship and development. Dyer suggests that some children are withheld the protective hold of childhood innocence. She writes:

“Childhood innocence is a seemingly natural condition but its rhetorical maneuvers are permeated by its elisions and attempted disavowals along the lines of race, class, gender and sexuality. That is, despite the familiar rhetorical insistence that children are the future, some children are withheld the benefits of being assumed inculpable (2)” 

Ascriptions of childhood innocence thus require a child to replicate social norms including the production of the nuclear family. In the Philippines, where the liberalization of international trade and high levels of unemployment have disproportionately impacted the labour migration of women, structures of the nuclear family are being re-organized (Parreñas 2005; Tungohan 2013). Women who work outside of the Philippines and away from their families are paradoxically celebrated for their “sacrifice” while also subject to disapproval over their absence (Tungohan 2013). When mothers leave the Philippines, the care-arrangements for children are shifted. There is a growing recognition of the changing nature of motherhood within transnational contexts and the concomitant emotional consequences of negotiating “long distance intimacy” (Parreñas 2005). The demands for transnational labour reconfigure Filipinx family formations and necessitate fraught intimacies between parent and child across borders. Cyber technologies like cell phones and the Internet initiate creative opportunities for children to be “virtually present” in the lives of their mothers and vice versa.  

“Parenthood” by Flickr user Saúl Alejandro Preciado Farías, CC BY 2.0

Drawing from Dyer, we might think of children who live without the physical presence of their mothers as “queer” to normative theories of childhood development that affirm overwrought expectations of maternal presence. She suggests that discourses of childhood innocence intend to subjugate the queerness of childhood and that these elisions hold bio-political significance. Faced with social inequities, Dyer emphasizes the importance of a child’s symbolic expression. She argues that children express their psychic and social conflicts aesthetically. A child’s imagination elaborates resistances to the enclosure of childhood innocence as a barometer of value. In this way, this article suggests a child’s singing and dancing are aesthetic expressions that take notice of the entangled traces of colonialism and nation, while resisting hierarchal structures that deem some childhoods more valuable than others.

The child’s sonic performance in the YouTube video is a queer offering that creatively procures transnational connection. Her singing registers a queer frequency that destabilizes normative theories of child development that assume a mother’s physical presence as necessary to developmental success. The girl’s performance suggests that psychic and political reparations can occur in the sounds the child makes. The tactile, spatial and physical qualities of her voice forge a new relation to her mother. Her voice is affecting, seemingly moving her mother to tears and rousing the onlookers at the Internet café to reorganize their bodies and sing along. In this video, we are invited to witness a child whose world has been altered by globalization and the continued geo-political violence’s enacted by the American empire. Given these circumstances, her “creative re-interpretation[s] of kinship” serves as a reminder that the affective fortitude of her voice tests physical and emotional borders (Dyer 2019). The restraint of normative conceptions of family is ruptured when the child remakes her relation to her mother in ways that stir joy, collectivity, and pleasure. 

Screenshot from “

By observing and listening to the child’s song more closely, we can listen for its potential to re-sound and re-imagine the parent-child relationship across borders. The sounds of “OFW Blank Space” linger after the clip has ended. By listening for what is in excess of the video’s content, we can consider the affective registers that enunciate alternative understandings of migration, family and belonging. There is a humming that is ubiquitous in the video. Perhaps, it is the sound of the electric fans that run to combat the tropical heat of the Philippines. Maybe it is the collective buzzing from the computers that have been set up to provide the Internet to its cybercafé patrons. The acoustics of the space are at once mundane and haphazard, and at the same time, cogent indicators of the geopolitical truths echoing throughout the scene. With limited access to Internet in the home, the cybercafé has been a site that children frequent to communicate with family working in another country. The convergence of sound, technology and diasporic subjectivity becomes audible when the practice of listening is attuned to these methods of transnational connection. 

While listening to the pedagogical potential of the cybercafé more broadly, a focus on the vocal performance of the child reveals my investment in what the sound of her voice tells us. The video starts with greetings spoken in Tagalog, the primary language of the Philippines. When the backing track begins, the child makes a seamless transition into singing in English. In her vocal performance of the lyrics, her Filipino accent is almost undetectable. She sings with a dulcet tone that is clear and appealing. Her voice sounds well-trained and confident. If not for the video, one might believe the child to be a professional American performer. In this scene, it is her voice that is marked and constituted by a narrative of American imperial conquest and Filipino assimilation. But in a creative adaptation of American cultural production, the child re-writes this racialized script and uses American pop songs as a mechanism of care for both herself and her mother.

“Mother and daughter at home” by Flickr user Dejan Krsmanovic, CC-BY-2.0

The economic instability in the Philippines has created a state instituted transnational workforce. Women have been disproportionately affected by the demand for work in care industries such as nursing, childcare and care for the elderly (Francisco-Menchavez 2018). These gendered and racialized structures of employment privilege the presence of Filipinx women in families other than their own. The child is withheld a future that assures her the presence of her mother and their physical proximity is denied as a result of the demand for labour and capital exchange between nation-states. However, despite these circumstances, the child uses her voice to summon a beautiful intimacy, one that does not disavow the imperial history that marks its possibility, but instead uses loss as a resource to creatively mourn their separation. For the child, the act of singing is a replacement for her lost object, her mother. In the video we witness a child who is full of joy and whose strength of voice quells, if not, temporarily, whatever longing for her mother she might have. Relatedly, the child is also perceptive of her mother’s needs and uses music as a method of offering her care. Her performance creatively re-routes the presumed directionalities of care (from mother to child) which globalization has fundamentally altered.  

Featured image: “Children” by Flickr user Clive Varley, CC-BY-2.0

Casey Mecija is an accomplished multi-disciplinary artist, primarily working in the fields of music and film. She played in Ohbijou, the Canadian orchestral pop band, and released her first solo album, entitled Psychic Materials, in 2016. Casey is also an award winning filmmaker whose work has screened internationally. She is completing a PhD at The University of Toronto, where she researches sound, performance studies and Filipinx Studies as they relate to queer diaspora.

tape-reelREWIND! . . .If you liked this post, you may also dig:

Relocating Soundscapes of Schooling: Learning to Listen to Children’s Lifeworlds-Cassie Brownell

Top 40 Democracy: Taylor Swift’s Election Day Victory-Eric Weisbard

Hearing “Media-Capitalism” in Egypt–Ziad Fahmy

Contemporary Television’s Construction of Sonic New Jersey

At the start of The Soprano’s sixth season, in the wake of being accidentally shot by his dementia-suffering uncle, New Jersey mob boss Tony Soprano enters a coma-induced dreamstate in which he reimagines his life as a successful precision optics salesman. A show interested in Freudian psychology, The Sopranos is full of dream sequences, but this one stands out as the longest and most frustrating, as first-time viewers must watch as the hour-long plotline follows Tony’s convoluted dream while his family waits in agony at his hospital bedside. Within the dream sequence, Tony awakens to find himself at a sales conference, where he has mistakenly taken someone else’s briefcase, and he attempts to find its rightful owner. Despite the frustrating circumstances, Tony has lost his tough, mob boss demeanor: instead, he’s professional, polite, and patient, qualities that the former Tony rarely exhibits throughout the show’s six seasons.

Screenshot from YouTube video “The Sopranos – Join The Club /When It’s Cold I’d Like to Die 720p”

But what immediately strikes me about this dream sequence is the sudden loss of Tony’s thick Jersey accent. Gone is the fast-paced speech filled with dropped ‘r’s’ and long ‘a’s’ and ‘o’s’. Instead, Tony’s way of speaking is relatively accentless, aligning with what is considered a neutral North American accent. By dreaming of himself as an upwardly mobile, white-collar worker, Tony has not only imagined a new career, he’s also imagined a new way of speaking, one that lacks any clear markers of region, class, or ethnicity. This transformation ultimately tethers Tony’s New Jersey accent to his identity as an Italian American mobster with working-class roots, and it reinforces the idea that speech is indicative of one’s class. The dream sequence is one instance in which television constructs the New Jersey accent as signifying a certain brand of whiteness—not quite white trash, but perhaps one step above it, a form of whiteness lacking sophistication, riddled with ignorance and superficial wealth.

Here I examine contemporary television’s construction and performance of the Jersey accent in order to understand what it confers about class status and ethnic identity. As others have argued, New Jersey dialects are actually quite eclectic, though contemporary television tends to represent the state’s accent as defined by long vowels and quick, poorly articulated speech:

I’m interested in how television shows such as The Sopranos, Jersey Shore, and Real Housewives of New Jersey, among others, construct the Jersey accent as a homogenous indicator of ethnicity and social class. Within these predominantly white shows, the Jersey accent is associated with whiteness, situating characters at a distance from dialects susceptible to scrutiny and violence, such as nonwhite immigrant accents or who embody what Nina Sun Eidsheim calls sonic blackness, but it also signifies that these characters do not come from respectable backgrounds or generational wealth.

Screenshot from Season 1 Episode 1 of MTV’s Jersey Shore

New Jersey has served as a popular setting for contemporary television, and reality television in particular has capitalized on the state’s materialistic and ostentatious reputation. As Alisha Gaines argues, reality television has a “full-blown crush” on the state, as its geography serves as “a stage for class and social passing, a late capital playground of ethnic representation.” MTV’s Jersey Shore is the most well-known reality TV show to emerge out of New Jersey. Although only a few of the show’s main characters originate from the state, they all embrace a stereotypical Jersey aesthetic: the big hair, the tanned bodies, and yes, the accent. Like The Sopranos, Jersey Shore’s Italian American characters claim to have a complicated relationship to whiteness. The characters attempt to reclaim the derogatory term “guido” (or “guidette,” in the case of the show’s female characters) and admit to not fully identifying as white: “I’m not white,” the show’s Nicole Polizzi (Snooki) says at one point. “I’m tan. That’s what I am.”

In Episode 7 of the show’s first season, Snooki meets Keith, a man she’s surprised to have hit it off with not only because he’s not Italian, but also because “he talks like a cowboy.” Yet Keith does not have a Southern accent, as one might expect, but instead speaks in a standard North American accent. Snooki’s assertion that he speaks “like a cowboy,” then, points to not only how accents are perceived (in the eye of the beholder), it also centers and normalizes the characters’ Jersey accents and calls into question how American television audiences have been trained to experience and think about accented subjects.

Predictably, within New Jersey shows, accents and “improper” ways of speaking often become the butt of the joke. For instance, in The Sopranos episode “Cold Stones,” Tony gifts his wife Carmela a Louis Vuitton wallet containing thirty grand in cash. “This is the real Louis Vee-toon,” he assures her, butchering the pronunciation of the French designer’s name. Tony may be able to afford the “real thing” (and then some), but his inability to sonically perform it gives him away: this is not a lifestyle he inherited or was born into; it does not come natural to him.

In a similar vein, Bravo produces blooper reels of the New Jersey Real Housewives mispronouncing common words (skooers instead of skewers, lopter instead of lobster, bought instead of brought, for instance).

Here, these characters’ mispronunciations are intended to indicate their ignorance and lack of education, echoing the show’s hints that their female characters have mob affiliations and primarily live off their husbands’ money. Within the Real Housewives of New Jersey and other Jersey-based shows, commenting on the state’s accent often functions as a way of implying that their characters are not to be taken too seriously, thereby influencing how audiences perceive this way of speaking beyond these shows (see, for instance, this Reddit thread).

As it pertains to whiteness and class, the privilege that the Jersey accent does or does not confer is difficult to unpack. Scholars such as Jennifer Stoever and Shilpa Davé have shown how nonwhite accents are subject to surveillance and violence in ways that white accents are not. Similarly, Christie Zwahlen argues in her Sounding Out! post “Look Who’s Talking, Y’all” that “In contradistinction to ‘foreign’ sounding accents, Southern accents are a classic symbol of American cultural belonging, like apple pie for the ears.” But what version of whiteness, and more specifically, Americannes, does the Jersey accent connote? While within the shows examined here, the accent is spoken primarily by characters belonging to immigrant groups that have been encompassed within the category of whiteness (often Italian and Jewish Americans), the legitimacy of these characters’ social class and education level is often under scrutiny. These characters’ interest in flashy outfits, gold jewelry, and French Chateau style decor (you know it when you see it) is represented as trashy and artificial, a performance of wealth rather than the actual embodiment of it.

In many ways, the “improperness” of the Jersey accent becomes another way of indicating that these characters are not highly educated and therefore their words, thoughts, and even their wealth, are deserving of suspicion. And a show like The Sopranos, in which most characters have organized crime affiliations, confirms that this suspicion is well-warranted. Indeed, this is not the whiteness or social status assumed to accompany standard English or American accents.

“New Jersey” by Flickr user Doug Kerr, CC BY-SA 2.0

Unsurprisingly, these shows’ centering of middle-class whiteness and its sonic registers ignores the disparity that exists across New Jersey’s geographies. While the state is one of the nation’s wealthiest, it’s also home to poorer cities of color that continue to suffer from the effects of suburbanization and neoliberal urban development. For example, scholars such as Kevin Mumford and Ana Y. Ramos-Zayas show how a city like Newark (a frequent setting on The Sopranos) has been heavily shaped by inequitable and volatile racial politics. And yet, the shows examined here eschew these socioeconomic and racial differences, erasing New Jersey’s communities of color from the state’s cultural discourses.

In an episode of HBO’s Boardwalk Empire, set in Atlantic City during Prohibition, Irish immigrant Margaret Schroeder expresses her fear that her Irish accent makes her “sound like an immigrant,” to which city treasurer Nucky Thompson responds, “But we’re all immigrants, are we not?” While his response echoes the assimilationist myth of the U.S.-as-melting-pot, it hits on something precise about New Jersey: as the state with the third-largest immigrant population, the homogeneity of the region’s accent is largely a construct. While contemporary television presents audiences with an all-encompassing Jersey accent, in actuality, the state’s diversity makes it nearly impossible to pin down exactly what New Jersey “sounds like.” Examining New Jersey’s representations in popular television reveals how the accent has become one of the state’s most prominent and recognizable features, and shows how these representations have the potential to reductively categorize an entire population.

Featured image: “Memorial Day Weekend” by Flickr user SurFeRGiRL30, CC-BY-2.0

Shannon Mooney is a PhD student in English and American Studies at the University of Massachusetts Amherst. She received her M.A. in English from the University of Connecticut in 2018. Shannon studies contemporary multi-ethnic U.S. literature, television, and film, with a focus on cultural geography and critical race theory. Her work examines how multi-ethnic writers and artists from New Jersey engage with the state’s natural and industrial landscapes to make sense of their positions as political and historical subjects. Shannon is also the Creative Director of Paperbark Literary Magazine, a publication rooted in sustainability and environmental justice.

tape reelREWIND! . . .If you liked this post, you may also dig:

Teach Me How to Dougie Like A Mediocre White Man-Justin Burton

Speaking American-Leslie McMurtry

Vocal Gender and the Gendered Soundscape: At the Intersection of Gender Studies and Sound Studies — Christine Ehrick

Stir It Up: From Polyphony to Multivocality in A Brief History of Seven Killings

For many, the audiobook is a source of pleasure and distraction, a way to get through the To Read Pile while washing dishes or commuting. Audiobooks have a stealthy way of rendering invisible the labor of creating this aural experience: the writer, the narrator, the producer, the technology…here at Sounding Out! we want to render that labor visible and, moreover, think of the sound as a focus of analysis in itself.

Over the next few weeks, we will host several authors who will make all of us think differently about the audiobook selections on our phone, in our car, and in our radios. Last week we listened to a book that listens to Dublin, in a post by Shantam Goyal. Today we have seven narrators telling us the story of an assassination attempt on Bob Marley. What will the audiobook whisper to us that the book cannot speak?

—Managing Editor Liana Silva


Reviews of A Brief History of Seven Killings, Marlon James’ 686-page rendering of the echoes of an assassination attempt on Bob Marley, almost invariably invoke the concept of polyphony to name its adroit use of multiple narrators. In The New York Times, Zachary Lazar maintained that the “polyphony and scope” of the 2014 novel made it much more than a saga of drug and gang violence stretching from 1970s Kingston to 1990s New York. And the Booker Prize, which James was the first Jamaican to win, similarly praised it as a “rich, polyphonic study,” with chief judge Michael Wood calling attention to the impressive “range of voices and registers, running from the patois of the street posse to The Book of Revelation.” It was thus not only the sheer number of voices in a preliminary three-page “Cast of Characters” that critics so unanimously admired but also the variety and nuance evident within them. Norwegian publisher Mime Books even took these polyphonic features a step further by hiring not one but twelve translators in a casting process that auditioned prominent novelists, playwrights, and performers.

Cover of the book, under fair use

James recalls realizing early on that this novel would be one “driven only by voice” (687), which might make such enthusiastic responses to its plurality of perspectives seem unsurprising. But what happens when such polyphony leaves the page behind and actual material voices drive its delivery? If the audiobook is a format of the novel (and here I follow Jonathan Sterne’s definition of format in MP3: The Meaning of a Format as “a whole range of decisions that affect the look, feel, experience, and workings of a medium” [7]), what lessons can listeners learn that print cannot provide? As I argue, the 26-hour-long audiobook version of A Brief History, which Highbridge Audio produced with seven actors (Robertson Dean, Cherise Boothe, Dwight Bacquie, Ryan Anderson, Johnathan McClain, Robert Younis, Thom Rivera), allows us to engage with multivocality rather than polyphony, which is to say the multiple vocal performances of a single individual rather than the presence of many narrators within a print work. And just as this novel’s polyphonic structure destabilizes any attempt at a definitive account of the events it portrays, the multifaceted performances of its audio format work to untrain ears that have been conditioned to hear necessary ties between voices and bodies. 

Of course, this effect is not one that most listeners consciously seek, as reviews of the audiobook articulating various reasons for turning to this format as well as diverging responses to it readily attest. Gayle, on Audible, began with the print version: “but as soon as I got to the first chapter that was written in Jamaican patois I knew that I was not able to do that in my head and I was going to miss a lot.” Sound here conveys sense more swiftly than the page, the ear apparently better suited than the eye to encounter difference. (Woodsy, another reviewer, even felt emboldened to ventriloquize in text that sonically distinctive speech: “I found that listening to the Audible version was helpful. Now all me need do is stop thinking in Jamaican.”) Yet it was Andre who offered by far the most memorable characterization of the audiobook and its affordances. As he explained, in James’ novel “the language is a thick, tropical forest of words. Audiobook is the machete that slices through this forest of words so I can enjoy the treasures inside.” The violence of this metaphor matches that of the novel’s most disturbing scenes, yet what is most striking is the way it reiterates once more how reviewers found it easier to access the work aurally rather than visually.

“Studio Microphone” by Richard Feliciano, CC BY-NC-ND 2.0

These reviews, and other similarly favorable appraisals, rarely consider the audiobook on its own terms, insisting instead on comparisons with the text. Negative ones, however, often note distinctively sonic features, with some reviewers echoing one of the Booker judges—who reportedly consulted a Jamaican poet about the accuracy of James’ ear for dialogue—by questioning the veracity of the Jamaican accents in a novel that also features American, Colombian, and Cuban ones. Tending to readily identify themselves as Jamaican, these writers and listeners rarely acknowledge that at least some of the actors were born on the island when asserting that the accents are off. In any case, such efforts to link sound and authenticity, as Liana Silva has argued with respect to the audiobook, wrongly suggest that those who belong to a group must conform to a single sound. James, too, distrusts discourses of the authentic, as characters repeatedly cast suspicion and scorn on anyone uttering the phrase “real Jamaica.”

If the polyphony in James’ novel prevents any one perspective from becoming either representative or definitive, the audiobook pushes this process even further by demonstrating how a single performer’s voice can possess such range that it seems to contain multiple ones. Each performer is responsible for all the voices within the sections narrated by their primary characters, which means that the same character can occasionally be voiced by different actors. In one section, a performer does the voices of a tough-talking Chicago-born hitman and the jittery Colombians he speaks with in Miami; in others, that same performer is both a white Rolling Stone journalist from Minnesota who’s attuned to racial difference and the black Jamaicans he converses with in Kingston. Continuity or strict one-to-one correspondences between performer and character ultimately matter less than the displays of vocal difference that allow the audiobook to contest essentialized notions of voice.

As a result, the audiobook articulates just how constructed vocal divisions based on race, gender, and class are by having its performers constantly cross them. It amplifies the very arbitrariness of such divisions and thereby reveals how, if the page is the space of polyphony, then what the audiobook stages is multivocality. Although they might seem like synonyms, these two terms can actually help us appreciate crucial differences and, in doing so, highlight the specificity of the audio format. On the one hand, –phony or phōnē, as Shane Butler reminds us in The Ancient Phonograph, ambiguously refers to both voice and the human capacity for speech (36), whereas –vocality centers the voice. On the other, the shift from the Greek poly- to the Latin multi- signals a contrast in what gets counted: while polyphony names the quantity of perspectives contributing to a narrative (when introducing it in Problems of Dostoevsky’s Poetics, Mikhail Bakhtin emphasized that polyphony consisted of “a plurality of independent and unmerged voices and consciousnesses” [6]), multivocality instead specifies how the number of voices can exceed the number of performers. In this way, the concept of multivocality outlined here with respect to the audiobook resonates with its use in another context by Katherine Meizel, who mobilizes it with reference to singing and the borders of identity. In both cases, voice names a multiplicity of practices rather than an immutable or inevitable expression, which in turn aligns with Nina Sun Eidsheim’s argument in The Race of Sound about the voice being not singular but collective and not innate but cultural (9).

“Last Exit (Recording Studio)” by Flickr user Drew Ressler, CC BY-NC-ND 2.0

We can therefore say that where print-based polyphony works on the eye by placing various perspectives on a page without necessarily challenging visual perceptions of difference, multivocality in the audiobook can retrain an ear’s culturally ingrained ideas about voice. James himself has experience with these seemingly inescapable meanings assigned to vocal sounds. In a moving essay for The New York Times Magazine, he recounts how, even at the age of 28, “I was so convinced that my voice outed me as a fag that I had stopped speaking to people I didn’t know.” That was already long after high school, when, as he remembered in a New Yorker profile, he had begun “tape-recording his efforts to sound masculine, repeating words like ‘bredren’ and ‘boss.’” He was well aware of the links that listeners created between voice and identity and that could, as he suggests, prove risky in a place with overt homophobia like Jamaica. Writing, however, offered him a space to take on any voice and, at the same time, not be concerned with the sound of his own.

Yet if the page allowed James to effortlessly shift among narrative voices, the audiobook format exhibits voices that ostensibly shift without any effort. Perhaps the most compelling example emerges in the work of Cherise Boothe, whose performance of the novel’s sole female primary character presents the voices of other figures as well. Toward the end of the novel, this character, Dorcas Palmer, is a caretaker for a much older and wealthier white man with amnesia in New York. Boothe not only captures the changes as Palmer often eliminates her Jamaican accent and occasionally lets it loose but also registers the man’s moments of lucidity and confusion. Even if, as listeners, we understand that Boothe is the voice behind both of these characters, the two vocal performances are so distinct that they effectively erode the basis for any beliefs about how a certain body should sound.

Adopting different voices is certainly not unique to the audiobook, but it does provide one of the few forms of extended exposure to this practice. Yet it is worth noting that A Brief History markedly differs from the model of a more extensive cast like the one comprised of 166 voices that recorded George Saunders’ Lincoln in the Bardo. By assigning a performer to every character, such productions ultimately emphasize vocal uniqueness in roughly the same way that Adriana Cavarero conceives it, namely as an index of individuality. But there the voice remains something singular or somehow essential, for there is no opportunity to perform the plurality that appears across A Brief History. At the same time, the use of seven actors also offers a contrast with the opposite extreme: a single performer responsible for all the roles, which demonstrates multivocality but does so on such a small scale that it feels exceptional instead of ordinary. The middle ground, which is to say the model found in A Brief History, allows us to hear multiple instances of how the voice is entrained rather than essential, possibility rather than inevitability.

Screenshot from Youtube video “Marlon James: A Brief History of Seven Killings” by Chicago Humanities Festival

When briefly addressing audiobooks in an interview, James remarked that this format possesses a distinct advantage: “even something that is not necessarily plain can be translated because of tone and symbol and voice.” In other words, a voice can register its changing surroundings; conveying these subtle transformations on the page, however, is often far more difficult. This shortcoming is one that Edward Kamau Brathwaithe once memorably described when explaining why he insisted on using a tape recorder in a lecture on language in the Caribbean: “I want you to get the sound of it, rather than the sight of it.” The idiomatic familiarity of the first half, which clashes so sharply with the awkwardness of the second, suggests that the multivocality of an audiobook can open ears by accentuating how the voice is not fixed but in constant formation.

Featured Image: “Audiobook” by Flickr user ActuaLitte, CC-BY-SA-2.0

Sam Carter is a PhD Candidate in Romance Studies at Cornell University. His work on literature and sound in the Southern Cone has appeared in Latin American Textualities: History, Materiality, and Digital Media and is forthcoming in the Revista Hispánica Moderna.

tape reelREWIND! . . .If you liked this post, you may also dig:

SO! Reads: Jonathan Sterne’s MP3: The Meaning of a Format–Aaron Trammell

Radio de Acción: Violent Circuits, Contentious Voices: Caribbean Radio Histories–Alejandra Bronfman

Scenes of Subjection: Women’s Voices Narrating Black Death“–Julie Beth Napolin

“Recorder of Dublin”: Ulysses’ FX in 1982

For many, the audiobook is a source of pleasure and distraction, a way to get through the To Read Pile while washing dishes or commuting. Audiobooks have a stealthy way of rendering invisible the labor of creating this aural experience: the writer, the narrator, the producer, the technology…here at Sounding Out! we want to render that labor visible and, moreover, think of the sound as a focus of analysis in itself.

Over the next few weeks, we will host several authors who will make all of us think differently about the audiobook selections on our phone, in our car, and in our radios. Today we start things off with a close listen of the 1982 audiobook edition of James Joyce’s Ulysses. Watch out for the hoooooooooooooonk of the SO! train pulling into the station!

—Managing Editor Liana Silva


To think about James Joyce’s Ulysses is to think about the first instant when it truly seized your ears. Accordingly, my Ulysses begins in its final episode, “Penelope”: Molly Bloom is lying down or sitting up next to a passed-out Leopold Bloom when she hears the “frseeeeeeeefronnnng train somewhere whistling.” Her train does not go chug, choo, or chuff, but it rhymes with her “Loves old sweeeetsonnnng” (1669) with an infectious insouciance for the codes of language. Let us call this the Ulysses of 1922 (though the definitive edition of James Joyce’s book whose page numbers are cited here was produced in 1984 by Hans Walter Gabler).

The Ulysses of 1922 is what Jacques Derrida called gramophonic. It plays back to us something recorded without filtering out the noise and is to be heard more than it is to be read. We listen to the book, but we are second-in-line. The first listener is the book itself, which listens to Dublin and records everything with an odd sonic democracy, discriminating little amid its recording of all sounds vivid or vapid, giving equal importance to cats, carts, bells, machines, laughter, coughs, and language. The book saunters about the city, listening and recording, and we listen to the book like we would to a scratchy, static-filled recording of a concert the morning-after. It is a reminder of something Michel Serres once said in The Five Senses: “Meaning trails this long comet tail behind it. A certain kind of æsthetics… take as their object this brilliant trail” (120). Ulysses’ elusive modern city glows in this comet tail of noise and background static more than it pivots around conventionally meaningful language content. Eventually, industrial and technological modernity catches up with artistic modernism and in 1924, Joyce reads and records parts of the “Aeolus” episode of Ulysses, and later in 1929 he records a section of Finnegans Wake. Many years after, in 1982 – the centenary year of Joyce’s birth – Ulysses comes home to Dublin and is recorded in full by Irish national radio.

Ulysses 1982 broadcast CD cover, under fair use

The 1982 Ulysses Broadcast was an uninterrupted twenty-nine-and-a-half-hour reading of the entire unabridged text on Ireland’s RTÉ Radio on 16th June – Bloomsday – produced by Micheál Ó hAodha. Among this and the two film versions, one from 1967 and the other from 2003, and other recordings such as the ones by LibriVox volunteers and a more recent one by BBC Radio 4, the 1982 Ulysses Broadcast was the first complete recording of the text. Director William Styles called upon voice actors from the Radio Éireann Players to dramatize and act Ulysses out.

My Ulysses of 1982 seizes me differently from the book. From the first seconds of the 1982 Broadcast, I reacted to Buck Mulligan stepping down the stairs inside the Martello Tower with surprise, because the reading is somewhat copiously accompanied; the sounds of loud waves outside of the walls of the seaside tower were part of the soundscape I was thrown into:

Immersion was of the essence. Not that the Ulysses of 1922 is by any means a silent text, but this accompaniment was a simultaneous roar. Sounds in the written text take up space, and as these sounds are being “played” in the book, there is a length of text where nothing else is happening. Think, for instance, of the machinery in the “Aeolus” episode: “Almost human the way it sllt to call attention” (251). As the “sllt” is recorded by the book, it is not over or behind any other sound or voice. It takes up its own space, unlike in the Broadcast

The layering of Buck Mulligan’s voice over the sounds of the sea becomes possible in the move from the spatial-visual of the page to the temporal-aural of a recording. However, listening to the Broadcast prompts me to ask: Is the sonic democracy of recording the soundscape still there?

Most critical work on the audiobook focuses on readerly reception and pleasure, almost indicating that we can hear the Ulysses of 1922 but we must read the Broadcast of 1982; the book provides for more direct sensory engagement while with the Broadcast, we must focus on analyzing the mechanics of our reception. We also get terms like Reinhart Meyer-Kalkus’ “hear-reading” (179) or Matthew Rubery’s “ear contact” (72) which are concerned with the link between the playback of the recorded text and the reading ear. We hear-read when we listen to the voice in our heads recite aloud to us what we are reading, and we establish ear contact, much like eye contact, when we find our ears bound to voices instead of people. Both these concepts are concerned with reception. If we steer clear of our listening of the Broadcast and turn the focus to the Broadcast’s listening of Ulysses, what we find is a rich sonic world, but it is one which takes us away from the linguistic play of the text.

For instance, the book gives cues for the ambient sounds of Dublin clamor surrounding any voice which might be speaking at that moment. “Stream of life” (327) signals in the Broadcast the coming alive of the city soundscape. What is described as a “sudden screech of laughter” (255) in the book is layered upon loud laughter in the Broadcast, as is “a loud cough” (281) upon a loud cough, and a telephone which “whirred” (283) upon the sound of an actual ringing telephone. Later, in the “Circe” episode, a mention of whistling (1169) is also whistled out.

Trams, the clatter of plates and glasses, desks being rapped, coins and bells ringing and jingling, cannon-firing, all these sounds are played as accompaniments again and again as their descriptions are being voiced in the Broadcast. Like in bedtime storytelling, says Brigette Ouvry-Vial, sound effects as uncomplicated accompaniments are never in conflict with the voiced text. Think of pictures and illustrations alongside words in children’s literature (185). The background sound effects of the broadcast add nothing to the sonic democracy of the book even if they do not detract from it.

The Ulysses of 1922 is also rife with non-lexical, unpronounceable sounds, like the one’s Bloom’s cat makes. The many different cat sounds, for example “Mkgnao!” and “Mrkgnao!” and “Mrkrgnao!” (107-8), are not voiced at all in the broadcast, and are instead replaced by the mimicked sounds of a cat meowing, almost exactly the same each time:

“Miaow!” (133) and “Prr” (107), which are Bloom’s responses to his cat, are voiced by him. When the “door of Ruttledge’s office whispered: ee: cree” (243), there is no voicing – only the sound of a creaking door. Yet, when we are in Bloom’s thoughts, like when he remembers a glorious gust of wind which blew up Molly’s skirt, he voices the gust of wind in the Broadcast going “Brrfoo!” (329), pronouncing the non-lexical word with a close-approximation. Would not the non-lexical sounds in his head suggest that he is thinking in sound rather than in language, much like many of us who can hear sounds in our heads? Often but not always, environmental sounds are retained as actual sounds while the sounds in Bloom’s head are sublimated into pronounceable, phonetic language. But mostly there is an insistence on adding sound effects wherever possible.

“JAMES JOYCE [ST. STEPHEN’S GREEN] REF-1085622” by Flickr user William Murphy, CC BY-SA 2.0

Whether the book describes the sound or sounds it with a non-lexical string of words, the Broadcast attaches its effects. If we look at the book as a recorder, its movements are staggeringly complex as it moves in and out of multiple spaces. When it is in Bloom’s head, the environment is muted, and when it is inside a carriage, unless it is poked out an open window, it does not record the street. Ssave for a few instances, the Broadcast’s insistence on effects attests to its rich production, but not to its vitality. It therefore stands as an accompaniment to the book, not as a text in its own right given its compositional inconsistencies. So, the several variations on Bloom’s flatulence with “Rrrrrr” (625), “Fff. Oo. Rrpr,” and “Pprrpffrrppfff” are all erased and instead fart sounds are recorded.

On the same page, when Bloom tries to mask his own sounds of bodily release under the din of the passing tram, the “Krandlkrakran” (629) is both voiced by Bloom and recorded as the sound of a noisily ringing tram in the background. But only an actual train whistles in “Penelope,” with no voice in the Broadcast attempting to say “frseeeeeeeefronnnng” (1669).

For Charles Bernstein, the sound of a work of literature, much like the shape of poetry on the page, might be an element which is “extralexical but… not extrasemantic” (5). It is different from the written word but it is not a meaningless ornament. For the Broadcast, however, it might as well be the case that sound is made irrelevant to meaning. Or, we can argue that the meaning being made is in the realm of performance studies and not literature. The pure temporality of the Broadcast helps. We can stop reading the book to look, but we cannot stop the Broadcast and still listen. Moreover, when the Broadcast records, it is listening to the book’s listening of Dublin, removed by another degree from the soundscape of Dublin.

The Broadcast is not however without value. Bernstein echoes Serres when he aggrandizes the “sheer noise of language” (22) which must take precedence over the impulse to decode everything. The Broadcast answers this need to not immediately rationalize and sublimate in analysis everything that is heard, but to rather hear without listening. Cue the poet Robert Carleton Brown who once said that writing since the very beginning has been “bottled up” inside of books (23). And in 1982, the stopper on Joyce’s spuming prose was popped.

Featured Image: “telemachus: the tower, 8 a.m., theology, white/gold, heir, narrative (young)” by Flickr user brad lindert, CC-BY-2.0

Shantam Goyal studies English Literature at the State University of New York at Buffalo for his PhD. He completed his M.Phil in 2018 from the University of Delhi with a dissertation titled “Listen Ulysses: Joyce and Sound.” He hopes to continue this thread for his doctoral research on Finnegans Wake and mishearing. Besides Joyce Studies and Sound Studies, he works on Poetics and Jazz Studies, and is also attempting to translate parts of Ulysses into Hindi as a personal project. His reviews, articles, and creative work have appeared in The Print, The Hindu Business Line, Vayavya, ColdNoon, Daath Voyage, and Café Dissensus among other publications. He prefers that any appellations for him such as academic, poet, or person be prefaced with “Delhi-based.”

tape reelREWIND! . . .If you liked this post, you may also dig:

This Is How You Listen: Reading Critically Junot Diaz’s Audiobook-Liana Silva

“‘HOW YOU SOUND??’: The Poet’s Voice, Aura, and the Challenge of Listening to Poetry”-John Hyland

“This Is Not A Sound”: The Treachery Of Sound in Comic Books-Osvaldo Oyola

%d bloggers like this: