Look Away and Listen: The Audiovisual Litany in Philosophy

This is an excerpt from a paper I delivered at the 2017 meeting of the Society for Phenomenology and Existential Philosophy.

“Compressed and rarefied air particles of sound waves” from Popular Science Monthly, Volume 13. In the public domain, via Wikimedia Commons.

According to sound studies scholar Jonathan Sterne in The Audible Past, many philosophers practice an “audiovisual litany,” which is a conceptual gesture that favorably opposes sound and sonic phenomena to a supposedly occularcentric status quo. He states, “the audiovisual litany…idealizes hearing (and, by extension, speech) as manifesting a kind of pure interiority. It alternately denigrates and elevates vision: as a fallen sense, vision takes us out of the world. But it also bathes us in the clear light of reason” (15).  In other words, Western culture is occularcentric, but the gaze is bad, so luckily sound and listening fix all that’s bad about it. It can seem like the audiovisual litany is everywhere these days: from Adriana Cavarero’s politics of vocal resonance, to Karen Barad’s diffraction, to, well, a ton of Deleuze-inspired scholarship from thinkers as diverse as Elizabeth Grosz and Steve Goodman, philosophers use some variation on the idea of acoustic resonance (as in, oscillatory patterns of variable pressure that interact via phase relationships) to mark their departure from European philosophy’s traditional models of abstraction, which are visual and verbal, and to overcome the skeptical melancholy that results from them. The field of philosophy seems to argue that we need to replace traditional models of philosophical abstraction, which are usually based on words or images, with sound-based models, but this argument reproduces hegemonic ideas about sight and sound.

For Sterne, the audiovisual litany is traditionally part of the “metaphysics of presence” that we get from Plato and Christianity: sound and speech offer the fullness and immediacy that vision and words deny. However, contemporary versions of the litany appeal to a different metaphysics. For example, Cavarero in For More Than One Voice argues that the privileging of vision over sound is the foundation of the metaphysics of presence. “The visual metaphor,” she argues, “is not simply an illustration; rather, it constitutes the entire metaphysical system” (38). The problem with this videocentric metaphysics is that it “legitimates the reduction of whatever is seen to an object” (Cavarero 176) and it cannot “anticipate” or “confirm the uniqueness” of each individual (4). In other words, it objectifies and abstracts, and that’s bad. If vision is the foundation of the metaphysics of presence, one way to fix its problems is to replace the foundation with something else. Cavarero thinks vocal resonance avoids the objectifying and abstracting tendencies that images and text supposedly lend to philosophy.

Similarly, in the same way that the traditional audiovisual litany “assume[s] that sound draws us into the world while vision separates us from it” (Sterne 18), Barad’s argument for agential realism in Meeting the Universe Halfway assumes that diffraction draws theorists into actual contact with matter while “reflection still holds the world at a distance” (87). Agential realism looks is the view that even the most basic units of reality, like the basic particles of matter, exercise agency as they interact to form more complex units; diffraction is Barad’s theory about how these particles interact. This litany of distance-versus-relationality and external objectivity versus immersive materiality structures Barad’s counterpoint between reflection and diffraction. For example, she contrasts traditional investment in reflective surfaces—“the belief that words, concepts, ideas, and the like accurately reflect or mirror the things to which they refer-makes a finely polished surface of this whole affair” (86)–with diffractive interiorities, which get down to “the real consequences, interventions, creative possibilities, and responsibilities of intra-acting within and as part of the world” (37). But how do we know Barad is appealing to an audiovisual litany? We know because her fundamental concept–diffraction–describes the behavior of waveforms as they encounter other things, and 21st century Western scientists and music scholars think sound is a waveform. When two or more waves interact, they produce “alternating pattern[s] of wave intensity” or “increasing and decreasing intensities” (Barad 77), like ripples in water or alternating light frequencies.

“diffracted hydrogen” by Flickr user candace, CC BY 2.0

Barad appeals to notions of consonance and dissonance to explain how these patterns interact. For example, when diffracting light waves around a razor blade, “bright spots appear in places where the waves enhance one another-that is, where there is ‘constructive interfer­ence’-and dark spots appear where the waves cancel one another-that is, where there is ‘destructive interference’” (Barad 77). This “constructive” and “destructive” interference is like audio amplification and masking: when frequencies are perfectly in sync (peaks align with peaks, valleys with valleys), they amplify; when frequencies are perfectly out of sync (peaks align with valleys), they cancel each other out (this is how noise-cancelling headphones work). Constructive interference is consonance: the synced patterns amplify one another; destructive interference is dissonance: the out-of-sync patterns mask each other. Both types of interference are varieties of resonance, a rational or irrational phase relationship among frequencies. Rational phase relationships are ones where the shorter phases or periods of higher frequencies are evenly divisible into the longer phases/periods; irrational phase relationships happen when the shorter phases can’t be evenly divided into the longer wavelengths. Abstracting from waveforms to philosophical analysis, Barad often uses resonance as a metaphor to translate wave behavior into materialist philosophical methods. However, even though most of Barad’s examples throughout Meeting the Universe Halfway are visual, she’s describing what scientists call acoustic relationships.

For example, Barad argues that “diffractively read[ing]” philosophical texts means processing “insights through one another for the patterns of resonance and dissonance they coproduce” (195; emphasis mine). Similarly, she advises her readers to tune into the “dissonant and harmonic resonances” (43) that emerge when they try “diffract­ing these insights [from an early chapter in her book] through the grating of the entire set of book chapters” (30). As patterns of higher and lower intensity that interact via ir/rational phase relationships, diffraction patterns are a type of acoustic resonance. Appealing to acoustics against representationalism, Barad practices a version of the audiovisual litany. And she’s not the only new materialist to do so—Jane Bennett’s concept of vibration and Elizabeth Grosz’s notion of “music” also ontologize a similar idea of resonance and claim it overcomes the distancing and skeptical melancholy produced by traditional methods of philosophical abstraction.

“Painter” by Flickr user Flood G., CC BY-NC-ND 2.0

There are also instances of the audiovisual litany in phenomenology. For example, Alia Al-Saji develops in the article A Phenomenology of Critical-Ethical Vision” a notion of “critical-ethical vision” against “objectifying vision,” and, via a reading of Merleau-Ponty, grounds the former, better notion of sight (and thought) in his analogy between painting and listening. According to Al-Saji, “objectifying vision” is the model of sight that has dominated much of European philosophy since the Enlightenment. “Objectifying vision” takes seeing as “merely a matter of re-cognition, the objectivation and categorization of the visible into clear-cut solids, into objects with definite contours and uses” (375). Because it operates in a two-dimensional metaphysical plane it can only see in binary terms (same/other): “Objectifying vision is thus reductive of lateral difference as relationality” (390). According to Al-Saji, Merleau-Ponty’s theory of painting develops an account of vision that is “non-objectifying” (388) and relational. We cannot see paintings as already-constituted objects, but as visualizations, the emergence of vision from a particular set of conditions. Such seeing allows us “to glimpse the intercorporeal, social and historical institution of my own vision, to remember my affective dependence on an alterity whose invisibility my [objectifying] vision takes for granted” (Al-Saji 391). Al-Saji turns to sonic language to describe such relational seeing: “more than mere looking, this is seeing that listens (391; emphasis mine).

This Merleau-Pontian vision not only departs from traditional European Enlightenment accounts of vision, it gestures toward traditional European accounts of hearing. Similarly, Fred Evans, in The Multivoiced Body uses voice as a metaphor for the Deleuzo-Guattarian metaphysics that he calls “chaosmos” or “composed chaos” (86); he then contrasts chaosmos to “homophonic” (67) Enlightenment metaphysics. According to Evans, if “‘voices,’ not individuals, the State, or social structures, are the primary participants in society” (256), then  “reciprocity” and “mutual intersection” (59) appear as fundamental social values (rather than, say, autonomy). This analysis exemplifies what is at the crux of the audiovisual litany: voices put us back in touch with what European modernity and postmodernity abstract away.

“Image from page 401 of “Surgical anatomy : a treatise on human anatomy in its application to the practice of medicine and surgery” (1901)” by Flickr user Internet Archive Book Images

The audiovisual litany is hot right now: as I’ve just shown, it’s commonly marshaled in the various attempts to move past or go beyond stale old Western modernist and postmodernist philosophy, with all their anthropocentrism and correlationism and classical liberalism. To play with Marie Thompson’s words a bit, just as there is an “ontological turn in sound studies,” there’s a “sound turn in ontological studies.” But why? What does sound DO for this specific philosophical project? And what kind of sound are we appealing to anyway?

The audiovisual litany naturalizes hegemonic concepts of sound and sight and uses these as metaphors for philosophical positions. This lets philosophical assumptions pass by unnoticed because they appear as “natural” features of various sensory modalities. Though he doesn’t use this term, Sterne’s analysis implies that the audiovisual litany is what Mary Beth Mader calls a sleight. “Sleights” are, according to Mader in Sleights of Reason,“conceptual collaborations that function as switches or ruses important to the continuing centrality and pertinence of the social category of a political system like “sex” (3). Sleights, in other words, are conceptual slippages that render underlying hegemonic structures like cisheteropatriarchy coherent. More specifically, sleights are “conceptual jacquemarts” (Mader 5). Jacquemarts are effectively the Milli Vanilli of clocks: sounds appear to come from one overtly visible, aesthetically appealing source action (figures ringing a bell) but they actually come from a hidden, less aesthetically appealing source action (hammers hitting gongs). The clock is constructed in a way to “misdirect or misindicate” (Mader 8) both who is making the sound and how they are making it. A sound exists, but its source is misattributed. This is exactly what happens in the uses of the audiovisual litany I discuss above: philosophers misdirect or misindicate the source of the distinction they use the audiovisual litany to mark. The litany doesn’t track the difference between sensory media or perceptual faculties, but between two different methods of abstraction.

Screenshot from Milli Vanilli’s video “Don’t Forget My Number”

This slippage between perceptual medium and philosophical method facilitates the continued centrality of Philosophy-capital-P: philosophy appears to reform its methods and fix its problems, while actually re-investing in its traditional boundaries, values, and commitments. For example, both new materialists and sound studies scholars have been widely critiqued for actively ignoring work on sound and resonance in black studies (e.g., by Zakiyyah Jackson, Diana Leong, Maire Thompson). As Zakiyyah Jackson argues in Outer Worlds: The Persistence of Race in Movement “Beyond the Human,” new materialism’s “gestures toward the ‘post’ or the ‘beyond’ effectively ignore praxes of humanity and critiques produced by black people” (215), and in so doing ironically reinstitute the very thing new materialism claims to supercede. Stratifying theory into “new” and not-new, new materialist “appeals to move ‘beyond’…may actually reintroduce the Eurocentric transcendentalism this movement purports to disrupt” (Jackson 215) by exclusively focusing on European philosophers’ accounts of sound and sight. Similarly, these uses of the litany often appeal only to other philosophers’ accounts of sound or music, not actual works or practices or performances. They don’t even attend to the sonic dimensions of literary texts, a method that scholars such as Jennifer Lynn Stoever and Alexander Weheliye develop in their work. Philosophers use the audiovisual litany to disguise philosophy’s ugly politics—white supremacy and Eurocentrism—behind an outwardly pleasing conceptual gesture: the turn from sight or text to sound. With this variation of the audiovisual litany, Philosophy appears to cross beyond its conventional boundaries while actually doubling-down on them.

Featured image: “soundwaves” from Flickr user istolethetv

Robin James is Associate Professor of Philosophy at UNC Charlotte. She is author of two books: Resilience & Melancholy: pop music, feminism, and neoliberalism, published by Zer0 books last year, and The Conjectural Body: gender, race and the philosophy of music was published by Lexington Books in 2010. Her work on feminism, race, contemporary continental philosophy, pop music, and sound studies has appeared in The New Inquiry, Hypatia, differences, Contemporary Aesthetics, and the Journal of Popular Music Studies. She is also a digital sound artist and musician. She blogs at its-her-factory.com and is a regular contributor to Cyborgology.

The Screech Within Speech

Weird Tales CoverWelcome back to SO!‘s Sonic Shadows series, which focuses on what it means to “have a voice.” In the first post in the series, I considered the role of the novel in sound studies, and how, paradoxically, this led us back to the embodied voice of the writer. In Joseph Conrad’s prose, traces of accent and translingualism shape the sonic space of difference, but also reframe the novel as a social, yet ambiguous act of communication.

This week, I’m happy to welcome Dominic Pettman, who picks up the question of the embodied human voice as it brushes up against the animal in what he calls the “voice of the world.” Next week, the series will conclude with uncanny mechanical sounds of early recording that trouble the voice of the human from within.

— Julie Beth Napolin, Guest Editor

This is the sound of “the loneliest whale in the world.”

Scientists have tracked this mournful creature for several years, intrigued by the melancholy songs, which go unanswered. The call of this singular cetacean, an Internet cult figure of unidentified species, registers at the unusual frequency of 52hz, much higher than that of all other types of whale.

These days, in general, whales have been forced into relatively tiny sonic boxes because of the din created by ship engines and various audio probings of the marine environment by military and industry alike. As bio-acoustician Christopher Clark, suggests, this assault and subsequent diminishment of the whale’s soundscape must be extremely traumatic for the animal, whose overall umwelt has shrunk from large swathes of the watery planet to barely a mile or so in any given direction. The noisier the ocean becomes the lonelier whales are likely to become.

“anim1083” by Flickr user NOAA Photo Library, CC BY 2.0

The 52hz whale is a bit like an outsider artist, offering personalized songs to the sub-aquatic world, only to be snubbed by the more “vocal” members of whale community. Cetaceans could arguably be considered the first instance of global communication, many millions of years ago, since their calls could travel astonishing distances – up to 500 miles under water. Songs of the humpback, for instance, can “sweep across the Pacific in just a few years,” as biologists from the University of Queensland explain. “In any given year, all the males in a population sing the same song, but the songs change from year to year. The changes are more than incremental; they represent whole new repertoires.”

Can we really, however, speak of singing in such cases? Many would argue that simply using the organ of vocalization does not equate to singing in that it lacks the element of self-reflection necessary for true expression; for artistry. Others have conversely argued that humans were likely taught to sing by other creatures, especially the birds. These perspectives on the question of the interspecies voice have a long and complex history, crisscrossing epochs, as well as those divergent orientations to the natural world crudely divided into “East” and “West.” In this post, I focus on what it means to try to hear the animal beyond or through human terms, to explore the question of who or what can rightly claim to have a voice – is it a property or capacity that belongs to a subject, even a nonhuman subject? Might we consider voice to include “expression” of the elements themselves? Might the world itself, whatever such a grand phrase might denote, have a vox mundi – a voice of the planet?

“Angry” by Pixabay user PublicDomainPictures, public domain

Such questions deserve long and careful consideration, [and SO! has housed a series of reflections on acoustic ecology and a singing planet.] But in this brief context, I focus on the historically contested existence of a creaturely voice – one which describes a plurality of vocal expressions, distributed among those species blessed with the capacity to make sounds with their bodies. As Tobias Menely explains in a wonderful new book, the creaturely voice, like the human one, forms the vector of sympathy; and is thus suspended between the individual producing the sound, and the one listening to it. Through “the voice of nature” we understand our essential “creaturely entanglement” with other animals. This perspective pushes Mladen Dolar’s psychoanalytic theory that voice ties self to other to include the nonhuman experience of the animal realm.

Menely argues for a condition of social identity in “creaturely voice,” which is a way of testing the world, and one’s location, role, and value in it. In other words, monkeys, birds, whales, and so on, test their own existence when they emit non-symbolic equivalents of, “I’m here.” “Where are you?” “Are you really there?” “Who are you?” “Marco.” “Polo.” These are the unspoken – and yet at least partially communicated – messages woven into the ever-vanishing, yet always returning, medium of the voice.

Take, for instance, the parrot or cockatoo. We humans have been fascinated by these birds, largely by virtue of their perceived organic capacity to “record” our own voices, and throw these back at us, like trickster ventriloquists, long before the invention of the phonograph. Certainly, this can create an uncanny effect in the human listener: hearing our own voice echoed back from the larynx of a creature so different from ourselves – a creature that may or may not have its own mind or soul. Historically speaking, many people who had their figurative feathers ruffled by the impertinence of parrots deflected the discomfort they felt, upon hearing their own words screeched back at them.

This pet parrot, who had clearly been in the room when its owner was watching X-rated material, recently became famous. The instant mirth, and/or discomfort, that this clip produces is a function of hearing ourselves, as humans, echoed back by an animal. Our words are “rebroadcast” back to us by an entity that has no sense of irony or decorum. It is literally obscene. It is as if the world were engaged in objective parody of the planet’s most arrogant animal: revealing one of our most sacred activities (“making love”) to be little more than a kind of crude ventriloquial trick. This parrot is not deliberately lampooning us, yet, the refrain created by the bird’s imitative tendencies means that we are lampooned nevertheless.

Another famous pet cockatoo was given to a new couple after a bitter divorce obliged it to find a new home. The details of the break-up remain obscure to the second owners. However, this (traumatized?) cockatoo re-enacts the tone, pitch, and vehemence of the arguments that it was obliged to witness in its previous life. While most of the “words” the cockatoo screeches are not clear enough to be translated, the emotions that initially launched them are obvious to all within hearing distance. The bird even bobs its head, and spreads its wings, in imitation of the angry body language of a wife scorned, spurned, or otherwise so aggrieved that she can only incessantly shriek at the man who made her so miserable. Whose voice is this, then?

Parrots are like children, some might claim, squawking back syllables they will never comprehend. One might as well yell into a cave, and be astonished that the words return as a consequence of physics. Bird songs, according to such a concept, create what Gilles Deleuze and Felix Guattari call “a refrain,” which in turn generates a territory through the act of sonically diagramming it. This operation is not limited to the natural world, however, since we may say the same about television sets or saxophones.

Consider how children, or lovers, playfully imitate the speech of the other. In doing so, they assert their own identity, while also putting such an identity under erasure. Many animals (including humans) may thus be creatures who continue to flesh themselves out in(to) this territory. But instead of the animal echoing back the human, what about the reverse? As a final example, consider one famous instance of simulated human suffering, “devolving” into a creaturely register; namely, the old literature professor, Dr. Immanuel Rath, who experiences a nervous breakdown when he succumbs to intense jealousy and a broken heart, at the climax of Josef von Sternberg’s classic film, The Blue Angel (1930).

Just as the full weight of his rejection, at the hands of Lola Lola (Marlene Dietrich) is being registered in his psyche, the professor – who has quit teaching to follow his beloved in the cabaret world – is ushered out onto the theatrical stage, dressed as a clown. The audience waits in skeptical anticipation of an amusing performance, but the haunted ex-professor can only unleash a torrent of repressed anguish at his broken heart, and his humiliation at the hands of the vulgar mob. The horrible sound he releases, silencing the crowd, is part spurned lover, part rooster, and wholly abject. The professor seems to lose almost all his humanity, which was once verifiable in his composed and authoritative teaching voice, but is now some kind of demonic bird, screeching in misery, fury, and defeat. As this seemingly mindless force of vengeance tries to strangle his romantic obsession backstage, and as he continues to struggle against those who restrain him, the ex-professor has become creaturely: a supposedly subhuman status signified more by his inhuman voice than by anything else.

And yet, as we have seen, there is no simple hierarchy here, where the human occasionally – in times of great distress – finds themselves, by this logic, reduced to being “an animal.” We might call this the vox mundi – the voice of the world—in which, like the shadowy depths of the ocean, there is a swath of sound shared by human and animal. The creaturely voice can be sweet, like the nightingale. Or it can be harsh, like the traumatized cockatoo or the green-eyed professor-clown. There is an intimate link between the voices of animals and those of humans, which cannot be reduced to a concept like “communication,” but which nevertheless impacts and influences all those in hearing distance.

“Humpback Whales” by Flickr user Christpher Michel, CC BY 2.0

That is, unless one happens to be a whale, singing at 52hz. In which case, we are likely to keep singing into the inky darkness, without any reply.

Dominic Pettman is Chair of Liberal Studies, New School for Social Research, and Professor of Culture & Media, Eugene Lang College. He is the author of several books, including Look at the Bunny: Totem, Taboo, Technology (Zero books), and the forthcoming Infinite Distraction: Paying Attention to Social Media (Polity).

Featured image: “Humpback Whales” by Flickr user Christopher Michel, CC BY 2.0

