Echo and the Chorus of Female Machines
Editor’s Note: February may be over, but our forum is still on! Today I bring you installment #5 of Sounding Out!‘s blog forum on gender and voice. Last week Art Blake talked about how his experience shifting his voice from feminine to masculine as a transgender man intersects with his work on John Cage. Before that, Regina Bradley put the soundtrack of Scandal in conversation with race and gender. The week before I talked about what it meant to have people call me, a woman of color, “loud.” That post was preceded by Christine Ehrick‘s selections from her forthcoming book, on the gendered soundscape. We have one more left! Robin James will round out our forum with an analysis of how ideas of what women should sound like have roots in Greek philosophy.
This week Canadian artist and writer AO Roberts takes us into the arena of speech synthesis and makes us wonder about what it means that the voices are so often female. So, lean in, close your eyes, and don’t be afraid of the robots’ voices. –Liana M. Silva, Managing Editor
—
I used Apple’s SIRI for the first time on an iPhone 4S. After hundreds of miles in a van full of people on a cross-country tour, all of the music had been played and the comedy mp3s entirely depleted. So, like so many first time SIRI users, we killed time by asking questions that went from the obscure to the absurd. Passive, awaiting command, prone to glitches: there was something both comedic and insidious about SIRI as female-gendered program, something that seemed to bind up the technology with stereotypical ideas of femininity.
Speech synthesis is the artificial simulation of the human voice through hardware or software, and SIRI is but one incarnation of the historical chorus of machines speaking what we code to be female. Starting from the early 20th century Voder, to the Cold-War era Silvia and Audrey, up to Amazon’s newly released Echo, researchers have by and large developed these applications as female personae. Each program articulates an individual timbre and character, soothing soft spoken or matter of fact; this is your mother, sister, or lover, here to affirm your interests while reminding you about that missed birthday. She is easy to call up in memory, tones rounded at the edges, like Scarlett Johansson’s smoky conviviality as Samantha in Spike Jonze’s Her, a bodiless purr. Simulated speech articulates a series of assumptions about what neutral articulation is, what a female voice is, and whose voice technology can ventriloquize.
The ways computers hear and speak the human voice are as complex as they are rapidly expanding. But in robotics gender is charted down to actual wavelength, actively policed around 100-150 HZ (male) and 200-250 HZ (female). Now prevalent in entertainment, navigation, law enforcement, surveillance, security, and communications, speech synthesis and recognition hold up an acoustic mirror to the dominant cultures from which they materialize. While they might provide useful tools for everything from time management to self-improvement, they also reinforce cisheteronormative definitions of personhood. Like the binary code that now gives it form, the development of speech recognition separated the entire spectrum of vocal expression into rigid biologically based categories. Ideas of a real voice vs. fake voice, in all their resonances with passing or failing one’s gender performance, have through this process been designed into the technology itself.
A SERIES OF MISERABLE GRUNTS

“Kempelen Speakingmachine” by Fabian Brackhane (Quintatoen), Saarbrücken – Own work. Licensed under Public Domain via Wikimedia Commons –
The first voice to be synthesized was a reed and bellows box invented by Wolfgang Von Kempelen in 1791 and shown off in the courts of the Hapsburg Empire. Von Kempelen had gained renown for his chess-playing Turk, a racist cartoon of an automaton that made waves amongst the nobles until it was revealed that underneath the tabletop was a small man secretly moving the chess player’s limbs. Von Kempelen’s second work, the speaking machine, wowed its audiences thoroughly. The player wheedled and squeezed the contraption, pushing air through its reed larynx to replicate simple words like mama and papa.
Synthesizing the voice has always required some level of making strange, of phonemic abstraction. Bell Laboratories originally developed The Voder, the earliest incarnation of the vocoder, as a cryptographic device for WWII military communications. The machine split the human voice into a spectral representation, fragmenting the source into number of different frequencies that were then recombined into synthetic speech. Noise and unintelligibility shielded the Allies’ phone calls from Nazi interception. The Vocoder’s developer, Ralph Miller, bemoaned the atrocities the machine performed on language, reducing it to a “series of miserable grunts.”

From website Binary Heap-
In his history of the The Vocoder, How to Wreck a Nice Beach, Dave Tompkins tells how the apparatus originally took up an entire wall and was played solely by female phone operators, but the pitch of the female voice was said to be too high to be heard by the nascent technology. In fact, when it debuted at the 1939 World’s Fair, only men were chosen to experience the roboticization of their voice. The Voder was, in fact, originally created to only hear pitches in the range of 100-150 HZ, a designed exclusion from the start. So when the Signal Corps of the Army convinced President Eisenhower to call his wife via Voder from North Africa, Miller and the developers panicked for fear she wouldn’t be heard. Entering the Pentagon late at night, Mamie Eisenhower spoke into the telephone and a fragmented version of her words travelled across the Atlantic. Resurfacing in angular vocoded form, her voice urged her husband to come home, and he had no problem hearing her. Instead of giving the developers pause to question their own definitions of gender, this interaction is told as a derisive footnote of in the history of the sound and technology: the punchline being that the first lady’s voice was heard because it was as low as a man’s.
WAKE WORDS
In fall 2014 Amazon launched Echo, their new personal assistant device. Echo is a 12-inch long plain black cone that stands upright on a tabletop, similar in appearance to a telephoto camera lens. Equipped with far field mics, Echo has a female voice, connected to the cloud and always on standby. Users engage Echo with their own chosen ‘wake’ word. The linguistic similarity to a BDSM safe word could have been lost on developers. Although here inverted, the word is used to engage rather than halt action, awakening an instrument that lays dormant awaiting command.
Amazon’s much-parodied promotional video for Echo is narrated by the innocent voice of the youngest daughter in a happy, straight, white, middle-class family. While the son pitches Oedipal jabs at the father for his dubious role as patriarchal translator of technology, each member of the family soon discovers the ways Echo is useful to them. They name it Alexa and move from questions like: “Alexa how many teaspoons in a tablespoon” and “How tall is Mt. Everest?” to commands for dance mixes and cute jokes. Echo enacts a hybrid role as mother, surrogate companion, and nanny of sorts not through any real aspects of labor but through the intangible contribution of information. As a female-voiced oracle in the early pantheon of the Internet of Things, Echo’s use value is squarely placed in the realm of cisheteronormative domestic knowledge production. Gone are the tongue-in-cheek existential questions proffered to SIRI upon its release. The future with Echo is clean, wholesome, and absolutely SFW. But what does it mean for Echo to be accepted into the home, as a female gendered speaking subject?
Concerns over privacy and surveillance quickly followed Echo’s release, alarms mostly sounding over its “always on” function. Amazon banks on the safety and intimacy we culturally associate with the female voice to ease the transition of robots and AI into the home. If the promotional video painted an accurate picture of Echo’s usage, it would appear that Amazon had successfully launched Echo as a bodiless voice over the uncanny valley, the chasm below littered with broken phalanxes of female machines. Masahiro Mori coined the now familiar term uncanny valley in 1970 to describe the dip in empathic response to humanoid robots as they approach realism.
If we listen to the litany of reactions to robot voices through the filters of gender and sexuality it reveals the stark inclines of what we might think of as a queer uncanny valley. Paulina Palmer wrote in The Queer Uncanny about reoccurring tropes in queer film and literature, expanding upon what Freud saw as a prototypical aspect of the uncanny: the doubling and interchanging of the self. In the queer uncanny we see another kind of rift: that between signifier and signified embodied by trans people, the tearing apart of gender from its biological basis. The non-linear algebra of difference posed by queer and trans bodies is akin to the blurring of divisions between human and machine represented by the cyborg. This is the coupling of transphobic and automatonophobic anxieties, defined always in relation to the responses and preoccupations of a white, able bodied, cisgendered male norm. This is the queer uncanny valley. For the synthesized voice to function here, it must ease the chasm, like Echo: sutured by a voice coded as neutral, but premised upon the imagined body of a white, heterosexual, educated middle class woman.
22% Female
My own voice spans a range that would have dismayed someone like Ralph Miller. I sang tenor in Junior High choir until I was found out for straying, and then warned to stay properly in the realms of alto, but preferably soprano range. Around the same time I saw a late night feature of Audrey Hepburn in My Fair Lady, struggling to lose her crass proletariat inflection. So I, a working class gender ambivalent kid, walked around with books on my head muttering The Rain In Spain Falls Mainly on the Plain for weeks after. I’m generally loud, opinionated and people remember me for my laugh. I have sung in doom metal and grindcore punk bands, using both screeching highs and the growling “cookie monster” vocal technique mostly employed by cismales.
Given my own history of toying with and estrangement from what my voice is supposed to sound like, I was interested to try out a new app on the market, the Exceptional Voice App (EVA ), touted as “The World’s First and Only Transgender Voice Training App.” Functioning as a speech recognition program, EVA analyzes the pitch, respiration, and character of your voice with the stated goal of providing training to sound more like one’s authentic self. Behind EVA is Kathe Perez, a speech pathologist and businesswoman, the developer and provider of code to the circuit. And behind the code is the promise of giving proper form to rough sounds, pitch-perfect prosody, safety, acceptance, and wholeness. Informational and training videos are integrated with tonal mimicry for phrases like hee, haa, and ooh. User progress is rated and logged with options to share goals reached on Twitter and Facebook. Customers can buy EVA for Gals or EVA for Guys. I purchased the app online for my iPhone for $5.97.
My initial EVA training scores informed me I was 22% female; a recurring number I receive in interfaces with identity recognition software. Facial recognition programs consistently rate my face at 22% female. If I smile I tend to get a higher female response than my neutral face, coded and read as male. Technology is caught up in these translations of gender: we socialize women to smile more than men, then write code for machines to recognize a woman in a face that smiles.
As for EVA’s usage, it seems to be a helpful pedagogical tool with more people sharing their positive results and reviews on trans forums every day. With violence against trans people persisting—even increasing—at alarming rates, experienced worst by trans women of color, the way one’s voice is heard and perceived is a real issue of safety. Programs like EVA can be employed to increase ease of mobility throughout the world. However, EVA is also out of reach to many, a classed capitalist venture that tautologically defines and creates users with supply. The context for EVA is the systems of legal, medical, and scientific categories inherited from Foucault’s era of discipline; the predetermined hallucination of normal sexuality, the invention of biological criteria to define the sexes and the pathologization of those outside each box, controlled by systems of biopower.
Despite all these tools we’ll never really know how we sound. It is true that the resonant chamber of our own skull provides us with a different acoustic image of our own voice. We hate to hear our voice recorded because suddenly we catch a sonic glimpse of what other people hear: sharper more angular tones, higher pitch, less warmth. Speech recognition and synthesis work upon the same logic, the shifting away from interiority; a just off the mark approximation. So the question remains what would a gender variant voice synthesis and recognition sound like? How much is reliant upon the technology and how much depends upon individual listeners, their culture, and what they project upon the voice? As markets grow, so too have more internationally accented English dialects been added to computer programs with voice synthesis. Thai, Indian, Arabic and Eastern European English were added to Mac OSX Lion in 2011. Can we hope to soon offer our voices to the industry not as a set of data to be mined into caricatures, but as a way to assist in the opening up in gender definitions? We would be better served to resist the urge to chime in and listen to the field in the same way we suddenly hear our recorded voice played back, with a focus on the sour notes of cold translation.
—
Featured image: “Golden People love Gold Jewelry Robots” by Flickr user epSos.de, CC BY 2.0
—
AO Roberts is a Canadian intermedia artist and writer based in Oakland whose work explores gender, technology and embodiment through sound, installation and print. A founding member of Winnipeg’s NGTVSPC feminist artist collective, they have shown their work at galleries and festivals internationally. They have also destroyed their vocal chords, played bass and made terrible sounds in a long line of noise projects and grindcore bands, including VOR, Hoover Death, Kursk and Wolbachia. They hold a BFA from the University of Manitoba and a MFA in Sculpture from California College of the Arts.
—
REWIND!…If you liked this post, you may also dig:
Hearing Queerly: NBC’s “The Voice”—Karen Tongson
On Sound and Pleasure: Meditations on the Human Voice—Yvon Bonefant
I Been On: BaddieBey and Beyoncé’s Sonic Masculinity—Regina Bradley
The Hell, the High Water, and the Funk of It All: Sounding Power in Scandal

Editor’s Note: Here’s installment #3 of Sounding Out!‘s blog forum on gender and voice! Last week I talked about what it meant to have people call me, a woman of color, “loud.” The week before that we hosted Christine Ehrick‘s selections from her forthcoming book; she introduced us to the idea of the gendered soundscape, which she uses in her analysis on women’s radio speech from the 1930s to the 1950s. In the next few weeks we’ll have A.O. Roberts with synthesized voices and gender, Art Blake with his reflections on how his experience shifting his voice from feminine to masculine as a transgender man intersects with his work on John Cage, and lastly Robin James with an analysis of how ideas of what women should sound like have roots in Greek philosophy.
This week regular writer Regina Bradley puts the soundtrack of Scandal in conversation with the agency of the show’s protagonist, a black woman in manages crises for a living. So, lean in and close your eyes, but keep your ears open for any spies creeping in. –Liana M. Silva, Managing Editor
–––
9:00 pm (Eastern). The quick shutter of an invisible camera calls the attention of the viewers to Scandal. The clicking re-emphasizes the show’s title, bringing to mind paparazzi and their capturing of scandalous behavior. The shuttering also signifies the literal and sonic fast paced timing of Shonda Rhimes’ most popular ABC prime time show: quickened plots, fast talks, and blink-and-you’ll-miss-something-important visual details. Scandal’s central character, Washington D.C. crisis manager Olivia Pope (portrayed by Kerry Washington), is known mostly for her sharp professional outfits and no-nonsense approach to work. In Olivia, Rhimes has created a black female character that is perfectly flawed, a symbol of both the potential power and victimization of black women. Olivia Pope is neither just the savior nor is she solely a victim.
Scandal evokes intense debate about race and power because of its visual politics, but rarely is Scandal’s scoring prominent in those discussions. The soundtrack acts as an indicator of contemporary black women’s agency in popular culture. As both Rhimes and Scandal music director Alexandra Patsavas reveal, Scandal’s ‘vintage’ soundtrack is an opportunity to buoy the plot and add a unique alternative perspective to the action taking place on the show. The soundtrack’s nods to yesteryear artists – including Stevie Wonder, The Ohio Players, The O’Jays, Sam Cooke, Earth, Wind, and Fire, and Nina Simone sonically narrate additional layers of agency and identity on the show. Are these tracks giving Olivia a voice? What does Scandal’s scoring suggest about race, place, and power scripts for black women in contemporary popular culture?
Scandal takes place in Washington D.C., a location full of physical and sonic significance in national lore and the black popular imagination. In a national narrative D.C. is the epicenter of political agency, power, and the visibility of whiteness as a form of power. It is America’s city. Yet D.C. in the black imagination is the Chocolate City, a space that serves as a living archive of black folks’ attempts to intervene into a national narrative that would rather overlook the contributions of black bodies and culture. Washington, D.C. is the home of the Moorland-Springarn Research Center and multiple black cultural archives, Howard University and its place as the black mecca of Black Greek Letter Organizations, GoGo Music, and (embattled) social-political policies and endeavors for black people. It is a site of black identity that goes much farther than the place where everyone saw how a certain somebody had an American Dream. On the other hand, the increasing gentrification of the city raises questions of whether or not the nickname “Chocolate City” is applicable.Thus, Washington, D.C. exists at the crux of the romanticization of Americanness as a form of worldly power and the reality that (white) Americanness does not include all Americans.
Yet Washington D.C. as a site of complex and rich black experiences does not alone buoy Scandal’s use of Washington, D.C. as a site where a black woman “handles” the hustle and bustle of American power and its upheaval. This type of work takes place in the scoring, particularly because the show is not culturally recognizable as a “black show.” Its inherent blackness is sonic, using black music to revisit tropes of power and racial politics.
One possible and albeit slightly heavy handed approach for thinking through Scandal’s leaning on funk and soul music is to point out how the show uses black cultural forms to invoke power. For example, soul songs like Otis Redding’s “Mr. Pitiful,” Edwin Starr’s “War,” and Sly and the Family Stone’s “Everyday People” are not only used to accentuate the action in a scene but the possibility of Olivia as a power figure. The use of black men’s voices as they yell, scream, and moan sonically allude to power as a masculine concept. Yet Olivia’s connection to these songs signifies her potential to wield power in unorthodox ways not associated with black women. For example, the crescendo of music before Olivia delivers a demand to her team sets up her agency as a political figure. Her blackness is amplified and earmarked by the music. This pairing amplifies the question of race and power in a useful way. The dominantly black musical script offers the critique and engagement with Olivia Pope’s blackness that many viewers and critics complain are lacking. (See the brilliant synopsis presented by Dr. Jessica Marie Johnson about Shonda Rhimes’ portrayal of black women and popular culture that took place at Duke University last month.)
Scandal serves vintage musical scoring as a double entendre: the sound of black music from previous eras evokes ‘vintage’ scripts of race, gender, and power from that era that seep into this present moment. Scandal’s use of soul, funk, and disco sonically allude to larger questions lingering from the Civil Rights Movement: integration as an equalizer of power and privilege, the hypermasculinity of the Civil Rights Era, its cultural producers, and the immediate aftermath of these scripts on (black) American society in the late 1960s and 1970s. We frequently annotate black agency through the men creating (singing?) the music. This is equally true for the black cultural productions of the era, as they aesthetically supplemented the understanding that black folks mattering connected to the uplift and healthy presence of black men. Even with soul and funk music, which stand as antithetical responses to the problematic expectations of classist respectability politics, black women’s agency was associated with the sexual, emotional, or physical agency of black men.
The blackness and “maleness” of the funk and soul used in Scandal’s score subverts the power that Olivia Pope exerts in her dealings with her clients, her lovers, and team. For example, in season two Olivia and President Fitz’s sex tape is threatened to be leaked to the public. It is important to note that the tape is an audio tape, suggestively alluding the absence of physical and visual rhetoric to address the interracial relationship. As Olivia gives the word to leak the sex tape, The Ohio Players’ track “Love Rollercoaster” begins to play. It sonically stabilizes Olivia’s decision to “leak” her sexuality as a power move while also leaving room to question the deeper implications of how the viewer navigates her blackness and womanhood using physical, aural, and cultural markers of sexuality. Using male funk and soul artists allows Pope to ‘codeswitch’ between cultural scripts of power as masculine and womanhood as opposite power. It amplifies her authority and agency while signifying that her physical appearance and voice may not have the ability to confer her worth to the audience.
This tug-and-pull of power and agency is most amplified in Olivia Pope’s dealings with her father Rowan Pope, played by Joe Morton (who plays the HELL out of this role, by the way). Rowan Pope is a literal and figurative double agent: He is Elijah Pope, a curator of antiquities at the Smithsonian, and Rowan Pope, head of the top secret and lethal U.S. organization called B613. His fragmented life speaks to the constant negotiation of “safe” black masculinity. He also embodies the anxieties about black men as violent and bloodthirsty. Rowan/Elijah encapsulates all of the swagger and vitriol associated with conceptualizations of black power and black men from the Black Liberation Era. He is cold and calculating, and he complicates the rhetoric of racial uplift and expected from the Civil Rights/Black Power movements. He speaks in hardened, hushed tones with conviction, while snarling his words with spite for white authority. Not to mention, his is the character that brings up race overtly in the show.
This balance between hushed tones and snarled words comes through in Rowan’s early interaction with Olivia during Scandal’s season three premiere. Olivia, on the run because her name is leaked as the President’s mistress, is recovered by her father and told to flee the country. Rowan is not a doting and concerned father in this scene. Rather, he is disappointed by her lack of prowess and failure to aspire to higher forms of power and authority than “first lady.” Rowan recognizes there is no power in being the wife of the President, especially as a black woman, and he criticizes her for not following the first rule of black folks’ survival: “You need to be twice as good to get half of what they have.” “They” is a collective noun for white folks, often spoken behind closed doors as a means to inspire young black folks to do better. Rowan demands she state out loud what they need to be twice of. Olivia’s voice cracks and is breathless as she whispers “twice as good to get half of what they have.” Rowan exaggerates a “yes” and dismisses Olivia as “mediocre.” It is a painful and powerful scene where multiple dichotomies take place: a father scolding his daughter, a black man undermining black women’s agency, and the fear/anxiety about black women’s sexuality as a sign of weakness and lacking privilege. The wavering volume of Olivia’s voice signifies her quickly plummeting ability to voice her power. Olivia’s loss of words amplifies Rowans’ own authority, embodied in his voice when he adamantly declares “I am the hell and the high water!” No soundtrack can save her here.
However, Rowan does have human moments, reaching out to his estranged daughter Olivia with wine and music, specifically Stevie Wonder. Her record collection is filled with Stevie Wonder. It is important that she has a record collection instead of a collection of CDs or playlist. Not only does this detail speak to the trope of “vintage” that runs through the show but also gives credence to how Olivia establishes her power. Her major moments are annotated by Stevie Wonder: when her name is leaked “Higher Ground” plays in the background. When she is kidnapped at the end of the episode for past December’s Winter finale “Don’t You Worry Bout a Thing” takes center stage. Again, the sounds of a man, her father in this instance, are the soundtrack to her work. The choice of music subverts the gender balance of power. Through male artists, the show gives Olivia her authority.
The most prominent sonic signifier is the instrumental accompaniment from the artist The Album Leaf titled “The Light.” Also known as Olivia and President Fitz’s “song,” the track plays each time the two characters interact and share intimacy (physical and otherwise).
Notes from what sounds like an electric piano playing a scale are short and sweet to the ear. The track lends its innocence and vulnerability to Olivia and Fitz’s affair and offers a possibility that their love for each other can be read as star-crossed instead of in bad taste.
Scott Poulson-Bryant offered an intriguing read on his Facebook page on “The Light” as an allusion to the Civil Rights’ theme song “We Shall Overcome.” This reading of “The Light” as the context to Olivia and Fitz’s relationship makes room to complicate how Olivia’s agency as a black woman is historically and politically bound to women before–she alludes to her similarity to Sally Hemmings in one episode. Olivia’s Sally Hemmings reference uses Hemmings as the genesis point for understanding the complexity of Olivia’s sexual encounters as well as how to navigate black women’s sexual agency – and pleasure – in popular spaces. Sally Hemmings’ relationship to President Thomas Jefferson lends historical credence to Olivia and Fitz’s Scandal but also signifies the gray area of historical memory, cultural expectations, and consent as a form of power for African American women. “The Light” instrumental is not only a sonic accompaniment of Olivia as she relates to Fitz but her own struggles to recognize and balance her public and personal agency.
The soundtrack of Scandal gives a voice to not just Olivia’s authority in a place where race and power are intertwined but also a voice on national television to how whiteness and political power operate. Scandal’s controversial protagonist/anti-hero Olivia Pope is often central to recent discussions of race, gender, and popular culture. But the soundtrack to the show asks viewers to not just watch closely but also listen closely. Tune out and you might miss something.
—
Regina Bradley recently completed her PhD at Florida State University in African American Literature. Her dissertation is titled “Race to Post: White Hegemonic Capitalism and Black Empowerment in 21st Century Black Popular Culture and Literature.” She is a regular writer for Sounding Out!
—
REWIND!…If you liked this post, you may also dig:
SO! Reads: Shana Redmond’s Anthem: Social Movements and the Sound of Solidarity in the African Diaspora—Ashon Crawley
Freedom Back: Sounding Black Feminist History, Courtesy the Artists–Tavia Nyong’o
They Do Not All Sound Alike: Sampling Kathleen Cleaver, Assata Shakur, and Angela Davis—Tara Betts





















Recent Comments