WardResearch

UTEP Study Findings Could Help Make AI Voices More Natural-Sounding

New research from the University of Texas at El Paso reveals that the secret to humanizing AI isn’t better pronunciation—it’s phonetic reduction, or the natural tendency to "mumble" when we are in a good mood. Led by Dr. Nigel Ward, the study found that humans articulate words less precisely when expressing positive feelings. By comparing neutral and positive recordings, researchers discovered that English speakers were 33% more likely to "cut corners" in pronunciation when happy, while Spanish speakers showed a similar trend with nearly 40% of positive speech classified as reduced.

The problem with current systems like Siri and Alexa is that they are designed for 100% clarity. While functional, this makes them sound cold and robotic. Dr. Ward argues that to build trust and rapport, AI needs to sacrifice some precision for social warmth. This has significant real-world implications; for example, a banking AI could sound strictly businesslike for transactions but shift to a more supportive, less articulated tone if it detects a user is stressed. Additionally, "embodied AI" in self-driving cars or service robots could use these speech cues to communicate intent more naturally.

To move the industry forward, the team has released ReduEst, a tool for researchers to measure speech reduction, and will present their findings at Interspeech 2026. Ultimately, the research suggests that to make AI feel like a partner rather than a tool, we need to teach it how to mumble.

For more details on this study, you can read the full article here: UTEP Study Findings Could Help Make AI Voices More Natural-Sounding.

Connect With Us

The University of Texas at El Paso
Department of Computer Science
Chemistry & Computer Science Building, Room 3.1100
1801 Hawthorne St
El Paso, Texas 79902

E: csfrontdesk@utep.edu
P: (915) 747- 5480
F: (915) 747-5030

Location

Donate

Give Now