217 Views

AI Headphones: Hear One Voice in a Crowd

LinkedIn Facebook X
May 30, 2024

Get a Price Quote

The innovative system known as “Target Speech Hearing” has the remarkable ability to eliminate all background noise and exclusively play the voice of a specific speaker in real time, regardless of the listener's movement in noisy surroundings. This groundbreaking technology was recently unveiled by a team of researchers at the ACM CHI Conference on Human Factors in Computing Systems in Honolulu on May 14. While the proof-of-concept device's code is accessible for further development, it is not yet available for commercial use.

Shyam Gollakota, a distinguished professor at the University of Washington's Paul G. Allen School of Computer Science & Engineering and senior author of the study, emphasized the unique application of artificial intelligence in altering auditory perception through this project. He stated, “We tend to think of AI now as web-based chatbots that answer questions. But in this project, we develop AI to modify the auditory perception of anyone wearing headphones, given their preferences. With our devices you can now hear a single speaker clearly even if you are in a noisy environment with lots of other people talking.”

To utilize the system, an individual wearing standard headphones equipped with microphones simply presses a button while facing the speaker they wish to focus on. The sound waves produced by the speaker's voice should reach both sides of the headset's microphones simultaneously, with a slight margin of error of 16 degrees. Subsequently, the headphones transmit this signal to an embedded computer onboard, where the team's machine learning software discerns and memorizes the unique vocal patterns of the desired speaker. As the pair moves around, the system continues to play back the enrolled voice to the listener, with its ability to concentrate on the specific voice improving as the speaker provides more speech data for training.

By honing in on the speaker's voice and filtering out extraneous noise, the “Target Speech Hearing” system offers a transformative solution for individuals seeking clear communication in challenging acoustic environments. This technology has the potential to revolutionize how people interact and communicate in noisy settings, providing a seamless and immersive auditory experience that enhances overall listening clarity and comprehension.

As researchers continue to refine and expand upon this cutting-edge technology, the future holds promising prospects for improved communication tools that leverage artificial intelligence to enhance human perception and interaction. The development of such innovative systems underscores the ongoing evolution of AI applications beyond traditional domains, opening up new possibilities for personalized and adaptive technologies that cater to individual preferences and needs.

Recent Stories