Find ASIC Vendors

Revolutionizing Keyword-Spotting with Innovative Architecture

May 22, 2024

Get a Price Quote

The latest breakthrough in architecture for audio feature extraction has been unveiled, showcasing a cutting-edge approach that leverages time-domain signal processing on oscillators locked by injection. This innovative technology is specifically designed for devices powered by energy harvesters, operating efficiently even with power supply levels below 0.5V. The research paper titled “0.4V 988nW Time-Domain Audio Feature Extraction for Keyword Spotting Using Injection-Locked Oscillators” highlights the remarkable achievement of achieving accurate speech recognition while consuming less than one microwatt of power.

The paper introduces the concept of injection-locked, oscillator-based time-domain audio feature extraction (TD-FEx), a pioneering technique that enables keyword spotting at voltages as low as 0.4V. Impressively, the system achieved an outstanding accuracy rate of 91 percent when tested on a set of 10 words. Unlike traditional voltage-based coding, TD-FEx represents information through the time delay of signals from two clocks. This approach not only caters to advanced nodes but also offers digital-like implementation, operates at low supply voltages, and boasts superior noise immunity compared to existing systems.

While digital audio feature extraction (FEx) units have been prevalent, the paper sheds light on the advantages of analog-based FEx units utilizing multi-channel Gm-C bandpass filters. These analog units exhibit ten times the power efficiency of their digital counterparts within a similar silicon footprint. However, the research points out that analog FEx circuits have faced limitations in keyword spotting applications, struggling to recognize more than four keywords effectively. Additionally, they are plagued by challenges such as large physical footprint, technology migration complexities, and restricted dynamic range at low supply voltages, despite speech signals inherently possessing a wide dynamic range.

Despite the advancements in analog-based FEx units, the injection-locked, oscillator-based TD-FEx system developed by CEA-Leti has emerged as a game-changer in the field of audio processing. By demonstrating precise speech recognition capabilities while consuming less than 1 µW of power, this technology sets a new benchmark for efficiency and accuracy in keyword spotting applications. The successful implementation of TD-FEx not only paves the way for enhanced performance in energy-constrained devices but also opens up possibilities for further innovations in low-power audio processing systems.

In conclusion, the integration of time-domain signal processing with injection-locked oscillators represents a significant leap forward in audio feature extraction technology. With its ability to operate at ultra-low voltages and deliver exceptional accuracy in keyword spotting tasks, the TD-FEx system showcases the potential for revolutionizing energy-efficient audio processing in various applications. As researchers continue to explore the capabilities of this groundbreaking technology, the future of audio feature extraction looks promising, offering new avenues for enhancing speech recognition and audio processing in resource-constrained environments.