Continuous speech recognition from ECoG

Dominic Heger, Christian Herff, Adriana De Pesters, Dominic Telaar, Peter Brunner, Gerwin Schalk, Tanja Schultz

Research output: Contribution to journalConference articlepeer-review

4 Scopus citations

Abstract

Continuous speech production is a highly complex process involving many parts of the human brain. To date, no fundamental representation that allows for decoding of continuous speech from neural signals has been presented. Here we show that techniques from automatic speech recognition can be applied to decode a textual representation of spoken words from neural signals. We model phones as the fundamental unit of the speech process in invasively measured brain activity (intracranial electrocorticographic (ECoG)) recordings. These phone models give insights into timings and locations of neural processes associated with the continuous production of speech and can be used in a speech recognizer to decode the neural data into their textual representations. When restricting the dictionary to small subsets, Word Error Rates as low as 25% can be achieved. As the brain activity data sets are fairly small, alternative approaches to Gaussian models are investigated by relying on robust, regularized discriminative models.

Original languageEnglish
Pages (from-to)1131-1135
Number of pages5
JournalProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
Volume2015-January
StatePublished - 2015
Event16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015 - Dresden, Germany
Duration: Sep 6 2015Sep 10 2015

Keywords

  • Brain-computer interface
  • ECoG
  • Electrocorticography
  • Speech recognition

Fingerprint

Dive into the research topics of 'Continuous speech recognition from ECoG'. Together they form a unique fingerprint.

Cite this