TY - JOUR
T1 - Identifying the attended speaker using electrocorticographic (ECoG) signals
AU - Dijkstra, K. V.
AU - Brunner, P.
AU - Gunduz, A.
AU - Coon, W.
AU - Ritaccio, A. L.
AU - Farquhar, J.
AU - Schalk, G.
N1 - Funding Information:
This work was supported by the NIH (EB006356 (GS), EB00856 (GS), and EB018783 (GS)), the US Army Research Office W911NF-07-1-0415 (GS), W911NF-08-1-0216 (GS) and W911NF-14-1-0440 (GS)), and Fondazione Neurone.
Funding Information:
This work was supported by the NIH (EB006356 (GS), EB00856 (GS), and EB018783 (GS)), the US Army Research Office W911NF-07-1-0415 (GS), W911NF-08-1-0216 (GS) and W911NF-14-1-0440 (GS)), and Fondazione Neurone. The authors acknowledge Marcia Sanders for her invaluable assistance in editing the manuscript.
Publisher Copyright:
© 2015, © 2015 Taylor & Francis.
PY - 2015/10/2
Y1 - 2015/10/2
N2 - People affected by severe neuro-degenerative diseases (e.g., late-stage amyotrophic lateral sclerosis (ALS) or locked-in syndrome) eventually lose all muscular control. Thus, they cannot use traditional assistive communication devices that depend on muscle control, or brain-computer interfaces (BCIs) that depend on the ability to control gaze. While auditory and tactile BCIs can provide communication to such individuals, their use typically entails an artificial mapping between the stimulus and the communication intent. This makes these BCIs diffcult to learn and use. In this study, we investigated the use of selective auditory attention to natural speech as an avenue for BCI communication. In this approach, the user communicates by directing his/her attention to one of two simultaneously presented speakers. We used electrocorticographic (ECoG) signals in the gamma band (70–170 Hz) to infer the identity of the attended speaker, thereby removing the need to learn such an artificial mapping. Our results from 12 human subjects show that a single cortical location over superior temporal gyrus or premotor cortex is typically sufficient to identify the attended speaker within 10 s and with 77% accuracy (50% accuracy due to chance). These results lay the groundwork for future studies that may determine the real-time performance of BCIs based on selective auditory attention to speech.
AB - People affected by severe neuro-degenerative diseases (e.g., late-stage amyotrophic lateral sclerosis (ALS) or locked-in syndrome) eventually lose all muscular control. Thus, they cannot use traditional assistive communication devices that depend on muscle control, or brain-computer interfaces (BCIs) that depend on the ability to control gaze. While auditory and tactile BCIs can provide communication to such individuals, their use typically entails an artificial mapping between the stimulus and the communication intent. This makes these BCIs diffcult to learn and use. In this study, we investigated the use of selective auditory attention to natural speech as an avenue for BCI communication. In this approach, the user communicates by directing his/her attention to one of two simultaneously presented speakers. We used electrocorticographic (ECoG) signals in the gamma band (70–170 Hz) to infer the identity of the attended speaker, thereby removing the need to learn such an artificial mapping. Our results from 12 human subjects show that a single cortical location over superior temporal gyrus or premotor cortex is typically sufficient to identify the attended speaker within 10 s and with 77% accuracy (50% accuracy due to chance). These results lay the groundwork for future studies that may determine the real-time performance of BCIs based on selective auditory attention to speech.
KW - auditory attention
KW - brain-computer interface (BCI)
KW - cocktail party
KW - electrocorticography (ECoG)
UR - http://www.scopus.com/inward/record.url?scp=85006451037&partnerID=8YFLogxK
U2 - 10.1080/2326263X.2015.1063363
DO - 10.1080/2326263X.2015.1063363
M3 - Article
AN - SCOPUS:85006451037
SN - 2326-263X
VL - 2
SP - 161
EP - 173
JO - Brain-Computer Interfaces
JF - Brain-Computer Interfaces
IS - 4
ER -