TY - GEN
T1 - Decoding syllables from human fMRI activity
AU - Otaka, Yohei
AU - Osu, Rieko
AU - Kawato, Mitsuo
AU - Liu, Meigen
AU - Murata, Satoshi
AU - Kamitani, Yukiyasu
PY - 2008
Y1 - 2008
N2 - Language plays essential roles in human cognition and social communication, and therefore technology of reading out speech using non-invasively measured brain activity will have both scientific and clinical merits. Here, we examined whether it is possible to decode each syllable from human fMRI activity. Four healthy subjects participated in the experiments. In a decoding session, the subjects repeatedly uttered a syllable presented on a screen at 3Hz for a 12-s block. Nine different syllables are presented in a single experimental run which was repeated 8 times. We also specified the voxels which showed articulation-related activities by utterance of all the syllables in Japanese phonology in a conventional task-rest sequence. Then, we used either all of these voxels or a part of these voxels that exist in anatomically specified ROIs (M1, cerebellum) during decoding sessions as data samples for training and testing a decoder (linear support vector machine) that classifies brain activity patterns for different syllables. To evaluate decoding performance, we performed cross-validation by testing the sample of one decoding session using a decoder trained with the samples of the remaining sessions. As a result, syllables were correctly decoded at above-chance levels. The results suggest the possibility of using non-invasively measured brain activity to read out the intended speech of disabled patients in speech motor control.
AB - Language plays essential roles in human cognition and social communication, and therefore technology of reading out speech using non-invasively measured brain activity will have both scientific and clinical merits. Here, we examined whether it is possible to decode each syllable from human fMRI activity. Four healthy subjects participated in the experiments. In a decoding session, the subjects repeatedly uttered a syllable presented on a screen at 3Hz for a 12-s block. Nine different syllables are presented in a single experimental run which was repeated 8 times. We also specified the voxels which showed articulation-related activities by utterance of all the syllables in Japanese phonology in a conventional task-rest sequence. Then, we used either all of these voxels or a part of these voxels that exist in anatomically specified ROIs (M1, cerebellum) during decoding sessions as data samples for training and testing a decoder (linear support vector machine) that classifies brain activity patterns for different syllables. To evaluate decoding performance, we performed cross-validation by testing the sample of one decoding session using a decoder trained with the samples of the remaining sessions. As a result, syllables were correctly decoded at above-chance levels. The results suggest the possibility of using non-invasively measured brain activity to read out the intended speech of disabled patients in speech motor control.
UR - http://www.scopus.com/inward/record.url?scp=54049120292&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=54049120292&partnerID=8YFLogxK
U2 - 10.1007/978-3-540-69162-4_102
DO - 10.1007/978-3-540-69162-4_102
M3 - Conference contribution
AN - SCOPUS:54049120292
SN - 3540691596
SN - 9783540691594
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 979
EP - 986
BT - Neural Information Processing - 14th International Conference, ICONIP 2007, Revised Selected Papers
T2 - 14th International Conference on Neural Information Processing, ICONIP 2007
Y2 - 13 November 2007 through 16 November 2007
ER -