EEG dataset for speech decoding
收藏OpenNeuro2025-04-08 更新2026-03-14 收录
下载链接:
https://openneuro.org/datasets/ds006104
下载链接
链接失效反馈官方服务:
资源简介:
EEG dataset for speech decoding
============================
Dataset Overview
---------------
This dataset contains EEG recordings from a phoneme discrimination task with TMS.
The data were collected during two related studies in 2019 and 2021.
Study 1 (2019, Session 01):
- 8 participants (P01-P08)
- Focus on CV and VC phoneme pairs
- 2 blocks: CV pairs and VC pairs
- TMS targeted to LipM1 (-56, -8, 46) and TongueM1 (-60, -10, 25)
Study 2 (2021, Session 02):
- 16 participants (S01-S16)
- Expanded to include single phonemes and phoneme triplets
- 4 blocks: single phonemes, CV pairs, real words, and pseudowords
- Additional TMS targets included Broca's area (BA 44: -51, 7, 23) and verbal memory region (BA 6: -46, 1, 41)
Task Description
---------------
Participants listened to speech sounds and identified stimuli with a button-press response.
The stimuli included:
1. Single phonemes - Consonants (/b/, /p/, /d/, /t/, /s/, /z/) and vowels (/i/, /E/, /A/, /u/, /oU/)
2. Phoneme pairs - CV and VC combinations of the phonemes
3. Phoneme triplets - Real and pseudowords constructed of CVC sequences
TMS Methodology
--------------
Detailed information about TMS parameters can be found in the sourcedata/tms_metadata/tms_parameters.json file.
TMS was applied using a Magstim Super Rapid Plus1 stimulator with a figure-of-eight 40 mm coil.
Stimulation was delivered at 110% of resting motor threshold as paired pulses with 50ms interpulse interval.
Detailed information about the methodology and results can be found in the associated publication:
Moreira et al. "An open-access EEG dataset for speech decoding: Exploring the role of articulation and coarticulation"
Directory Structure
------------------
The dataset follows BIDS convention with the following structure:
/sub-[subject]/ses-[session]/eeg/
Where subject is P01-P08 for Study 1 and S01-S16 for Study 2.
Session is 01 for Study 1 and 02 for Study 2.
Contact Information
------------------
For questions about this dataset, please contact Lindy Comstock at lbcomstock@ucla.edu
用于语音解码的脑电图(EEG)数据集
============================
数据集概览
---------------
本数据集包含结合经颅磁刺激(Transcranial Magnetic Stimulation, TMS)的音素辨别任务的脑电图记录数据。该数据集的数据采集自2019年与2021年的两项相关研究。
研究1(2019年,会话01):
- 8名受试者(编号P01-P08)
- 实验聚焦于辅音-元音(CV)与元音-辅音(VC)音素对
- 分为2个实验块:CV音素对块与VC音素对块
- 经颅磁刺激靶点为唇部初级运动皮层(LipM1)(-56, -8, 46)与舌部初级运动皮层(TongueM1)(-60, -10, 25)
研究2(2021年,会话02):
- 16名受试者(编号S01-S16)
- 实验范围拓展至单音素与音素三元组
- 分为4个实验块:单音素块、CV音素对块、真实词汇块与伪词汇块
- 新增经颅磁刺激靶点:布洛卡区(Broca's area,BA 44:-51, 7, 23)与言语记忆脑区(BA 6:-46, 1, 41)
任务描述
---------------
受试者聆听语音刺激,并通过按键反应完成刺激物识别任务。本次实验的刺激物包含以下三类:
1. 单音素:包括辅音(/b/、/p/、/d/、/t/、/s/、/z/)与元音(/i/、/E/、/A/、/u/、/oU/)
2. 音素对:由上述音素组合而成的CV与VC音素对
3. 音素三元组:由辅音-元音-辅音(CVC)序列构成的真实词汇与伪词汇
经颅磁刺激方法学
---------------
经颅磁刺激参数的详细信息可参见sourcedata/tms_metadata/tms_parameters.json文件。
本次实验采用Magstim Super Rapid Plus1型刺激器搭配8字形40mm线圈实施经颅磁刺激。刺激强度设置为静息运动阈值的110%,采用脉冲间隔为50ms的成对脉冲刺激模式。
本数据集的方法学细节与实验结果可参见相关学术出版物:Moreira等人发表的《用于语音解码的开放获取脑电图数据集:探究发音与协同发音的作用》("An open-access EEG dataset for speech decoding: Exploring the role of articulation and coarticulation")
数据集目录结构
---------------
本数据集遵循脑成像数据结构(Brain Imaging Data Structure, BIDS)规范,目录结构如下:
/sub-[subject]/ses-[session]/eeg/
其中,subject对于研究1为P01-P08,研究2为S01-S16;session对于研究1为01,研究2为02。
联系方式
---------------
若对本数据集有任何疑问,请联系Lindy Comstock,邮箱地址为lbcomstock@ucla.edu
创建时间:
2025-04-08
搜集汇总
数据集介绍

背景与挑战
背景概述
该数据集包含两个相关研究中的EEG记录,用于音素辨别任务,共24名参与者。数据集重点关注单音素、音素对和音素三元组的神经解码,并使用了TMS技术,遵循BIDS标准组织。
以上内容由遇见数据集搜集并总结生成



