FLAMENCO Learning Disabilities Dataset
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/10492891
下载链接
链接失效反馈官方服务:
资源简介:
In the context of the FLAMENCO project, we have released a dataset designed for predicting potential deficiencies in children's communication skills, tailored for Federated Learning. This dataset specifically focuses on addressing two prevalent deficiencies in communication skill development in children: autism and intellectual disability. For each deficiency, two CSV files are provided—one for training machine learning models and another for testing them. Each entry in these CSV files includes the following details:
- case_id: An anonymized identifier used to distinguish cases.
- client_id: Identifies the client to which the case belongs, useful for dataset splitting in federated settings.
- A series of scores measuring specific communication skills: These scores, such as Verbalization, Voicing, Syntax, etc., are derived from the child's performance in specialised gamified exercises and have been computed with the assistance of expert clinicians.
target: Can be -1 (no clinician's diagnosis available for the case), 0 (no diagnosed deficiency in the case), 1 (indicates a positive diagnosis of communication deficiency by a clinician)
TThis project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreements No. 957406 (TERMINET).
在FLAMENCO项目框架下,我们发布了一款专为联邦学习(Federated Learning)设计的数据集,用于预测儿童沟通技能潜在缺陷。该数据集专门针对儿童沟通技能发展中的两类常见缺陷:自闭症与智力障碍。针对每类缺陷,均提供两份CSV文件:一份用于机器学习模型训练,另一份用于模型测试。上述CSV文件中的每条数据均包含以下信息:
- case_id:用于区分不同样本的匿名化标识符
- client_id:标识该样本所属的客户端,在联邦学习场景下可用于数据集拆分
- 多项沟通技能专项评分:此类评分(如Verbalization、Voicing、Syntax等)源自儿童在专业化游戏化训练任务中的表现,并经临床专家协助计算得出
- target:取值包括-1(该样本无临床医生诊断结果)、0(该样本未被诊断存在沟通缺陷)、1(该样本经临床医生确诊存在沟通缺陷)
本项目获欧盟地平线2020研究与创新计划资助,资助协议编号为957406(TERMINET)。
创建时间:
2024-01-11



