five

Past Written Texts Dataset

收藏
NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/record/2670060
下载链接
链接失效反馈
官方服务:
资源简介:
The dataset consists of features extracted from older adults’ text. The texts were written by the older person either in an electronic mean (eg. older e-mail), or in paper form and were transcribed by the project's clinical nurses. The texts were then translated to English using the MyMemory service (https://mymemory.translated.net/), and a series of features were generated that can be used for sentiment analysis. The list of fields of this dataset is presented below: - Part_id: The user ID, which should be a 4-digit number - Date: The recording date, which follows the “DD-MM-YY” format (eg. 14 September 2017, is formatted as 14-09-17) - Clinical_visit: As several clinical evaluations were performed to each older adult, this number shows for which clinical evaluation these measurements refer to - Transcript: If the text was written by the older adult (0) or was transcribed by a nurse (1) - Language: The original language of the text (0 = Greek) - Text_length, Number_of_sentences, Number_of_words, Number_of_words_per_sentence, Text_entropy: Statistical Measures - Desc_image_ENG_sentiment, Desc_event_sentiment, Prev_text_ENG_sentiment: Sentiment Analysis - Tf-XX: Term frequency – Inverse document frequency - Tf-pos-XX: Part of Speech analysis, using tf-idf methodology
创建时间:
2020-01-24
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作