ParisNeo/Word_in_Sentence_Database
收藏Hugging Face2023-11-19 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/ParisNeo/Word_in_Sentence_Database
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
task_categories:
- table-question-answering
language:
- en
pretty_name: Word in Sentence database
---
# WIS database
This database contains a question answer list about text
This database was built using my this workflow:
1- load a raw text file
2- split into paragraphs
3- split paragraphs into sentences
4- for each word, ask question about its position and answer with the position, then ask about the word length and answer with the actual length of the word
5- ask a question about the number of words in the sentence and answer it
6- build a json database using this.
To do this, I kindly got the concent of Keith Curtis to use his website content as fuel to this algorithm.
The website can be found here:
https://keithcu.com/wordpress/?page_id=599
Best regards.
提供机构:
ParisNeo
原始信息汇总
WIS数据库
概述
WIS数据库包含关于文本的问题答案列表。
构建流程
- 加载原始文本文件。
- 将文本分割成段落。
- 将段落分割成句子。
- 对每个单词提问其位置并回答位置,提问单词长度并回答实际长度。
- 提问句子中单词数量并回答。
- 使用上述信息构建JSON数据库。
许可
Apache 2.0
任务类别
- 表格问答
语言
- 英语
名称
Word in Sentence database



