five

ParisNeo/Word_in_Sentence_Database

收藏
Hugging Face2023-11-19 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/ParisNeo/Word_in_Sentence_Database
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 task_categories: - table-question-answering language: - en pretty_name: Word in Sentence database --- # WIS database This database contains a question answer list about text This database was built using my this workflow: 1- load a raw text file 2- split into paragraphs 3- split paragraphs into sentences 4- for each word, ask question about its position and answer with the position, then ask about the word length and answer with the actual length of the word 5- ask a question about the number of words in the sentence and answer it 6- build a json database using this. To do this, I kindly got the concent of Keith Curtis to use his website content as fuel to this algorithm. The website can be found here: https://keithcu.com/wordpress/?page_id=599 Best regards.
提供机构:
ParisNeo
原始信息汇总

WIS数据库

概述

WIS数据库包含关于文本的问题答案列表。

构建流程

  1. 加载原始文本文件。
  2. 将文本分割成段落。
  3. 将段落分割成句子。
  4. 对每个单词提问其位置并回答位置,提问单词长度并回答实际长度。
  5. 提问句子中单词数量并回答。
  6. 使用上述信息构建JSON数据库。

许可

Apache 2.0

任务类别

  • 表格问答

语言

  • 英语

名称

Word in Sentence database

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作