RickBrannan/categorize_bib_lang_grammar
收藏Hugging Face2024-11-22 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/RickBrannan/categorize_bib_lang_grammar
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含2700多个句子,这些句子被分类为`0`(`NOT-GRAMMAR`)或`1`(`GRAMMAR`),分类是由人工进行的。数据集主要用于识别使用语法术语的句子,特别是在SIL Open Translators Notes等资源中。数据集的句子主要来自unfoldingWord Greek Grammar和unfoldingWord Hebrew Grammar,还有一部分来自SIL Open Translators Notes。
The dataset contains over 2,700 sentences categorized as `0` (NOT-GRAMMAR) or `1` (GRAMMAR). These categorizations are human-curated. The dataset primarily comes from unfoldingWords Greek Grammar and Hebrew Grammar, with a smaller portion from SIL Open Translators Notes. The purpose of the dataset is to locate sentences using grammatical terminology in resources, particularly suitable for models like DistilBERT.
提供机构:
RickBrannan



