mediabiasgroup/DefExtra
收藏Hugging Face2026-02-06 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/mediabiasgroup/DefExtra
下载链接
链接失效反馈官方服务:
资源简介:
DefExtra包含来自75篇论文的268条定义记录(术语、定义、上下文、类型)。由于版权问题,我们不提供论文摘录,而是提供定位标记和脚本,让用户可以从自己的PDF文件中重构数据集。数据集的工作流程设计是因为无法重新分发受版权保护的摘录,因此仅提供定位标记和用于从用户提供的PDF中重构文本的脚本。数据集的应用场景包括文本分类、问答和文本检索。
DefExtra contains 268 definition records (term, definition, context, type) from 75 papers. We do not ship excerpts from papers due to copyright. Instead, we ship markers and scripts that let users hydrate the dataset from their own PDFs. The workflow is designed because we cannot redistribute copyrighted excerpts, hence we ship only localization markers plus scripts to reconstruct the text from user-supplied PDFs. The dataset is intended for tasks such as text-classification, question-answering, and text-retrieval.
提供机构:
mediabiasgroup



