MuLD
收藏MuLD: The Multitask Long Document Benchmark
数据集概述
MuLD(Multitask Long Document Benchmark)是一个包含6个自然语言处理任务的数据集,其中输入文本至少包含10,000个单词。该基准涵盖了多种任务类型,包括翻译、摘要、问答和分类。此外,输出长度从单个单词的分类标签到比输入文本更长的输出不等。
数据集加载
可以使用Huggingface Datasets库加载数据集: python import datasets ds = datasets.load_dataset("ghomasHudson/muld", "NarrativeQA") ds = datasets.load_dataset("ghomasHudson/muld", "HotpotQA") ds = datasets.load_dataset("ghomasHudson/muld", "Character Archetype Classification") ds = datasets.load_dataset("ghomasHudson/muld", "OpenSubtitles") ds = datasets.load_dataset("ghomasHudson/muld", "AO3 Style Change Detection") ds = datasets.load_dataset("ghomasHudson/muld", "VLSP")
或者通过克隆仓库加载: python import datasets ds = datasets.load_dataset("./muld.py", "NarrativeQA") ...
手动下载
如果需要手动下载数据文件,可以访问以下链接:
- NarrativeQA
- HotpotQA
- Character Archetype Classification
- OpenSubtitles
- AO3 Style Change Detection
- VLSP
引用
如果使用该基准,请引用以下论文:
@InProceedings{hudson-almoubayed:2022:LREC, author = {Hudson, George and Al Moubayed, Noura}, title = {MuLD: The Multitask Long Document Benchmark}, booktitle = {Proceedings of the Language Resources and Evaluation Conference}, month = {June}, year = {2022}, address = {Marseille, France}, publisher = {European Language Resources Association}, pages = {3675--3685}, url = {https://aclanthology.org/2022.lrec-1.392} }
数据集元数据
以下表格是该数据集被搜索引擎索引所需的元数据: <div itemscope itemtype="http://schema.org/Dataset"> <table> <tr> <th>属性</th> <th>值</th> </tr> <tr> <td>名称</td> <td><code itemprop="name">MuLD</code></td> </tr> <tr> <td>别名</td> <td><code itemprop="alternateName">Multitask Long Document Benchmark</code></td> </tr> <tr> <td>URL</td> <td><code itemprop="url">https://github.com/ghomasHudson/muld</code></td> </tr> <tr> <td>描述</td> <td><code itemprop="description">MuLD (Multitask Long Document Benchmark) is a set of 6 NLP tasks where the inputs consist of at least 10,000 words. The benchmark covers a wide variety of task types including translation, summarization, question answering, and classification. Additionally there is a range of output lengths from a single word classification label all the way up to an output longer than the input text.</code></td> </tr> <tr> <td>引用</td> <td><code itemprop="citation">https://arxiv.org/abs/2202.07362</code></td> </tr> <tr> <td>创建者</td> <td> <div itemscope itemtype="http://schema.org/Person" itemprop="creator"> <table> <tr> <th>属性</th> <th>值</th> </tr> <tr> <td>名称</td> <td><code itemprop="name">Thomas Hudson</code></td> </tr> <tr> <td>sameAs</td> <td><code itemprop="sameAs">https://orcid.org/0000-0003-3562-3593</code></td> </tr> </table> </div> </td> </tr> </table> </div>




