five

Pedagogical Roles of Natural Language Processing Documents

收藏
Figshare2017-07-14 更新2026-04-08 收录
下载链接:
https://figshare.com/articles/dataset/Pedagogical_Roles_of_Natural_Language_Processing_Documents/5202424/1
下载链接
链接失效反馈
官方服务:
资源简介:
<b>Description</b>To allow a computational exploration of the learning utility ("pedagogical value") between a learner and a document, we introduce the notion of "pedagogical roles" of documents as an intermediary component. This dataset is a novel annotated corpus of the pedagogical roles of documents from an expanded ACL Anthology corpus.<br>The current version includes the following pedagogical roles:<br>- Survey: Is this document a broad survey? A broad survey examines or compares across a broad concept.- Tutorial: Is this document a tutorial? Tutorials describe a coherent process about how to use tools or understand a concept, and teach by example.- Resource: Does this document describe the authors' implementation of a system, corpus, or other resource that has been distributed (e.g. public data sets or tools that have been released under an open source-license or are commercially available)?- Reference Work: Is this document a collection of authoritative facts intended for others to refer to? Reports of novel, experimental results are not authoritative facts; the statement ``grass is green'' is. Reference Works describe different subtopics within a concept.- Empirical Results: Does this document describe results of the authors' experiments?- Software Manual: Is this document a manual describing how to use different components of a software?- Other: Other role (This includes theoretical papers, papers that present a rebuttal for a claim, thought experiments, etc.)<br><b>Files</b>- annotations_raw_average.tsv: Averaged raw annotations. Each pedagogical role score is an average over all annotations of the role for the document.- annotations_bin.tsv: Binarized version of the annotations. A document belongs to a pedagogical role if a majority of the annotators agree.- pedagogical_roles.bib: Metadata of documents in annotated corpus. The documents with a source of "web-supplementary" are supplementary documents that were annotated internally.<br><b>Papers</b>If you use this dataset, please cite the following paper. We present annotation guidelines, analysis, and initial baseline classification results.<br>@InProceedings{ShengEtAl2017, author = {Emily Sheng and Prem Natarajan and Jonathan Gordon and Gully Burns}, year = {2017}, title = {An Investigation into the Pedagogical Features of Documents}, booktitle = {Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications} }<br>Associated work that makes use of this corpus:<br>@InProceedings{GordonEtAl2017, author = {Jonathan Gordon and Stephen Aguilar and Emily Sheng and Gully Burns}, year = {2017}, title = {Structured Generation of Technical Reading Lists}, booktitle = {Proceedings of the 12th Workshop on Innovative Use of NLP for Building Educational Applications} }<br><b>Acknowledgements</b>This research is based upon work supported in part by the Office ofthe Director of National Intelligence (ODNI), Intelligence AdvancedResearch Projects Activity (IARPA), via Air Force Research Laboratory(AFRL). The views and conclusions contained herein are those of theauthors and should not be interpreted as necessarily representing theofficial policies or endorsements, either expressed or implied, ofODNI, IARPA, AFRL, or the U.S. Government. The U.S. Government isauthorized to reproduce and distribute reprints for Governmentalpurposes notwithstanding any copyright annotation thereon.
提供机构:
Pnataraj@Isi.Edu; Burns@Isi.Edu
创建时间:
2017-07-14
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作