relbert/conceptnet_relational_similarity
收藏Hugging Face2023-08-01 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/relbert/conceptnet_relational_similarity
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- en
license:
- other
multilinguality:
- monolingual
size_categories:
- n<1K
pretty_name: ConceptNet with High Confidence
---
# Dataset Card for "relbert/conceptnet_relation_similarity"
## Dataset Description
- **Repository:** [RelBERT](https://github.com/asahi417/relbert)
- **Paper:** [https://home.ttic.edu/~kgimpel/commonsense.html](https://home.ttic.edu/~kgimpel/commonsense.html)
- **Dataset:** Relational similarity dataset based on the high-confidence subset of ConceptNet
### Dataset Summary
The selected subset of ConceptNet used in [this work](https://home.ttic.edu/~kgimpel/commonsense.html), which compiled
to fine-tune [RelBERT](https://github.com/asahi417/relbert) model.
We removed `NotCapableOf` and `NotDesires` to keep the positive relation only.
We consider the original test set as test set, dev1 as the training set, and dev2 as the validation set.
## Dataset Structure
### Data Instances
An example of `train` looks as follows.
```shell
{
"relation_type": "AtLocation",
"positives": [["fish", "water"], ["cloud", "sky"], ["child", "school"], ... ],
"negatives": [["pen", "write"], ["sex", "fun"], ["soccer", "sport"], ["fish", "school"], ... ]
}
```
### Data Splits
| train |validation| test|
|--------:|---------:|---------:|
| 28| 34 | 16|
### Citation Information
```
@InProceedings{P16-1137,
author = "Li, Xiang
and Taheri, Aynaz
and Tu, Lifu
and Gimpel, Kevin",
title = "Commonsense Knowledge Base Completion",
booktitle = "Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) ",
year = "2016",
publisher = "Association for Computational Linguistics",
pages = "1445--1455",
location = "Berlin, Germany",
doi = "10.18653/v1/P16-1137",
url = "http://aclweb.org/anthology/P16-1137"
}
```
提供机构:
relbert
原始信息汇总
数据集概述
数据集描述
- 名称: ConceptNet with High Confidence
- 类型: 关系相似性数据集
- 基础: 基于ConceptNet的高置信度子集
- 用途: 用于微调RelBERT模型
数据集结构
数据实例
- 示例: shell { "relation_type": "AtLocation", "positives": [["fish", "water"], ["cloud", "sky"], ["child", "school"], ... ], "negatives": [["pen", "write"], ["sex", "fun"], ["soccer", "sport"], ["fish", "school"], ... ] }
数据分割
| train | validation | test |
|---|---|---|
| 28 | 34 | 16 |
引用信息
@InProceedings{P16-1137, author = "Li, Xiang and Taheri, Aynaz and Tu, Lifu and Gimpel, Kevin", title = "Commonsense Knowledge Base Completion", booktitle = "Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)", year = "2016", publisher = "Association for Computational Linguistics", pages = "1445--1455", location = "Berlin, Germany", doi = "10.18653/v1/P16-1137", url = "http://aclweb.org/anthology/P16-1137" }



