muhammadravi251001/translated-indo-nli
收藏数据集概述
标签
- 标签:
- translated-indonli
- 许可证:
- bigscience-openrail-m
- 数据集:
- indonli
数据来源
- 数据文件:
translate_train.tar.gz - 来源链接:
https://github.com/ir-nlp-csui/indonli/tree/main/data
使用方法
-
下载数据: python !wget https://huggingface.co/datasets/muhammadravi251001/translated-indo-nli/raw/main/dev.jsonl !wget https://huggingface.co/datasets/muhammadravi251001/translated-indo-nli/resolve/main/train.jsonl
-
加载数据: python import pandas as pd data_train_translated_indonli = pd.read_json(path_or_buf=train.jsonl, lines=True) data_dev_translated_indonli = pd.read_json(path_or_buf=dev.jsonl, lines=True)
参考文献
-
数据集来源: IndoNLI
-
参考文献:
@inproceedings{indonli, title = "IndoNLI: A Natural Language Inference Dataset for Indonesian", author = "Mahendra, Rahmad and Aji, Alham Fikri and Louvan, Samuel and Rahman, Fahrurrozi and Vania, Clara", booktitle = "Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing", month = nov, year = "2021", publisher = "Association for Computational Linguistics", }



