muhammadravi251001/augmented-indo-nli
收藏数据集概述
标签
- augmented-indonli
许可证
- bigscience-openrail-m
包含的数据集
- indonli
数据文件
translate_train.tar.gztrain.jsonldev.jsonl
数据来源
https://github.com/ir-nlp-csui/indonli/tree/main/data
使用方法
python !wget https://huggingface.co/datasets/muhammadravi251001/augmented-indo-nli/raw/main/dev_augmented.jsonl !wget https://huggingface.co/datasets/muhammadravi251001/augmented-indo-nli/resolve/main/train_augmented.jsonl
import pandas as pd data_train_augmented_indonli = pd.read_json(path_or_buf=train.jsonl, lines=True) data_dev_augmented_indonli = pd.read_json(path_or_buf=dev.jsonl, lines=True)
参考文献
@inproceedings{indonli, title = "IndoNLI: A Natural Language Inference Dataset for Indonesian", author = "Mahendra, Rahmad and Aji, Alham Fikri and Louvan, Samuel and Rahman, Fahrurrozi and Vania, Clara", booktitle = "Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing", month = nov, year = "2021", publisher = "Association for Computational Linguistics", }



