five

ChuGyouk/Arguinas

收藏
Hugging Face2026-04-21 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/ChuGyouk/Arguinas
下载链接
链接失效反馈
官方服务:
资源简介:
--- size_categories: - 1K<n<10K language: - en configs: - config_name: default data_files: - split: train path: data/train-* - split: test path: data/test-* dataset_info: features: - name: messages list: - name: content dtype: string - name: role dtype: string - name: title dtype: string - name: background dtype: string - name: argument dtype: string - name: fallacy_info struct: - name: rationale dtype: string - name: type dtype: string - name: sections struct: - name: check_faithfulness struct: - name: faithfulness dtype: bool - name: feedback_faithfulness dtype: string - name: check_validity struct: - name: final_formalized_conclusion dtype: string - name: necessary_formalized_premises dtype: string - name: valid_formalized_premises list: string - name: validity dtype: string - name: z3_program dtype: string - name: reconstruction struct: - name: conclusion dtype: string - name: definition dtype: string - name: formalized_conclusion dtype: string - name: formalized_intermediate_conclusions dtype: string - name: formalized_premises dtype: string - name: intermediate_conclusions dtype: string - name: premises dtype: string - name: streamlined struct: - name: valid_conclusion dtype: string - name: valid_premises list: string - name: explicit_premises list: string - name: implicit_premises list: string splits: - name: train num_bytes: 61877399 num_examples: 2934 - name: test num_bytes: 4997520 num_examples: 241 download_size: 24779661 dataset_size: 66874919 --- # Argument Reconstruction as Supervision for Critical Thinking in LLMs [![arXiv](https://img.shields.io/badge/Paper-arXiv:2603.17432-Green)](https://arxiv.org/abs/2603.17432) [![BibTex](https://img.shields.io/badge/Paper-BibTex-yellow)](#bibtex) **Arguinas** (**Argu**ment reconstruct**i**o**n**) dat**a**set as presented in our paper: [**Argument Reconstruction as Supervision for Critical Thinking in LLMs**](https://arxiv.org/abs/2603.17432) by Hyun Ryu<sup>\*1,2</sup>, Gyouk Chu<sup>\*2</sup>, Gregor Betz<sup>3</sup>, Eunho Yang<sup>2</sup>, Carolyn Rosé<sup>†1</sup>, and Sean Welleck<sup>†1</sup> <sup>1</sup>Language Technologies Institute, Carnegie Mellon University &nbsp;&nbsp; <sup>2</sup>Graduate School of AI, Korea Advanced Institute of Science & Technology &nbsp;&nbsp; <sup>3</sup>Department of Philosophy, Karlsruhe Institute of Technology &nbsp;&nbsp; <sup>\*</sup>Equal Contribution &nbsp;&nbsp; <sup>†</sup>Equal Advising <p align="center"> <img src="gaar_engine_github.png" alt="GAAR" width="80%" /> </p> --- ## 🔔 Updates - [✔] (25.04.21) The Arguinas dataset are out. - [✔] (26.03.18) Paper is out! [here](https://arxiv.org/abs/2603.17432) --- ## 🏋️ Data Our train and test Arguinas datasets are open! **See [`data/README.md`](https://github.com/GyoukChu/Arguinas/blob/main/data/README.md) for the full data format (top-level columns, `fallacy_info`, `sections`, etc.).** The source dataset of arguments are [DebateLabKIT/arguments-and-debates](https://huggingface.co/datasets/DebateLabKIT/arguments-and-debates), [Anthropic/persuasion](https://huggingface.co/datasets/Anthropic/persuasion), and [webis/args_me](https://huggingface.co/datasets/webis/args_me). ### Statistics | Sources | # Data | Avg. Words in Argument | Author of Argument | # Premises | % Implicit Premises | |----------------------------------|--------|-------------------------|--------------------|------------|---------------------| | Procon.org | 282 | 178.01 ± 100.68 | Staff Editors | 6.46 ± 2.95| 45.39 ± 19.39 | | Pros-and-cons-1950 | 119 | 43.41 ± 8.98 | Educators | 5.59 ± 1.82| 47.41 ± 18.08 | | Pros-and-cons-2010 | 373 | 83.53 ± 25.81 | Educators | 5.79 ± 2.20| 51.99 ± 18.13 | | NYT-room-for-debate | 297 | 398.21 ± 92.02 | Journalists | 8.13 ± 3.39| 41.68 ± 15.54 | | Anthropic/Persuasion | 287 | 252.33 ± 37.34 | Human / Claude | 8.47 ± 3.15| 40.42 ± 17.57 | | Synthetic Arguments | 1,520 | 332.60 ± 190.55 | GPT-5 / GPT-5.1 | 9.14 ± 4.28| 37.88 ± 15.29 | | Synthetic *Fallacious* Arguments | 297 | 296.45 ± 160.88 | GPT-5.2 | 8.40 ± 4.28| 35.70 ± 19.11 | | **Total** | 3,175 | 269.48 ± 177.21 | - | 8.15 ± 3.94| 40.94 ± 17.46 | <a id="bibtex"></a> ## 📚 BibTeX If you find this repo useful for your research, please consider citing us: ``` @article{ryu2026argument, title={Argument Reconstruction as Supervision for Critical Thinking in LLMs}, author={Ryu, Hyun and Chu, Gyouk and Betz, Gregor and Yang, Eunho and Rose, Carolyn and Welleck, Sean}, journal={arXiv preprint arXiv:2603.17432}, year={2026} } ``` ## ✉️ Contact If you have any questions or feedback, feel free to reach out: - Hyun Ryu: ryuhyun1905@kaist.ac.kr - Gyouk Chu: kyouwook@kaist.ac.kr
提供机构:
ChuGyouk
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作