zhliu/ArxivMIA
收藏Hugging Face2024-06-01 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/zhliu/ArxivMIA
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-4.0
task_categories:
- text-classification
language:
- en
configs:
- config_name: arxiv_mia
data_files: data/arxiv_mia.jsonl
default: true
- config_name: arxiv_mia_dev
data_files: data/arxiv_mia_dev.jsonl
- config_name: arxiv_mia_test
data_files: data/arxiv_mia_test.jsonl
---
## Dataset Card for ArxivMIA
To evaluate various pre-training data detection methods in a more challenging scenario, we introduce ArxivMIA, a new benchmark comprising abstracts from the fields of Computer Science (CS) and Mathematics (Math) sourced from Arxiv.
- **Repository:** https://github.com/zhliu0106/probing-lm-data
- **Paper:** Probing Language Models for Pre-training Data Detection
提供机构:
zhliu
原始信息汇总
数据集概述
数据集名称
- ArxivMIA
许可证
- CC-BY-4.0
任务类别
- 文本分类
语言
- 英语
配置信息
- config_name: arxiv_mia
- 数据文件: data/arxiv_mia.jsonl
- 默认配置: true
- config_name: arxiv_mia_dev
- 数据文件: data/arxiv_mia_dev.jsonl
- config_name: arxiv_mia_test
- 数据文件: data/arxiv_mia_test.jsonl
搜集汇总
数据集介绍

以上内容由遇见数据集搜集并总结生成



