waashk/medline
收藏Hugging Face2025-04-02 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/waashk/medline
下载链接
链接失效反馈官方服务:
资源简介:
MEDLINE数据集是一个用于文本分类任务的数据集,包含超过100K但少于1M的记录。该数据集包含文本数据和相应的编码标签,以.parquet文件格式存储。此外,它还包含了用于k交叉验证的.pkl格式划分文件。每个划分都有对应的训练和测试数据集。数据集用于评估自动文本分类方法,并确保实验结果的可复现性。
The MEDLINE dataset is a text classification dataset containing more than 100K but less than 1M records. It includes text data and corresponding encoded labels stored in .parquet file format. Additionally, it contains .pkl format split files for k-cross validation. Each split has corresponding training and testing datasets. The dataset is used to evaluate automatic text classification methods and ensure the reproducibility of experimental results.
提供机构:
waashk



