lopozz/CulturaViva-Retrieval
收藏Hugging Face2026-04-20 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/lopozz/CulturaViva-Retrieval
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: corpus
features:
- name: _id
dtype: string
- name: title
dtype: string
- name: text
dtype: string
splits:
- name: train
num_bytes: 314059992
num_examples: 51719
- name: test
num_bytes: 34851568
num_examples: 5747
download_size: 190349876
dataset_size: 348911560
- config_name: qrels
features:
- name: query-id
dtype: string
- name: corpus-id
dtype: string
- name: score
dtype: int64
splits:
- name: train
num_bytes: 2149978
num_examples: 51719
- name: test
num_bytes: 216166
num_examples: 5747
download_size: 795963
dataset_size: 2366144
- config_name: queries
features:
- name: _id
dtype: string
- name: text
dtype: string
splits:
- name: train
num_bytes: 19445996
num_examples: 51719
- name: test
num_bytes: 2137728
num_examples: 5747
download_size: 10734939
dataset_size: 21583724
configs:
- config_name: corpus
data_files:
- split: train
path: corpus/train-*
- split: test
path: corpus/test-*
- config_name: qrels
data_files:
- split: train
path: qrels/train-*
- split: test
path: qrels/test-*
- config_name: queries
data_files:
- split: train
path: queries/train-*
- split: test
path: queries/test-*
---
## Description
**CulturaViva-Retrieval** is a dataset entirely conceived in its original Italian language, specifically designed to capture and enhance the complexity and cultural richness of the Italian language. Unlike traditional datasets derived from translations, CulturaViva-ITA draws directly from culture, literature, and history, offering linguistically accurate and culturally significant data.
## Reference & Citation
For more information, visit the official repository: [DeepMount00/CulturaViva-ITA](https://huggingface.co/datasets/DeepMount00/CulturaViva-ITA)
Please cite this dataset as follows:
```bibtex
@misc{culturaviva-ita,
author = {Michele, Montebovi},
title = {CulturaViva-ITA: An Authentically Italian Dataset for NLP},
year = {2025},
publisher = {Michele Montebovi},
url = {https://huggingface.co/datasets/DeepMount00/CulturaViva-ITA}
}
```
## License
CulturaViva-ITA is released under the **Creative Commons Attribution 4.0 International (CC BY 4.0)** license.
提供机构:
lopozz



