vesteinn/FC3
收藏Hugging Face2023-03-23 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/vesteinn/FC3
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc
language:
- fo
pretty_name: FC3
---
This is the Faroese Common Crawl corpus. The largest dataset of mono-lingual Faroese text, it was extracted from the Common Crawl.
If you find this dataset useful, please cite
```
@inproceedings{snaebjarnarson-etal-2023-transfer,
title = "{T}ransfer to a Low-Resource Language via Close Relatives: The Case Study on Faroese",
author = "Snæbjarnarson, Vésteinn and
Simonsen, Annika and
Glavaš, Goran and
Vulić, Ivan",
booktitle = "Proceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)",
month = "may 22--24",
year = "2023",
address = "Tórshavn, Faroe Islands",
publisher = {Link{\"o}ping University Electronic Press, Sweden},
}
```
提供机构:
vesteinn
原始信息汇总
数据集概述
数据集名称
- FC3
数据集描述
- 这是法罗语的Common Crawl语料库,是最大的单语法罗语文本数据集,从Common Crawl中提取。
语言
- 法罗语
许可证
- CC



