gianlucar/rugby_test_2
收藏Hugging Face2023-12-15 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/gianlucar/rugby_test_2
下载链接
链接失效反馈官方服务:
资源简介:
---
task_categories:
- text-generation
language:
- en
tags:
- fine-tuning
- touch rugby
size_categories:
- n<1K
---
# Touch Rugby Rules Dataset (for embeddings)
train.csv is taken from the [International Touch Website](https://cdn.internationaltouch.org/public/FIT%205th%20Edition%20Rulebook.pdf)
test.csv is copy pasted from abbreviated rules on the [UK Touch website](https://www.englandtouch.org.uk/develop/coaching/the-rules/). Note that I'm bypassing the pdf to text stage.
All text is chunked to a length of 100 tokens with 50% overlap.
For educational and non-commercial use only.
This dataset is designed for text generation tasks, containing English text related to touch rugby rules. The dataset is small in size, with fewer than 1000 samples. The text sources include the International Touch Website and the UK Touch website, with all text chunked into segments of 100 tokens each, having a 50% overlap, and is intended for educational and non-commercial use only.
提供机构:
gianlucar



