ritup3/SporTabSet
收藏Hugging Face2026-04-20 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/ritup3/SporTabSet
下载链接
链接失效反馈官方服务:
资源简介:
---
pretty_name: SporTabSet
tags:
- datasets
- text
- tabular
- sports
- basketball
- cricket
configs:
- config_name: basketball
data_files:
- split: train
path:
- basketball/final/original.json
- basketball/final/anonymized.json
- basketball/final/counterfactual.json
- config_name: basketball_temporal
data_files:
- split: old
path: basketball/final/original_old.json
- split: new
path: basketball/final/original_new.json
- config_name: cricket_odi_original
data_files:
- split: train
path: cricket/all_data_json/original.json
- config_name: cricket_odi_counterfactual
data_files:
- split: train
path: cricket/all_data_json/counterfactual.json
- config_name: cricket_odi_masked
data_files:
- split: train
path: cricket/all_data_json/masked.json
- config_name: cricket_odi_unmasked
data_files:
- split: train
path: cricket/all_data_json/unmasked.json
- config_name: cricket_odi_paraphrase
data_files:
- split: train
path: cricket/all_data_json/paraphrase.json
- config_name: cricket_odi_symbolic
data_files:
- split: train
path: cricket/all_data_json/symbolic.json
- config_name: cricket_odi_temporal
data_files:
- split: train
path: cricket/all_data_json/odi_complete_temporal.json
- config_name: cricket_t20_original
data_files:
- split: train
path: cricket/all_data_t20_json/original.json
- config_name: cricket_t20_counterfactual
data_files:
- split: train
path: cricket/all_data_t20_json/counterfactual.json
- config_name: cricket_t20_masked
data_files:
- split: train
path: cricket/all_data_t20_json/masked.json
- config_name: cricket_t20_unmasked
data_files:
- split: train
path: cricket/all_data_t20_json/unmasked.json
- config_name: cricket_t20_paraphrase
data_files:
- split: train
path: cricket/all_data_t20_json/paraphrase.json
- config_name: cricket_t20_symbolic
data_files:
- split: train
path: cricket/all_data_t20_json/symbolic.json
- config_name: cricket_t20_linguistic
data_files:
- split: train
path: cricket/all_data_t20_json/linguistic.json
---
# Coral Sports Commentary
This repository packages the finalized basketball data, the cricket ODI and T20 variants, and the temporal subsets needed for Hugging Face upload.
## Included Data
- `basketball`: finalized basketball commentary variants from `basketball/final`.
- `basketball_temporal`: basketball temporal partition exposed as `old` and `new` splits.
- `cricket_odi_*`: ODI cricket variants, excluding `old_2025.json` and `new_2025.json`.
- `cricket_odi_temporal`: ODI temporal cricket data from `odi_complete_temporal.json`.
- `cricket_t20_*`: T20 cricket variants.
## Data Fields
### Basketball
- `id`: unique game identifier.
- `commentary`: full game commentary text.
- `boxscore`: player-level boxscore stored as aligned arrays for team, player, and stat columns.
### Cricket ODI and T20 Core Variants
- `commentaries`: source commentary text.
- `commentary_final`: final transformed commentary text.
- `batsman_gnd_truth_table`: batting ground-truth table.
- `bolwer_gnd_truth_table`: bowling ground-truth table.
- `mapping`: replacement mapping used in counterfactual or symbolic variants when present.
- `ref_table`: source match/table reference when present.
- `commentaries_2`: auxiliary transformed commentary field used in some variants, often describing the masked commentary.
- `commentaries_hindi`: auxiliary variant-specific commentary field used in some T20 files.
- `commentaries_symbolic`: symbolic intermediate commentary field used in T20 symbolic data.
- `commentaries_counterfactual`: counterfactual intermediate commentary field used in T20 counterfactual data.
### Cricket ODI Temporal
- `commentaries`: source commentary text segment.
- `commentaries_masked`: masked version of the same commentary segment.
- `batsman_gnd_truth`: batting ground-truth records for the segment.
- `bowler_gnd_truth`: bowling ground-truth records for the segment.
提供机构:
ritup3



