TAUR-dev/agg__BC__lmfd__rewritten_v3_to_4omini__train
收藏Hugging Face2025-04-02 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/TAUR-dev/agg__BC__lmfd__rewritten_v3_to_4omini__train
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征字段,如仓库信息(repo)、原始数据(raw)、标准化跨度(norm_spans)、标准化换行(norm_line_breaks)、样本数量(n_samples)、平均换行数(avg_line_breaks)、总平均值(total_avg)和平均单词数(word_count_avg)。数据集分为训练集(train),提供了训练集的字节数和示例数量。数据集的总下载大小和实际大小也进行了说明。
The dataset includes multiple feature fields such as repository information (repo), raw data (raw), normalized spans (norm_spans), normalized line breaks (norm_line_breaks), number of samples (n_samples), average number of line breaks (avg_line_breaks), total average (total_avg), and average word count (word_count_avg). The dataset is split into a training set (train), with the byte size and number of examples provided for the training set. The total download size and actual size of the dataset are also described.
提供机构:
TAUR-dev



