v1ctor10/cos_sim_raw_2018to2020_exp
收藏Hugging Face2025-03-19 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/v1ctor10/cos_sim_raw_2018to2020_exp
下载链接
链接失效反馈官方服务:
资源简介:
这个数据集包含了公司1和公司2的整型标识、年份(字符串类型)、相关系数、绝对差值之和、归一化的绝对差值之和以及余弦相似度等特征。数据集分为训练集,包含约2895940个示例,文件大小为150,950,873字节。数据集的配置信息中,默认配置指定了训练集的数据文件路径。
The dataset includes integer identifiers for Company1 and Company2, year as a string, correlation coefficient, sum of absolute differences, normalized sum of absolute differences, and cosine similarity features. The dataset is split into a training set with approximately 2,895,940 examples, with a file size of 150,950,873 bytes. The configuration information for the dataset specifies the path to the training set data files under the default configuration.
提供机构:
v1ctor10



