withpi/embedding-msmarco-bert-ensemble-margin-mse-large_embedding_tokenized_8k_1_embedding
收藏Hugging Face2025-05-29 更新2025-05-31 收录
下载链接:
https://hf-mirror.com/datasets/withpi/embedding-msmarco-bert-ensemble-margin-mse-large_embedding_tokenized_8k_1_embedding
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个包含文本特征和标签信息的NLP数据集,适用于文本分类或文本匹配任务。数据集包含训练集和测试集,特征字段包括标签、类别、查询及其关注掩码、正例和反例的输入ID及其关注掩码、文本长度信息和一个自动生成的主键。每个样本都有对应的标签和文本特征,可以用于训练机器学习模型进行文本相关的预测任务。
This dataset is an NLP dataset containing text features and label information, suitable for text classification or text matching tasks. The dataset includes training and test sets, with feature fields including label, category, query and its attention mask, positive and negative input IDs and their attention masks, text length information, and an auto-generated primary key. Each sample has corresponding labels and text features that can be used to train machine learning models for text-related prediction tasks.
提供机构:
withpi



