sentence-transformers/ccnews
收藏数据集概述
基本信息
- 语言: 英语
- 多语言性: 单语
- 大小范围: 10万<n<100万
- 任务类别: 特征提取, 句子相似度
- 美观名称: CC News
- 标签: sentence-transformers
数据集配置
- 配置名称: pair
- 特征:
- 名称: title
- 数据类型: 字符串
- 名称: article
- 数据类型: 字符串
- 名称: title
数据集拆分
- 训练集:
- 字节数: 1529462734
- 示例数量: 614664
- 下载大小: 960719023
- 数据集大小: 1529462734
数据集子集
- 子集名称: pair
-
列: "title", "article"
-
列类型: 字符串, 字符串
-
示例: python { title: Tennessee joins states urging court to reinstate travel ban, article: NASHVILLE, Tenn. (AP) – Tennessee is joining more than a dozen other states in urging an appeals court to reinstate President Donald Trump’s revised travel ban. State Senate Majority Leader Mark Norris, a Collierville Republican considering a bid for governor next year, lauded Attorney General Herbert Slatery’s office for filing a brief with the 9th U.S. Circuit Court of Appeals in San Francisco. The states argue the ban falls within the president’s authority to block foreigners from the U.S. They also reject the argument that it targets Muslims. Norris last year sponsored legislation to allow the General Assembly to hire its own attorneys to file a legal challenge seeking to halt the federal refugee resettlement program in Tennessee after Slatery and Gov. Bill Haslam declined to sue over the issue., }
-
收集策略: 从embedding-training-data读取CC News数据集
-
去重: 否
-



