auto-create-v2
收藏魔搭社区2025-05-24 更新2024-12-28 收录
下载链接:
https://modelscope.cn/datasets/wxzhuyeah/auto-create-v2
下载链接
链接失效反馈官方服务:
资源简介:
# Dataset Overview
dataset: cohere-v3-small
## Metadata
- **Creation Time**: 2024-12-23 16:23:41
- **Update Time**: 2024-12-23 09:15:33+0000
- **Source**: N/A
- **Task**: N/A
- **Train Samples**: 13000
- **Test Samples**: 100
- **License**: DISCLAIMER AND LICENSE NOTICE:
1. This dataset is intended for benchmarking and research purposes only.
2. The source data used in this dataset retains its original license and copyright. Users must comply with the respective licenses of the original data sources.
3. The ground truth part of the dataset (including but not limited to annotations, labels, and evaluation metrics) is licensed under Apache 2.0.
4. This dataset is provided 'AS IS' without any warranty. The dataset maintainers are not responsible for any copyright violations arising from the use of the source data.
5. If you are the copyright holder of any source data and believe it has been included inappropriately, please contact us for prompt removal.
6. Commercial use of this dataset must ensure compliance with the original data sources' licenses and obtain necessary permissions where required.
## Dataset Statistics
| Split | Name | Size | Num Rows | Num Columns | Schema | Num Files |
| --- | --- | --- | --- | --- | --- | --- |
| train | cohere-v3-small | 2.209 MB | 1000 | 6 | { "chunk_id": "string", "url": "string", "title": "string", "text": "string", "emb": "list", "idx": "int64"} | 1 |
| test | cohere-v3-small | 0.265 MB | 100 | 6 | { "chunk_id": "string", "url": "string", "title": "string", "text": "string", "emb": "list", "idx": "int64"} | 1 |
| neighbors | cohere-v3-small | 0.859 MB | 100 | 5 | { "idx": "int64", "neighbors_id": "list", "distance": "list", "metric": "string", "query_expr": "null"} | 1 |
# 数据集概览
数据集:cohere-v3-small
## 元数据
- **创建时间**:2024-12-23 16:23:41
- **更新时间**:2024-12-23 09:15:33+0000
- **来源**:无
- **任务类型**:无
- **训练样本数**:13000
- **测试样本数**:100
- **许可协议**:免责声明与许可须知:
1. 本数据集仅用于基准测试与研究用途。
2. 本数据集所使用的源数据保留其原有许可与版权,用户须遵守各原始数据源的相应许可协议。
3. 本数据集的真值部分(包括但不限于标注、标签与评估指标)采用Apache 2.0协议授权。
4. 本数据集按“现状”提供,不附带任何形式的担保。数据集维护方不对因使用源数据引发的任何版权侵权问题承担责任。
5. 若您是某源数据的版权持有者,且认为其被不当纳入本数据集,请联系我们以便及时移除。
6. 商业使用本数据集须确保遵守原始数据源的许可要求,并在必要时获取相关许可。
## 数据集统计信息
| 拆分集 | 名称 | 大小 | 行数 | 列数 | 数据结构 | 文件数 |
| --- | --- | --- | --- | --- | --- | --- |
| train | cohere-v3-small | 2.209 MB | 1000 | 6 | { "chunk_id": "字符串类型", "url": "字符串类型", "title": "字符串类型", "text": "字符串类型", "emb": "列表类型", "idx": "int64"} | 1 |
| test | cohere-v3-small | 0.265 MB | 100 | 6 | { "chunk_id": "字符串类型", "url": "字符串类型", "title": "字符串类型", "text": "字符串类型", "emb": "列表类型", "idx": "int64"} | 1 |
| neighbors | cohere-v3-small | 0.859 MB | 100 | 5 | { "idx": "int64", "neighbors_id": "列表类型", "distance": "列表类型", "metric": "字符串类型", "query_expr": "空值"} | 1 |
提供机构:
maas
创建时间:
2024-12-27



