datasets-examples/doc-formats-csv-3
收藏Hugging Face2023-11-23 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/datasets-examples/doc-formats-csv-3
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files: "data.csv"
delimiter: "|"
header: 1
names: ["kind", "sound"]
size_categories:
- n<1K
---
# [doc] formats - csv - 3
This dataset contains one csv file at the root:
- [data.csv](./data.csv)
```csv
# ignored comment
col1|col2
dog|woof
cat|meow
pokemon|pika
human|hello
```
We define the config name in the YAML config, as well as the exact location of the file, the separator as `"|"`, the name of the columns, and the number of rows to ignore (the row #1 is a row of column headers, that will be replaced by the `names` option, and the row #0 is ignored). The reference for the options is the [documentation of pandas.read_csv()](https://pandas.pydata.org/docs/reference/api/pandas.read_csv.html).
```yaml
---
configs:
- config_name: default
data_files: "data.csv"
delimiter: "|"
header: 1
names: ["kind", "sound"]
size_categories:
- n<1K
---
```
提供机构:
datasets-examples
原始信息汇总
数据集概述
数据文件
- 文件名:
data.csv - 分隔符:
| - 列名: ["kind", "sound"]
数据内容示例
csv
ignored comment
col1|col2 dog|woof cat|meow pokemon|pika human|hello
配置信息
- 配置名称:
default - 数据文件位置:
data.csv - 分隔符:
| - 列头行: 1
- 列名: ["kind", "sound"]
数据规模
- 数据行数: n<1K



