FAIRsharing record for: Tabular Data Package
收藏Mendeley Data2024-01-31 更新2024-06-30 收录
下载链接:
https://fairsharing.org/10.25504/FAIRsharing.082881
下载链接
链接失效反馈官方服务:
资源简介:
This FAIRsharing record describes: Tabular Data Package is a simple container format used for publishing and sharing tabular-style data. The format's focus is on simplicity and ease of use, especially online. In addition, the format is focused on data that can be presented in a tabular structure and in making it easy to produce (and consume) tabular data packages from spreadsheets and relational databases. The key features of this format are the following: CSV (comma separated variables) for data files; single JSON file (datapackage.json) to describe the dataset including a schema for data files; and reuse of existing work including other Frictionless Data specifications. As suggested by the name, Tabular Data Package extends and specializes the Data Package spec for the specific case where the data is tabular. In a Tabular Data Package, each CSV must have a schema defined using Table Schema and, optionally, a dialect defined using CSV-DDF. An application or library that consumes Tabular Data Packages therefore must be able to understand not only the full Data Package specification, but also Table Schema and CSV-DDF.
本FAIRsharing记录所描述的内容为:表格数据包(Tabular Data Package)是一种用于发布与共享表格型数据的简易容器格式。该格式以简洁性与易用性为核心设计原则,尤其适配线上应用场景。此外,该格式聚焦于可通过表格结构呈现的数据,并旨在简化从电子表格与关系型数据库中生成(及使用)表格数据包的流程。
该格式的核心特性如下:数据文件采用逗号分隔值(Comma Separated Variables,CSV)格式;通过单个JSON文件(datapackage.json)描述数据集,其中包含数据文件的架构定义;复用现有成果,包括其他Frictionless Data规范。
顾名思义,表格数据包针对数据为表格型的特定场景,对数据包(Data Package)规范进行了扩展与定制。
在表格数据包中,每个CSV文件必须采用表格模式(Table Schema)定义其架构,亦可选择使用CSV-DDF定义其方言。因此,能够处理表格数据包的应用程序或库,不仅需要支持完整的数据包规范,还需兼容表格模式与CSV-DDF。
创建时间:
2024-01-31



