five

MMFS directories for LargeRDFBench datasets

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/13954799
下载链接
链接失效反馈
官方服务:
资源简介:
The file mmfs.tar.zst contains all cleaned (RDF 1.1) datasets from the LargeRDFBench, encoded with the Mapped-Memory Friendly Store  (MMFS) format, which is used in experiments which code, scripts and results are also available as a dataset in Zenodo: 10.5281/zenodo.13960678. Uncompressed, the 13 directories will cost 36.317 GiB in disk space. The list of directories inside mmfs.tar.zst is: Affymetrix ChEBI DBPedia-Subset DrugBank GeoNames Jamendo KEGG LinkedTCGA-A LinkedTCGA-E LinkedTCGA-M LMDB NYT SWDFood Each MMFS directory contains 4 files: shared: suffixes and prefixes that are shared between multiple RDF terms; strings: The dictionary that maps the N-Triples representation of each RDF term in the graph to a unique 64-bit sequential identifier. This dictionary also decomposes the N-Triples strings into a local part, stored at the end of this file, and a shared prefix/suffix, stored at the shared file; lexical: This is a small list L of bitsets, where L[hash(t) % L.length] is the set of literal suffixes for which there MAY be a term u such that str(u) = str(t). spo: This is a triples index which stores a sorted (by ID) list of predicate and object pairs for each subject; pso: Similar to spo, but stores subject-object pairs for a given predicate; ops: Similar to spo, but stores predicate-subject pairs given an object.
创建时间:
2025-02-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作