five

acrion/zelph

收藏
Hugging Face2026-03-14 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/acrion/zelph
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-4.0 task_categories: - graph-ml language: - en tags: - semantic-network - knowledge-graph - wikidata - graph-ml - semantic-inference - token-compression pretty_name: 'zelph Binaries: Semantic Networks from Knowledge Bases (e.g., Wikidata)' size_categories: - 100M<n<1B --- # zelph Binaries Dataset This dataset provides pre-compiled binary files (.bin) for use with [zelph](https://zelph.org), a sophisticated semantic network system. These binaries are derived from large knowledge bases like Wikidata, optimized for fast loading and efficient querying. ## Dataset Description zelph binaries enable users to work with semantic networks without the need to import raw dumps (e.g., JSON files), which can take hours. Instead, these .bin files load in minutes, though they require substantial RAM. The dataset includes both full and pruned variants: - **Full binaries**: Contain the complete network, suitable for comprehensive use but demanding high RAM (Wikidata: ~ 210 GB RAM). - **Pruned binaries**: Reduced versions with removed domains (e.g., biology, chemistry, astronomy) to lower RAM requirements (~ 16 GB RAM) while preserving core connections. For detailed information on each binary, including sizes, creation dates, pruning details, and updates, visit [https://zelph.org/binaries](https://zelph.org/binaries). ## How to Use 1. Download the desired .bin file from this dataset. 2. In zelph interactive mode, load it with: ``` .load /path/to/your-file.bin ``` 3. Run queries, define rules, perform inferences or run complete scripts (see [zelph on GitHub](https://github.com/acrion/zelph?tab=readme-ov-file#performing-inference) for details). ## LLM-Friendly Outputs zelph can generate rule-based inferences in a compressed text format optimized for LLM training or processing. This uses a token encoder that maps Wikidata IDs (Q/P) to compact UTF-8 symbols (CJK range), reducing input length while preserving structure. This feature is currently focused on Wikidata, but it can be adapted for similar use cases. Use it to export inferences from loaded binaries for LLM datasets – see the [command documentation on GitHub](https://github.com/acrion/zelph?tab=readme-ov-file#exporting-deduced-facts-to-file) for details. ## Citation If you use this dataset, cite as: ``` @dataset{zelph, author = {Stefan Zipproth}, title = {zelph Binaries Dataset}, year = {2026}, url = {https://huggingface.co/datasets/acrion/zelph} } ```
提供机构:
acrion
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作