five

SKGInstruct

收藏
魔搭社区2025-12-05 更新2024-06-01 收录
下载链接:
https://modelscope.cn/datasets/TIGER-Lab/SKGInstruct
下载链接
链接失效反馈
官方服务:
资源简介:
# 🏗️ StructLM: Towards Building Generalist Models for Structured Knowledge Grounding SKGInstruct is an instruction tuning dataset constructed from 19 structured knowledge grounding datasets, mixed with 🤗 [SlimOrca](https://huggingface.co/datasets/Open-Orca/SlimOrca) Project Page: [https://tiger-ai-lab.github.io/StructLM/](https://tiger-ai-lab.github.io/StructLM/) Paper: [https://arxiv.org/pdf/2402.16671.pdf](https://arxiv.org/pdf/2402.16671.pdf) Code: [https://github.com/TIGER-AI-Lab/StructLM](https://github.com/TIGER-AI-Lab/StructLM) Models: 7B | [StructLM-7B](https://huggingface.co/TIGER-Lab/StructLM-7B) 13B | [StructLM-13B](https://huggingface.co/TIGER-Lab/StructLM-13B) 34B | [StructLM-34B](https://huggingface.co/TIGER-Lab/StructLM-34B) ## **License** | Dataset Name | License Type | |--------------|----------------| | TabMWP | [Attribution-ShareAlike 4.0 International](https://creativecommons.org/licenses/by-sa/4.0/)| | SlimOrca | MIT | | everything else | [Attribution-NonCommercial-ShareAlike 4.0 International](https://creativecommons.org/licenses/by-nc-sa/4.0/)| ## **Citation** Please cite our paper if you use our data, model or code. Please also kindly cite the original dataset papers. ``` @misc{zhuang2024structlm, title={StructLM: Towards Building Generalist Models for Structured Knowledge Grounding}, author={Alex Zhuang and Ge Zhang and Tianyu Zheng and Xinrun Du and Junjie Wang and Weiming Ren and Stephen W. Huang and Jie Fu and Xiang Yue and Wenhu Chen}, year={2024}, eprint={2402.16671}, archivePrefix={arXiv}, primaryClass={cs.CL} } ```

# 🏗️ StructLM:面向结构化知识锚定(Structured Knowledge Grounding, SKG)的通用大模型构建 SKGInstruct 是一个指令微调数据集,由19个结构化知识锚定数据集构建而来,并融合了🤗 [SlimOrca](https://huggingface.co/datasets/Open-Orca/SlimOrca) 数据集。 项目页面:[https://tiger-ai-lab.github.io/StructLM/](https://tiger-ai-lab.github.io/StructLM/) 论文链接:[https://arxiv.org/pdf/2402.16671.pdf](https://arxiv.org/pdf/2402.16671.pdf) 代码仓库:[https://github.com/TIGER-AI-Lab/StructLM](https://github.com/TIGER-AI-Lab/StructLM) 模型版本: 7B 参数量 | [StructLM-7B](https://huggingface.co/TIGER-Lab/StructLM-7B) 13B 参数量 | [StructLM-13B](https://huggingface.co/TIGER-Lab/StructLM-13B) 34B 参数量 | [StructLM-34B](https://huggingface.co/TIGER-Lab/StructLM-34B) ## **许可证** | 数据集名称 | 许可证类型 | |--------------|----------------| | TabMWP | ["署名-相同方式共享4.0国际版"](https://creativecommons.org/licenses/by-sa/4.0/)| | SlimOrca | MIT 许可证 | | 其余所有数据集 | ["署名-非商业性使用-相同方式共享4.0国际版"](https://creativecommons.org/licenses/by-nc-sa/4.0/)| ## **引用说明** 若您使用本数据集、模型或代码,请引用本文献。同时也请一并引用对应原始数据集的相关学术论文。 @misc{zhuang2024structlm, title={StructLM: Towards Building Generalist Models for Structured Knowledge Grounding}, author={Alex Zhuang and Ge Zhang and Tianyu Zheng and Xinrun Du and Junjie Wang and Weiming Ren and Stephen W. Huang and Jie Fu and Xiang Yue and Wenhu Chen}, year={2024}, eprint={2402.16671}, archivePrefix={arXiv}, primaryClass={cs.CL} }
提供机构:
maas
创建时间:
2024-05-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作