SKGInstruct
收藏魔搭社区2025-12-05 更新2024-06-01 收录
下载链接:
https://modelscope.cn/datasets/TIGER-Lab/SKGInstruct
下载链接
链接失效反馈官方服务:
资源简介:
# 🏗️ StructLM: Towards Building Generalist Models for Structured Knowledge Grounding
SKGInstruct is an instruction tuning dataset constructed from 19 structured knowledge grounding datasets, mixed with 🤗 [SlimOrca](https://huggingface.co/datasets/Open-Orca/SlimOrca)
Project Page: [https://tiger-ai-lab.github.io/StructLM/](https://tiger-ai-lab.github.io/StructLM/)
Paper: [https://arxiv.org/pdf/2402.16671.pdf](https://arxiv.org/pdf/2402.16671.pdf)
Code: [https://github.com/TIGER-AI-Lab/StructLM](https://github.com/TIGER-AI-Lab/StructLM)
Models:
7B | [StructLM-7B](https://huggingface.co/TIGER-Lab/StructLM-7B)
13B | [StructLM-13B](https://huggingface.co/TIGER-Lab/StructLM-13B)
34B | [StructLM-34B](https://huggingface.co/TIGER-Lab/StructLM-34B)
## **License**
| Dataset Name | License Type |
|--------------|----------------|
| TabMWP | [Attribution-ShareAlike 4.0 International](https://creativecommons.org/licenses/by-sa/4.0/)|
| SlimOrca | MIT |
| everything else | [Attribution-NonCommercial-ShareAlike 4.0 International](https://creativecommons.org/licenses/by-nc-sa/4.0/)|
## **Citation**
Please cite our paper if you use our data, model or code. Please also kindly cite the original dataset papers.
```
@misc{zhuang2024structlm,
title={StructLM: Towards Building Generalist Models for Structured Knowledge Grounding},
author={Alex Zhuang and Ge Zhang and Tianyu Zheng and Xinrun Du and Junjie Wang and Weiming Ren and Stephen W. Huang and Jie Fu and Xiang Yue and Wenhu Chen},
year={2024},
eprint={2402.16671},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
```
# 🏗️ StructLM:面向结构化知识锚定(Structured Knowledge Grounding, SKG)的通用大模型构建
SKGInstruct 是一个指令微调数据集,由19个结构化知识锚定数据集构建而来,并融合了🤗 [SlimOrca](https://huggingface.co/datasets/Open-Orca/SlimOrca) 数据集。
项目页面:[https://tiger-ai-lab.github.io/StructLM/](https://tiger-ai-lab.github.io/StructLM/)
论文链接:[https://arxiv.org/pdf/2402.16671.pdf](https://arxiv.org/pdf/2402.16671.pdf)
代码仓库:[https://github.com/TIGER-AI-Lab/StructLM](https://github.com/TIGER-AI-Lab/StructLM)
模型版本:
7B 参数量 | [StructLM-7B](https://huggingface.co/TIGER-Lab/StructLM-7B)
13B 参数量 | [StructLM-13B](https://huggingface.co/TIGER-Lab/StructLM-13B)
34B 参数量 | [StructLM-34B](https://huggingface.co/TIGER-Lab/StructLM-34B)
## **许可证**
| 数据集名称 | 许可证类型 |
|--------------|----------------|
| TabMWP | ["署名-相同方式共享4.0国际版"](https://creativecommons.org/licenses/by-sa/4.0/)|
| SlimOrca | MIT 许可证 |
| 其余所有数据集 | ["署名-非商业性使用-相同方式共享4.0国际版"](https://creativecommons.org/licenses/by-nc-sa/4.0/)|
## **引用说明**
若您使用本数据集、模型或代码,请引用本文献。同时也请一并引用对应原始数据集的相关学术论文。
@misc{zhuang2024structlm,
title={StructLM: Towards Building Generalist Models for Structured Knowledge Grounding},
author={Alex Zhuang and Ge Zhang and Tianyu Zheng and Xinrun Du and Junjie Wang and Weiming Ren and Stephen W. Huang and Jie Fu and Xiang Yue and Wenhu Chen},
year={2024},
eprint={2402.16671},
archivePrefix={arXiv},
primaryClass={cs.CL}
}
提供机构:
maas
创建时间:
2024-05-29



