midah/license-features
收藏Hugging Face2026-04-21 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/midah/license-features
下载链接
链接失效反馈官方服务:
资源简介:
License Features数据集包含744个软件和AI许可证的LLM提取特征值,涵盖了Nordlander、Oliner & Woo (2004)、Kapitsaki et al. (2019)以及AI模型许可证的ML特定扩展的分类法。数据集分为三个parquet文件,分别对应不同的分类法特征。每个文件包含相同的`spdx_id`和`model`键列,可以通过连接这些文件来获取每个许可证的完整25个特征向量。数据集还包括724个SPDX许可证和23个AI特定许可证。
LLM-extracted feature values for 744 software and AI licenses, covering the taxonomies from Nordlander, Oliner & Woo (2004), Kapitsaki et al. (2019), and an ML-specific extension for AI model licenses. The dataset splits features by taxonomy into three parquets, all sharing the same `spdx_id` + `model` key columns. Join across files to get the full 25-feature vector per license per model. The corpus includes 724 SPDX licenses and 23 AI-specific licenses.
提供机构:
midah



