The Virtual Patent (VP-WPI) Test Collection
收藏DataCite Commons2025-11-10 更新2026-05-06 收录
下载链接:
https://researchdata.tuwien.ac.at/doi/10.48436/x309z-a9q08
下载链接
链接失效反馈官方服务:
资源简介:
The VP-WPI Test Collection is a novel dataset that implements the Virtual Patent (VP) concept. A Virtual Patent is a synthesized document that represents a single patent, created by merging the most up-to-date information from its various publication stages (e.g., kind codes A1, A2, B1, B2).
Specifically, VP-WPI is as a specialized vertical of the WPI+ resource, which offers a unified, non-redundant view of patents by aggregating all relevant documents from the WPI test collection at the kind-code level to create unified VP documents.
This collection serves as an abstraction layer over WPI, designed to:
Simplify analysis by reducing document redundancy.
Enhance data consistency by providing a single source of truth.
Preserve traceability with links back to all original source documents.
Further Information
For full technical details, including collection statistics, data specifications, and the creation process, please refer to:
WPI+ Resource - Documentation & Source Code: WPI+ GitHub Repository
Resources:
VP-WPI Test Collection on TU-Wien (this page): VP-WPI Collection.
WPI Test Collection on Zenodo: WPI Test Collection.
Comprehensive Thesis (in Greek): Papadopoulos, C., MSc Thesis, International Hellenic University. https://repository.ihu.gr/handle/11544/47881.
VP-WPI测试集是一款实现了虚拟专利(Virtual Patent, VP)概念的新型数据集。虚拟专利是代表单一专利的合成文档,通过整合该专利各公开阶段(例如文献类型代码A1、A2、B1、B2)的最新信息构建而成。
具体而言,VP-WPI是WPI+资源的专业垂直分支,它通过在文献类型代码层级上聚合WPI测试集中的所有相关文档以生成统一的虚拟专利文档,从而为专利提供统一且无冗余的呈现视角。
该数据集作为WPI之上的抽象层,旨在实现以下目标:
- 通过减少文档冗余,简化分析流程;
- 通过提供单一可信数据源,提升数据一致性;
- 通过保留指向所有原始源文档的链接,确保可追溯性。
补充信息
如需了解包括数据集统计信息、数据规范及构建流程在内的完整技术细节,请参阅:
WPI+资源——文档与源代码:WPI+ GitHub仓库
相关资源:
维也纳工业大学(TU-Wien)平台上的VP-WPI测试集(本页面):VP-WPI数据集
Zenodo平台上的WPI测试集:WPI测试集
完整研究论文(希腊语版):帕帕佐普洛斯(C.),国际希腊大学理学硕士学位论文,链接:https://repository.ihu.gr/handle/11544/47881
提供机构:
TU Wien
创建时间:
2025-11-10



