common-pile/uspto
收藏Hugging Face2025-06-06 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/common-pile/uspto
下载链接
链接失效反馈官方服务:
资源简介:
USPTO数据集包含了来自美国专利和商标局(USPTO)的大约20294152个专利文档,这些文档是从Google Patents Public Data dataset中获取的。文档时间跨度从1782年至今,所有文档均为政府作品进入公共领域。数据集提供了经过处理的清洁文本,并保留了标准化的专利文档结构,同时将数学表达式和方程式转换为LaTeX格式。
The USPTO dataset includes approximately 20,294,152 patent documents from the United States Patent and Trademark Office (USPTO), sourced from the Google Patents Public Data dataset. The documents span from 1782 to the present, all of which are in the public domain as government works. The dataset provides clean, processed text while preserving the standardized structure of patent documents, with mathematical expressions and equations converted into LaTeX format.
提供机构:
common-pile



