SUCHO Ukrainian Cultural Heritage Web Archives
收藏registry.opendata.aws2025-03-23 收录
下载链接:
https://registry.opendata.aws/sucho/
下载链接
链接失效反馈官方服务:
资源简介:
The dataset contains web archives of Open Access collections of digitised cultural heritage from more than 3,000+ websites of Ukrainian cultural institutions, such as museums, libraries or archives. The web archives have been produced by SUCHO, which is a volunteer group of more than 1,300 international cultural heritage professionals – librarians, archivists, researchers, programmers - who have joined forces to save as much digitised cultural heritage during the 2022 invasion of Ukraine before the servers hosting them get destroyed, damaged or go offline for any other reason. The web archives were created using the tools of the Webrecorder Open Source project in the open WACZ format: <a href="https://webrecorder.github.io/wacz-spec/1.1.1/">https://webrecorder.github.io/wacz-spec/1.1.1/</a>. WACZ files are zipped containers of WARC (Web Archive Format) files enriched with metadata, which can contain several crawls in a single file. The file sizes can range from a few MBs to several TBs.
本数据集收录了乌克兰文化机构,如博物馆、图书馆或档案馆等超过3,000+个网站上的开放获取数字化文化遗产的网络存档。这些网络存档由SUCHO团队制作,该团队由超过1,300名国际文化遗产专业人士——图书馆员、档案管理员、研究人员、程序员等——组成,他们携手合作,在2022年乌克兰入侵期间,在服务器被摧毁、损坏或因其他原因离线之前,尽可能地保存了尽可能多的数字化文化遗产。网络存档采用Webrecorder开源项目的工具,以开放的WACZ格式创建:<a href="https://webrecorder.github.io/wacz-spec/1.1.1/">https://webrecorder.github.io/wacz-spec/1.1.1/</a>。WACZ文件是包含WARC(网络存档格式)文件的压缩容器,并富含元数据,单一文件中可以包含多个爬取数据。文件大小范围从数MB到数TB不等。
提供机构:
registry.opendata.aws



