five

Lessons learned from twelve years’ operation of the Web Archiving Project (WARP)

收藏
IFLA Repository2025-11-19 更新2026-05-16 收录
下载链接:
https://repository.ifla.org/items/29e4c1a0-8aa1-4391-94ac-2f85684ea033
下载链接
链接失效反馈
官方服务:
资源简介:
The National Diet Library (NDL) has been operating the Web ARchiving Project (WARP) since 2002, to collect and keep available for future access websites published in Japan. This paper describes the purpose of, history behind, and system used for this project, and introduces actual case studies to demonstrate the challenges faced in fulfilling the potential of this project. WARP has been attempting to create a comprehensive archive of websites published by public agencies in Japan, as prescribed in the 2010 revision of the NDL Law. It also archives, with permission of the publishers, the websites of private universities, websites promoting cultural or international events held in Japan, and websites related to the Great East Japan Earthquake. As of March 2015, the archived content reached 85,764 items, comprising 533 TB of data and 3.1 billion files. WARP was created using Open Source Software (OSS), such as Heritrix, Wayback and Solr, with some original software and user interfaces. Publications significant for public use, which are included in the collected websites, are cataloged individually, and made accessible together with other digitized materials. WARP metadata can also be searchable via other integrated search services. Some public agencies even guide their users to WARP in order to ensure access to older information that is no longer available on their own websites. Since it does not seem practicable for individual public libraries in Japan to conduct web archiving on their own, the NDL will take a step further in promoting WARP within the framework of digital resource sharing programs. We consider this an important part of the NDL’s mission as a national library responsible for disseminating cultural heritage through configuration of platforms and networks for digital resource sharing.
提供机构:
International Federation of Library Associations and Institutions
创建时间:
2025-09-24
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作