File group profile dataset
收藏Mendeley Data2024-01-31 更新2024-06-28 收录
下载链接:
https://figshare.com/articles/dataset/File_group_profile_dataset/1297321
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains file group profiles for specific combinations of file formats and their properties. Combinations are defined by the Analysis Groups dataset (X). Currently three different sets are available. PDF_filtered covers PDF file format versions. Years covered are 1996-2010. HTML_filtered coveres HTML and XHTML file format versions. Years covered are 1996-2010. Distiller_filtered coveres 9 major releases of Adobe Distiller. Years covered are 1996-2010. This dataset is an intermediary result of the analysis workflow (https://github.com/datascience/FormatAnalysis). It is created by filtering and conflict reduction of the Format profile dataset from the UK web archive (DOI: 10.5259/ukwa.ds.2/fmt/1). First 2 columns cover combinations of property value pairs for different properties (mime type and software for Distiller_filtered and mie type and version for HTML_filtered and PDF_filtered). Last two columns cover year and amount of files with defined combination of properties that were available in a given dataset (UK web archive format profile) at specific year.
创建时间:
2024-01-31



