ratarmount indexes for PMC OpenAccess subset
收藏academictorrents.com2025-01-22 收录
下载链接:
https://academictorrents.com/details/e95526a0bc4f39a5bbf423b24708d65fa4542d20
下载链接
链接失效反馈官方服务:
资源简介:
## the problem PMC Open Access bulk article (commercial and non-commercial) is a hefty set of files that weight in compressed at 79G and uncompressed at 388G. Archive decompression time in itself can take hours. A bittorrent mirror exists on: ## the solution ratarmount (), a python application, allows us to use FUSE (through fusepy) to mount a compressed archive as a disk, allowing us randomly access files in the archive as a disk without first decompression. To achieve good performance, it creates an index (an sqlite database per archive). This set of indexes still weight in at 1.4G uncompressed (345M compressed). ## usage * decompress all indexes in the same directory you ve downloaded oa_bulk * install ratarmount * use ratarmount to mount the oa_bulk archives on the disk a sample script mount.sh is provided as an example ## distribution we also use bittorrent to d
该PMC开放获取批量文章(包括商业和非商业用途)构成了一个庞大的文件集合,其压缩后体积为79G,解压缩后体积达388G。仅解压缩过程本身就可能耗时数小时。在指定的镜像站点上存在一个BitTorrent镜像:
解决方案方面,ratarmount(一个Python应用程序)允许我们通过FUSE(借助fusepy)将压缩存档挂载为磁盘,从而实现在不解压缩的情况下,以磁盘的形式随机访问存档中的文件。为了实现良好的性能,ratarmount会创建一个索引(每个存档对应一个sqlite数据库)。这套索引在不压缩的情况下体积为1.4G(压缩后为345M)。
使用说明:
* 在您下载的oa_bulk相同目录下解压缩所有索引
* 安装ratarmount
* 使用ratarmount将oa_bulk存档挂载到磁盘上。一个示例脚本mount.sh已提供。
分发方式:我们同样使用BitTorrent进行数据分发。
提供机构:
academictorrents.com



