Real Data Corpus - Naval Postgraduate School (2006-01-01 to 2014-12-31)
收藏Mendeley Data2024-01-31 更新2024-06-28 收录
下载链接:
https://www.impactcybertrust.org/dataset_view?idDataset=790
下载链接
链接失效反馈官方服务:
资源简介:
Real Data Corpus The Real Data Corpus (RDC) is a collection of disk images extracted from secondary storage devices that were acquired from second-hand markets around the world. In total, the RDC currently consists of 58 TiB of data contained in 3,127 disk images from 29 countries. A variety of devices are represented, including magnetic media and solid state storage from laptops, desktops, mobile phones, USB memory sticks, and other media. The dataset is hosted in the HPC infrastructure at the Naval Postgraduate School, as well as in AWS Govcloud. Potential Uses The Real Data Corpus is a one-of-a-kind scientific resource for: -Developing and validating forensic and data recovery tools. -Training students in forensics and data recovery -Developing and validating document translation software. -Exploring and characterizing real-world computing practices, configuration choices, and option settings. -Studying the storage allocation strategies of file systems under real-world conditions The RDC has been cited in over 60 articles. See our current list here. Access and Availability Please contact us if you would like access to the Real Data Corpus. In general, due to privacy concerns, we do not release copies of the data to private individuals. However, depending on the requirements of the project, we may be able to offer access through one of two methods: 1.Mediated Access. Researchers submit source code, build instructions, and detailed instructions for running their experiment. We return sanitized results. This is the most expedient option in cases where the desired experiment does not involve human subjects research. 2.Direct Access. Researchers create virtual machines on Amazon GovCloud, and these machines are granted access to the dataset. Because this method may involve direct contact with sensitive data, it involves additional review. Please be aware that due to limited staff we cannot always accommodate all requests. Efforts are underway to develop infrastructure that will allow us to meet a wider range of research requirements without unduly increasing privacy risks. For more information or if you're interested in access to the Real Data Corpus, please contract: Brittany Ramsey - Research Associate blramsey@nps.edu (831) 656-2014
创建时间:
2024-01-31



