This is a non-IMPACT record, meaning that access to the data is not controlled by IMPACT. For access, see the directions below.

Disclaimer:
This Resource is offered and provided outside of the IMPACT mediation framework. IMPACT and the IMPACT Coordination Council/Blackfire Technology, Inc. expressly disclaim all conditions, representations and warranties including but not limited to Resource availability, quality, accuracy, non-infringement, and non-interference. All Resource information and access is controlled by entities and under terms that are external to the IMPACT legal framework.

Summary

DS-0790
Real Data Corpus - Naval Postgraduate School
External Dataset
Naval Postgraduate School
Naval Postgraduate School
01/01/2006
12/31/2014
50 (lowest rank is 50)

Category & Restrictions

Generic Network/Behavior Data
forensics
Restricted
true

Please see provider for details

Description


The Real Data Corpus (RDC) is a collection of disk images extracted from secondary storage devices that were acquired from second-hand markets around the world. In total, the RDC currently consists of 58 TiB of data contained in 3,127 disk images from 29 countries.

Real Data Corpus

The Real Data Corpus (RDC) is a collection of disk images extracted from secondary storage devices that were acquired from second-hand markets around the world. In total, the RDC currently consists of 58 TiB of data contained in 3,127 disk images from 29 countries. A variety of devices are represented, including magnetic media and solid state storage from laptops, desktops, mobile phones, USB memory sticks, and other media. The dataset is hosted in the HPC infrastructure at the Naval Postgraduate School, as well as in AWS Govcloud.

Potential Uses

The Real Data Corpus is a one-of-a-kind scientific resource for:
-Developing and validating forensic and data recovery tools.
-Training students in forensics and data recovery
-Developing and validating document translation software.
-Exploring and characterizing real-world computing practices, configuration choices, and option settings.
-Studying the storage allocation strategies of file systems under real-world conditions

The RDC has been cited in over 60 articles. See our current list here. Access and Availability

Please contact us if you would like access to the Real Data Corpus. In general, due to privacy concerns, we do not release copies of the data to private individuals. However, depending on the requirements of the project, we may be able to offer access through one of two methods: 1.Mediated Access. Researchers submit source code, build instructions, and detailed instructions for running their experiment. We return sanitized results. This is the most expedient option in cases where the desired experiment does not involve human subjects research.
2.Direct Access. Researchers create virtual machines on Amazon GovCloud, and these machines are granted access to the dataset. Because this method may involve direct contact with sensitive data, it involves additional review.

Please be aware that due to limited staff we cannot always accommodate all requests. Efforts are underway to develop infrastructure that will allow us to meet a wider range of research requirements without unduly increasing privacy risks. For more information or if you're interested in access to the Real Data Corpus, please contract:

Brittany Ramsey - Research Associate

blramsey@nps.edu (831) 656-2014

Additional Details

1.0TB
false
transaction processing, backup, public universities and colleges in california, computer buses, usb, information sensitivity, translation software, classes of computers, amazon web services, file system, laptop, 790, solid state computer storage, computer data storage, data recovery, secrecy, computer architecture, storage software, text, physical layer protocols, peripheral, solid state storage, videotelephony, virtualization software, data security, source code, personal computing, computer assisted translation, real data corpus - naval postgraduate school, usb flash drive, personal computer, amazon, naval postgraduate school, virtual machine, mobile phone, translation