Reviewing for restrictions in digital material is mostly the same as reviewing physical material except there is software that we use to search the files for restrictions as well.
Reports from the transfer workflow
During the digital transfer workflow when using CCA tools, a tool is run in the background called bulk_extractor that will look for PII, social security numbers, and keywords related to MIT restrictions within the text of files in the transfer. This does not work on audio, video, images, and some types of files (PDFs of scanned files, etc.). You can review the output of this tool using the Bulk Reviewer software (see section below).
Bulk Reviewer all requires use of BitCurator, a customized Linux operating system, that we run on a virtual machine through Windows. To access do the following steps:
Click on the Oracle Virtual Box shortcut on the Windows desktop screen
Virtual Box will open, click on the BitCurator with the latest version number after it
Wait for the operating system to load. Once you see a screen with a blue background, BitCurator is ready to use.
Bulk Reviewer (pre-existing reports)
- Click on the Bulk Reviewer shortcut on the BitCurator Desktop.
Archivematica
In addition to bulk reviewer, you can look choose the "examine contents" option when transfer the material through Archivematica. This won't cover as much mainly surfacing credit card numbers, social security numbers, telephone numbers, and email addresses through their interface. More information can be found in the appraisal tab section of the Archivematica documentation.
Creating and reviewing new reports
If you did not use one of the CCA tools, you can create reports for looking for restricted materials by running reports directly through Bulk Reviewer.