Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: clarify that pre-existing reports unlikely to be present

...

During the digital transfer workflow when using CCA tools, a tool can be run in the background called bulk_extractor that will look for PII, social security numbers, and keywords related to MIT restrictions within the text of files in the transfer. This does not work on audio, video, images, and some types of files (PDFs of scanned files, etc.). You can review the output of this tool using the Bulk Reviewer software (see Bulk Reviewer (pre-existing reports) section below) (most likely will not exist from non disk image CCA tools as they need to be updated for this to work). You can also run bulk_extractor through the Bulk Reviewer if you did use one of the CCA tools or chose not to create reports at time of transfer, as outlined in the Creating new reports section.

...

During the digital transfer workflow when using CCA tools, sometimes restricted content reports will be generated (most likely will not exist from non disk image CCA tools as they need to be updated for this to work, currently can only get them by running a modified version for disk images on the command line).You can review the output of this tool using the Bulk Reviewer software (see section below).

...

In addition to bulk reviewer, you can look choose the "examine contents" option when transfer the material through Archivematica. This won't cover as much mainly surfacing credit card numbers, social security numbers, telephone numbers, and email addresses through their interface. More information can be found in the appraisal tab section of the Archivematica documentation. (link)

Creating new reports

If you did not use one of the CCA tools or chose not to create reports at time of transfer, you can create reports for looking for restricted materials by running reports directly through Bulk Reviewer.

...