Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migration of unmigrated content due to installation of a new plugin

...

If you have made changes to the package created with CCA tools (from logical transfer or disk image extraction), such as deleting or changing organization of the packagethe files in the "objects" folder, you will have to make changes to the appropriate checksum.md5 file within the SIP. Making sure it is accurate ensures that Archivematica will be able to check all the files transferred correctly. If not, proceed to step 4.

  1. Copy the checksum.md5 file into the submissionDocumentation folder and rename it original_checksum.md5
  2. Return to the checksum.md5 file in the metadata folder and open it in a text editor to remove the lines for the files or directories that have been deleted.
  3. Edit the paths listed for items that you have moved since initial package creation so they reflect their current location.
  4. If you have not done so, record your rearrangement and appraisal actions in the processing information note in the finding aid. (link)

Adding required metadata files to the package

Through Archivematica has we can use specific metadata files that can be added to in the transfer package so metadata is transferred correctly with the package. If you have analyzed the transfer files already for possible restricted content and found that there is a restriction, proceed to the Creating and adding the rights restriction file section. If you have found no restrictions you can proceed directly to the Creating and adding the metadata file section.

Creating and adding the rights restriction file 

Adding information about restrictions is done through a CSV file called "rights.csv" and is required if you have found restricted content. DDC uses this file for the options included for recording restrictions but there are also options for actual usage rights (licenses, copyright, etc.), hence the name of the file. 

There is a template included in the Transfer folder that you can edit for your needs, which is titled "rights.csv", this is the same as what the file will be named that you include in the package. The template includes three rows; an example of a restriction for the full package, an example of a restriction for a folder, and an example for a restriction of an individual item.

Warning

The rights csv only works on the file level at present. If not at the file level, the Archivematica transfer will fail. We are hoping to get this changed and therefore have left the other examples in place. You can describe restrictions/rights later at the full-package level through the Archivematica dashboard following step 5b under Ingest in this documentation.

The three examples also are example examples of the three most common basis types of restriction; policy, statute, and donor ("other" is also an option in rare cases where a restriction does not fit those categories). In the event there is a copyright or license situation that should be noted, you would also have an entry for copyright or license. Below are the fields used in the csv CSV with descriptions for each, images come from the example template.

  1. file: the path that the package/folder/file is located. If describing a restriction of the entire package, this will always be just "objects". In the case of folders or files you will list objects/ first and then the rest of the path. If your content is in a bag, you will need to put /data instead of /objects for folders and files so Archivematica looks in the correct place.
  2. basis: the basis of restriction, there are three choices we generally use: policy, statute. and donor ("other" is also an option in rare cases where a restriction does not fit those categories). In the event there is a copyright or license situation that should be noted, you would also have an entry for copyright or license.
  3. status: his field is only used for copyrighted content. This should always be "copyrighted" for copyrighted content.
  4. determination_date: the date you determined the statute applied to these records, most likely the present date. It is in YYYY-MM-DD format. This only used for restrictions with the basis: statute or copyright. It will allow us in the future to determine what version of the statute was in place at the time of the determination.
    Image Added
  5. jurisdiction: the jurisdiction where the statute or copyright applies, it is not used for the other restriction types. This will be entered using ISO 3166 alpha-2 country codes or subdivision code (often for states), most often being used are "us" for United States  or "us-ma" for Massachusetts.
  6. start_date: the start date of the restriction, if on the package or folder level, the oldest dates of a restricted file within the grouping. The start date of the copyright or license will be determined by relevant copyright law and/or donor or license agreement.
  7. end_date: the end date of the restriction often the restriction time period after the start date (the start date plus 50 years, for instance). If on the package or folder level, this will be the date plus the restriction period of the newest file with a restriction (newest file date plus 50 years, for instance). If the restriction end date is based on a donor policy reliant on an unknown end date (for instance the death of the creator), then this field should be left blank. The end date of the copyright or license will be determined by relevant copyright law and/or donor or license agreement.
  8.  terms: terms is only used when describing a license. This may be when a donor retains copyright to some/all of their collection but assigns a license for use to the content. The most common being Creative Commons licenses. For instance: "This content is licensed under the Creative Commons Attribution-Share Alike 3.0 Unported license"
    citation: the citation to the statute relevant to this descriptionrestriction. This is only used with basis: statute. The most often used are FERPA, Massachusetts MA 201 CMR subsection 17,  and HIPAA.
  9. note: 
  10. grant_act
  11. grant_restriction
  12. grant_start_date
  13. grant_end_date
  14. section 17, HIPAA, Attorney-client privilege, and Legal work product.
    Image Added
  15. note: the description of the restriction based on restriction type. For example, "student records, restricted for 75 years". In regards to copyright and license, note any additional information as necessary.
    Image Added
  16. grant_act: indicates the type of preservation repository action being described. For restricted content, this will always be "disseminate". For license/copyright it would be "use".
  17. grant_restriction: indicates whether the action in step 9 is allowed or not. For restricted content, this will always be "Disallow". For license/copyright this can be Allow, Disallow, or Conditional depending on the terms.
  18. grant_start_date: the same date listed in the first start_date field above.
  19. grant_end_date: the same date listed in the first end_date field above.
    Image Added
  20. When finished, save a copy of file as a CSV in the “metadata” folder within the package you are transferring in Archivematica.
     

    Info
    titleNote

    If you have a bag where the “metadata” folder was created within the “data” folder or if you have not determined the restriction status because you are waiting to analyze the files in Archivematica, adding the restrictions metadata will wait until a later step in Archivematica and would only be for the entire package (see Archivematica processing step 5b under Ingest). In the case of this type of bag, adding it to the metadata folder now would cause the bag to be invalid.


  21. Move onto the Creating and adding the metadata file section.grant_note

Creating and adding the metadata file

If you have analyzed the files already for possible restricted content and/or have determined what level of digital preservation commitment these files require, you should create a metadata csv file with this information to be included with the package(s) going through Archivematica. If you have not analyzed the files yet, go to the section above. If you are just transferring the files into Archivematica Backlog for processing later, continue to step 2.

  1. There is a template included in the Transfer folder that you can edit for your needs, which is titled metadata.csv, this is the same as what the file will be named that you include in the package. 
  2. Open the template, there should be at least two heading columns.
  3. Column 1 is the “filename”, this should always be “objects”. 
  4. Column 2 is the “Level_of_DPCommitment” which should be the appropriate level for the specific package, such as “Level 3”.
  5. If you are putting through multiple packages (each with it’s own metadata file) as part of the same transfer (e.g. 2020_029_001, 2020_029_002, etc.), you will need a third column called “dc“dcterms.isPartOf”. In that field you will enter the identifier for the group of packages, adding “AIC#” to the beginning. For instance, “AIC#2020_029acc”.
    Image Modified
  6. Save the file as a copy in the “metadata” folder within the package you are transferring in Archivematica.

    Info
    titleNote

    If you have a bag where the “metadata” folder was created within the “data” folder or if you have not determined the level of digital preservation commitment because you are waiting to analyze the files in Archivematica, adding the CSV metadata will wait until a later step in Archivematica (see Archivematica processing step 6 5a under Ingest). (link) In the case of this type of the bag, adding it to the metadata folder now would cause the bag to be invalid.

     

Final steps

...

Open Archivematica and click “Browse” and make sure the folder is available. Now you can proceed to the Archivematica processing steps

...

.

  1.