Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Reviewed Oct. 2020

Information gathered in July 2008:

BCR-CDP's Digital Imaging Best Practices, version 2.0
http://www.bcr.org/cdp/best/index.html

University of Michigan's Digital Image Specifications.
http://www.lib.umich.edu/lit/dlps/dcs/UMichDigitizationSpecifications20070501.pdf

  1. (Yale) use the bibliographic record identification number for the specific resource to be scanned that is found in the MARC record for the title in your OPAC.   Most of the materials that we are digitally reformatting are cataloged in our OPAC.  Call numbers can change, several books can have the same title, and using truncated titles for file names frequently don't offer much information.  The bibliographic record number is unique, does not change, and we use this as the persistent identifier for the files.  Also, data from OPACs already have a fairly reliable track record of being migrated into the future.
  2. The New York State Library employs such a convention, with OCLC number or local control number being the major portion of most file names (not all items imaged have been cataloged).  Some of our larger items are scanned in parts, some imaging equipment saves raw scans at one file per page (later combined for a use copy), and some of our imaged titles are multi-part sets or serials.  And so while the bib record identification number is a good start, we necessarily create file names that may include Volume, Number, Year, Month, Day, Part, Page, etc.  
  3. (Sibley/Univ of Rochester) We base our file names on the bar code assigned to the copy we digitize, since it is logically and permanently linked both to the specific copy we digitized (we retain the original with its bar code attached to the housing) and to the bib record.  
  4. We also use our bib identifier for most special and older projects. The materials being scanned by (or by us for) the Google Michigan Digitization Project use the barcode attached to the bib record.
  5. I posted a reply to this on the ARSC Associated Audio Archivists Message Board because it was easier to format that way. Here's the link:
    http://arsc-aaa.invisionzone.com/forums/index.php?showtopic=38
  6. it is extremely important to document the filenaming scheme some where else as well.  I keep a running word document for each collection we load with a written explanation of the file naming scheme and why we chose it.  It is really for in-house use and for future employees.
  7. (Indiana Univ) The Archives of Traditional Music updated its file naming scheme in 2006, working with our Digital Library Program which was simultaneously developing the recommendations presented by the first link in Nancy's message, below. You can see our implementation for audio files in chapter 3 of the publication Sound Directions: Best Practices for Audio Preservation, available at  http://www.dlib.indiana.edu/projects/sounddirections/papersPresent/index.shtml
  8. (Rochester?) When I was researching file naming some time ago, I had bookmarked these pages (I found the first to be particularly helpful):
    http://wiki.dlib.indiana.edu/confluence/display/INF/Filename+Requirements+for+Digital+Objects
    http://www.archives.gov/preservation/technical/guidelines.pdf (see page 60)
    http://www.controlledvocabulary.com/imagedatabases/filename_limits.html
    http://edocs.lib.sfu.ca/projects/Doukhobor-Collection/technical.html
    http://staffweb.library.northwestern.edu/dl/adhocdigitization/storage/
  9. Here at the Northeast Document Conservation Center, we use the following:
    • for our in-house client work: first five letters of client name, plus date stamp (yymmdd), plus workstation code(a,b,c,d,etc.), plus four digit ascension number (three digit number with leading zero)
    • for our digital conservation treatment photography: client job number (without spaces or points), department code (p, b, i, f), job sub number, status code (bt, dt, at), leading zero and three digit ascension number.

...