"The Open Content Alliance (OCA) represents the collaborative efforts of a group of cultural, technology, nonprofit, and governmental organizations from around the world that will help build a permanent archive of multilingual digitized text and multimedia content." - OCA website

Link to the OCA: http://www.opencontentalliance.org/

Link to the Internet Archive: http://www.archive.org/index.php

Please see Attachments for:

  • OCA Digitization Criteria
  • Music notation & OCR example

NEW PRICING GUIDELINE for 2011: http://www.archive.org/~pnguyen/pricing/index.html

Criteria for scanning

Digitization Criteria to be used by Library and IA

  • Criteria to be used for determining if the materials may be digitized are listed below and will include, but not necessarily be limited to the following:
    • Any special preservation standards that came on cart from the Library Partner.
  • Materials that have multiple titles/physical volume (eg, 63 vols, 20-40 pamphlets per vol, of "Forestry Pamphlets" without analytics) will be reviewed to ensure all proper meta data is understood.
  • Materials will be screened for size- materials not fitting the requirements shown below will be returned unscanned. 
  • Size-
    • 9.7 wide by 14.5 high at the max. Books as small as 3 inches x 3 inches may possibly be scanned.
    • less than 3 inches thick are ok, greater than 3 inches will need to be reviewed.
    • Books should on average should be 200 pages or larger. If a collection is mostly under 100 pages a review should be undertaken between IA and BLC to ensure the quoted price per scanned page can be met.
  • Book Style-
    • Side bound Monographs, no single sheets, no top bound books.
    • Rebound books need to be checked for how tight the gutter or binding is, or if the text runs outside the margin (it will show the cradle).
    • When there are more than one title within a bound book; each of these has to be clearly marked with a paper strip; each of these will be counted as a separate book. Deviations from this must be approved between IA/Library Partner.
    • Soft cover books are ok if they are bound.
    • Covers that are almost separated from book and appear too fragile may be rejected unless agreed to in advance by Library Partner.
    • Material condition-Materials should be of similar condition or quality to what would be put into circulation. Materials deemed not robust to go into circulation should be reviewed with Scanning Center Coordinator before scanning.
    • Tight bindings that will not lay open for digitization per IA specification limits will be rejected. 
  • Paper style-
    • Most paper styles can be scanned, except highly acidic paper that disintegrates to the touch. Note that if a hard-to-scan paper is to be digitized; a review of time to scan versus any ‘damage’ will be undertaken.
    • All pages should be pre-cut. Unless otherwise instructed, books with uncut pages will not be scanned.
    • Pages should be able to be lifted and turned with normal effort. Sticky pages or the like will not be scanned.
    • Pages should not be excessively dusty, have excessive mildew or be moldy.
    • Microfilmed reproductions should be reviewed with Scanning Center before being scanned. If the paper/image looks like a film negative (Xerox), these won’t be scanned. If the microfilm reproductions have more than one page on each leaf these will be rejected.
  • Gutters/Margins-
    • Any book where the text is less than a quarter inch off the gutter, on an approximate 75-degree angle will be unscannable.
    • Text that runs to the edge of the page or margin can be scanned but the presentation will be poor, as the cradle will show. The Library Partner must approve this.
  • Bibliographic data-
    • Multiple books or multi volume set-The usual problem with bibliographic info is that there is only one bib record for a set. The Library Partner will make a decision as to how this should be handled. 
    • Books that are out of approved copyright range. These will be set aside, unless otherwise agreed to by Library Partner and IA.
    • A book will be set aside when a Marc record can’t be located   when a Call ID/Book ID or equivalent is inputted.

Articles

Brewster Kahle's presentation on the Internet Archive at the TED Conference: http://blog.ted.com/2008/09/brewster_kahle.php
Wired Magazine's article and gallery about the Internet Archive: http://www.wired.com/entertainment/theweb/multimedia/2008/03/gallery_internet_archive

 Sample Batch

9/15/08: Per Beverly's email message to Ann Marie, the files from the OCA sample batch will not be loaded into DOME at this time.  Loading them will be discussed when the collections of which they are a part are scanned and loaded. 

The physical items that were scanned as part of the sample batch will be reshelved.

Review of OCA Samples

IA URLs for OCA sample scans:

 

Item

Book ID (Note 1)

Catalog System Number

Library Item Barcode

Catalog Year

Library Item Description

Call Number

Catalog Title

Catalog Author Name

Date Sent

Notes

URLs


MIT 04/11/08

1

particularsofear00sava

000258531

39080003902670

1967


HD28.M414 no.236- 67

The particulars of early change.

Savage, Charles H.

4/10/2008


http://www.archive.org/details/particularsofear00sava

MIT 04/11/08

2

scienceofstrateg00mint

000258532

39080003902662

1967


HD28.M414 no.237- 67

The science of strategy-making; managerial methods and planner programs.

Mintzberg, Henry.

4/10/2008


http://www.archive.org/details/scienceofstrateg00mint

MIT 04/11/08

3

stochasticrespon00mont

000258535

39080003902621

1967


HD28.M414 no.238- 67

A stochastic response model with application to brand choice.

Montgomery, David Bruce.

4/10/2008


http://www.archive.org/details/stochasticrespon00mont

MIT 04/11/08

4

unprogrammeddeci00soel

000258536

39080003902605

1967


HD28.M414 no.239- 67

Unprogrammed decision making.

Soelberg, Peer (Peer Olav)

4/10/2008


http://www.archive.org/details/unprogrammeddeci00soel

MIT 04/11/08

5

stochasticmodels00mont

000258537

39080003871610

1967


HD28.M414 no.240- 67

Stochastic models of consumer response.

Montgomery, David Bruce.

4/10/2008


http://www.archive.org/details/stochasticmodels00mont

MIT 04/11/08

6

noteontaxescorpo00farr

000258538

39080003871628

1967


HD28.M414 no.241- 67

Note on taxes, corporate financial policy and the cost of capital to the firm [by] Donald E. Farrar

Farrar, Donald Eugene.

4/10/2008


http://www.archive.org/details/noteontaxescorpo00farr

MIT 04/11/08

7

measuringprotect00trav

000258539

39080003871644

1967


HD28.M414 no.242- 67

Measuring protection.

Travis, William P. (William Penfield)

4/10/2008


http://www.archive.org/details/measuringprotect00trav

MIT 04/11/08

8

managementinform00zann

000259057

39080003871677

1967


HD28.M414 no.243- 67

Management information systems and the management process: new directions.

Zannetos, Zenon S.

4/10/2008


http://www.archive.org/details/managementinform00zann

MIT 04/11/08

9

implicationsofch00scot

000408861

39080005829251

1984


HD28.M414 no.1408- 83 1984b

Implications of changes in information technology for corporate strategy / Michael S. Scott Morton,

Scott Morton, Michael S.

4/10/2008


http://www.archive.org/details/implicationsofch00scot

MIT 04/11/08

10

davidtaylorrepor267usex

001327389

39080027535787

  0

no.267 (1930)

V393.R46

David Taylor report.


4/10/2008


http://www.archive.org/details/davidtaylorrepor267usex

MIT 04/11/08

11

technicalnote1962davi

001477631

39080019132494

  0

SML-760-34 (1962)

TA645.T43

Technical note / Structural Mechanics Laboratory, Ship Dynamics Division, David Taylor Model Basin.

Technical note (David W. Taylor Model Basin. Structural Mechanics Laboratory)

4/10/2008


http://www.archive.org/details/technicalnote1962davi

MIT 04/11/08

12

davidtaylorrepor189usex

001327389

39080027535928

  0

no.189 (1928)

V393.R46

David Taylor report.


4/10/2008


http://www.archive.org/details/davidtaylorrepor189usex

MIT 04/11/08

13

artprinciplesinh00clar

000416959

39080010038575

1921


NA7110.C592

Art principles in house, furniture, and village building; an exposition of designing principles whic

Clark, Arthur Bridgman, 1866-1948.

4/10/2008


http://www.archive.org/details/artprinciplesinh00clar

MIT 04/11/08

14

transactionsofam0219211922amer

000291693

39080024848894

1920

v.2 (1921/1922:Oct.-Sep.)

TN.A514

Transactions of the American Society for Steel Treating.

American Society for Steel Treating.

4/10/2008


http://www.archive.org/details/transactionsofam0219211922amer

MIT 04/11/08

15

transactionsofam0119201921amer

000291693

39080024850254

1920

v.1 (1920/1921:Oct.-Oct.)

TN.A514

Transactions of the American Society for Steel Treating.

American Society for Steel Treating.

4/10/2008


http://www.archive.org/details/transactionsofam0119201921amer

MIT 04/11/08

16

principleofrelat00eins

001020855

39080024178235

1920


QC6.E45x 1920

The principle of relativity; original papers by A. Einstein and H. Minkowski, tr. into English by M.

Einstein, Albert, 1879-1955.

4/10/2008


http://www.archive.org/details/principleofrelat00eins