Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Migrated to Confluence 4.0

...

+ Project Description/Overview
- BookReader as DSpace add-on
- BookReader is opensource and freely available from Internet Archive
OpenLibrary (http://openlibrary.org)
- Integrating OpenLibrary BookReader functionality with Dome
(dome.mit.edu) for displaying multiple page content

Wiki Markup+ Describe behavior and benefits of the BookReader \
- OpenLibrary BookReader consists of server and client
see [
https://github.com/openlibrary/bookreader/blob/master/BookReaderIA/inc/Book|https://wikis.mit.edu/redir.aspx?C=65705528d74b411ab4e0e7b27e0e9cd1&URL=https%3a%2f%2fgithub.com%2fopenlibrary%2fbookreader%2fblob%2fmaster%2fBookReaderIA%2finc%2fBook]
Reader.inc for more background \
- performs functions such as single-page, two-page, and multi-page view,
page-turning, zoom in/out, pan, jump to page, full-text search [
http://openlibrary.org/dev/docs/bookreader|https://wikis.mit.edu/redir.aspx?C=65705528d74b411ab4e0e7b27e0e9cd1&URL=http%3a%2f%2fopenlibrary.org%2fdev%2fdocs%2fbookreader]
Single-Page, Two-page, and Thumbnail view
Zoom
Right-to-left page progression (e.g. for Yiddish and Chinese)
Full-text search with highlighting of search results
Support for foldouts and variable page size
In-Browser Text-To-Speech
Embeddable
Bookmark-friendly URLs
Works with a variety of image servers, or a simple directory of images
Simple access control \
- Typically used in conjunction with JPEG files that the client gets from
JP2 image server, such as Djatoka (which knows how to most efficiently
handle jp2 to jpg conversion), or static files on a local or remote
filesystem \
- BookReader client displays JPEG or PNG images \
- BookReader client can consume bookreader files anywhere, even remote \
- Either as files 'published' from JP2 image server (Djatoka), or files
sitting on the file system or using a remote URL \
- Describe what types of content BookReader could be useful forŠ. \
[ fill in the blanksŠ \ ] \
- More and more digital library web applications are using 'page-turner'
type functionality to display complex, multi-page, content. For example,
documents may contain images and text, benefit from additional
navigational aids because of length (e.g. hundreds of pages) or has a
complex structure where the user benefits from being able to move around
easily and quickly (the document does not need to be downloaded in its
entirety in order to view the document 'map') from one art of the document
to the next. \
- There are a few existing DSpace models, such as the Brasiliana site
([http://www.brasiliana.usp.br/bbd|https://wikis.mit.edu/redir.aspx?C=65705528d74b411ab4e0e7b27e0e9cd1&URL=http%3a%2f%2fwww.brasiliana.usp.br%2fbbd]) that have made admirable progress
towards integrating BookReader with DSpace in some fashion \
- More on Brasiliana site: the BookReader has been modified to work with
TIFF and PDF files rather than only with JP2 files; also does direct
access to certain DSpace programming 'hooks' that may be considered
'counter' to recommended practice \
- BookReader functionality not needed for all items in all collections

+ MIT Content Questions
+ Identify collections and/or groups of individual items that would
benefit from a 'book reader' view
- Collections currently under consideration include:
- Future projects: track potential content that would benefit from a
bookreader view?
+ List characteristics that renders an item (collection of items)
suitable for JP2 conversion
- Multi-page (navigation would be enhanced by ability to rapidly move
around the document)
- why PDF isn't necessarily good enough?

-mimics the natural way of browsing through a physical document
- complex formatting: documents may contain text, images, hand-written
notes
- zoomable content (illustrations, small details of printed text,
handwritten notes, fragments, anything where the user might want a closer
view)
- What kinds of text-only documents should be included as suitable?
- Does an all-text document qualify (e.g. see Presidents Reports below)?
- Retrospective conversion of older material to JP2 or are we only
talking about using it for future projects?
+ Edgerton Books(?) and Notebooks
- brief description : approx. 36 notebooks, 400 dpi, ca. 45-50MB .tif's,
.pdf and .tif master scans
- mixed text, some typed, some handwritten notes, sketches, photographs
- cataloged metadata in Archivists Toolkit
- See current MIT Edgerton site:
http://edgerton-digital-collections.org/ where page-turning view has
already been implemented:
http://edgerton-digital-collections.org/notebooks/11

...

+ Vail Balloon Collection
- general description: newspaper, broadsides, other, on ballooning in
19th century
- approx. 1331 .tif files, 1 - 3 pages per item, 300 dpi, approx. 46MB
per page, PDF for each itemunmigrated-wiki-markup

+ Misc. digital objects from Institute Archives collections(s) \
- These may be single instances of unique objects
+ Other collections? \ [ Ask Beverly \ ]

+ MIT Course catalogs (historical to present; scanned from microfiche to
born digital)
- more description

...

+ President's Reports (70 - 700 pages, text only), more recent reports
are very large in size and
- description

Wiki Markup+ JPEG 2000
+ Working with JP2 \
- JP2 not suitable for all types of scanned images..... \ [ describe what
we mean \ ] \
- What are the criteria for identifying and analyzing collections or
individual images that have the appropriate characteristics for conversion? \
- structural criteria \
- other criteria \
- benefits \
- drawbacks \
- system-wide workflow implications of using JP2?

+ Converting TIFF's to JP2
- Work with Jenn Morris to test creating JP2 files
- JP2 conversion can be more complicated because of the many parameters
that need to be set (or can be set?) in order to do a successful conversion
How expert do we have to be to generate JP2's to begin testing?
- conversion parameters
- tiling considerations
- what about pages that don't need tiling (is tiling optional?)

...