Národní knihovna CR
Albertina icome Praha
images/space.gif

 

mod1.jpg (4834 bytes)
mod2.jpg (5065 bytes)

 

Contents:
1. Categories of DOBM files
2. Structure of the BIBLDECSR file
3. Structure of the TECHDESCR file
4. Structure of the BOOK file
5. Structure of the PAGE file
6. Other statements
7. Literature:

 

     This proposal has been prepared in order to enable digitization of damaged and endangered modern books which to be replaced by their digital copies and thus used for library services if they are out of print and there is no problem with copyright.

For this purpose the concrete application of the DOBM format for manuscripts and old printed books NKP//MANUSCRIPT 2.1 will be adapted as specification NKP//MANUSCRIPT 2.2.

1. Categories of DOBM files

The metadata files are divided into five categories:

  1. BIBLDESCR - (Bibliographic Description)
  2. Bibliographic description of the document with cataloguing data. From here significant references are made to the file of the TECHDESCR category and the file of the BOOK category.

  3. TECHDESCR (Technical Description)
  4. A file with the description of the digitization process.

  5. BOOK (Book)
  6. This file represents the book, it can include its contents. Significant references are made from here to individual pages (files of the PAGE category).

  7. PAGE (Page)

This file represents one page of the book. It contains a preview image and it can contain also the description of the page. References to data files of the digital copies of pages are made from here. The data files are divided into categories following the recommended quality levels of the digital images.

book.jpg (41234 bytes)

2. Structure of the BIBLDECSR file

The file of the BIBLDESCR category is the root of the tree structure and it covers the digital copy of the book. The BIBLDESCR category contains the bibliographic description. The list of the categories of the bibliographic record is as follows:

AACR2 - this marks the statements whose origin is in the AACR2 standard
doc - the values of the statements are in the language of the described document
orig - the value is in the language of the original document (in case we digitize a translation)
descr - the values of the statements are in English or in the cataloguing language

 

MAINTTL
Title
Main title(mandatory) AACR2 doc
OTHERTTL
Other Titles
Other information about the title AACR2 doc
ORIGTTL
Original Title
Title of the original AACR2 orig
FSTOFRESP
Author
First statement of responsibility (author) AACR2 orig
OTHSTOFRESP
Other statement of resposnibility
Other statement of resposnibility AACR2 orig
EDITST
Edition
Edition (1, 2, etc.) AACR2 doc
GMD
Type of Document
General material designation type of the document E.g., manuscript, periodical. (mandatory) AACR2 descr
LANGDOC
Language of the Document
Language of the document AACR2 descr
LANGORIG
Language of the Original
Language of the original AACR2 descr
PBLSHER
Publisher
Publisher AACR2 doc
PLACEPBL
Place of Publication
Place of publication AACR2 doc
DATOFPUBL
Datation
Date of publication(mandatory) AACR2 descr
SERIES
Series
Series AACR2 doc
PRINTER
Printer
Printer AACR2 doc
PLACEPRT
Place of Printing
Place of printing AACR2 doc
PHYSDESCR
Physical Description
Physical description (mandatory) AACR2 descr
It can contain these more detailed statements:
SIZE
Size
Size descr
EXTENT
Extent
Extent (number of pages or folios) descr
ATTMAT
Attached Material
Attached material orig / descr
DEFECTS
Defects
Discovered defects descr
ANNOT
Annotation
Annotation (mandatory) descr
It can contain these more detailed statements:
BASINF
Basic Information
Basic information descr
CONTENTS
Contents
List of contents doc/descr
INDEX
Index
Index of important places in the document doc/descr
LIT
Literature
Literature/bibliography doc/descr
SHELFNO
Shelf-number
Shelf-number   descr
LIBRARY
Library
Place of storage   descr
ACCESSIB
Accessibility
Accessibility/availability   descr
KEYWORD
Keyword
Keyword   descr
UDC
UDC
UDC (Universal Decimal Classifuication) descr
ISBN
ISBN
ISBN AACR2 descr
NOTES
Notes
Notes AACR2 descr

 

3. Structure of the TECHDESCR file

The file of the TECHDESCR category contains technical information about how the original was digitized: description of technological devices used, original resolution, or information about intermediate formats used for the production of final digital data files. The TECHDESCR category contains this information in one category labelled as shown below:

CAPTURE Image Capturing Data Technical data about digitization

 

4. Structure of the BOOK file

The BOOK category file represents a book, it contains no other statements. From here a reference to the gallery of individual pages can be made.

5. Structure of the PAGE file

The files of the PAGE category represent each one page of the book. The PAGE file can contain various statements and also references to the images of various quality levels:

  • GALLERYQ
    Gallery Quality Picture
  • PREVIEWQ
    Preview Quality Picture
  • NORMALQ
    Normal Quality Picture
  • INTERNETQ
    Internet Quality Picture
  • EXCELLENTQ
    Excellent Quality Picture
  • DETAILQ
    Detail
  • WATERMARK
    Watermark

It can also contain a reference to the text file related to the page

  • TEXTCOPY (TEXT)
    Text    Copy   Text copy

 

6. Other statements

If needed larger or more detailed descriptions of individual pages, other statements/categories as desired.

 

7. Literature:

[1]  Mayer T., Knoll A.: The Structure of Digital Copies of Old Books and Manuscripts. Version 2.1, 1997, AiP, NK Praha
[2] Vomlel J., Knoll A.: Digitization of Old Books, Manuscripts, and Other Documents: format for storage of metadata, version 2. 1, 1997, AiP, NK Praha