Národní knihovna CR
Albertina icome Praha
images/space.gif

 

colle1.jpg (6164 bytes)
colle2.jpg (7638 bytes)
colle3.jpg (7400 bytes)

 

Contents:
1. Basic principles
1.1 .New specification - nkp//manuscript 2.5
2. Categories of DOBM files
3. Structure of the COLLECTION file
4. Literature
  
Examples: Example of the enlargement of a concrete aplication to describe a collection of digitized documents
Example of the mapping DOBMENT file for the manuscript collection

 

1. Basic principles

1.1. New specification - nkp//manuscript 2.5

A collection of compound digital documents (e.g. that of digitized manuscripts or sound recordings) consists of several complex documents whose root files are individual bibliographic descriptions BIBLDESCR (DESCR.HTM). For them the concrete DOBM formats as defined for single items are the basic structural platform.

To map a collection in its DOBMENT.SGM, it is necessary to add the COLLECTION TAG into the list of categories applied (into the CTGSET tag).

It will have the category label “COLLECTION” and the English name “Collection” (e.g., the Czech name will be “sbírka”) as shown below:

CTGLABEL="COLLECTION" NAME="Collection"

 

2. Categories of DOBM files

The files with metadata for each type of document are added the category COLLECTION to as follows for manuscripts and sound recordings taken as examples:

  1. COLLECTION (Collection) - File with general description of the collection. From this file, a significant reference is made to the BIBLDESCR files describing the items of which the collection consists.
  2. BIBLDESCR - (Bibliographic Description) File with bibliographic description of the book (work). From this file, a significant reference is made to the file of the TECHDESCR category as well as to the file of the BOOK (WORK) category.
  3. TECHDESCR (Technical Description) - File with description of the digitization process.
  4. Files describing individual items of the collection (BOOK, WORK) - File representing the book, sound work; here as parts of the collection. From this file, significant references are made to individual parts of the book or work (files of the PAGE or PART category).
  5. PAGE (Page), PART (Part) - File representing one part of the book, sound recording (part of a symphony, song, volume, variation, ...). It contains the description of the page or part and references to data files of digital copies divided into categories defined by their quality, given by the data format or the compression ratio.

coll.jpg (67811 bytes)

 

3. Structure of the COLLECTION file

The COLLECTION file contains the description of the collection, especially its main characteristics, number of items, historical circumstances of its building, and other interesting information about the collection as a whole. It has in fact a character of an annotation.

The COLLECTION category contains this information in one category labelled as shown below:

CTGLABEL="COLLDESCR" NAME="Collection Description"
This statement is mandatory.

 

EXAMPLE OF THE ENLARGEMENT OF A CONCRETE APPLICATION TO DESCRIBE A COLLECTION OF DIGITIZED DOCUMENTS

For manuscripts and old printed books the CTGSET will be added and inserted the following to:

<DOBM SPEC="NKP//MANUSCRIPT 2.5" CTGLABEL="COLLECTION" NAME="Collection">

<DX CTGLABEL="COLLDESCR" NAME="Collection Description" TYPE="TEXT">

</DX>

</DOBM>

For sound recordings the CTGSET will be added and inserted the following to:

<DOBM SPEC="NKP//MANUSCRIPT 2.5" CTGLABEL="COLLECTION" NAME="Collection">

<DX CTGLABEL="COLLDESCR" NAME="Collection Description" TYPE="TEXT">

</DX>

</DOBM>

 

The ROOTS tag will look like as follows:

<ROOTS>
<!-- Reference to the top of the tree structure of the description - ->
<BEG HREF="COLLECT.HTM" CTGLABEL="COLLECTION" NAME="Collection Description">
</ROOTS>

The structures of the BIBLDESCR, TECHDESCR, BOOK (WORK, ...), PAGE (PART, ...) files remain the same as defined by the concrete specifications for each type of document.

 

4. Literature:

[1]  Mayer T., Knoll A.: The structure of digital copies of old books and manuscripts II, version 2, In: Digitization of Rare Library Materials. Storage and Access to Data: The Solution for the Compound Document, Manuscripts and Old Printed Books, Prague, National Library and Albertina icome Praha, 1997. CD-ROM.
[2] Vomlel J., Knoll A.: Digitization of manuscripts, old printed books, and other documents, Format for storage of metadata, version 2. 1, In: Digitization of Rare Library Materials. Storage and Access to Data: The Solution for the Compound Document, Manuscripts and Old Printed Books, Prague, National Library and Albertina icome Praha, 1997. CD-ROM.

 

Example of the mapping DOBMENT file for the manuscript collection

 

<!DOCTYPE DOBMENT PUBLIC "-//AIP//DTD DOBMENT//EN">

<DOBMENT>

<ROOTS><!-- Reference to the top of the tree structure of the description - ->
<BEG HREF="COLLECT.HTM" CTGLABEL="COLLECTION" NAME="Collection Description">

</ROOTS>

<CTGSET>

<DOBM SPEC="NKP//MANUSCRIPT 2. 5" CTGLABEL="COLLECTION" NAME="Collection">

<DX CTGLABEL="COLLDESCR" NAME="Collection Description" TYPE="TEXT">

</DX>

</DOBM>

<DOBM SPEC="NKP//MANUSCRIPT 5" CTGLABEL="PAGE" NAME="Page">

<DATA CTGLABEL="GALLERYQ" NAME="Gallery Quality Picture" TYPE="IMAGE">

<DATA CTGLABEL="PREVIEWQ" NAME="Preview Quality Picture" TYPE="IMAGE">

<DATA CTGLABEL="INTERNETQ" NAME="Internet Quality Picture" TYPE="IMAGE">

<DATA CTGLABEL="NORMALQ" NAME="Normal Quality Picture" TYPE="IMAGE">

<DATA CTGLABEL="EXCELLENTQ" NAME="Excellent Quality Picture" TYPE="IMAGE">

<DATA CTGLABEL="DETAIL" NAME="Detail" TYPE="IMAGE">

<DATA CTGLABEL="WATERMARK" NAME="Watermark" TYPE="IMAGE">

</DOBM>

<DOBM SPEC="NKP//MANUSCRIPT 2.5" CTGLABEL="BOOK" NAME="Book">

<REFERENCE SPEC="NKP//MANUSCRIPT 2.1" CTGLABEL="PAGE">

</DOBM>

<DOBM SPEC="NKP//MANUSCRIPT 2.5" CTGLABEL="TECHDESCR" NAME="Technical Description">

<DX CTGLABEL="CAPTURE" NAME="Image Capturing Data" TYPE="TEXT">

</DX>

</DOBM>

<DOBM SPEC="NKP//MANUSCRIPT 2.5" CTGLABEL="BIBLDESCR" NAME="Bibliographic Description">

<REFERENCE SPEC="NKP//MANUSCRIPT 2.1.coll" CTGLABEL="TECHDESCR">

<REFERENCE SPEC="NKP//MANUSCRIPT 2.1.coll" CTGLABEL="BOOK">

<DX CTGLABEL="SHELFNO" NAME="Shelf-number" TYPE="TEXT">

</DX>

<DX CTGLABEL="LIBRARY" NAME="Library" TYPE="TEXT">

</DX>

<DX CTGLABEL="OWNER" NAME="Owner" TYPE="TEXT">

</DX>

<DX CTGLABEL="MAINTTL" NAME="Title" TYPE="TEXT">

</DX>

<DX CTGLABEL="FSTOFRESP" NAME="Author" TYPE="TEXT">

</DX>

<DX CTGLABEL="EDITST" NAME="Edition" TYPE="TEXT">

</DX>

<DX CTGLABEL="GMD" NAME="Type of Document" TYPE="TEXT">

</DX>

<DX CTGLABEL="PBLSHER" NAME="Publisher" TYPE="TEXT">

</DX>

<DX CTGLABEL="PLACEPBL" NAME="Place of Publication" TYPE="TEXT">

</DX>

<DX CTGLABEL="PRINTER" NAME="Printer" TYPE="TEXT">

</DX>

<DX CTGLABEL="PLACEPRT" NAME="Place of Printing" TYPE="TEXT">

</DX>

<DX CTGLABEL="DATOFPUBL" NAME="Datation" TYPE="TEXT">

</DX>

<DX CTGLABEL="PHYSDESCR" NAME="Physical Description" TYPE="TEXT">

<DX CTGLABEL="MATERIAL" NAME="Material" TYPE="TEXT">

</DX>

<DX CTGLABEL="SIZE" NAME="Size" TYPE="TEXT">

</DX>

<DX CTGLABEL="EXTENT" NAME="Extent" TYPE="TEXT">

</DX>

<DX CTGLABEL="ILLUM" NAME="Illuminations" TYPE="TEXT">

</DX>

<DX CTGLABEL="NOTATION" NAME="Notation" TYPE="TEXT">

</DX>

</DX>

<DX CTGLABEL="NOTES" NAME="Notes" TYPE="TEXT">

</DX>

<DX CTGLABEL="ANNOT" NAME="Annotation" TYPE="TEXT">

<DX CTGLABEL="BASINF" NAME="Basic Information" TYPE="TEXT">

</DX>

<DX CTGLABEL="CONTENTS" NAME="Contents" TYPE="TEXT">

</DX>

<DX CTGLABEL="INDEX" NAME="Index" TYPE="TEXT">

</DX>

<DX CTGLABEL="LIT" NAME="Literature" TYPE="TEXT">

</DX>

</DX>

</DOBM>

</CTGSET>

</DOBMENT>