Le Courrier

sommaire dossier
d'ici... opinion notre planete
ethiques signes connexions dires
Digital disappearances
Sara Gould and Marie-Thérèse Varlamoff. Sara Gould is programme officer for the Universal Availability of Publications at IFLA (International Federation of Library Associations and Institutions). Marie-Thérèse Varlamoff is chief curator at the French National Library and director of the Preservation and Conservation Programme at IFLA.
photo
National libraries with the financial means have been steadily digitizing their collections.










In the near future, those who study literary history will be at a loss if all they have is a diskette containing the final “clean” version






Obsolete machines and the ephemeral nature of web-based materials are just some of the hurdles confronting archivists as they struggle to preserve the world’s digital heritage

The wobble of Neil Armstrong’s legendary leaps on the moon are engraved in the memory of at least a generation. Yet 30 years later, much of the actual data from that mission has been lost. Apparently,
NASA lacked the foresight to save the equipment and software required to read it. If the scourge of technological obsolescence can cause such a loss for the world’s leading space agency, imagine the challenge awaiting national libraries in their task to preserve the world’s digital heritage.
Just a few years ago, digital technologies emerged as a godsend for archivists. As e-mail and the Internet spread, pundits began taking bets on how long it would take for paper to disappear. Libraries with the financial means began digitizing their collections. Instead of submitting their precious collections to page-fraying, spine-breaking habits of scholars in a merciless search for key passages, librarians handed them a slim pair of
CD-ROMs, digital copies of the great works, which could be scanned in minutes. With a flip of a switch, libraries could signficantly expand access to their collections.
Yet in the frenzy to digitize, precious resources may have been wasted. Without a co-ordinated strategy, some sister organizations have made digital copies of similar sections of their collections. With a bit more foresight, they might have worked together to offer a more diverse selection. Converted to the cult of the new technologies, others have been a bit too zealous in cleaning up—throwing out—their ageing documents like old newspapers. And finally, some of the more prestigious institutions have fallen upon a double-edged sword after proudly displaying the gems of their collections on the Internet. The digital versions whetted the general public’s appetite to see the “real thing” which means that crowning documents like the Magna Carta risk becoming veritable attractions.
Ironically, the supposed saviour, digitization, may be an enemy in the battle to build and preserve perpetual archives. To begin with, the technology is so new that we cannot be certain of the longevity of, for example,
CD-ROMs. A more immediate threat lies in the extraordinary speed of technological progress. “The great creator becomes the great eraser,” warns an American futurist, Stewart Brand. We can read a thousand-year-old manuscript, yet archivists cannot decipher some materials that are less than 20 years old.
Remember the old floppy discs of the not-so-distant past? If by some miracle a library finds some of those clunky machines, the chances of acquiring spare parts, the right software required and a trained technician are just about nil. Today, “backward compatible” software—that designed to read old versions—generally only covers one or two generations of changes, which offers little relief when the average life cycle of hardware and software is a mere 18 months. Indeed, the U.S. based Research Libraries Group found in a 1998 survey that nearly half (15 of 36) of its member institutions reported losing access to part of their digital holdings because of obsolescence.
Awaiting a long-term solution, archivists continue to rely on a veritable relic: microfilm for crumbling paper collections. But the old stand-by does little to resolve the latest thorn in their side: the new wave of “born-digitals,” works created on computer-like websites and electronic journals.
Consider the case of continually up-dated documents on the Web. Should we preserve all the various drafts of the document or only the final version? The drafts of great works of literature are treasure troves for scholars. For instance, Victor Hugo’s splendid handwriting and the powerful drawings he used to fill in the margins of the pale blue paper he favoured are rich in historical significance. In the near future, those who study literary history will be at a loss if all they have is a diskette containing the final “clean” version with no draft, no hesitation, no notes, no drawings nor doodles.
The same is true of e-mail. A century ago, famous writers may have recorded their movements, discoveries and emotions to friends or family in letters which have often been preserved as part of our cultural heritage, helping to set literary works in the context of the writer’s life and thought. E-mail storage is increasingly seen as a burden on a computer system. Will the memory of today’s literary giants lose out to that of the computer?
And what about all the links an electronic publication might have to other websites? The exhilaration that grips us when we surf the Net quickly turns to vertigo with the thought of preserving that sea of information.
National libraries are testing two different approaches. Given the extraordinary volume of web-based materials, Australian archivists are becoming “gatekeepers,” sifting through and selecting the most important materials for a national electronic repository. To be part of the depot, an item must either be about the country or particularly relevant to its interests. Alternatively, it can be the work of an Australian, recognized as an authority on an issue of international significance. While this approach appears to be the most practical, there is a danger in becoming overly selective. After Napoleon ransacked the Vatican archives and sent them back to Paris in 1810, the French only saved the documents of immediate and apparent interest.
Alternatively, archivists might try to save the entire electronic domain of a country. Sweden has embarked on such an ambitious project, in collaboration with other Nordic states. Thanks in large part to a robot that takes regular “snapshots” of the Internet, the Swedes have already amassed more than 58,000 items—from online publications to conference proceedings—in building a central electronic repository.
Both of these routes lead to two major obstacles: copyright conflicts and costs. In some countries, publishers must deposit copies of every new publication in national libraries. These laws, however, do not always apply in the electronic domain. While some publishers voluntarily deposit copies of “handheld” works, like
CD-ROMs, ephemeral works like electronic journals are practically untouchable for legal reasons. By subscribing to those journals, libraries or individuals aren’t actually buying a “copy,” just a license to access the material.

Taking a digital refresher
Do they have the right to offer that access to the public? And what if a library stops a subscription? Then they don’t necessarily have the right to display or offer public access to back issues for which they paid.
Enormous financial pressure is building on national libraries and publishers alike to iron out new business arrangements and partnerships. Publishers are leery of the responsibility for preservation, yet at the same time cannot afford to see their collections become obsolete. Meanwhile, the archivists are finding it far more expensive to acquire an electronic publication than a traditional one. The National Library of Australia, for example, estimates that it takes one person a full working day to acquire the first version of an online publication—a task five times more labour intensive than adding a print item. A study by the British Library suggested that the cost of managing and preserving a digital publication over a 25-year-period is about 20 times greater than it is for print.
A hefty chunk of the costs go to “refreshing” digital documents. Every five or ten years, electronic collections are supposed to embark on a “migration,” in archival lingo, to an updated computer configuration. Unfortunately, the journey is not without its casualties. Formats and presentations may change and some sections risk disappearing altogether. Does it matter if the look and “feel” of a website changes, so long as the contents remain?
For some archivists, migration is just a standby for “emulation,” which would involve a combination of software and hardware capable of mimicking the behaviour of obsolete platforms and operating systems. In short, the aim is to develop a kind of mini-archive to “remind” computers of their past. But the technological wizards can only forge part of the key to preserving our digital documents. Ultimately, we must reconfigure relations between libraries and publishers.