Home | Digitisation (Summary)
Full Text: Page 1 | Page 2 | Page 3 | Page 4

GOOD PRACTICE GUIDELINES

The process of digitisation is superficially simple, since an inexpensive PC and scanner can be used with standard software to create a digital copy of any human-readable artefact. However, when issues such as the quality of the resulting image, the format used to store it, its description, its intended use and its preservation are considered, the process becomes much more complex. The very simplicity of the basic process can lead librarians and policy makers into a false belief that a digitisation programme will be straightforward and inexpensive. It will be neither.

The delivery of digitised materials can be relatively straightforward, since the development of the Internet and the World Wide Web has provided the infrastructure, software and technical standards needed. Again, however, the very ease with which the Web can be accessed masks some of the underlying difficulties. For example, if images are not simply to be given away, complex software and procedures may be needed to collect revenue and to guard against infringement against intellectual property rights.

It is important to recognise that digitisation is often taken to mean only the creation of digital images of originals. More complex processes which seek to capture the meaning of the text (or image or whatever) – such as Optical Character Recognition (OCR) – may also be used and these are considered briefly below.

Planning
It is first necessary to be clear as to why digitisation is felt to be appropriate, the precise selection criteria and the audience being addressed. It will be noted while none of these issues are first and foremost strictly technical, they are all essential to a successful project. They lead on to a decision on what should be digitised. (See also diverse cultural content)

For an example of digitisation selection criteria see University of California Selection criteria for digitisation. Lee (see below) summarises the necessary questions and suggests the use of a ‘decision matrix’. Colet’s ‘Planning an Imaging Project’ will be found useful.

The scope of the project will have to be considered carefully – it is easy to underestimate the effort needed to digitise traditional materials, and it is almost inevitable that only parts of most collections can be handled. Cost, staffing and other issues (see below) will contribute to these decisions.

IPR (Intellectual Property Rights) (see also copyright)

There are two aspects to this:

  • Does the library hold the right to make digital copies? If it will be necessary to secure the rights to the material, this will inevitably be a long drawn out and probably costly process.

  • What rights to the digitised images does the library intend to impose? Will all images be made freely available, or will thumbnails be displayed on the web and a charge be made for higher definition copies?

File formats
The output from a digitisation process is an image, or other information object, in digital format. However, there are many different formats in use and that chosen will have to be selected with care. Bear in mind that it is usual to hold both a master and delivery copies, and it is often appropriate for these to be in different formats. Common formats used for digitisation projects include TIFF (for high quality masters) and JPEG or GIF (for delivery copies), but remember that there are different versions of all formats. The following criteria need to be considered (based on C. W. Brown and B. J. Sheperd. ‘Graphics File Formats, Reference and Guide’. Greenwich, Connecticut, USA: Prentice Hall, 1995, as described in Franziska Frey ‘File Formats for Digital Masters’ in the US Guides to Quality in Visual Resource Imaging series,).

Although a large number of formats are available it is likely that the practical choice will be fairly limited. Taking a lead from other projects, and taking expert advice, is recommended.

Digitisation hardware and software
The hardware choices, broadly, are:

  • Scanners. It should be noted that simple flat-bed scanners are not likely to be suitable for bound volumes and can cause considerable damage. Where it is possible to split originals into separate pages a document feeder will be found to be advantageous, as it can speed up processing and improve accuracy. Specialised scanners are available for digitising slides and microforms. Don Williams has written a guide to scanner selection in the US Guides to Quality in Visual Resource Imaging series.

  • Digital cameras. Where a book cannot be opened flat, or where material is delicate, a camera is much more suitable than a scanner since it avoids physical contact between the equipment and the original object. It will be necessary to build or acquire a book cradle and to provide suitable, low temperature lighting.

For non-visual media, such as audio recordings, equipment can be purchased to provide digital output from analogue media.

The software used in the digitisation process needs to be chosen with equal care. There are three processes to consider: capture, processing and delivery – the last of these is considered separately below. Image capture software is often bought with the hardware, and this can be a good solution. It must support common standards (e.g. TWAIN) and should provide basic management features such as image preview – this enables adjustments to be made without a full scan. The chosen file formats must, of course, be supported. Image processing software may range from the simple to the highly sophisticated, depending on the intended use. At one end of the spectrum, a project may decide not to process captured images at all. Another project, however, may decide to ‘clean’ the images to remove scratches or stains which appear on the original and to adjust colour balance etc. manually.

The choice of appropriate hardware and software will be dictated by a number of other considerations, including purchase and maintenance costs, ease of use (need for special training), throughput requirements, adaptability and extensibility, ability (quality and choice of output), standards compliance, reliability and so on.

Home | Digitisation (Summary)
Full Text: Page 1 | Page 2 | Page 3 | Page 4


Select a country to view information on public libraries


Digital Guidelines Manuals
Click here to view


The PULMAN
Online Database of Education Resources


Private Section for PULMAN partners only.
Click here to Enter

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 


Last updated 11/05/2004
Site best viewed with IE 4.0 or above