Interpretive journeys and METS: Determining requirements for the effective management of complex digital objects in a National Park

Dennis Nicholson (Centre for Digital Library Research, Strathclyde University, Glasgow, UK)

Journal of Documentation

ISSN: 0022-0418

Article publication date: 1 March 2006

821

Abstract

Purpose

To report on initial work carried out on a Metadata Options Appraisal undertaken for the Loch Lomond and Trossachs National Park in the context of plans to create and manage complex digital objects called Interpretive Journeys, PDA‐based multimedia guides to Park tours. Design/methodology/approach – After some initial work, a phased approach was adopted. In Phase I (reported here), metadata schema for managing structurally complex digital objects were researched to provide a range of appropriate options. Their characteristics were then matched against the Park's general requirements as determined through discussions with staff and an examination of relevant documentation. Findings – It was determined that the METS metadata schema met Park requirements for managing Interpretive Journeys at a general level and could be used both, to inform the development of the planned Digital Asset Management System (an early priority), and to act as the main basis of a framework for determining a detailed metadata solution for the Park. Research limitations/implications – Further work is required in Phase II to determine the detailed metadata requirement for the Park. Practical implications – A useful guide to dealing with a range of management and metadata issues associated with managing complex digital objects, and to managing projects where guidance on metadata requirements is needed before the objects to be described are available. Originality/value – This paper offers an introduction to the management and metadata issues raised by Interpretive Journeys and digital composites of similar complexity, and an indication of how METS addresses these at a general level.

Keywords

Citation

Nicholson, D. (2006), "Interpretive journeys and METS: Determining requirements for the effective management of complex digital objects in a National Park", Journal of Documentation, Vol. 62 No. 2, pp. 271-290. https://doi.org/10.1108/00220410610653334

Publisher

:

Emerald Group Publishing Limited

Copyright © 2006, Emerald Group Publishing Limited


Introduction

The Metadata Options Appraisal

This paper deals with a Metadata Options Appraisal commissioned by the Loch Lomond and Trossachs National Park in Scotland in the context of a Europe‐wide project called MoPark Project (2004). Its remit is to look at three levels of the Park's metadata requirements: those specific to MoPark (level 1), those likely to arise if the scope of the MoPark work were extended (level 2, and those associated with the management of Park materials generally (level 3). However, since the MoPark requirements are the most complex, these have been the major focus of the work to date. The Appraisal was originally planned as a single initiative to be conducted in a number of steps (see Methodology), but a decision was taken on the basis of early findings to adopt a phased approach. This paper covers the work leading up to that decision – that is, it covers what is now Phase I of two‐phase appraisal, Phase I outcomes, and the reasons for the adoption of a phased approach.

MoPark and interpretive journeys

The aim of the MoPark project is to encourage green tourism in the Park by providing opportunities for sustainable travel and innovative interpretation. The interpretive facet is intended to help visitors understand and enjoy the Park's special qualities and will be delivered electronically via a variety of output devices, including personal digital assistants (PDAs), mobile phones, audio tours, and talking posts. In the case of PDAs, in particular, but perhaps also mobile phones, this will entail the creation of complex digital objects called Interpretive Journeys. These are multimedia‐based Park tours following marked out paths with a number of stops. Each stop is enhanced and interpreted for the visitor through an interactive programme consisting of still images, video and audio clips, animations, games, and text relevant to the flora, fauna, landscape formations and features, history, and culture represented at that point in the tour.

Interpretive journeys in other environments

In the context of the MoPark Project, an Interpretive Journey is typically a hand‐held electronic guide to a Park trail – whether it is a walk in the woods or a trip on a solar boat. However, the potential of this kind of digital multimedia presentation as a means of enhancing an individual's experience and understanding of a range of environments is clearly significant. Leisure and recreation environments – parks, museums, tourist‐trails, and similar – are probably the most likely areas of application. Education and other forms of training are another. This being so, the use of such packages seems likely to grow, which means that there is likely to be a increasing demand from a range of organisation‐types for information scientists who can advise on the issues covered in this paper – that is, on the problems raised by Interpretive Journeys and similar multimedia programmes, on the metadata required to adequately describe them, and on the design and operation of digital repository software to house and manage both.

Interpretive journeys as highly structured digital content

Interpretive Journeys (and other similar packages) are not simply collections of component digital objects with a common theme or a related set of such themes. They are pieces of highly structured digital content whose function, value, and meaning would be largely lost if their structure was lost – just as the function, value, and meaning of a book would be lost if it were cut up into paragraphs or sentences presented in randomised order. This means that the metadata required to manage them must, amongst other things, be able to capture and describe that structure, particularly if, as is envisaged in MoPark, there is (or may be) a requirement to disseminate these complex digital objects to other repositories, or to ensure their long‐term digital preservation. In the case of MoPark, for example, the metadata must be able to capture and describe structural elements such as the sequential and hierarchical nature of the journey, and the inter‐relationships between component digital objects at different sequential or hierarchical levels (for example, a map that can be called up at any journey stop and which itself has hierarchical levels in the shape of a ‘zoom in’ to a more detailed map, a key to the map, and so on).

Objectives of the Metadata Options Appraisal

The Appraisal has two objectives. The first is to determine the metadata requirements for the Park's proposed digital repository in the context of the ongoing use and long‐term management of these and other digital objects, looking at:

  1. 1.

    MoPark project requirements (Level 1).

  2. 2.

    Requirements if MoPark is extended to other topics and Park areas (Level 2).

  3. 3.

    General requirements of the National Park beyond MoPark (Level 3).

The second is to make recommendations on the implementation of the digital repository, taking in issues such as metadata schema options, staffing requirements, training, and the design of the proposed Digital Asset Management System (DAMS). Although Interpretive Journeys are not the only digital objects of interest in the Appraisal, they are, by far, the most complex, and were, therefore, the primary focus of the initial work. More precisely, written and spoken descriptions of the Park's plans for the creation of such journeys were the initial focus of the work, since, in the original plan, the appraiser was charged with identifying the metadata requirements before the actual Interpretive Journeys were available.

This, of course, was a less than ideal circumstance in which to conduct the Appraisal. Following discussion, however, it became clear that it was unavoidable. Park staff required some guidance on metadata requirements if they were to specify their requirements for the Digital Asset Management System to the software designers and programmers, and the project schedule did not permit the long delay involved in waiting for the creation of the actual Interpretive Journeys. The appraiser was therefore faced with a requirement to deal with the situation, ideal or not – a real world circumstance which led to the proposal for the phased approach described below, and one which may well be encountered in other organisations as the digital revolution gathers pace.

Digital repositories and metadata

Digitisation programmes, as well as individual initiatives, have been something of a growth area in recent times (see, for example, the NOF Digi programme's database of projects (New Opportunities Fund, 2003), the sites of the members of the Digital Library Federation (Digital Library Federation, 2004), and the Register of Digital Initiatives in New Zealand (National Digital Forum, 2004)). One result of this is that it is possible to draw upon a good deal of readily available experience and expertise in respect of the management of digital objects when considering metadata options. Another is that there is a wide (and apparently still growing) range of metadata schema designed for use with digital objects and repositories available to choose from: DC, MODS, e‐GMS, TEI, METS, EAD, MPEG‐7, SCORM, MPEG‐21, IMS‐CP, MARCXML, MIX, to name only a few of those available (MetaMap, 2004; National Information Standards Organization, 2004; Farb and Riggio, 2004).

Fortunately, it was clear at an early stage of the Appraisal that the list of possible schema choices could be reduced considerably by focusing on digital repository initiatives dealing with composite digital objects whose complexity was of an order similar to that of MoPark Interpretive Journeys. Consideration of the issues encountered in such projects (Alemneh et al., 2002; Bekaert 2003, 2004; Chandler, 2002; Chen, 2000, 2002; Farb and Riggio, 2004; Guenther and McCallum, 2003; Proffitt, 2004; Staples et al., 2003; Tennant, 2003a), and of the kinds of metadata schema employed to deal with them, allowed the list of possible schema relevant to MoPark to be narrowed to those capable of managing structured content – the defining feature of complex digital objects like Interpretive Journeys. With proprietary solutions considered inappropriate and unwise because of a likely long‐term need to interoperate with other repositories in Scotland and beyond, the range of possibilities to choose from was narrowed to METS, MPEG‐21, and IMS‐CP, three options which, whilst not directly comparable in respect of their primary purpose, are recognised as roughly interchangeable standards for managing structured content (New York University Libraries, 2004), and are used as such by the various projects examined in the Appraisal:

METS

The Metadata Encoding and Transmission Standard (METS) is a schema developed following ground breaking work at the University of California Digital Library in the context of the Making of America II project (Cundiff, 2004; National Information Standards Organization, 2004; Coalition for Networked Information, 2003). Maintained by the Library of Congress (Library of Congress, 2004a), it has the support of the influential Digital Library Federation, and is designed to be used with a range of “extension schemas” such as MODS, MIX, MARCXML and TextMD (Cundiff, 2004; Library of Congress (2004b, c); Ockerbloom et al., 2004; Tennant, 2003a). In use in, or recommended by, a growing range of initiatives (Cundiff, 2004; Chandler, 2002; Guenther and McCallum, 2003; Institute of Museum and Library Services, 2001; Niu, 2002; Proffitt, 2004; Semple, 2004; Staples et al., 2003; Stehno, 2003; Tennant, 2003a), METS is also an integral part of at least one digital library management package (Staples et al., 2003) and was at the time of the work reportedly being considered for future use in MIT's widely‐implemented DSpace software (Smith et al., 2003).

MPEG‐21

MPEG‐21 is a Moving Picture Expert Group standard (International Organisation for Standardisation, 2002), expected to be a preferred approach in the commercial sector (Coalition for Networked Information, 2003). It was found to be less developed than METS (Coalition for Networked Information, 2003; Ockerbloom et al., 2004) and to be in use in one repository (Bekaert (2003, 2004)).

IMS‐CP

IMS‐CP, the IMS content packaging standard (IMS Global Learning Consortium Inc., 2004), is designed for use in the learning community, a community that MoPark will at least look to interoperate with. IMS‐CP is in use in various projects (Australian Flexible Learning Framework, 2004a, b, c; Coalition for Networked Information, 2003, p19), and is recognised as the learning community's alternative to METS and MPEG‐21 (Coalition for Networked Information, 2003; Ockerbloom et al., 2004). SCORM (Advanced Distributed Learning, 2004), another learning community solution also based on IMS specifications (Coalition for Networked Information, 2003) is also in use (Lubas, 2004). However, by the time it was identified as a possible candidate, it had already been agreed that the proposed future MoPark connection with the learning community was a relatively weak one and that there was no need to assess SCORM as one of the options.

Methodology

Metadata and effective management of the digital repository

From the start, it was recognised that the underlying aim in conducting the Appraisal was to determine the requirement for the effective management of the proposed digital repository and the digital materials it would contain. The metadata requirements of any system are primarily related to function. In order to determine the metadata requirements for MoPark and for the National Park, therefore, it was first necessary to determine the needs in respect of handling or managing digital information and information products; that is, to determine which things needed to be done to, or, with what information and information products, by or for whom, in what circumstances (when, how, where), and for what purpose (why) in order to meet the management and service requirements of the system (National Information Standards Organization, 2004; Haynes, 2004; Farb and Riggio, 2004). It was also necessary to consider other issues affecting decisions on approaches to metadata: resources available for implementation and ongoing management, existing constraints in terms of the existence of legacy metadata repositories and legacy systems used by organisational partners, the need to interoperate with services outwith the group of organisational partners, and so on (Coalition for Networked Information, 2003, p. 23). Key requirements of a successful study were therefore an examination of representative examples of the various kinds of digital objects that would have to be managed and in‐depth discussions with Park staff on the management issues the various types of object would entail.

Initial methodology

A methodology for conducting the Appraisal was agreed in the early stages of discussions on project requirements. This consisted of the following steps:

  1. 1.

    Participation in an initial briefing session with the client group and suppliers.

  2. 2.

    An initial analysis of the outcomes, followed by an in‐depth consultation with project leaders on information needs, the amount and level of cataloguing required, the types of digital object likely to be stored and presented, the likely modes of presentation of these within interpretative journeys, and any requirements to access distributed information sources and integrate with other repositories, either now or in future.

  3. 3.

    An in‐depth examination – via the literature and online examinations of sites – of a range of internal project and Park documents on issues relating to, or having an impact on, metadata needs.

  4. 4.

    The creation of an outline, but detailed, sketch of the scope and complexity of the likely general needs of the project and the National Park in respect of metadata for digital objects.

  5. 5.

    An examination of prominent global digital library initiatives and the metadata issues and solutions adopted by them.

  6. 6.

    The identification of a digital object metadata framework sufficiently complex and flexible to meet the needs of the MoPark project and the National Park.

  7. 7.

    The identification of a range of areas and issues requiring further investigation before the detailed metadata requirements of the project and the Park could be specified.

  8. 8.

    The creation of a set of detailed questions that had to be answered – primarily by Park staff in discussion with the appraiser – in order to draw out and agree the detailed requirement.

  9. 9.

    A specification of the detailed requirement

  10. 10.

    The drawing up of recommendations on associated issues such as staffing and training requirements, costs, and the metadata management facilities required in the DAMS.

The need for a phased approach

In the event, the methodology had to be adjusted to take account of the information that emerged from the early part of the study. By the completion of step 4 of the process described above, two things were clear. First, that there was sufficient detail available from initial discussions with key MoPark and Park personnel (step 1), from follow‐up discussions with project leaders (step 2), and from project and Park documentation (step 3), to enable, both the general needs of the project and Park in respect of digital object metadata (steps 4‐6), and the further work needed to draw out the full requirement (steps 6‐8), to be specified in the timescale originally envisaged for this work. Second, that specifying the need in detail (step 9), together with associated staffing and training needs and detailed DAMS requirements (step 10), would be more difficult. It would require, at minimum, further work with actual examples of the complex digital objects (Interpretive Journeys) likely to form the primary elements of the Level 1 and 2 requirements, more experience amongst project and Park personnel of the issues and problems associated with managing complex digital objects, and an in‐depth survey of the likely range of other digital objects, their usage, and their life‐cycles.

Since the original assumption made by MoPark had been that the metadata requirement might be specified prior to the creation of the initial Interpretive Journeys and their use in the field, it was necessary to agree an adjusted methodology with project staff. This entailed adopting a phased approach to the completion of the Appraisal, with Phase I setting out, both a general framework within which the full requirement could be met, and the elements of the follow‐up phase, and Phase II stretching out over the full lifetime of the project (three years) and gradually specifying the detailed requirement as actual Interpretive Journeys emerged and were field‐tested, and as staff familiarity with these objects and understanding of the management issues they entailed, grew. This paper, of necessity, reports only on the outcomes of Phase I of this process. The outcomes of Phase II will be reported at a later date.

MoPark: general requirements

The completion of step 4 – the creation of an outline, but detailed, sketch of the scope and complexity of the likely general needs of the project and the National Park in respect of metadata for digital objects – produced the following agreed conclusions as regards general requirements:

Interpretive journeys

The primary need is to deal with Interpretive Journeys. These are complex digital objects comprising coherent aggregates of a range of simpler constituent digital objects in a variety of formats. They will require a metadata framework that encompasses:

  1. 1.

    Descriptive metadata (e.g. a MARC record) for both the aggregate object and the constituent objects.

  2. 2.

    Administrative metadata for both – including technical detail on format and use characteristics, intellectual property rights information, descriptive and administrative metadata on the analogue source of the digital object, digital provenance metadata.

  3. 3.

    Metadata on the files encompassed by the aggregate object.

  4. 4.

    Metadata on the hierarchical structure of the digital object and on hyperlinks between levels (structural metadata).

  5. 5.

    Metadata on executable programs required to run some of the constituent digital objects.

User‐type specific variations

User‐type specific variations are an issue. It is envisaged that different types of Park visitor will be presented with different versions of each Interpretive Journey and that these versions will themselves vary according to visitor choice of, for example, preferred terrain, and mode of transport. It is not clear at this stage whether each journey will be one single aggregate object or many similar, but different, aggregates. Either way, the metadata framework will have to be sufficiently flexible to cope with the inter‐relationships involved and the complex management issues entailed.

Output‐device‐specific variations

The issues described in the two points above are further complicated by the fact that different presentations of the various Interpretive Journeys are likely to be required for a range of different output modes (PDAs, web‐sites, mobile phones, and others).

Workflow control

Workflow control during the digital object management process is needed to ensure that accuracy, legal, metadata quality, and other requirements are met. The creation and management of Interpretive Journeys is likely to involve a range of Park personnel with different skills and expertise and this will have implications in respect of the need for a range of record statuses and the ability to monitor staff involvement in upkeep and creation in the metadata. Issues like information currency, accuracy, authority, access rights, metadata quality, and so on will make such involvement necessary and make workflow control and associated metadata needs essential elements of a working system

Format variations

Initial discussions suggest that the constituent digital objects that will make up the detail of the Interpretive Journeys are likely to be in a wide range of formats, including (but not necessarily limited to), textual materials, audio material, video materials, still images, animated sequences, 3D animations, 3D and 2D maps, interactive packages of various kinds. Any framework will have to be able to deal with the metadata requirements associated with each.

GPS issues

Special software will be needed to handle some of the constituent digital objects involved in an Interpretive Journey and it will have to be possible to specify this in the metadata framework. Included in this requirement is the need to obtain and respond to GPS data and to interact with an external GIS database system and repository.

Referencing descriptive metadata

It is likely that the descriptive metadata required for the aggregates and the constituent objects will in some cases be held on external systems (e.g. maps metadata and, possibly, MARC or Dublin Core metadata). In other cases, it may need to be embedded in the record for the object itself. The metadata framework will have to be able to handle either or both situations

Flexibility

The range and nature of digital objects likely to be managed when the needs of the Park as a whole are taken into account is at present unknown in detail. The framework adopted must therefore be flexible and wide‐ranging to enable it to cope with future requirements

Digital preservation

There will, in time, be a need to preserve some or all of the digital objects in the repository, so the metadata framework adopted must be able to deal with this likelihood.

Mapping to different metadata standards; XML output

A need to present or otherwise disseminate the metadata in various forms and various formats for different purposes in different situations is envisaged. Possible requirements in this respect include the need to interoperate with Library, Museum and (possibly) Archive community systems in Scotland1, to exchange data with other Parks or project partners in other countries, to meet the requirements of e‐GIF, the e‐Government Interoperability Framework (Broadley, 2004; Cabinet Office, Office of the e‐Envoy, 2004a) and, in particular, e‐GMS the e‐Government Metadata Schema (Cabinet Office, Office of the e‐Envoy, 2004b). It would be wise to allow for the use of a variety of standards, of mapping to different standards, and output in various formats. Either storing the metadata in XML (W3C, 2004) format or permitting its export and import in XML is advisable. Adopting national and international standards where possible is also advisable.

METS: a framework for Phase II; clarification on methodology

The methodology detailed earlier in this paper implies a linear progression from the identification of the elements of the general requirements (step 4), through the investigation of metadata solutions adopted elsewhere (step 5), to the identification of a digital objects metadata framework (step 6) capable of meeting the requirement identified in step 4. Although this is essentially the process followed, the actual progression from requirement to framework was less clear‐cut. In reality, an initial sketch of the requirement, together with an investigation of solutions adopted elsewhere, led to the identification of the METS standard, maintained by the Library of Congress, as a likely solution for the Park. This then helped focus further work on the requirement and on solutions adopted elsewhere, leading gradually, in consultation with project and Park staff, to the confirmation of both the final details of the general requirement2 and the suitability of METS as the metadata framework that best meets that requirement.

METS: points in favour

By the completion of step 5, it was clear that METS has a number of points in its favour as the schema most able to meet Park requirements in respect of a framework. In summary, these are as follows:

  1. 1.

    It is designed for complex digital objects, XML‐based, and internationally supported. The METS: Metadata Encoding and Transmission Standard is a new standard that has been developed specifically to provide an overall framework within which metadata for all types of digital materials – digitised video, sound files still images, electronic texts, and others – can be integrated. Expressed in the World Wide Web Consortium's XML schema language and maintained by the Network Development and MARC Standards Office of the Library of Congress, it provides a flexible and coherent framework within which to meet MoPark and National Park digital materials metadata requirements as detailed above. There is an official METS web site (Library of Congress, 2004a) with good support and training information and the JISC web site (Joint Information Systems Committee, 2004) also has information on the standard (Gartner, 2002) and on MODS (Gartner, 2003) an accredited METS extension schema covering descriptive metadata.

  2. 2.

    It meets the complex range of requirements identified as relevant to MoPark needs. METS aims to provide an XML document format for encoding metadata necessary for both the management of digital library objects within a repository and the exchange of such objects between repositories (or between repositories and their users). It caters for all of the types of metadata (descriptive, administrative, structural, executable behaviors etc) used to describe digital library objects. It permits both the embedding of the metadata within the document and the referencing of externally held metadata (which may be a MoPark requirement). Hierarchical structures and links between levels can be encoded and pointers to software required to run particular constituent objects can be specified. Since the data is held in XML format, it is platform and software independent and should be readily exchangeable with partners and cooperating organisations. The schema is a new one but has, as already indicated, been adopted by a number of digital library projects.

  3. 3.

    It is the most widely used and appropriate of the three relevant standards identified. Of the three options – METS, IMS‐CP, and MPEG‐21 – identified by the Appraisal as potentially suitable for dealing with complex digital objects of the kind envisaged in MoPark:

  1. 1.

    It is supported by key organisations and regarded as a stable standard. The use of METS by major organisations such as Library of Congress, the British Library, Research Libraries Group, and the Digital Library Federation itself, means that METS is supported by significant “players” in digital object management, an important consideration for the Park, given the likely need for collaboration and interoperability in future. There are also indications that METS is considered stable enough for submission to formal standards bodies such as ISO and NISO (Guenther and McCallum, 2003).

  2. 2.

    It has available a growing community of practice, and good support documentation. The existence of a community of users or potential users, some of whom (The British Library, Oxford University (Cundiff, 2004), The National Library of Wales (Guenther and McCallum, 2003), Edinburgh University (Semple, 2004)) are in the UK is an important consideration, as is the availability of good documentation (Library of Congress Help Desk, 2003) and simplified accounts of its place in the scheme of things (Tennant, 2004, 2003b, 2002). As RLG puts it (Proffitt, 2004, p. 2) “METS provides us with a standard to follow and an active community of colleagues with whom to confer on issues unique to this topic”.

  3. 3.

    In METS, Interpretive Journey stops can be treated as individual complex objects. It is possible to create hierarchies of METS documents (Cundiff, 2004), a possible issue for MoPark, given that Interpretive Journeys are made up of a variety of “stops”, each of which comprises a complex digital object in its own right and might be used either as a stand‐alone package in some circumstances, or as a component in more than one Interpretive Journey (where walks overlap, for example).

  4. 4.

    It can be used as a comprehensive framework for Phase II then discarded if desired. The aim is to use METS as a framework that would ensure the iterative approach proposed would be comprehensive. It would be feasible to discard it later should the detailed requirements suggest this to be advisable for some reason.

  5. 5.

    The possible future need for interaction with the learning community is provided for. Although use of IMS‐CP itself is not considered appropriate, it is envisaged that there would in time be a need to interoperate with the learning community. The existence of initial work on a mapping between METS and IMS‐CP is therefore considered a useful development (Ockerbloom et al., 2004; Yee, 2004)

  6. 6.

    METS can deal with hardcopy materials if necessary. By the time the framework was being considered, it had become clear that the DAMS might have to deal with hardcopy books, journals and reports. There are indications that METS would be able to cope with this (Tennant, 2004).

  7. 7.

    It is able to deal with digital preservation requirements. Long‐term preservation of digital assets is an issue for the Park and metadata is a key part of any solution (Alemneh et al., 2002). Many different approaches to preservation metadata exist (ERPANET, 2003). METS could provide one solution, in that METS documents could serve as “Information Packages” – containers of context information and preservation description information – within the Open Archival Information System (OAIS) reference model (CCSDS, 2001), a proposed architecture for the preservation of digital materials that has either informed, or been explicitly adopted by, many prominent digital preservation initiatives (OCLC/RLG Working Group on Preservation Metadata, 2002; Guenther and McCallum, 2003). In fact, METS was designed to work with OAIS (Coalition for Networked Information, 2003, p. 20).

  8. 8.

    Open source software for handling METS metadata is available if required. Although the approach was subsequently rejected, the availability of open source software capable of handling METS (Staples et al., 2003), or likely to be able to handle it in future (Smith et al., 2003) was also seen as a positive factor when the decision was made.

  9. 9.

    Looked at in detail, METS is a Good Fit for MoPark requirements. Specific points in favour of METS as the basis of a solution for MoPark can be seen by considering some of the ways in which the various elements of a METS document (Cundiff, 2004) apply to the MoPark general requirement detailed above.

  10. 10.

    The METS Header contains metadata describing the METS document itself, including such information as creator, staff involved in creation and maintenance and their roles, and record status. This might enable some MoPark requirements as regards staff roles and metadata and digital object maintenance workflows to be managed.

  11. 11.

    The descriptive metadata section holds or points to pieces of metadata describing an aggregate digital object and its constituent parts. In the MoPark context, it might contain a section pointing to descriptive metadata for Park area maps held on another server, another pointing to a MARC database containing records describing MoPark digital content at the aggregate or interpretive journey level, and several embedded sections containing actual descriptive metadata for the videos and sound files and images that are the parts of the interpretive journey.

  12. 12.

    The administrative metadata section has four sub‐sections: technical metadata (information regarding a digital object file's creation, format, and use characteristics), intellectual property rights metadata (copyright and license information), source metadata (descriptive and administrative metadata regarding the analog source from which a digital library object derives), and digital provenance metadata (information regarding source/destination relationships between files, including master/derivative relationships between files and information regarding migrations/transformations employed on files between original digitisation of an artifact and its current incarnation as a digital library object). MoPark and the National Park would definitely require the first two of these types, and would probably require the last type (for digital preservation purposes). Further investigation is required to determine whether it would require metadata on the analog source of digitised objects, but it is possible that it will, and the section is, in any case, optional.

  13. 13.

    The files metadata section lists, references and gives unique identification numbers to, the constituent electronic files of a digital object and would permit the details of the various constituent digital objects of an Interpretive Journey to be listed and referenced.

  14. 14.

    The structural map metadata section records information on the structure of a digital object. It would store information on the complex structure of an Interpretive Journey – hierarchical levels and sub‐levels and so on. It would be a key part of Interpretive Journey metadata – the main thing required to turn an otherwise loosely connected set of constituent digital objects into a coherent journey.

  15. 15.

    The structural links metadata section records information on links between hierarchical levels of digital composites. This would store information about links between hierarchical levels of an interpretive journey. For example, a link from texts describing a native bird on one web‐page to a lower web page offering an image, a sound file of the bird's song, and an video of it nesting (and probably back again).

  16. 16.

    The behaviour metadata section records executable behaviours associated with “playing” a constituent digital object. This would store information about executable behaviours associated with constituent parts of an interpretive journey and about the programs required to run them – for example the name and location of a program required to run a video sequence or an animation.

  17. 17.

    An additional point in its favour is the increasing availability of METS extension schemas designed to deal with a range of digital object types (text files, sound files, images, video files) in commonly accepted ways (Guenther, 2004; Library of Congress, 2004b; McCallum, 2004).

Other elements of the framework; Phase II

Other elements of the framework

In addition to the proposal to adopt METS as the basis for the framework for Phase II, two further recommendations were made on the basis of the analysis of general requirements itemised above.

  1. 1.

    That the framework should assume the adoption, where possible and appropriate, of national and international standards relevant to the field. For example, in respect of descriptive metadata, it is sensible to investigate whether the use of Library of Congress Subject Headings (LCSH) on the one hand, and of the Dewey Decimal Classification (DDC) system on the other, provide the best basis for subject description and classification of MoPark and Park digital materials. A good deal of work is going on as regards interoperability in Scotland (Dunsire and Macgregor, 2003; Law et al., 2002; Nicholson and Macgregor, 2003) and this has recognised the significance of the HILT project, which has DDC and LCSH as key schemes (Nicholson, 2003; Shiri et al., 2004).

  2. 2.

    The framework should allow for and facilitate cooperation with other key players with similar needs and interests to harmonise approaches and ensure interoperability beyond Park systems. An obvious partner in this area is the Scottish Cultural Portal Project (Winch, 2003), which has similar aims in a number of areas and an overlap in terms of client groups served. An example of possible cooperation here would be in the area of subject terminology. Even if LCSH were used as the basis of MoPark subject description, it is likely that it would have to be adapted and important that adaptations be harmonised with those made by other Scottish players. This and other related matters should be explored during Phase II, at which time other possible partners may also be identified. For example, it would be sensible to determine whether there is a need to work with other bodies engaged in providing digital learning materials to schools.

Phase II

As will be evident, the METS‐based framework proposed above provides a suitable framework within which to meet the metadata requirements of MoPark and the Park. As indicated earlier, however, there are problems in identifying the detailed requirement at this point in project development – problems that cannot be readily resolved through discussion and analysis as yet. Moving forward will require – at minimum – further work with actual examples of the complex digital objects (Interpretive Journeys) likely to form the primary elements of the Level 1 and 2 requirements, more experience amongst project and Park personnel of the issues and problems associated with managing complex digital objects, and an in‐depth survey of the likely range of other digital objects, their usage, and their life‐cycles.

A phased approach to determining the detailed requirements as regards metadata, metadata standards, and metadata management is therefore both necessary and sensible. This approach has been discussed and agreed with the project leaders and will be reported in a later paper.

Discussion

Two points of interest worth noting emerged from the initial stages of the Appraisal process. First, whilst this type of phased approach to the determination of metadata requirements will not be either appropriate or necessary in all cases, it is likely to be of value in cases with similarities to the MoPark situation. Specifically, where:

  1. 1.

    The approach is practical, being no immediate pressure to create large numbers of metadata records quickly.

  2. 2.

    The approach is necessary and beneficial, because:

  • The objects to be described are not available for examination and some aspects of their nature are unclear.

  • The staff who will have to manage them have insufficient familiarity with them – and the issues they are likely to raise – to respond appropriately to an investigator's questions regarding management requirements.

The author is aware of at least one other current project (DIDET Project, 2004) where similar conditions apply and where a similar phased approach is both appropriate and necessary.

The challenge in MoPark – and one that is likely to arise in other, similar, projects – was to convince the project team that a phased approach need not threaten project scheduling. The initial assumption of the MoPark team had been that metadata requirements would be determined in the first few months of the project, would inform the design of the DAMS, and inform the training of the staff who would use it to manage the digital repository. This would have allowed management processes to begin as soon as the digital objects became available to catalogue – around the end of year 1 of the project.

In the event, it was relatively easy to persuade the team, not only that this approach was neither possible nor in the best interests of the project, but also that it was, given their circumstances, not necessary. Only two of the complex Interpretive Journeys were likely to be available by the end of year 1 – and, whilst these would comprise, between them, much larger numbers of component digital objects, the pressure to create metadata quickly to manage them was not as great as initially proposed. Implications for the design of the DAMS were an issue, but this issue too was resolved easily enough. METS provided an indication of the likely types of metadata, of the likely need to be able to output the metadata as a METS document in XML, and of the need to allow a Park DAMS administrator to add new fields as required under each metadata type within DAMS. This was more than enough to inform the initial work on the system prototype – particularly when combined with other facilities likely to be needed in any such system (in‐context help, error checking, authority control, and so on).

Second, a working assumption was formulated that will inform the development and field‐testing of both the DAMS and (albeit to a lesser extent) the detailed metadata requirements as they emerge. This is that a key requirement for the effective management of the digital composites called Interpretive Journeys is the creation of an online environment for metadata management that will not be a barrier in any sense3 to the individual and collective application of staff expertise to the level required for the various tasks entailed in managing these composite materials and their constituent digital objects.

Conclusion

Within the timescale allotted for the original Metadata Options Appraisal it has been possible to determine the general requirements of MoPark and the Loch Lomond and Trossachs National Park in respect of metadata, metadata standards, and metadata management, and to identify a framework within which these requirements can be met.

Since early analysis suggests that it will not be feasible to determine all of the detail of the requirement without creating and field‐testing at least one of the proposed Interpretive Journeys, and that fully determining the requirements in respect of the other two levels (and Level 3 requirements in particular) will face similar (but longer term) difficulties, a phased approach to agreeing the full and detailed requirement was recommended. Phase 1:

  1. 1.

    Set out a framework within which the full requirement can safely develop.

  2. 2.

    Proposed a flexible forward development path that will progressively facilitate the specification of detailed metadata‐related needs for Levels 1, 2, and 3 and ultimately lead to the determination of the full requirement.

The framework proposed has three elements:
  1. 1.

    Adoption of the XML‐based Metadata Encoding and Transmission Standard (METS) developed as an initiative of the Digital Library Federation and maintained in the Network Development and MARC Standards Office of the Library of Congress. METS provides an integrated and coherent infrastructure for all of the metadata types likely to be required within MoPark and the Park – descriptive metadata, administrative metadata (technical metadata, rights metadata, analogue source information, digital object files provenance), Files metadata, Structural Map metadata, Structural Links metadata, and Behaviour metadata. It also provides for the levels of complexity, structure, and interoperability likely to be required as the digital repository progresses.

  2. 2.

    The adoption, where possible and appropriate, of national and international standards relevant to the field (for example, the METS format would allow the use of MARC and LCSH in the descriptive metadata section should Phase II establish the requirement).

  3. 3.

    Cooperation with other key players with similar needs and interests to harmonise approaches and ensure interoperability beyond Park systems (for example, the Scottish Cultural Portal project which has needs and a client group that overlap with those of MoPark).

This framework will guide Phase II of the Appraisal, the results of which will be reported in a later paper.

Notes

Required to allow visitors at Visitor Centre web sites to explore cultural, historical or scientific themes on other Scottish systems or, indeed, potential visitors to find links from external web sites to Park Interpretive Journeys

This is reflected in the final statement of the general requirement, particularly conclusion 1, which closely follows the METS metadata types pattern.

“In any sense” is taken to encompass a wide range of aspects, including the need for staff to be able to switch seamlessly from managing complex digital objects and their constituent parts to managing other digital objects outwith Interpretive Journeys and even hard‐copy materials such as books.

Appendix. Glossary/abbreviations

DAMS Digital Asset Management System (proprietary software used in the MoPark project) DC Dublin Core Metadata Element Set (http://dublincore.org/documents/dces/) DDC Dewey Decimal Classification (www.oclc.org/dewey/) DIDET Digital Libraries for Global Distributed Innovative Design, Education and Teamwork Project (http://dmem1.ds.strath.ac.uk/didet/) EAD Encoded Archival Description (www.loc.gov/ead/) e‐GIF UK e‐Government Interoperability Framework (www.govtalk.gov.uk/schemasstandards/egif.asp) e‐GMS UK e‐Government Metadata Standard (www.govtalk.gov.uk/schemasstandards/metadata.asp) GIS Geographic Information System (www.gis.com/whatisgis/overview.html) GPS Global Positioning System HILT High‐Level Thesaurus Project (http://hilt.cdlr.strath.ac.uk/) IMS Instructional Management Systems IMS‐CP IMS Global Learning Consortium – Content Packaging Information Model (www.imsglobal.org/content/packaging/cpv1p1p2/imscp_infov1p1p2.html) ISO International Standards Organization (www.iso.org/) LCSH Library of Congress Subject Headings (http://authorities.loc.gov/) MARC Machine Readable Cataloging (www.loc.gov/marc/) MARCXML XML framework for MARC (www.loc.gov/standards/marcxml/) METS Metadata Encoding Transmission Standard (www.loc.gov/standards/mets/) MIX Metadata for Images in XML Standard (www.loc.gov/standards/mix/) MODS Metadata Object Description Schema (www.loc.gov/standards/mods/) MPEG‐7 Moving Picture Expert Group 7 – Multimedia Content Description Interface (www.chiariglione.org/mpeg/standards/mpeg‐7/mpeg‐7.htm) MPEG‐21 Moving Pictures Expert Group 21 – Multimedia Framework initiative (www.chiariglione.org/mpeg/standards/mpeg‐21/mpeg‐21.htm) NISO National Information Standards Organization (www.niso.org/) OCLC Online Computer Library Center (www.oclc.org) OAIS Open Archival Information System (www.ccsds.org/RP9905/) PDA Personal Digital Assistant RLG The Research Libraries Group (www.rlg.org) TEI Text Encoding Initiative (www.tei‐c.org/) TextMD Schema for Technical Metadata for Text (http://dlib.nyu.edu/METS/textmd.xsd) SCORM Sharable Courseware Object Reference Model (www.adlnet.org/scorm/index.cfm) XML Extensible Markup Language (www.w3.org/XML/)

Corresponding author

Dennis Nicholson can be contacted at: [email protected]

References

Advanced Distributed Learning (2004), SCORM Overview, available at: www.adlnet.org/index.cfm?fuseaction=scormabt (accessed 17 September 2004).

Alemneh, D.G. et al., (2002), “A metadata approach to preservation of digital resources: the University of North Texas Libraries' experience”, First Monday, Vol. 7 No. 8, available at: http://firstmonday.org/issues/issue7_8/alemneh/index.html (accessed 13 September 2004).

Australian Flexible Learning Framework (2004a), Content Packaging, available at: www.flexiblelearning.net.au/interop/topics/packaging.htm#1 (accessed 17 September 2004).

Australian Flexible Learning Framework (2004b), Projects: Learning Object Repository Implementation, available at: http://flexiblelearning.net.au/projects/learningobjectimplementation.htm (accessed 17 September 2004).

Australian Flexible Learning Framework (2004c), Projects: Quality e‐Learning Resources, available at: http://flexiblelearning.net.au/projects/toolboxes2004.htm (accessed 17 September 2004).

Bekaert, J. et al. (2003), “Using MPEG‐21 DIDL to represent complex digital objects in the Los Alamos National Laboratory Digital Library”, D‐Lib Magazine, Vol. 9 No. 11, available at: www.dlib.org/dlib/november03/bekaert/11bekaert.html (accessed 13 September 2004).

Bekaert, J. et al. (2004), “Using MPEG‐21 DIP and NISO OpenURL for the dynamic dissemination of complex digital objects in the Los Alamos National Laboratory Digital Library”, D‐Lib Magazine, Vol. 10 No. 2, available at: www.dlib.org/dlib/february04/bekaert/02bekaert.html (accessed 13 September 2004).

Broadley, L. (2004), “Metadata and the e‐GMS”, Update, Vol. 3 Nos 7‐8, pp. 323.

Cabinet Office, Office of the e‐Envoy (2004a), e‐Government Interoperability Framework Version 6.0, available at: www.govtalk.gov.uk/documents/e‐gif‐v6‐0.pdf (accessed 15 September 2004).

Cabinet Office, Office of the e‐Envoy (2004b), e‐Government Metadata Standard Version 3.0, available at: www.govtalk.gov.uk/documents/eGovMetadataStandard%2020040429.pdf (accessed 15 September 2004).

CCSDS (2001), Reference Model for an Open Archival Information System (OAIS), available at: www.ccsds.org/documents/pdf/CCSDS‐650.0‐R‐2.pdf (accessed 24 September 2004).

Chandler, R.L. (2002), “Museums in the online archive of California (MOAC): building digital collections across libraries and museums”, First Monday, Vol. 7 No. 5, available at: www.firstmonday.org/issues/issue7_5/chandler/index.html (accessed 17 September 2004).

Chen, C. (2002), “The design of metadata for the Digital Museum Initiative in Taiwan”, Online Information Review, Vol. 26 No. 5, pp. 295306.

Chen, Y. (2000), “A case study in designing Chinese metadata”, Online Information Review, Vol. 24 No. 3, pp. 22934.

Coalition for Networked Information (2003), Current Practices in Digital Asset Management. Version 0.9, available at: www.internet2.edu/pubs/200310‐BP‐DAM.pdf (accessed 20 September 2004).

Cundiff, M.V. (2004), “An introduction to the Metadata Encoding and Transmission Standard (METS)”, Library Hi Tech, Vol. 22 No. 1, pp. 5264.

DIDET Project (2004), DIDET Project, available at: http://dmem1.ds.strath.ac.uk/didet/ (accessed 13 September 2004).

Digital Library Federation (2004), About the DLF, available at: www.diglib.org/about.htm (accessed 17 September 2004).

Dunsire, G. and Macgregor, G. (2003), “Clumps and collection description in the information environment in the UK, with particular reference to Scotland”, Program, Vol. 13 No. 4.

ERPANET (2003), Metadata in Digital Preservation ERPANET Training Seminar, Marburg September 3‐5, 2003, available at: www.erpanet.org/www/products/marburg/documents/erpaTraining‐Marburg_BriefingPaper.pdf (accessed 20 September 2004).

Farb, S.E. and Riggio, A. (2004), “Medium or message? A new look at standards, structures, and schemata for managing electronic resources”, Library Hi Tech, Vol. 22 No. 2, pp. 14452.

Gartner, R. (2002), METS: Metadata Encoding and Transmission Standard, JISC Techwatch report, TSW 02‐05, available at: www.jisc.ac.uk/index.cfm?name=techwatch_report_0205 (accessed 15 September 2004).

Gartner, R. (2003), MODS: Metadata Object Description Schema, JISC Techwatch Report, TSW 03‐06, 2003, available at: www.jisc.ac.uk/index.cfm?name=techwatch_report_0306 (accessed 20 September 2004).

Guenther, R. and McCallum, S. (2003), “New metadata standards for digital resources: MODS and METS”, Bulletin of the American Society for Information Science and Technology, Vol. 29 No. 2, pp. 1215.

Guenther, R.S. (2004), “Using the Metadata Object Description Schema (MODS) for resource description: guidelines and applications”, Library Hi Tech, Vol. 22 No. 1, pp. 8998.

Haynes, D. (2004), “The five purposes of metadata”, Update, Vol. 3 Nos 7‐8, pp. 301.

IMS Global Learning Consortium Inc. (2004), IMS Global Learning Consortium Inc., available at: www.imsproject.org/content/packaging/ (accessed 24 September 2004).

Institute of Museum and Library Services (2001), A Framework of Guidance for Building Good Digital Collections, available at: www.imls.gov/pubs/forumframework.htm (accessed 15 September 2004).

International Organisation for Standardisation (2002), MPEG‐21 Overview V.5, available at: www.chiariglione.org/mpeg/standards/mpeg‐21/mpeg‐21.htm (accessed 24 September 2004).

Joint Information Systems Committee (2004), Joint Information Systems Committee, available at: www.jisc.ac.uk/ (accessed 13 September 2004).

Law, D., Nicholson, D. and Dunsire, G. (2002), “Whole environment research on distributed and collaborative digital and non‐digital networked libraries in Scotland”, Bibliothek. Forschung und Praxis, Vol. 26 No. 2, pp. 11323.

Library of Congress (2004a), METS Official Web site, available at: www.loc.gov/standards/mets/ (accessed 15 September 2004).

Library of Congress (2004b), AV Prototype Project Working Documents: Extension Schemas for the Metadata Encoding and Transmission Standard, available at: http://lcweb.loc.gov/rr/mopic/avprot/metsmenu2.html (accessed 13 September 2004).

Library of Congress (2004c), Audio‐Visual Prototyping Project: VideoMD Data Dictionary, available at: www.loc.gov/rr/mopic/avprot/DD_VMD.html (accessed 16 September 2004).

Library of Congress Help Desk (2003), METS: An Overview and Tutorial, available at: www.loc.gov/standards/mets/METSOverview.html (accessed 17 September 2004).

Lubas, R.L. (2004), “Creating metadata practices for MIT's OpenCourseWare project”, Library Hi Tech, Vol. 22 No. 2, pp. 13843.

McCallum, S.H. (2004), “An introduction to the Metadata Object Description Schema (MODS)”, Library Hi Tech, Vol. 22 No. 1, pp. 828.

MetaMap (2004), MetaMap, available at: http://mapageweb.umontreal.ca/turner/meta/english/index.html (accessed 13 September 2004).

MoPark Project (2004), MoPark Project, available at: www.mopark.net/ (accessed 13 September).

National Digital Forum (2004), Register of Digital Initiatives, available at: http://ndf.natlib.govt.nz/index.htm (accessed 12 September 2004).

National Information Standards Organization (2004), Understanding Metadata, available at: www.niso.org/standards/resources/UnderstandingMetadata.pdf (accessed 15 September 2004).

New Opportunities Fund (2003), EnrichUK, available at: www.nof‐digitise.org/ (accessed 15 September 2004).

New York University Libraries (2004), CNI Structured Content Workshop, available at: http://library.nyu.edu/diglib/StructuredContent/ (accessed 10 June 2005).

Nicholson, D. (2003), “Subject‐based interoperability: issues from the High Level Thesaurus (HILT) Project”, International Cataloguing and Bibliographic Control, Vol. 32 No. 1.

Nicholson, D. and Macgregor, G. (2003), “Devloping the Scottish cooperative infrastructure: the what, who, where, when and why of SPIER”, WIDWISAWN, Vol. 2 No. 1, available at: http://widwisawn.cdlr.strath.ac.uk/Issues/issue2.html (accessed 24 September 2004).

Niu, J. (2002), “A metadata framework developed at the Tsinghua University Library to aid in the preservation of digital resources,”, D‐Lib Magazine, Vol. 8 No. 11, available at: www.dlib.org/dlib/november02/niu/11niu.html (accessed 13 September 2004).

Ockerbloom, J.M. et al. (2004), Digital Library Content and Course Management Systems: Issues of Interoperation July, 2004 Appendix 2: Supporting the Use of Digital Content in Electronic Learning Applications. A Checklist of Digital Repository Service Requirements, With Recommended Best Practices, available at: www.diglib.org/pubs/cmsdl0407/cmsdl0407app2.pdf (accessed 17 September 2004).

OCLC/RLG Working Group on Preservation Metadata (2002), Preservation Metadata and the OAIS Information Model: A Metadata Framework to Support the Preservation of Digital Objects, available at: www.oclc.org/research/projects/pmwg/pm_framework.pdf (accessed 24 September 2004).

Proffitt, M. (2004), “Pulling it all together: use of METS in RLG cultural materials service”, Library Hi Tech, Vol. 22 No. 1, pp. 658.

Semple, N. (2004), “Developing a digital preservation strategy at Edinburgh University Library”, VINE, Vol. 34 No. 1, pp. 337.

Shiri, A., Nicholson, D. and McCulloch, E. (2004), “User evaluation of a pilot terminologies server for a distributed multi‐scheme environment”, Online Information Review, Vol. 28 No. 4, pp. 27383.

Smith, M. et al., (2003), “DSpace: an open source dynamic digital repository”, D‐Lib Magazine, Vol. 9 No. 1, available at: www.dlib.org/dlib/january03/smith/01smith.html (accessed 20 September 2004).

Staples, T. et al., (2003), “The Fedora Project: An open‐source digital object repository management system”, D‐Lib Magazine, Vol. 9 No. 4, available at: www.dlib.org/dlib/april03/staples/04staples.html (accessed 20 September 2004).

Stehno, B. (2003), “METAe‐Automated Encoding of Digitised Texts”, Literary and Linguistic Computing, Vol. 18 No. 1, pp. 7788.

Tennant, R. (2002), “Metadata as if libraries depended on it”, Library Journal, Vol. 127 No. 7, pp. 324.

Tennant, R. (2003a), “A bibliographic metadata infrastructure for the twenty‐first century”, Library Hi Tech, Vol. 22 No. 2, pp. 17581.

Tennant, R. (2003b), “The engine of interoperability”, Library Journal, Vol. 128 No. 20, p. 33.

Tennant, R. (2004), “It's opening day for METS”, Library Journal, Vol. 129 No. 9, p. 28.

W3C (2004), Extensible Markup Language (XML), available at: www.w3.org/XML/ (accessed 24 September 2004).

Winch, S. (2003), “Scotland's portal”, Information Scotland, Vol. 1 No. 6, available at: www.slainte.org.uk/publications/serials/infoscot/voll(6)/make.html (accessed 24 September 2004).

Yee, R. (2004), “A preliminary crosswalk from METS to IMS content packaging”, Library Hi Tech, Vol. 22 No. 1, pp. 6981.

Further Reading

Seadle, M. (2002), “METS and the metadata marketplace”, Library Hi Tech, Vol. 20 No. 3, pp. 2557.

Winch, S. (2003), “Scotland's portal”, Information Scotland, Vol. 1 No. 6, available at: www.slainte.org.uk/publications/serials/infoscot/vol1(6)/make.html (accessed 24 September 2004).

Related articles