AccessMyLibrary provides FREE access to over 30 million articles from top publications available through your library.

Establishing a global digital format registry.(Digital Library Federation)

Library Trends

| June 22, 2005 | Abrams, Stephen L. | COPYRIGHT 2008 Johns Hopkins University Press. This material is published under license from the publisher through the Gale Group, Farmington Hills, Michigan.  All inquiries regarding rights should be directed to the Gale Group. (Hide copyright information)Copyright

ABSTRACT

Detailed knowledge of the internal properties of digital representation formats is necessary to interpret properly the full information content of otherwise opaque digital objects. These properties form an important component of the representation information needed by repository workflows regardless of local preservation strategy and infrastructure decisions. The Digital Library Federation (DLF) has sponsored preliminary investigations toward establishing a Global Digital Format Registry (GDFR) that will function as a sustainable utility for maintaining the bindings between public identifiers for digital formats and the significant syntactic and semantic properties of those formats. A sustainable GDFR should prove to be of great utility to archives, libraries, digital repositories, and other organizations and individuals interested in the long-term viability of digital assets.

DIGITAL FORMATS

It has become commonplace for digital objects to be acceptable and valued assets under the collection development policies of many libraries, archives, museums, and other scientific and cultural heritage repositories with long-term preservation mandates. In general, a digital object can be considered as the encapsulation in digital form of some piece of abstract intellectual content. More specifically, a digital object is the aggregation of one or more formatted content streams representing the primary content of the object as well as associated descriptive, administrative, technical, and structural metadata. Without a thorough understanding of the format of those content streams, the ability to recover the original intellectual content from which those streams were derived is severely compromised, if not made impossible. Furthermore, common agreement on the syntax and semantics associated with an object's formatted content streams is necessary for the effective interchange of that object, whether between institutions implementing different technological infrastructures or between the various processing steps applied to the object as it passes through its intra-institutional life cycle. In essence, a format is the property associated with a content stream that provides the typing information necessary for its proper interpretation.

More formally, a format is a reversible, byte-serialized encoding of an abstract information model, which is itself a formal expression of exchangeable knowledge (International Organization for Standardization, 2003). A format defines the syntactic and semantic rules for the mapping from an information model to a byte stream and the inverse mapping from that byte stream back to the original information model. Historically, discussions of formats have been couched in terms of "file formats." However, as there are many contexts, such as the network transport of formatted content streams or consideration of content streams at a level of granularity finer than that of an entire file, where specific reference to "file" is inappropriate, the more general term "digital formats" will be used in this article.

FORMAT DEPENDENCIES IN REPOSITORY OPERATION

Digital repository operations can be distinguished into two broad categories: (1) those that are performed independent of the internal properties of its digital objects; and (2) those that are performed dependent upon the internal characteristics of the objects or, in other words, their format. With regard to the latter category, format dependencies exist in many, if not most, phases of repository operation. Figure 1 presents an idealized repository workflow based on the Open Archival Information System (OAIS) reference model (International Organization for Standardization, 2003). Although originally developed by the space science community, the OAIS model defines a general approach that is broadly applicable to repositories operating in nonscientific domains. It has been widely adopted as the conceptual framework for repository architecture and operation and has become part of the lingua franca within the digital preservation community.

[FIGURE 1 OMITTED]

Ingest Dependencies

In OAIS terms, digital objects are delivered to an archive or repository in the form of a Submission Information Package (SIP), a conceptual data structure that encapsulates both primary content and representation information about that content. Representation information is information that is necessary to map object content into more meaningful constructs relative to some designated community--in other words, metadata (Holdsworth & Sergeant, 2000). The specific format of an object content stream within a SIP is an important technical component of SIP metadata.

The OAIS Ingest function is responsible for Quality Assurance (QA) validation of SIP content. Some repositories may operate under local policies or statutory regimes that mandate an obligation to accept all SIPs regardless of validation status, while others may implement more stringent policies that reject SIPs that are not well formed or well characterized. Regardless, it is a reasonable repository best practice to validate incoming SIP content streams relative to the stated or inferred formats of those streams. Even for repositories that do not use validation status as an acceptance criterion, that status is nevertheless an important preservation metadata property that characterizes the state of a digital object at the point of ingest. Validation is performed with respect to the specific syntactic and semantic rules established by the format to which a content stream purportedly conforms. The Ingest function is the most effective point at which to detect and remediate errors occurring in archival materials (National Archives and Records Administration et al., 1999). Once digital objects are accepted into a repository, where they may not be accessed for significant periods of time, effective channels of communication with the original creators to ascertain their authorial intent with respect to those objects may become difficult, if not impossible.

The Ingest function is also responsible for disaggregating a SIP, passing the descriptive metadata to the archive Data Management function, and transforming the SIP into an Archival Information Package (ALP) encapsulating primary content and administrative and technical metadata. It is not necessary for object content streams within an ALP to have the same formats as the corresponding content streams in the SIP. In the interest of data homogeneity and its concomitant impact on operational efficiencies, many repositories may choose to define a restricted set of canonical AlP formats to which SIP content streams are transformed during the SIP-to-ALP conversion process. Quality assurance checks must be applied subsequent to all content stream…

Related articles from newspapers, magazines, journals, and more
BigBand Teams Up With Verimatrix on Secure Ad Insertion - BigBand MSP2000...
Press release article from: M2 Presswire November 24, 2008 700+ words
...Enables Advertisement Splicing into Verimatrix VCAS Encrypted Content Streams while Maintaining the Security of Primary Programming Content...partnership with Verimatrix, we can now produce MPEG-compliant content streams including seamless ad insertions. As a result, our network...
Viacast Introduces New Broadband Networking Solution; Forte 80 IP Gateway...
Press release article from: PR Newswire February 3, 2000 700+ words
...class" Gateway is fully Digital Video Broadcast (DVB) compliant and ideally suited to provide media-rich broadband IP content streams to a satellite uplink facility, cable headend, multiple system operator (MSO) or corporate data center over satellite...
CTO CONNECTION: The show must grow on - As rich content streams into the...
Magazine article from: InfoWorld Dickerson, Chad October 14, 2002 700+ words
AS TECHNOLOGY becomes less about data processing and more about enabling communication and information delivery -- whether it's headline news or a Webcast to your shareholders -- most companies are now accidental media outlets. This is forcing CTOs to reckon with the same issues traditionally found
Toshiba and ViXS Agree to Develop Wireless PVR Products; Multiple Analog and...
Press release article from: PR Newswire January 7, 2004 700+ words
LAS VEGAS, Jan. 7 /PRNewswire/ -- ViXS Systems (Booth #23274), the leading developer of video networking chipsets and software, announced today it is closely working with Toshiba America Electronic Components, Inc. (TAEC) to develop a cost-effective personal video recorder (PVR) reference platform
Faroudja, Inc. Presents New Line of Digital Format Translators For...
Press release article from: Business Wire March 27, 1998 700+ words
...professional broadcasters, the Faroudja Digital Format Translators(tm). The line of Digital Format Translators provides a range of modular...broadcasters, whatever their choice of digital format turns out to be." "Faroudja's Digital...
Faroudja Inc. Reports First-Quarter Results; Digital Format Translator for...
Press release article from: Business Wire April 21, 1998 700+ words
...are the introduction of Faroudja's Digital Format Translators for broadcasters, and...We introduced our new family of Digital Format Translators at the National Association...pricing was outstanding. "The DFT 5000 Digital Format Translator with Aspect Ratio Control...
Hemingway's work in digital format from January 5.
News wire article from: PTI - The Press Trust of India Ltd. December 31, 2008 700+ words
Hemingway's work in digital format from January 5 Washington, Dec...novels, will soon be available in digital format. About 3000 documents left behind...documents are so far reproduced in digital format, like telegrams, letters...
Baltimore theater trying to convert to digital format.
Newspaper article from: Daily Record (Baltimore, MD) March 25, 2002 700+ words
...to present movies in an electronic digital format.<p>Theater owner Tom...it will be equipped to show films in digital format as well as on traditional 35mm film...Clones set to be released May 16 in a digital format. It was Lukas hope that more than...
THOMCAST Teams with Faroudja Prominent System Integrator to Market Digital...
Press release article from: Business Wire February 2, 2000 700+ words
...today that it will be incorporating Digital Format Translator(TM) (&uot;DFT...THOMCAST announces the sale of Faroudja Digital Format Translators products to several prominent...amp;uot; Mr. Sakata added. The Digital Format Translator provides HDTV-quality...
Northwest Airlines makes in-flight magazine available in digital format.
Newspaper article from: Telecomworldwire April 14, 2008 700+ words
...makes in-flight magazine available in digital format(C)1994-2008 M2 COMMUNICATIONS...NYSE:NWA), is now available in digital format, online, at http://www.digitalnwaworldtraveler...flight magazine to move to a true digital format. It claimed the magazine will now...
For more facts and information, see all results
©2010 Gale, a part of Cengage Learning. All rights reserved. About us | FAQs | Contact us | Privacy policy | Terms and conditions
Other Gale sites: Encyclopedia.com | HighBeam Research | Acquire Content | Books & Authors | Goliath | MovieRetriever | Smart QandA

The AccessMyLibrary advertising network includes: womensforum.com GlamFamily