METADATA FOR INFORMATION RESOURCES
WORKSHOP BACKGROUND, ACKNOWLEDGEMENTS,
AND CONTACT INFORMATION
Workshop Background, Acknowledgements, and Contact Information
This workshop and all related materials are the direct result of a two-year grant to the State Archives Department of the Minnesota Historical Society (MHS) from the National Historical Publications and Records Commission (NHPRC). Work on the “Educating Archivists and Their Constituencies” project began in January 2001 and was completed in May 2003.
The project sought to address a critical responsibility that archives have discovered in their work with electronic records: the persistent need to educate a variety of constituencies about the principles, products, and resources necessary to implement archival considerations in the application of information technology to government functions. Several other goals were also supported:
- raise the level of knowledge and understanding of essential electronic records skills and tools among archivists,
- help archivists reach the electronic records creators who are their key constituencies,
- provide the means to form with those constituencies communities of learning that will support and sustain collaboration, and
- raise the profile of archivists in their own organizations and promote their involvement in the design and analysis of recordkeeping systems.
MHS administered the project and worked in collaboration with several partners: the Delaware Public Archives, the Indiana University Archives, the Ohio Historical Society, the San Diego Supercomputer Center, the Smithsonian Institution Archives, and the State of Kentucky. This list represents a variety of institutions, records environments, constituencies, needs, and levels of electronic records expertise. At MHS, Robert Horton served as the Project Director, Shawn Rounds as the Project Manager, and Jennifer Johnson as the Project Archivist.
MHS gratefully acknowledges the contribution of Advanced Strategies, Inc. (ASI) of Atlanta, Georgia, and Saint Paul, Minnesota, which specializes in a user-centric approach to all aspects of information technology planning and implementation. MHS project staff received training and guidance from ASI in adult education strategies and workshop development. The format of this course book is directly based on the design used by ASI in its own classes. For more information about ASI, visit
For more information regarding the workshop, contact MHS staff or visit the workshop web site at
Robert Horton: / 651-215-5866
Shawn Rounds: / 651-296-7953
Metadata for Information ResourcesWorkshop Background, Acknowledgements, and Contact Information-1
State Archives Department, Minnesota Historical Society, 345 Kellogg Boulevard West, Saint Paul, Minnesota, 55102-1906 / / 651-297-4502 May 2003
INTRODUCTION
Introduction
This unit includes:
Course objectives
Course approach
Course schedule
Administrative items
Participant introductions
Introduction
Course objectives
Upon completion of this course, you will be able to:
understand basic information technology concepts and terminology
understand the definition of metadata
understand how you currently use metadata in your work
discuss the differences between metadata for paper documents and metadata for digital objects
discuss what additional metadata may be needed for digital objects
describe different functions of metadata
discuss the varying metadata standards
Introduction
Course approach
Spiral approach
About the exercises
oThe exercises in this course will be “live”
oThe exercises will be limited in number
oThe exercises may require some oral participation from students
oThere is not a single, 100% correct answer; it is impossible to get an A, because there are many “right” answers
Parking lot
Team teaching
Introduction
Course Schedule
IntroductionUnit I: Defining Metadata
Break
Unit II: Using Metadata with Information Resources
Break
Unit III: Learning More About Metadata and its Functions
Lunch
Unit IV: Recordkeeping Metadata
Break
Unit V: Access Metadata
Unit VI: Preservation Metadata
Break
Unit VII: Putting It All Together
Appendices
Very flexible - driven by your needs. /
Introduction
Administrative Items
Class will begin promptly at the scheduled time.
Need a break: squirm, lie down, tell me!
If something is not clear, if you have a question, or if I say something that does not make any sense:
Stop me!
Discussion is always better than lecture. Success of this workshop is dependent on all participants.
Facilities (Parking, Restrooms, Telephones, Exits)
We will proceed informally.
Introduction
Participant Introductions
NAME and ORGANIZATION:______
Tell us a little bit about your program (think of what you wrote in your self-evaluation):
What training have you had related to metadata?
______
______
Does your organization have a metadata program planned or in place?
______
______
Who are the principal groups, agencies, partners, etc. that have participated in or supported your metadata efforts?
______
______
Are you currently using metadata? What kind and for what projects?
______
______
Metadata for Information ResourcesIntroduction-1
State Archives Department, Minnesota Historical Society, 345 Kellogg Boulevard West, Saint Paul, Minnesota, 55102-1906 / / 651-297-4502 May 2003
UNIT I: DEFINING METADATA
Defining Metadata
This unit includes:
What do we mean by information resources, digital objects, and electronic records?
One person or organization cannot do it all.
Definitions of metadata.
Metadata for traditional paper documents.
Defining Metadata
What do we mean by information resources, digital objects, and electronic records?
Information resources: The content of your information technology projects (data, information, records, images, digital objects, etc.)
Digital object:Information that is inscribed on a tangible medium or that is stored in an electronic or other medium and is retrievable in perceivable form. An object created, generated, sent, communicated, received, or stored by electronic means. [1]
An electronic record is a specific type of digital object with unique characteristics described by archivists and records managers.
Types of digital objects:
e-mailPortable Document Format (PDF) files
web pagesPowerPoint presentations
databasesdigital images
spreadsheets…and many more
word processing documents
Defining Metadata
Digital objects have three components:
Content: Informational substance of the object.
Structure:Technical characteristics of the objects (e.g., presentation, appearance, display).
Context: Information outside the object which provides illumination or understanding about it, or assigns meaning to it.
Defining Metadata
Defining digital objects
Pittsburgh Project Definition
/Order of Values
/ Information Technology ArchitectureContent / Data / Data
Structure / Information / Format
Context / Knowledge / Application
Defining Metadata
One person or organization cannot do it all
Components: X + Y + Z = resources, tools, standards/methodologies, education, technology, partners
Resources: skilled and informed staff, organizational funding, grants, …
Tools: metadata, markup languages, …
Standards/methodologies: Dublin Core, XML, …
Education: formal/informal education, workshops, conferences, …
Technology: TagGen, XMetal, …
Partners: stakeholders [record creators, agencies, users], other archivists, professional organizations [Midwest Archives
Conference (MAC), Society of American Archivists (SAA), National Association of Government Archives and Records Administrators (NAGARA), Association of Records Managers and Administrators (ARMA)], …
This class will cover a necessary tool: Metadata
Defining Metadata
Exercise: What do you think metadata is?
Defining Metadata
Different people and professions have different definitions of metadata
data about data
information about information
data about objects
descriptive information which facilitates management of, and access to, other information
evaluation tool
Defining Metadata
Different people and professions also use metadata to fulfill different functions
Description: what is in the object, what the object is about
Discovery: the location of the object
Evaluation: the value of the object, is this the object I want to use
Management: control of the access, storage, preservation, and disposal of an object
Defining Metadata
Metadata for traditional paper documents
- entry in a library card catalog
- standards: description, archival practices (donor agreements, finding aids, acquisition reports, etc.)
- records retention schedule
- book/journal publication information
legend on a map
[2]
One person’s data is another person’s metadata.
Defining Metadata
Key Messages
Digital objects are investments which we all need to manage and understand.
One person or organization cannot do it all.
Metadata is used for…
oDescription, Discovery, Evaluation, and Management
Metadata is a tool.
Metadata for Information ResourcesI-1
State Archives Department, Minnesota Historical Society, 345 Kellogg Boulevard West, Saint Paul, Minnesota, 55102-1906 / / 651-297-4502 May 2003
UNIT II: USING METADATA WITH INFORMATION RESOURCES
Using Metadata with Information Resources
This unit includes:
Metadata for digital objects.
What are some options for storing metadata?
Why is metadata useful?
Using Metadata with Information Resources
What is metadata for digital objects?
Description: what is in the object, what the object is about
Discovery: the location of the object
Evaluation: the value of the object, is this the object I want to use
Management: control of the access, storage, preservation, and disposal of an object
Using Metadata with Information Resources
Exercise: E-mail (management, description, discovery)
From: Rounds, Shawn
Sent:Tuesday, February 19, 2002 9:09 AM
To:Craig A Steiner (E-mail); Eileen Mccormack (E-mail); Horton, Robert; James Taylor (E-mail); Johnson, Jennifer; Karen Bondy (E-mail); Linda Feist (E-mail); Marsha K Haagenson (E-mail); Patricia Dunlop (E-mail); Steve Retzlaff (E-mail)
Cc:Bruce Yurich (E-mail); Ed Potter (E-mail); Janice Zarkin (E-mail); Jay. Achenbach (E-mail)
Subject:Metadata Comm: New Draft
Hello --
As promised at our last meeting, here is the latest draft of the standard, now complete with all elements. I've cleaned it up, removing the track-changes marks and doing some editing to provide consistency in phrasing, formatting, etc. It's NOT the final draft, but it's the starting point for our next round of work. It should also be online at the committee web site in the next few days (
Believe it or not, we're nearing the end! Thanks for your continued interest and participation --
Shawn
Using Metadata with Information Resources
Exercise: Web Page (evaluation, description, discovery)
Using Metadata with Information Resources
Exercise: Word processing document (discovery and management)
Using Metadata with Information Resources
Where are some options for storing metadata?
1.As part of the object or document being described:
oa web page
2.A separate file that can be externally accessed but is linked to the object or document:
oGeoGateway: a Minnesota Geographic Data Clearinghouse
3.A separate file:
oPALS: Minnesota Historical Society online catalog
Using Metadata with Information Resources
Why is metadata useful?
Everyone needs metadata to help manage and use digital objects.
Collaboration with partners and stakeholders is crucial to ensure that everyone’s requirements are met and that efforts are coordinated.
Using Metadata with Information Resources
Why is metadata useful?
New questions and issues are raised by information technology.
- Who created the file and who has accessed it? Is the file reliable?
- Who owns the file?
- Where is my file?
- Which files are duplicates?
- What version of the file is this? Is it the most current one? Is it the official copy?
- Where are the backup files?
- What storage media are the digital objects saved on? What software/hardware was used to save these digital objects?
- When do I need to migrate or convert my data?
- What is the retention period for my records?
- …and many more
Using Metadata with Information Resources
Why is metadata useful?
Metadata helps with:
- discovery and admissibility issues
- data access requirements (including privacy and data sharing issues)
- records retention requirements
- legislation and organizational policies
- …and much, much more
Using Metadata with Information Resources
Different metadata standards serve different functions
Data Modeling
Systems Management
GIS (Geographic Information System)
Recordkeeping
Access
Preservation
Using Metadata with Information Resources
Key Messages
We need more metadata for digital objects than paper documents.
o(e.g., creator and modifier, time and date created and modified, file format, version, hardware and software stored and created on)
Some options for storing metadata include:
oAs part of the object or document being described
oA separate file that can be externally accessed but is linked to the object or document
oA separate file
Metadata is useful for everyone.
oMetadata helps us address new questions and issues raised by information technology
oOur legal and business mandates require metadata
There are different metadata standards for different functions.
Metadata for Information ResourcesII-1
State Archives Department, Minnesota Historical Society, 345 Kellogg Boulevard West, Saint Paul, Minnesota, 55102-1906 / / 651-297-4502 May 2003
UNIT III: LEARNING MORE ABOUT METADATA AND ITS FUNCTIONS
Learning More About Metadata and Its Functions
This unit includes:
Primary and secondary uses of data.
An introduction to different types of metadata standards.
oData Modeling
oSystems Management
oGeographic Information System (GIS)
Learning More About Metadata and Its Functions
Primary and secondary uses of data requires metadata
Primary use: Why you create or use data.
Secondary use:When anyone else wants to use the data.
Metadata makes re-use possible. Metadata standards allow for more consistent and efficient description, discovery, evaluation, and management.
Learning More About Metadata and Its Functions
Introduction to some different types of metadata standards: Data Modeling [3]
Data modeling is a graphic representation of a system and its metadata. It is systems documentation that facilitates the function of management.
The Process of Modeling [4]
Learning More About Metadata and Its Functions
Introduction to some different types of metadata standards: Data Modeling (continued)
Data modeling is a business and technical tool that encompasses a set of techniques for analyzing business requirements and designing information system components to manage associated data needs. Data models graphically capture and record business decisions, facilitate planning, and offer a means of understanding information relationships, structures, and processes.
Important definitions:
Entity:anything we want to capture in our model - e.g., a person, a thing, or a function.
Attribute:information about an entity - e.g., a person's name, address, or age.
Used for depicting systems and functions
Associated with business processes
Users include data administrators, business analysts, system developers, auditors
Numerous modeling tools available
oComputer Aided Software Engineering (CASE) tools [5]
Learning More About Metadata and Its Functions
Introduction to some different types of metadata standards: Systems Management
Necessary for day-to-day system functions
Associated with data administration, databases, data warehouses
Examples include field size, allowable values
Users include systems analysts, data administrators, business analysts, software developers, planners, and auditors
Learning More About Metadata and Its Functions
Introduction to some different types of metadata standards: Systems Management (continued)
Specification and Standardization of Data Elements. ISO/IEC 11179, Final draft international standard. [6]
ISO/IEC 11179: Metadata Registries (2001 draft revisions)
Part 1: Framework for the Specification and Standardization of Data Elements [7]
Part 2: Classification for Data Elements
Part 3: Basic Attributes of Data Elements (Registry Metamodel) [8]
Part 4: Rules and Guidelines for the Formulation of Data Definitions
Part 5: Naming and Identification Principles for Data Elements [9]
Part 6: Registration of Data Elements
Purpose of standard:“to give concrete guidance on the formulation and maintenance of discrete data element descriptions and semantic content (metadata) that shall be used to formulate data elements in a consistent, standard manner. It also provides guidance for establishing a data element registry.”
Learning More About Metadata and Its Functions
Introduction to some different types of metadata standards: Systems Management (continued)
Useful for data warehouses
What is a data warehouse? [10]
“Data warehouses are computer-based information systems that are home for "secondhand" data that originated from other applications and/or external systems or sources. Warehouses optimize database query and reporting tools because of their ability to analyze data, often from disparate databases and in interesting ways. They are a way for managers and decision makers to extract information quickly and easily in order to answer questions about their business. In other words, data warehouses are read-only, integrated databases designed to answer comparative and "what if" questions. Unlike operational databases that are set up to handle transactions and that are kept up to date as of the last transaction, data warehouses are analytical, subject-oriented and are structured to aggregate transactions as a snapshot in time.”
This metadata helps you evaluate data and answer the following questions:
oWhat’s the source of the data?
oHas the data recently been cleansed or transformed?
oIs this data appropriate for my needs?
Learning More About Metadata and Its Functions
Systems management metadata example No.1
Learning More About Metadata and Its Functions
Systems management metadata example No.2
Learning More About Metadata and its Functions
Introduction to some different types of metadata standards: Geographic Information Systems (GIS) [11][12][13]
The seven sections of the Minnesota Geographic Metadata Guidelines:
- Identification Information
- Data Quality Information
- Spatial Data Organization Information
- Spatial Reference Information
- Entity and Attribute Information
- Distribution Information
- Metadata Reference Information
Learning More About Metadata and Its Functions