Lecture Thursday 24 February 2005
Reflection paper due 3/10/05 15%
Identify a theme/thesis/queston
Choose your own form
metaphors of text mining/knowledge discovery/hypothesis generation and the know;ledge maanagement paradigm
comes doewn to defining the role of information scientists working as professionals always cognizant of the work context instead of as computer scientists
Metadata describes the attributes of an information bearing object (IBO) - document, data set, database, image, artifact, collection, etc.; metadata acts as a surrogate representation of the IBO. A metadata record can include representations of the content, context, structure, quality, provenance, condition, and other characteristics of an IBO for the purposes of representing the IBO to a potential user - for discovery, evaluation for fitness for use, access, transfer, and citation. See also, Meta-information. Examples of metadata format are the MARC format used by the library community Content Standards for Digital Geospatial Metadata developed by the Federal Geographic Data Committee Directory Interchange Format (DIF) used by NASA's Global Change Master Directory Government Information Locator Service (GILS), and Dublin Core set of attributes for electronic resources developed with the lead of the Online Computer Library Center (OCLC). [LH] (fat-albert.alexandria.ucsb.edu:8827/glossary.html)
like a map index
data upon data, layers of description
metadata also used to describe data about "information bearing objects"
metaphor of assigning a card catalog to animals in a zoo
reasonable approach to information
interplay of information technology and people
representation of a thing
isn't that what data is?
metadata can describe itself?
"An alternative solution that promises to mediate these extremes involves the creation of a record that is more informative than an index entry but is less complete than a formal cataloging record. If only a small amount of human effort were required to create such records, more objects could be described, especially if the author of the resource could be encouraged to create the description. And if the description followed an established standard, only the creation of the record would require human intervention; automated tools could discover these descriptions and collect them." (http://www.dlib.org/dlib/July95/07weibel.html#intro)
CAPTURE & REPRESENT
intrinsicness
worth is in part due to collecitons rather than individual items
problem of retrospective conversion: given the things that have beencreated without metadata, who is going to update it?
Intrinsicality
The Dublin Core concentrates on describing intrinsic properties of the object. Intrinsic data refer to the properties of the work that could be discovered by having the work in hand, such as its intellectual content and physical form. This is distinguished from extrinsic data, which describe the context in which the work is used. For example, the "Subject" element is intrinsic data, while transaction information such as cost and access considerations are extrinsic data. The focus on intrinsic data in no way demeans the importance of other varieties of data, but simply reflects the need to keep the scope of deliberations narrowly focussed.
(http://www.dlib.org/dlib/July95/07weibel.html#intro)
retrospective metadata
problem is you have to have those attributes in place that are adequate for describing them
knowledge is justified true belief
the means by which you arrive at the knowledge must be the right means
data is not very good at suggesting its inadequacy, and neither are researchers attempting to use it in order to publish
sometimes it's the attributes that have never been identified that bear the most information
Identify a theme/thesis/queston
Choose your own form
metaphors of text mining/knowledge discovery/hypothesis generation and the know;ledge maanagement paradigm
comes doewn to defining the role of information scientists working as professionals always cognizant of the work context instead of as computer scientists
Metadata describes the attributes of an information bearing object (IBO) - document, data set, database, image, artifact, collection, etc.; metadata acts as a surrogate representation of the IBO. A metadata record can include representations of the content, context, structure, quality, provenance, condition, and other characteristics of an IBO for the purposes of representing the IBO to a potential user - for discovery, evaluation for fitness for use, access, transfer, and citation. See also, Meta-information. Examples of metadata format are the MARC format used by the library community Content Standards for Digital Geospatial Metadata developed by the Federal Geographic Data Committee Directory Interchange Format (DIF) used by NASA's Global Change Master Directory Government Information Locator Service (GILS), and Dublin Core set of attributes for electronic resources developed with the lead of the Online Computer Library Center (OCLC). [LH] (fat-albert.alexandria.ucsb.edu:8827/glossary.html)
like a map index
data upon data, layers of description
metadata also used to describe data about "information bearing objects"
metaphor of assigning a card catalog to animals in a zoo
reasonable approach to information
interplay of information technology and people
representation of a thing
isn't that what data is?
metadata can describe itself?
"An alternative solution that promises to mediate these extremes involves the creation of a record that is more informative than an index entry but is less complete than a formal cataloging record. If only a small amount of human effort were required to create such records, more objects could be described, especially if the author of the resource could be encouraged to create the description. And if the description followed an established standard, only the creation of the record would require human intervention; automated tools could discover these descriptions and collect them." (http://www.dlib.org/dlib/July95/07weibel.html#intro)
CAPTURE & REPRESENT
intrinsicness
worth is in part due to collecitons rather than individual items
problem of retrospective conversion: given the things that have beencreated without metadata, who is going to update it?
Intrinsicality
The Dublin Core concentrates on describing intrinsic properties of the object. Intrinsic data refer to the properties of the work that could be discovered by having the work in hand, such as its intellectual content and physical form. This is distinguished from extrinsic data, which describe the context in which the work is used. For example, the "Subject" element is intrinsic data, while transaction information such as cost and access considerations are extrinsic data. The focus on intrinsic data in no way demeans the importance of other varieties of data, but simply reflects the need to keep the scope of deliberations narrowly focussed.
(http://www.dlib.org/dlib/July95/07weibel.html#intro)
retrospective metadata
problem is you have to have those attributes in place that are adequate for describing them
knowledge is justified true belief
the means by which you arrive at the knowledge must be the right means
data is not very good at suggesting its inadequacy, and neither are researchers attempting to use it in order to publish
sometimes it's the attributes that have never been identified that bear the most information