Inf 722 Information Organisation

Fall, 2007

Home Assignment 2 (Gangolly)

This home assignment is due on October 29, 2007.

The objective of this assignment is to familiarize you with the way ontologies could be built using software such as Protégé, provide some hands-on experience in building a small ontology, and help you prepare for the semester project.

You may use the same text documents for this assignment, or use some other document in the domain of your interest. You may use a relatively small document just to gain some experience, or, (specially if you foresee continued interest in KOM) use a longer document and use this assignment as a stepping-stone for the semester paper.

If you expect to have an enduring interest in KOM, you may like to use this assignment as well as the final paper/project to narrow your interests so that you can use it as the first step towards your dissertations. We will discuss the paper requirement of the course in the class on Monday.

In doing this assignment as well as the final paper/project for my part of the course, you will find the following reading extremely helpful:

Ontology Development 101: A Guide to Creating Your First Ontology

Natalya F. Noy and Deborah L. McGuinness

http://protege.stanford.edu/publications/ontology_development/ontology101-noy-mcguinness.html

______

Step 1: Choose the text from which you will extract, at least, partially, the vocabulary to be the basis for the construction of the ontology.

Step 2: Extract the vocabulary, and choose a few, say 6-8 concepts (sometimes called classes) for further analysis. You may use your judgment based on your knowledge of the domain from which you selected the text.

Step 3: Extract the properties of each concept describing various features and attributes of the concept (slots, sometimes called roles or properties), and restrictions on slots (facets, sometimes called role restrictions).

Step 4: Identify all the instances in the chosen text of the classes (concepts) you chose in step 2.

Step 4: Use Protégé to create the ontology for the concepts extracted.