Asymmetric Information Distances for Automated Taxonomy Construction

13 Pages Posted: 27 Aug 2008  

Wei Lee Woon

Masdar Institute of Science and Technology (MIST)

Stuart Madnick

Massachusetts Institute of Technology (MIT) - Sloan School of Management

Date Written: August 25, 2008

Abstract

A novel method for automatically constructing taxonomies for specific research domains is presented. The proposed methodology uses term co-occurence frequencies as an indicator of the semantic closeness between terms. To support the automated creation of taxonomies or subject classifications we present a simple modification to the basic distance measure, and describe a set of procedures by which these measures may be converted into estimates of the desired taxonomy. To demonstrate the viability of this approach, a pilot study on renewable energy technologies is conducted, where the proposed method is used to construct a hierarchy of terms related to alternative energy. These techniques have many potential applications, but one activity in which we are particularly interested is the mapping and subsequent prediction of future developments in the technology and research.

Keywords: Taxonomy Construction, Asymmetric Information

Suggested Citation

Woon, Wei Lee and Madnick, Stuart, Asymmetric Information Distances for Automated Taxonomy Construction (August 25, 2008). MIT Sloan Research Paper No. 4712-08. Available at SSRN: https://ssrn.com/abstract=1256562 or http://dx.doi.org/10.2139/ssrn.1256562

Wei Lee Woon (Contact Author)

Masdar Institute of Science and Technology (MIST) ( email )

MASDAR
PO Box 54115
Abu Dhabi
United Arab Emirates

Stuart E. Madnick

Massachusetts Institute of Technology (MIT) - Sloan School of Management ( email )

E53-321
Cambridge, MA 02142
United States
617-253-6671 (Phone)
617-253-3321 (Fax)

Paper statistics

Downloads
86
Rank
241,366
Abstract Views
5,566