Kno.e.sis Publications

TaxaMiner: An Experimentation Framework for Automated Taxonomy Bootstrapping

Document Type

Article

Publication Date

2005

Abstract

Construction of domain ontologies on the semantic web is a human and resource intensive process, efforts to reduce which are crucial for the Semantic Web to scale. We present a framework for automated taxonomy construction, that involves: (a) generation of a cluster hierarchy from a document corpus using statistical clustering and NLP techniques; (b) extraction of a topic hierarchy from this cluster hierarchy; and (c) assignment of labels to nodes in the topic hierarchy. Metrics for estimating topic hierarchy quality and parameters of an experimentation framework are identified. MEDLINE was the document corpus and MeSH thesaurus was the gold standard.

Comments

Attached is the unpublished, author's version of this article. The final, publisher's version can be found at http://inderscience.metapress.com/content/a3ull2nmnexl6xla/?genre=article&issn=1741-1106&volume=1&issue=2&spage=240.

Repository Citation

Kashyap, V., Ramakrishnan, C., Thomas, C., & Sheth, A. P. (2005). TaxaMiner: An Experimentation Framework for Automated Taxonomy Bootstrapping. International Journal of Web and Grid Services, 1 (2), 240-266.
https://corescholar.libraries.wright.edu/knoesis/744

Download

Request Accessible Version

Included in

Bioinformatics Commons, Communication Technology and New Media Commons, Databases and Information Systems Commons, OS and Networks Commons, Science and Technology Studies Commons

COinS

Kno.e.sis Publications

TaxaMiner: An Experimentation Framework for Automated Taxonomy Bootstrapping

Document Type

Publication Date

Abstract

Comments

Repository Citation

Included in

Search

Browse

About

Kno.e.sis Publications

TaxaMiner: An Experimentation Framework for Automated Taxonomy Bootstrapping

Authors

Document Type

Publication Date

Abstract

Comments

Repository Citation

Included in

Share

Search

Browse

About