Document Type
Conference Proceeding
Publication Date
2008
Abstract
In this paper we investigate unsupervised population of a biomedical ontology via information extraction from biomedical literature. Relationships in text seldom connect simple entities. We therefore focus on identifying compound entities rather than mentions of simple entities. We present a method based on rules over grammatical dependency structures for unsupervised segmentation of sentences into compound entities and relationships. We complement the rule-based approach with a statistical component that prunes structures with low information content, thereby reducing false positives in the prediction of compound entities, their constituents and relationships. The extraction is manually evaluated with respect to the UMLS Semantic Network by analyzing the conformance of the extracted triples with the corresponding UMLS relationship type definitions.
Repository Citation
Ramakrishnan, C.,
Mendes, P. N.,
Wang, S.,
& Sheth, A. P.
(2008). Unsupervised Discovery of Compound Entities for Relationship Extraction. Lecture Notes in Computer Science, 5268, 146-155.
https://corescholar.libraries.wright.edu/knoesis/49
DOI
10.1007/978-3-540-87696-0_15
Included in
Bioinformatics Commons, Communication Technology and New Media Commons, Databases and Information Systems Commons, OS and Networks Commons, Science and Technology Studies Commons
Comments
The featured PDF document is the unpublished, peer-reviewed version of this article.
The featured abstract was published in the final version of this article, which appeared in Lecture Notes in Computer Science, volume 2568, pp. 146-155 and may be found at http://link.springer.com/content/pdf/10.1007%2F978-3-540-87696-0_15.pdf .
This paper was presented at the 16th International Conference on Knowledge Engineering and Knowledge Management Knowledge Patterns (EKAW), Acitrezza, Catania, Italy, September 29-October 3, 2008,