Using Semantic and Structural Properties of the Unified Medical Language System to Discover Potential Terminological Relationships

Academic Article

Abstract

  • Objective: To use the semantic and structural properties in the Unified Medical Language System (UMLS) Metathesaurus to characterize and discover potential relationships. Design: The UMLS integrates knowledge from several biomedical terminologies. This knowledge can be used to discover implicit semantic relationships between concepts. In this paper, the authors propose a problem-independent approach for discovering potential terminological relationships that employs semantic abstraction of indirect relationship paths to perform classification and analysis of network theoretical measures such as topological overlap, preferential attachment, graph partitioning, and number of indirect paths. Using different versions of the UMLS, the authors evaluate the proposed approach's ability to predict newly added relationships. Measurements: Classification accuracy, precision-recall. Results: Strong discriminative characteristics were observed with a semantic abstraction based classifier (classification accuracy of 91%), the average number of indirect paths, preferential attachment, and graph partitioning to identify potential relationships. The proposed relationship prediction algorithm resulted in 56% recall in top 10 results for new relationships added to subsequent versions of the UMLS between 2005 and 2007. Conclusions: The UMLS has sufficient knowledge to enable discovery of potential terminological relationships. © 2009 J Am Med Inform Assoc.
  • Digital Object Identifier (doi)

    Author List

  • Patel CO; Cimino JJ
  • Start Page

  • 346
  • End Page

  • 353
  • Volume

  • 16
  • Issue

  • 3