Building a Spanish MMTx by using automatic translation and biomedical ontologies

Loading...
Thumbnail Image
Identifiers

Publication date

Advisors

Editors

Journal Title

Journal ISSN

Volume Title

Publisher

Springer

Metrics

Google Scholar

Research Projects

Organizational Units

Journal Issue

Abstract

The use of domain ontologies is becoming increasingly popular in Medical Natural Language Processing Systems. A wide variety of knowledge bases in multiple languages has been integrated into the Unified Medical Language System (UMLS) to create a huge knowledge source that can be accessed with diverse lexical tools. MetaMap (and its java version MMTx) is a tool that allows extracting medical concepts from free text, but currently there not exists a Spanish version. Our ongoing research is centered on the application of biomedical concepts to cross-lingual text classification, what makes it necessary to have a Spanish MMTx available. We have combined automatic translation techniques with biomedical ontologies and the existing English MMTx to produce a Spanish version of MMTx. We have evaluated different approaches and applied several types of evaluation according to different concept representations for text classification. Our results prove that the use of existing translation tools such as Google Translate produce translations with a high similarity to original texts in terms of extracted concepts.

Description

Keywords

Bibliographic reference

Carrero, F., Cortizo, J. C., & Gómez, J. M. (2008). Building a Spanish MMTx by using automatic translation and biomedical ontologies. In International Conference on Intelligent Data Engineering and Automated Learning: IDEAL 2008 (pp.346-353). Berlin: Springer. DOI: 10.1086/588455

Type of document