Journal of Systems Integration, Vol 1, No 4 (2010)

Font Size:  Small  Medium  Large

Multi-Paradigm and Multi-Lingual Information Extraction as Support for Medical Web Labelling Authorities

Martin Labsky, Vojtech Svatek, Marek Nekvasil


Until recently, quality labelling of medical web content has been a pre-dominantly manual activity. However, the advances in automated text processing opened the way to computerised support of this activity. The core enabling technology is information extraction (IE). However, the heterogeneity of websites offering medical content imposes particular requirements on the IE techniques to be applied. In the paper we discuss these requirements and describe a multi-paradigm approach to IE addressing them. Experiments on multi-lingual data are reported. The research has been carried out within the EU MedIEQ project.

Full Text: PDF


ISSN: 1804-2724

Creative Commons License
This work is licensed under a Creative Commons Attribution-Noncommercial 3.0 Czech Republic License.