Мобильная версия

Доступно журналов:

3 288

Доступно статей:

3 891 637

 

Скрыть метаданые

Автор CATALÀ, NEUS
Автор CASTELL, NÚRIA
Автор MARTÍN, MARIO
Дата выпуска 2003
dc.description The main issue when building Information Extraction (IE) systems is how to obtain the knowledge needed to identify relevant information in a document. Most approaches require expert human intervention in many steps of the acquisition process. In this paper we describe ESSENCE, a new method for acquiring IE patterns that significantly reduces the need for human intervention. The method is based on ELA, a specifically designed learning algorithm for acquiring IE patterns without tagged examples. The distinctive features of ESSENCE and ELA are that (1) they permit the automatic acquisition of IE patterns from unrestricted and untagged text representative of the domain, due to (2) their ability to identify regularities around semantically relevant concept-words for the IE task by (3) using non-domain-specific lexical knowledge tools such as WordNet, and (4) restricting the human intervention to defining the task, and validating and typifying the set of IE patterns obtained. Since ESSENCE does not require a corpus annotated with the type of information to be extracted and it uses a general purpose ontology and widely applied syntactic tools, it reduces the expert effort required to build an IE system and therefore also reduces the effort of porting the method to any domain. The results of the application of ESSENCE to the acquisition of IE patterns in an MUC-like task are shown.
Издатель Cambridge University Press
Название A portable method for acquiring information extraction patterns without annotated corpora
DOI 10.1017/S1351324902003042
Electronic ISSN 1469-8110
Print ISSN 1351-3249
Журнал Natural Language Engineering
Том 9
Первая страница 151
Последняя страница 179
Аффилиация CATALÀ NEUS; Technical University of Catalonia
Аффилиация CASTELL NÚRIA; Technical University of Catalonia
Аффилиация MARTÍN MARIO; Technical University of Catalonia
Выпуск 2

Скрыть метаданые