Мобильная версия

Доступно журналов:

3 288

Доступно статей:

3 891 637

 

Скрыть метаданые

Автор RILOFF, ELLEN
Автор SHEPHERD, JESSICA
Дата выпуска 1999
dc.description Many applications need a lexicon that represents semantic information but acquiring lexical information is time consuming. We present a corpus-based bootstrapping algorithm that assists users in creating domain-specific semantic lexicons quickly. Our algorithm uses a representative text corpus for the domain and a small set of ‘seed words’ that belong to a semantic class of interest. The algorithm hypothesizes new words that are also likely to belong to the semantic class because they occur in the same contexts as the seed words. The best hypotheses are added to the seed word list dynamically, and the process iterates in a bootstrapping fashion. When the bootstrapping process halts, a ranked list of hypothesized category words is presented to a user for review. We used this algorithm to generate a semantic lexicon for eleven semantic classes associated with the MUC-4 terrorism domain.
Издатель Cambridge University Press
Название A corpus-based bootstrapping algorithm for Semi-Automated semantic lexicon construction This research is supported in part by the National Science Foundation under grants IRI-9509820 and IRI-9704240.
Electronic ISSN 1469-8110
Print ISSN 1351-3249
Журнал Natural Language Engineering
Том 5
Первая страница 147
Последняя страница 156
Аффилиация RILOFF ELLEN; University of Utah
Аффилиация SHEPHERD JESSICA; University of Utah
Выпуск 2

Скрыть метаданые