[期刊论文][Full-length article]


An unsupervised method for word sense disambiguation

作   者:
Nazreena Rahman;Bhogeswar Borah;

出版年:2022

页     码:6643 - 6651
出版社:Elsevier


摘   要:

Word sense disambiguation (WSD) finds the actual meaning of a word according to its context. This paper presents a novel WSD method to find the correct sense of a word present in a sentence. The proposed method uses both the WordNet lexical dictionary and the Wikipedia corpus. Initially, we find all the probable senses of the target word using WordNet. For each of the words present in a sense, we calculate the collocation extraction score with the other words in the sentence. The collocation extraction score finds the probability of the occurrence of two words together in the Wikipedia corpus. The maximum collocation extraction score assigns the proper sense for that context of the sentence. Our method is not limited to the bi-grams that are made up of only two consecutive words. Our method can find the probability of having two words together in a sentence when other words separate these two words. To compare our WSD method with current knowledge-based unsupervised and supervised systems, we use different Senseval and SemEval datasets for doing WSD on English words. Finally, the experimental analysis illustrates the significance of the proposed approach over many baseline and current systems.



关键字:

Word sense disambiguation (WSD) ; WordNet lexical dictionary ; Wikipedia corpus ; Collocation extraction score ; Knowledge-based unsupervised and supervised systems ; Senseval and SemEval datasets


全文
所属期刊
Journal of King Saud University @?C Computer and Information Sciences
ISSN: 1319-1578
来自:Elsevier