talnarchives

Une archive numérique francophone des articles de recherche en Traitement Automatique de la Langue.

Hypernym extraction from Wikipedia

Adel Ghamnia

Abstract : This work contributes to demostrate how the structure of the documents available on the web canbe exploited to enrich semantic knowledge bases. These knowledge bases play a key role in manyapplications like artificial intelligence, semantic web, information retrieval, diagnostic support, etc.In this context, we are interested here in the identification of hypernymy relations from Wikipediadisambiguation pages, which are highly structured documents. A hypernymy relation extractordedicated to this kind of page and based on lexico-syntactic patterns was designed, developed andevaluated. The results show a precision of 0.68 and a recall of 0.75 for the patterns that we havedefined, and an enrichment rate up to 33% for both BabelNet and DBPedia semantic resources.

Keywords : Hypernym extraction,Knowledge Base,Morpho-syntactic patterns