talnarchives

Une archive numérique francophone des articles de recherche en Traitement Automatique de la Langue.

Lexicalizing a shallow parser

Roberto Basili, Maria Teresa Pazienza, Fabio Massimo Zanzotto

Abstract : Current NL parsers are expected to run with throughput rate suitable to satisfy ”time constraints” in real applications. The aim of the present work is, on the one hand, to investigate the effects of lexical information in a shallow parsing environment, on the other hand, to study the limits of a bootstrapping architecture that, automatically learning the lexical information in an unsupervised fashion, guarantees the reliability and portability of the parser to different domains. The investigated parser is Chaos (Chunk analysis oriented system), a robust parser based on stratification and lexicalization. Large scale evaluation over a standard tree bank is discussed.