talnarchives

Une archive numérique francophone des articles de recherche en Traitement Automatique de la Langue.

A prototype dependency treebank for Breton

Francis M. Tyers, Vinit Ravishankar

Abstract : This paper describes the development of the first syntactically-annotated corpus of Breton. The corpus is part of the Universal Dependencies project. In the paper we describe how the corpus was prepared, some Breton-specific constructions that required special treatment, and in addition we give results for parsing Breton using a number of off-the-shelf data-driven parsers.

Keywords : breton,dependency parsing,treebank