Improving the Computational Morphological Analysis of a Swahili Corpus for Lexicographic Purposes

  • G De Pauw
  • G-M de Schryver

Abstract

Abstract: Computational morphological analysis is an important first step in the automatic treatment of natural language and a useful lexicographic tool. This article describes a corpus-based approach to the morphological analysis of Swahili. We particularly focus our discussion on its ability to retrieve lemmas for word forms and evaluate it as a tool for corpus-based dictionary compilation. Keywords: LEXICOGRAPHY, MORPHOLOGY, CORPUS ANNOTATION, LEMMATIZATION, MACHINE LEARNING, SWAHILI (KISWAHILI)
Section
Articles

Journal Identifiers


eISSN: 2224-0039
print ISSN: 1684-4904