Now showing items 1-2 of 2

    • Proceedings of the Workshop on Language Technology for Normalisation of Less-Resourced Languages 

      De Pauw, G; de Schryver, G-M; Forcada, M L; Sarasola, K; Tyers, F M; Wagacha, P W (University of Nairobi, 2012)
      This paper describes the stages involved in implementing a corpus of spoken Irish. This pilot project (consisting of approximately 140K words of transcribed data) implements part of the design of a larger corpus of spoken ...
    • The SAWA corpus: a parallel corpus English - Swahili 

      De Pauw, G; Wagacha, PW; de Schryver, Gilles-Maurice (2009)
      Research in data-driven methods for Machine Translation has greatly benefited from the increasing availability of parallel corpora. Processing the same text in two different languages yields useful information on how words ...