Search
Now showing items 1-2 of 2
The SAWA corpus: a parallel corpus English - Swahili
(2009)
Research in data-driven methods for Machine Translation has greatly benefited from the increasing availability of parallel corpora. Processing the same text in two different languages yields useful information on how words ...
Proceedings of the Workshop on Language Technology for Normalisation of Less-Resourced Languages
(University of Nairobi, 2012)
This paper describes the stages involved in implementing a corpus of spoken Irish. This pilot project (consisting of approximately
140K words of transcribed data) implements part of the design of a larger corpus of spoken ...