Show simple item record

dc.contributor.authorPauw, Guy De
dc.contributor.authorWagacha, Peter Waiganjo
dc.contributor.authorde Schryver, Gilles-Maurice
dc.date.accessioned2013-06-21T13:12:59Z
dc.date.available2013-06-21T13:12:59Z
dc.date.issued2009
dc.identifier.citationProceedings of the EACL 2009 Workshop on Language Technologies for African Languages – AfLaT 2009, pages 9–16, Athens, Greece, 31 March 2009en
dc.identifier.isbn1-932432-25-6
dc.identifier.urihttp://dl.acm.org/citation.cfm?id=1564511
dc.identifier.urihttp://hdl.handle.net/11295/37612
dc.description.abstractResearch in data-driven methods for Machine Translation has greatly benefited from the increasing availability of parallel corpora. Processing the same text in two different languages yields useful information on how words and phrases are translated from a source language into a target language. To investigate this, a parallel corpus is typically aligned by linking linguistic tokens in the source language to the corresponding units in the target language. An aligned parallel corpus therefore facilitates the automatic development of a machine translation system and can also bootstrap annotation through projection. In this paper, we describe data collection and annotation efforts and preliminary experimental results with a parallel corpus English - Swahili.en
dc.language.isoenen
dc.publisherAssociation for Computational Linguisticsen
dc.titleThe SAWA corpus: a parallel corpus English - Swahilien
dc.typePresentationen
local.publisherSchool of Computing and Informatics, University of Nairobi, Kenyaen
local.publisherAfrican Languages and Cultures, Ghent University, Belgium Xhosa Department, University of the Western Cape, South Africaen
local.publisherCNTS - Language Technology Group, University of Antwerp, Belgiumen


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record