Show simple item record

dc.contributor.authorPauw, Guy De
dc.contributor.authorMaajabu, Naomi
dc.contributor.authorWagacha, Peter Waiganjo
dc.date.accessioned2013-06-21T13:38:03Z
dc.date.available2013-06-21T13:38:03Z
dc.date.issued2010
dc.identifier.citationProceedings of the Second Workshop on African Language Technology - AfLaT 2010 -en
dc.identifier.urihttp://www.clips.ua.ac.be/sites/default/files/depauw.pdf
dc.identifier.urihttp://hdl.handle.net/11295/37631
dc.description.abstractThis paper describes the collection and exploitation of a small trilingual corpus English - Swahili - Luo (Dholuo). Taking advantage of existing morphosyntactic annotation tools for English and Swahili and the unsupervised induction of Luo morphological segmentation patterns, we are able to perform fairly accurate word alignment on factored data. This not only enables the development of a workable bidirectional statistical machine translation system English - Luo, but also allows us to part-of-speech tag the Luo data using the projection of annotation technique. The experiments described in this paper demonstrate how this knowledge-light and language-independent approach to machine translation and part-of-speech tagging can result in the fast development of language technology components for a resource-scarce language.en
dc.language.isoenen
dc.titleA Knowledge-Light Approach to Luo Machine Translation and Part-of-Speech Taggingen
dc.typeArticleen
local.publisherSchool of Computing & Informatic, University of Nairobien
local.publisherCLiPS - Computational Linguistics Group University of Antwerp, Belgiumen


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record