Proceedings of the Workshop on Language Technology for Normalisation of Less-Resourced Languages
![Thumbnail](/bitstream/handle/11295/87785/Wagacha_Proceedings_of_the_Workshop_on....pdf.jpg?sequence=6&isAllowed=y)
Date
2012Author
De Pauw, G
de Schryver, G-M
Forcada, M L
Sarasola, K
Tyers, F M
Wagacha, P W
Language
enMetadata
Show full item recordAbstract
This paper describes the stages involved in implementing a corpus of spoken Irish. This pilot project (consisting of approximately
140K words of transcribed data) implements part of the design of a larger corpus of spoken Irish which it is hoped will contain
approximately 2 million words when complete. It hoped that such a corpus will provide material for linguistic research, lexicography,
the teaching of Irish and for development of language technology for the Irish language.
Publisher
University of Nairobi