Proceedings of the Workshop on Language Technology for Normalisation of Less-Resourced Languages
Date
2012Author
De Pauw, G
de Schryver, G-M
Forcada, M L
Sarasola, K
Tyers, F M
Wagacha, P W
Language
enMetadata
Show full item recordAbstract
This paper describes the stages involved in implementing a corpus of spoken Irish. This pilot project (consisting of approximately
140K words of transcribed data) implements part of the design of a larger corpus of spoken Irish which it is hoped will contain
approximately 2 million words when complete. It hoped that such a corpus will provide material for linguistic research, lexicography,
the teaching of Irish and for development of language technology for the Irish language.
Publisher
University of Nairobi