Monday, October 22, 2007

more Maltese utterology

Another aspect of my ongoing research on Maltese, something that I am very pleased about, is the freshly unveiled Maltese token corpus. It's still in its initial stages, but thanks to the brilliance of Jerid Francom and Dainon Woudstra (my talented research assistants), we now have the first (of perhaps, one of the first) token corpora of Maltese, based on a Maltese newspaper. At over 500,000 tokens, it's pretty big for a first stab, and will hopefully get bigger very soon! It was great fun sharing this corpus with my Maltese colleagues in Bremen.

0 Comments:

Post a Comment

<< Home