Polish word sketches
Loading...
Date
2011-11
Advisor
Editor
Journal Title
Journal ISSN
Volume Title
Publisher
Fundacja Uniwersytetu im. A. Mickiewicza
Title alternative
Abstract
Word sketches are one-page automatic, corpus-based summaries of a word's grammatical and collocational behaviour. They were first used in the production of the Macmillan English Dictionary (Rundell 2002). At that point, word sketches only existed for English. Today, the Sketch Engine is available, a corpus tool which takes as input a corpus of any language and corresponding grammar patterns and which generates word sketches for the words of that language. It also automatically generates a thesaurus and 'sketch differences', which specify similarities and differences between near-synonyms. A web corpus of Polish was morpho-syntactically tagged and loaded into the Sketch Engine. We describe the Polish Sketch Grammar and show how the resulting word sketches can be used in lexicography and for other linguistic purposes. The results show that word sketches could significantly facilitate lexicographic work for Polish, as they have for other languages.
Description
Sponsor
Keywords
Polish, Lexical profiling, Sketch Engine, Word sketch, Concordancer, Web corpus, Collocation, Thesaurus, Morpho-syntactic description
Citation
Radziszewski, Adam, Kilgarriff, Adam and Lew, Robert. 2011. ‘Polish Word Sketches’ in Vetulani, Zygmunt (ed.), Human Language Technologies as a Challenge for Computer Science and Linguistics. Proceedings of the 5th Language & Technology Conference. Poznań: Fundacja Uniwersytetu im. A. Mickiewicza.
Seria
ISBN
978-83-932640-1-8