Hungarian WordNet (HuWN)
Introduction
Wordnets are lexical databases in which words are organized into clusters based on their meanings, and they are linked to each other through different semantic and lexical relations, yielding a conceptual hierarchy (i.e. lexical ontology) of words. The Hungarian WordNet (HuWN) was developed by the Research Institute for Linguistics of the Hungarian Academy of Sciences, the Department of Informatics of the University of Szeged, and MorphoLogic Ltd. in a 3 year project. As a result, HuWN now consists of over 42.000 synsets, out of which 2.000 synsets form part of a subontology in the business domain and later, 650 synsets were added from the legal domain.
The Princeton WordNet 2.0 served as a basis for the construction of HuWN, that is, synsets belonging to the BalkaNet Concept Set were selected from PWN 2.0 and then translated into Hungarian.
Licensing
The database can be used free of charge for research and educational purposes under the licence MetaShare Commons NonCommercial-NoRedistribution (MSCommons_NoCOM-NC-NR). If you would like to use HuWN for a commercial or non-research purpose, please contact us.
Downloads
- The Hungarian WordNet in XML format
- Description of HuWN
Reference
- Miháltz, Márton; Hatvani, Csaba; Kuti, Judit; Szarvas, György; Csirik, János; Prószéky, Gábor; Váradi, Tamás 2008: Methods and Results of the Hungarian WordNet Project. In: Proceedings of the Fourth Global WordNet Conference GWC 2008, pp. 310-320.
For further information please contact Veronika Vincze (vinczev AT inf.u-szeged.hu).