Downloads
Software
- magyarlanc: a toolkit for the basic linguistic processing of Hungarian
- Named Entity Recognition tool for English and Hungarian
- HTML annotation tool (Firefox extension)
- TextAnnotator: a tool for the linguistic annotation of natural language texts
- CRF-based tool for identifying light verb constructions in English texts
Language Resources
- The Szeged Treebank
- Szeged Dependency Treebank
- Hungarian Wordnet
- The BioScope corpus
- Hungarian Named Entity corpora
- Named Entity lemmatization database
- Corpora for uncertainty detection
- Corpora of multiword expressions
- Hungarian word sense disambiguated
- The affiliation HTML corpus
- The SzegedParallel English-Hungarian corpus
- The HunOr Russian-Hungarian parallel corpus
- Hungarian forum corpus for Opinion Mining
- A dataset for opinionated keyphrase extraction
- HunLearner, a learners' corpus of Hungarian
- SzegedTrip, an English corpus of travel blogs annotated for opinions and personality traits
- HuSent, a Hungarian sentiment corpus
External references