Databases, Portals and Tools
Portals and Databases
- Dictionary portal
- Terminology portal
- Lexika slovenských terénnych názvov
- Etymologická databáza slovenskej lexiky
- “Retrográdny slovník súčasnej slovenčiny” – Web Portal
- Slovak WordNet
- Frequencies and ARF (Araneum Slovacum VII Maximum) dataset
Corpora
- Slovak National Corpus
- ARANEA corpora
- Slovak Legislative Corpus
- Corpus of Court Decisions
- Error Corpus of Slovak “CHIBY”
- Corpus of the journal “Slovenská reč”
- Corpus of Rusyn Wikipedia
- HPLT web corpus
Tools
- mistral-sk-7b, generative Slovak LLM
- Lemmatization, Morphological Analysis and Disambiguation
- Lemmatization, Morphological Analysis and Disambiguation (Slovak written without diacritics)
- Word embeddings
- Paraphrase Slovak (and Czech)
- Rekonstruction of Diacritics
- Vitvorťe si Štúrovskuo meno
- Named Entity Recognition, Demo
- Machine Translation of Slovak into the L. Štúr version
- Timeline of word occurrences in the corpus
- Visualization of Collocations
- Transliteration of Slovak or Czech into Glagolitics