Accessible Data

The data from the DMII is accessible in two versions. Either of the two versions is accessible in two forms, CSV and SQL. Please note that uninfiectable words are not a part of the DMII, a list of 3,711 such words needs to be accessed separately.

Sigrún's format

The version named Sigrún’s format (Sigrúnarsnið) links inflectionals form and headword and it contains grammatical tags for each inflectional form. This version is named for Sigrún Helgadóttir, and it was initially produced to be used in the creation of the Tagged Icelandic Corpus (Mörkuð íslensk málheild; the MÍM Corpus).

Please note that some additional data needs to be accessed separately, e.g. personal and reflexive pronouns, the definite article, etc. The same applies to clitized forms of verbs, used in questions.

List of word forms

The second version is a simple list of word forms, without grammatical tags.

Access the DMII data here

 The Árni Magnússon Institute for Icelandic Studies is the rightsholder of the DMII.

For more linguistic resources and tools for Icelandic, see malfong.is.

 

KB 1.10.2013