The measurements of lexical diversity are carried out to account for the typical and rarely used words in texts, as well as to measure richness of the vocabulary of a particular authors or interlocutors. The diagram demonstrates the richness of vocabulary of individual authors by using the MSTTR method (mean segmental type-token ratio). The text is divided into equal segments (in this example 100 words per segment); the type-token ration is calculated for each segment and an arithmetic mean of TTR measurements is calculated to obtain the MSTTR.
The NoSketch platform implementation at the National Library of Latvia nosketch.lnb.lv provides access to the thematic corpora in the fields of literature, folklore, and history. The main features of the platform are word frequency, concordance, collocation and term extraction tools. nosketch.lnb.lv