← Research

Lexical diversity

Lexical diversity measurements are conducted to analyze the frequency of both common and uncommon words within texts, thereby evaluating the vocabulary richness of specific authors or speakers. The diagram showcases the vocabulary richness of individual authors through the MSTTR method (Mean Segmental Type-Token Ratio).

In this approach, the text is segmented into equal parts (for instance, 100 words per segment); then, the type-token ratio (TTR) is calculated for each segment. Subsequently, the arithmetic mean of these TTR measurements is computed to derive the MSTTR, providing a quantifiable measure of lexical variety.

Baklāne, A., Saulespurēns, V. Leksiskās daudzveidības kvantitatīvā analīze latviešu prozas izpētē. Aktuālas problēmas literatūras un kultūras pētniecībā = Current Issues in Research of Literature and Culture 27. Liepāja: LiePA. 2022