Odia is an Eastern Indo-Aryan language belonging to the Indo-Aryan language family. It is thought to be directly descended from a OdraMagadhi Prakrit similar to Ardha Magadhi, which was spoken in eastern India over 1,500 years ago, and is the primary language used in early Jain texts. Odia appears to have had relatively little influence from Persian and Arabic, compared to other major North Indian languages. Odia is an Indian language, belonging to the Indo-Aryan branch of the Indo-European language family. It is mainly spoken in the Indian states of Odisha and in parts of West Bengal, Jharkhand, Chhattisgarh and Andhra Pradesh. (Source: Wikipedia)


The Odia alphabet and digits are placed in cltk/corpus/odia/alphabet.py.

The digits are placed in a list NUMERALS with the digit the same as the list index (0-9). For example, the odia digit for 4 can be accessed in this manner:

In [1]: from cltk.corpus.odia.alphabet import NUMERALS
In [2]: NUMERALS[4]
Out[2]: '୪'

The vowels are places in a list VOWELS and can be accessed in this manner :

In [1]: from cltk.corpus.odia.alphabet import VOWELS
In [2]: VOWELS
Out[2]: ['ଅ', 'ଆ', 'ଇ', 'ଈ', 'ଉ', 'ଊ', 'ଋ', 'ୠ', 'ଌ', 'ୡ', 'ଏ', 'ଐ', 'ଓ', 'ଔ']

The rest of the alphabets are UNSTRUCTURED_CONSONANTS and STRUCTURED_CONSONANTS that can be accessed in a similar way.