Ottoman Turkish, or the Ottoman language, is the variety of the Turkish language that was used in the Ottoman Empire. Ottoman Turkish was highly influenced by Arabic and Persian. Arabic and Persian words in the language accounted for up to 88% of its vocabulary. As in most other Turkic and other foreign languages of Islamic communities, the Arabic borrowings were not originally the result of a direct exposure of Ottoman Turkish to Arabic, a fact that is evidenced by the typically Persian phonological mutation of the words of Arabic origin. (Source: Wikipedia)
The Ottoman digits and alphabet are placed in cltk/corpus/ottoman/alphabet.py.
The digits are placed in a dict
NUMERALS with the digit the same as the index (0-9). There is a dictionary named NUMERALS_WRITINGS for their writing also. For example, the persian digit for 5 can be accessed in this manner:
In : from cltk.corpus.ottoman.alphabet import NUMERALS, NUMERALS_WRITINGS In : NUMERALS Out: '۵' In : NUMERALS_WRITINGS Out: 'بش'
One can also have the alphabetic orders of the charachters form ALPHABETIC_ORDER dictionary. The keys are the characters and the values are their order. The corresponding dictionary can be imported:
In : from cltk.corpus.ottoman.alphabet import ALPHABETIC_ORDER, CIM In : ALPHABETIC_ORDER[CIM] Out: 6