Coptic¶
Coptic is the latest stage of the Egyptian language, a northern Afroasiatic language spoken in Egypt until at least the 17th century. Coptic flourished as a literary language from the second to thirteenth centuries, and its Bohairic dialect continues to be the liturgical language of the Coptic Orthodox Church of Alexandria. (Source: Wikipedia)
Corpora¶
Use CorpusImporter()
or browse the CLTK GitHub organization (anything beginning with coptic_
) to discover available Coptic corpora.
In [1]: from cltk.corpus.utils.importer import CorpusImporter
In [2]: c = CorpusImporter('coptic')
In [3]: c.list_corpora
Out[3]: ['coptic_text_scriptorium']
Swadesh¶
The corpus module has a class for generating a Swadesh list for Coptic.
In[1]: from cltk.corpus.swadesh import Swadesh
In[2]: swadesh = Swadesh('cop')
In[3]: swadesh.words()[:10]
Out[3]: ['ⲁⲛⲟⲕ', 'ⲛⲧⲟⲕ, ⲛⲧⲟ', 'ⲛⲧⲟϥ, ⲛⲧⲟⲥ', 'ⲁⲛⲟⲛ', 'ⲛⲧⲟⲧⲛ', 'ⲛⲧⲟⲩ', '-ⲉⲓ', 'ⲡⲓ-, ϯ-, ⲛⲓ-', 'ⲡⲉⲓⲙⲁ', 'ⲙⲙⲁⲩ']