Coptic is the latest stage of the Egyptian language, a northern Afroasiatic language spoken in Egypt until at least the 17th century. Coptic flourished as a literary language from the second to thirteenth centuries, and its Bohairic dialect continues to be the liturgical language of the Coptic Orthodox Church of Alexandria. (Source: Wikipedia)


Use CorpusImporter() or browse the CLTK GitHub organization (anything beginning with coptic_) to discover available Coptic corpora.

In [1]: from cltk.corpus.utils.importer import CorpusImporter

In [2]: c = CorpusImporter('coptic')

In [3]: c.list_corpora
Out[3]: ['coptic_text_scriptorium']