Corpus oana-fa: the 1984 corpus, Farsi version – statistics and info

The annotatoed 1984 Persian Corpus in the MULTEXT-EAST Framework

Counts
Tokens108437
Words95682
Sentences6605
Paragraphs1266
Documents1
General info
LanguagePersian
EncodingUTF-8
Compiled11/09/2015 15:38:07
Tagset doc Description
Infolink More info
Lexicon sizes
word11322
tag428
lemma6612
rtag12
lc11322
lemma_lc6612

Structures and attributes

hide detail