Corpus M-Grams – statistics and info
M-Grams
Counts | |
---|---|
Tokens | 80391906 |
Words | 72623708 |
Sentences | 10893667 |
Paragraphs | 0 |
Documents | 2104787 |
General info | |
---|---|
Language | English |
Encoding | UTF-8 |
Compiled | 11/07/2015 00:22:49 |
Tagset doc | Description |
Infolink | More info |
Lexicon sizes | |
---|---|
word | 309938 |
tag | 45 |
lemma | 278253 |
lc | 309938 |
lemma_lc | 278251 |
Structures and attributes
- doc 2104787
-
year 8205
-
artist 49586
-
id 2104787
-
- s 10893667
- g 8993600