Susanne
This action may take several minutes for large corpora, please wait.

Susanne

Susanne corpus of an approximately 130,000-word subset of the Brown Corpus of American English, annotated in accordance with the SUSANNE annotation scheme (attempting to provide a method of representing all aspects of English grammar which are sufficiently definite to be susceptible of formal annotation). Tagged with TreeTagger v2.2.

Counts
Tokens150426
Words128998
Sentences6861
Paragraphs1923
Documents149
General info
Corpus description Document
LanguageEnglish
EncodingUTF-8
Compiled04/20/2018 10:23:24
Tagset Description
Lexicon sizes
word16447
lempos11655
tag420
status4
lemma10416
lc 14721
lempos_lc 11590
lemma_lc 10330
Tags legend
adjectiveJJ.*
adverbRR.*
conjuctionCC.*
determinerAT.*
nounN.*
numeralM.*
prepositionI.*
pronounPP.*
verbV.*
Lempos suffixes
adjective-j
adverb-a
conjunction-c
noun-n
preposition-i
pronoun-d
verb-v

Structures and attributes

Subcorpora statistics

Subcorpus Tokens Words %
doc_file_starting_with_A 37307 ~ 31992 24.8008987808