A Treebank is a linguistic corpus that has been annotated with syntactic structure information, typically in the form of parse trees. These annotations help in representing the grammatical structure of sentences, capturing relations between words, phrases, and their syntactic roles. Treebanks are used in various fields including computational linguistics, natural language processing (NLP), and linguistic research. There are several well-known Treebanks that vary in their design and purpose.
New to topics? Read the docs here!