OBC

Full Name
The Old Bailey Corpus
Composer
Magnus Huber, Magnus Nissel, Patrick Maiwald, and Bianca Widlitzki (University of Giessen )
Language
English
British English
Register
Spoken
Genre
Other
Speech
Style
Formal
Period
1900-2000 AD
1800-1900 AD
1700-1800 AD
Period (details)
1720-1913
Number of words
> 100.000.000
Number of words (details)
ca. 113 million
Annotation
POS tagging
Tokenization
Format
Download
Data collection
Semi-elicited
Multimedia
Transcription only
Availability
Free subscription
Remarks

The Proceedings of the Old Bailey, London's central criminal court, were published from 1674 to 1913 and constitute a large body of texts from the beginning of Present Day English. The 2163 volumes contain almost 200,000 trials, totalling ca. 134 million words. Since the proceedings were taken down in shorthand by scribes in the courtroom, the verbatim passages are arguably as near as we can get to the spoken word of the period. The material thus offers the rare opportunity of analyzing spoken language in a period that has been neglected both with regard to the compilation of primary linguistic data and the description of the structure, variability, and change of English.