Diachronic English Web Corpus

Full Name

Composer

Research and Development Unit for English Studies, Birmingham City University

URL

https://www.webcorp.org.uk/wcx/lse/corpora

Language

English

Register

Written

Genre

Internet

Style

Formal and Informal

Period

2000-2100 AD

Number of words

> 100.000.000

Number of words (details)

128,951,238

Annotation

Tokenization

Annotation remarks

This corpus consists of 128,951,238 words (tokens) from web-extracted texts. It covers the period Jan 2000 - Dec 2010. Each month contains approximately 1 million words.

Format

Online

Availability

Open access