COMIT

Full Name
Corpus Multimodal De Informativos Televisados
Composer
Universitat de Barcelona, Universitat Politécnica de Catalunya, Universitat D'Alacant, Euskal Herriko Unibertsitatea
Language
Spanish
Iberian Spanish
Register
Spoken
Genre
Media talk
Style
Formal
Period
2000-2100 AD
1900-2000 AD
Number of words
< 500.000
Number of words (details)
99,000 words
Annotation
Prosodic annotation
Other
Annotation remarks

COMIT includes downloadable transcripts of 9 television news broadcasts issued in Spain (by TVE 1, La 2 and Antena 3). It is proposed to represent the typical audiovisual dimension of TV news (interaction of visual and sound modes). Transcription is orthographic and includes paralinguistic information (pauses, falling / rising intonation) and extralingual information about the images and ambient noises that occur together with the transcribed speech. Does not include search engine or allow access to recordings.

Format
Download
Data collection
Elicited
Multimedia
Transcription only
Availability
Free subscription