Spoken Corpus
PRESEEA-MA corpus is part of the international research project on Castilian and American Spanish (PRESEEA-Project). PRESEEA aims at collecting and digitalising a synchronic spoken corpus of Spanish including educational, age and gender variation from a large set of cities and towns. As it is composed of urban data from very different dialect areas and situations, it will present a representative image of how Spanish is spoken.
PRESEEA-MA corpus is a collection of spoken texts produced by a representative sample of Malaga town speakers from both sexes. All these texts are already available as they have been included in a book in three volumes: the first one (Vida-Castro, 2007) contains spoken texts produced by 24 primary school speakers; the second one (Ávila-Muñoz, Lasarte-Cervantes and Villena-Ponsoda, 2008) includes texts by 24 secondary school speakers, and the third volume provides spoken data from 24 university speakers (Lasarte-Cervantes, Sánchez-Sáez and Villena-Ponsoda Ávila-Muñoz, 2009).
To guarantee anonymity of speakers, all data permitting, somehow, their identification have been omitted. The full version of these materials, which includes sound files, is restricted to researchers within the international research group.
Sociological profile of the town Labelling
Obtaining the speech samples | GENERATION 1 (20-34 years) | GENERATION 2 (35-54 years) | GENERATION 3 (more than 55 years) | |||
Men | Women | Men | Women | Men | Women | |
Primary school | 1 | 1 | 1 | 1 | 1 | 1 |
Secondary school | 1 | 1 | 1 | 1 | 1 | 1 |
University | 1 | 1 | 1 | 1 | 1 | 1 |