English
 
Help Privacy Policy Disclaimer
  Advanced SearchBrowse

Item

ITEM ACTIONSEXPORT

Released

Journal Article

CoVox: A dataset of contrasting vocalizations

MPS-Authors
/persons/resource/persons243246

Bruder,  Camila       
Department of Music, Max Planck Institute for Empirical Aesthetics, Max Planck Society;

/persons/resource/persons179725

Larrouy-Maestri,  Pauline       
Department of Music, Max Planck Institute for Empirical Aesthetics, Max Planck Society;

External Resource
No external resources are shared
Fulltext (restricted access)
There are currently no full texts shared for your IP range.
Fulltext (public)

mus-25-bru-01-covox.pdf
(Publisher version), 3MB

Supplementary Material (public)
There is no public supplementary material available
Citation

Bruder, C., & Larrouy-Maestri, P. (2025). CoVox: A dataset of contrasting vocalizations. Behavior Research Methods, 57: 142. doi:10.3758/s13428-025-02664-9.


Cite as: https://hdl.handle.net/21.11116/0000-0011-0C26-6
Abstract
The human voice is remarkably versatile and can vary greatly in sound depending on how it is used. An increasing number of studies have addressed the differences and similarities between the singing and the speaking voice. However, finding adequate stimuli material that is at the same time controlled and ecologically valid is challenging, and most datasets lack variability in terms of vocal styles performed by the same voice. Here, we describe a curated stimulus set of vocalizations where 22 female singers performed the same melody excerpts in three contrasting singing styles (as a lullaby, as a pop song, and as an opera aria) and spoke the text aloud in two speaking styles (as if speaking to an adult or to an infant). All productions were made with the songs’ original lyrics, in Brazilian Portuguese, and with a/lu/sound. This ecologically valid dataset of 1320 vocalizations was validated through a forced-choice lab experiment (N = 25 for each stimulus) where lay listeners could recognize the intended vocalization style with high accuracy (proportion of correct recognition superior to 69% for all styles). We also provide acoustic characterization of the stimuli, depicting clear and contrasting acoustic profiles depending on the style of vocalization. All recordings are made freely available under a Creative Commons license and can be downloaded at https://osf.io/cgexn/.