Blowing in the wind: using ‘North Wind and the Sun’ texts to sample phoneme 
inventories

Baird, Louise; Evans, Nicholas; Greenhill, Simon J.

doi:10.1017/S002510032000033X

Local TagsRelease HistoryDetailsSummary

Blowing in the wind: using ‘North Wind and the Sun’ texts to sample phoneme inventories

Baird, L., Evans, N., & Greenhill, S. J. (2022). Blowing in the wind: using ‘North Wind and the Sun’ texts to sample phoneme inventories. Journal of the International Phonetic Association, 52(3): S002510032000033X, pp. 453-494. doi:10.1017/S002510032000033X.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/21.11116/0000-0008-BC9D-4 Version Permalink: https://hdl.handle.net/21.11116/0000-000C-CC96-5

Genre: Journal Article

Files

show Files

hide Files

:

shh2970.pdf (Publisher version), 4MB

View Save

File Permalink:
https://hdl.handle.net/21.11116/0000-0008-BC9F-2

Name:
shh2970.pdf

Description:
OA . - first view

OA-Status:
Not specified

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
http://creativecommons.org/licenses/by/4.0/

Locators

show

Creators

show

hide

Creators:
Baird, Louise, Author
Evans, Nicholas, Author
Greenhill, Simon J.¹, Author

Affiliations:
1Linguistic and Cultural Evolution, Max Planck Institute for the Science of Human History, Max Planck Society, ou_2074311

Content

show

hide

Free keywords: -

Abstract: Language documentation faces a persistent and pervasive problem: How much material is enough to represent a language fully? How much text would we need to sample the full phoneme inventory of a language? In the phonetic/phonemic domain, what proportion of the phoneme inventory can we expect to sample in a text of a given length? Answering these questions in a quantifiable way is tricky, but asking them is necessary. The cumulative collection of Illustrative Texts published in the Illustration series in this journal over more than four decades (mostly renditions of the ‘North Wind and the Sun’) gives us an ideal dataset for pursuing these questions. Here we investigate a tractable subset of the above questions, namely: What proportion of a language’s phoneme inventory do these texts enable us to recover, in the minimal sense of having at least one allophone of each phoneme? We find that, even with this low bar, only three languages (Modern Greek, Shipibo and the Treger dialect of Breton) attest all phonemes in these texts. Unsurprisingly, these languages sit at the low end of phoneme inventory sizes (respectively 23, 24 and 36 phonemes). We then estimate the rate at which phonemes are sampled in the Illustrative Texts and extrapolate to see how much text it might take to display a language’s full inventory. Finally, we discuss the implications of these findings for linguistics in its quest to represent the world’s phonetic diversity, and for JIPA in its design requirements for Illustrations and in particular whether supplementary panphonic texts should be included.

Details

show

hide

Language(s): eng - English

Dates: Published Online: 2021-06-07Date issued: 2022-12

Publication Status: Issued

Pages: 42

Publishing info: -

Table of Contents: 1 Introduction
2 Language sample
3 Data coding
3.1 Lengh/gemination
3.2 Diphthongs vs. vowel sequences
3.3 Tonal contrasts
3.4 Illustration of an Illustration: Shilluk
4 Overview of the JIPA Illustration text corpus
5 Transcript coverage
6 The nature of phoneme frequency distributions
7 Recovering the full phoneme inventory
8 Evaluating the methods against a larger corpus
9 Returning to the cross-linguistic data
10 Estimating the amount of audio needed
11 Effect on recovery of cross-linguistic frequency/rarity
12 Discussion
12.1 Recommendations for JIPA
12.2 How many data are needed to fully capture a language’s phoneme inventory?
13 Conclusion

Rev. Type: Peer

Identifiers: DOI: 10.1017/S002510032000033X
Other: shh2970

Degree: -

Event

show

Legal Case

show

Project information

show

Source 1

show

hide

Title: Journal of the International Phonetic Association

Source Genre: Journal

Creator(s):

Affiliations:

Publ. Info: London? : The Association

Pages: - Volume / Issue: 52 (3) Sequence Number: S002510032000033X Start / End Page: 453 - 494 Identifier: ISSN: 0025-1003
CoNE: https://pure.mpg.de/cone/journals/resource/110978977281466