Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jayden L. Macklin-Cordes

Challenges of sampling and how phylogenetic comparative methods help: With a case study of the Pama-Nyungan laminal contrast

Jan 01, 2022

Jayden L. Macklin-Cordes, Erich R. Round

Figure 1 for Challenges of sampling and how phylogenetic comparative methods help: With a case study of the Pama-Nyungan laminal contrast

Figure 2 for Challenges of sampling and how phylogenetic comparative methods help: With a case study of the Pama-Nyungan laminal contrast

Figure 3 for Challenges of sampling and how phylogenetic comparative methods help: With a case study of the Pama-Nyungan laminal contrast

Figure 4 for Challenges of sampling and how phylogenetic comparative methods help: With a case study of the Pama-Nyungan laminal contrast

Abstract:Phylogenetic comparative methods are new in our field and are shrouded, for most linguists, in at least a little mystery. Yet the path that led to their discovery in comparative biology is so similar to the methodological history of balanced sampling, that it is only an accident of history that they were not discovered by a typologist. Here we clarify the essential logic behind phylogenetic comparative methods and their fundamental relatedness to a deep intellectual tradition focussed on sampling. Then we introduce concepts, methods and tools which will enable typologists to use these methods in everyday typological research. The key commonality of phylogenetic comparative methods and balanced sampling is that they attempt to deal with statistical non-independence due to genealogy. Whereas sampling can never achieve independence and requires most comparative data to be discarded, phylogenetic comparative methods achieve independence while retaining and using all data. We discuss the essential notions of phylogenetic signal; uncertainty about trees; typological averages and proportions that are sensitive to genealogy; comparison across language families; and the effects of areality. Extensive supplementary materials illustrate computational tools for practical analysis and we illustrate the methods discussed with a typological case study of the laminal contrast in Pama-Nyungan.

* Accepted for publication in Linguistic Typology. Supplementary data at https://doi.org/10.5281/zenodo.5602216. 96 total pages (Main text: 41 pages, 6 figures, 3 tables. Supplementary S1: 34 pages, 1 figure. Supplementary S2: 21 pages)

Via

Access Paper or Ask Questions

Re-evaluating phoneme frequencies

Jun 09, 2020

Jayden L. Macklin-Cordes, Erich R. Round

Figure 1 for Re-evaluating phoneme frequencies

Figure 2 for Re-evaluating phoneme frequencies

Figure 3 for Re-evaluating phoneme frequencies

Figure 4 for Re-evaluating phoneme frequencies

Abstract:Causal processes can give rise to distinctive distributions in the linguistic variables that they affect. Consequently, a secure understanding of a variable's distribution can hold a key to understanding the forces that have causally shaped it. A storied distribution in linguistics has been Zipf's law, a kind of power law. In the wake of a major debate in the sciences around power-law hypotheses and the unreliability of earlier methods of evaluating them, here we re-evaluate the distributions claimed to characterize phoneme frequencies. We infer the fit of power laws and three alternative distributions to 168 Australian languages, using a maximum likelihood framework. We find evidence supporting earlier results, but also qualifying and nuancing them. Most notably, phonemic inventories appear to have a Zipfian-like frequency structure among their most-frequent members (though perhaps also a lognormal structure) but a geometric (or exponential) structure among the least-frequent. We highlight implications for causal accounts.

* 24pp (2 figures, 3 tables). This article has been submitted but not yet accepted for publication. Supplementary information, data and code available at http://doi.org/10.5281/zenodo.3886212

Via

Access Paper or Ask Questions

Phylogenetic signal in phonotactics

Feb 03, 2020

Jayden L. Macklin-Cordes, Claire Bowern, Erich R. Round

Figure 1 for Phylogenetic signal in phonotactics

Figure 2 for Phylogenetic signal in phonotactics

Figure 3 for Phylogenetic signal in phonotactics

Figure 4 for Phylogenetic signal in phonotactics

Abstract:Phylogenetic methods have broad potential in linguistics beyond tree inference. Here, we show how a phylogenetic approach opens the possibility of gaining historical insights from entirely new kinds of linguistic data--in this instance, statistical phonotactics. We extract phonotactic data from 128 Pama-Nyungan vocabularies and apply tests for phylogenetic signal, quantifying the degree to which the data reflect phylogenetic history. We test three datasets: (1) binary variables recording the presence or absence of biphones (two-segment sequences) in a lexicon (2) frequencies of transitions between segments, and (3) frequencies of transitions between natural sound classes. Australian languages have been characterised as having a high degree of phonotactic homogeneity. Nevertheless, we detect phylogenetic signal in all datasets. Phylogenetic signal is higher in finer-grained frequency data than in binary data, and highest in natural-class-based data. These results demonstrate the viability of employing a new source of readily extractable data in historical and comparative linguistics.

* Main text: 26 pages, 13 figures, 1 table. Supplementary Information: 17 pages, 1 figure. Code and data available at http://doi.org/10.5281/zenodo.3610089. This article has been submitted but not yet accepted for publication in a book or journal

Via

Access Paper or Ask Questions