Search | arXiv e-print repository

Learning spatio-temporal patterns with Neural Cellular Automata

Authors: Alex D. Richardson, Tibor Antal, Richard A. Blythe, Linus J. Schumacher

Abstract: Neural Cellular Automata (NCA) are a powerful combination of machine learning and mechanistic modelling. We train NCA to learn complex dynamics from time series of images and PDE trajectories. Our method is designed to identify underlying local rules that govern large scale dynamic emergent behaviours. Previous work on NCA focuses on learning rules that give stationary emergent structures. We exte… ▽ More Neural Cellular Automata (NCA) are a powerful combination of machine learning and mechanistic modelling. We train NCA to learn complex dynamics from time series of images and PDE trajectories. Our method is designed to identify underlying local rules that govern large scale dynamic emergent behaviours. Previous work on NCA focuses on learning rules that give stationary emergent structures. We extend NCA to capture both transient and stable structures within the same system, as well as learning rules that capture the dynamics of Turing pattern formation in nonlinear Partial Differential Equations (PDEs). We demonstrate that NCA can generalise very well beyond their PDE training data, we show how to constrain NCA to respect given symmetries, and we explore the effects of associated hyperparameters on model performance and stability. Being able to learn arbitrary dynamics gives NCA great potential as a data driven modelling framework, especially for modelling biological pattern formation. △ Less

Submitted 22 April, 2024; v1 submitted 23 October, 2023; originally announced October 2023.

Comments: For videos referenced in appendix, see: https://github.com/AlexDR1998/NCA/tree/main/Videos

arXiv:2305.15914 [pdf, other]

Reliable Detection and Quantification of Selective Forces in Language Change

Authors: Juan Guerrero Montero, Andres Karjus, Kenny Smith, Richard A. Blythe

Abstract: Language change is a cultural evolutionary process in which variants of linguistic variables change in frequency through processes analogous to mutation, selection and genetic drift. In this work, we apply a recently-introduced method to corpus data to quantify the strength of selection in specific instances of historical language change. We first demonstrate, in the context of English irregular v… ▽ More Language change is a cultural evolutionary process in which variants of linguistic variables change in frequency through processes analogous to mutation, selection and genetic drift. In this work, we apply a recently-introduced method to corpus data to quantify the strength of selection in specific instances of historical language change. We first demonstrate, in the context of English irregular verbs, that this method is more reliable and interpretable than similar methods that have previously been applied. We further extend this study to demonstrate that a bias towards phonological simplicity overrides that favouring grammatical simplicity when these are in conflict. Finally, with reference to Spanish spelling reforms, we show that the method can also detect points in time at which selection strengths change, a feature that is generically expected for socially-motivated language change. Together, these results indicate how hypotheses for mechanisms of language change can be tested quantitatively using historical corpus data. △ Less

Submitted 21 August, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

arXiv:2303.04691 [pdf, other]

doi 10.1093/genetics/iyad092

Self-contained Beta-with-Spikes Approximation for Inference Under a Wright-Fisher Model

Authors: Juan Guerrero Montero, Richard A. Blythe

Abstract: We construct a reliable estimation of evolutionary parameters within the Wright-Fisher model, which describes changes in allele frequencies due to selection and genetic drift, from time-series data. Such data exists for biological populations, for example via artificial evolution experiments, and for the cultural evolution of behavior, such as linguistic corpora that document historical usage of d… ▽ More We construct a reliable estimation of evolutionary parameters within the Wright-Fisher model, which describes changes in allele frequencies due to selection and genetic drift, from time-series data. Such data exists for biological populations, for example via artificial evolution experiments, and for the cultural evolution of behavior, such as linguistic corpora that document historical usage of different words with similar meanings. Our method of analysis builds on a Beta-with-Spikes approximation to the distribution of allele frequencies predicted by the Wright-Fisher model. We introduce a self-contained scheme for estimating the parameters in the approximation, and demonstrate its robustness with synthetic data, especially in the strong-selection and near-extinction regimes where previous approaches fail. We further apply to allele frequency data for baker's yeast (Saccharomyces cerevisiae), finding a significant signal of selection in cases where independent evidence supports such a conclusion. We further demonstrate the possibility of detecting time-points at which evolutionary parameters change in the context of a historical spelling reform in the Spanish language. △ Less

Submitted 11 May, 2023; v1 submitted 8 March, 2023; originally announced March 2023.

arXiv:2104.10210 [pdf, other]

doi 10.1371/journal.pone.0252582

How individuals change language

Authors: Richard A Blythe, William Croft

Abstract: Languages emerge and change over time at the population level though interactions between individual speakers. It is, however, hard to directly observe how a single speaker's linguistic innovation precipitates a population-wide change in the language, and many theoretical proposals exist. We introduce a very general mathematical model that encompasses a wide variety of individual-level linguistic… ▽ More Languages emerge and change over time at the population level though interactions between individual speakers. It is, however, hard to directly observe how a single speaker's linguistic innovation precipitates a population-wide change in the language, and many theoretical proposals exist. We introduce a very general mathematical model that encompasses a wide variety of individual-level linguistic behaviours and provides statistical predictions for the population-level changes that result from them. This model allows us to compare the likelihood of empirically-attested changes in definite and indefinite articles in multiple languages under different assumptions on the way in which individuals learn and use language. We find that accounts of language change that appeal primarily to errors in childhood language acquisition are very weakly supported by the historical data, whereas those that allow speakers to change incrementally across the lifespan are more plausible, particularly when combined with social network effects. △ Less

Submitted 20 April, 2021; originally announced April 2021.

Comments: 50 pages, 11 figures

Journal ref: PLoS ONE 16(6): e0252582 (2021)

arXiv:2103.11024 [pdf, other]

doi 10.1111/cogs.13035

Conceptual similarity and communicative need shape colexification: an experimental study

Authors: Andres Karjus, Richard A. Blythe, Simon Kirby, Tianyu Wang, Kenny Smith

Abstract: Colexification refers to the phenomenon of multiple meanings sharing one word in a language. Cross-linguistic lexification patterns have been shown to be largely predictable, as similar concepts are often colexified. We test a recent claim that, beyond this general tendency, communicative needs play an important role in sha** colexification patterns. We approach this question by means of a serie… ▽ More Colexification refers to the phenomenon of multiple meanings sharing one word in a language. Cross-linguistic lexification patterns have been shown to be largely predictable, as similar concepts are often colexified. We test a recent claim that, beyond this general tendency, communicative needs play an important role in sha** colexification patterns. We approach this question by means of a series of human experiments, using an artificial language communication game paradigm. Our results across four experiments match the previous cross-linguistic findings: all other things being equal, speakers do prefer to colexify similar concepts. However, we also find evidence supporting the communicative need hypothesis: when faced with a frequent need to distinguish similar pairs of meanings, speakers adjust their colexification preferences to maintain communicative efficiency, and avoid colexifying those similar meanings which need to be distinguished in communication. This research provides further evidence to support the argument that languages are shaped by the needs and preferences of their speakers. △ Less

Submitted 1 September, 2021; v1 submitted 19 March, 2021; originally announced March 2021.

Journal ref: Cognitive Science (2021) 45 e1303

arXiv:2006.09277 [pdf, other]

Communicative need modulates competition in language change

Authors: Andres Karjus, Richard A. Blythe, Simon Kirby, Kenny Smith

Abstract: All living languages change over time. The causes for this are many, one being the emergence and borrowing of new linguistic elements. Competition between the new elements and older ones with a similar semantic or grammatical function may lead to speakers preferring one of them, and leaving the other to go out of use. We introduce a general method for quantifying competition between linguistic ele… ▽ More All living languages change over time. The causes for this are many, one being the emergence and borrowing of new linguistic elements. Competition between the new elements and older ones with a similar semantic or grammatical function may lead to speakers preferring one of them, and leaving the other to go out of use. We introduce a general method for quantifying competition between linguistic elements in diachronic corpora which does not require language-specific resources other than a sufficiently large corpus. This approach is readily applicable to a wide range of languages and linguistic subsystems. Here, we apply it to lexical data in five corpora differing in language, type, genre, and time span. We find that changes in communicative need are consistently predictive of lexical competition dynamics. Near-synonymous words are more likely to directly compete if they belong to a topic of conversation whose importance to language users is constant over time, possibly leading to the extinction of one of the competing words. By contrast, in topics which are increasing in importance for language users, near-synonymous words tend not to compete directly and can coexist. This suggests that, in addition to direct competition between words, language change can be driven by competition between topics or semantic subspaces. △ Less

Submitted 16 June, 2020; originally announced June 2020.

arXiv:1811.01275 [pdf, other]

doi 10.5334/gjgl.909

Challenges in detecting evolutionary forces in language change using diachronic corpora

Authors: Andres Karjus, Richard A. Blythe, Simon Kirby, Kenny Smith

Abstract: Newberry et al. (Detecting evolutionary forces in language change, Nature 551, 2017) tackle an important but difficult problem in linguistics, the testing of selective theories of language change against a null model of drift. Having applied a test from population genetics (the Frequency Increment Test) to a number of relevant examples, they suggest stochasticity has a previously under-appreciated… ▽ More Newberry et al. (Detecting evolutionary forces in language change, Nature 551, 2017) tackle an important but difficult problem in linguistics, the testing of selective theories of language change against a null model of drift. Having applied a test from population genetics (the Frequency Increment Test) to a number of relevant examples, they suggest stochasticity has a previously under-appreciated role in language evolution. We replicate their results and find that while the overall observation holds, results produced by this approach on individual time series can be sensitive to how the corpus is organized into temporal segments (binning). Furthermore, we use a large set of simulations in conjunction with binning to systematically explore the range of applicability of the Frequency Increment Test. We conclude that care should be exercised with interpreting results of tests like the Frequency Increment Test on individual series, given the researcher degrees of freedom available when applying the test to corpus data, and fundamental differences between genetic and linguistic data. Our findings have implications for selection testing and temporal binning in general, as well as demonstrating the usefulness of simulations for evaluating methods newly introduced to the field. △ Less

Submitted 13 November, 2019; v1 submitted 3 November, 2018; originally announced November 2018.

Journal ref: Glossa: a journal of general linguistics, 5(1) (2020), p.45

arXiv:1809.11047 [pdf, other]

Cross-situational learning of large lexicons with finite memory

Authors: James Holehouse, Richard A. Blythe

Abstract: Cross-situational word learning, wherein a learner combines information about possible meanings of a word across multiple exposures, has previously been shown to be a very powerful strategy to acquire a large lexicon in a short time. However, this success may derive from idealizations that are made when modeling the word-learning process. In particular, an earlier model assumed that a learner coul… ▽ More Cross-situational word learning, wherein a learner combines information about possible meanings of a word across multiple exposures, has previously been shown to be a very powerful strategy to acquire a large lexicon in a short time. However, this success may derive from idealizations that are made when modeling the word-learning process. In particular, an earlier model assumed that a learner could perfectly recall all previous instances of a word's use and the inferences that were drawn about its meaning. In this work, we relax this assumption and determine the performance of a model cross-situational learner who forgets word-meaning associations over time. Our main finding is that it is possible for this learner to acquire a human-scale lexicon by adulthood with word-exposure and memory-decay rates that are consistent with empirical research on childhood word learning, as long as the degree of referential uncertainty is not too high or the learner employs a mutual exclusivity constraint. Our findings therefore suggest that successful word learning does not necessarily demand either highly accurate long-term tracking of word and meaning statistics or hypothesis-testing strategies. △ Less

Submitted 28 September, 2018; originally announced September 2018.

Comments: 39 pages, 16 figures

arXiv:1806.00699 [pdf, other]

doi 10.1163/22105832-01001200

Quantifying the dynamics of topical fluctuations in language

Authors: Andres Karjus, Richard A. Blythe, Simon Kirby, Kenny Smith

Abstract: The availability of large diachronic corpora has provided the impetus for a growing body of quantitative research on language evolution and meaning change. The central quantities in this research are token frequencies of linguistic elements in texts, with changes in frequency taken to reflect the popularity or selective fitness of an element. However, corpus frequencies may change for a wide varie… ▽ More The availability of large diachronic corpora has provided the impetus for a growing body of quantitative research on language evolution and meaning change. The central quantities in this research are token frequencies of linguistic elements in texts, with changes in frequency taken to reflect the popularity or selective fitness of an element. However, corpus frequencies may change for a wide variety of reasons, including purely random sampling effects, or because corpora are composed of contemporary media and fiction texts within which the underlying topics ebb and flow with cultural and socio-political trends. In this work, we introduce a simple model for controlling for topical fluctuations in corpora - the topical-cultural advection model - and demonstrate how it provides a robust baseline of variability in word frequency changes over time. We validate the model on a diachronic corpus spanning two centuries, and a carefully-controlled artificial language change scenario, and then use it to correct for topical fluctuations in historical time series. Finally, we use the model to show that the emergence of new words typically corresponds with the rise of a trending topic. This suggests that some lexical innovations occur due to growing communicative need in a subspace of the lexicon, and that the topical-cultural advection model can be used to quantify this. △ Less

Submitted 21 June, 2019; v1 submitted 2 June, 2018; originally announced June 2018.

Comments: Code to run the analyses described in this paper is now available at https://github.com/andreskarjus/topical_cultural_advection_model . A previous shorter version of this paper outlining the basic model appeared as an extended abstract in the proceedings of the Society for Computation in Linguistics (Karjus et al. 2018, Topical advection as a baseline model for corpus-based lexical dynamics)

Journal ref: Language Dynamics and Change, 10(1) 2020, 86-125

arXiv:1505.00122 [pdf, other]

doi 10.1140/epjb/e2015-60347-3

Hierarchy of Scales in Language Dynamics

Authors: Richard A. Blythe

Abstract: Methods and insights from statistical physics are finding an increasing variety of applications where one seeks to understand the emergent properties of a complex interacting system. One such area concerns the dynamics of language at a variety of levels of description, from the behaviour of individual agents learning simple artificial languages from each other, up to changes in the structure of la… ▽ More Methods and insights from statistical physics are finding an increasing variety of applications where one seeks to understand the emergent properties of a complex interacting system. One such area concerns the dynamics of language at a variety of levels of description, from the behaviour of individual agents learning simple artificial languages from each other, up to changes in the structure of languages shared by large groups of speakers over historical timescales. In this Colloquium, we survey a hierarchy of scales at which language and linguistic behaviour can be described, along with the main progress in understanding that has been made at each of them---much of which has come from the statistical physics community. We argue that future developments may arise by linking the different levels of the hierarchy together in a more coherent fashion, in particular where this allows more effective use of rich empirical data sets. △ Less

Submitted 30 September, 2015; v1 submitted 1 May, 2015; originally announced May 2015.

Comments: Colloquium (short review paper) solicited by European Physical Journal B. 18 pages, 3 figures. accepted v2 contains more text, figures, references and coherence

Journal ref: EPJB (2015) v88 295

arXiv:1412.2487 [pdf, other]

doi 10.1016/j.cognition.2016.02.017

Word learning under infinite uncertainty

Authors: Richard A. Blythe, Andrew D. M. Smith, Kenny Smith

Abstract: Language learners must learn the meanings of many thousands of words, despite those words occurring in complex environments in which infinitely many meanings might be inferred by the learner as a word's true meaning. This problem of infinite referential uncertainty is often attributed to Willard Van Orman Quine. We provide a mathematical formalisation of an ideal cross-situational learner attempti… ▽ More Language learners must learn the meanings of many thousands of words, despite those words occurring in complex environments in which infinitely many meanings might be inferred by the learner as a word's true meaning. This problem of infinite referential uncertainty is often attributed to Willard Van Orman Quine. We provide a mathematical formalisation of an ideal cross-situational learner attempting to learn under infinite referential uncertainty, and identify conditions under which word learning is possible. As Quine's intuitions suggest, learning under infinite uncertainty is in fact possible, provided that learners have some means of ranking candidate word meanings in terms of their plausibility; furthermore, our analysis shows that this ranking could in fact be exceedingly weak, implying that constraints which allow learners to infer the plausibility of candidate word meanings could themselves be weak. This approach lifts the burden of explanation from `smart' word learning constraints in learners, and suggests a programme of research into weak, unreliable, probabilistic constraints on the inference of word meaning in real word learners. △ Less

Submitted 10 February, 2016; v1 submitted 8 December, 2014; originally announced December 2014.

Comments: 30 pages, 4 figures, contains considerable extra discussion and relaxation of original model assumptions. Version to appear in Cognition

Journal ref: Cognition (2016) v151 pp18-27

arXiv:1302.5526 [pdf, ps, other]

doi 10.1103/PhysRevLett.110.258701

Stochastic dynamics of lexicon learning in an uncertain and nonuniform world

Authors: Rainer Reisenauer, Kenny Smith, Richard A. Blythe

Abstract: We study the time taken by a language learner to correctly identify the meaning of all words in a lexicon under conditions where many plausible meanings can be inferred whenever a word is uttered. We show that the most basic form of cross-situational learning - whereby information from multiple episodes is combined to eliminate incorrect meanings - can perform badly when words are learned independ… ▽ More We study the time taken by a language learner to correctly identify the meaning of all words in a lexicon under conditions where many plausible meanings can be inferred whenever a word is uttered. We show that the most basic form of cross-situational learning - whereby information from multiple episodes is combined to eliminate incorrect meanings - can perform badly when words are learned independently and meanings are drawn from a nonuniform distribution. If learners further assume that no two words share a common meaning, we find a phase transition between a maximally-efficient learning regime, where the learning time is reduced to the shortest it can possibly be, and a partially-efficient regime where incorrect candidate meanings for words persist at late times. We obtain exact results for the word-learning process through an equivalence to a statistical mechanical problem of enumerating loops in the space of word-meaning map**s. △ Less

Submitted 31 May, 2013; v1 submitted 22 February, 2013; originally announced February 2013.

Comments: 7 pages, 3 figures. Version 2 contains additional discussion and will appear in Phys. Rev. Lett

Journal ref: Phys Rev Lett (2013) 110 258701

arXiv:1108.1275 [pdf, ps, other]

doi 10.1142/S0219525911003414

Neutral evolution: A null model for language dynamics

Authors: R. A. Blythe

Abstract: We review the task of aligning simple models for language dynamics with relevant empirical data, motivated by the fact that this is rarely attempted in practice despite an abundance of abstract models. We propose that one way to meet this challenge is through the careful construction of null models. We argue in particular that rejection of a null model must have important consequences for theories… ▽ More We review the task of aligning simple models for language dynamics with relevant empirical data, motivated by the fact that this is rarely attempted in practice despite an abundance of abstract models. We propose that one way to meet this challenge is through the careful construction of null models. We argue in particular that rejection of a null model must have important consequences for theories about language dynamics if modelling is truly to be worthwhile. Our main claim is that the stochastic process of neutral evolution (also known as genetic drift or random copying) is a viable null model for language dynamics. We survey empirical evidence in favour and against neutral evolution as a mechanism behind historical language changes, highlighting the theoretical implications in each case. △ Less

Submitted 5 August, 2011; originally announced August 2011.

Comments: 20 pages, 2 figures. To appear in a special issue of ACS - Advances in Complex Systems on language dynamics

Journal ref: ACS - Advances in Complex Systems (2012) 15 1150015

arXiv:1104.0529 [pdf, ps, other]

doi 10.1142/S0219525911003396

Random copying in space

Authors: Richard A Blythe

Abstract: Random copying is a simple model for population dynamics in the absence of selection, and has been applied to both biological and cultural evolution. In this work, we investigate the effect that spatial structure has on the dynamics. We focus in particular on how a measure of the diversity in the population changes over time. We show that even when the vast majority of a population's history may b… ▽ More Random copying is a simple model for population dynamics in the absence of selection, and has been applied to both biological and cultural evolution. In this work, we investigate the effect that spatial structure has on the dynamics. We focus in particular on how a measure of the diversity in the population changes over time. We show that even when the vast majority of a population's history may be well-described by a spatially-unstructured model, spatial structure may nevertheless affect the expected level of diversity seen at a local scale. We demonstrate this phenomenon explicitly by examining the random copying process on small-world networks, and use our results to comment on the use of simple random-copying models in an empirical context. △ Less

Submitted 4 April, 2011; originally announced April 2011.

Comments: 26 pages, 11 figures. Based on invited talk at AHRC CECD Conference on "Cultural Evolution in Spatially Structured Populations" at UCL, September 2010. To appear in ACS - Advances in Complex Systems

Journal ref: ACS - Advances in Complex Systems (2012) 15 1150012

Showing 1–14 of 14 results for author: Blythe, R A