Skip to main content

Showing 1–4 of 4 results for author: Christiansen, M H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2005.03521  [pdf, other

    cs.CL

    The Danish Gigaword Project

    Authors: Leon Strømberg-Derczynski, Manuel R. Ciosici, Rebekah Baglini, Morten H. Christiansen, Jacob Aarup Dalsgaard, Riccardo Fusaroli, Peter Juel Henrichsen, Rasmus Hvingelby, Andreas Kirkedal, Alex Speed Kjeldsen, Claus Ladefoged, Finn Årup Nielsen, Malte Lau Petersen, Jonathan Hvithamar Rystrøm, Daniel Varab

    Abstract: Danish language technology has been hindered by a lack of broad-coverage corpora at the scale modern NLP prefers. This paper describes the Danish Gigaword Corpus, the result of a focused effort to provide a diverse and freely-available one billion word corpus of Danish text. The Danish Gigaword corpus covers a wide array of time periods, domains, speakers' socio-economic status, and Danish dialect… ▽ More

    Submitted 12 May, 2021; v1 submitted 7 May, 2020; originally announced May 2020.

    Comments: Identical to the NoDaLiDa 2021 version

  2. Memory limitations are hidden in grammar

    Authors: Carlos Gómez-Rodríguez, Morten H. Christiansen, Ramon Ferrer-i-Cancho

    Abstract: The ability to produce and understand an unlimited number of different sentences is a hallmark of human language. Linguists have sought to define the essence of this generative capacity using formal grammars that describe the syntactic dependencies between constituents, independent of the computational limitations of the human brain. Here, we evaluate this independence assumption by sampling sente… ▽ More

    Submitted 5 April, 2022; v1 submitted 19 August, 2019; originally announced August 2019.

    Comments: Improved with reviewer feedback once again. In press in Glottometrics

    Journal ref: Glottometrics (2022) 52, 39-64

  3. arXiv:1304.6736  [pdf

    physics.soc-ph cs.SI q-bio.NC

    Networks in Cognitive Science

    Authors: Andrea Baronchelli, Ramon Ferrer-i-Cancho, Romualdo Pastor-Satorras, Nick Chater, Morten H. Christiansen

    Abstract: Networks of interconnected nodes have long played a key role in Cognitive Science, from artificial neural net- works to spreading activation models of semantic mem- ory. Recently, however, a new Network Science has been developed, providing insights into the emergence of global, system-scale properties in contexts as diverse as the Internet, metabolic reactions, and collaborations among scientists… ▽ More

    Submitted 5 July, 2013; v1 submitted 24 April, 2013; originally announced April 2013.

    Journal ref: Trends in Cognitive Sciences 17, 348-360 (2013)

  4. arXiv:1302.2937  [pdf

    physics.soc-ph cs.MA q-bio.PE

    The Biological Origin of Linguistic Diversity

    Authors: Andrea Baronchelli, Nick Chater, Romualdo Pastor-Satorras, Morten H. Christiansen

    Abstract: In contrast with animal communication systems, diversity is characteristic of almost every aspect of human language. Languages variously employ tones, clicks, or manual signs to signal differences in meaning; some languages lack the noun-verb distinction (e.g., Straits Salish), whereas others have a proliferation of fine-grained syntactic categories (e.g., Tzeltal); and some languages do without m… ▽ More

    Submitted 12 February, 2013; originally announced February 2013.

    Journal ref: PLoS ONE 7(10): e48029 (2012)