Search | arXiv e-print repository

arXiv:2402.01932 [pdf, other]

A Virtual Solar Wind Monitor at Mars with Uncertainty Quantification using Gaussian Processes

Authors: A. R. Azari, E. Abrahams, F. Sapienza, J. Halekas, J. Biersteker, D. L. Mitchell, F. Pérez, M. Marquette, M. J. Rutala, C. F. Bowers, C. M. Jackman, S. M. Curry

Abstract: Single spacecraft missions do not measure the pristine solar wind continuously because of the spacecrafts' orbital trajectory. The infrequent spatiotemporal cadence of measurement fundamentally limits conclusions about solar wind-magnetosphere coupling throughout the solar system. At Mars, such single spacecraft missions result in limitations for assessing the solar wind's role in causing lower al… ▽ More Single spacecraft missions do not measure the pristine solar wind continuously because of the spacecrafts' orbital trajectory. The infrequent spatiotemporal cadence of measurement fundamentally limits conclusions about solar wind-magnetosphere coupling throughout the solar system. At Mars, such single spacecraft missions result in limitations for assessing the solar wind's role in causing lower altitude observations such as auroral dynamics or atmospheric loss. In this work, we detail the development of a virtual solar wind monitor from the Mars Atmosphere and Volatile Evolution (MAVEN) mission; a single spacecraft. This virtual solar wind monitor provides a continuous estimate of the solar wind upstream from Mars with uncertainties. We specifically employ Gaussian process regression to estimate the upstream solar wind and uncertainty estimations that scale with the data sparsity of our real observations. This proxy enables continuous solar wind estimation at Mars with representative uncertainties for the majority of the time since since late 2014. We conclude by discussing suggested uses of this virtual solar wind monitor for statistical studies of the Mars space environment and heliosphere. △ Less

Submitted 6 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

Comments: submitted to JGR: Machine Learning and Computation

arXiv:2308.15769 [pdf, other]

Vector Autoregression in Cryptocurrency Markets: Unraveling Complex Causal Networks

Authors: Cameron Cornell, Lewis Mitchell, Matthew Roughan

Abstract: Methodologies to infer financial networks from the price series of speculative assets vary, however, they generally involve bivariate or multivariate predictive modelling to reveal causal and correlational structures within the time series data. The required model complexity intimately relates to the underlying market efficiency, where one expects a highly developed and efficient market to display… ▽ More Methodologies to infer financial networks from the price series of speculative assets vary, however, they generally involve bivariate or multivariate predictive modelling to reveal causal and correlational structures within the time series data. The required model complexity intimately relates to the underlying market efficiency, where one expects a highly developed and efficient market to display very few simple relationships in price data. This has spurred research into the applications of complex nonlinear models for developed markets. However, it remains unclear if simple models can provide meaningful and insightful descriptions of the dependency and interconnectedness of the rapidly developed cryptocurrency market. Here we show that multivariate linear models can create informative cryptocurrency networks that reflect economic intuition, and demonstrate the importance of high-influence nodes. The resulting network confirms that node degree, a measure of influence, is significantly correlated to the market capitalisation of each coin ($ρ=0.193$). However, there remains a proportion of nodes whose influence extends beyond what their market capitalisation would imply. We demonstrate that simple linear model structure reveals an inherent complexity associated with the interconnected nature of the data, supporting the use of multivariate modelling to prevent surrogate effects and achieve accurate causal representation. In a reductive experiment we show that most of the network structure is contained within a small portion of the network, consistent with the Pareto principle, whereby a fraction of the inputs generates a large proportion of the effects. Our results demonstrate that simple multivariate models provide nontrivial information about cryptocurrency market dynamics, and that these dynamics largely depend upon a few key high-influence coins. △ Less

Submitted 30 August, 2023; originally announced August 2023.

arXiv:2308.02858 [pdf, other]

doi 10.1016/j.nima.2023.168622

Radiation Damage of $2 \times 2 \times 1 \ \mathrm{cm}^3$ Pixelated CdZnTe Due to High-Energy Protons

Authors: Daniel Shy, David Goodman, Ryan Parsons, Michael Streicher, Willy Kaye, Lee Mitchell, Zhong He, Bernard Phlips

Abstract: Pixelated CdZnTe detectors are a promising imaging-spectrometer for gamma-ray astrophysics due to their combination of relatively high energy resolution with room temperature operation negating the need for cryogenic cooling. This reduces the size, weight, and power requirements for telescope-based radiation detectors. Nevertheless, operating CdZnTe in orbit will expose it to the harsh radiation e… ▽ More Pixelated CdZnTe detectors are a promising imaging-spectrometer for gamma-ray astrophysics due to their combination of relatively high energy resolution with room temperature operation negating the need for cryogenic cooling. This reduces the size, weight, and power requirements for telescope-based radiation detectors. Nevertheless, operating CdZnTe in orbit will expose it to the harsh radiation environment of space. This work, therefore, studies the effects of $61 \ \mathrm{MeV}$ protons on $2 \times 2 \times 1 \ \mathrm{cm}^3$ pixelated CdZnTe and quantifies proton-induced radiation damage of fluences up to $2.6 \times 10^8 \ \mathrm{p/cm^2}$. In addition, we studied the effects of irradiation on two separate instruments: one was biased and operational during irradiation while the other remained unbiased. Following final irradiation, the $662 \ \mathrm{keV}$ centroid and nominal $1\%$ resolution of the detectors were degraded to $642.7 \ \mathrm{keV}, 4.9 \% \ ( \mathrm{FWHM})$ and $653.8 \ \mathrm{keV}, 1.75 \% \ (\mathrm{FWHM})$ for the biased and unbiased systems respectively. We therefore observe a possible bias dependency on proton-induced radiation damage in CdZnTe. This work also reports on the resulting activation and recovery of the instrument following room temperature and $60^{\circ}\mathrm{C}$ annealing. △ Less

Submitted 5 August, 2023; originally announced August 2023.

arXiv:2308.02720 [pdf, other]

doi 10.1029/2023JA031546

Magnetic Field Dra** in Induced Magnetospheres: Evidence from the MAVEN Mission to Mars

Authors: A. R. Azari, E. Abrahams, F. Sapienza, D. L. Mitchell, J. Biersteker, S. Xu, C. Bowers, F. Pérez, G. A. DiBraccio, Y. Dong, S. Curry

Abstract: The Mars Atmosphere and Volatile EvolutioN (MAVEN) mission has been orbiting Mars since 2014 and now has over 10,000 orbits which we use to characterize Mars' dynamic space environment. Through global field line tracing with MAVEN magnetic field data we find an altitude dependent dra** morphology that differs from expectations of induced magnetospheres in the vertical ($\hat Z$ Mars Sun-state, M… ▽ More The Mars Atmosphere and Volatile EvolutioN (MAVEN) mission has been orbiting Mars since 2014 and now has over 10,000 orbits which we use to characterize Mars' dynamic space environment. Through global field line tracing with MAVEN magnetic field data we find an altitude dependent dra** morphology that differs from expectations of induced magnetospheres in the vertical ($\hat Z$ Mars Sun-state, MSO) direction. We quantify this difference from the classical picture of induced magnetospheres with a Bayesian multiple linear regression model to predict the draped field as a function of the upstream interplanetary magnetic field (IMF), remanent crustal fields, and a previously underestimated induced effect. From our model we conclude that unexpected twists in high altitude dayside dra** ($>$800 km) are a result of the IMF component in the $\pm \hat X$ MSO direction. We propose that this is a natural outcome of current theories of induced magnetospheres but has been underestimated due to approximations of the IMF as solely $\pm \hat Y$ directed. We additionally estimate that distortions in low altitude ($<$800 km) dayside dra** along $\hat Z$ are directly related to remanent crustal fields. We show dayside dra** traces down tail and previously reported inner magnetotail twists are likely caused by the crustal field of Mars, while the outer tail morphology is governed by an induced response to the IMF direction. We conclude with an updated understanding of induced magnetospheres which details dayside dra** for multiple directions of the incoming IMF and discuss the repercussions of this dra** for magnetotail morphology. △ Less

Submitted 20 October, 2023; v1 submitted 4 August, 2023; originally announced August 2023.

Comments: Accepted in Journal of Geophysical Research: Space Physics

arXiv:2307.05710 [pdf, other]

A Vacuum-Compatible Cylindrical Inertial Rotation Sensor with Picoradian Sensitivity

Authors: M. P. Ross, J. van Dongen, Y. Huang, P. Zhou, Y. Chowdhury, S. K. Apple, C. M. Mow-Lowry, A. L. Mitchell, N. A. Holland, B. Lantz, E. Bonilla, A. Engl, A. Pele, D. Griffith, E. Sanchez, E. A. Shaw, C. Gettings, J. H. Gundlach

Abstract: We describe an inertial rotation sensor with a 30-cm cylindrical proof-mass suspended from a pair of 14-$μ$m thick BeCu flexures. The angle between the proof-mass and support structure is measured with a pair of homodyne interferometers which achieve a noise level of $\sim 5\ \text{prad}/\sqrt{\text{Hz}}$. The sensor is entirely made of vacuum compatible materials and the center of mass can be adj… ▽ More We describe an inertial rotation sensor with a 30-cm cylindrical proof-mass suspended from a pair of 14-$μ$m thick BeCu flexures. The angle between the proof-mass and support structure is measured with a pair of homodyne interferometers which achieve a noise level of $\sim 5\ \text{prad}/\sqrt{\text{Hz}}$. The sensor is entirely made of vacuum compatible materials and the center of mass can be adjusted remotely. △ Less

Submitted 14 September, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

arXiv:2306.03219 [pdf, other]

A simple model for the emergence of relaxation-oscillator convection

Authors: Francisco E. Spaulding-Astudillo, Jonathan L. Mitchell

Abstract: Earth's tropics are characterized by quasi-steady precipitation with small oscillations about a mean value, which has led to the hypothesis that moist convection is in a state of quasi-equilibrium (QE). In contrast, very warm simulations of Earth's tropical convection are characterized by relaxation-oscillator-like (RO) precipitation, with short-lived convective storms and torrential rainfall form… ▽ More Earth's tropics are characterized by quasi-steady precipitation with small oscillations about a mean value, which has led to the hypothesis that moist convection is in a state of quasi-equilibrium (QE). In contrast, very warm simulations of Earth's tropical convection are characterized by relaxation-oscillator-like (RO) precipitation, with short-lived convective storms and torrential rainfall forming and dissipating at regular intervals with little to no precipitation in between. We develop a model of moist convection by combining a zero-buoyancy model of bulk-plume convection with a QE heat engine model, and we use it to show that QE is violated at high surface temperatures. We hypothesize that the RO state emerges when the equilibrium condition of the convective heat engine is violated, i.e., when the net cooling times a thermodynamic efficiency exceeds the work that can be performed. We test our hypothesis against one- and three-dimensional numerical simulations and find that it accurately predicts the onset of RO convection. The proposed mechanism for RO emergence from QE breakdown is agnostic of the condensing component, and can be applied to any planetary atmosphere undergoing moist convection. To date, RO states have only been demonstrated in three-dimensional convection-resolving simulations, which has made it seem that the physics of the RO state requires simulations that can explicitly resolve the three-dimensional interaction of cloudy plumes and their environment. We demonstrate that RO states also exist in single-column simulations of radiative-convective equilibrium with parameterized convection, albeit in a different surface temperature range and with much longer storm-free intervals. △ Less

Submitted 26 April, 2024; v1 submitted 5 June, 2023; originally announced June 2023.

Comments: 30 pages, 9 figures, and 1 table

arXiv:2301.11452 [pdf]

doi 10.1029/2022GL101734

Transient Foreshock Structures Upstream of Mars: Implications of the Small Martian Bow Shock

Authors: H. Madanian, N. Omidi, D. G. Sibeck, L. Andersson, R. Ramstad, S. Xu, J. R. Gruesbeck, S. J. Schwartz, R. A. Frahm, D. A. Brain, P. Kajdic, F. G. Eparvier, D. L. Mitchell, S. M. Curry

Abstract: We characterize the nature of magnetic structures in the foreshock region of Mars associated with discontinuities in the solar wind. The structures form at the upstream edge of moving foreshocks caused by slow rotations in the interplanetary magnetic field (IMF). The solar wind plasma density and the IMF strength noticeably decrease inside the structures' core, and a compressional shock layer is p… ▽ More We characterize the nature of magnetic structures in the foreshock region of Mars associated with discontinuities in the solar wind. The structures form at the upstream edge of moving foreshocks caused by slow rotations in the interplanetary magnetic field (IMF). The solar wind plasma density and the IMF strength noticeably decrease inside the structures' core, and a compressional shock layer is present at their sunward side, making them consistent with foreshock bubbles (FBs). Ion populations responsible for these structures include backstreaming ions that only appear within the moving foreshock, and accelerated reflected ions from the quasi-perpendicular bow shock. Both ion populations accumulate near the upstream edge of the moving foreshock which facilitates FB formation. Reflected ions with hybrid trajectories that straddle between the quasi-perpendicular and quasi-parallel bow shocks during slow IMF rotations contribute to formation of foreshock transients. △ Less

Submitted 26 January, 2023; originally announced January 2023.

Comments: Submitted to Geophysical Research Letters

Journal ref: Geophysical Research Letters Volume 50, Issue8 28 April 2023 e2022GL101734

arXiv:2211.05350 [pdf, other]

The entropy rate of Linear Additive Markov Processes

Authors: Bridget Smart, Matthew Roughan, Lewis Mitchell

Abstract: This work derives a theoretical value for the entropy of a Linear Additive Markov Process (LAMP), an expressive model able to generate sequences with a given autocorrelation structure. While a first-order Markov Chain model generates new values by conditioning on the current state, the LAMP model takes the transition state from the sequence's history according to some distribution which does not h… ▽ More This work derives a theoretical value for the entropy of a Linear Additive Markov Process (LAMP), an expressive model able to generate sequences with a given autocorrelation structure. While a first-order Markov Chain model generates new values by conditioning on the current state, the LAMP model takes the transition state from the sequence's history according to some distribution which does not have to be bounded. The LAMP model captures complex relationships and long-range dependencies in data with similar expressibility to a higher-order Markov process. While a higher-order Markov process has a polynomial parameter space, a LAMP model is characterised only by a probability distribution and the transition matrix of an underlying first-order Markov Chain. We prove that the theoretical entropy rate of a LAMP is equivalent to the theoretical entropy rate of the underlying first-order Markov Chain. This surprising result is explained by the randomness introduced by the random process which selects the LAMP transitioning state, and provides a tool to model complex dependencies in data while retaining useful theoretical results. We use the LAMP model to estimate the entropy rate of the LastFM, BrightKite, Wikispeedia and Reuters-21578 datasets. We compare estimates calculated using frequency probability estimates, a first-order Markov model and the LAMP model, and consider two approaches to ensuring the transition matrix is irreducible. In most cases the LAMP entropy rates are lower than those of the alternatives, suggesting that LAMP model is better at accommodating structural dependencies in the processes. △ Less

Submitted 9 January, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

Comments: 9 pages, code available on Github

arXiv:2208.07038 [pdf, other]

doi 10.1007/978-3-031-19097-1_3

#IStandWithPutin versus #IStandWithUkraine: The interaction of bots and humans in discussion of the Russia/Ukraine war

Authors: Bridget Smart, Joshua Watt, Sara Benedetti, Lewis Mitchell, Matthew Roughan

Abstract: The 2022 Russian invasion of Ukraine emphasises the role social media plays in modern-day warfare, with conflict occurring in both the physical and information environments. There is a large body of work on identifying malicious cyber-activity, but less focusing on the effect this activity has on the overall conversation, especially with regards to the Russia/Ukraine Conflict. Here, we employ a va… ▽ More The 2022 Russian invasion of Ukraine emphasises the role social media plays in modern-day warfare, with conflict occurring in both the physical and information environments. There is a large body of work on identifying malicious cyber-activity, but less focusing on the effect this activity has on the overall conversation, especially with regards to the Russia/Ukraine Conflict. Here, we employ a variety of techniques including information theoretic measures, sentiment and linguistic analysis, and time series techniques to understand how bot activity influences wider online discourse. By aggregating account groups we find significant information flows from bot-like accounts to non-bot accounts with behaviour differing between sides. Pro-Russian non-bot accounts are most influential overall, with information flows to a variety of other account groups. No significant outward flows exist from pro-Ukrainian non-bot accounts, with significant flows from pro-Ukrainian bot accounts into pro-Ukrainian non-bot accounts. We find that bot activity drives an increase in conversations surrounding angst (with p = 2.450 x 1e-4) as well as those surrounding work/governance (with p = 3.803 x 1e-18). Bot activity also shows a significant relationship with non-bot sentiment (with p = 3.76 x 1e-4), where we find the relationship holds in both directions. This work extends and combines existing techniques to quantify how bots are influencing people in the online conversation around the Russia/Ukraine invasion. It opens up avenues for researchers to understand quantitatively how these malicious campaigns operate, and what makes them impactful. △ Less

Submitted 19 August, 2022; v1 submitted 15 August, 2022; originally announced August 2022.

Comments: 12 pages, 7 figures, to be published in SocInfo 2022. Dataset available at https://figshare.com/articles/dataset/Tweet_IDs_Botometer_results/20486910

arXiv:2205.06029 [pdf]

doi 10.1016/j.osnem.2022.100231

Information flow estimation: a study of news on Twitter

Authors: Tobin South, Bridget Smart, Matthew Roughan, Lewis Mitchell

Abstract: News media has long been an ecosystem of creation, reproduction, and critique, where news outlets report on current events and add commentary to ongoing stories. Understanding the dynamics of news information creation and dispersion is important to accurately ascribe credit to influential work and understand how societal narratives develop. These dynamics can be modelled through a combination of i… ▽ More News media has long been an ecosystem of creation, reproduction, and critique, where news outlets report on current events and add commentary to ongoing stories. Understanding the dynamics of news information creation and dispersion is important to accurately ascribe credit to influential work and understand how societal narratives develop. These dynamics can be modelled through a combination of information-theoretic natural language processing and networks; and can be parameterised using large quantities of textual data. However, it is challenging to see "the wood for the trees", i.e., to detect small but important flows of information in a sea of noise. Here we develop new comparative techniques to estimate temporal information flow between pairs of text producers. Using both simulated and real text data we compare the reliability and sensitivity of methods for estimating textual information flow, showing that a metric that normalises by local neighbourhood structure provides a robust estimate of information flow in large networks. We apply this metric to a large corpus of news organisations on Twitter and demonstrate its usefulness in identifying influence within an information ecosystem, finding that average information contribution to the network is not correlated with the number of followers or the number of tweets. This suggests that small local organisations and right-wing organisations which have lower average follower counts still contribute significant information to the ecosystem. Further, the methods are applied to smaller full-text datasets of specific news events across news sites and Russian troll accounts on Twitter. The information flow estimation reveals and quantifies features of how these events develop and the role of groups of trolls in setting disinformation narratives. △ Less

Submitted 28 September, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

Journal ref: Online Social Networks and Media, Volume 31, September 2022, 100231

arXiv:2112.07731 [pdf, other]

Toward a unified theory for the Hadley cell descending and ascending edges

Authors: Spencer A. Hill, Simona Bordoni, Jonathan L. Mitchell

Abstract: We present theories for the latitudinal extents of both Hadley cells throughout the annual cycle by combining our recent scaling for the ascending edge latitude (Hill et al. 2021) with the uniform Rossby number (Ro), baroclinic instability-based theory for the poleward, descending edge latitudes of Kang and Lu 2012. The resulting analytic expressions for all three Hadley cell edges are predictive… ▽ More We present theories for the latitudinal extents of both Hadley cells throughout the annual cycle by combining our recent scaling for the ascending edge latitude (Hill et al. 2021) with the uniform Rossby number (Ro), baroclinic instability-based theory for the poleward, descending edge latitudes of Kang and Lu 2012. The resulting analytic expressions for all three Hadley cell edges are predictive except for diagnosed values of Ro and two proportionality constants. The theory captures the climatological annual cycle of the ascending and descending edges in an Earth-like simulation in an idealized aquaplanet general circulation model (GCM), provided the descending edge prediction is lagged by one month. In simulations in this and two other idealized GCMs with varied planetary rotation rate ($Ω$), the winter, descending edge of the solsticial, cross-equatorial Hadley cell scales approximately as $Ω^{-1/2}$ and the summer, ascending edge as $Ω^{-2/3}$, both in accordance with our theory. △ Less

Submitted 14 December, 2021; originally announced December 2021.

Comments: 11 pages, 5 figures, 1 table, submitted to Journal of the Atmospheric Sciences

arXiv:2112.04627 [pdf]

doi 10.1016/j.icarus.2021.114835

Hypotheses for Triton's Plumes: New Analyses and Future Remote Sensing Tests

Authors: Jason D. Hofgartner, Samuel P. D. Birch, Julie Castillo, Will M. Grundy, Candice J. Hansen, Alexander G. Hayes, Carly J. A. Howett, Terry A. Hurford, Emily S. Martin, Karl L. Mitchell, Tom A. Nordheim, Michael J. Poston, Louise M. Prockter, Lynnae C. Quick, Paul Schenk, Rebecca N. Schindhelm, Orkan M. Umurhan

Abstract: At least two active plumes were observed on Neptune's moon Triton during the Voyager 2 flyby in 1989. Models for Triton's plumes have previously been grouped into five hypotheses, two of which are primarily atmospheric phenomena and are generally considered unlikely, and three of which include eruptive processes and are plausible. These hypotheses are compared, including new arguments, such as com… ▽ More At least two active plumes were observed on Neptune's moon Triton during the Voyager 2 flyby in 1989. Models for Triton's plumes have previously been grouped into five hypotheses, two of which are primarily atmospheric phenomena and are generally considered unlikely, and three of which include eruptive processes and are plausible. These hypotheses are compared, including new arguments, such as comparisons based on current understanding of Mars, Enceladus, and Pluto. An eruption model based on a solar-powered, solid-state greenhouse effect was previously considered the leading hypothesis for Triton's plumes, in part due to the proximity of the plumes to the subsolar latitude during the Voyager 2 flyby and the distribution of Triton's fans that are putatively deposits from former plumes. The other two eruption hypotheses are powered by internal heat, not solar insolation. Based on new analyses of the ostensible relation between the latitude of the subsolar point on Triton and the geographic locations of the plumes and fans, we argue that neither the locations of the plumes nor fans are strong evidence in favor of the solar-powered hypothesis. We conclude that all three eruption hypotheses should be considered further. Five tests are presented that could be implemented with remote sensing observations from future spacecraft to confidently distinguish among the eruption hypotheses for Triton's plumes. The five tests are based on the: (1) composition and thickness of Triton's southern hemisphere terrains, (2) composition of fan deposits, (3) distribution of active plumes, (4) distribution of fans, and (5) surface temperature at the locations of plumes and/or fans. The tests are independent, but complementary, and implementable with a single flyby mission such as the Trident mission concept. We note that, in the case of the solar-driven hypothesis, the 2030s and 2040s may be the last ... △ Less

Submitted 8 December, 2021; originally announced December 2021.

Comments: Accepted for publication in Icarus

arXiv:2012.08029 [pdf, ps, other]

doi 10.1029/2020JA028984

Observations of Energized Electrons in the Martian Magnetosheath

Authors: K. Horaites, L. Andersson, S. J. Schwartz, S. Xu, D. L. Mitchell, C. Mazelle, J. Halekas, J. Gruesbeck

Abstract: This observational study demonstrates that the magnitude and location of energization of electrons in the Martian magnetosheath is more complex than previous studies suggest. Electrons in Mars's magnetosheath originate in the solar wind and are accelerated by an electric field when they cross the bow shock. Assuming that this acceleration is localized solely to the shock, the field-aligned electro… ▽ More This observational study demonstrates that the magnitude and location of energization of electrons in the Martian magnetosheath is more complex than previous studies suggest. Electrons in Mars's magnetosheath originate in the solar wind and are accelerated by an electric field when they cross the bow shock. Assuming that this acceleration is localized solely to the shock, the field-aligned electron distributions in the sheath are expected to be highly asymmetric. However, such an asymmetry is not observed in this study. Based on the analysis here, it is suggested that an additional parallel acceleration takes place downstream of the Martian bow shock. This additional acceleration suppresses the expected asymmetry of the electron distribution. Consequently, along a flux tube in the magnetosheath that is tied on both ends to the bow shock the difference in energization between parallel and anti-parallel electrons is less than about 20 eV. Where this energization difference is expected to be maximal, we find the energization difference to be at most 25% of the predicted value. △ Less

Submitted 14 December, 2020; originally announced December 2020.

Comments: 30 pages, 7 figures

arXiv:2011.05966 [pdf, other]

doi 10.1175/JAS-D-20-0341.1

Solsticial Hadley Cell ascending edge theory from supercriticality

Authors: Spencer A. Hill, Simona Bordoni, Jonathan L. Mitchell

Abstract: How far the Hadley circulation's ascending branch extends into the summer hemisphere is a fundamental but incompletely understood characteristic of Earth's climate. Here, we present a predictive, analytical theory for this ascending edge latitude based on the extent of supercritical forcing. Supercriticality sets the minimum extent of a large-scale circulation based on the angular momentum and abs… ▽ More How far the Hadley circulation's ascending branch extends into the summer hemisphere is a fundamental but incompletely understood characteristic of Earth's climate. Here, we present a predictive, analytical theory for this ascending edge latitude based on the extent of supercritical forcing. Supercriticality sets the minimum extent of a large-scale circulation based on the angular momentum and absolute vorticity distributions of the hypothetical state were the circulation absent. We explicitly simulate this latitude-by-latitude radiative-convective equilibrium (RCE) state. Its depth-averaged temperature profile is suitably captured by a simple analytical approximation that increases linearly with $\sin\varphi$, where $\varphi$ is latitude, from the winter to the summer pole. This, in turn, yields a one-third power-law scaling of the supercritical forcing extent with the thermal Rossby number. In moist and dry idealized GCM simulations under solsticial forcing performed with a wide range of planetary rotation rates, the ascending edge latitudes largely behave according to this scaling. △ Less

Submitted 2 March, 2021; v1 submitted 11 November, 2020; originally announced November 2020.

Comments: 15 pages, 7 figures, revised for Journal of the Atmospheric Sciences

Journal ref: Journal of the Atmospheric Sciences 2021

arXiv:2008.06193 [pdf, other]

Risk map** for COVID-19 outbreaks in Australia using mobility data

Authors: Cameron Zachreson, Lewis Mitchell, Michael J. Lydeamore, Nicolas Rebuli, Martin Tomko, Nicholas Geard

Abstract: COVID-19 is highly transmissible and containing outbreaks requires a rapid and effective response. Because infection may be spread by people who are pre-symptomatic or asymptomatic, substantial undetected transmission is likely to occur before clinical cases are diagnosed. Thus, when outbreaks occur there is a need to anticipate which populations and locations are at heightened risk of exposure. I… ▽ More COVID-19 is highly transmissible and containing outbreaks requires a rapid and effective response. Because infection may be spread by people who are pre-symptomatic or asymptomatic, substantial undetected transmission is likely to occur before clinical cases are diagnosed. Thus, when outbreaks occur there is a need to anticipate which populations and locations are at heightened risk of exposure. In this work, we evaluate the utility of aggregate human mobility data for estimating the geographic distribution of transmission risk. We present a simple procedure for producing spatial transmission risk assessments from near-real-time population mobility data. We validate our estimates against three well-documented COVID-19 outbreak scenarios in Australia. Two of these were well-defined transmission clusters and one was a community transmission scenario. Our results indicate that mobility data can be a good predictor of geographic patterns of exposure risk from transmission centres, particularly in scenarios involving workplaces or other environments associated with habitual travel patterns. For community transmission scenarios, our results demonstrate that mobility data adds the most value to risk predictions when case counts are low and spatially clustered. Our method could assist health systems in the allocation of testing resources, and potentially guide the implementation of geographically-targeted restrictions on movement and social interaction. △ Less

Submitted 4 December, 2020; v1 submitted 14 August, 2020; originally announced August 2020.

arXiv:2008.02250 [pdf, other]

doi 10.1140/epjds/s13688-021-00260-3

Generalized Word Shift Graphs: A Method for Visualizing and Explaining Pairwise Comparisons Between Texts

Authors: Ryan J. Gallagher, Morgan R. Frank, Lewis Mitchell, Aaron J. Schwartz, Andrew J. Reagan, Christopher M. Danforth, Peter Sheridan Dodds

Abstract: A common task in computational text analyses is to quantify how two corpora differ according to a measurement like word frequency, sentiment, or information content. However, collapsing the texts' rich stories into a single number is often conceptually perilous, and it is difficult to confidently interpret interesting or unexpected textual patterns without looming concerns about data artifacts or… ▽ More A common task in computational text analyses is to quantify how two corpora differ according to a measurement like word frequency, sentiment, or information content. However, collapsing the texts' rich stories into a single number is often conceptually perilous, and it is difficult to confidently interpret interesting or unexpected textual patterns without looming concerns about data artifacts or measurement validity. To better capture fine-grained differences between texts, we introduce generalized word shift graphs, visualizations which yield a meaningful and interpretable summary of how individual words contribute to the variation between two texts for any measure that can be formulated as a weighted average. We show that this framework naturally encompasses many of the most commonly used approaches for comparing texts, including relative frequencies, dictionary scores, and entropy-based measures like the Kullback-Leibler and Jensen-Shannon divergences. Through several case studies, we demonstrate how generalized word shift graphs can be flexibly applied across domains for diagnostic investigation, hypothesis generation, and substantive interpretation. By providing a detailed lens into textual shifts between corpora, generalized word shift graphs help computational social scientists, digital humanists, and other text analysis practitioners fashion more robust scientific narratives. △ Less

Submitted 5 August, 2020; originally announced August 2020.

Comments: 20 pages, 7 figures, 2 tables

Journal ref: EPJ Data Science, 10(4), 2021

arXiv:2003.08213 [pdf]

doi 10.1016/j.nima.2020.164798

Radiation damage assessment of SensL SiPMs

Authors: Lee J. Mitchell, Bernard Phlips, W. Neil Johnson, Mary Johnson-Rambert, Anika N. Kansky, Richard Woolf

Abstract: Silicon Photomultipliers (SiPMs) are quickly replacing traditional photomultiplier tubes (PMTs) as the readout of choice for gamma-ray scintillation detectors in space. While they offer substantial size, weight and power saving, they have shown to be susceptible to radiation damage. SensL SiPMs with different cell sizes were irradiated with 64 MeV protons and 8 MeV electrons. In general, results s… ▽ More Silicon Photomultipliers (SiPMs) are quickly replacing traditional photomultiplier tubes (PMTs) as the readout of choice for gamma-ray scintillation detectors in space. While they offer substantial size, weight and power saving, they have shown to be susceptible to radiation damage. SensL SiPMs with different cell sizes were irradiated with 64 MeV protons and 8 MeV electrons. In general, results show larger cell sizes are more susceptible to radiation damage with the largest 50 um SiPMs showing the greatest increase in current as a function of dose. Current increases were observed for doses as low at ~2 rad(Si) for protons and ~20 rad(Si) for electrons. The U.S. Naval Research Laboratory's (NRL) Strontium Iodide Radiation Instrument (SIRI-1) experienced a 528 uA increase in the bias current of the on-board 2x2 SensL J-series 60035 SiPM over its one-year mission in sun-synchronous orbit. The work here focuses on the increase in bulk current observed with increasing radiation damage and was performed to better quantify this effect as a function of dose for future mission. These include the future NRL mission SIRI-2, the follow on to SIRI-1, Glowbug and the GAGG Radiation Instrument (GARI). △ Less

Submitted 17 March, 2020; originally announced March 2020.

Comments: 36 pages, 49 Figures, To be presented at the Nuclear and Space Radiation Effects Conference (NSREC 2020)

arXiv:2002.05035 [pdf, other]

doi 10.3390/e22030265

Complex contagion features without social reinforcement in a model of social information flow

Authors: Tyson Pond, Saranzaya Magsarjav, Tobin South, Lewis Mitchell, James P. Bagrow

Abstract: Contagion models are a primary lens through which we understand the spread of information over social networks. However, simple contagion models cannot reproduce the complex features observed in real-world data, leading to research on more complicated complex contagion models. A noted feature of complex contagion is social reinforcement that individuals require multiple exposures to information be… ▽ More Contagion models are a primary lens through which we understand the spread of information over social networks. However, simple contagion models cannot reproduce the complex features observed in real-world data, leading to research on more complicated complex contagion models. A noted feature of complex contagion is social reinforcement that individuals require multiple exposures to information before they begin to spread it themselves. Here we show that the quoter model, a model of the social flow of written information over a network, displays features of complex contagion, including the weakness of long ties and that increased density inhibits rather than promotes information flow. Interestingly, the quoter model exhibits these features despite having no explicit social reinforcement mechanism, unlike complex contagion models. Our results highlight the need to complement contagion models with an information-theoretic view of information spreading to better understand how network properties affect information flow and what are the most necessary ingredients when modeling social behavior. △ Less

Submitted 26 February, 2020; v1 submitted 12 February, 2020; originally announced February 2020.

Comments: 18 pages, 9 figures, 1 table

Journal ref: Entropy 2020, 22(3), 265

arXiv:1911.05860 [pdf, other]

Constraints from invariant subtropical vertical velocities on the scalings of Hadley cell strength and downdraft width with rotation rate

Authors: Jonathan L. Mitchell, Spencer A. Hill

Abstract: Weak-temperature-gradient influences from the tropics and quasigeostrophic influences from the extratropics plausibly constrain the subtropical-mean static stability in terrestrial atmospheres. Because mean descent acting on this static stability is a leading-order term in the thermodynamic balance, a state-invariant static stability would impose constraints on the Hadley cells, which this paper e… ▽ More Weak-temperature-gradient influences from the tropics and quasigeostrophic influences from the extratropics plausibly constrain the subtropical-mean static stability in terrestrial atmospheres. Because mean descent acting on this static stability is a leading-order term in the thermodynamic balance, a state-invariant static stability would impose constraints on the Hadley cells, which this paper explores in simulations of varying planetary rotation rate. If downdraft-averaged effective heating (the sum of diabatic heating and eddy heat flux convergence) too is invariant, so must be vertical velocity -- an "omega governor." In that case, the Hadley circulation overturning strength and downdraft width must scale identically -- the cell can strengthen only by widening or weaken only by narrowing. Simulations in two idealized, dry GCMs with a wide range of planetary rotation rates exhibit nearly unchanging downdraft-averaged static stability, effective heating, and vertical velocity, as well as nearly identical scalings of the Hadley cell downdraft width and strength. In one, eddy stresses set this scaling directly (the Rossby number remains small); in the other, eddy stress and bulk Rossby number changes compensate to yield the same, ({\sim}Ω^{-1/3}) scaling. The consistency of this power law for cell width and strength variations may indicate a common driver, and we speculate that Ekman pum** could be the mechanism responsible for this behavior. Extending to moist atmospheres, in an idealized aquaplanet GCM the subtropical static stability is also insensitive to rotation rate but the effective heating and vertical velocity are not. △ Less

Submitted 1 December, 2020; v1 submitted 13 November, 2019; originally announced November 2019.

Comments: 20 pages (6200 words) main text, 12 figures

arXiv:1908.03318 [pdf, other]

Bayesian inference of network structure from information cascades

Authors: Caitlin Gray, Lewis Mitchell, Matthew Roughan

Abstract: Contagion processes are strongly linked to the network structures on which they propagate, and learning these structures is essential for understanding and intervention on complex network processes such as epidemics and (mis)information propagation. However, using contagion data to infer network structure is a challenging inverse problem. In particular, it is imperative to have appropriate measure… ▽ More Contagion processes are strongly linked to the network structures on which they propagate, and learning these structures is essential for understanding and intervention on complex network processes such as epidemics and (mis)information propagation. However, using contagion data to infer network structure is a challenging inverse problem. In particular, it is imperative to have appropriate measures of uncertainty in network structure estimates, however these are largely ignored in most machine-learning approaches. We present a probabilistic framework that uses samples from the distribution of networks that are compatible with the dynamics observed to produce network and uncertainty estimates. We demonstrate the method using the well known independent cascade model to sample from the distribution of networks P(G) conditioned on the observation of a set of infections C. We evaluate the accuracy of the method by using the marginal probabilities of each edge in the distribution, and show the bene ts of quantifying uncertainty to improve estimates and understanding, particularly with small amounts of data. △ Less

Submitted 9 August, 2019; originally announced August 2019.

arXiv:1906.08403 [pdf, other]

How the Avengers assemble: Ecological modelling of effective cast sizes for movies

Authors: Matthew Roughan, Lewis Mitchell, Tobin South

Abstract: The number of characters in a movie is an interesting feature. However, it is non-trivial to measure directly. Naive metrics such as the number of credited characters vary wildly. Here, we show that a metric based on the notion of "ecological diversity" as expressed through a Shannon-entropy based metric can characterise the number of characters in a movie, and is useful in taxonomic classificatio… ▽ More The number of characters in a movie is an interesting feature. However, it is non-trivial to measure directly. Naive metrics such as the number of credited characters vary wildly. Here, we show that a metric based on the notion of "ecological diversity" as expressed through a Shannon-entropy based metric can characterise the number of characters in a movie, and is useful in taxonomic classification. We also show how the metric can be generalised using Jensen-Shannon divergence to provide a measure of the similarity of characters appearing in different movies, for instance of use in recommender systems, e.g., Netflix. We apply our measures to the Marvel Cinematic Universe (MCU), and show what they teach us about this highly successful franchise of movies. In particular, these measures provide a useful predictor of "success" for films in the MCU, as well as a natural means to understand the relationships between the stories in the overall film arc. △ Less

Submitted 19 June, 2019; originally announced June 2019.

arXiv:1904.01153 [pdf, other]

Semi-supervised graph labelling reveals increasing partisanship in the United States Congress

Authors: Max Glonek, Jonathan Tuke, Lewis Mitchell, Nigel Bean

Abstract: Graph labelling is a key activity of network science, with broad practical applications, and close relations to other network science tasks, such as community detection and clustering. While a large body of work exists on both unsupervised and supervised labelling algorithms, the class of random walk-based supervised algorithms requires further exploration, particularly given their relevance to so… ▽ More Graph labelling is a key activity of network science, with broad practical applications, and close relations to other network science tasks, such as community detection and clustering. While a large body of work exists on both unsupervised and supervised labelling algorithms, the class of random walk-based supervised algorithms requires further exploration, particularly given their relevance to social and political networks. This work refines and expands upon a new semi-supervised graph labelling method, the GLaSS method, that exactly calculates absorption probabilities for random walks on connected graphs. The method models graphs exactly as discrete-time Markov chains, treating labelled nodes as absorbing states. The method is applied to roll call voting data for 42 meetings of the United States House of Representatives and Senate, from 1935 to 2019. Analysis of the 84 resultant political networks demonstrates strong and consistent performance of GLaSS when estimating labels for unlabelled nodes in graphs, and reveals a significant trend of increasing partisanship within the United States Congress. △ Less

Submitted 16 June, 2019; v1 submitted 1 April, 2019; originally announced April 2019.

Comments: 18 pages, 4 figures, submitted to Applied Network Science

arXiv:1811.05063 [pdf, ps, other]

SMERC: Social media event response clustering using textual and temporal information

Authors: Peter Mathews, Caitlin Gray, Lewis Mitchell, Giang T. Nguyen, Nigel G. Bean

Abstract: Tweet clustering for event detection is a powerful modern method to automate the real-time detection of events. In this work we present a new tweet clustering approach, using a probabilistic approach to incorporate temporal information. By analysing the distribution of time gaps between tweets we show that the gaps between pairs of related tweets exhibit exponential decay, whereas the gaps between… ▽ More Tweet clustering for event detection is a powerful modern method to automate the real-time detection of events. In this work we present a new tweet clustering approach, using a probabilistic approach to incorporate temporal information. By analysing the distribution of time gaps between tweets we show that the gaps between pairs of related tweets exhibit exponential decay, whereas the gaps between unrelated tweets are approximately uniform. Guided by this insight, we use probabilistic arguments to estimate the likelihood that a pair of tweets are related, and build an improved clustering method. Our method Social Media Event Response Clustering (SMERC) creates clusters of tweets based on their tendency to be related to a single event. We evaluate our method at three levels: through traditional event prediction from tweet clustering, by measuring the improvement in quality of clusters created, and also comparing the clustering precision and recall with other methods. By applying SMERC to tweets collected during a number of sporting events, we demonstrate that incorporating temporal information leads to state of the art clustering performance. △ Less

Submitted 12 November, 2018; originally announced November 2018.

arXiv:1811.01467 [pdf, other]

The one comparing narrative social network extraction techniques

Authors: Michelle Edwards, Lewis Mitchell, Jonathan Tuke, Matthew Roughan

Abstract: Analysing narratives through their social networks is an expanding field in quantitative literary studies. Manually extracting a social network from any narrative can be time consuming, so automatic extraction methods of varying complexity have been developed. However, the effect of different extraction methods on the analysis is unknown. Here we model and compare three extraction methods for soci… ▽ More Analysing narratives through their social networks is an expanding field in quantitative literary studies. Manually extracting a social network from any narrative can be time consuming, so automatic extraction methods of varying complexity have been developed. However, the effect of different extraction methods on the analysis is unknown. Here we model and compare three extraction methods for social networks in narratives: manual extraction, co-occurrence automated extraction and automated extraction using machine learning. Although the manual extraction method produces more precise results in the network analysis, it is much more time consuming and the automatic extraction methods yield comparable conclusions for density, centrality measures and edge weights. Our results provide evidence that social networks extracted automatically are reliable for many analyses. We also describe which aspects of analysis are not reliable with such a social network. We anticipate that our findings will make it easier to analyse more narratives, which help us improve our understanding of how stories are written and evolve, and how people interact with each other. △ Less

Submitted 4 November, 2018; originally announced November 2018.

arXiv:1810.11105 [pdf, other]

doi 10.1175/JAS-D-18-0306.1

Axisymmetric constraints on cross-equatorial Hadley cell extent

Authors: Spencer Hill, Simona Bordoni, Jonathan L. Mitchell

Abstract: We consider the relevance of known constraints from each of Hide's theorem, the angular momentum conserving (AMC) model, and the equal-area model on the extent of cross-equatorial Hadley cells. These theories respectively posit that a Hadley circulation must span: all latitudes where the radiative convective equilibrium (RCE) absolute angular momentum ($M_\mathrm{rce}$) satisfies… ▽ More We consider the relevance of known constraints from each of Hide's theorem, the angular momentum conserving (AMC) model, and the equal-area model on the extent of cross-equatorial Hadley cells. These theories respectively posit that a Hadley circulation must span: all latitudes where the radiative convective equilibrium (RCE) absolute angular momentum ($M_\mathrm{rce}$) satisfies $M_\mathrm{rce}>Ωa^2$ or $M_\mathrm{rce}<0$ or where the RCE absolute vorticity ($η_\mathrm{rce}$) satisfies $fη_\mathrm{rce}<0$; all latitudes where the RCE zonal wind exceeds the AMC zonal wind; and over a range such that depth-averaged potential temperature is continuous and that energy is conserved. The AMC model requires knowledge of the ascent latitude $\varphi_\mathrm{a}$, which need not equal the RCE forcing maximum latitude $\varphi_\mathrm{m}$. Whatever the value of $\varphi_\mathrm{a}$, we demonstrate that an AMC cell must extend at least as far into the winter hemisphere as the summer hemisphere. The equal-area model predicts $\varphi_\mathrm{a}$, always placing it poleward of $\varphi_\mathrm{m}$. As $\varphi_\mathrm{m}$ is moved poleward (at a given thermal Rossby number), the equal-area predicted Hadley circulation becomes implausibly large, while both $\varphi_\mathrm{m}$ and $\varphi_\mathrm{a}$ become increasingly displaced poleward of the minimal cell extent based on Hide's theorem (i.e. of supercritical forcing). In an idealized dry general circulation model, cross-equatorial Hadley cells are generated, some spanning nearly pole-to-pole. All homogenize angular momentum imperfectly, are roughly symmetric in extent about the equator, and appear in extent controlled by the span of supercritical forcing. △ Less

Submitted 23 May, 2019; v1 submitted 25 October, 2018; originally announced October 2018.

Comments: 18 pages, 9 figures, published

Journal ref: Journal of the Atmospheric Sciences, Volume 76, pp 1547-1564, 2019

arXiv:1805.07011 [pdf, other]

doi 10.1016/j.physd.2018.05.002

A shadowing-based inflation scheme for ensemble data assimilation

Authors: Thomas Bellsky, Lewis Mitchell

Abstract: Artificial ensemble inflation is a common technique in ensemble data assimilation, whereby the ensemble covariance is periodically increased in order to prevent deviation of the ensemble from the observations and possible ensemble collapse. This manuscript introduces a new form of covariance inflation for ensemble data assimilation based upon shadowing ideas from dynamical systems theory. We prese… ▽ More Artificial ensemble inflation is a common technique in ensemble data assimilation, whereby the ensemble covariance is periodically increased in order to prevent deviation of the ensemble from the observations and possible ensemble collapse. This manuscript introduces a new form of covariance inflation for ensemble data assimilation based upon shadowing ideas from dynamical systems theory. We present results from a low order nonlinear chaotic system that supports using shadowing inflation, demonstrating that shadowing inflation is more robust to parameter tuning than standard multiplicative covariance inflation, outperforming in observation-sparse scenarios and often leading to longer forecast shadowing times. △ Less

Submitted 6 September, 2018; v1 submitted 17 May, 2018; originally announced May 2018.

arXiv:1802.05039 [pdf, other]

Super-blockers and the effect of network structure on information cascades

Authors: Caitlin Gray, Lewis Mitchell, Matthew Roughan

Abstract: Modelling information cascades over online social networks is important in fields from marketing to civil unrest prediction, however the underlying network structure strongly affects the probability and nature of such cascades. Even with simple cascade dynamics the probability of large cascades are almost entirely dictated by network properties, with well-known networks such as Erdos-Renyi and Bar… ▽ More Modelling information cascades over online social networks is important in fields from marketing to civil unrest prediction, however the underlying network structure strongly affects the probability and nature of such cascades. Even with simple cascade dynamics the probability of large cascades are almost entirely dictated by network properties, with well-known networks such as Erdos-Renyi and Barabasi-Albert producing wildly different cascades from the same model. Indeed, the notion of 'superspreaders' has arisen to describe highly influential nodes promoting global cascades in a social network. Here we use a simple model of global cascades to show that the presence of locality in the network increases the probability of a global cascade due to the increased vulnerability of connecting nodes. Rather than 'super-spreaders', we find that the presence of these highly connected 'super-blockers' in heavy-tailed networks in fact reduces the probability of global cascades, while promoting information spread when targeted as the initial spreader. △ Less

Submitted 21 March, 2018; v1 submitted 14 February, 2018; originally announced February 2018.

arXiv:1711.08552 [pdf, other]

doi 10.5194/gmd-11-4359-2018

Thetis coastal ocean model: discontinuous Galerkin discretization for the three-dimensional hydrostatic equations

Authors: Tuomas Kärnä, Stephan C. Kramer, Lawrence Mitchell, David A. Ham, Matthew D. Piggott, António M. Baptista

Abstract: Unstructured grid ocean models are advantageous for simulating the coastal ocean and river-estuary-plume systems. However, unstructured grid models tend to be diffusive and/or computationally expensive which limits their applicability to real life problems. In this paper, we describe a novel discontinuous Galerkin (DG) finite element discretization for the hydrostatic equations. The formulation is… ▽ More Unstructured grid ocean models are advantageous for simulating the coastal ocean and river-estuary-plume systems. However, unstructured grid models tend to be diffusive and/or computationally expensive which limits their applicability to real life problems. In this paper, we describe a novel discontinuous Galerkin (DG) finite element discretization for the hydrostatic equations. The formulation is fully conservative and second-order accurate in space and time. Monotonicity of the advection scheme is ensured by using a strong stability preserving time integration method and slope limiters. Compared to previous DG models advantages include a more accurate mode splitting method, revised viscosity formulation, and new second-order time integration scheme. We demonstrate that the model is capable of simulating baroclinic flows in the eddying regime with a suite of test cases. Numerical dissipation is well-controlled, being comparable or lower than in existing state-of-the-art structured grid models. △ Less

Submitted 18 October, 2018; v1 submitted 22 November, 2017; originally announced November 2017.

Comments: Submitted to Geoscientific Model Development

Journal ref: Geoscientific Model Development 11:4359-4382 (2018)

arXiv:1711.00326 [pdf, other]

doi 10.1063/1.5011403

The quoter model: a paradigmatic model of the social flow of written information

Authors: James P. Bagrow, Lewis Mitchell

Abstract: We propose a model for the social flow of information in the form of text data, which simulates the posting and sharing of short social media posts. Nodes in a graph representing a social network take turns generating words, leading to a symbolic time series associated with each node. Information propagates over the graph via a quoting mechanism, where nodes randomly copy short segments of text fr… ▽ More We propose a model for the social flow of information in the form of text data, which simulates the posting and sharing of short social media posts. Nodes in a graph representing a social network take turns generating words, leading to a symbolic time series associated with each node. Information propagates over the graph via a quoting mechanism, where nodes randomly copy short segments of text from each other. We characterize information flows from these text via information-theoretic estimators, and we derive analytic relationships between model parameters and the values of these estimators. We explore and validate the model with simulations on small network motifs and larger random graphs. Tractable models such as ours that generate symbolic data while controlling the information flow allow us to test and compare measures of information flow applicable to real social media data. In particular, by choosing different network structures, we can develop test scenarios to determine whether or not measures of information flow can distinguish between true and spurious interactions, and how topological network properties relate to information flow. △ Less

Submitted 11 July, 2018; v1 submitted 1 November, 2017; originally announced November 2017.

Comments: 11 pages, 9 figures

Journal ref: Chaos 28, 075304 (2018)

arXiv:1708.04575 [pdf, other]

doi 10.1038/s41562-018-0510-5

Information flow reveals prediction limits in online social activity

Authors: James P. Bagrow, Xipei Liu, Lewis Mitchell

Abstract: Modern society depends on the flow of information over online social networks, and users of popular platforms generate significant behavioral data about themselves and their social ties. However, it remains unclear what fundamental limits exist when using these data to predict the activities and interests of individuals, and to what accuracy such predictions can be made using an individual's socia… ▽ More Modern society depends on the flow of information over online social networks, and users of popular platforms generate significant behavioral data about themselves and their social ties. However, it remains unclear what fundamental limits exist when using these data to predict the activities and interests of individuals, and to what accuracy such predictions can be made using an individual's social ties. Here we show that 95% of the potential predictive accuracy for an individual is achievable using their social ties only, without requiring that individual's data. We use information theoretic tools to estimate the predictive information within the writings of Twitter users, providing an upper bound on the available predictive information that holds for any predictive or machine learning methods. As few as 8-9 of an individual's contacts are sufficient to obtain predictability comparable to that of the individual alone. Distinct temporal and social effects are visible by measuring information flow along social ties, allowing us to better study the dynamics of online activity. Our results have distinct privacy implications: information is so strongly embedded in a social network that in principle one can profile an individual from their available social ties even when the individual forgoes the platform completely. △ Less

Submitted 9 February, 2019; v1 submitted 15 August, 2017; originally announced August 2017.

Comments: 15 pages, 4 figures, supplementary information included

Journal ref: Nature Human Behaviour 3 (2019) 122-128

arXiv:1703.06361 [pdf, other]

Which friends are more popular than you? Contact strength and the friendship paradox in social networks

Authors: James P. Bagrow, Christopher M. Danforth, Lewis Mitchell

Abstract: The friendship paradox states that in a social network, egos tend to have lower degree than their alters, or, "your friends have more friends than you do". Most research has focused on the friendship paradox and its implications for information transmission, but treating the network as static and unweighted. Yet, people can dedicate only a finite fraction of their attention budget to each social i… ▽ More The friendship paradox states that in a social network, egos tend to have lower degree than their alters, or, "your friends have more friends than you do". Most research has focused on the friendship paradox and its implications for information transmission, but treating the network as static and unweighted. Yet, people can dedicate only a finite fraction of their attention budget to each social interaction: a high-degree individual may have less time to dedicate to individual social links, forcing them to modulate the quantities of contact made to their different social ties. Here we study the friendship paradox in the context of differing contact volumes between egos and alters, finding a connection between contact volume and the strength of the friendship paradox. The most frequently contacted alters exhibit a less pronounced friendship paradox compared with the ego, whereas less-frequently contacted alters are more likely to be high degree and give rise to the paradox. We argue therefore for a more nuanced version of the friendship paradox: "your closest friends have slightly more friends than you do", and in certain networks even: "your best friend has no more friends than you do". We demonstrate that this relationship is robust, holding in both a social media and a mobile phone dataset. These results have implications for information transfer and influence in social networks, which we explore using a simple dynamical model. △ Less

Submitted 18 March, 2017; originally announced March 2017.

arXiv:1703.05545 [pdf, other]

doi 10.1145/3041021.3053903

The nature and origin of heavy tails in retweet activity

Authors: Peter Mathews, Lewis Mitchell, Giang T. Nguyen, Nigel G. Bean

Abstract: Modern social media platforms facilitate the rapid spread of information online. Modelling phenomena such as social contagion and information diffusion are contingent upon a detailed understanding of the information-sharing processes. In Twitter, an important aspect of this occurs with retweets, where users rebroadcast the tweets of other users. To improve our understanding of how these distributi… ▽ More Modern social media platforms facilitate the rapid spread of information online. Modelling phenomena such as social contagion and information diffusion are contingent upon a detailed understanding of the information-sharing processes. In Twitter, an important aspect of this occurs with retweets, where users rebroadcast the tweets of other users. To improve our understanding of how these distributions arise, we analyse the distribution of retweet times. We show that a power law with exponential cutoff provides a better fit than the power laws previously suggested. We explain this fit through the burstiness of human behaviour and the priorities individuals place on different tasks. △ Less

Submitted 16 March, 2017; originally announced March 2017.

Comments: To appear in MSM 2017: 8th International Workshop on Modelling Social Media: Machine Learning and AI for Modelling and Analysing Social Media, April 2017, Perth, Australia

arXiv:1609.08283 [pdf, other]

A data-driven model for influenza transmission incorporating media effects

Authors: Lewis Mitchell, Joshua V. Ross

Abstract: Numerous studies have attempted to model the effect of mass media on the transmission of diseases such as influenza, however quantitative data on media engagement has until recently been difficult to obtain. With the recent explosion of "big data" coming from online social media and the like, large volumes of data on a population's engagement with mass media during an epidemic are becoming availab… ▽ More Numerous studies have attempted to model the effect of mass media on the transmission of diseases such as influenza, however quantitative data on media engagement has until recently been difficult to obtain. With the recent explosion of "big data" coming from online social media and the like, large volumes of data on a population's engagement with mass media during an epidemic are becoming available to researchers. In this study we combine an online data set comprising millions of shared messages relating to influenza with traditional surveillance data on flu activity to suggest a functional form for the relationship between the two. Using this data we present a simple deterministic model for influenza dynamics incorporating media effects, and show that such a model helps explain the dynamics of historical influenza outbreaks. Furthermore, through model selection we show that the proposed media function fits historical data better than other media functions proposed in earlier studies. △ Less

Submitted 27 September, 2016; originally announced September 2016.

Comments: To appear in Royal Society Open Science

arXiv:1608.06313 [pdf, other]

doi 10.1103/PhysRevE.95.052301

Simon's fundamental rich-get-richer model entails a dominant first-mover advantage

Authors: Peter Sheridan Dodds, David Rushing Dewhurst, Fletcher F. Hazlehurst, Colin M. Van Oort, Lewis Mitchell, Andrew J. Reagan, Jake Ryland Williams, Christopher M. Danforth

Abstract: Herbert Simon's classic rich-get-richer model is one of the simplest empirically supported mechanisms capable of generating heavy-tail size distributions for complex systems. Simon argued analytically that a population of flavored elements growing by either adding a novel element or randomly replicating an existing one would afford a distribution of group sizes with a power-law tail. Here, we show… ▽ More Herbert Simon's classic rich-get-richer model is one of the simplest empirically supported mechanisms capable of generating heavy-tail size distributions for complex systems. Simon argued analytically that a population of flavored elements growing by either adding a novel element or randomly replicating an existing one would afford a distribution of group sizes with a power-law tail. Here, we show that, in fact, Simon's model does not produce a simple power law size distribution as the initial element has a dominant first-mover advantage, and will be overrepresented by a factor proportional to the inverse of the innovation probability. The first group's size discrepancy cannot be explained away as a transient of the model, and may therefore be many orders of magnitude greater than expected. We demonstrate how Simon's analysis was correct but incomplete, and expand our alternate analysis to quantify the variability of long term rankings for all groups. We find that the expected time for a first replication is infinite, and show how an incipient group must break the mechanism to improve their odds of success. We present an example of citation counts for a specific field that demonstrates a first-mover advantage consistent with our revised view of the rich-get-richer mechanism. Our findings call for a reexamination of preceding work invoking Simon's model and provide an expanded understanding going forward. △ Less

Submitted 4 May, 2017; v1 submitted 16 August, 2016; originally announced August 2016.

Comments: 8 pages, 3 figures

Journal ref: Phys. Rev. E 95, 052301 (2017)

arXiv:1605.00492 [pdf, other]

doi 10.1016/j.jcp.2016.09.037

High level implementation of geometric multigrid solvers for finite element problems: applications in atmospheric modelling

Authors: Lawrence Mitchell, Eike Hermann Müller

Abstract: The implementation of efficient multigrid preconditioners for elliptic partial differential equations (PDEs) is a challenge due to the complexity of the resulting algorithms and corresponding computer code. For sophisticated finite element discretisations on unstructured grids an efficient implementation can be very time consuming and requires the programmer to have in-depth knowledge of the mathe… ▽ More The implementation of efficient multigrid preconditioners for elliptic partial differential equations (PDEs) is a challenge due to the complexity of the resulting algorithms and corresponding computer code. For sophisticated finite element discretisations on unstructured grids an efficient implementation can be very time consuming and requires the programmer to have in-depth knowledge of the mathematical theory, parallel computing and optimisation techniques on manycore CPUs. In this paper we show how the development of bespoke multigrid preconditioners can be simplified significantly by using a framework which allows the expression of the each component of the algorithm at the correct abstraction level. Our approach (1) allows the expression of the finite element problem in a language which is close to the mathematical formulation of the problem, (2) guarantees the automatic generation and efficient execution of parallel optimised low-level computer code and (3) is flexible enough to support different abstraction levels and give the programmer control over details of the preconditioner. We use the composable abstractions of the Firedrake/PyOP2 package to demonstrate the efficiency of this approach for the solution of strongly anisotropic PDEs in atmospheric modelling. The weak formulation of the PDE is expressed in Unified Form Language (UFL) and the lower PyOP2 abstraction layer allows the manual design of computational kernels for a bespoke geometric multigrid preconditioner. We compare the performance of this preconditioner to a single-level method and hypre's BoomerAMG algorithm. The Firedrake/PyOP2 code is inherently parallel and we present a detailed performance analysis for a single node (24 cores) on the ARCHER supercomputer. Our implementation utilises a significant fraction of the available memory bandwidth and shows very good weak scaling on up to 6,144 compute cores. △ Less

Submitted 14 September, 2016; v1 submitted 2 May, 2016; originally announced May 2016.

Comments: 22 pages, 5 figures, 9 tables. Submitted to JCP

MSC Class: 65F08; 65N55; 76M10; 86A10 ACM Class: D.2.2; G.1.3; G.1.8; G.4; J.2

Journal ref: Journal of Computational Physics 327:1-18 (2016)

arXiv:1508.05938 [pdf, other]

Tracking the Teletherms: The spatiotemporal dynamics of the hottest and coldest days of the year

Authors: Peter Sheridan Dodds, Lewis Mitchell, Andrew J. Reagan, Christopher M. Danforth

Abstract: Instabilities and long term shifts in seasons, whether induced by natural drivers or human activities, pose great disruptive threats to ecological, agricultural, and social systems. Here, we propose, measure, and explore two fundamental markers of location-sensitive seasonal variations: the Summer and Winter Teletherms---the on-average annual dates of the hottest and coldest days of the year. We a… ▽ More Instabilities and long term shifts in seasons, whether induced by natural drivers or human activities, pose great disruptive threats to ecological, agricultural, and social systems. Here, we propose, measure, and explore two fundamental markers of location-sensitive seasonal variations: the Summer and Winter Teletherms---the on-average annual dates of the hottest and coldest days of the year. We analyse daily temperature extremes recorded at 1218 stations across the contiguous United States from 1853--2012, and observe large regional variation with the Summer Teletherm falling up to 90 days after the Summer Solstice, and 50 days for the Winter Teletherm after the Winter Solstice. We show that Teletherm temporal dynamics are substantive with clear and in some cases dramatic shifts reflective of system bifurcations. We also compare recorded daily temperature extremes with output from two regional climate models finding considerable though relatively unbiased error. Our work demonstrates that Teletherms are an intuitive, powerful, and statistically sound measure of local climate change, and that they pose detailed, stringent challenges for future theoretical and computational models. △ Less

Submitted 16 March, 2016; v1 submitted 24 August, 2015; originally announced August 2015.

Comments: Manuscript: 13 pages, 8 Figures; Supplementary: 19 pages, 21 Figures

arXiv:1507.05098 [pdf, other]

The Lexicocalorimeter: Gauging public health through caloric input and output on social media

Authors: S. E. Alajajian, J. R. Williams, A. J. Reagan, S. C. Alajajian, M. R. Frank, L. Mitchell, J. Lahne, C. M. Danforth, P. S. Dodds

Abstract: We propose and develop a Lexicocalorimeter: an online, interactive instrument for measuring the "caloric content" of social media and other large-scale texts. We do so by constructing extensive yet improvable tables of food and activity related phrases, and respectively assigning them with sourced estimates of caloric intake and expenditure. We show that for Twitter, our naive measures of "caloric… ▽ More We propose and develop a Lexicocalorimeter: an online, interactive instrument for measuring the "caloric content" of social media and other large-scale texts. We do so by constructing extensive yet improvable tables of food and activity related phrases, and respectively assigning them with sourced estimates of caloric intake and expenditure. We show that for Twitter, our naive measures of "caloric input", "caloric output", and the ratio of these measures are all strong correlates with health and well-being measures for the contiguous United States. Our caloric balance measure in many cases outperforms both its constituent quantities, is tunable to specific health and well-being measures such as diabetes rates, has the capability of providing a real-time signal reflecting a population's health, and has the potential to be used alongside traditional survey data in the development of public policy and collective self-awareness. Because our Lexicocalorimeter is a linear superposition of principled phrase scores, we also show we can move beyond correlations to explore what people talk about in collective detail, and assist in the understanding and explanation of how population-scale conditions vary, a capacity unavailable to black-box type methods. △ Less

Submitted 10 January, 2017; v1 submitted 17 July, 2015; originally announced July 2015.

Comments: Manuscript: 17 pages, 8 figures, 1 table, Supplementary Information: 10 pages, 7 figures, 3 tables

arXiv:1507.03886 [pdf, other]

doi 10.1103/PhysRevE.93.052314

The game story space of professional sports: Australian Rules Football

Authors: D. P. Kiley, A. J. Reagan, L. Mitchell, C. M. Danforth, P. S. Dodds

Abstract: Sports are spontaneous generators of stories. Through skill and chance, the script of each game is dynamically written in real time by players acting out possible trajectories allowed by a sport's rules. By properly characterizing a given sport's ecology of `game stories', we are able to capture the sport's capacity for unfolding interesting narratives, in part by contrasting them with random walk… ▽ More Sports are spontaneous generators of stories. Through skill and chance, the script of each game is dynamically written in real time by players acting out possible trajectories allowed by a sport's rules. By properly characterizing a given sport's ecology of `game stories', we are able to capture the sport's capacity for unfolding interesting narratives, in part by contrasting them with random walks. Here, we explore the game story space afforded by a data set of 1,310 Australian Football League (AFL) score lines. We find that AFL games exhibit a continuous spectrum of stories rather than distinct clusters. We show how coarse-graining reveals identifiable motifs ranging from last minute comeback wins to one-sided blowouts. Through an extensive comparison with biased random walks, we show that real AFL games deliver a broader array of motifs than null models, and we provide consequent insights into the narrative appeal of real games. △ Less

Submitted 23 May, 2016; v1 submitted 25 June, 2015; originally announced July 2015.

Comments: 15 pages, 19 figures

Journal ref: Phys. Rev. E 93, 052314 (2016)

arXiv:1505.06750 [pdf, other]

doi 10.1073/pnas.1505647112

Reply to Garcia et al.: Common mistakes in measuring frequency dependent word characteristics

Authors: P. S. Dodds, E. M. Clark, S. Desu, M. R. Frank, A. J. Reagan, J. R. Williams, L. Mitchell, K. D. Harris, I. M. Kloumann, J. P. Bagrow, K. Megerdoomian, M. T. McMahon, B. F. Tivnan, C. M. Danforth

Abstract: We demonstrate that the concerns expressed by Garcia et al. are misplaced, due to (1) a misreading of our findings in [1]; (2) a widespread failure to examine and present words in support of asserted summary quantities based on word usage frequencies; and (3) a range of misconceptions about word usage frequency, word rank, and expert-constructed word lists. In particular, we show that the English… ▽ More We demonstrate that the concerns expressed by Garcia et al. are misplaced, due to (1) a misreading of our findings in [1]; (2) a widespread failure to examine and present words in support of asserted summary quantities based on word usage frequencies; and (3) a range of misconceptions about word usage frequency, word rank, and expert-constructed word lists. In particular, we show that the English component of our study compares well statistically with two related surveys, that no survey design influence is apparent, and that estimates of measurement error do not explain the positivity biases reported in our work and that of others. We further demonstrate that for the frequency dependence of positivity---of which we explored the nuances in great detail in [1]---Garcia et al. did not perform a reanalysis of our data---they instead carried out an analysis of a different, statistically improper data set and introduced a nonlinearity before performing linear regression. △ Less

Submitted 28 May, 2015; v1 submitted 25 May, 2015; originally announced May 2015.

Comments: 5 pages, 2 figures, 1 table. Expanded version of reply appearing in PNAS 2015

arXiv:1505.03804 [pdf, other]

doi 10.1371/journal.pone.0136092

Climate change sentiment on Twitter: An unsolicited public opinion poll

Authors: Emily M. Cody, Andrew J. Reagan, Lewis Mitchell, Peter Sheridan Dodds, Christopher M. Danforth

Abstract: The consequences of anthropogenic climate change are extensively debated through scientific papers, newspaper articles, and blogs. Newspaper articles may lack accuracy, while the severity of findings in scientific papers may be too opaque for the public to understand. Social media, however, is a forum where individuals of diverse backgrounds can share their thoughts and opinions. As consumption sh… ▽ More The consequences of anthropogenic climate change are extensively debated through scientific papers, newspaper articles, and blogs. Newspaper articles may lack accuracy, while the severity of findings in scientific papers may be too opaque for the public to understand. Social media, however, is a forum where individuals of diverse backgrounds can share their thoughts and opinions. As consumption shifts from old media to new, Twitter has become a valuable resource for analyzing current events and headline news. In this research, we analyze tweets containing the word "climate" collected between September 2008 and July 2014. Through use of a previously developed sentiment measurement tool called the Hedonometer, we determine how collective sentiment varies in response to climate change news, events, and natural disasters. We find that natural disasters, climate bills, and oil-drilling can contribute to a decrease in happiness while climate rallies, a book release, and a green ideas contest can contribute to an increase in happiness. Words uncovered by our analysis suggest that responses to climate change news are predominately from climate change activists rather than climate change deniers, indicating that Twitter is a valuable resource for the spread of climate change awareness. △ Less

Submitted 30 July, 2015; v1 submitted 14 May, 2015; originally announced May 2015.

Comments: 11 pages, 10 figures

arXiv:1410.1393 [pdf, other]

Constructing a taxonomy of fine-grained human movement and activity motifs through social media

Authors: Morgan R. Frank, Jake Ryland Williams, Lewis Mitchell, James P. Bagrow, Peter Sheridan Dodds, Christopher M. Danforth

Abstract: Profiting from the emergence of web-scale social data sets, numerous recent studies have systematically explored human mobility patterns over large populations and large time scales. Relatively little attention, however, has been paid to mobility and activity over smaller time-scales, such as a day. Here, we use Twitter to identify people's frequently visited locations along with their likely acti… ▽ More Profiting from the emergence of web-scale social data sets, numerous recent studies have systematically explored human mobility patterns over large populations and large time scales. Relatively little attention, however, has been paid to mobility and activity over smaller time-scales, such as a day. Here, we use Twitter to identify people's frequently visited locations along with their likely activities as a function of time of day and day of week, capitalizing on both the content and geolocation of messages. We subsequently characterize people's transition pattern motifs and demonstrate that spatial information is encoded in word choice. △ Less

Submitted 11 May, 2015; v1 submitted 28 September, 2014; originally announced October 2014.

arXiv:1409.0589 [pdf, other]

doi 10.1002/qj.2451

Accounting for model error due to unresolved scales within ensemble Kalman filtering

Authors: Lewis Mitchell, Alberto Carrassi

Abstract: We propose a method to account for model error due to unresolved scales in the context of the ensemble transform Kalman filter (ETKF). The approach extends to this class of algorithms the deterministic model error formulation recently explored for variational schemes and extended Kalman filter. The model error statistic required in the analysis update is estimated using historical reanalysis incre… ▽ More We propose a method to account for model error due to unresolved scales in the context of the ensemble transform Kalman filter (ETKF). The approach extends to this class of algorithms the deterministic model error formulation recently explored for variational schemes and extended Kalman filter. The model error statistic required in the analysis update is estimated using historical reanalysis increments and a suitable model error evolution law. Two different versions of the method are described; a time-constant model error treatment where the same model error statistical description is time-invariant, and a time-varying treatment where the assumed model error statistics is randomly sampled at each analysis step. We compare both methods with the standard method of dealing with model error through inflation and localization, and illustrate our results with numerical simulations on a low order nonlinear system exhibiting chaotic dynamics. The results show that the filter skill is significantly improved through the proposed model error treatments, and that both methods require far less parameter tuning than the standard approach. Furthermore, the proposed approach is simple to implement within a pre-existing ensemble based scheme. The general implications for the use of the proposed approach in the framework of square-root filters such as the ETKF are also discussed. △ Less

Submitted 1 September, 2014; originally announced September 2014.

Comments: 12 pages, 9 figures, to appear in Quarterly Journal of the Royal Meteorological Society

arXiv:1406.3855 [pdf, other]

Human language reveals a universal positivity bias

Authors: Peter Sheridan Dodds, Eric M. Clark, Suma Desu, Morgan R. Frank, Andrew J. Reagan, Jake Ryland Williams, Lewis Mitchell, Kameron Decker Harris, Isabel M. Kloumann, James P. Bagrow, Karine Megerdoomian, Matthew T. McMahon, Brian F. Tivnan, Christopher M. Danforth

Abstract: Using human evaluation of 100,000 words spread across 24 corpora in 10 languages diverse in origin and culture, we present evidence of a deep imprint of human sociality in language, observing that (1) the words of natural human language possess a universal positivity bias; (2) the estimated emotional content of words is consistent between languages under translation; and (3) this positivity bias i… ▽ More Using human evaluation of 100,000 words spread across 24 corpora in 10 languages diverse in origin and culture, we present evidence of a deep imprint of human sociality in language, observing that (1) the words of natural human language possess a universal positivity bias; (2) the estimated emotional content of words is consistent between languages under translation; and (3) this positivity bias is strongly independent of frequency of word usage. Alongside these general regularities, we describe inter-language variations in the emotional spectrum of languages which allow us to rank corpora. We also show how our word evaluations can be used to construct physical-like instruments for both real-time and offline measurement of the emotional content of large-scale texts. △ Less

Submitted 15 June, 2014; originally announced June 2014.

Comments: Manuscript: 7 pages, 4 figures; Supplementary Material: 49 pages, 43 figures, 6 tables. Online appendices available at http://www.uvm.edu/storylab/share/papers/dodds2014a/

arXiv:1312.6122 [pdf, other]

Shadow networks: Discovering hidden nodes with models of information flow

Authors: James P. Bagrow, Suma Desu, Morgan R. Frank, Narine Manukyan, Lewis Mitchell, Andrew Reagan, Eric E. Bloedorn, Lashon B. Booker, Luther K. Branting, Michael J. Smith, Brian F. Tivnan, Christopher M. Danforth, Peter S. Dodds, Joshua C. Bongard

Abstract: Complex, dynamic networks underlie many systems, and understanding these networks is the concern of a great span of important scientific and engineering problems. Quantitative description is crucial for this understanding yet, due to a range of measurement problems, many real network datasets are incomplete. Here we explore how accidentally missing or deliberately hidden nodes may be detected in n… ▽ More Complex, dynamic networks underlie many systems, and understanding these networks is the concern of a great span of important scientific and engineering problems. Quantitative description is crucial for this understanding yet, due to a range of measurement problems, many real network datasets are incomplete. Here we explore how accidentally missing or deliberately hidden nodes may be detected in networks by the effect of their absence on predictions of the speed with which information flows through the network. We use Symbolic Regression (SR) to learn models relating information flow to network topology. These models show localized, systematic, and non-random discrepancies when applied to test networks with intentionally masked nodes, demonstrating the ability to detect the presence of missing nodes and where in the network those nodes are likely to reside. △ Less

Submitted 20 December, 2013; originally announced December 2013.

Comments: 12 pages, 3 figures

arXiv:1306.3488 [pdf, other]

doi 10.1175/MWR-D-13-00200.1

Non-global parameter estimation using local ensemble Kalman filtering

Authors: Thomas Bellsky, Jesse Berwald, Lewis Mitchell

Abstract: We study parameter estimation for non-global parameters in a low-dimensional chaotic model using the local ensemble transform Kalman filter (LETKF). By modifying existing techniques for using observational data to estimate global parameters, we present a methodology whereby spatially-varying parameters can be estimated using observations only within a localized region of space. Taking a low-dimens… ▽ More We study parameter estimation for non-global parameters in a low-dimensional chaotic model using the local ensemble transform Kalman filter (LETKF). By modifying existing techniques for using observational data to estimate global parameters, we present a methodology whereby spatially-varying parameters can be estimated using observations only within a localized region of space. Taking a low-dimensional nonlinear chaotic conceptual model for atmospheric dynamics as our numerical testbed, we show that this parameter estimation methodology accurately estimates parameters which vary in both space and time, as well as parameters representing physics absent from the model. △ Less

Submitted 22 November, 2013; v1 submitted 14 June, 2013; originally announced June 2013.

MSC Class: 37N10; 37M05; 62-07; 62H12

Journal ref: Monthly Weather Review, 142, 2150-2164, 2014

arXiv:1304.1296 [pdf, other]

doi 10.1038/srep02625

Happiness and the Patterns of Life: A Study of Geolocated Tweets

Authors: Morgan R. Frank, Lewis Mitchell, Peter S. Dodds, Christopher M. Danforth

Abstract: The patterns of life exhibited by large populations have been described and modeled both as a basic science exercise and for a range of applied goals such as reducing automotive congestion, improving disaster response, and even predicting the location of individuals. However, these studies previously had limited access to conversation content, rendering changes in expression as a function of movem… ▽ More The patterns of life exhibited by large populations have been described and modeled both as a basic science exercise and for a range of applied goals such as reducing automotive congestion, improving disaster response, and even predicting the location of individuals. However, these studies previously had limited access to conversation content, rendering changes in expression as a function of movement invisible. In addition, they typically use the communication between a mobile phone and its nearest antenna tower to infer position, limiting the spatial resolution of the data to the geographical region serviced by each cellphone tower. We use a collection of 37 million geolocated tweets to characterize the movement patterns of 180,000 individuals, taking advantage of several orders of magnitude of increased spatial accuracy relative to previous work. Employing the recently developed sentiment analysis instrument known as the 'hedonometer', we characterize changes in word usage as a function of movement, and find that expressed happiness increases logarithmically with distance from an individual's average location. △ Less

Submitted 12 September, 2013; v1 submitted 4 April, 2013; originally announced April 2013.

Comments: 12 page main document, 12 page supplement, 21 figures

Journal ref: Scientific Reports, Vol 3, No 2625, 2013

arXiv:1302.3299 [pdf, other]

doi 10.1371/journal.pone.0064417

The Geography of Happiness: Connecting Twitter sentiment and expression, demographics, and objective characteristics of place

Authors: Lewis Mitchell, Kameron Decker Harris, Morgan R. Frank, Peter Sheridan Dodds, Christopher M. Danforth

Abstract: We conduct a detailed investigation of correlations between real-time expressions of individuals made across the United States and a wide range of emotional, geographic, demographic, and health characteristics. We do so by combining (1) a massive, geo-tagged data set comprising over 80 million words generated over the course of several recent years on the social network service Twitter and (2) ann… ▽ More We conduct a detailed investigation of correlations between real-time expressions of individuals made across the United States and a wide range of emotional, geographic, demographic, and health characteristics. We do so by combining (1) a massive, geo-tagged data set comprising over 80 million words generated over the course of several recent years on the social network service Twitter and (2) annually-surveyed characteristics of all 50 states and close to 400 urban populations. Among many results, we generate taxonomies of states and cities based on their similarities in word use; estimate the happiness levels of states and cities; correlate highly-resolved demographic characteristics with happiness levels; and connect word choice and message length with urban characteristics such as education levels and obesity rates. Our results show how social media may potentially be used to estimate real-time levels and changes in population-level measures such as obesity rates. △ Less

Submitted 18 May, 2013; v1 submitted 13 February, 2013; originally announced February 2013.

Journal ref: PLoS ONE 8(5): e64417, 2013

arXiv:1210.5246 [pdf, ps, other]

doi 10.1093/mnras/sts228

Collisionless Stellar Hydrodynamics as an Efficient Alternative to N-body Methods

Authors: Nigel L. Mitchell, Eduard I. Vorobyov, Gerhard Hensler

Abstract: For simulations that deal only with dark matter or stellar systems, the conventional N-body technique is fast, memory efficient, and relatively simple to implement. However when including the effects of gas physics, mesh codes are at a distinct disadvantage compared to SPH. Whilst implementing the N-body approach into SPH codes is fairly trivial, the particle-mesh technique used in mesh codes to c… ▽ More For simulations that deal only with dark matter or stellar systems, the conventional N-body technique is fast, memory efficient, and relatively simple to implement. However when including the effects of gas physics, mesh codes are at a distinct disadvantage compared to SPH. Whilst implementing the N-body approach into SPH codes is fairly trivial, the particle-mesh technique used in mesh codes to couple collisionless stars and dark matter to the gas on the mesh, has a series of significant scientific and technical limitations. These include spurious entropy generation resulting from discreteness effects, poor load balancing and increased communication overhead which spoil the excellent scaling in massively parallel grid codes. We propose the use of the collisionless Boltzmann moment equations as a means to model collisionless material as a fluid on the mesh, implementing it into the massively parallel FLASH AMR code. This approach, which we term "collisionless stellar hydrodynamics" enables us to do away with the particle-mesh approach. Since the parallelisation scheme is identical to that used for the hydrodynamics, it preserves the excellent scaling of the FLASH code already demonstrated on peta-flop machines. We find the classic hydrodynamic equations and Boltzmann moment equations can be reconciled under specific conditions, allowing us to generate analytic solutions for collisionless systems using conventional test problems. We confirm the validity of our approach using a suite of demanding test problems, including the use of a modified Sod shock test. We conclude by demonstrating the ability of our code to model complex phenomena by simulating the evolution of a spiral galaxy whose properties agree with those predicted by swing amplification theory. (Abridged) △ Less

Submitted 18 October, 2012; originally announced October 2012.

Comments: Accepted for publication in the Monthly Notices of the Royal Astronomical Society

arXiv:1204.1999 [pdf, ps, other]

doi 10.1063/1.4704805

On finite-size Lyapunov exponents in multiscale systems

Authors: Lewis Mitchell, Georg A. Gottwald

Abstract: We study the effect of regime switches on finite size Lyapunov exponents (FSLEs) in determining the error growth rates and predictability of multiscale systems. We consider a dynamical system involving slow and fast regimes and switches between them. The surprising result is that due to the presence of regimes the error growth rate can be a non-monotonic function of initial error amplitude. In par… ▽ More We study the effect of regime switches on finite size Lyapunov exponents (FSLEs) in determining the error growth rates and predictability of multiscale systems. We consider a dynamical system involving slow and fast regimes and switches between them. The surprising result is that due to the presence of regimes the error growth rate can be a non-monotonic function of initial error amplitude. In particular, troughs in the large scales of FSLE spectra is shown to be a signature of slow regimes, whereas fast regimes are shown to cause large peaks in the spectra where error growth rates far exceed those estimated from the maximal Lyapunov exponent. We present analytical results explaining these signatures and corroborate them with numerical simulations. We show further that these peaks disappear in stochastic parametrizations of the fast chaotic processes, and the associated FSLE spectra reveal that large scale predictability properties of the full deterministic model are well approximated whereas small scale features are not properly resolved. △ Less

Submitted 9 April, 2012; originally announced April 2012.

Comments: Accepted for publication in Chaos

Journal ref: Chaos, 22 (2), 023115, 2012

arXiv:1110.6671 [pdf, ps, other]

doi 10.1175/JAS-D-11-0145.1

Data assimilation in slow-fast systems using homogenized climate models

Authors: Lewis Mitchell, Georg A. Gottwald

Abstract: A deterministic multiscale toy model is studied in which a chaotic fast subsystem triggers rare transitions between slow regimes, akin to weather or climate regimes. Using homogenization techniques, a reduced stochastic parametrization model is derived for the slow dynamics. The reliability of this reduced climate model in reproducing the statistics of the slow dynamics of the full deterministic m… ▽ More A deterministic multiscale toy model is studied in which a chaotic fast subsystem triggers rare transitions between slow regimes, akin to weather or climate regimes. Using homogenization techniques, a reduced stochastic parametrization model is derived for the slow dynamics. The reliability of this reduced climate model in reproducing the statistics of the slow dynamics of the full deterministic model for finite values of the time scale separation is numerically established. The statistics however is sensitive to uncertainties in the parameters of the stochastic model. It is investigated whether the stochastic climate model can be beneficial as a forecast model in an ensemble data assimilation setting, in particular in the realistic setting when observations are only available for the slow variables. The main result is that reduced stochastic models can indeed improve the analysis skill, when used as forecast models instead of the perfect full deterministic model. The stochastic climate model is far superior at detecting transitions between regimes. The observation intervals for which skill improvement can be obtained are related to the characteristic time scales involved. The reason why stochastic climate models are capable of producing superior skill in an ensemble setting is due to the finite ensemble size; ensembles obtained from the perfect deterministic forecast model lacks sufficient spread even for moderate ensemble sizes. Stochastic climate models provide a natural way to provide sufficient ensemble spread to detect transitions between regimes. This is corroborated with numerical simulations. The conclusion is that stochastic parametrizations are attractive for data assimilation despite their sensitivity to uncertainties in the parameters. △ Less

Submitted 30 October, 2011; originally announced October 2011.

Comments: Accepted for publication in Journal of the Atmospheric Sciences

ACM Class: J.2

Journal ref: Journal of the Atmospheric Sciences, 69(4), pp. 1359-1377, 2012

Showing 1–50 of 55 results for author: Mitchell, L