-
Decoding the Molecular Universe -- Workshop Report
Abstract: On August 9-10, 2023, a workshop was convened at the Pacific Northwest National Laboratory (PNNL) in Richland, WA that brought together a group of internationally recognized experts in metabolomics, natural products discovery, chemical ecology, chemical and biological threat assessment, cheminformatics, computational chemistry, cloud computing, artificial intelligence, and novel technology develop… ▽ More
Submitted 19 November, 2023; originally announced November 2023.
-
DEIMoS: an open-source tool for processing high-dimensional mass spectrometry data
Abstract: We present DEIMoS: Data Extraction for Integrated Multidimensional Spectrometry, a Python application programming interface (API) and command-line tool for high-dimensional mass spectrometry data analysis workflows that offers ease of development and access to efficient algorithmic implementations. Functionality includes feature detection, feature alignment, collision cross section (CCS) calibrati… ▽ More
Submitted 6 December, 2021; originally announced December 2021.
-
arXiv:2108.07815 [pdf, ps, other]
Long-term stability of planets in and around binary stars
Abstract: Planets are observed to orbit the component star(s) of stellar binary systems on so-called circumprimary or circumsecondary orbits, as well as around the entire binary system on so-called circumbinary orbits. Depending on the orbital parameters of the binary system a planet will be dynamically stable if it orbits within some critical separation of the semimajor axis in the circumprimary case, or b… ▽ More
Submitted 17 August, 2021; originally announced August 2021.
Comments: 13 pages, 7 figures, 2 short appendices, accepted for publication in MNRAS. A video abstract created by Helena Gibbon is available here: https://youtu.be/76kANnTK9-s
-
arXiv:2108.05215 [pdf, ps, other]
Communicating extraterrestrial intelligence (CETI) interaction models based on the Drake Equation
Abstract: The Drake Equation has proven fertile ground for speculation about the abundance, or lack thereof, of communicating extraterrestrial intelligences (CETIs) for decades. It has been augmented by subsequent authors to include random variables in order to understand its probabilistic behavior. However, in most cases, the emergence and lifetime of CETIs are assumed to be independent of each other. In t… ▽ More
Submitted 17 October, 2022; v1 submitted 18 June, 2021; originally announced August 2021.
Comments: 12 pages, 3 figures
Journal ref: International Journal of Astrobiology (2022), p. 1-9
-
arXiv:2005.12710 [pdf, ps, other]
An improved estimate of the inverse binary entropy function
Abstract: Two estimates for the inverse binary entropy function are derived using the property of information entropy to estimate combinatorics of sequences as well as related formulas from population genetics for the effective number of alleles. The second estimate shows close correspondence to the actual value of the inverse binary entropy function and can be seen as a close approximation away from low va… ▽ More
Submitted 5 May, 2020; originally announced May 2020.
Comments: 5 pages, 2 figures
-
arXiv:1712.10287 [pdf, ps, other]
Bitcoin Average Dormancy: A Measure of Turnover and Trading Activity
Abstract: Attempts to accurately measure the monetary velocity or related properties of bitcoin used in transactions have often attempted to either directly apply definitions from traditional macroeconomic theory or to use specialized metrics relative to the properties of the Blockchain like bitcoin days destroyed. In this paper, it is demonstrated that beyond being a useful metric, bitcoin days destroyed h… ▽ More
Submitted 7 February, 2018; v1 submitted 20 December, 2017; originally announced December 2017.
Comments: 12 pages, 5 figures; accepted and to appear in Ledger
Journal ref: Ledger, 3, 91-99 (2018)
-
arXiv:1512.00750 [pdf, ps, other]
A Mutual Information Approach to Calculating Nonlinearity
Abstract: A new method to measure nonlinear dependence between two variables is described using mutual information to analyze the separate linear and nonlinear components of dependence. This technique, which gives an exact value for the proportion of linear dependence, is then compared with another common test for linearity, the Brock, Dechert and Scheinkman (BDS) test.
Submitted 29 November, 2015; originally announced December 2015.
Comments: 13 pages, 2 figures
Journal ref: STAT, 4, 291-303 (2015)
-
arXiv:1501.05606 [pdf, ps, other]
A rapid algorithm to calculate joint probability matrices for joint entropies of arbitrary order
Abstract: There is no closed form analytical equation or quick method to calculate probabilities based only on the entropy of a signal or process. Except in the cases where there are constraints on the state probabilities, one must typically derive the underlying probabilities through search algorithms. These become more computationally expensive as entropies of higher orders are investigated. In this paper… ▽ More
Submitted 30 November, 2014; originally announced January 2015.
Comments: 4 pages 1 figure
-
arXiv:1410.8082 [pdf, ps, other]
Malware "Ecology" Viewed as Ecological Succession: Historical Trends and Future Prospects
Abstract: The development and evolution of malware including computer viruses, worms, and trojan horses, is shown to be closely analogous to the process of community succession long recognized in ecology. In particular, both changes in the overall environment by external disturbances, as well as, feedback effects from malware competition and antivirus coevolution have driven community succession and the dev… ▽ More
Submitted 24 September, 2014; originally announced October 2014.
Comments: 13 pages, 3 figures
-
arXiv:1308.3616 [pdf, ps, other]
Complexity in animal communication: Estimating the size of N-Gram structures
Abstract: In this paper, new techniques that allow conditional entropy to estimate the combinatorics of symbols are applied to animal communication studies to estimate the communication's repertoire size. By using the conditional entropy estimates at multiple orders, the paper estimates the total repertoire sizes for animal communication across bottlenose dolphins, humpback whales, and several species of bi… ▽ More
Submitted 16 December, 2013; v1 submitted 14 August, 2013; originally announced August 2013.
Comments: 17 pages, 4 figures, 4 tables; accepted and to appear in Entropy
Journal ref: Entropy 2014, 16(1), 526-542
-
arXiv:1307.5251 [pdf, ps, other]
Period doubling, information entropy, and estimates for Feigenbaum's constants
Abstract: The relationship between period doubling bifurcations and Feigenbaum's constants has been studied for nearly 40 years and this relationship has helped uncover many fundamental aspects of universal scaling across multiple nonlinear dynamical systems. This paper will combine information entropy with symbolic dynamics to demonstrate how period doubling can be defined using these tools alone. In addit… ▽ More
Submitted 3 August, 2013; v1 submitted 17 July, 2013; originally announced July 2013.
Comments: 5 pages; accepted to the International Journal of Bifurcation and Chaos
Journal ref: Int. J. Bifurcation Chaos, 23, 1350190 (2013)
-
Distinct word length frequencies: distributions and symbol entropies
Abstract: The distribution of frequency counts of distinct words by length in a language's vocabulary will be analyzed using two methods. The first, will look at the empirical distributions of several languages and derive a distribution that reasonably explains the number of distinct words as a function of length. We will be able to derive the frequency count, mean word length, and variance of word length b… ▽ More
Submitted 14 July, 2012; v1 submitted 10 July, 2012; originally announced July 2012.
Comments: 16 pages, 4 figures
Journal ref: Glottometrics 23, 2012, 7-22
-
arXiv:1201.3580 [pdf, ps, other]
A drift formulation of Gresham's Law
Abstract: In this paper we analyze Gresham's Law, in particular, how the rate of inflow or outflow of currencies is affected by the demand elasticity of arbitrage and the difference in face value ratios inside and outside of a country under a bimetallic system. We find that these equations are very similar to those used to describe drift in systems of free charged particles. In addition, we look at how Gres… ▽ More
Submitted 8 April, 2012; v1 submitted 17 January, 2012; originally announced January 2012.
Comments: 6 pages, 7 figures
Journal ref: Hyperion international journal of econophysics & new economy, volume 5, issue 1, 2012, p. 71-84
-
arXiv:1108.5580 [pdf, ps, other]
Five Differences Between Ecological and Economic Networks
Abstract: Ecological and economic networks have many similarities and are often compared. However, the comparison is often more apt as metaphor than a direct equivalence. In this paper, five key differences are explained which should inform any analysis which compares the two.
Submitted 29 August, 2011; originally announced August 2011.
Comments: 4 pages
-
arXiv:1103.5625 [pdf, ps, other]
Information Theory and Population Genetics
Abstract: The key findings of classical population genetics are derived using a framework based on information theory using the entropies of the allele frequency distribution as a basis. The common results for drift, mutation, selection, and gene flow will be rewritten both in terms of information theoretic measurements and used to draw the classic conclusions for balance conditions and common features of o… ▽ More
Submitted 8 June, 2012; v1 submitted 21 March, 2011; originally announced March 2011.
Comments: 29 pages, 11 figures
-
arXiv:1103.4983 [pdf, ps, other]
Propagation of Cascades in Complex Networks: From Supply Chains to Food Webs
Abstract: A general theory of top-down cascades in complex networks is described which explains two similar types of perturbation amplifications in the complex networks of business supply chains (the `bullwhip effect') and ecological food webs (trophic cascades). The dependence of the strength of the effects on the interaction strength and covariance in the dynamics as well as the graph structure allows bot… ▽ More
Submitted 28 January, 2012; v1 submitted 22 March, 2011; originally announced March 2011.
Comments: 16 pages, 3 figures
-
arXiv:1101.1154 [pdf, ps, other]
Liquid chromatography mass spectrometry-based proteomics: Biological and technological aspects
Abstract: Mass spectrometry-based proteomics has become the tool of choice for identifying and quantifying the proteome of an organism. Though recent years have seen a tremendous improvement in instrument performance and the computational tools used, significant challenges remain, and there are many opportunities for statisticians to make important contributions. In the most widely used "bottom-up" approach… ▽ More
Submitted 6 January, 2011; originally announced January 2011.
Comments: Published in at http://dx.doi.org/10.1214/10-AOAS341 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)
Report number: IMS-AOAS-AOAS341
Journal ref: Annals of Applied Statistics 2010, Vol. 4, No. 4, 1797-1823
-
arXiv:1006.5490 [pdf, ps, other]
Is high-frequency trading inducing changes in market microstructure and dynamics?
Abstract: Using high-frequency time series of stock prices and share volumes sizes from January 2002-May 2009, this paper investigates whether the effects of the onset of high-frequency trading, most prominent since 2005, are apparent in the dynamics of the dollar traded volume. Indeed it is found in almost all of 14 heavily traded stocks, that there has been an increase in the Hurst exponent of dollar trad… ▽ More
Submitted 21 September, 2010; v1 submitted 28 June, 2010; originally announced June 2010.
Comments: 21 pages, 10 figures, 2 tables; v2 corrected small omission (tilde) in Eq. 8; v3 - changed NWSA to NWS (News Corp), explicitly stated TAQ sale condition codes (and added a few) and trade correction indicator exclusions. No substantive changes to graphs or conclusions
-
A note on the entropy of repetitive sequences of symmetry group permutations
Abstract: The paper makes the observation that all orders of information entropy are equal in signals composed of repeating units of distinct symbols where the units can be classified as a member of a symmetry group. This leads to an improved metric for measuring the information content of higher order entropies in data such as text, signals, or genetics and another measure of similarity to compare the incr… ▽ More
Submitted 13 July, 2010; v1 submitted 8 April, 2010; originally announced April 2010.
Comments: Paper has been witdrawn pending corrections to equations and formalism; 3 pages, no figures
-
arXiv:0912.2201 [pdf, ps, other]
Crossing the Line: Towards increasingly fruitful complex systems research for the physics community
Abstract: This article addresses broad trends in interdisciplinary research in physics where interactions with colleagues in fields such as computer science, ecology, or economics can often be derailed by unintentional clashes of methodologies and perspectives on the core science. Key causes of such breakdowns in interdisciplinary work are detailed and solutions offered.
Submitted 26 September, 2011; v1 submitted 11 December, 2009; originally announced December 2009.
Comments: 15 pages
-
arXiv:0911.2158 [pdf, ps, other]
Right on time: Measuring Kuramoto model coupling from a survey of wristwatches
Abstract: Using a survey of wristwatch synchronization from a randomly selected group of independent volunteers, one can model the system as a Kuramoto-type coupled oscillator network. Based on the phase data, both the order parameter and an estimated value of the coupling is derived and the possibilities for similar research to deduce topology from dynamics are discussed.
Submitted 25 March, 2010; v1 submitted 11 November, 2009; originally announced November 2009.
Comments: 7 pages, 3 figures
-
arXiv:0904.3797 [pdf, ps, other]
Internet Traffic Periodicities and Oscillations: A Brief Review
Abstract: Internet traffic displays many persistent periodicities (oscillations) on a large range of time scales. This paper describes the measurement methodology to detect Internet traffic periodicities and also describes the main periodicities in Internet traffic.
Submitted 27 April, 2009; v1 submitted 24 April, 2009; originally announced April 2009.
Comments: 10 pages 2 figures; submitted to Computer Networks
-
arXiv:0901.3863 [pdf, ps, other]
Broadcasting but not receiving: density dependence considerations for SETI signals
Abstract: This paper develops a detailed quantitative model which uses the Drake equation and an assumption of an average maximum radio broadcasting distance by an communicative civilization to derive a minimum civilization density for contact between two civilizations to be probable in a given volume of space under certain conditions, the amount of time it would take for a first contact, and whether reci… ▽ More
Submitted 6 February, 2010; v1 submitted 24 January, 2009; originally announced January 2009.
Comments: 11 pages, 2 figures; note revised 2/6/2010 to correct minor mathematical errors. The conclusions and final numbers of the paper only change slightly so its findings are not invalidated. Quote as Int. J. Astrobio but arxiv version is most correct and updated
Journal ref: International Journal of Astrobiology, volume 8, issue 02, pp. 101-105 (2009)
-
arXiv:0901.1392 [pdf, ps, other]
The Spread of the Credit Crisis: View from a Stock Correlation Network
Abstract: The credit crisis roiling the world's financial markets will likely take years and entire careers to fully understand and analyze. A short empirical investigation of the current trends, however, demonstrates that the losses in certain markets, in this case the US equity markets, follow a cascade or epidemic flow like model along the correlations of various stocks. This phenomenon will be shown b… ▽ More
Submitted 8 June, 2009; v1 submitted 10 January, 2009; originally announced January 2009.
Comments: 4 pages, 3 figures; to appear in the Journal of the Physical Society of Korea; animations of credit crisis spread available at: http://reggiesmithsci.googlepages.com/creditcrisis
Journal ref: J Korean Phys. Soc. 54, 6, p. 2460-2463 (2009)
-
arXiv:0808.3616 [pdf, ps, other]
Constructing word similarities in Meroitic as an aid to decipherment
Abstract: Meroitic is the still undeciphered language of the ancient civilization of Kush. Over the years, various techniques for decipherment such as finding a bilingual text or cognates from modern or other ancient languages in the Sudan and surrounding areas has not been successful. Using techniques borrowed from information theory and natural language statistics, similar words are paired and attempts… ▽ More
Submitted 30 March, 2009; v1 submitted 26 August, 2008; originally announced August 2008.
Comments: 10 pages; 2 figures; to appear in British Museum studies in Ancient Egypt and Sudan
Journal ref: British Museum Studies in Ancient Egypt and Sudan, 12, 1-10 (2009)
-
Investigation of the Zipf-plot of the extinct Meroitic language
Abstract: The ancient and extinct language Meroitic is investigated using Zipf's Law. In particular, since Meroitic is still undeciphered, the Zipf law analysis allows us to assess the quality of current texts and possible avenues for future investigation using statistical techniques.
Submitted 21 August, 2008; originally announced August 2008.
Comments: 10 pages, 2 figures
Journal ref: Glottometrics 15, 2007, 53-61
-
Phase Diagrams of Network Traffic
Abstract: This paper has been withdrawn due to errors in the analysis of data with Carrier Access Rate control and statistical methodologies.
Submitted 17 March, 2009; v1 submitted 27 July, 2008; originally announced July 2008.
Comments: This paper has been withdrawn
-
arXiv:0807.3374 [pdf, ps, other]
The Dynamics of Internet Traffic: Self-Similarity, Self-Organization, and Complex Phenomena
Abstract: The Internet is the most complex system ever created in human history. Therefore, its dynamics and traffic unsurprisingly take on a rich variety of complex dynamics, self-organization, and other phenomena that have been researched for years. This paper is a review of the complex dynamics of Internet traffic. Departing from normal treatises, we will take a view from both the network engineering and… ▽ More
Submitted 5 September, 2010; v1 submitted 21 July, 2008; originally announced July 2008.
Comments: 63 pages, 7 figures, 7 tables, submitted to Advances in Complex Systems
Journal ref: Advances in Complex Systems, 14, 6 p. 905-949 (2011)
-
arXiv:0803.0367 [pdf, ps, other]
Plant-Mycorrhiza Percent Infection as Evidence of Coupled Metabolism
Abstract: A common feature of mycorrhizal observation is the growth of the infection on the plant root as a percent of the infected root or root tip length. Often, this is measured as a logistic curve with an eventual, though usually transient, plateau. It is shown in this paper that the periods of stable percent infection in the mycorrhizal growth cycle correspond to periods where both the plant and myco… ▽ More
Submitted 28 February, 2009; v1 submitted 3 March, 2008; originally announced March 2008.
Comments: 11 pages; accepted by the Journal of Theoretical Biology (in press)
Journal ref: Journal of Theoretical Biology 259 (2009), pp. 172-175
-
arXiv:0802.3554 [pdf, ps, other]
Data Traffic Dynamics and Saturation on a Single Link
Abstract: The dynamics of User Datagram Protocol (UDP) traffic over Ethernet between two computers are analyzed using nonlinear dynamics which shows that there are two clear regimes in the data flow: free flow and saturated. The two most important variables affecting this are the packet size and packet flow rate. However, this transition is due to a transcritical bifurcation rather than phase transition i… ▽ More
Submitted 20 February, 2009; v1 submitted 24 February, 2008; originally announced February 2008.
Comments: 10 pages, 5 figures
Journal ref: International Journal of Computer, Information, and Systems Science, and Engineering, vol 3, no. 1, 11-16 2009
-
arXiv:0710.2947 [pdf, ps, other]
Average Path Length in Complex Networks: Patterns and Predictions
Abstract: A simple and accurate relationship is demonstrated that links the average shortest path, nodes, and edges in a complex network. This relationship takes advantage of the concept of link density and shows a large improvement in fitting networks of all scales over the typical random graph model. The relationships herein can allow researchers to better predict the shortest path of networks of almost… ▽ More
Submitted 13 January, 2008; v1 submitted 15 October, 2007; originally announced October 2007.
Comments: 5 pages, 3 figures; submitted to Journal of Statistical Mechanics
-
The Network of Collaboration Among Rappers and its Community Structure
Abstract: The social network formed by the collaboration between rappers is studied using standard statistical techniques for analyzing complex networks. In addition, the community structure of the rap music community is analyzed using a new method that uses weighted edges to determine which connections are most important and revealing among all the communities. The results of this method as well as possi… ▽ More
Submitted 19 January, 2006; v1 submitted 25 November, 2005; originally announced November 2005.
Comments: 8 pages, 1 figure, 4 tables Accepted for publication in The Journal of Statistical Mechanics: Theory and Experiment. Minor changes in TeX style, added comments, table formatting corrected
Journal ref: J. Stat. Mech. (2006) P02006
-
Instant Messaging as a Scale-Free Network
Abstract: The topology of an instant messaging system is described. Statistical measures of the network are given and compared with the statistics of a comparable random graph. The scale-free character of the network is examined and implications are given for the structure of social networks and instant messenger security.
Submitted 19 August, 2002; v1 submitted 19 June, 2002; originally announced June 2002.
Comments: 5 pages, 4 figures