-
De Bruijn Polyominoes
Authors:
D. Condon,
Yuxin Wang,
E. Yang
Abstract:
We introduce the notions of de Bruijn polyominoes and prismatic polyominoes, which generalize the notions of de Bruijn sequences and arrays. Given a small fixed polyomino $p$ and a set of colors $[n]$, a de Bruijn polyomino for $(p,n)$ is a colored fixed polyomino $P$ with cells colored from $[n]$ such that every possible coloring of $p$ from $[n]$ exists as a subset of $P$. We call de Bruijn poly…
▽ More
We introduce the notions of de Bruijn polyominoes and prismatic polyominoes, which generalize the notions of de Bruijn sequences and arrays. Given a small fixed polyomino $p$ and a set of colors $[n]$, a de Bruijn polyomino for $(p,n)$ is a colored fixed polyomino $P$ with cells colored from $[n]$ such that every possible coloring of $p$ from $[n]$ exists as a subset of $P$. We call de Bruijn polyominoes for $(p,n)$ of minimum size $(p,n)$-prismatic. We discuss for some values of $p$ and $n$ the shape of a $(p,n)$-prismatic polyomino $P$, the construction of a coloring of $P$, and the enumeration of the colorings of $P$. We find evidence that the difficulty of these problems may depend on the parity of the size of $p$
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Deep Lexical Hypothesis: Identifying personality structure in natural language
Authors:
Andrew Cutler,
David M. Condon
Abstract:
Recent advances in natural language processing (NLP) have produced general models that can perform complex tasks such as summarizing long passages and translating across languages. Here, we introduce a method to extract adjective similarities from language models as done with survey-based ratings in traditional psycholexical studies but using millions of times more text in a natural setting. The c…
▽ More
Recent advances in natural language processing (NLP) have produced general models that can perform complex tasks such as summarizing long passages and translating across languages. Here, we introduce a method to extract adjective similarities from language models as done with survey-based ratings in traditional psycholexical studies but using millions of times more text in a natural setting. The correlational structure produced through this method is highly similar to that of self- and other-ratings of 435 terms reported by Saucier and Goldberg (1996a). The first three unrotated factors produced using NLP are congruent with those in survey data, with coefficients of 0.89, 0.79, and 0.79. This structure is robust to many modeling decisions: adjective set, including those with 1,710 terms (Goldberg, 1982) and 18,000 terms (Allport & Odbert, 1936); the query used to extract correlations; and language model. Notably, Neuroticism and Openness are only weakly and inconsistently recovered. This is a new source of signal that is closer to the original (semantic) vision of the Lexical Hypothesis. The method can be applied where surveys cannot: in dozens of languages simultaneously, with tens of thousands of items, on historical text, and at extremely large scale for little cost. The code is made public to facilitate reproduction and fast iteration in new directions of research.
△ Less
Submitted 3 March, 2022;
originally announced March 2022.
-
Simple Relationships Between Lozenge Tiling Functions of Related Regions
Authors:
Daniel Condon
Abstract:
We give a formula for the number of symmetric tilings of hexagons on the triangular lattice with unit triangles removed from arbitrary positions along two non-adjacent non-opposite sides. We show that for certain families of such regions, the ratios of their numbers of symmetric tilings are given by simple product formulas. We also prove that for certain weighted regions which arise when applying…
▽ More
We give a formula for the number of symmetric tilings of hexagons on the triangular lattice with unit triangles removed from arbitrary positions along two non-adjacent non-opposite sides. We show that for certain families of such regions, the ratios of their numbers of symmetric tilings are given by simple product formulas. We also prove that for certain weighted regions which arise when applying Ciucu's Factorization Theorem, the formulas for the weighted and unweighted counts of tilings have a simple explicit relationship.
△ Less
Submitted 18 December, 2021;
originally announced December 2021.
-
Lozenge Tiling Function Ratios for Hexagons with Dents on Two Sides
Authors:
Daniel Condon
Abstract:
We give a formula for the number of lozenge tilings of a hexagon on the triangular lattice with unit triangles removed from arbitrary positions along two non-adjacent, non-opposite sides. Our formula implies that for certain families of such regions, the ratios of their numbers of tilings are given by simple product formulas.
We give a formula for the number of lozenge tilings of a hexagon on the triangular lattice with unit triangles removed from arbitrary positions along two non-adjacent, non-opposite sides. Our formula implies that for certain families of such regions, the ratios of their numbers of tilings are given by simple product formulas.
△ Less
Submitted 12 February, 2020; v1 submitted 5 February, 2020;
originally announced February 2020.
-
Predicting the kinetics of RNA oligonucleotides using Markov state models
Authors:
Giovanni Pinamonti,
Jianbo Zhao,
David E. Condon,
Fabian Paul,
Frank NoƩ,
Douglas H. Turner,
Giovanni Bussi
Abstract:
Nowadays different experimental techniques, such as single molecule or relaxation experiments, can provide dynamic properties of biomolecular systems, but the amount of detail obtainable with these methods is often limited in terms of time or spatial resolution. Here we use state-of-the-art computational techniques, namely atomistic molecular dynamics and Markov state models, to provide insight in…
▽ More
Nowadays different experimental techniques, such as single molecule or relaxation experiments, can provide dynamic properties of biomolecular systems, but the amount of detail obtainable with these methods is often limited in terms of time or spatial resolution. Here we use state-of-the-art computational techniques, namely atomistic molecular dynamics and Markov state models, to provide insight into the rapid dynamics of short RNA oligonucleotides, in order to elucidate the kinetics of stacking interactions. Analysis of multiple microsecond-long simulations indicates that the main relaxation modes of such molecules can consist of transitions between alternative folded states, rather than between random coils and native structures. After properly removing structures that are artificially stabilized by known inaccuracies of the current RNA AMBER force field, the kinetic properties predicted are consistent with the timescales of previously reported relaxation experiments.
△ Less
Submitted 22 December, 2016;
originally announced December 2016.
-
Automorphisms of $S_6$ and the Colored Cubes Puzzle
Authors:
Ethan Berkove,
David Cervantes Nava,
Daniel Condon,
Rachel Katz
Abstract:
Given a palette of six colors, a colored cube is a cube where each face is colored with exactly one color and each color appears on some face. Starting with an arbitrary collection of unit length colored cubes, one can try to arrange a subset of the collection into an $n \times n \times n$ cube where each face is a single color. This is the Colored Cubes Puzzle. In this paper, we determine minimum…
▽ More
Given a palette of six colors, a colored cube is a cube where each face is colored with exactly one color and each color appears on some face. Starting with an arbitrary collection of unit length colored cubes, one can try to arrange a subset of the collection into an $n \times n \times n$ cube where each face is a single color. This is the Colored Cubes Puzzle. In this paper, we determine minimum size sets of cubes required to complete an $n \times n \times n$ cube's frame, its corners and edges. We answer this problem for all $n$, and in particular show that for $n \geq4$ one has the best possible result, that as long as there are enough cubes to build a frame it can always be done, regardless of the cubes in the collection. Part of our analysis involves the set of $6$-colored cubes and its associated $S_6$ action. In addition to the problem simplification this action provides, it also gives another way to visualize the outer automorphism of $S_6$.
△ Less
Submitted 24 March, 2015;
originally announced March 2015.
-
On generalizations of separating and splitting families
Authors:
Daniel Condon,
Samuel Coskey,
Luke Serafin,
Cody Stockdale
Abstract:
The work in this article is concerned with two different types of families of finite sets: separating families and splitting families (they are also called "systems"). These families have applications in combinatorial search, coding theory, cryptography, and related fields. We define and study generalizations of these two notions, which we have named $n$-separating families and $n$-splitting famil…
▽ More
The work in this article is concerned with two different types of families of finite sets: separating families and splitting families (they are also called "systems"). These families have applications in combinatorial search, coding theory, cryptography, and related fields. We define and study generalizations of these two notions, which we have named $n$-separating families and $n$-splitting families. For each of these new notions, we outline their basic properties and connections with the well-studied notions. We then spend the greatest effort obtaining lower and upper bounds on the minimal size of the families. For $n$-separating families we obtain bounds which are asymptotically tight within a linear factor. For $n$-splitting families this appears to be much harder; we provide partial results and open questions.
△ Less
Submitted 7 May, 2015; v1 submitted 15 December, 2014;
originally announced December 2014.
-
Reuse, Temporal Dynamics, Interest Sharing, and Collaboration in Social Tagging Systems
Authors:
Elizeu Santos-Neto,
David Condon,
Nazareno Andrade,
Adriana Iamnitchi,
Matei Ripeanu
Abstract:
User-generated content is sha** the dynamics of the World Wide Web. Indeed, an increasingly large number of systems provide mechanisms to support the growing demand for content creation, sharing, and management. Tagging systems are a particular class of these systems where users share and collaboratively annotate content such as photos and URLs. This collaborative behavior and the pool of user-g…
▽ More
User-generated content is sha** the dynamics of the World Wide Web. Indeed, an increasingly large number of systems provide mechanisms to support the growing demand for content creation, sharing, and management. Tagging systems are a particular class of these systems where users share and collaboratively annotate content such as photos and URLs. This collaborative behavior and the pool of user-generated metadata create opportunities to improve existing systems and to design new mechanisms. However, to realize this potential, it is necessary to understand the usage characteristics of current systems. This work addresses this issue characterizing three tagging systems (CiteULike, Connotea and del.icio.us) while focusing on three aspects: i) the patterns of information (tags and items) production; ii) the temporal dynamics of users' tag vocabularies; and, iii) the social aspects of tagging systems.
△ Less
Submitted 25 January, 2013;
originally announced January 2013.
-
Asymptotic expansion of the difference of two Mahler measures
Authors:
John D. Condon
Abstract:
We show that for almost every polynomial P(x,y) with complex coefficients, the difference of the logarithmic Mahler measures of P(x,y) and P(x,x^n) can be expanded in a type of formal series similar to an asymptotic power series expansion in powers of 1/n. This generalizes a result of Boyd. We also show that such an expansion is unique and provide a formula for its coefficients. When P has algebra…
▽ More
We show that for almost every polynomial P(x,y) with complex coefficients, the difference of the logarithmic Mahler measures of P(x,y) and P(x,x^n) can be expanded in a type of formal series similar to an asymptotic power series expansion in powers of 1/n. This generalizes a result of Boyd. We also show that such an expansion is unique and provide a formula for its coefficients. When P has algebraic coefficients, the coefficients in the expansion are linear combinations of polylogarithms of algebraic numbers, with algebraic coefficients.
△ Less
Submitted 10 November, 2011; v1 submitted 1 November, 2011;
originally announced November 2011.