-
A family of permutationally invariant quantum codes
Authors:
Arda Aydin,
Max A. Alekseyev,
Alexander Barg
Abstract:
We construct a new family of permutationally invariant codes that correct $t$ Pauli errors for any $t\ge 1$. We also show that codes in the new family correct quantum deletion errors as well as spontaneous decay errors. Our construction contains some of the previously known permutationally invariant quantum codes as particular cases, which also admit transversal gates. In many cases, the codes in…
▽ More
We construct a new family of permutationally invariant codes that correct $t$ Pauli errors for any $t\ge 1$. We also show that codes in the new family correct quantum deletion errors as well as spontaneous decay errors. Our construction contains some of the previously known permutationally invariant quantum codes as particular cases, which also admit transversal gates. In many cases, the codes in the new family are shorter than the best previously known explicit permutationally invariant codes for Pauli errors and deletions. Furthermore, our new code family includes a new $((4,2,2))$ optimal single-deletion-correcting code. As a separate result, we generalize the conditions for permutationally invariant codes to correct $t$ Pauli errors from the previously known results for $t=1$ to any number of errors. For small $t$, these conditions can be used to construct new examples of codes by computer.
△ Less
Submitted 15 April, 2024; v1 submitted 8 October, 2023;
originally announced October 2023.
-
On computing sets of integers with maximum number of pairs summing to powers of 2
Authors:
Max A. Alekseyev
Abstract:
We address the problem of finding sets of integers of a given size with maximum number of pairs summing to powers of $2$. By fixing particular pairs this problem reduces to finding a labeling of the vertices of a given graph with pairwise distinct integers such that the endpoint labels for each edge sum to a power of $2$. We propose an efficient algorithm for this problem, which we use to determin…
▽ More
We address the problem of finding sets of integers of a given size with maximum number of pairs summing to powers of $2$. By fixing particular pairs this problem reduces to finding a labeling of the vertices of a given graph with pairwise distinct integers such that the endpoint labels for each edge sum to a power of $2$. We propose an efficient algorithm for this problem, which we use to determine the maximum size of graphs of order $n$ that admit such a labeling for all $n\leq 18$. We also identify the minimal forbidden subgraphs of order $\leq 11$, whose presence prevents graphs from having such a labeling.
△ Less
Submitted 5 March, 2023;
originally announced March 2023.
-
Orienting Ordered Scaffolds: Complexity and Algorithms
Authors:
Sergey Aganezov,
Pavel Avdeyev,
Nikita Alexeev,
Yongwu Rong,
Max A. Alekseyev
Abstract:
Despite the recent progress in genome sequencing and assembly, many of the currently available assembled genomes come in a draft form. Such draft genomes consist of a large number of genomic fragments (scaffolds), whose order and/or orientation (i.e., strand) in the genome are unknown. There exist various scaffold assembly methods, which attempt to determine the order and orientation of scaffolds…
▽ More
Despite the recent progress in genome sequencing and assembly, many of the currently available assembled genomes come in a draft form. Such draft genomes consist of a large number of genomic fragments (scaffolds), whose order and/or orientation (i.e., strand) in the genome are unknown. There exist various scaffold assembly methods, which attempt to determine the order and orientation of scaffolds along the genome chromosomes. Some of these methods (e.g., based on FISH physical map**, chromatin conformation capture, etc.) can infer the order of scaffolds, but not necessarily their orientation. This leads to a special case of the scaffold orientation problem (i.e., deducing the orientation of each scaffold) with a known order of the scaffolds.
We address the problem of orientating ordered scaffolds as an optimization problem based on given weighted orientations of scaffolds and their pairs (e.g., coming from pair-end sequencing reads, long reads, or homologous relations). We formalize this problem using notion of a scaffold graph (i.e., a graph, where vertices correspond to the assembled contigs or scaffolds and edges represent connections between them). We prove that this problem is NP-hard, and present a polynomial-time algorithm for solving its special case, where orientation of each scaffold is imposed relatively to at most two other scaffolds. We further develop an FPT algorithm for the general case of the OOS problem.
△ Less
Submitted 25 November, 2019;
originally announced November 2019.
-
On partitions into squares of distinct integers whose reciprocals sum to 1
Authors:
Max A. Alekseyev
Abstract:
In 1963, Graham proved that all integers greater than 77 (but not 77 itself) can be partitioned into distinct positive integers whose reciprocals sum to 1. He further conjectured that for any sufficiently large integer, it can be partitioned into squares of distinct positive integers whose reciprocals sum to 1. In this study, we establish the exact bound for existence of such partitions by proving…
▽ More
In 1963, Graham proved that all integers greater than 77 (but not 77 itself) can be partitioned into distinct positive integers whose reciprocals sum to 1. He further conjectured that for any sufficiently large integer, it can be partitioned into squares of distinct positive integers whose reciprocals sum to 1. In this study, we establish the exact bound for existence of such partitions by proving that 8542 is the largest integer with no such partition.
△ Less
Submitted 23 April, 2018; v1 submitted 17 January, 2018;
originally announced January 2018.
-
Combinatorial Scoring of Phylogenetic Networks
Authors:
Nikita Alexeev,
Max A. Alekseyev
Abstract:
Construction of phylogenetic trees and networks for extant species from their characters represents one of the key problems in phylogenomics. While solution to this problem is not always uniquely defined and there exist multiple methods for tree/network construction, it becomes important to measure how well the constructed networks capture the given character relationship across the species.
In…
▽ More
Construction of phylogenetic trees and networks for extant species from their characters represents one of the key problems in phylogenomics. While solution to this problem is not always uniquely defined and there exist multiple methods for tree/network construction, it becomes important to measure how well the constructed networks capture the given character relationship across the species.
In the current study, we propose a novel method for measuring the specificity of a given phylogenetic network in terms of the total number of distributions of character states at the leaves that the network may impose. While for binary phylogenetic trees, this number has an exact formula and depends only on the number of leaves and character states but not on the tree topology, the situation is much more complicated for non-binary trees or networks. Nevertheless, we develop an algorithm for combinatorial enumeration of such distributions, which is applicable for arbitrary trees and networks under some reasonable assumptions.
△ Less
Submitted 8 August, 2016; v1 submitted 8 February, 2016;
originally announced February 2016.
-
Making Walks Count: From Silent Circles to Hamiltonian Cycles
Authors:
Max A. Alekseyev,
Gerard P. Michon
Abstract:
We illustrate the application of the matrix-transfer method for a number of enumeration problems concerning the party game Silent Circles, Hamiltonian cycles in the antiprism graphs, and simple paths and cycles of a fixed length in arbitrary graphs.
We illustrate the application of the matrix-transfer method for a number of enumeration problems concerning the party game Silent Circles, Hamiltonian cycles in the antiprism graphs, and simple paths and cycles of a fixed length in arbitrary graphs.
△ Less
Submitted 25 August, 2017; v1 submitted 3 February, 2016;
originally announced February 2016.
-
Weighted de Bruijn Graphs for the Menage Problem and Its Generalizations
Authors:
Max A. Alekseyev
Abstract:
We address the problem of enumeration of seating arrangements of married couples around a circular table such that no spouses sit next to each other and no k consecutive persons are of the same gender. While the case of k=2 corresponds to the classical problème des ménages with a well-studied solution, no closed-form expression for the number of seating arrangements is known when k>=3.
We propos…
▽ More
We address the problem of enumeration of seating arrangements of married couples around a circular table such that no spouses sit next to each other and no k consecutive persons are of the same gender. While the case of k=2 corresponds to the classical problème des ménages with a well-studied solution, no closed-form expression for the number of seating arrangements is known when k>=3.
We propose a novel approach for this type of problems based on enumeration of walks in certain algebraically weighted de Bruijn graphs. Our approach leads to new expressions for the menage numbers and their exponential generating function and allows one to efficiently compute the number of seating arrangements in general cases, which we illustrate in detail for the ternary case of k=3.
△ Less
Submitted 9 August, 2016; v1 submitted 27 October, 2015;
originally announced October 2015.
-
Computing the inverses, their power sums, and extrema for Euler's totient and other multiplicative functions
Authors:
Max A. Alekseyev
Abstract:
We propose a generic algorithm for computing the inverses of a multiplicative function under the assumption that the set of inverses is finite. More generally, our algorithm can compute certain functions of the inverses, such as their power sums (e.g., cardinality) or extrema, without direct enumeration of the inverses. We illustrate our algorithm with Euler's totient function $\varphi(\cdot)$ and…
▽ More
We propose a generic algorithm for computing the inverses of a multiplicative function under the assumption that the set of inverses is finite. More generally, our algorithm can compute certain functions of the inverses, such as their power sums (e.g., cardinality) or extrema, without direct enumeration of the inverses. We illustrate our algorithm with Euler's totient function $\varphi(\cdot)$ and the $k$-th power sum of divisors $σ_k(\cdot)$. For example, we can establish that the number of solutions to $σ_1(x) = 10^{1000}$ is 15,512,215,160,488,452,125,793,724,066,873,737,608,071,476, while it is intractable to iterate over the actual solutions.
△ Less
Submitted 17 May, 2016; v1 submitted 23 January, 2014;
originally announced January 2014.
-
On the minimal teaching sets of two-dimensional threshold functions
Authors:
Max A. Alekseyev,
Marina G. Basova,
Nikolai Yu. Zolotykh
Abstract:
It is known that a minimal teaching set of any threshold function on the twodimensional rectangular grid consists of 3 or 4 points. We derive exact formulae for the numbers of functions corresponding to these values and further refine them in the case of a minimal teaching set of size 3. We also prove that the average cardinality of the minimal teaching sets of threshold functions is asymptoticall…
▽ More
It is known that a minimal teaching set of any threshold function on the twodimensional rectangular grid consists of 3 or 4 points. We derive exact formulae for the numbers of functions corresponding to these values and further refine them in the case of a minimal teaching set of size 3. We also prove that the average cardinality of the minimal teaching sets of threshold functions is asymptotically 7/2.
We further present corollaries of these results concerning some special arrangements of lines in the plane.
△ Less
Submitted 19 July, 2014; v1 submitted 3 July, 2013;
originally announced July 2013.
-
On integral points on biquadratic curves and near-multiples of squares in Lucas sequences
Authors:
Max A. Alekseyev,
Szabolcs Tengely
Abstract:
We describe an algorithmic reduction of the search for integral points on a curve y^2 = ax^4 + bx^2 + c with nonzero ac(b^2-4ac) to solving a finite number of Thue equations. While existence of such reduction is anticipated from arguments of algebraic number theory, our algorithm is elementary and to best of our knowledge is the first published algorithm of this kind. In combination with other met…
▽ More
We describe an algorithmic reduction of the search for integral points on a curve y^2 = ax^4 + bx^2 + c with nonzero ac(b^2-4ac) to solving a finite number of Thue equations. While existence of such reduction is anticipated from arguments of algebraic number theory, our algorithm is elementary and to best of our knowledge is the first published algorithm of this kind. In combination with other methods and powered by existing software Thue equations solvers, it allows one to efficiently compute integral points on biquadratic curves.
We illustrate this approach with a particular application of finding near-multiples of squares in Lucas sequences. As an example, we establish that among Fibonacci numbers only 2 and 34 are of the form 2m^2+2; only 1, 13, and 1597 are of the form m^2-3; and so on.
As an auxiliary result, we also give an algorithm for solving a Diophantine equation k^2 = f(m,n)/g(m,n) in integers m,n,k, where f and g are homogeneous quadratic polynomials.
△ Less
Submitted 20 May, 2014; v1 submitted 4 June, 2013;
originally announced June 2013.
-
Solving the Tower of Hanoi with Random Moves
Authors:
Max A. Alekseyev,
Toby Berger
Abstract:
We prove the exact formulae for the expected number of moves to solve several variants of the Tower of Hanoi puzzle with 3 pegs and n disks, when each move is chosen uniformly randomly from the set of all valid moves. We further present an alternative proof for one of the formulae that couples a theorem about expected commute times of random walks on graphs with the delta-to-wye transformation use…
▽ More
We prove the exact formulae for the expected number of moves to solve several variants of the Tower of Hanoi puzzle with 3 pegs and n disks, when each move is chosen uniformly randomly from the set of all valid moves. We further present an alternative proof for one of the formulae that couples a theorem about expected commute times of random walks on graphs with the delta-to-wye transformation used in the analysis of three-phase AC systems for electrical power distribution.
△ Less
Submitted 18 September, 2014; v1 submitted 13 April, 2013;
originally announced April 2013.
-
BayesHammer: Bayesian clustering for error correction in single-cell sequencing
Authors:
Sergey I. Nikolenko,
Anton I. Korobeynikov,
Max A. Alekseyev
Abstract:
Error correction of sequenced reads remains a difficult task, especially in single-cell sequencing projects with extremely non-uniform coverage. While existing error correction tools designed for standard (multi-cell) sequencing data usually come up short in single-cell sequencing projects, algorithms actually used for single-cell error correction have been so far very simplistic.
We introduce s…
▽ More
Error correction of sequenced reads remains a difficult task, especially in single-cell sequencing projects with extremely non-uniform coverage. While existing error correction tools designed for standard (multi-cell) sequencing data usually come up short in single-cell sequencing projects, algorithms actually used for single-cell error correction have been so far very simplistic.
We introduce several novel algorithms based on Hamming graphs and Bayesian subclustering in our new error correction tool BayesHammer. While BayesHammer was designed for single-cell sequencing, we demonstrate that it also improves on existing error correction tools for multi-cell sequencing data while working much faster on real-life datasets. We benchmark BayesHammer on both $k$-mer counts and actual assembly results with the SPAdes genome assembler.
△ Less
Submitted 12 November, 2012;
originally announced November 2012.
-
On pairwise distances and median score of three genomes under DCJ
Authors:
Sergey Aganezov, Jr.,
Max A. Alekseyev
Abstract:
In comparative genomics, the rearrangement distance between two genomes (equal the minimal number of genome rearrangements required to transform them into a single genome) is often used for measuring their evolutionary remoteness. Generalization of this measure to three genomes is known as the median score (while a resulting genome is called median genome). In contrast to the rearrangement distanc…
▽ More
In comparative genomics, the rearrangement distance between two genomes (equal the minimal number of genome rearrangements required to transform them into a single genome) is often used for measuring their evolutionary remoteness. Generalization of this measure to three genomes is known as the median score (while a resulting genome is called median genome). In contrast to the rearrangement distance between two genomes which can be computed in linear time, computing the median score for three genomes is NP-hard. This inspires a quest for simpler and faster approximations for the median score, the most natural of which appears to be the halved sum of pairwise distances which in fact represents a lower bound for the median score.
In this work, we study relationship and interplay of pairwise distances between three genomes and their median score under the model of Double-Cut-and-Join (DCJ) rearrangements. Most remarkably we show that while a rearrangement may change the sum of pairwise distances by at most 2 (and thus change the lower bound by at most 1), even the most "powerful" rearrangements in this respect that increase the lower bound by 1 (by moving one genome farther away from each of the other two genomes), which we call strong, do not necessarily affect the median score. This observation implies that the two measures are not as well-correlated as one's intuition may suggest.
We further prove that the median score attains the lower bound exactly on the triples of genomes that can be obtained from a single genome with strong rearrangements. While the sum of pairwise distances with the factor 2/3 represents an upper bound for the median score, its tightness remains unclear. Nonetheless, we show that the difference of the median score and its lower bound is not bounded by a constant.
△ Less
Submitted 22 October, 2012; v1 submitted 1 August, 2012;
originally announced August 2012.
-
Weighted genomic distance can hardly impose a bound on the proportion of transpositions
Authors:
Shuai Jiang,
Max A. Alekseyev
Abstract:
Genomic distance between two genomes, i.e., the smallest number of genome rearrangements required to transform one genome into the other, is often used as a measure of evolutionary closeness of the genomes in comparative genomics studies. However, in models that include rearrangements of significantly different "power" such as reversals (that are "weak" and most frequent rearrangements) and transp…
▽ More
Genomic distance between two genomes, i.e., the smallest number of genome rearrangements required to transform one genome into the other, is often used as a measure of evolutionary closeness of the genomes in comparative genomics studies. However, in models that include rearrangements of significantly different "power" such as reversals (that are "weak" and most frequent rearrangements) and transpositions (that are more "powerful" but rare), the genomic distance typically corresponds to a transformation with a large proportion of transpositions, which is not biologically adequate.
Weighted genomic distance is a traditional approach to bounding the proportion of transpositions by assigning them a relative weight α > 1. A number of previous studies addressed the problem of computing weighted genomic distance with α \leq 2.
Employing the model of multi-break rearrangements on circular genomes, that captures both reversals (modelled as 2-breaks) and transpositions (modelled as 3-breaks), we prove that for α \in (1,2], a minimum-weight transformation may entirely consist of transpositions, implying that the corresponding weighted genomic distance does not actually achieve its purpose of bounding the proportion of transpositions. We further prove that for α \in (1,2), the minimum-weight transformations do not depend on a particular choice of α from this interval. We give a complete characterization of such transformations and show that they coincide with the transformations that at the same time have the shortest length and make the smallest number of breakages in the genomes.
Our results also provide a theoretical foundation for the empirical observation that for α < 2, transpositions are favored over reversals in the minimum-weight transformations.
△ Less
Submitted 10 December, 2010;
originally announced December 2010.
-
On the intersections of Fibonacci, Pell, and Lucas numbers
Authors:
Max A. Alekseyev
Abstract:
We describe how to compute the intersection of two Lucas sequences of the forms $\{U_n(P,\pm 1) \}_{n=0}^{\infty}$ or $\{V_n(P,\pm 1) \}_{n=0}^{\infty}$ with $P\in\mathbb{Z}$ that includes sequences of Fibonacci, Pell, Lucas, and Lucas-Pell numbers. We prove that such an intersection is finite except for the case $U_n(1,-1)$ and $U_n(3,1)$ and the case of two $V$-sequences when the product of thei…
▽ More
We describe how to compute the intersection of two Lucas sequences of the forms $\{U_n(P,\pm 1) \}_{n=0}^{\infty}$ or $\{V_n(P,\pm 1) \}_{n=0}^{\infty}$ with $P\in\mathbb{Z}$ that includes sequences of Fibonacci, Pell, Lucas, and Lucas-Pell numbers. We prove that such an intersection is finite except for the case $U_n(1,-1)$ and $U_n(3,1)$ and the case of two $V$-sequences when the product of their discriminants is a perfect square. Moreover, the intersection in these cases also forms a Lucas sequence. Our approach relies on solving homogeneous quadratic Diophantine equations and Thue equations. In particular, we prove that 0, 1, 2, and 5 are the only numbers that are both Fibonacci and Pell, and list similar results for many other pairs of Lucas sequences. We further extend our results to Lucas sequences with arbitrary initial terms.
△ Less
Submitted 5 November, 2010; v1 submitted 8 February, 2010;
originally announced February 2010.