-
Phase transition in the computational complexity of the shortest common superstring and genome assembly
Authors:
L. A. Fernandez,
V. Martin-Mayor,
D. Yllanes
Abstract:
Genome assembly, the process of reconstructing a long genetic sequence by aligning and merging short fragments, or reads, is known to be NP-hard, either as a version of the shortest common superstring problem or in a Hamiltonian-cycle formulation. That is, the computing time is believed to grow exponentially with the the problem size in the worst case. Despite this fact, high-throughput technologi…
▽ More
Genome assembly, the process of reconstructing a long genetic sequence by aligning and merging short fragments, or reads, is known to be NP-hard, either as a version of the shortest common superstring problem or in a Hamiltonian-cycle formulation. That is, the computing time is believed to grow exponentially with the the problem size in the worst case. Despite this fact, high-throughput technologies and modern algorithms currently allow bioinformaticians to handle datasets of billions of reads. Using methods from statistical mechanics, we address this conundrum by demonstrating the existence of a phase transition in the computational complexity of the problem and showing that practical instances always fall in the 'easy' phase (solvable by polynomial-time algorithms). In addition, we propose a Markov-chain Monte Carlo method that outperforms common deterministic algorithms in the hard regime.
△ Less
Submitted 11 March, 2024; v1 submitted 18 October, 2022;
originally announced October 2022.
-
Janus II: a new generation application-driven computer for spin-system simulations
Authors:
Janus Collaboration,
M. Baity-Jesi,
R. A. Baños,
A. Cruz,
L. A. Fernandez,
J. M. Gil-Narvion,
A. Gordillo-Guerrero,
D. Iñiguez,
A. Maiorano,
F. Mantovani,
E. Marinari,
V. Martin-Mayor,
J. Monforte-Garcia,
A. Muñoz Sudupe,
D. Navarro,
G. Parisi,
S. Perez-Gaviro,
M. Pivanti,
F. Ricci-Tersenghi,
J. J. Ruiz-Lorenzo,
S. F. Schifano,
B. Seoane,
A. Tarancon,
R. Tripiccione,
D. Yllanes
Abstract:
This paper describes the architecture, the development and the implementation of Janus II, a new generation application-driven number cruncher optimized for Monte Carlo simulations of spin systems (mainly spin glasses). This domain of computational physics is a recognized grand challenge of high-performance computing: the resources necessary to study in detail theoretical models that can make cont…
▽ More
This paper describes the architecture, the development and the implementation of Janus II, a new generation application-driven number cruncher optimized for Monte Carlo simulations of spin systems (mainly spin glasses). This domain of computational physics is a recognized grand challenge of high-performance computing: the resources necessary to study in detail theoretical models that can make contact with experimental data are by far beyond those available using commodity computer systems. On the other hand, several specific features of the associated algorithms suggest that unconventional computer architectures, which can be implemented with available electronics technologies, may lead to order of magnitude increases in performance, reducing to acceptable values on human scales the time needed to carry out simulation campaigns that would take centuries on commercially available machines. Janus II is one such machine, recently developed and commissioned, that builds upon and improves on the successful JANUS machine, which has been used for physics since 2008 and is still in operation today. This paper describes in detail the motivations behind the project, the computational requirements, the architecture and the implementation of this new machine and compares its expected performances with those of currently available commercial systems.
△ Less
Submitted 3 October, 2013;
originally announced October 2013.
-
Reconfigurable computing for Monte Carlo simulations: results and prospects of the Janus project
Authors:
Janus Collaboration,
M. Baity-Jesi,
R. A. Banos,
A. Cruz,
L. A. Fernandez,
J. M. Gil-Narvion,
A. Gordillo-Guerrero,
M. Guidetti,
D. Iniguez,
A. Maiorano,
F. Mantovani,
E. Marinari,
V. Martin-Mayor,
J. Monforte-Garcia,
A. Munoz Sudupe,
D. Navarro,
G. Parisi,
M. Pivanti,
S. Perez-Gaviro,
F. Ricci-Tersenghi,
J. J. Ruiz-Lorenzo,
S. F. Schifano,
B. Seoane,
A. Tarancon,
P. Tellez
, et al. (2 additional authors not shown)
Abstract:
We describe Janus, a massively parallel FPGA-based computer optimized for the simulation of spin glasses, theoretical models for the behavior of glassy materials. FPGAs (as compared to GPUs or many-core processors) provide a complementary approach to massively parallel computing. In particular, our model problem is formulated in terms of binary variables, and floating-point operations can be (almo…
▽ More
We describe Janus, a massively parallel FPGA-based computer optimized for the simulation of spin glasses, theoretical models for the behavior of glassy materials. FPGAs (as compared to GPUs or many-core processors) provide a complementary approach to massively parallel computing. In particular, our model problem is formulated in terms of binary variables, and floating-point operations can be (almost) completely avoided. The FPGA architecture allows us to run many independent threads with almost no latencies in memory access, thus updating up to 1024 spins per cycle. We describe Janus in detail and we summarize the physics results obtained in four years of operation of this machine; we discuss two types of physics applications: long simulations on very large systems (which try to mimic and provide understanding about the experimental non-equilibrium dynamics), and low-temperature equilibrium simulations using an artificial parallel tempering dynamics. The time scale of our non-equilibrium simulations spans eleven orders of magnitude (from picoseconds to a tenth of a second). On the other hand, our equilibrium simulations are unprecedented both because of the low temperatures reached and for the large systems that we have brought to equilibrium. A finite-time scaling ansatz emerges from the detailed comparison of the two sets of simulations. Janus has made it possible to perform spin-glass simulations that would take several decades on more conventional architectures. The paper ends with an assessment of the potential of possible future versions of the Janus architecture, based on state-of-the-art technology.
△ Less
Submitted 18 April, 2012;
originally announced April 2012.
-
The Invar tensor package: Differential invariants of Riemann
Authors:
Jose M. Martin-Garcia,
David Yllanes,
Renato Portugal
Abstract:
The long standing problem of the relations among the scalar invariants of the Riemann tensor is computationally solved for all 6x10^23 objects with up to 12 derivatives of the metric. This covers cases ranging from products of up to 6 undifferentiated Riemann tensors to cases with up to 10 covariant derivatives of a single Riemann. We extend our computer algebra system Invar to produce within se…
▽ More
The long standing problem of the relations among the scalar invariants of the Riemann tensor is computationally solved for all 6x10^23 objects with up to 12 derivatives of the metric. This covers cases ranging from products of up to 6 undifferentiated Riemann tensors to cases with up to 10 covariant derivatives of a single Riemann. We extend our computer algebra system Invar to produce within seconds a canonical form for any of those objects in terms of a basis. The process is as follows: (1) an invariant is converted in real time into a canonical form with respect to the permutation symmetries of the Riemann tensor; (2) Invar reads a database of more than 6x10^5 relations and applies those coming from the cyclic symmetry of the Riemann tensor; (3) then applies the relations coming from the Bianchi identity, (4) the relations coming from commutations of covariant derivatives, (5) the dimensionally-dependent identities for dimension 4, and finally (6) simplifies invariants that can be expressed as product of dual invariants. Invar runs on top of the tensor computer algebra systems xTensor (for Mathematica) and Canon (for Maple).
△ Less
Submitted 11 February, 2008;
originally announced February 2008.