-
Del Visual al Auditivo: Sonorización de Escenas Guiada por Imagen
Authors:
María Sánchez,
Laura Fernández,
Julián Arias,
Mateo Cámara,
Giulia Comini,
Adam Gabrys,
José Luis Blanco,
Juan Ignacio Godino,
Luis Alfonso Hernández
Abstract:
Recent advances in image, video, text and audio generative techniques, and their use by the general public, are leading to new forms of content generation. Usually, each modality was approached separately, which poses limitations. The automatic sound recording of visual sequences is one of the greatest challenges for the automatic generation of multimodal content. We present a processing flow that…
▽ More
Recent advances in image, video, text and audio generative techniques, and their use by the general public, are leading to new forms of content generation. Usually, each modality was approached separately, which poses limitations. The automatic sound recording of visual sequences is one of the greatest challenges for the automatic generation of multimodal content. We present a processing flow that, starting from images extracted from videos, is able to sound them. We work with pre-trained models that employ complex encoders, contrastive learning, and multiple modalities, allowing complex representations of the sequences for their sonorization. The proposed scheme proposes different possibilities for audio map** and text guidance. We evaluated the scheme on a dataset of frames extracted from a commercial video game and sounds extracted from the Freesound platform. Subjective tests have evidenced that the proposed scheme is able to generate and assign audios automatically and conveniently to images. Moreover, it adapts well to user preferences, and the proposed objective metrics show a high correlation with the subjective ratings.
△ Less
Submitted 2 February, 2024;
originally announced February 2024.
-
Impact of Automatic Image Classification and Blind Deconvolution in Improving Text Detection Performance of the CRAFT Algorithm
Authors:
Clarisa V. Albarillo,
Proceso L. Fernandez Jr
Abstract:
Text detection in natural scenes has been a significant and active research subject in computer vision and document analysis because of its wide range of applications as evidenced by the emergence of the Robust Reading Competition. One of the algorithms which has good text detection performance in the said competition is the Character Region Awareness for Text Detection (CRAFT). Employing the ICDA…
▽ More
Text detection in natural scenes has been a significant and active research subject in computer vision and document analysis because of its wide range of applications as evidenced by the emergence of the Robust Reading Competition. One of the algorithms which has good text detection performance in the said competition is the Character Region Awareness for Text Detection (CRAFT). Employing the ICDAR 2013 dataset, this study investigates the impact of automatic image classification and blind deconvolution as image pre-processing steps to further enhance the text detection performance of CRAFT. The proposed technique automatically classifies the scene images into two categories, blurry and non-blurry, by utilizing of a Laplacian operator with 100 as threshold. Prior to applying the CRAFT algorithm, images that are categorized as blurry are further pre-processed using blind deconvolution to reduce the blur. The results revealed that the proposed method significantly enhanced the detection performance of CRAFT, as demonstrated by its IoU h-mean of 94.47% compared to the original 91.42% h-mean of CRAFT and this even outperformed the top-ranked SenseTime, whose h-mean is 93.62%.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
Phase transition in the computational complexity of the shortest common superstring and genome assembly
Authors:
L. A. Fernandez,
V. Martin-Mayor,
D. Yllanes
Abstract:
Genome assembly, the process of reconstructing a long genetic sequence by aligning and merging short fragments, or reads, is known to be NP-hard, either as a version of the shortest common superstring problem or in a Hamiltonian-cycle formulation. That is, the computing time is believed to grow exponentially with the the problem size in the worst case. Despite this fact, high-throughput technologi…
▽ More
Genome assembly, the process of reconstructing a long genetic sequence by aligning and merging short fragments, or reads, is known to be NP-hard, either as a version of the shortest common superstring problem or in a Hamiltonian-cycle formulation. That is, the computing time is believed to grow exponentially with the the problem size in the worst case. Despite this fact, high-throughput technologies and modern algorithms currently allow bioinformaticians to handle datasets of billions of reads. Using methods from statistical mechanics, we address this conundrum by demonstrating the existence of a phase transition in the computational complexity of the problem and showing that practical instances always fall in the 'easy' phase (solvable by polynomial-time algorithms). In addition, we propose a Markov-chain Monte Carlo method that outperforms common deterministic algorithms in the hard regime.
△ Less
Submitted 11 March, 2024; v1 submitted 18 October, 2022;
originally announced October 2022.
-
A Scalable Data Streaming Infrastructure for Smart Cities
Authors:
Jesus Arias Fisteus,
Luis Sanchez Fernandez,
Victor Corcoba Magaña,
Mario Muñoz Organero,
Jorge Yago Fernandez,
Juan Antonio Alvarez Garcia
Abstract:
Many of the services a smart city can provide to its citizens rely on the ability of its infrastructure to collect and process in real time vast amounts of continuous data that sensors deployed through the city produce. In this paper we present the server infrastructure we have designed in the context of the HERMES project to collect the data from sensors and aggregate it in streams for their use…
▽ More
Many of the services a smart city can provide to its citizens rely on the ability of its infrastructure to collect and process in real time vast amounts of continuous data that sensors deployed through the city produce. In this paper we present the server infrastructure we have designed in the context of the HERMES project to collect the data from sensors and aggregate it in streams for their use in services of the smart city.
△ Less
Submitted 8 March, 2017;
originally announced March 2017.
-
Wearable Embroidered Muscle Activity Sensing Device for the Human Upper Leg
Authors:
R. B. Ribas Manero,
J. Grewal,
B. Michael,
A. Shafti,
K. Althoefer,
J. Ll. Ribas Fernandez,
M. J. Howard
Abstract:
Within the last decade, running has become one of the most popular physical activities in the world. Although the benefits of running are numerous, there is a risk of Running Related Injuries (RRI) of the lower extremities. Electromyography (EMG) techniques have previously been used to study causes of RRIs, but the complexity of this technology limits its use to a laboratory setting. As running is…
▽ More
Within the last decade, running has become one of the most popular physical activities in the world. Although the benefits of running are numerous, there is a risk of Running Related Injuries (RRI) of the lower extremities. Electromyography (EMG) techniques have previously been used to study causes of RRIs, but the complexity of this technology limits its use to a laboratory setting. As running is primarily an outdoors activity, this lack of technology acts as a barrier to the study of RRIs in natural environments. This study presents a minimally invasive wearable muscle sensing device consisting of jogging leggings with embroidered surface EMG (sEMG) electrodes capable of recording muscle activity data of the quadriceps group. To test the use of the device, a proof of concept study consisting of $N=2$ runners performing a set of $5km$ running trials is presented in which the effect of running surfaces on muscle fatigue, a potential cause of RRIs, is evaluated. Results show that muscle fatigue can be analysed from the sEMG data obtained through the wearable device, and that running on soft surfaces (such as sand) may increase the likelihood of suffering from RRIs.
△ Less
Submitted 15 February, 2016;
originally announced February 2016.
-
Janus II: a new generation application-driven computer for spin-system simulations
Authors:
Janus Collaboration,
M. Baity-Jesi,
R. A. Baños,
A. Cruz,
L. A. Fernandez,
J. M. Gil-Narvion,
A. Gordillo-Guerrero,
D. Iñiguez,
A. Maiorano,
F. Mantovani,
E. Marinari,
V. Martin-Mayor,
J. Monforte-Garcia,
A. Muñoz Sudupe,
D. Navarro,
G. Parisi,
S. Perez-Gaviro,
M. Pivanti,
F. Ricci-Tersenghi,
J. J. Ruiz-Lorenzo,
S. F. Schifano,
B. Seoane,
A. Tarancon,
R. Tripiccione,
D. Yllanes
Abstract:
This paper describes the architecture, the development and the implementation of Janus II, a new generation application-driven number cruncher optimized for Monte Carlo simulations of spin systems (mainly spin glasses). This domain of computational physics is a recognized grand challenge of high-performance computing: the resources necessary to study in detail theoretical models that can make cont…
▽ More
This paper describes the architecture, the development and the implementation of Janus II, a new generation application-driven number cruncher optimized for Monte Carlo simulations of spin systems (mainly spin glasses). This domain of computational physics is a recognized grand challenge of high-performance computing: the resources necessary to study in detail theoretical models that can make contact with experimental data are by far beyond those available using commodity computer systems. On the other hand, several specific features of the associated algorithms suggest that unconventional computer architectures, which can be implemented with available electronics technologies, may lead to order of magnitude increases in performance, reducing to acceptable values on human scales the time needed to carry out simulation campaigns that would take centuries on commercially available machines. Janus II is one such machine, recently developed and commissioned, that builds upon and improves on the successful JANUS machine, which has been used for physics since 2008 and is still in operation today. This paper describes in detail the motivations behind the project, the computational requirements, the architecture and the implementation of this new machine and compares its expected performances with those of currently available commercial systems.
△ Less
Submitted 3 October, 2013;
originally announced October 2013.
-
Quid Pro Quo: A Mechanism for Fair Collaboration in Networked Systems
Authors:
Agustín Santos Méndez,
Antonio Fernández Anta,
Luis López Fernández
Abstract:
Collaboration may be understood as the execution of coordinated tasks (in the most general sense) by groups of users, who cooperate for achieving a common goal. Collaboration is a fundamental assumption and requirement for the correct operation of many communication systems. The main challenge when creating collaborative systems in a decentralized manner is dealing with the fact that users may beh…
▽ More
Collaboration may be understood as the execution of coordinated tasks (in the most general sense) by groups of users, who cooperate for achieving a common goal. Collaboration is a fundamental assumption and requirement for the correct operation of many communication systems. The main challenge when creating collaborative systems in a decentralized manner is dealing with the fact that users may behave in selfish ways, trying to obtain the benefits of the tasks but without participating in their execution. In this context, Game Theory has been instrumental to model collaborative systems and the task allocation problem, and to design mechanisms for optimal allocation of tasks. In this paper, we revise the classical assumptions and propose a new approach to this problem. First, we establish a system model based on heterogenous nodes (users, players), and propose a basic distributed mechanism so that, when a new task appears, it is assigned to the most suitable node. The classical technique for compensating a node that executes a task is the use of payments (which in most networks are hard or impossible to implement). Instead, we propose a distributed mechanism for the optimal allocation of tasks without payments. We prove this mechanism to be robust event in the presence of independent selfish or rationally limited players. Additionally, our model is based on very weak assumptions, which makes the proposed mechanisms susceptible to be implemented in networked systems (e.g., the Internet).
△ Less
Submitted 25 July, 2012;
originally announced July 2012.
-
Reconfigurable computing for Monte Carlo simulations: results and prospects of the Janus project
Authors:
Janus Collaboration,
M. Baity-Jesi,
R. A. Banos,
A. Cruz,
L. A. Fernandez,
J. M. Gil-Narvion,
A. Gordillo-Guerrero,
M. Guidetti,
D. Iniguez,
A. Maiorano,
F. Mantovani,
E. Marinari,
V. Martin-Mayor,
J. Monforte-Garcia,
A. Munoz Sudupe,
D. Navarro,
G. Parisi,
M. Pivanti,
S. Perez-Gaviro,
F. Ricci-Tersenghi,
J. J. Ruiz-Lorenzo,
S. F. Schifano,
B. Seoane,
A. Tarancon,
P. Tellez
, et al. (2 additional authors not shown)
Abstract:
We describe Janus, a massively parallel FPGA-based computer optimized for the simulation of spin glasses, theoretical models for the behavior of glassy materials. FPGAs (as compared to GPUs or many-core processors) provide a complementary approach to massively parallel computing. In particular, our model problem is formulated in terms of binary variables, and floating-point operations can be (almo…
▽ More
We describe Janus, a massively parallel FPGA-based computer optimized for the simulation of spin glasses, theoretical models for the behavior of glassy materials. FPGAs (as compared to GPUs or many-core processors) provide a complementary approach to massively parallel computing. In particular, our model problem is formulated in terms of binary variables, and floating-point operations can be (almost) completely avoided. The FPGA architecture allows us to run many independent threads with almost no latencies in memory access, thus updating up to 1024 spins per cycle. We describe Janus in detail and we summarize the physics results obtained in four years of operation of this machine; we discuss two types of physics applications: long simulations on very large systems (which try to mimic and provide understanding about the experimental non-equilibrium dynamics), and low-temperature equilibrium simulations using an artificial parallel tempering dynamics. The time scale of our non-equilibrium simulations spans eleven orders of magnitude (from picoseconds to a tenth of a second). On the other hand, our equilibrium simulations are unprecedented both because of the low temperatures reached and for the large systems that we have brought to equilibrium. A finite-time scaling ansatz emerges from the detailed comparison of the two sets of simulations. Janus has made it possible to perform spin-glass simulations that would take several decades on more conventional architectures. The paper ends with an assessment of the potential of possible future versions of the Janus architecture, based on state-of-the-art technology.
△ Less
Submitted 18 April, 2012;
originally announced April 2012.
-
JANUS: an FPGA-based System for High Performance Scientific Computing
Authors:
F. Belletti,
M. Cotallo,
A. Cruz,
L. A. Fernández,
A. Gordillo,
M. Guidetti,
A. Maiorano,
F. Mantovani,
E. Marinari,
V. Martín-Mayor,
A. Muñoz-Sudupe,
D. Navarro,
G. Parisi,
S. Pérez-Gaviro,
M. Rossi,
J. J. Ruiz-Lorenzo,
S. F. Schifano,
D. Sciretti,
A. Tarancón,
R. Tripiccione,
J. L. Velasco
Abstract:
This paper describes JANUS, a modular massively parallel and reconfigurable FPGA-based computing system. Each JANUS module has a computational core and a host. The computational core is a 4x4 array of FPGA-based processing elements with nearest-neighbor data links. Processors are also directly connected to an I/O node attached to the JANUS host, a conventional PC. JANUS is tailored for, but not…
▽ More
This paper describes JANUS, a modular massively parallel and reconfigurable FPGA-based computing system. Each JANUS module has a computational core and a host. The computational core is a 4x4 array of FPGA-based processing elements with nearest-neighbor data links. Processors are also directly connected to an I/O node attached to the JANUS host, a conventional PC. JANUS is tailored for, but not limited to, the requirements of a class of hard scientific applications characterized by regular code structure, unconventional data manipulation instructions and not too large data-base size. We discuss the architecture of this configurable machine, and focus on its use on Monte Carlo simulations of statistical mechanics. On this class of application JANUS achieves impressive performances: in some cases one JANUS processing element outperfoms high-end PCs by a factor ~ 1000. We also discuss the role of JANUS on other classes of scientific applications.
△ Less
Submitted 8 April, 2008; v1 submitted 18 October, 2007;
originally announced October 2007.
-
Simulating spin systems on IANUS, an FPGA-based computer
Authors:
F. Belletti,
M. Cotallo,
A. Cruz,
L. A. Fernández,
A. Gordillo,
A. Maiorano,
F. Mantovani,
E. Marinari,
V. Martín-Mayor,
A. Muñoz-Sudupe,
D. Navarro,
S. Pérez-Gaviro,
J. J. Ruiz-Lorenzo,
S. F. Schifano,
D. Sciretti,
A. Tarancón,
R. Tripiccione,
J. L. Velasco
Abstract:
We describe the hardwired implementation of algorithms for Monte Carlo simulations of a large class of spin models. We have implemented these algorithms as VHDL codes and we have mapped them onto a dedicated processor based on a large FPGA device. The measured performance on one such processor is comparable to O(100) carefully programmed high-end PCs: it turns out to be even better for some sele…
▽ More
We describe the hardwired implementation of algorithms for Monte Carlo simulations of a large class of spin models. We have implemented these algorithms as VHDL codes and we have mapped them onto a dedicated processor based on a large FPGA device. The measured performance on one such processor is comparable to O(100) carefully programmed high-end PCs: it turns out to be even better for some selected spin models. We describe here codes that we are currently executing on the IANUS massively parallel FPGA-based system.
△ Less
Submitted 26 April, 2007;
originally announced April 2007.