-
Lessons learned from a performance analysis and optimization of a multiscale cellular simulation
Authors:
Marc Clascà,
Marta Garcia-Gasulla,
Arnau Montagud,
Jose Carbonell Caballero,
Alfonso Valencia
Abstract:
This work presents a comprehensive performance analysis and optimization of a multiscale agent-based cellular simulation. The optimizations applied are guided by detailed performance analysis and include memory management, load balance, and a locality-aware parallelization. The outcome of this paper is not only the speedup of 2.4x achieved by the optimized version with respect to the original Phys…
▽ More
This work presents a comprehensive performance analysis and optimization of a multiscale agent-based cellular simulation. The optimizations applied are guided by detailed performance analysis and include memory management, load balance, and a locality-aware parallelization. The outcome of this paper is not only the speedup of 2.4x achieved by the optimized version with respect to the original PhysiCell code, but also the lessons learned and best practices when develo** parallel HPC codes to obtain efficient and highly performant applications, especially in the computational biology field.
△ Less
Submitted 20 June, 2023;
originally announced June 2023.
-
A Comparison of Tiny-nerf versus Spatial Representations for 3d Reconstruction
Authors:
Saulo Abraham Gante,
Juan Irving Vasquez,
Marco Antonio Valencia,
Mauricio Olguín Carbajal
Abstract:
Neural rendering has emerged as a powerful paradigm for synthesizing images, offering many benefits over classical rendering by using neural networks to reconstruct surfaces, represent shapes, and synthesize novel views, either for objects or scenes. In this neural rendering, the environment is encoded into a neural network. We believe that these new representations can be used to codify the scene…
▽ More
Neural rendering has emerged as a powerful paradigm for synthesizing images, offering many benefits over classical rendering by using neural networks to reconstruct surfaces, represent shapes, and synthesize novel views, either for objects or scenes. In this neural rendering, the environment is encoded into a neural network. We believe that these new representations can be used to codify the scene for a mobile robot. Therefore, in this work, we perform a comparison between a trending neural rendering, called tiny-NeRF, and other volume representations that are commonly used as maps in robotics, such as voxel maps, point clouds, and triangular meshes. The target is to know the advantages and disadvantages of neural representations in the robotics context. The comparison is made in terms of spatial complexity and processing time to obtain a model. Experiments show that tiny-NeRF requires three times less memory space compared to other representations. In terms of processing time, tiny-NeRF takes about six times more to compute the model.
△ Less
Submitted 26 January, 2023;
originally announced January 2023.
-
Parallel Model Exploration for Tumor Treatment Simulations
Authors:
Charilaos Akasiadis,
Miguel Ponce-de-Leon,
Arnau Montagud,
Evangelos Michelioudakis,
Alexia Atsidakou,
Elias Alevizos,
Alexander Artikis,
Alfonso Valencia,
Georgios Paliouras
Abstract:
Computational systems and methods are often being used in biological research, including the understanding of cancer and the development of treatments. Simulations of tumor growth and its response to different drugs are of particular importance, but also challenging complexity. The main challenges are first to calibrate the simulators so as to reproduce real-world cases, and second, to search for…
▽ More
Computational systems and methods are often being used in biological research, including the understanding of cancer and the development of treatments. Simulations of tumor growth and its response to different drugs are of particular importance, but also challenging complexity. The main challenges are first to calibrate the simulators so as to reproduce real-world cases, and second, to search for specific values of the parameter space concerning effective drug treatments. In this work, we combine a multi-scale simulator for tumor cell growth and a Genetic Algorithm (GA) as a heuristic search method for finding good parameter configurations in reasonable time. The two modules are integrated into a single workflow that can be executed in parallel on high performance computing infrastructures. In effect, the GA is used to calibrate the simulator, and then to explore different drug delivery schemes. Among these schemes, we aim to find those that minimize tumor cell size and the probability of emergence of drug resistant cells in the future. Experimental results illustrate the effectiveness and computational efficiency of the approach.
△ Less
Submitted 22 February, 2022; v1 submitted 25 March, 2021;
originally announced March 2021.
-
Linear codes over signed graphs
Authors:
Jose Martinez-Bernal,
Miguel A. Valencia,
Rafael H. Villarreal
Abstract:
We give formulas, in terms of graph theoretical invariants, for the minimum distance and the generalized Hamming weights of the linear code generated by the rows of the incidence matrix of a signed graph over a finite field, and for those of its dual code. Then we determine the regularity of the ideals of circuits and cocircuits of a signed graph, and prove an algebraic formula in terms of the mul…
▽ More
We give formulas, in terms of graph theoretical invariants, for the minimum distance and the generalized Hamming weights of the linear code generated by the rows of the incidence matrix of a signed graph over a finite field, and for those of its dual code. Then we determine the regularity of the ideals of circuits and cocircuits of a signed graph, and prove an algebraic formula in terms of the multiplicity for the frustration index of an unbalanced signed graph.
△ Less
Submitted 21 September, 2019; v1 submitted 20 April, 2019;
originally announced April 2019.
-
Algorithmic complexity in computational biology: basics, challenges and limitations
Authors:
Davide Cirillo,
Miguel Ponce-de-Leon,
Alfonso Valencia
Abstract:
Computational problems can be classified according to their algorithmic complexity, which is defined based on how the resources needed to solve the problem, e.g. the execution time, scale with the problem size. Many problems in computational biology are computationally infeasible in the sense that the exhaustive search for the optimal solution is prohibitive in practical terms. As a consequence, t…
▽ More
Computational problems can be classified according to their algorithmic complexity, which is defined based on how the resources needed to solve the problem, e.g. the execution time, scale with the problem size. Many problems in computational biology are computationally infeasible in the sense that the exhaustive search for the optimal solution is prohibitive in practical terms. As a consequence, these problems are tackled through heuristics and approximations aiming to overcome the exceeding computational requirements at the cost of providing suboptimal solutions. The importance of defining the computational complexity of computational biology algorithms is a topic rarely surveyed for broad audiences of bioinformaticians and users of bioinformatics tools. However, recognizing the underlying complexity of any algorithm is essential for understanding their potential and limitations. Thus, the aim of this review is to survey the main algorithmic solutions to intractable problems in computational biology, highlighting the importance of High-Performance Computing in this area.
△ Less
Submitted 28 July, 2021; v1 submitted 18 November, 2018;
originally announced November 2018.
-
Accurate Demarcation of Protein Domain Linkers based on Structural Analysis of Linker Probable Region
Authors:
Vivekanand Samant,
Arvind Hulgeri,
Alfonso Valencia,
Ashish V. Tendulkar
Abstract:
In multi-domain proteins, the domains are connected by a flexible unstructured region called as protein domain linker. The accurate demarcation of these linkers holds a key to understanding of their biochemical and evolutionary attributes. This knowledge helps in designing a suitable linker for engineering stable multi-domain chimeric proteins. Here we propose a novel method for the demarcation of…
▽ More
In multi-domain proteins, the domains are connected by a flexible unstructured region called as protein domain linker. The accurate demarcation of these linkers holds a key to understanding of their biochemical and evolutionary attributes. This knowledge helps in designing a suitable linker for engineering stable multi-domain chimeric proteins. Here we propose a novel method for the demarcation of the linker based on a three-dimensional protein structure and a domain definition. The proposed method is based on biological knowledge about structural flexibility of the linkers. We performed structural analysis on a linker probable region (LPR) around domain boundary points of known SCOP domains. The LPR was described using a set of overlap** peptide fragments of fixed size. Each peptide fragment was then described by geometric invariants (GIs) and subjected to clustering process where the fragments corresponding to actual linker come up as outliers. We then discover the actual linkers by finding the longest continuous stretch of outlier fragments from LPRs. This method was evaluated on a benchmark dataset of 51 continuous multi-domain proteins, where it achieves F1 score of 0.745 (0.83 precision and 0.66 recall). When the method was applied on 725 continuous multi-domain proteins, it was able to identify novel linkers that were not reported previously. This method can be used in combination with supervised / sequence based linker prediction methods for accurate linker demarcation.
△ Less
Submitted 23 November, 2012;
originally announced November 2012.
-
Mirroring co-evolving trees in the light of their topologies
Authors:
Iman Hajirasouliha,
Alexander Schönhuth,
David Juan,
Alfonso Valencia,
S. Cenk Sahinalp
Abstract:
Determining the interaction partners among protein/domain families poses hard computational problems, in particular in the presence of paralogous proteins. Available approaches aim to identify interaction partners among protein/domain families through maximizing the similarity between trimmed versions of their phylogenetic trees. Since maximization of any natural similarity score is computationall…
▽ More
Determining the interaction partners among protein/domain families poses hard computational problems, in particular in the presence of paralogous proteins. Available approaches aim to identify interaction partners among protein/domain families through maximizing the similarity between trimmed versions of their phylogenetic trees. Since maximization of any natural similarity score is computationally difficult, many approaches employ heuristics to maximize the distance matrices corresponding to the tree topologies in question. In this paper we devise an efficient deterministic algorithm which directly maximizes the similarity between two leaf labeled trees with edge lengths, obtaining a score-optimal alignment of the two trees in question.
Our algorithm is significantly faster than those methods based on distance matrix comparison: 1 minute on a single processor vs. 730 hours on a supercomputer. Furthermore we have advantages over the current state-of-the-art heuristic search approach in terms of precision as well as a recently suggested overall performance measure for mirrortree approaches, while incurring only acceptable losses in recall.
A C implementation of the method demonstrated in this paper is available at http://compbio.cs.sfu.ca/mirrort.htm
△ Less
Submitted 26 October, 2011;
originally announced October 2011.