Skip to main content

Showing 1–16 of 16 results for author: Calì, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2209.08600  [pdf, other

    cs.AR cs.DS q-bio.GN

    GenPIP: In-Memory Acceleration of Genome Analysis via Tight Integration of Basecalling and Read Map**

    Authors: Haiyu Mao, Mohammed Alser, Mohammad Sadrosadati, Can Firtina, Akanksha Baranwal, Damla Senol Cali, Aditya Manglik, Nour Almadhoun Alserr, Onur Mutlu

    Abstract: Nanopore sequencing is a widely-used high-throughput genome sequencing technology that can sequence long fragments of a genome into raw electrical signals at low cost. Nanopore sequencing requires two computationally-costly processing steps for accurate downstream genome analysis. The first step, basecalling, translates the raw electrical signals into nucleotide bases (i.e., A, C, G, T). The secon… ▽ More

    Submitted 17 December, 2023; v1 submitted 18 September, 2022; originally announced September 2022.

    Comments: 17 pages, 13 figures

  2. Scrooge: A Fast and Memory-Frugal Genomic Sequence Aligner for CPUs, GPUs, and ASICs

    Authors: Joël Lindegger, Damla Senol Cali, Mohammed Alser, Juan Gómez-Luna, Nika Mansouri Ghiasi, Onur Mutlu

    Abstract: Pairwise sequence alignment is a very time-consuming step in common bioinformatics pipelines. Speeding up this step requires heuristics, efficient implementations, and/or hardware acceleration. A promising candidate for all of the above is the recently proposed GenASM algorithm. We identify and address three inefficiencies in the GenASM algorithm: it has a high amount of data movement, a large mem… ▽ More

    Submitted 12 April, 2023; v1 submitted 21 August, 2022; originally announced August 2022.

  3. Neural-network preconditioners for solving the Dirac equation in lattice gauge theory

    Authors: Salvatore Calì, Daniel C. Hackett, Yin Lin, Phiala E. Shanahan, Brian Xiao

    Abstract: This work develops neural-network--based preconditioners to accelerate solution of the Wilson-Dirac normal equation in lattice quantum field theories. The approach is implemented for the two-flavor lattice Schwinger model near the critical point. In this system, neural-network preconditioners are found to accelerate the convergence of the conjugate gradient solver compared with the solution of unp… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: 11 pages, 6 figures, and 2 tables

  4. arXiv:2207.09765  [pdf, other

    cs.AR cs.AI cs.LG q-bio.GN q-bio.QM

    ApHMM: Accelerating Profile Hidden Markov Models for Fast and Energy-Efficient Genome Analysis

    Authors: Can Firtina, Kamlesh Pillai, Gurpreet S. Kalsi, Bharathwaj Suresh, Damla Senol Cali, Jeremie Kim, Taha Shahroodi, Meryem Banu Cavlak, Joel Lindegger, Mohammed Alser, Juan Gómez Luna, Sreenivas Subramoney, Onur Mutlu

    Abstract: Profile hidden Markov models (pHMMs) are widely employed in various bioinformatics applications to identify similarities between biological sequences, such as DNA or protein sequences. In pHMMs, sequences are represented as graph structures. These probabilities are subsequently used to compute the similarity score between a sequence and a pHMM graph. The Baum-Welch algorithm, a prevalent and highl… ▽ More

    Submitted 21 October, 2023; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: Accepted to ACM TACO

  5. SeGraM: A Universal Hardware Accelerator for Genomic Sequence-to-Graph and Sequence-to-Sequence Map**

    Authors: Damla Senol Cali, Konstantinos Kanellopoulos, Joel Lindegger, Zülal Bingöl, Gurpreet S. Kalsi, Ziyi Zuo, Can Firtina, Meryem Banu Cavlak, Jeremie Kim, Nika Mansouri Ghiasi, Gagandeep Singh, Juan Gómez-Luna, Nour Almadhoun Alserr, Mohammed Alser, Sreenivas Subramoney, Can Alkan, Saugata Ghose, Onur Mutlu

    Abstract: A critical step of genome sequence analysis is the map** of sequenced DNA fragments (i.e., reads) collected from an individual to a known linear reference genome sequence (i.e., sequence-to-sequence map**). Recent works replace the linear reference sequence with a graph-based representation of the reference genome, which captures the genetic variations and diversity across many individuals in… ▽ More

    Submitted 31 May, 2022; v1 submitted 12 May, 2022; originally announced May 2022.

    Comments: To appear in ISCA'22

  6. arXiv:2203.15561  [pdf, ps, other

    cs.AR

    Algorithmic Improvement and GPU Acceleration of the GenASM Algorithm

    Authors: Joël Lindegger, Damla Senol Cali, Mohammed Alser, Juan Gómez-Luna, Onur Mutlu

    Abstract: We improve on GenASM, a recent algorithm for genomic sequence alignment, by significantly reducing its memory footprint and bandwidth requirement. Our algorithmic improvements reduce the memory footprint by 24$\times$ and the number of memory accesses by 12$\times$. We efficiently parallelize the algorithm for GPUs, achieving a 4.1$\times$ speedup over a CPU implementation of the same algorithm, a… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: To appear at the 21st IEEE International Workshop on High Performance Computational Biology (HiCOMB) 2022

  7. arXiv:2202.10400  [pdf, other

    cs.AR cs.DC cs.OS q-bio.GN

    GenStore: A High-Performance and Energy-Efficient In-Storage Computing System for Genome Sequence Analysis

    Authors: Nika Mansouri Ghiasi, Jisung Park, Harun Mustafa, Jeremie Kim, Ataberk Olgun, Arvid Gollwitzer, Damla Senol Cali, Can Firtina, Haiyu Mao, Nour Almadhoun Alserr, Rachata Ausavarungnirun, Nandita Vijaykumar, Mohammed Alser, Onur Mutlu

    Abstract: Read map** is a fundamental, yet computationally-expensive step in many genomics applications. It is used to identify potential matches and differences between fragments (called reads) of a sequenced genome and an already known genome (called a reference genome). To address the computational challenges in genome analysis, many prior works propose various approaches such as filters that select th… ▽ More

    Submitted 6 April, 2023; v1 submitted 21 February, 2022; originally announced February 2022.

    Comments: Published at ASPLOS 2022

  8. arXiv:2202.05838  [pdf, ps, other

    hep-lat cs.LG hep-ph

    Applications of Machine Learning to Lattice Quantum Field Theory

    Authors: Denis Boyda, Salvatore Calì, Sam Foreman, Lena Funcke, Daniel C. Hackett, Yin Lin, Gert Aarts, Andrei Alexandru, Xiao-Yong **, Biagio Lucini, Phiala E. Shanahan

    Abstract: There is great potential to apply machine learning in the area of numerical lattice quantum field theory, but full exploitation of that potential will require new strategies. In this white paper for the Snowmass community planning process, we discuss the unique requirements of machine learning for lattice quantum field theory research and outline what is needed to enable exploration and deployment… ▽ More

    Submitted 10 February, 2022; originally announced February 2022.

    Comments: 10 pages, contribution to Snowmass 2022

    Report number: MIT-CTP/5405

  9. arXiv:2111.01916  [pdf, other

    cs.AR q-bio.GN

    Accelerating Genome Sequence Analysis via Efficient Hardware/Algorithm Co-Design

    Authors: Damla Senol Cali

    Abstract: Genome sequence analysis plays a pivotal role in enabling many medical and scientific advancements in personalized medicine, outbreak tracing, and forensics. However, the analysis of genome sequencing data is currently bottlenecked by the computational power and memory bandwidth limitations of existing systems. In this dissertation, we propose four major works, where we characterize the real-syste… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: Ph.D. Dissertation of Damla Senol Cali (Carnegie Mellon University, August 2021)

  10. FPGA-Based Near-Memory Acceleration of Modern Data-Intensive Applications

    Authors: Gagandeep Singh, Mohammed Alser, Damla Senol Cali, Dionysios Diamantopoulos, Juan Gómez-Luna, Henk Corporaal, Onur Mutlu

    Abstract: Modern data-intensive applications demand high computation capabilities with strict power constraints. Unfortunately, such applications suffer from a significant waste of both execution cycles and energy in current computing systems due to the costly data movement between the computation units and the memory units. Genome analysis and weather prediction are two examples of such applications. Recen… ▽ More

    Submitted 3 July, 2021; v1 submitted 11 June, 2021; originally announced June 2021.

    Comments: This is an extended and updated version of a paper published in IEEE Micro, vol. 41, no. 4, pp. 39-48, 1 July-Aug. 2021

  11. arXiv:2009.07692  [pdf, other

    cs.AR q-bio.GN

    GenASM: A High-Performance, Low-Power Approximate String Matching Acceleration Framework for Genome Sequence Analysis

    Authors: Damla Senol Cali, Gurpreet S. Kalsi, Zülal Bingöl, Can Firtina, Lavanya Subramanian, Jeremie S. Kim, Rachata Ausavarungnirun, Mohammed Alser, Juan Gomez-Luna, Amirali Boroumand, Anant Nori, Allison Scibisz, Sreenivas Subramoney, Can Alkan, Saugata Ghose, Onur Mutlu

    Abstract: Genome sequence analysis has enabled significant advancements in medical and scientific areas such as personalized medicine, outbreak tracing, and the understanding of evolution. Unfortunately, it is currently bottlenecked by the computational power and memory bandwidth limitations of existing systems, as many of the steps in genome sequence analysis must process a large amount of data. A major co… ▽ More

    Submitted 16 September, 2020; originally announced September 2020.

    Comments: To appear in MICRO 2020

  12. arXiv:2008.00961  [pdf, other

    cs.AR q-bio.GN stat.CO

    Accelerating Genome Analysis: A Primer on an Ongoing Journey

    Authors: Mohammed Alser, Zülal Bingöl, Damla Senol Cali, Jeremie Kim, Saugata Ghose, Can Alkan, Onur Mutlu

    Abstract: Genome analysis fundamentally starts with a process known as read map**, where sequenced fragments of an organism's genome are compared against a reference genome. Read map** is currently a major bottleneck in the entire genome analysis pipeline, because state-of-the-art genome sequencing technologies are able to sequence a genome much faster than the computational techniques employed to analy… ▽ More

    Submitted 22 September, 2020; v1 submitted 30 July, 2020; originally announced August 2020.

    Comments: This is an extended and updated version of a paper published in IEEE Micro, vol. 40, no. 5, pp. 65-75, 1 Sept.-Oct. 2020, https://doi.org/10.1109/MM.2020.3013728

    Journal ref: IEEE Micro, Volume: 40, Issue: 5, Sept.-Oct. 1 2020

  13. arXiv:1912.08735  [pdf, other

    q-bio.GN cs.CE

    AirLift: A Fast and Comprehensive Technique for Remap** Alignments between Reference Genomes

    Authors: Jeremie S. Kim, Can Firtina, Meryem Banu Cavlak, Damla Senol Cali, Mohammed Alser, Nastaran Ha**azar, Can Alkan, Onur Mutlu

    Abstract: As genome sequencing tools and techniques improve, researchers are able to incrementally assemble more accurate reference genomes, which enable sensitivity in read map** and downstream analysis such as variant calling. A more sensitive downstream analysis is critical for a better understanding of the genome donor (e.g., health characteristics). Therefore, read sets from sequenced samples should… ▽ More

    Submitted 21 November, 2022; v1 submitted 18 December, 2019; originally announced December 2019.

  14. arXiv:1902.07609  [pdf, other

    cs.AR cs.PF

    Understanding the Interactions of Workloads and DRAM Types: A Comprehensive Experimental Study

    Authors: Saugata Ghose, Tianshi Li, Nastaran Ha**azar, Damla Senol Cali, Onur Mutlu

    Abstract: It has become increasingly difficult to understand the complex interaction between modern applications and main memory, composed of DRAM chips. Manufacturers are now selling and proposing many different types of DRAM, with each DRAM type catering to different needs (e.g., high throughput, low power, high memory density). At the same time, the memory access patterns of prevalent and emerging worklo… ▽ More

    Submitted 18 October, 2019; v1 submitted 20 February, 2019; originally announced February 2019.

  15. Apollo: A Sequencing-Technology-Independent, Scalable, and Accurate Assembly Polishing Algorithm

    Authors: Can Firtina, Jeremie S. Kim, Mohammed Alser, Damla Senol Cali, A. Ercument Cicek, Can Alkan, Onur Mutlu

    Abstract: Long reads produced by third-generation sequencing technologies are used to construct an assembly (i.e., the subject's genome), which is further used in downstream genome analysis. Unfortunately, long reads have high sequencing error rates and a large proportion of bps in these long reads are incorrectly identified. These errors propagate to the assembly and affect the accuracy of genome analysis.… ▽ More

    Submitted 7 March, 2020; v1 submitted 12 February, 2019; originally announced February 2019.

    Comments: 9 pages, 1 figure. Accepted in Bioinformatics

    Journal ref: Bioinformatics . 2020 Jun 1;36(12):3669-3679

  16. GRIM-Filter: Fast Seed Location Filtering in DNA Read Map** Using Processing-in-Memory Technologies

    Authors: Jeremie S. Kim, Damla Senol Cali, Hongyi Xin, Donghyuk Lee, Saugata Ghose, Mohammed Alser, Hasan Hassan, Oguz Ergin, Can Alkan, Onur Mutlu

    Abstract: Motivation: Seed location filtering is critical in DNA read map**, a process where billions of DNA fragments (reads) sampled from a donor are mapped onto a reference genome to identify genomic variants of the donor. State-of-the-art read mappers 1) quickly generate possible map** locations for seeds (i.e., smaller segments) within each read, 2) extract reference sequences at each of the mappin… ▽ More

    Submitted 2 November, 2017; originally announced November 2017.

    Comments: arXiv admin note: text overlap with arXiv:1708.04329

    Journal ref: BMC Genomics, 19 (Suppl 2):89, 2018