Skip to main content

Showing 1–6 of 6 results for author: Ochoa, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.12670  [pdf, other

    cs.LG q-bio.QM

    Towards a more inductive world for drug repurposing approaches

    Authors: Jesus de la Fuente, Guillermo Serrano, Uxía Veleiro, Mikel Casals, Laura Vera, Marija Pizurica, Antonio Pineda-Lucena, Idoia Ochoa, Silve Vicent, Olivier Gevaert, Mikel Hernaez

    Abstract: Drug-target interaction (DTI) prediction is a challenging, albeit essential task in drug repurposing. Learning on graph models have drawn special attention as they can significantly reduce drug repurposing costs and time commitment. However, many current approaches require high-demanding additional information besides DTIs that complicates their evaluation process and usability. Additionally, stru… ▽ More

    Submitted 24 November, 2023; v1 submitted 21 November, 2023; originally announced November 2023.

  2. arXiv:2310.12804  [pdf, other

    hep-ex cs.LG hep-ph physics.data-an

    Differentiable Vertex Fitting for Jet Flavour Tagging

    Authors: Rachel E. C. Smith, Inês Ochoa, Rúben Inácio, Jonathan Shoemaker, Michael Kagan

    Abstract: We propose a differentiable vertex fitting algorithm that can be used for secondary vertex fitting, and that can be seamlessly integrated into neural networks for jet flavour tagging. Vertex fitting is formulated as an optimization problem where gradients of the optimized solution vertex are defined through implicit differentiation and can be passed to upstream or downstream neural network compone… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: 11 pages

  3. arXiv:1911.03572  [pdf, other

    cs.LG cs.IT stat.ML

    DZip: improved general-purpose lossless compression based on novel neural network modeling

    Authors: Mohit Goyal, Kedar Tatwawadi, Shubham Chandak, Idoia Ochoa

    Abstract: We consider lossless compression based on statistical data modeling followed by prediction-based encoding, where an accurate statistical model for the input data leads to substantial improvements in compression. We propose DZip, a general-purpose compressor for sequential data that exploits the well-known modeling capabilities of neural networks (NNs) for prediction, followed by arithmetic coding.… ▽ More

    Submitted 18 September, 2020; v1 submitted 8 November, 2019; originally announced November 2019.

    Comments: Updated manuscript and an efficient implementation added

  4. arXiv:1811.08162  [pdf, other

    cs.CL eess.SP q-bio.GN

    DeepZip: Lossless Data Compression using Recurrent Neural Networks

    Authors: Mohit Goyal, Kedar Tatwawadi, Shubham Chandak, Idoia Ochoa

    Abstract: Sequential data is being generated at an unprecedented pace in various forms, including text and genomic data. This creates the need for efficient compression mechanisms to enable better storage, transmission and processing of such data. To solve this problem, many of the existing compressors attempt to learn models for the data and perform prediction-based compression. Since neural networks are k… ▽ More

    Submitted 20 November, 2018; originally announced November 2018.

  5. arXiv:1207.5184  [pdf, ps, other

    q-bio.GN cs.IT q-bio.QM

    Lossy Compression of Quality Values via Rate Distortion Theory

    Authors: Himanshu Asnani, Dinesh Bharadia, Mainak Chowdhury, Idoia Ochoa, Itai Sharon, Tsachy Weissman

    Abstract: Motivation: Next Generation Sequencing technologies revolutionized many fields in biology by enabling the fast and cheap sequencing of large amounts of genomic data. The ever increasing sequencing capacities enabled by current sequencing machines hold a lot of promise as for the future applications of these technologies, but also create increasing computational challenges related to the analysis a… ▽ More

    Submitted 21 July, 2012; originally announced July 2012.

    Comments: 7 Pages, 8 Figures, Submitted to Bioinformatics

  6. Reference Based Genome Compression

    Authors: Bobbie Chern, Idoia Ochoa, Alexandros Manolakos, Albert No, Kartik Venkat, Tsachy Weissman

    Abstract: DNA sequencing technology has advanced to a point where storage is becoming the central bottleneck in the acquisition and mining of more data. Large amounts of data are vital for genomics research, and generic compression tools, while viable, cannot offer the same savings as approaches tuned to inherent biological properties. We propose an algorithm to compress a target genome given a known refere… ▽ More

    Submitted 9 April, 2012; originally announced April 2012.

    Comments: 5 pages; Submitted to the IEEE Information Theory Workshop (ITW) 2012