-
Towards a more inductive world for drug repurposing approaches
Authors:
Jesus de la Fuente,
Guillermo Serrano,
Uxía Veleiro,
Mikel Casals,
Laura Vera,
Marija Pizurica,
Antonio Pineda-Lucena,
Idoia Ochoa,
Silve Vicent,
Olivier Gevaert,
Mikel Hernaez
Abstract:
Drug-target interaction (DTI) prediction is a challenging, albeit essential task in drug repurposing. Learning on graph models have drawn special attention as they can significantly reduce drug repurposing costs and time commitment. However, many current approaches require high-demanding additional information besides DTIs that complicates their evaluation process and usability. Additionally, stru…
▽ More
Drug-target interaction (DTI) prediction is a challenging, albeit essential task in drug repurposing. Learning on graph models have drawn special attention as they can significantly reduce drug repurposing costs and time commitment. However, many current approaches require high-demanding additional information besides DTIs that complicates their evaluation process and usability. Additionally, structural differences in the learning architecture of current models hinder their fair benchmarking. In this work, we first perform an in-depth evaluation of current DTI datasets and prediction models through a robust benchmarking process, and show that DTI prediction methods based on transductive models lack generalization and lead to inflated performance when evaluated as previously done in the literature, hence not being suited for drug repurposing approaches. We then propose a novel biologically-driven strategy for negative edge subsampling and show through in vitro validation that newly discovered interactions are indeed true. We envision this work as the underpinning for future fair benchmarking and robust model design. All generated resources and tools are publicly available as a python package.
△ Less
Submitted 24 November, 2023; v1 submitted 21 November, 2023;
originally announced November 2023.
-
Sweetwater: An interpretable and adaptive autoencoder for efficient tissue deconvolution
Authors:
Jesus de la Fuente,
Naroa Legarra,
Guillermo Serrano,
Irene Marin-Goni,
Aintzane Diaz-Mazkiaran,
Markel Benito Sendin,
Ana Garcia Osta,
Krishna R. Kalari,
Carlos Fernandez-Granda,
Idoia Ochoa,
Mikel Hernaez
Abstract:
Single-cell RNA-sequencing (scRNA-seq) stands as a powerful tool for deciphering cellular heterogeneity and exploring gene expression profiles at high resolution. However, its high cost renders it impractical for extensive sample cohorts within routine clinical care, hindering its broader applicability. Hence, many methodologies have recently arised to estimate cell type proportions from bulk RNA-…
▽ More
Single-cell RNA-sequencing (scRNA-seq) stands as a powerful tool for deciphering cellular heterogeneity and exploring gene expression profiles at high resolution. However, its high cost renders it impractical for extensive sample cohorts within routine clinical care, hindering its broader applicability. Hence, many methodologies have recently arised to estimate cell type proportions from bulk RNA-seq samples (known as deconvolution methods). However, they have several limitations: Many depend on selecting a robust scRNA-seq reference dataset, which is often challenging. Secondly, building reliable pseudobulk samples requires determining the optimal number of genes or cells involved in the simulated data generation process, which has not been studied in depth. Moreover, pseudobulk and bulk RNA-seq samples often exhibit distribution shifts. Finally, most modern deconvolution approaches behave as a black box, and the underlying mechanisms of the deconvolution task are still unknown, which can compromise the reliability of the results. In this work, we present Sweetwater, an adaptive and interpretable autoencoder able to efficiently deconvolve bulk RNA-seq and microarray samples leveraging multiple classes of reference data, such as scRNA-seq and single-nuclei RNA-seq. Moreover, it can be trained on a mixture of FACS-sorted FASTQ files, which we newly propose to use as this reduces platform-specific biases and may potentially outperform single-cell-based references. Also, we demonstrate that Sweetwater effectively uncovers biologically meaningful patterns during the training process, increasing the reliability of the results. Sweetwater is available at https://github.com/ubioinformat/Sweetwater, and we anticipate will facilitate and expedite the accurate examination of high-throughput clinical data across diverse applications.
△ Less
Submitted 17 March, 2024; v1 submitted 20 November, 2023;
originally announced November 2023.
-
A Flexible Channel Coding Approach for Short-Length Codewords
Authors:
Mikel Hernaez,
Pedro M. crespo,
Javier Del Ser
Abstract:
This letter introduces a novel channel coding design framework for short-length codewords that permits balancing the tradeoff between the bit error rate floor and waterfall region by modifying a single real-valued parameter. The proposed approach is based on combining convolutional coding with a $q$-ary linear combination and unequal energy allocation, the latter being controlled by the aforementi…
▽ More
This letter introduces a novel channel coding design framework for short-length codewords that permits balancing the tradeoff between the bit error rate floor and waterfall region by modifying a single real-valued parameter. The proposed approach is based on combining convolutional coding with a $q$-ary linear combination and unequal energy allocation, the latter being controlled by the aforementioned parameter. EXIT charts are used to shed light on the convergence characteristics of the associated iterative decoder, which is described in terms of factor graphs. Simulation results show that the proposed scheme is able to adjust its end-to-end error rate performance efficiently and easily, on the contrary to previous approaches that require a full code redesign when the error rate requirements of the application change. Simulations also show that, at mid-range bit-error rates, there is a small performance penalty with respect to the previous approaches. However, the EXIT chart analysis and the simulation results suggest that for very low bit-error rates the proposed system will exhibit lower error floors than previous approaches.
△ Less
Submitted 22 March, 2012;
originally announced March 2012.
-
On the Design of a Novel Joint Network-Channel Coding Scheme for the Multiple Access Relay Channel
Authors:
Mikel Hernaez,
Pedro M. Crespo,
Javier Del Ser
Abstract:
This paper proposes a novel joint non-binary network-channel code for the Time-Division Decode-and-Forward Multiple Access Relay Channel (TD-DF-MARC), where the relay linearly combines -- over a non-binary finite field -- the coded sequences from the source nodes. A method based on an EXIT chart analysis is derived for selecting the best coefficients of the linear combination. Moreover, it is show…
▽ More
This paper proposes a novel joint non-binary network-channel code for the Time-Division Decode-and-Forward Multiple Access Relay Channel (TD-DF-MARC), where the relay linearly combines -- over a non-binary finite field -- the coded sequences from the source nodes. A method based on an EXIT chart analysis is derived for selecting the best coefficients of the linear combination. Moreover, it is shown that for different setups of the system, different coefficients should be chosen in order to improve the performance. This conclusion contrasts with previous works where a random selection was considered. Monte Carlo simulations show that the proposed scheme outperforms, in terms of its gap to the outage probabilities, the previously published joint network-channel coding approaches. Besides, this gain is achieved by using very short-length codewords, which makes the scheme particularly attractive for low-latency applications.
△ Less
Submitted 21 March, 2012;
originally announced March 2012.