Skip to main content

Showing 1–43 of 43 results for author: Pérez-Cruz, F

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.00456  [pdf, other

    cs.LG cs.AI

    Counterfactual Explanations for Deep Learning-Based Traffic Forecasting

    Authors: Rushan Wang, Yanan Xin, Yatao Zhang, Fernando Perez-Cruz, Martin Raubal

    Abstract: Deep learning models are widely used in traffic forecasting and have achieved state-of-the-art prediction accuracy. However, the black-box nature of those models makes the results difficult to interpret by users. This study aims to leverage an Explainable AI approach, counterfactual explanations, to enhance the explainability and usability of deep learning-based traffic forecasting models. Specifi… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 24 pages

  2. arXiv:2403.03593  [pdf, other

    cs.CR cs.AI

    Do You Trust Your Model? Emerging Malware Threats in the Deep Learning Ecosystem

    Authors: Dorjan Hitaj, Giulio Pagnotta, Fabio De Gaspari, Sediola Ruko, Briland Hitaj, Luigi V. Mancini, Fernando Perez-Cruz

    Abstract: Training high-quality deep learning models is a challenging task due to computational and technical requirements. A growing number of individuals, institutions, and companies increasingly rely on pre-trained, third-party models made available in public repositories. These models are often used directly or integrated in product pipelines with no particular precautions, since they are effectively ju… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 16 pages, 9 figures

  3. arXiv:2402.12242  [pdf, other

    cs.LG

    Synthetic location trajectory generation using categorical diffusion models

    Authors: Simon Dirmeier, Ye Hong, Fernando Perez-Cruz

    Abstract: Diffusion probabilistic models (DPMs) have rapidly evolved to be one of the predominant generative models for the simulation of synthetic data, for instance, for computer vision, audio, natural language processing, or biomolecule generation. Here, we propose using DPMs for the generation of synthetic individual location trajectories (ILTs) which are sequences of variables representing physical loc… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

  4. arXiv:2311.11749  [pdf, other

    physics.soc-ph cs.LG cs.SI

    Revealing behavioral impact on mobility prediction networks through causal interventions

    Authors: Ye Hong, Yanan Xin, Simon Dirmeier, Fernando Perez-Cruz, Martin Raubal

    Abstract: Deep neural networks are increasingly utilized in mobility prediction tasks, yet their intricate internal workings pose challenges for interpretability, especially in comprehending how various aspects of mobility behavior affect predictions. This study introduces a causal intervention framework to assess the impact of mobility-related factors on neural networks designed for next location predictio… ▽ More

    Submitted 18 March, 2024; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: 31 pages, 6 figures

  5. arXiv:2311.06965  [pdf, other

    cs.LG stat.ML

    Anchor Data Augmentation

    Authors: Nora Schneider, Shirin Goshtasbpour, Fernando Perez-Cruz

    Abstract: We propose a novel algorithm for data augmentation in nonlinear over-parametrized regression. Our data augmentation algorithm borrows from the literature on causality and extends the recently proposed Anchor regression (AR) method for data augmentation, which is in contrast to the current state-of-the-art domain-agnostic solutions that rely on the Mixup literature. Our Anchor Data Augmentation (AD… ▽ More

    Submitted 27 November, 2023; v1 submitted 12 November, 2023; originally announced November 2023.

  6. arXiv:2311.01606  [pdf, other

    cs.CL

    KG-FRUS: a Novel Graph-based Dataset of 127 Years of US Diplomatic Relations

    Authors: Gökberk Özsoy, Luis Salamanca, Matthew Connelly, Raymond Hicks, Fernando Pérez-Cruz

    Abstract: In the current paper, we present the KG-FRUS dataset, comprised of more than 300,000 US government diplomatic documents encoded in a Knowledge Graph (KG). We leverage the data of the Foreign Relations of the United States (FRUS) (available as XML files) to extract information about the documents and the individuals and countries mentioned within them. We use the extracted entities, and associated… ▽ More

    Submitted 30 October, 2023; originally announced November 2023.

    Comments: 11 pages, 5 figures, 2 tables, submitted to NeurIPS databases. Mixed of social sciences and data analysis content

  7. arXiv:2311.00474  [pdf, ps, other

    cs.LG stat.ML

    Diffusion models for probabilistic programming

    Authors: Simon Dirmeier, Fernando Perez-Cruz

    Abstract: We propose Diffusion Model Variational Inference (DMVI), a novel method for automated approximate inference in probabilistic programming languages (PPLs). DMVI utilizes diffusion models as variational approximations to the true posterior distribution by deriving a novel bound to the marginal likelihood objective used in Bayesian modelling. DMVI is easy to implement, allows hassle-free inference in… ▽ More

    Submitted 21 November, 2023; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: * Fix mathematical typos * Add conference info

  8. arXiv:2311.00377  [pdf, other

    cs.LG stat.AP

    Uncertainty quantification and out-of-distribution detection using surjective normalizing flows

    Authors: Simon Dirmeier, Ye Hong, Yanan Xin, Fernando Perez-Cruz

    Abstract: Reliable quantification of epistemic and aleatoric uncertainty is of crucial importance in applications where models are trained in one environment but applied to multiple different environments, often seen in real-world applications for example, in climate science or mobility analysis. We propose a simple approach using surjective normalizing flows to identify out-of-distribution data sets in dee… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  9. arXiv:2308.01054  [pdf, other

    stat.ML cs.LG stat.ME

    Simulation-based inference using surjective sequential neural likelihood estimation

    Authors: Simon Dirmeier, Carlo Albert, Fernando Perez-Cruz

    Abstract: We present Surjective Sequential Neural Likelihood (SSNL) estimation, a novel method for simulation-based inference in models where the evaluation of the likelihood function is not tractable and only a simulator that can generate synthetic data is available. SSNL fits a dimensionality-reducing surjective normalizing flow model and uses it as a surrogate likelihood function which allows for convent… ▽ More

    Submitted 23 February, 2024; v1 submitted 2 August, 2023; originally announced August 2023.

  10. arXiv:2306.15283  [pdf, other

    stat.ML cs.LG

    Adaptive Annealed Importance Sampling with Constant Rate Progress

    Authors: Shirin Goshtasbpour, Victor Cohen, Fernando Perez-Cruz

    Abstract: Annealed Importance Sampling (AIS) synthesizes weighted samples from an intractable distribution given its unnormalized density function. This algorithm relies on a sequence of interpolating distributions bridging the target to an initial tractable distribution such as the well-known geometric mean path of unnormalized distributions which is assumed to be suboptimal in general. In this paper, we p… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

  11. arXiv:2306.01545  [pdf, other

    cs.CL cs.AI cs.CR

    PassGPT: Password Modeling and (Guided) Generation with Large Language Models

    Authors: Javier Rando, Fernando Perez-Cruz, Briland Hitaj

    Abstract: Large language models (LLMs) successfully model natural language from vast amounts of text without the need for explicit supervision. In this paper, we investigate the efficacy of LLMs in modeling passwords. We present PassGPT, a LLM trained on password leaks for password generation. PassGPT outperforms existing methods based on generative adversarial networks (GAN) by guessing twice as many previ… ▽ More

    Submitted 14 June, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

  12. arXiv:2303.08984  [pdf, other

    physics.acc-ph cs.LG physics.data-an

    Forecasting Particle Accelerator Interruptions Using Logistic LASSO Regression

    Authors: Sichen Li, Jochem Snuverink, Fernando Perez-Cruz, Andreas Adelmann

    Abstract: Unforeseen particle accelerator interruptions, also known as interlocks, lead to abrupt operational changes despite being necessary safety measures. These may result in substantial loss of beam time and perhaps even equipment damage. We propose a simple yet powerful binary classification model aiming to forecast such interruptions, in the case of the High Intensity Proton Accelerator complex at th… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: 12 pages, 13 figures

  13. arXiv:2302.10062  [pdf, other

    cs.LG physics.ao-ph

    An evaluation of deep learning models for predicting water depth evolution in urban floods

    Authors: Stefania Russo, Nathanaël Perraudin, Steven Stalder, Fernando Perez-Cruz, Joao Paulo Leitao, Guillaume Obozinski, Jan Dirk Wegner

    Abstract: In this technical report we compare different deep learning models for prediction of water depth rasters at high spatial resolution. Efficient, accurate, and fast methods for water depth prediction are nowadays important as urban floods are increasing due to higher rainfall intensity caused by climate change, expansion of cities and changes in land use. While hydrodynamic models models can provide… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

  14. Design Space Exploration and Explanation via Conditional Variational Autoencoders in Meta-model-based Conceptual Design of Pedestrian Bridges

    Authors: Vera M. Balmer, Sophia V. Kuhn, Rafael Bischof, Luis Salamanca, Walter Kaufmann, Fernando Perez-Cruz, Michael A. Kraus

    Abstract: For conceptual design, engineers rely on conventional iterative (often manual) techniques. Emerging parametric models facilitate design space exploration based on quantifiable performance metrics, yet remain time-consuming and computationally expensive. Pure optimisation methods, however, ignore qualitative aspects (e.g. aesthetics or construction methods). This paper provides a performance-driven… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Journal ref: Automation in Construction Volume 163, July 2024, 105411

  15. Vision Paper: Causal Inference for Interpretable and Robust Machine Learning in Mobility Analysis

    Authors: Yanan Xin, Natasa Tagasovska, Fernando Perez-Cruz, Martin Raubal

    Abstract: Artificial intelligence (AI) is revolutionizing many areas of our lives, leading a new era of technological advancement. Particularly, the transportation sector would benefit from the progress in AI and advance the development of intelligent transportation systems. Building intelligent transportation systems requires an intricate combination of artificial intelligence and mobility analysis. The pa… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: accepted by ACM SIGSPATIAL 2022 Conference

    ACM Class: I.2; J.2

  16. arXiv:2209.13226  [pdf, other

    stat.ML cs.LG

    Optimization of Annealed Importance Sampling Hyperparameters

    Authors: Shirin Goshtasbpour, Fernando Perez-Cruz

    Abstract: Annealed Importance Sampling (AIS) is a popular algorithm used to estimates the intractable marginal likelihood of deep generative models. Although AIS is guaranteed to provide unbiased estimate for any set of hyperparameters, the common implementations rely on simple heuristics such as the geometric average bridging distributions between initial and the target distribution which affect the estima… ▽ More

    Submitted 8 October, 2022; v1 submitted 27 September, 2022; originally announced September 2022.

  17. Facilitated machine learning for image-based fruit quality assessment

    Authors: Manuel Knott, Fernando Perez-Cruz, Thijs Defraeye

    Abstract: Image-based machine learning models can be used to make the sorting and grading of agricultural products more efficient. In many regions, implementing such systems can be difficult due to the lack of centralization and automation of postharvest supply chains. Stakeholders are often too small to specialize in machine learning, and large training data sets are unavailable. We propose a machine learn… ▽ More

    Submitted 9 January, 2023; v1 submitted 10 July, 2022; originally announced July 2022.

  18. arXiv:2206.08612  [pdf, other

    eess.IV cs.CV

    OADAT: Experimental and Synthetic Clinical Optoacoustic Data for Standardized Image Processing

    Authors: Firat Ozdemir, Berkan Lafci, Xosé Luís Deán-Ben, Daniel Razansky, Fernando Perez-Cruz

    Abstract: Optoacoustic (OA) imaging is based on excitation of biological tissues with nanosecond-duration laser pulses followed by subsequent detection of ultrasound waves generated via light-absorption-mediated thermoelastic expansion. OA imaging features a powerful combination between rich optical contrast and high resolution in deep tissues. This enabled the exploration of a number of attractive new appl… ▽ More

    Submitted 3 May, 2023; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: Accepted to TMLR. 32 pages, 24 figures, 9 tables

    Journal ref: Transactions on Machine Learning Research (2023) 2835-8856

  19. arXiv:2205.11266  [pdf, other

    cs.CV cs.AI cs.LG

    What You See is What You Classify: Black Box Attributions

    Authors: Steven Stalder, Nathanaël Perraudin, Radhakrishna Achanta, Fernando Perez-Cruz, Michele Volpi

    Abstract: An important step towards explaining deep image classifiers lies in the identification of image regions that contribute to individual class scores in the model's output. However, doing this accurately is a difficult task due to the black-box nature of such networks. Most existing approaches find such attributions either using activations and gradients or by repeatedly perturbing the input. We inst… ▽ More

    Submitted 7 October, 2022; v1 submitted 23 May, 2022; originally announced May 2022.

  20. arXiv:2202.06091  [pdf, other

    cs.CR cs.IT cs.LG

    TATTOOED: A Robust Deep Neural Network Watermarking Scheme based on Spread-Spectrum Channel Coding

    Authors: Giulio Pagnotta, Dorjan Hitaj, Briland Hitaj, Fernando Perez-Cruz, Luigi V. Mancini

    Abstract: Watermarking of deep neural networks (DNNs) has gained significant traction in recent years, with numerous (watermarking) strategies being proposed as mechanisms that can help verify the ownership of a DNN in scenarios where these models are obtained without the permission of the owner. However, a growing body of work has demonstrated that existing watermarking mechanisms are highly susceptible to… ▽ More

    Submitted 3 June, 2024; v1 submitted 12 February, 2022; originally announced February 2022.

    Comments: 12 pages

  21. arXiv:2201.12059  [pdf, other

    cs.LG stat.ME stat.ML

    Learning Summary Statistics for Bayesian Inference with Autoencoders

    Authors: Carlo Albert, Simone Ulzega, Firat Ozdemir, Fernando Perez-Cruz, Antonietta Mira

    Abstract: For stochastic models with intractable likelihood functions, approximate Bayesian computation offers a way of approximating the true posterior through repeated comparisons of observations with simulated model outputs in terms of a small set of summary statistics. These statistics need to retain the information that is relevant for constraining the parameters but cancel out the noise. They can thus… ▽ More

    Submitted 23 May, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

  22. arXiv:2201.09841  [pdf, ps, other

    cs.IT cs.NI

    Deep Reinforcement Learning for Random Access in Machine-Type Communication

    Authors: Muhammad Awais Jadoon, Adriano Pastore, Monica Navarro, Fernando Perez-Cruz

    Abstract: Random access (RA) schemes are a topic of high interest in machine-type communication (MTC). In RA protocols, backoff techniques such as exponential backoff (EB) are used to stabilize the system to avoid low throughput and excessive delays. However, these backoff techniques show varying performance for different underlying assumptions and analytical models. Therefore, finding a better transmission… ▽ More

    Submitted 24 January, 2022; originally announced January 2022.

    Comments: 6 pages, 9 figures, conference paper accepted in IEEE WCNC'22

  23. arXiv:2201.08786  [pdf, other

    cs.CR cs.LG

    FedComm: Federated Learning as a Medium for Covert Communication

    Authors: Dorjan Hitaj, Giulio Pagnotta, Briland Hitaj, Fernando Perez-Cruz, Luigi V. Mancini

    Abstract: Proposed as a solution to mitigate the privacy implications related to the adoption of deep learning, Federated Learning (FL) enables large numbers of participants to successfully train deep neural networks without having to reveal the actual private training data. To date, a substantial amount of research has investigated the security and privacy properties of FL, resulting in a plethora of innov… ▽ More

    Submitted 17 May, 2023; v1 submitted 21 January, 2022; originally announced January 2022.

    Comments: 13 pages

  24. arXiv:2109.13235  [pdf, other

    cs.LG cs.AI

    Probabilistic modeling of lake surface water temperature using a Bayesian spatio-temporal graph convolutional neural network

    Authors: Michael Stalder, Firat Ozdemir, Artur Safin, Jonas Sukys, Damien Bouffard, Fernando Perez-Cruz

    Abstract: Accurate lake temperature estimation is essential for numerous problems tackled in both hydrological and ecological domains. Nowadays physical models are developed to estimate lake dynamics; however, computations needed for accurate estimation of lake surface temperature can get prohibitively expensive. We propose to aggregate simulations of lake temperature at a certain depth together with a rang… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Comments: 7 pages, 2 figures

  25. arXiv:2109.12014  [pdf, other

    cs.SD cs.LG eess.AS

    A data acquisition setup for data driven acoustic design

    Authors: Romana Rust, Achilleas Xydis, Kurt Heutschi, Nathanaël Perraudin, Gonzalo Casas, Chaoyu Du, Jürgen Strauss, Kurt Eggenschwiler, Fernando Perez-Cruz, Fabio Gramazio, Matthias Kohler

    Abstract: In this paper, we present a novel interdisciplinary approach to study the relationship between diffusive surface structures and their acoustic performance. Using computational design, surface structures are iteratively generated and 3D printed at 1:10 model scale. They originate from different fabrication typologies and are designed to have acoustic diffusion and absorption effects. An automated r… ▽ More

    Submitted 24 September, 2021; originally announced September 2021.

    Journal ref: Building Acoustics. February 2021

  26. arXiv:2108.10764  [pdf, ps, other

    cs.CL cs.LG

    Regularizing Transformers With Deep Probabilistic Layers

    Authors: Aurora Cobo Aguilera, Pablo Martínez Olmos, Antonio Artés-Rodríguez, Fernando Pérez-Cruz

    Abstract: Language models (LM) have grown with non-stop in the last decade, from sequence-to-sequence architectures to the state-of-the-art and utter attention-based Transformers. In this work, we demonstrate how the inclusion of deep generative models within BERT can bring more versatile models, able to impute missing/noisy words with richer text or even improve BLEU score. More precisely, we use a Gaussia… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

  27. arXiv:2102.00786  [pdf, other

    physics.acc-ph cs.LG eess.SP

    A Novel Approach for Classification and Forecasting of Time Series in Particle Accelerators

    Authors: Sichen Li, Mélissa Zacharias, Jochem Snuverink, Jaime Coello de Portugal, Fernando Perez-Cruz, Davide Reggiani, Andreas Adelmann

    Abstract: The beam interruptions (interlocks) of particle accelerators, despite being necessary safety measures, lead to abrupt operational changes and a substantial loss of beam time. A novel time series classification approach is applied to decrease beam time loss in the High Intensity Proton Accelerator complex by forecasting interlock events. The forecasting is performed through binary classification of… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Journal ref: Information 2021, 12(3), 121

  28. arXiv:2006.02734  [pdf, ps, other

    cs.LG stat.ML

    Robust Sampling in Deep Learning

    Authors: Aurora Cobo Aguilera, Antonio Artés-Rodríguez, Fernando Pérez-Cruz, Pablo Martínez Olmos

    Abstract: Deep learning requires regularization mechanisms to reduce overfitting and improve generalization. We address this problem by a new regularization method based on distributional robust optimization. The key idea is to modify the contribution from each sample for tightening the empirical risk bound. During the stochastic training, the selection of samples is done according to their accuracy in such… ▽ More

    Submitted 5 June, 2020; v1 submitted 4 June, 2020; originally announced June 2020.

    Comments: 8 pages, 3 figures

  29. arXiv:1911.01425  [pdf, other

    stat.ML cs.CV cs.LG

    Improved BiGAN training with marginal likelihood equalization

    Authors: Pablo Sánchez-Martín, Pablo M. Olmos, Fernando Perez-Cruz

    Abstract: We propose a novel training procedure for improving the performance of generative adversarial networks (GANs), especially to bidirectional GANs. First, we enforce that the empirical distribution of the inverse inference network matches the prior distribution, which favors the generator network reproducibility on the seen samples. Second, we have found that the marginal log-likelihood of the sample… ▽ More

    Submitted 23 May, 2020; v1 submitted 4 November, 2019; originally announced November 2019.

  30. arXiv:1910.06569  [pdf, other

    cs.LG eess.SP stat.ML

    Probabilistic Time of Arrival Localization

    Authors: Fernando Perez-Cruz, Pablo M. Olmos, Michael Minyi Zhang, Howard Huang

    Abstract: In this paper, we take a new approach for time of arrival geo-localization. We show that the main sources of error in metropolitan areas are due to environmental imperfections that bias our solutions, and that we can rely on a probabilistic model to learn and compensate for them. The resulting localization error is validated using measurements from a live LTE cellular network to be less than 10 me… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

    Comments: IEEE Signal Processing Letters, 2019

  31. Probabilistic MIMO Symbol Detection with Expectation Consistency Approximate Inference

    Authors: Javier Cépedes, Pablo M. Olmos, Matilde Sánchez-Fernández, Fernando Pérez-Cruz

    Abstract: In this paper we explore low-complexity probabilistic algorithms for soft symbol detection in high-dimensional multiple-input multiple-output (MIMO) systems. We present a novel algorithm based on the Expectation Consistency (EC) framework, which describes the approximate inference problem as an optimization over a non-convex function. EC generalizes algorithms such as Belief Propagation and Expect… ▽ More

    Submitted 2 October, 2019; originally announced October 2019.

    Journal ref: IEEE Transactions on Vehicular Technology ( Volume: 67 , Issue: 4 , April 2018 )

  32. arXiv:1901.09557  [pdf, other

    cs.LG stat.ML

    Out-of-Sample Testing for GANs

    Authors: Pablo Sánchez-Martín, Pablo M. Olmos, Fernando Pérez-Cruz

    Abstract: We propose a new method to evaluate GANs, namely EvalGAN. EvalGAN relies on a test set to directly measure the reconstruction quality in the original sample space (no auxiliary networks are necessary), and it also computes the (log)likelihood for the reconstructed samples in the test set. Further, EvalGAN is agnostic to the GAN algorithm and the dataset. We decided to test it on three state-of-the… ▽ More

    Submitted 28 January, 2019; originally announced January 2019.

  33. arXiv:1810.09261  [pdf, other

    eess.SP cs.IT cs.LG stat.ML

    Infinite Factorial Finite State Machine for Blind Multiuser Channel Estimation

    Authors: Francisco J. R. Ruiz, Isabel Valera, Lennart Svensson, Fernando Perez-Cruz

    Abstract: New communication standards need to deal with machine-to-machine communications, in which users may start or stop transmitting at any time in an asynchronous manner. Thus, the number of users is an unknown and time-varying parameter that needs to be accurately estimated in order to properly recover the symbols transmitted by all users in the system. In this paper, we address the problem of joint c… ▽ More

    Submitted 18 October, 2018; originally announced October 2018.

    Comments: 15 pages, 15 figures

    Journal ref: IEEE Transactions on Cognitive Communications and Networking, June 2018, Vol 2, Issue 2, pages 177-191

  34. arXiv:1806.11518  [pdf, ps, other

    cs.LG stat.ML

    Sparse Three-parameter Restricted Indian Buffet Process for Understanding International Trade

    Authors: Melanie F. Pradier, Viktor Stojkoski, Zoran Utkovski, Ljupco Kocarev, Fernando Perez-Cruz

    Abstract: This paper presents a Bayesian nonparametric latent feature model specially suitable for exploratory analysis of high-dimensional count data. We perform a non-negative doubly sparse matrix factorization that has two main advantages: not only we are able to better approximate the row input distributions, but the inferred topics are also easier to interpret. By combining the three-parameter and rest… ▽ More

    Submitted 29 June, 2018; originally announced June 2018.

    Comments: To appear in the proceedings of ICASSP 2018

  35. arXiv:1709.00440  [pdf, other

    cs.CR cs.LG stat.ML

    PassGAN: A Deep Learning Approach for Password Guessing

    Authors: Briland Hitaj, Paolo Gasti, Giuseppe Ateniese, Fernando Perez-Cruz

    Abstract: State-of-the-art password guessing tools, such as HashCat and John the Ripper, enable users to check billions of passwords per second against password hashes. In addition to performing straightforward dictionary attacks, these tools can expand password dictionaries using password generation rules, such as concatenation of words (e.g., "password123456") and leet speak (e.g., "password" becomes "p4s… ▽ More

    Submitted 14 February, 2019; v1 submitted 1 September, 2017; originally announced September 2017.

    Comments: This is an extended version of the paper which appeared in NeurIPS 2018 Workshop on Security in Machine Learning (SecML'18), see https://github.com/secml2018/secml2018.github.io/raw/master/PASSGAN_SECML2018.pdf

  36. arXiv:1702.07464  [pdf, other

    cs.CR cs.LG stat.ML

    Deep Models Under the GAN: Information Leakage from Collaborative Deep Learning

    Authors: Briland Hitaj, Giuseppe Ateniese, Fernando Perez-Cruz

    Abstract: Deep Learning has recently become hugely popular in machine learning, providing significant improvements in classification accuracy in the presence of highly-structured and large databases. Researchers have also considered privacy implications of deep learning. Models are typically trained in a centralized manner with all the data being processed by the same training algorithm. If the data is a… ▽ More

    Submitted 14 September, 2017; v1 submitted 24 February, 2017; originally announced February 2017.

    Comments: ACM CCS'17, 16 pages, 18 figures

  37. Complex-Valued Kernel Methods for Regression

    Authors: Rafael Boloix-Tortosa, Juan José Murillo-Fuentes, Irene Santos Velázquez, Fernando Pérez-Cruz

    Abstract: Usually, complex-valued RKHS are presented as an straightforward application of the real-valued case. In this paper we prove that this procedure yields a limited solution for regression. We show that another kernel, here denoted as pseudo kernel, is needed to learn any function in complex-valued fields. Accordingly, we derive a novel RKHS to include it, the widely RKHS (WRKHS). When the pseudo-ker… ▽ More

    Submitted 31 October, 2016; originally announced October 2016.

    Comments: 8 pages, 9 figures

    Journal ref: IEEE Transactions on Signal Processing (Volume: 65, Issue: 19, Oct.1, 1 2017)

  38. arXiv:1502.05988  [pdf, other

    cs.LG cs.AI

    Deep Learning for Multi-label Classification

    Authors: Jesse Read, Fernando Perez-Cruz

    Abstract: In multi-label classification, the main focus has been to develop ways of learning the underlying dependencies between labels, and to take advantage of this at classification time. Develo** better feature-space representations has been predominantly employed to reduce complexity, e.g., by eliminating non-helpful feature attributes from the input space prior to (or during) training. This is an im… ▽ More

    Submitted 17 December, 2014; originally announced February 2015.

  39. arXiv:1401.7620  [pdf, other

    stat.ML cs.LG

    Bayesian nonparametric comorbidity analysis of psychiatric disorders

    Authors: Francisco J. R. Ruiz, Isabel Valera, Carlos Blanco, Fernando Perez-Cruz

    Abstract: The analysis of comorbidity is an open and complex research field in the branch of psychiatry, where clinical experience and several studies suggest that the relation among the psychiatric disorders may have etiological and treatment implications. In this paper, we are interested in applying latent feature modeling to find the latent structure behind the psychiatric disorders that can help to exam… ▽ More

    Submitted 29 January, 2014; originally announced January 2014.

    Comments: Submitted to Journal of Machine Learning Research

  40. arXiv:1303.2823  [pdf, other

    cs.LG cs.IT stat.ML

    Gaussian Processes for Nonlinear Signal Processing

    Authors: Fernando Pérez-Cruz, Steven Van Vaerenbergh, Juan José Murillo-Fuentes, Miguel Lázaro-Gredilla, Ignacio Santamaria

    Abstract: Gaussian processes (GPs) are versatile tools that have been successfully employed to solve nonlinear estimation problems in machine learning, but that are rarely used in signal processing. In this tutorial, we present GPs for regression as a natural nonlinear extension to optimal Wiener filtering. After establishing their basic formulation, we discuss several important aspects and extensions, incl… ▽ More

    Submitted 27 September, 2013; v1 submitted 12 March, 2013; originally announced March 2013.

    Journal ref: IEEE Signal Processing Magazine, vol.30, no.4, pp.40-50, July 2013

  41. Tree-Structure Expectation Propagation for LDPC Decoding over the BEC

    Authors: Pablo M. Olmos, Juan José Murillo-Fuentes, Fernando Pérez-Cruz

    Abstract: We present the tree-structure expectation propagation (Tree-EP) algorithm to decode low-density parity-check (LDPC) codes over discrete memoryless channels (DMCs). EP generalizes belief propagation (BP) in two ways. First, it can be used with any exponential family distribution over the cliques in the graph. Second, it can impose additional constraints on the marginal distributions. We use this se… ▽ More

    Submitted 13 August, 2012; v1 submitted 3 January, 2012; originally announced January 2012.

    Journal ref: IEEE Transactions on Information Theory 2013

  42. arXiv:1009.4287   

    cs.IT

    Tree-Structure Expectation Propagation for LDPC Decoding in Erasure Channels

    Authors: Pablo M. Olmos, Juan José Murillo-Fuentes, Fernando Pérez-Cruz

    Abstract: In this paper we present a new algorithm, denoted as TEP, to decode low-density parity-check (LDPC) codes over the Binary Erasure Channel (BEC). The TEP decoder is derived applying the expectation propagation (EP) algorithm with a tree- structured approximation. Expectation Propagation (EP) is a generalization to Belief Propagation (BP) in two ways. First, it can be used with any exponential famil… ▽ More

    Submitted 4 January, 2012; v1 submitted 22 September, 2010; originally announced September 2010.

    Comments: This paper has been withdrawn to be replaced by a corrected version under a different title: "Tree-Structure Expectation Propagation for LDPC Decoding over the BEC"

  43. arXiv:1006.0795  [pdf, other

    cs.IT

    Channel Decoding with a Bayesian Equalizer

    Authors: Luis Salamanca, Juan José Murillo-Fuentes, Fernando Pérez-Cruz

    Abstract: Low-density parity-check (LPDC) decoders assume the channel estate information (CSI) is known and they have the true a posteriori probability (APP) for each transmitted bit. But in most cases of interest, the CSI needs to be estimated with the help of a short training sequence and the LDPC decoder has to decode the received word using faulty APP estimates. In this paper, we study the uncertainty i… ▽ More

    Submitted 4 June, 2010; originally announced June 2010.

    Comments: 5 pages, 6 figures, ISIT 2010 conference