Search | arXiv e-print repository

arXiv:2305.19901 [pdf, other]

Adaptive Conformal Regression with Jackknife+ Rescaled Scores

Authors: Nicolas Deutschmann, Mattia Rigotti, Maria Rodriguez Martinez

Abstract: Conformal regression provides prediction intervals with global coverage guarantees, but often fails to capture local error distributions, leading to non-homogeneous coverage. We address this with a new adaptive method based on rescaling conformal scores with an estimate of local score distribution, inspired by the Jackknife+ method, which enables the use of calibration data in conformal scores wit… ▽ More Conformal regression provides prediction intervals with global coverage guarantees, but often fails to capture local error distributions, leading to non-homogeneous coverage. We address this with a new adaptive method based on rescaling conformal scores with an estimate of local score distribution, inspired by the Jackknife+ method, which enables the use of calibration data in conformal scores without breaking calibration-test exchangeability. Our approach ensures formal global coverage guarantees and is supported by new theoretical results on local coverage, including an a posteriori bound on any calibration score. The strength of our approach lies in achieving local coverage without sacrificing calibration set size, improving the applicability of conformal prediction intervals in various settings. As a result, our method provides prediction intervals that outperform previous methods, particularly in the low-data regime, making it especially relevant for real-world applications such as healthcare and biomedical domains where uncertainty needs to be quantified accurately despite low sample data. △ Less

Submitted 31 May, 2023; originally announced May 2023.

Comments: 24 pages, 7 figures

arXiv:2302.02406 [pdf, other]

Pre-screening breast cancer with machine learning and deep learning

Authors: Rolando Gonzales Martinez, Daan-Max van Dongen

Abstract: We suggest that deep learning can be used for pre-screening cancer by analyzing demographic and anthropometric information of patients, as well as biological markers obtained from routine blood samples and relative risks obtained from meta-analysis and international databases. We applied feature selection algorithms to a database of 116 women, including 52 healthy women and 64 women diagnosed with… ▽ More We suggest that deep learning can be used for pre-screening cancer by analyzing demographic and anthropometric information of patients, as well as biological markers obtained from routine blood samples and relative risks obtained from meta-analysis and international databases. We applied feature selection algorithms to a database of 116 women, including 52 healthy women and 64 women diagnosed with breast cancer, to identify the best pre-screening predictors of cancer. We utilized the best predictors to perform k-fold Monte Carlo cross-validation experiments that compare deep learning against traditional machine learning algorithms. Our results indicate that a deep learning model with an input-layer architecture that is fine-tuned using feature selection can effectively distinguish between patients with and without cancer. Additionally, compared to machine learning, deep learning has the lowest uncertainty in its predictions. These findings suggest that deep learning algorithms applied to cancer pre-screening offer a radiation-free, non-invasive, and affordable complement to screening methods based on imagery. The implementation of deep learning algorithms in cancer pre-screening offer opportunities to identify individuals who may require imaging-based screening, can encourage self-examination, and decrease the psychological externalities associated with false positives in cancer screening. The integration of deep learning algorithms for both screening and pre-screening will ultimately lead to earlier detection of malignancy, reducing the healthcare and societal burden associated to cancer treatment. △ Less

Submitted 5 February, 2023; originally announced February 2023.

arXiv:2106.10086 [pdf, other]

It's FLAN time! Summing feature-wise latent representations for interpretability

Authors: An-phi Nguyen, Maria Rodriguez Martinez

Abstract: Interpretability has become a necessary feature for machine learning models deployed in critical scenarios, e.g. legal system, healthcare. In these situations, algorithmic decisions may have (potentially negative) long-lasting effects on the end-user affected by the decision. In many cases, the representational power of deep learning models is not needed, therefore simple and interpretable models… ▽ More Interpretability has become a necessary feature for machine learning models deployed in critical scenarios, e.g. legal system, healthcare. In these situations, algorithmic decisions may have (potentially negative) long-lasting effects on the end-user affected by the decision. In many cases, the representational power of deep learning models is not needed, therefore simple and interpretable models (e.g. linear models) should be preferred. However, in high-dimensional and/or complex domains (e.g. computer vision), the universal approximation capabilities of neural networks are required. Inspired by linear models and the Kolmogorov-Arnold representation theorem, we propose a novel class of structurally-constrained neural networks, which we call FLANs (Feature-wise Latent Additive Networks). Crucially, FLANs process each input feature separately, computing for each of them a representation in a common latent space. These feature-wise latent representations are then simply summed, and the aggregated representation is used for prediction. These constraints (which are at the core of the interpretability of linear models) allow a user to estimate the effect of each individual feature independently from the others, enhancing interpretability. In a set of experiments across different domains, we show how without compromising excessively the test performance, the structural constraints proposed in FLANs indeed facilitates the interpretability of deep learning models. We quantitatively compare FLANs interpretability to post-hoc methods using recently introduced metrics, discussing the advantages of natively interpretable models over a post-hoc analysis. △ Less

Submitted 20 December, 2021; v1 submitted 18 June, 2021; originally announced June 2021.

arXiv:2007.07591 [pdf, other]

Learning Invariances for Interpretability using Supervised VAE

Authors: An-phi Nguyen, María Rodríguez Martínez

Abstract: We propose to learn model invariances as a means of interpreting a model. This is motivated by a reverse engineering principle. If we understand a problem, we may introduce inductive biases in our model in the form of invariances. Conversely, when interpreting a complex supervised model, we can study its invariances to understand how that model solves a problem. To this end we propose a supervised… ▽ More We propose to learn model invariances as a means of interpreting a model. This is motivated by a reverse engineering principle. If we understand a problem, we may introduce inductive biases in our model in the form of invariances. Conversely, when interpreting a complex supervised model, we can study its invariances to understand how that model solves a problem. To this end we propose a supervised form of variational auto-encoders (VAEs). Crucially, only a subset of the dimensions in the latent space contributes to the supervised task, allowing the remaining dimensions to act as nuisance parameters. By sampling solely the nuisance dimensions, we are able to generate samples that have undergone transformations that leave the classification unchanged, revealing the invariances of the model. Our experimental results show the capability of our proposed model both in terms of classification, and generation of invariantly transformed samples. Finally we show how combining our model with feature attribution methods it is possible to reach a more fine-grained understanding about the decision process of the model. △ Less

Submitted 15 July, 2020; originally announced July 2020.

arXiv:2007.07584 [pdf, other]

On quantitative aspects of model interpretability

Authors: An-phi Nguyen, María Rodríguez Martínez

Abstract: Despite the growing body of work in interpretable machine learning, it remains unclear how to evaluate different explainability methods without resorting to qualitative assessment and user-studies. While interpretability is an inherently subjective matter, previous works in cognitive science and epistemology have shown that good explanations do possess aspects that can be objectively judged apart… ▽ More Despite the growing body of work in interpretable machine learning, it remains unclear how to evaluate different explainability methods without resorting to qualitative assessment and user-studies. While interpretability is an inherently subjective matter, previous works in cognitive science and epistemology have shown that good explanations do possess aspects that can be objectively judged apart from fidelity), such assimplicity and broadness. In this paper we propose a set of metrics to programmatically evaluate interpretability methods along these dimensions. In particular, we argue that the performance of methods along these dimensions can be orthogonally imputed to two conceptual parts, namely the feature extractor and the actual explainability method. We experimentally validate our metrics on different benchmark tasks and show how they can be used to guide a practitioner in the selection of the most appropriate method for the task at hand. △ Less

Submitted 15 July, 2020; originally announced July 2020.

arXiv:2005.13285 [pdf, other]

PaccMann$^{RL}$ on SARS-CoV-2: Designing antiviral candidates with conditional generative models

Authors: Jannis Born, Matteo Manica, Joris Cadow, Greta Markert, Nil Adell Mill, Modestas Filipavicius, María Rodríguez Martínez

Abstract: With the fast development of COVID-19 into a global pandemic, scientists around the globe are desperately searching for effective antiviral therapeutic agents. Bridging systems biology and drug discovery, we propose a deep learning framework for conditional de novo design of antiviral candidate drugs tailored against given protein targets. First, we train a multimodal ligand--protein binding affin… ▽ More With the fast development of COVID-19 into a global pandemic, scientists around the globe are desperately searching for effective antiviral therapeutic agents. Bridging systems biology and drug discovery, we propose a deep learning framework for conditional de novo design of antiviral candidate drugs tailored against given protein targets. First, we train a multimodal ligand--protein binding affinity model on predicting affinities of antiviral compounds to target proteins and couple this model with pharmacological toxicity predictors. Exploiting this multi-objective as a reward function of a conditional molecular generator (consisting of two VAEs), we showcase a framework that navigates the chemical space toward regions with more antiviral molecules. Specifically, we explore a challenging setting of generating ligands against unseen protein targets by performing a leave-one-out-cross-validation on 41 SARS-CoV-2-related target proteins. Using deep RL, it is demonstrated that in 35 out of 41 cases, the generation is biased towards sampling more binding ligands, with an average increase of 83% comparing to an unbiased VAE. We present a case-study on a potential Envelope-protein inhibitor and perform a synthetic accessibility assessment of the best generated molecules is performed that resembles a viable roadmap towards a rapid in-vitro evaluation of potential SARS-CoV-2 inhibitors. △ Less

Submitted 6 July, 2020; v1 submitted 27 May, 2020; originally announced May 2020.

Comments: 5 pages, 6 figures

Journal ref: ICML Workshop on Computational Biology 2020

arXiv:1911.13213 [pdf]

DeStress: Deep Learning for Unsupervised Identification of Mental Stress in Firefighters from Heart-rate Variability (HRV) Data

Authors: Ali Oskooei, Sophie Mai Chau, Jonas Weiss, Arvind Sridhar, María Rodríguez Martínez, Bruno Michel

Abstract: In this work we perform a study of various unsupervised methods to identify mental stress in firefighter trainees based on unlabeled heart rate variability data. We collect RR interval time series data from nearly 100 firefighter trainees that participated in a drill. We explore and compare three methods in order to perform unsupervised stress detection: 1) traditional K-Means clustering with engi… ▽ More In this work we perform a study of various unsupervised methods to identify mental stress in firefighter trainees based on unlabeled heart rate variability data. We collect RR interval time series data from nearly 100 firefighter trainees that participated in a drill. We explore and compare three methods in order to perform unsupervised stress detection: 1) traditional K-Means clustering with engineered time and frequency domain features 2) convolutional autoencoders and 3) long short-term memory (LSTM) autoencoders, both trained on the raw RRI measurements combined with DBSCAN clustering and K-Nearest-Neighbors classification. We demonstrate that K-Means combined with engineered features is unable to capture meaningful structure within the data. On the other hand, convolutional and LSTM autoencoders tend to extract varying structure from the data pointing to different clusters with different sizes of clusters. We attempt at identifying the true stressed and normal clusters using the HRV markers of mental stress reported in the literature. We demonstrate that the clusters produced by the convolutional autoencoders consistently and successfully stratify stressed versus normal samples, as validated by several established physiological stress markers such as RMSSD, Max-HR, Mean-HR and LF-HF ratio. △ Less

Submitted 18 November, 2019; originally announced November 2019.

arXiv:1909.13611 [pdf, other]

MonoNet: Towards Interpretable Models by Learning Monotonic Features

Authors: An-phi Nguyen, María Rodríguez Martínez

Abstract: Being able to interpret, or explain, the predictions made by a machine learning model is of fundamental importance. This is especially true when there is interest in deploying data-driven models to make high-stakes decisions, e.g. in healthcare. While recent years have seen an increasing interest in interpretable machine learning research, this field is currently lacking an agreed-upon definition… ▽ More Being able to interpret, or explain, the predictions made by a machine learning model is of fundamental importance. This is especially true when there is interest in deploying data-driven models to make high-stakes decisions, e.g. in healthcare. While recent years have seen an increasing interest in interpretable machine learning research, this field is currently lacking an agreed-upon definition of interpretability, and some researchers have called for a more active conversation towards a rigorous approach to interpretability. Joining this conversation, we claim in this paper that the difficulty of interpreting a complex model stems from the existing interactions among features. We argue that by enforcing monotonicity between features and outputs, we are able to reason about the effect of a single feature on an output independently from other features, and consequently better understand the model. We show how to structurally introduce this constraint in deep learning models by adding new simple layers. We validate our model on benchmark datasets, and compare our results with previously proposed interpretable models. △ Less

Submitted 30 September, 2019; originally announced September 2019.

arXiv:1909.05114 [pdf, other]

doi 10.1007/978-3-030-45257-5_18

PaccMann$^{RL}$: Designing anticancer drugs from transcriptomic data via reinforcement learning

Authors: Jannis Born, Matteo Manica, Ali Oskooei, Joris Cadow, Karsten Borgwardt, María Rodríguez Martínez

Abstract: With the advent of deep generative models in computational chemistry, in silico anticancer drug design has undergone an unprecedented transformation. While state-of-the-art deep learning approaches have shown potential in generating compounds with desired chemical properties, they disregard the genetic profile and properties of the target disease. Here, we introduce the first generative model capa… ▽ More With the advent of deep generative models in computational chemistry, in silico anticancer drug design has undergone an unprecedented transformation. While state-of-the-art deep learning approaches have shown potential in generating compounds with desired chemical properties, they disregard the genetic profile and properties of the target disease. Here, we introduce the first generative model capable of tailoring anticancer compounds for a specific biomolecular profile. Using a RL framework, the transcriptomic profiles of cancer cells are used as a context for the generation of candidate molecules. Our molecule generator combines two separately pretrained variational autoencoders (VAEs) - the first VAE encodes transcriptomic profiles into a smooth, latent space which in turn is used to condition a second VAE to generate novel molecular structures on the given transcriptomic profile. The generative process is optimized through PaccMann, a previously developed drug sensitivity prediction model to obtain effective anticancer compounds for the given context (i.e., transcriptomic profile). We demonstrate how the molecule generation can be biased towards compounds with high predicted inhibitory effect against individual cell lines or specific cancer sites. We verify our approach by investigating candidate drugs generated against specific cancer types and find the highest structural similarity to existing compounds with known efficacy against these cancer types. We envision our approach to transform in silico anticancer drug design by leveraging the biomolecular characteristics of the disease in order to increase success rates in lead compound discovery. △ Less

Submitted 16 April, 2020; v1 submitted 29 August, 2019; originally announced September 2019.

Comments: 18 pages total (12 pages main text, 4 pages references, 11 pages appendix) 8 figures

Journal ref: International Conference on Research in Computational Molecular Biology 2020

arXiv:1904.11223 [pdf, other]

doi 10.1021/acs.molpharmaceut.9b00520

Towards Explainable Anticancer Compound Sensitivity Prediction via Multimodal Attention-based Convolutional Encoders

Authors: Matteo Manica, Ali Oskooei, Jannis Born, Vigneshwari Subramanian, Julio Sáez-Rodríguez, María Rodríguez Martínez

Abstract: In line with recent advances in neural drug design and sensitivity prediction, we propose a novel architecture for interpretable prediction of anticancer compound sensitivity using a multimodal attention-based convolutional encoder. Our model is based on the three key pillars of drug sensitivity: compounds' structure in the form of a SMILES sequence, gene expression profiles of tumors and prior kn… ▽ More In line with recent advances in neural drug design and sensitivity prediction, we propose a novel architecture for interpretable prediction of anticancer compound sensitivity using a multimodal attention-based convolutional encoder. Our model is based on the three key pillars of drug sensitivity: compounds' structure in the form of a SMILES sequence, gene expression profiles of tumors and prior knowledge on intracellular interactions from protein-protein interaction networks. We demonstrate that our multiscale convolutional attention-based (MCA) encoder significantly outperforms a baseline model trained on Morgan fingerprints, a selection of encoders based on SMILES as well as previously reported state of the art for multimodal drug sensitivity prediction (R2 = 0.86 and RMSE = 0.89). Moreover, the explainability of our approach is demonstrated by a thorough analysis of the attention weights. We show that the attended genes significantly enrich apoptotic processes and that the drug attention is strongly correlated with a standard chemical structure similarity index. Finally, we report a case study of two receptor tyrosine kinase (RTK) inhibitors acting on a leukemia cell line, showcasing the ability of the model to focus on informative genes and submolecular regions of the two compounds. The demonstrated generalizability and the interpretability of our model testify its potential for in-silico prediction of anticancer compound efficacy on unseen cancer cells, positioning it as a valid solution for the development of personalized therapies as well as for the evaluation of candidate compounds in de novo drug design. △ Less

Submitted 14 July, 2019; v1 submitted 25 April, 2019; originally announced April 2019.

Comments: 11 pages, 5 figures, 1 table, Workshop on Computational Biology at the International Conference on Machine Learning (ICML), Long Beach, CA, 2019

Journal ref: Mol. Pharmaceutics 2019

arXiv:1904.08745 [pdf, ps, other]

edGNN: a Simple and Powerful GNN for Directed Labeled Graphs

Authors: Guillaume Jaume, An-phi Nguyen, María Rodríguez Martínez, Jean-Philippe Thiran, Maria Gabrani

Abstract: The ability of a graph neural network (GNN) to leverage both the graph topology and graph labels is fundamental to building discriminative node and graph embeddings. Building on previous work, we theoretically show that edGNN, our model for directed labeled graphs, is as powerful as the Weisfeiler-Lehman algorithm for graph isomorphism. Our experiments support our theoretical findings, confirming… ▽ More The ability of a graph neural network (GNN) to leverage both the graph topology and graph labels is fundamental to building discriminative node and graph embeddings. Building on previous work, we theoretically show that edGNN, our model for directed labeled graphs, is as powerful as the Weisfeiler-Lehman algorithm for graph isomorphism. Our experiments support our theoretical findings, confirming that graph neural networks can be used effectively for inference problems on directed graphs with both node and edge labels. Code available at https://github.com/guillaumejaume/edGNN. △ Less

Submitted 4 December, 2019; v1 submitted 18 April, 2019; originally announced April 2019.

Comments: Representation Learning on Graphs and Manifolds @ ICLR19

arXiv:1811.09619 [pdf, other]

Inference of the three-dimensional chromatin structure and its temporal behavior

Authors: Bianca-Cristina Cristescu, Zalán Borsos, John Lygeros, María Rodríguez Martínez, Maria Anna Rapsomaniki

Abstract: Understanding the three-dimensional (3D) structure of the genome is essential for elucidating vital biological processes and their links to human disease. To determine how the genome folds within the nucleus, chromosome conformation capture methods such as HiC have recently been employed. However, computational methods that exploit the resulting high-throughput, high-resolution data are still suff… ▽ More Understanding the three-dimensional (3D) structure of the genome is essential for elucidating vital biological processes and their links to human disease. To determine how the genome folds within the nucleus, chromosome conformation capture methods such as HiC have recently been employed. However, computational methods that exploit the resulting high-throughput, high-resolution data are still suffering from important limitations. In this work, we explore the idea of manifold learning for the 3D chromatin structure inference and present a novel method, REcurrent Autoencoders for CHromatin 3D structure prediction (REACH-3D). Our framework employs autoencoders with recurrent neural units to reconstruct the chromatin structure. In comparison to existing methods, REACH-3D makes no transfer function assumption and permits dynamic analysis. Evaluating REACH-3D on synthetic data indicated high agreement with the ground truth. When tested on real experimental HiC data, REACH-3D recovered most faithfully the expected biological properties and obtained the highest correlation coefficient with microscopy measurements. Last, REACH-3D was applied to dynamic HiC data, where it successfully modeled chromatin conformation during the cell cycle. △ Less

Submitted 22 November, 2018; originally announced November 2018.

Comments: 10 pages, 7 figures, 1 algorithm. Neural Information Processing Systems, Machine Learning for Molecules and Materials, 2018

arXiv:1808.06603 [pdf]

Network-based Biased Tree Ensembles (NetBiTE) for Drug Sensitivity Prediction and Drug Sensitivity Biomarker Identification in Cancer

Authors: Ali Oskooei, Matteo Manica, Roland Mathis, Maria Rodriguez Martinez

Abstract: We present the Network-based Biased Tree Ensembles (NetBiTE) method for drug sensitivity prediction and drug sensitivity biomarker identification in cancer using a combination of prior knowledge and gene expression data. Our devised method consists of a biased tree ensemble that is built according to a probabilistic bias weight distribution. The bias weight distribution is obtained from the assign… ▽ More We present the Network-based Biased Tree Ensembles (NetBiTE) method for drug sensitivity prediction and drug sensitivity biomarker identification in cancer using a combination of prior knowledge and gene expression data. Our devised method consists of a biased tree ensemble that is built according to a probabilistic bias weight distribution. The bias weight distribution is obtained from the assignment of high weights to the drug targets and propagating the assigned weights over a protein-protein interaction network such as STRING. The propagation of weights, defines neighborhoods of influence around the drug targets and as such simulates the spread of perturbations within the cell, following drug administration. Using a synthetic dataset, we showcase how application of biased tree ensembles (BiTE) results in significant accuracy gains at a much lower computational cost compared to the unbiased random forests (RF) algorithm. We then apply NetBiTE to the Genomics of Drug Sensitivity in Cancer (GDSC) dataset and demonstrate that NetBiTE outperforms RF in predicting IC50 drug sensitivity, only for drugs that target membrane receptor pathways (MRPs): RTK, EGFR and IGFR signaling pathways. We propose based on the NetBiTE results, that for drugs that inhibit MRPs, the expression of target genes prior to drug administration is a biomarker for IC50 drug sensitivity following drug administration. We further verify and reinforce this proposition through control studies on, PI3K/MTOR signaling pathway inhibitors, a drug category that does not target MRPs, and through assignment of dummy targets to MRP inhibiting drugs and investigating the variation in NetBiTE accuracy. △ Less

Submitted 26 April, 2019; v1 submitted 18 August, 2018; originally announced August 2018.

Comments: 36 pages, 5 figures, 3 supplementary figures

arXiv:1807.00692 [pdf, other]

Grapevine: A Wine Prediction Algorithm Using Multi-dimensional Clustering Methods

Authors: Richard Diehl Martinez, Geoffrey Angus, Rooz Mahdavian

Abstract: We present a method for a wine recommendation system that employs multidimensional clustering and unsupervised learning methods. Our algorithm first performs clustering on a large corpus of wine reviews. It then uses the resulting wine clusters as an approximation of the most common flavor palates, recommending a user a wine by optimizing over a price-quality ratio within clusters that they demons… ▽ More We present a method for a wine recommendation system that employs multidimensional clustering and unsupervised learning methods. Our algorithm first performs clustering on a large corpus of wine reviews. It then uses the resulting wine clusters as an approximation of the most common flavor palates, recommending a user a wine by optimizing over a price-quality ratio within clusters that they demonstrated a preference for. △ Less

Submitted 29 June, 2018; originally announced July 2018.

arXiv:1803.11274 [pdf, other]

doi 10.1038/s41540-019-0086-3

PIMKL: Pathway Induced Multiple Kernel Learning

Authors: Matteo Manica, Joris Cadow, Roland Mathis, María Rodríguez Martínez

Abstract: Reliable identification of molecular biomarkers is essential for accurate patient stratification. While state-of-the-art machine learning approaches for sample classification continue to push boundaries in terms of performance, most of these methods are not able to integrate different data types and lack generalization power, limiting their application in a clinical setting. Furthermore, many meth… ▽ More Reliable identification of molecular biomarkers is essential for accurate patient stratification. While state-of-the-art machine learning approaches for sample classification continue to push boundaries in terms of performance, most of these methods are not able to integrate different data types and lack generalization power, limiting their application in a clinical setting. Furthermore, many methods behave as black boxes, and we have very little understanding about the mechanisms that lead to the prediction. While opaqueness concerning machine behaviour might not be a problem in deterministic domains, in health care, providing explanations about the molecular factors and phenotypes that are driving the classification is crucial to build trust in the performance of the predictive system. We propose Pathway Induced Multiple Kernel Learning (PIMKL), a novel methodology to reliably classify samples that can also help gain insights into the molecular mechanisms that underlie the classification. PIMKL exploits prior knowledge in the form of a molecular interaction network and annotated gene sets, by optimizing a mixture of pathway-induced kernels using a Multiple Kernel Learning (MKL) algorithm, an approach that has demonstrated excellent performance in different machine learning applications. After optimizing the combination of kernels for prediction of a specific phenotype, the model provides a stable molecular signature that can be interpreted in the light of the ingested prior knowledge and that can be used in transfer learning tasks. △ Less

Submitted 5 July, 2018; v1 submitted 29 March, 2018; originally announced March 2018.

Journal ref: npj Systems Biology and Applications (2019)

arXiv:1803.04235 [pdf, ps, other]

Approximate Bayesian Computation in controlled branching processes: the role of summary statistics

Authors: M. González, R. Martínez, C. Minuesa, I. del Puerto

Abstract: Controlled branching processes are stochastic growth population models in which the number of individuals with reproductive capacity in each generation is controlled by a random control function. The purpose of this work is to examine the Approximate Bayesian Computation (ABC) methods and to propose appropriate summary statistics for them in the context of these processes. This methodology enables… ▽ More Controlled branching processes are stochastic growth population models in which the number of individuals with reproductive capacity in each generation is controlled by a random control function. The purpose of this work is to examine the Approximate Bayesian Computation (ABC) methods and to propose appropriate summary statistics for them in the context of these processes. This methodology enables to approximate the posterior distribution of the parameters of interest satisfactorily without explicit likelihood calculations and under a minimal set of assumptions. In particular, the tolerance rejection algorithm, the sequential Monte Carlo ABC algorithm, and a post-sampling correction method based on local-linear regression are provided. The accuracy of the proposed methods are illustrated and compared with a "likelihood free" Markov chain Monte Carlo technique by the way of a simulated example developed with the statistical software R. △ Less

Submitted 1 July, 2019; v1 submitted 12 March, 2018; originally announced March 2018.

arXiv:1801.09064 [pdf, ps, other]

Bayesian inference in Y-linked two-sex branching processes with mutations: ABC approach

Authors: Miguel González, Rodrigo Martínez, Cristina Gutiérrez

Abstract: A Y-linked two-sex branching process with mutations and blind choice of males is a suitable model for analyzing the evolution of the number of carriers of an allele and its mutations of a Y-linked gene. Considering a two-sex monogamous population, in this model each female chooses her partner from among the male population without caring about his type (i.e., the allele he carries). In this work,… ▽ More A Y-linked two-sex branching process with mutations and blind choice of males is a suitable model for analyzing the evolution of the number of carriers of an allele and its mutations of a Y-linked gene. Considering a two-sex monogamous population, in this model each female chooses her partner from among the male population without caring about his type (i.e., the allele he carries). In this work, we deal with the problem of estimating the main parameters of such model develo** the Bayesian inference in a parametric framework. Firstly, we consider, as sample scheme, the observation of the total number of females and males up to some generation as well as the number of males of each genotype at last generation. Later, we introduce the information of the mutated males only in the last generation obtaining in this way a second sample scheme. For both samples, we apply the Approximate Bayesian Computation (ABC) methodology to approximate the posterior distributions of the main parameters of this model. The accuracy of the procedure based on these samples is illustrated and discussed by way of simulated examples. △ Less

Submitted 27 January, 2018; originally announced January 2018.

arXiv:1711.09196 [pdf]

The Impact of an AirBnb Host's Listing Description 'Sentiment' and Length On Occupancy Rates

Authors: Richard Diehl Martinez, Anthony Carrington, Tiffany Kuo, Lena Tarhuni, Nour Adel Zaki Abdel-Motaal

Abstract: There has been significant literature regarding the way product review sentiment affects brand loyalty. Intrigued by how natural language influences consumer choice, we were motivated to examine whether an AirBnb host's occupancy rate (how often their listing is booked out of the days they indicated their listing was available) can be determined by the perceived sentiment and length of their descr… ▽ More There has been significant literature regarding the way product review sentiment affects brand loyalty. Intrigued by how natural language influences consumer choice, we were motivated to examine whether an AirBnb host's occupancy rate (how often their listing is booked out of the days they indicated their listing was available) can be determined by the perceived sentiment and length of their description summary. Our main goal, more generally, was to determine which features, including (but not limited to) sentiment and description length, most influence a host's occupancy rate. We define sentiment score through a natural language algorithm process, based on the AFINN dictionary. Using AirBnB data on New York City, our hypothesis is that higher sentiment scores (more positive descriptions) and longer summary length lead to higher occupancy rates. Our results show that while longer summary length may positively influence occupancy rates, more positive summary descriptions have no effect. Instead, we find that other factors such as number of reviews and number of amenities, in addition to summary length, are better indicators of occupancy rate. △ Less

Submitted 25 November, 2017; originally announced November 2017.

arXiv:1707.00705 [pdf]

The Nu Class of Low-Degree-Truncated, Rational, Generalized Functions. Ib. Integrals of Matern-correlation functions for all odd-half-integer class parameters

Authors: Selden Crary, Richard Diehl Martinez, Michael Saunders

Abstract: This paper is an extension of Parts I and Ia of a series about Nu-class generalized functions. We provide hand-generated algebraic expressions for integrals of single Matern-covariance functions, as well as for products of two Matern-covariance functions, for all odd-half-integer class parameters. These are useful both for IMSPE-optimal design software and for testing universality of Nu-class gene… ▽ More This paper is an extension of Parts I and Ia of a series about Nu-class generalized functions. We provide hand-generated algebraic expressions for integrals of single Matern-covariance functions, as well as for products of two Matern-covariance functions, for all odd-half-integer class parameters. These are useful both for IMSPE-optimal design software and for testing universality of Nu-class generalized-function properties, across covariance classes. △ Less

Submitted 22 May, 2019; v1 submitted 3 July, 2017; originally announced July 2017.

Comments: 30 pages, 3 tables, 1 appendix

arXiv:0712.4323 [pdf, ps, other]

Dispersion Models for Extremes

Authors: Bent Jørgensen, Yuri Goegebeur, José Raúl Martínez

Abstract: We propose extreme value analogues of natural exponential families and exponential dispersion models, and introduce the slope function as an analogue of the variance function. The set of quadratic and power slope functions characterize well-known families such as the Rayleigh, Gumbel, power, Pareto, logistic, negative exponential, Weibull and Fréchet. We show a convergence theorem for slope func… ▽ More We propose extreme value analogues of natural exponential families and exponential dispersion models, and introduce the slope function as an analogue of the variance function. The set of quadratic and power slope functions characterize well-known families such as the Rayleigh, Gumbel, power, Pareto, logistic, negative exponential, Weibull and Fréchet. We show a convergence theorem for slope functions, by which we may express the classical extreme value convergence results in terms of asymptotics for extreme dispersion models. The main idea is to explore the parallels between location families and natural exponential families, and between the convolution and minimum operations. △ Less

Submitted 28 December, 2007; originally announced December 2007.

Comments: 23 pages. Abstract submitted to the 56th Session of the ISI, Lisboa, 2007

Showing 1–20 of 20 results for author: Martínez, R