Skip to main content

Showing 1–40 of 40 results for author: Oliva, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.00148  [pdf, other

    cs.CV cs.LG

    Localizing Anomalies via Multiscale Score Matching Analysis

    Authors: Ahsan Mahmood, Junier Oliva, Martin Styner

    Abstract: Anomaly detection and localization in medical imaging remain critical challenges in healthcare. This paper introduces Spatial-MSMA (Multiscale Score Matching Analysis), a novel unsupervised method for anomaly localization in volumetric brain MRIs. Building upon the MSMA framework, our approach incorporates spatial information and conditional likelihoods to enhance anomaly detection capabilities. W… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  2. arXiv:2406.01825  [pdf, other

    cs.LG cs.AI

    EMOE: Expansive Matching of Experts for Robust Uncertainty Based Rejection

    Authors: Yunni Qu, James Wellnitz, Alexander Tropsha, Junier Oliva

    Abstract: Expansive Matching of Experts (EMOE) is a novel method that utilizes support-expanding, extrapolatory pseudo-labeling to improve prediction and uncertainty based rejection on out-of-distribution (OOD) points. We propose an expansive data augmentation technique that generates OOD instances in a latent space, and an empirical trial based approach to filter out augmented expansive points for pseudo-l… ▽ More

    Submitted 4 June, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

  3. arXiv:2403.12211  [pdf, other

    cs.CV cs.AI

    A Unified Model for Longitudinal Multi-Modal Multi-View Prediction with Missingness

    Authors: Boqi Chen, Junier Oliva, Marc Niethammer

    Abstract: Medical records often consist of different modalities, such as images, text, and tabular information. Integrating all modalities offers a holistic view of a patient's condition, while analyzing them longitudinally provides a better understanding of disease progression. However, real-world longitudinal medical records present challenges: 1) patients may lack some or all of the data for a specific t… ▽ More

    Submitted 21 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  4. arXiv:2308.06382  [pdf, other

    cs.SD cs.LG eess.AS

    Phoneme Hallucinator: One-shot Voice Conversion via Set Expansion

    Authors: Siyuan Shan, Yang Li, Amartya Banerjee, Junier B. Oliva

    Abstract: Voice conversion (VC) aims at altering a person's voice to make it sound similar to the voice of another person while preserving linguistic content. Existing methods suffer from a dilemma between content intelligibility and speaker similarity; i.e., methods with higher intelligibility usually have a lower speaker similarity, while methods with higher speaker similarity usually require plenty of ta… ▽ More

    Submitted 30 December, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

    Comments: AAAI 2024 Demo, Codes: https://phonemehallucinator.github.io/

  5. arXiv:2304.03220  [pdf, other

    cs.LG cs.AI cs.CV

    Anomaly Detection via Gumbel Noise Score Matching

    Authors: Ahsan Mahmood, Junier Oliva, Martin Styner

    Abstract: We propose Gumbel Noise Score Matching (GNSM), a novel unsupervised method to detect anomalies in categorical data. GNSM accomplishes this by estimating the scores, i.e. the gradients of log likelihoods w.r.t.~inputs, of continuously relaxed categorical distributions. We test our method on a suite of anomaly detection tabular datasets. GNSM achieves a consistently high performance across all exper… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

  6. arXiv:2302.13960  [pdf, other

    cs.LG

    Acquisition Conditioned Oracle for Nongreedy Active Feature Acquisition

    Authors: Michael Valancius, Max Lennon, Junier Oliva

    Abstract: We develop novel methodology for active feature acquisition (AFA), the study of how to sequentially acquire a dynamic (on a per instance basis) subset of features that minimizes acquisition costs whilst still yielding accurate predictions. The AFA framework can be useful in a myriad of domains, including health care applications where the cost of acquiring additional features for a patient (in ter… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

  7. arXiv:2207.08911  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Deeply-Learned Generalized Linear Models with Missing Data

    Authors: David K Lim, Naim U Rashid, Junier B Oliva, Joseph G Ibrahim

    Abstract: Deep Learning (DL) methods have dramatically increased in popularity in recent years, with significant growth in their application to supervised learning problems in the biomedical sciences. However, the greater prevalence and complexity of missing data in modern biomedical datasets present significant challenges for DL methods. Here, we provide a formal treatment of missing data in the context of… ▽ More

    Submitted 26 October, 2023; v1 submitted 18 July, 2022; originally announced July 2022.

    Journal ref: Journal of Computational and Graphical Statistics, 2023

  8. Distribution-based Sketching of Single-Cell Samples

    Authors: Vishal Athreya Baskaran, Jolene Ranek, Siyuan Shan, Natalie Stanley, Junier B. Oliva

    Abstract: Modern high-throughput single-cell immune profiling technologies, such as flow and mass cytometry and single-cell RNA sequencing can readily measure the expression of a large number of protein or gene features across the millions of cells in a multi-patient cohort. While bioinformatics approaches can be used to link immune cell heterogeneity to external variables of interest, such as, clinical out… ▽ More

    Submitted 30 June, 2022; originally announced July 2022.

    Comments: Accepted by ACM-BCB 2022

  9. arXiv:2205.05739  [pdf, other

    cs.CV cs.AI cs.CL cs.HC cs.MA

    Learning to Retrieve Videos by Asking Questions

    Authors: Avinash Madasu, Junier Oliva, Gedas Bertasius

    Abstract: The majority of traditional text-to-video retrieval systems operate in static environments, i.e., there is no interaction between the user and the agent beyond the initial textual query provided by the user. This can be sub-optimal if the initial query has ambiguities, which would lead to many falsely retrieved videos. To overcome this limitation, we propose a novel framework for Video Retrieval u… ▽ More

    Submitted 16 July, 2022; v1 submitted 11 May, 2022; originally announced May 2022.

    Journal ref: ACM Multimedia 2022

  10. arXiv:2201.12414  [pdf, other

    cs.LG stat.ML

    Posterior Matching for Arbitrary Conditioning

    Authors: Ryan R. Strauss, Junier B. Oliva

    Abstract: Arbitrary conditioning is an important problem in unsupervised learning, where we seek to model the conditional densities $p(\mathbf{x}_u \mid \mathbf{x}_o)$ that underly some data, for all possible non-intersecting subsets $o, u \subset \{1, \dots , d\}$. However, the vast majority of density estimation only focuses on modeling the joint distribution $p(\mathbf{x})$, in which important conditiona… ▽ More

    Submitted 11 November, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: Accepted at NeurIPS 2022

  11. Transparent Single-Cell Set Classification with Kernel Mean Embeddings

    Authors: Siyuan Shan, Vishal Baskaran, Haidong Yi, Jolene Ranek, Natalie Stanley, Junier Oliva

    Abstract: Modern single-cell flow and mass cytometry technologies measure the expression of several proteins of the individual cells within a blood or tissue sample. Each profiled biological sample is thus represented by a set of hundreds of thousands of multidimensional cell feature vectors, which incurs a high computational cost to predict each biological sample's associated phenotype with machine learnin… ▽ More

    Submitted 28 June, 2022; v1 submitted 18 January, 2022; originally announced January 2022.

    Comments: Accepted as Oral by ACM-BCB 2022. Codes are avialbe at https://github.com/shansiliu95/CKME

  12. arXiv:2109.08613  [pdf, other

    cs.CL

    Adversarial Scrubbing of Demographic Information for Text Classification

    Authors: Somnath Basu Roy Chowdhury, Sayan Ghosh, Yiyuan Li, Junier B. Oliva, Shashank Srivastava, Snigdha Chaturvedi

    Abstract: Contextual representations learned by language models can often encode undesirable attributes, like demographic associations of the users, while being trained for an unrelated target task. We aim to scrub such undesirable attributes and learn fair representations while maintaining performance on the target task. In this paper, we present an adversarial learning framework "Adversarial Scrubber" (AD… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: Accepted at EMNLP 2021

  13. arXiv:2107.04163  [pdf, other

    cs.LG

    Towards Robust Active Feature Acquisition

    Authors: Yang Li, Siyuan Shan, Qin Liu, Junier B. Oliva

    Abstract: Truly intelligent systems are expected to make critical decisions with incomplete and uncertain data. Active feature acquisition (AFA), where features are sequentially acquired to improve the prediction, is a step towards this goal. However, current AFA models all deal with a small set of candidate features and have difficulty scaling to a large feature space. Moreover, they are ignorant about the… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

  14. arXiv:2102.06083  [pdf, other

    cs.LG

    Partially Observed Exchangeable Modeling

    Authors: Yang Li, Junier B. Oliva

    Abstract: Modeling dependencies among features is fundamental for many machine learning tasks. Although there are often multiple related instances that may be leveraged to inform conditional dependencies, typical approaches only model conditional dependencies over individual instances. In this work, we propose a novel framework, partially observed exchangeable modeling (POEx) that takes in a set of related… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  15. arXiv:2102.04426  [pdf, other

    cs.LG

    Arbitrary Conditional Distributions with Energy

    Authors: Ryan R. Strauss, Junier B. Oliva

    Abstract: Modeling distributions of covariates, or density estimation, is a core challenge in unsupervised learning. However, the majority of work only considers the joint distribution, which has limited utility in practical situations. A more general and useful problem is arbitrary conditional density estimation, which aims to model any possible conditional distribution over a set of covariates, reflecting… ▽ More

    Submitted 26 October, 2021; v1 submitted 8 February, 2021; originally announced February 2021.

    Comments: Accepted at NeurIPS 2021

  16. arXiv:2102.03340  [pdf, other

    cs.LG

    NRTSI: Non-Recurrent Time Series Imputation

    Authors: Siyuan Shan, Yang Li, Junier B. Oliva

    Abstract: Time series imputation is a fundamental task for understanding time series with missing data. Existing methods either do not directly handle irregularly-sampled data or degrade severely with sparsely observed data. In this work, we reformulate time series as permutation-equivariant sets and propose a novel imputation model NRTSI that does not impose any recurrent structures. Taking advantage of th… ▽ More

    Submitted 27 May, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: Codes available at https://github.com/lupalab/NRTSI

  17. arXiv:2101.07357  [pdf, other

    cs.LG stat.AP

    Unsupervised Imputation of Non-ignorably Missing Data Using Importance-Weighted Autoencoders

    Authors: David K. Lim, Naim U. Rashid, Junier B. Oliva, Joseph G. Ibrahim

    Abstract: Deep Learning (DL) methods have dramatically increased in popularity in recent years. While its initial success was demonstrated in the classification and manipulation of image data, there has been significant growth in the application of DL methods to problems in the biomedical sciences. However, the greater prevalence and complexity of missing data in biomedical datasets present significant chal… ▽ More

    Submitted 17 June, 2022; v1 submitted 18 January, 2021; originally announced January 2021.

    Comments: 31 pages, 4 figures, 2 tables, under review (Biometrics Methodology)

  18. arXiv:2010.13132  [pdf, other

    cs.LG

    Multiscale Score Matching for Out-of-Distribution Detection

    Authors: Ahsan Mahmood, Junier Oliva, Martin Styner

    Abstract: We present a new methodology for detecting out-of-distribution (OOD) images by utilizing norms of the score estimates at multiple noise scales. A score is defined to be the gradient of the log density with respect to the input data. Our methodology is completely unsupervised and follows a straight forward training scheme. First, we train a deep network to estimate scores for levels of noise. Once… ▽ More

    Submitted 23 March, 2021; v1 submitted 25 October, 2020; originally announced October 2020.

  19. arXiv:2010.02433  [pdf, other

    cs.LG cs.AI

    Active Feature Acquisition with Generative Surrogate Models

    Authors: Yang Li, Junier B. Oliva

    Abstract: Many real-world situations allow for the acquisition of additional relevant information when making an assessment with limited or uncertain data. However, traditional ML approaches either require all features to be acquired beforehand or regard part of them as missing data that cannot be acquired. In this work, we consider models that perform active feature acquisition (AFA) and query the environm… ▽ More

    Submitted 11 February, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

  20. arXiv:2008.02676  [pdf, other

    cs.LG stat.ML

    Exchangeable Neural ODE for Set Modeling

    Authors: Yang Li, Haidong Yi, Christopher M. Bender, Siyuan Shan, Junier B. Oliva

    Abstract: Reasoning over an instance composed of a set of vectors, like a point cloud, requires that one accounts for intra-set dependent features among elements. However, since such instances are unordered, the elements' features should remain unchanged when the input's order is permuted. This property, permutation equivariance, is a challenging constraint for most neural architectures. While recent work h… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

  21. arXiv:2006.07701  [pdf, other

    cs.LG stat.ML

    Dynamic Feature Acquisition with Arbitrary Conditional Flows

    Authors: Yang Li, Junier B. Oliva

    Abstract: Many real-world situations allow for the acquisition of additional relevant information when making an assessment with limited or uncertain data. However, traditional ML approaches either require all features to be acquired beforehand or regard part of them as missing data that cannot be acquired. In this work, we propose models that dynamically acquire new features to further improve the predicti… ▽ More

    Submitted 12 March, 2021; v1 submitted 13 June, 2020; originally announced June 2020.

  22. arXiv:2006.04259  [pdf, other

    cs.LG stat.ML

    Deep Goal-Oriented Clustering

    Authors: Yifeng Shi, Christopher M. Bender, Junier B. Oliva, Marc Niethammer

    Abstract: Clustering and prediction are two primary tasks in the fields of unsupervised and supervised learning, respectively. Although much of the recent advances in machine learning have been centered around those two tasks, the interdependent, mutually beneficial relationship between them is rarely explored. One could reasonably expect appropriately clustering the data would aid the downstream prediction… ▽ More

    Submitted 15 June, 2020; v1 submitted 7 June, 2020; originally announced June 2020.

    Comments: 15 pages

  23. arXiv:2004.03456  [pdf, other

    cs.LG eess.SP stat.ML

    Binary and Multiclass Classifiers based on Multitaper Spectral Features for Epilepsy Detection

    Authors: Jefferson Tales Oliva, João Luís Garcia Rosa

    Abstract: Epilepsy is one of the most common neurological disorders that can be diagnosed through electroencephalogram (EEG), in which the following epileptic events can be observed: pre-ictal, ictal, post-ictal, and interictal. In this paper, we present a novel method for epilepsy detection into two differentiation contexts: binary and multiclass classification. For feature extraction, a total of 105 measu… ▽ More

    Submitted 2 April, 2020; originally announced April 2020.

    Comments: 19 pages, 6 figures, 10 tables. Obs.: in the text, English editing is required. A new version of this text will be available once we have completed their review

  24. arXiv:2003.10602  [pdf, other

    cs.LG stat.ML

    Defense Through Diverse Directions

    Authors: Christopher M. Bender, Yang Li, Yifeng Shi, Michael K. Reiter, Junier B. Oliva

    Abstract: In this work we develop a novel Bayesian neural network methodology to achieve strong adversarial robustness without the need for online adversarial training. Unlike previous efforts in this direction, we do not rely solely on the stochasticity of network weights by minimizing the divergence between the learned parameter distribution and a prior. Instead, we additionally require that the model mai… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

  25. arXiv:1909.09877  [pdf, other

    cs.LG stat.ML

    Deep Message Passing on Sets

    Authors: Yifeng Shi, Junier Oliva, Marc Niethammer

    Abstract: Modern methods for learning over graph input data have shown the fruitfulness of accounting for relationships among elements in a collection. However, most methods that learn over set input data use only rudimentary approaches to exploit intra-collection relationships. In this work we introduce Deep Message Passing on Sets (DMPS), a novel method that incorporates relational learning for sets. DMPS… ▽ More

    Submitted 21 September, 2019; originally announced September 2019.

    Comments: 11 pages, 8 figures

  26. arXiv:1909.09140  [pdf, other

    cs.LG stat.ML

    Meta-Neighborhoods

    Authors: Siyuan Shan, Yang Li, Junier Oliva

    Abstract: Making an adaptive prediction based on one's input is an important ability for general artificial intelligence. In this work, we step forward in this direction and propose a semi-parametric method, Meta-Neighborhoods, where predictions are made adaptively to the neighborhood of the input. We show that Meta-Neighborhoods is a generalization of $k$-nearest-neighbors. Due to the simpler manifold stru… ▽ More

    Submitted 13 October, 2020; v1 submitted 18 September, 2019; originally announced September 2019.

    Comments: To appear in NeurIPS 2020

  27. arXiv:1909.06319  [pdf, other

    cs.LG cs.CV stat.ML

    Flow Models for Arbitrary Conditional Likelihoods

    Authors: Yang Li, Shoaib Akbar, Junier B. Oliva

    Abstract: Understanding the dependencies among features of a dataset is at the core of most unsupervised learning tasks. However, a majority of generative modeling approaches are focused solely on the joint distribution $p(x)$ and utilize models where it is intractable to obtain the conditional distribution of some arbitrary subset of features $x_u$ given the rest of the observed covariates $x_o$:… ▽ More

    Submitted 6 August, 2020; v1 submitted 13 September, 2019; originally announced September 2019.

  28. arXiv:1905.13372  [pdf, other

    cs.LG cs.AI q-bio.MN q-bio.QM stat.ML

    MolecularRNN: Generating realistic molecular graphs with optimized properties

    Authors: Mariya Popova, Mykhailo Shvets, Junier Oliva, Olexandr Isayev

    Abstract: Designing new molecules with a set of predefined properties is a core problem in modern drug discovery and development. There is a growing need for de-novo design methods that would address this problem. We present MolecularRNN, the graph recurrent generative model for molecular structures. Our model generates diverse realistic molecular graphs after likelihood pretraining on a big database of mol… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

  29. arXiv:1902.03356  [pdf, other

    cs.LG stat.ML

    Meta-Curvature

    Authors: Eunbyung Park, Junier B. Oliva

    Abstract: We propose meta-curvature (MC), a framework to learn curvature information for better generalization and fast model adaptation. MC expands on the model-agnostic meta-learner (MAML) by learning to transform the gradients in the inner optimization such that the transformed gradients achieve better generalization performance to a new task. For training large scale neural networks, we decompose the cu… ▽ More

    Submitted 9 January, 2020; v1 submitted 8 February, 2019; originally announced February 2019.

    Comments: To appear in NeurIPS 2019

  30. arXiv:1902.01967  [pdf, other

    cs.LG stat.ML

    Exchangeable Generative Models with Flow Scans

    Authors: Christopher Bender, Kevin O'Connor, Yang Li, Juan Jose Garcia, Manzil Zaheer, Junier Oliva

    Abstract: In this work, we develop a new approach to generative density estimation for exchangeable, non-i.i.d. data. The proposed framework, FlowScan, combines invertible flow transformations with a sorted scan to flexibly model the data while preserving exchangeability. Unlike most existing methods, FlowScan exploits the intradependencies within sets to learn both global and local structure. FlowScan repr… ▽ More

    Submitted 18 September, 2019; v1 submitted 5 February, 2019; originally announced February 2019.

  31. arXiv:1902.01435  [pdf, other

    stat.ML cs.LG

    A Forest from the Trees: Generation through Neighborhoods

    Authors: Yang Li, Tianxiang Gao, Junier B. Oliva

    Abstract: In this work, we propose to learn a generative model using both learned features (through a latent space) and memories (through neighbors). Although human learning makes seamless use of both learned perceptual features and instance recall, current generative learning paradigms only make use of one of these two components. Take, for instance, flow models, which learn a latent space of invertible fe… ▽ More

    Submitted 19 November, 2019; v1 submitted 4 February, 2019; originally announced February 2019.

  32. arXiv:1711.02033  [pdf, other

    astro-ph.CO cs.LG stat.ML

    Estimating Cosmological Parameters from the Dark Matter Distribution

    Authors: Siamak Ravanbakhsh, Junier Oliva, Sebastien Fromenteau, Layne C. Price, Shirley Ho, Jeff Schneider, Barnabas Poczos

    Abstract: A grand challenge of the 21st century cosmology is to accurately estimate the cosmological parameters of our Universe. A major approach to estimating the cosmological parameters is to use the large-scale matter distribution of the Universe. Galaxy surveys provide the means to map out cosmic large-scale structure in three dimensions. Information about galaxy locations is typically summarized in a "… ▽ More

    Submitted 6 November, 2017; originally announced November 2017.

    Comments: ICML 2016

  33. arXiv:1705.10750  [pdf, other

    cs.LG stat.ML

    Recurrent Estimation of Distributions

    Authors: Junier B. Oliva, Kumar Avinava Dubey, Barnabas Poczos, Eric Xing, Jeff Schneider

    Abstract: This paper presents the recurrent estimation of distributions (RED) for modeling real-valued data in a semiparametric fashion. RED models make two novel uses of recurrent neural networks (RNNs) for density estimation of general real-valued data. First, RNNs are used to transform input covariates into a latent space to better capture conditional dependencies in inputs. After, an RNN is used to comp… ▽ More

    Submitted 30 May, 2017; originally announced May 2017.

  34. arXiv:1703.00381  [pdf, other

    cs.LG cs.AI stat.ML

    The Statistical Recurrent Unit

    Authors: Junier B. Oliva, Barnabas Poczos, Jeff Schneider

    Abstract: Sophisticated gated recurrent neural network architectures like LSTMs and GRUs have been shown to be highly effective in a myriad of applications. We develop an un-gated unit, the statistical recurrent unit (SRU), that is able to learn long term dependencies in data by only kee** moving averages of statistics. The SRU's architecture is simple, un-gated, and contains a comparable number of parame… ▽ More

    Submitted 1 March, 2017; originally announced March 2017.

  35. arXiv:1603.06288  [pdf, other

    stat.ML cs.AI cs.LG

    Multi-fidelity Gaussian Process Bandit Optimisation

    Authors: Kirthevasan Kandasamy, Gautam Dasarathy, Junier B. Oliva, Jeff Schneider, Barnabas Poczos

    Abstract: In many scientific and engineering applications, we are tasked with the maximisation of an expensive to evaluate black box function $f$. Traditional settings for this problem assume just the availability of this single function. However, in many cases, cheap approximations to $f$ may be obtainable. For example, the expensive real world behaviour of a robot can be approximated by a cheap computer s… ▽ More

    Submitted 15 March, 2019; v1 submitted 20 March, 2016; originally announced March 2016.

    Comments: Preliminary version appeared at NIPS 2016

  36. arXiv:1511.04150  [pdf, other

    stat.ML cs.CV cs.LG

    Deep Mean Maps

    Authors: Junier B. Oliva, Danica J. Sutherland, Barnabás Póczos, Jeff Schneider

    Abstract: The use of distributions and high-level features from deep architecture has become commonplace in modern computer vision. Both of these methodologies have separately achieved a great deal of success in many computer vision tasks. However, there has been little work attempting to leverage the power of these to methodologies jointly. To this end, this paper presents the Deep Mean Maps (DMMs) framewo… ▽ More

    Submitted 14 January, 2021; v1 submitted 12 November, 2015; originally announced November 2015.

  37. arXiv:1509.07553  [pdf, other

    stat.ML cs.LG

    Linear-time Learning on Distributions with Approximate Kernel Embeddings

    Authors: Danica J. Sutherland, Junier B. Oliva, Barnabás Póczos, Jeff Schneider

    Abstract: Many interesting machine learning problems are best posed by considering instances that are distributions, or sample sets drawn from distributions. Previous work devoted to machine learning tasks with distributional inputs has done so through pairwise kernel evaluations between pdfs (or sample sets). While such an approach is fine for smaller datasets, the computation of an $N \times N$ Gram matri… ▽ More

    Submitted 14 January, 2021; v1 submitted 24 September, 2015; originally announced September 2015.

    Journal ref: AAAI'16: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, February 2016, 2073-2079

  38. arXiv:1410.7414  [pdf, other

    stat.ML cs.LG

    Fast Function to Function Regression

    Authors: Junier Oliva, Willie Neiswanger, Barnabas Poczos, Eric Xing, Jeff Schneider

    Abstract: We analyze the problem of regression when both input covariates and output responses are functions from a nonparametric function class. Function to function regression (FFR) covers a large range of interesting applications including time-series prediction problems, and also more general tasks like studying a map** between two separate types of distributions. However, previous nonparametric estim… ▽ More

    Submitted 27 October, 2014; originally announced October 2014.

  39. arXiv:1311.2236  [pdf, other

    stat.ML cs.LG math.ST

    Fast Distribution To Real Regression

    Authors: Junier B. Oliva, Willie Neiswanger, Barnabas Poczos, Jeff Schneider, Eric Xing

    Abstract: We study the problem of distribution to real-value regression, where one aims to regress a map** $f$ that takes in a distribution input covariate $P\in \mathcal{I}$ (for a non-parametric family of distributions $\mathcal{I}$) and outputs a real-valued response $Y=f(P) + ε$. This setting was recently studied, and a "Kernel-Kernel" estimator was introduced and shown to have a polynomial rate of co… ▽ More

    Submitted 8 March, 2014; v1 submitted 9 November, 2013; originally announced November 2013.

  40. arXiv:1311.2234  [pdf, other

    stat.ML cs.LG math.ST

    FuSSO: Functional Shrinkage and Selection Operator

    Authors: Junier B. Oliva, Barnabas Poczos, Timothy Verstynen, Aarti Singh, Jeff Schneider, Fang-Cheng Yeh, Wen-Yih Tseng

    Abstract: We present the FuSSO, a functional analogue to the LASSO, that efficiently finds a sparse set of functional input covariates to regress a real-valued response against. The FuSSO does so in a semi-parametric fashion, making no parametric assumptions about the nature of input functional covariates and assuming a linear form to the map** of functional covariates to the response. We provide a statis… ▽ More

    Submitted 8 March, 2014; v1 submitted 9 November, 2013; originally announced November 2013.