Skip to main content

Showing 1–25 of 25 results for author: Oliva, J B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.06382  [pdf, other

    cs.SD cs.LG eess.AS

    Phoneme Hallucinator: One-shot Voice Conversion via Set Expansion

    Authors: Siyuan Shan, Yang Li, Amartya Banerjee, Junier B. Oliva

    Abstract: Voice conversion (VC) aims at altering a person's voice to make it sound similar to the voice of another person while preserving linguistic content. Existing methods suffer from a dilemma between content intelligibility and speaker similarity; i.e., methods with higher intelligibility usually have a lower speaker similarity, while methods with higher speaker similarity usually require plenty of ta… ▽ More

    Submitted 30 December, 2023; v1 submitted 11 August, 2023; originally announced August 2023.

    Comments: AAAI 2024 Demo, Codes: https://phonemehallucinator.github.io/

  2. arXiv:2207.08911  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Deeply-Learned Generalized Linear Models with Missing Data

    Authors: David K Lim, Naim U Rashid, Junier B Oliva, Joseph G Ibrahim

    Abstract: Deep Learning (DL) methods have dramatically increased in popularity in recent years, with significant growth in their application to supervised learning problems in the biomedical sciences. However, the greater prevalence and complexity of missing data in modern biomedical datasets present significant challenges for DL methods. Here, we provide a formal treatment of missing data in the context of… ▽ More

    Submitted 26 October, 2023; v1 submitted 18 July, 2022; originally announced July 2022.

    Journal ref: Journal of Computational and Graphical Statistics, 2023

  3. Distribution-based Sketching of Single-Cell Samples

    Authors: Vishal Athreya Baskaran, Jolene Ranek, Siyuan Shan, Natalie Stanley, Junier B. Oliva

    Abstract: Modern high-throughput single-cell immune profiling technologies, such as flow and mass cytometry and single-cell RNA sequencing can readily measure the expression of a large number of protein or gene features across the millions of cells in a multi-patient cohort. While bioinformatics approaches can be used to link immune cell heterogeneity to external variables of interest, such as, clinical out… ▽ More

    Submitted 30 June, 2022; originally announced July 2022.

    Comments: Accepted by ACM-BCB 2022

  4. arXiv:2201.12414  [pdf, other

    cs.LG stat.ML

    Posterior Matching for Arbitrary Conditioning

    Authors: Ryan R. Strauss, Junier B. Oliva

    Abstract: Arbitrary conditioning is an important problem in unsupervised learning, where we seek to model the conditional densities $p(\mathbf{x}_u \mid \mathbf{x}_o)$ that underly some data, for all possible non-intersecting subsets $o, u \subset \{1, \dots , d\}$. However, the vast majority of density estimation only focuses on modeling the joint distribution $p(\mathbf{x})$, in which important conditiona… ▽ More

    Submitted 11 November, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: Accepted at NeurIPS 2022

  5. arXiv:2109.08613  [pdf, other

    cs.CL

    Adversarial Scrubbing of Demographic Information for Text Classification

    Authors: Somnath Basu Roy Chowdhury, Sayan Ghosh, Yiyuan Li, Junier B. Oliva, Shashank Srivastava, Snigdha Chaturvedi

    Abstract: Contextual representations learned by language models can often encode undesirable attributes, like demographic associations of the users, while being trained for an unrelated target task. We aim to scrub such undesirable attributes and learn fair representations while maintaining performance on the target task. In this paper, we present an adversarial learning framework "Adversarial Scrubber" (AD… ▽ More

    Submitted 17 September, 2021; originally announced September 2021.

    Comments: Accepted at EMNLP 2021

  6. arXiv:2107.04163  [pdf, other

    cs.LG

    Towards Robust Active Feature Acquisition

    Authors: Yang Li, Siyuan Shan, Qin Liu, Junier B. Oliva

    Abstract: Truly intelligent systems are expected to make critical decisions with incomplete and uncertain data. Active feature acquisition (AFA), where features are sequentially acquired to improve the prediction, is a step towards this goal. However, current AFA models all deal with a small set of candidate features and have difficulty scaling to a large feature space. Moreover, they are ignorant about the… ▽ More

    Submitted 8 July, 2021; originally announced July 2021.

  7. arXiv:2102.06083  [pdf, other

    cs.LG

    Partially Observed Exchangeable Modeling

    Authors: Yang Li, Junier B. Oliva

    Abstract: Modeling dependencies among features is fundamental for many machine learning tasks. Although there are often multiple related instances that may be leveraged to inform conditional dependencies, typical approaches only model conditional dependencies over individual instances. In this work, we propose a novel framework, partially observed exchangeable modeling (POEx) that takes in a set of related… ▽ More

    Submitted 11 February, 2021; originally announced February 2021.

  8. arXiv:2102.04426  [pdf, other

    cs.LG

    Arbitrary Conditional Distributions with Energy

    Authors: Ryan R. Strauss, Junier B. Oliva

    Abstract: Modeling distributions of covariates, or density estimation, is a core challenge in unsupervised learning. However, the majority of work only considers the joint distribution, which has limited utility in practical situations. A more general and useful problem is arbitrary conditional density estimation, which aims to model any possible conditional distribution over a set of covariates, reflecting… ▽ More

    Submitted 26 October, 2021; v1 submitted 8 February, 2021; originally announced February 2021.

    Comments: Accepted at NeurIPS 2021

  9. arXiv:2102.03340  [pdf, other

    cs.LG

    NRTSI: Non-Recurrent Time Series Imputation

    Authors: Siyuan Shan, Yang Li, Junier B. Oliva

    Abstract: Time series imputation is a fundamental task for understanding time series with missing data. Existing methods either do not directly handle irregularly-sampled data or degrade severely with sparsely observed data. In this work, we reformulate time series as permutation-equivariant sets and propose a novel imputation model NRTSI that does not impose any recurrent structures. Taking advantage of th… ▽ More

    Submitted 27 May, 2021; v1 submitted 5 February, 2021; originally announced February 2021.

    Comments: Codes available at https://github.com/lupalab/NRTSI

  10. arXiv:2101.07357  [pdf, other

    cs.LG stat.AP

    Unsupervised Imputation of Non-ignorably Missing Data Using Importance-Weighted Autoencoders

    Authors: David K. Lim, Naim U. Rashid, Junier B. Oliva, Joseph G. Ibrahim

    Abstract: Deep Learning (DL) methods have dramatically increased in popularity in recent years. While its initial success was demonstrated in the classification and manipulation of image data, there has been significant growth in the application of DL methods to problems in the biomedical sciences. However, the greater prevalence and complexity of missing data in biomedical datasets present significant chal… ▽ More

    Submitted 17 June, 2022; v1 submitted 18 January, 2021; originally announced January 2021.

    Comments: 31 pages, 4 figures, 2 tables, under review (Biometrics Methodology)

  11. arXiv:2010.02433  [pdf, other

    cs.LG cs.AI

    Active Feature Acquisition with Generative Surrogate Models

    Authors: Yang Li, Junier B. Oliva

    Abstract: Many real-world situations allow for the acquisition of additional relevant information when making an assessment with limited or uncertain data. However, traditional ML approaches either require all features to be acquired beforehand or regard part of them as missing data that cannot be acquired. In this work, we consider models that perform active feature acquisition (AFA) and query the environm… ▽ More

    Submitted 11 February, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

  12. arXiv:2008.02676  [pdf, other

    cs.LG stat.ML

    Exchangeable Neural ODE for Set Modeling

    Authors: Yang Li, Haidong Yi, Christopher M. Bender, Siyuan Shan, Junier B. Oliva

    Abstract: Reasoning over an instance composed of a set of vectors, like a point cloud, requires that one accounts for intra-set dependent features among elements. However, since such instances are unordered, the elements' features should remain unchanged when the input's order is permuted. This property, permutation equivariance, is a challenging constraint for most neural architectures. While recent work h… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

  13. arXiv:2006.07701  [pdf, other

    cs.LG stat.ML

    Dynamic Feature Acquisition with Arbitrary Conditional Flows

    Authors: Yang Li, Junier B. Oliva

    Abstract: Many real-world situations allow for the acquisition of additional relevant information when making an assessment with limited or uncertain data. However, traditional ML approaches either require all features to be acquired beforehand or regard part of them as missing data that cannot be acquired. In this work, we propose models that dynamically acquire new features to further improve the predicti… ▽ More

    Submitted 12 March, 2021; v1 submitted 13 June, 2020; originally announced June 2020.

  14. arXiv:2006.04259  [pdf, other

    cs.LG stat.ML

    Deep Goal-Oriented Clustering

    Authors: Yifeng Shi, Christopher M. Bender, Junier B. Oliva, Marc Niethammer

    Abstract: Clustering and prediction are two primary tasks in the fields of unsupervised and supervised learning, respectively. Although much of the recent advances in machine learning have been centered around those two tasks, the interdependent, mutually beneficial relationship between them is rarely explored. One could reasonably expect appropriately clustering the data would aid the downstream prediction… ▽ More

    Submitted 15 June, 2020; v1 submitted 7 June, 2020; originally announced June 2020.

    Comments: 15 pages

  15. arXiv:2003.10602  [pdf, other

    cs.LG stat.ML

    Defense Through Diverse Directions

    Authors: Christopher M. Bender, Yang Li, Yifeng Shi, Michael K. Reiter, Junier B. Oliva

    Abstract: In this work we develop a novel Bayesian neural network methodology to achieve strong adversarial robustness without the need for online adversarial training. Unlike previous efforts in this direction, we do not rely solely on the stochasticity of network weights by minimizing the divergence between the learned parameter distribution and a prior. Instead, we additionally require that the model mai… ▽ More

    Submitted 23 March, 2020; originally announced March 2020.

  16. arXiv:1909.06319  [pdf, other

    cs.LG cs.CV stat.ML

    Flow Models for Arbitrary Conditional Likelihoods

    Authors: Yang Li, Shoaib Akbar, Junier B. Oliva

    Abstract: Understanding the dependencies among features of a dataset is at the core of most unsupervised learning tasks. However, a majority of generative modeling approaches are focused solely on the joint distribution $p(x)$ and utilize models where it is intractable to obtain the conditional distribution of some arbitrary subset of features $x_u$ given the rest of the observed covariates $x_o$:… ▽ More

    Submitted 6 August, 2020; v1 submitted 13 September, 2019; originally announced September 2019.

  17. arXiv:1902.03356  [pdf, other

    cs.LG stat.ML

    Meta-Curvature

    Authors: Eunbyung Park, Junier B. Oliva

    Abstract: We propose meta-curvature (MC), a framework to learn curvature information for better generalization and fast model adaptation. MC expands on the model-agnostic meta-learner (MAML) by learning to transform the gradients in the inner optimization such that the transformed gradients achieve better generalization performance to a new task. For training large scale neural networks, we decompose the cu… ▽ More

    Submitted 9 January, 2020; v1 submitted 8 February, 2019; originally announced February 2019.

    Comments: To appear in NeurIPS 2019

  18. arXiv:1902.01435  [pdf, other

    stat.ML cs.LG

    A Forest from the Trees: Generation through Neighborhoods

    Authors: Yang Li, Tianxiang Gao, Junier B. Oliva

    Abstract: In this work, we propose to learn a generative model using both learned features (through a latent space) and memories (through neighbors). Although human learning makes seamless use of both learned perceptual features and instance recall, current generative learning paradigms only make use of one of these two components. Take, for instance, flow models, which learn a latent space of invertible fe… ▽ More

    Submitted 19 November, 2019; v1 submitted 4 February, 2019; originally announced February 2019.

  19. arXiv:1705.10750  [pdf, other

    cs.LG stat.ML

    Recurrent Estimation of Distributions

    Authors: Junier B. Oliva, Kumar Avinava Dubey, Barnabas Poczos, Eric Xing, Jeff Schneider

    Abstract: This paper presents the recurrent estimation of distributions (RED) for modeling real-valued data in a semiparametric fashion. RED models make two novel uses of recurrent neural networks (RNNs) for density estimation of general real-valued data. First, RNNs are used to transform input covariates into a latent space to better capture conditional dependencies in inputs. After, an RNN is used to comp… ▽ More

    Submitted 30 May, 2017; originally announced May 2017.

  20. arXiv:1703.00381  [pdf, other

    cs.LG cs.AI stat.ML

    The Statistical Recurrent Unit

    Authors: Junier B. Oliva, Barnabas Poczos, Jeff Schneider

    Abstract: Sophisticated gated recurrent neural network architectures like LSTMs and GRUs have been shown to be highly effective in a myriad of applications. We develop an un-gated unit, the statistical recurrent unit (SRU), that is able to learn long term dependencies in data by only kee** moving averages of statistics. The SRU's architecture is simple, un-gated, and contains a comparable number of parame… ▽ More

    Submitted 1 March, 2017; originally announced March 2017.

  21. arXiv:1603.06288  [pdf, other

    stat.ML cs.AI cs.LG

    Multi-fidelity Gaussian Process Bandit Optimisation

    Authors: Kirthevasan Kandasamy, Gautam Dasarathy, Junier B. Oliva, Jeff Schneider, Barnabas Poczos

    Abstract: In many scientific and engineering applications, we are tasked with the maximisation of an expensive to evaluate black box function $f$. Traditional settings for this problem assume just the availability of this single function. However, in many cases, cheap approximations to $f$ may be obtainable. For example, the expensive real world behaviour of a robot can be approximated by a cheap computer s… ▽ More

    Submitted 15 March, 2019; v1 submitted 20 March, 2016; originally announced March 2016.

    Comments: Preliminary version appeared at NIPS 2016

  22. arXiv:1511.04150  [pdf, other

    stat.ML cs.CV cs.LG

    Deep Mean Maps

    Authors: Junier B. Oliva, Danica J. Sutherland, Barnabás Póczos, Jeff Schneider

    Abstract: The use of distributions and high-level features from deep architecture has become commonplace in modern computer vision. Both of these methodologies have separately achieved a great deal of success in many computer vision tasks. However, there has been little work attempting to leverage the power of these to methodologies jointly. To this end, this paper presents the Deep Mean Maps (DMMs) framewo… ▽ More

    Submitted 14 January, 2021; v1 submitted 12 November, 2015; originally announced November 2015.

  23. arXiv:1509.07553  [pdf, other

    stat.ML cs.LG

    Linear-time Learning on Distributions with Approximate Kernel Embeddings

    Authors: Danica J. Sutherland, Junier B. Oliva, Barnabás Póczos, Jeff Schneider

    Abstract: Many interesting machine learning problems are best posed by considering instances that are distributions, or sample sets drawn from distributions. Previous work devoted to machine learning tasks with distributional inputs has done so through pairwise kernel evaluations between pdfs (or sample sets). While such an approach is fine for smaller datasets, the computation of an $N \times N$ Gram matri… ▽ More

    Submitted 14 January, 2021; v1 submitted 24 September, 2015; originally announced September 2015.

    Journal ref: AAAI'16: Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, February 2016, 2073-2079

  24. arXiv:1311.2236  [pdf, other

    stat.ML cs.LG math.ST

    Fast Distribution To Real Regression

    Authors: Junier B. Oliva, Willie Neiswanger, Barnabas Poczos, Jeff Schneider, Eric Xing

    Abstract: We study the problem of distribution to real-value regression, where one aims to regress a map** $f$ that takes in a distribution input covariate $P\in \mathcal{I}$ (for a non-parametric family of distributions $\mathcal{I}$) and outputs a real-valued response $Y=f(P) + ε$. This setting was recently studied, and a "Kernel-Kernel" estimator was introduced and shown to have a polynomial rate of co… ▽ More

    Submitted 8 March, 2014; v1 submitted 9 November, 2013; originally announced November 2013.

  25. arXiv:1311.2234  [pdf, other

    stat.ML cs.LG math.ST

    FuSSO: Functional Shrinkage and Selection Operator

    Authors: Junier B. Oliva, Barnabas Poczos, Timothy Verstynen, Aarti Singh, Jeff Schneider, Fang-Cheng Yeh, Wen-Yih Tseng

    Abstract: We present the FuSSO, a functional analogue to the LASSO, that efficiently finds a sparse set of functional input covariates to regress a real-valued response against. The FuSSO does so in a semi-parametric fashion, making no parametric assumptions about the nature of input functional covariates and assuming a linear form to the map** of functional covariates to the response. We provide a statis… ▽ More

    Submitted 8 March, 2014; v1 submitted 9 November, 2013; originally announced November 2013.