Skip to main content

Showing 1–50 of 129 results for author: Póczos, B

.
  1. arXiv:2407.05649  [pdf, other

    cs.LG cs.AI cs.NE

    Graph Attention with Random Rewiring

    Authors: Tongzhou Liao, Barnabás Póczos

    Abstract: Graph Neural Networks (GNNs) have become fundamental in graph-structured deep learning. Key paradigms of modern GNNs include message passing, graph rewiring, and Graph Transformers. This paper introduces Graph-Rewiring Attention with Stochastic Structures (GRASS), a novel GNN architecture that combines the advantages of these three paradigms. GRASS rewires the input graph by superimposing a random… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

  2. arXiv:2406.08511  [pdf, other

    physics.chem-ph cs.LG

    Diffusion Models in $\textit{De Novo}$ Drug Design

    Authors: Amira Alakhdar, Barnabas Poczos, Newell Washburn

    Abstract: Diffusion models have emerged as powerful tools for molecular generation, particularly in the context of 3D molecular structures. Inspired by non-equilibrium statistical physics, these models can generate 3D molecular structures with specific properties or requirements crucial to drug discovery. Diffusion models were particularly successful at learning 3D molecular geometries' complex probability… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  3. arXiv:2405.01490  [pdf, other

    cs.CL cs.AI

    Controllable Text Generation in the Instruction-Tuning Era

    Authors: Dhananjay Ashok, Barnabas Poczos

    Abstract: While most research on controllable text generation has focused on steering base Language Models, the emerging instruction-tuning and prompting paradigm offers an alternate approach to controllability. We compile and release ConGenBench, a testbed of 17 different controllable generation tasks, using a subset of it to benchmark the performance of 9 different baselines and methods on Instruction-tun… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  4. arXiv:2308.15772  [pdf, other

    cs.CL

    Task-Based MoE for Multitask Multilingual Machine Translation

    Authors: Hai Pham, Young ** Kim, Subhabrata Mukherjee, David P. Woodruff, Barnabas Poczos, Hany Hassan Awadalla

    Abstract: Mixture-of-experts (MoE) architecture has been proven a powerful method for diverse tasks in training deep models in many applications. However, current MoE implementations are task agnostic, treating all tokens from different tasks in the same manner. In this work, we instead design a novel method that incorporates task information into MoE models at different granular levels with shared dynamic… ▽ More

    Submitted 24 October, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

  5. arXiv:2308.13066  [pdf, other

    cs.LG q-bio.QM

    Objective-Agnostic Enhancement of Molecule Properties via Multi-Stage VAE

    Authors: Chenghui Zhou, Barnabas Poczos

    Abstract: Variational autoencoder (VAE) is a popular method for drug discovery and various architectures and pipelines have been proposed to improve its performance. However, VAE approaches are known to suffer from poor manifold recovery when the data lie on a low-dimensional manifold embedded in a higher dimensional ambient space [Dai and Wipf, 2019]. The consequences of it in drug discovery are somewhat u… ▽ More

    Submitted 9 September, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: arXiv admin note: text overlap with arXiv:2212.02750

  6. arXiv:2305.14707  [pdf, other

    cs.CL cs.AI cs.LG

    SciFix: Outperforming GPT3 on Scientific Factual Error Correction

    Authors: Dhananjay Ashok, Atharva Kulkarni, Hai Pham, Barnabás Póczos

    Abstract: Due to the prohibitively high cost of creating error correction datasets, most Factual Claim Correction methods rely on a powerful verification model to guide the correction process. This leads to a significant drop in performance in domains like scientific claims, where good verification models do not always exist. In this work, we introduce SciFix, a scientific claim correction system that does… ▽ More

    Submitted 12 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: To appear in proceedings of EMNLP2023 (findings)

  7. arXiv:2212.02750  [pdf, other

    cs.LG q-bio.QM

    Improving Molecule Properties Through 2-Stage VAE

    Authors: Chenghui Zhou, Barnabas Poczos

    Abstract: Variational autoencoder (VAE) is a popular method for drug discovery and there had been a great deal of architectures and pipelines proposed to improve its performance. But the VAE model itself suffers from deficiencies such as poor manifold recovery when data lie on low-dimensional manifold embedded in higher dimensional ambient space and they manifest themselves in each applications differently.… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

  8. arXiv:2211.03970  [pdf, other

    cs.LG math.OC

    On the Algorithmic Stability and Generalization of Adaptive Optimization Methods

    Authors: Han Nguyen, Hai Pham, Sashank J. Reddi, Barnabás Póczos

    Abstract: Despite their popularity in deep learning and machine learning in general, the theoretical properties of adaptive optimizers such as Adagrad, RMSProp, Adam or AdamW are not yet fully understood. In this paper, we develop a novel framework to study the stability and generalization of these optimization methods. Based on this framework, we show provable guarantees about such properties that depend h… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: 21 pages including appendix

  9. arXiv:2106.04072  [pdf, ps, other

    cs.AI cs.LG

    Coarse-to-Fine Curriculum Learning

    Authors: Otilia Stretcu, Emmanouil Antonios Platanios, Tom M. Mitchell, Barnabás Póczos

    Abstract: When faced with learning challenging new tasks, humans often follow sequences of steps that allow them to incrementally build up the necessary skills for performing these new tasks. However, in machine learning, models are most often trained to solve the target tasks directly.Inspired by human learning, we propose a novel curriculum learning approach which decomposes challenging tasks into sequenc… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

  10. arXiv:2104.08398  [pdf, ps, other

    cs.CL

    Re-TACRED: Addressing Shortcomings of the TACRED Dataset

    Authors: George Stoica, Emmanouil Antonios Platanios, Barnabás Póczos

    Abstract: TACRED is one of the largest and most widely used sentence-level relation extraction datasets. Proposed models that are evaluated using this dataset consistently set new state-of-the-art performance. However, they still exhibit large error rates despite leveraging external knowledge and unsupervised pretraining on large text corpora. A recent study suggested that this may be due to poor dataset qu… ▽ More

    Submitted 16 April, 2021; originally announced April 2021.

  11. arXiv:2104.05196  [pdf, other

    cs.CL cs.AI cs.LG

    StylePTB: A Compositional Benchmark for Fine-grained Controllable Text Style Transfer

    Authors: Yiwei Lyu, Paul Pu Liang, Hai Pham, Eduard Hovy, Barnabás Póczos, Ruslan Salakhutdinov, Louis-Philippe Morency

    Abstract: Text style transfer aims to controllably generate text with targeted stylistic changes while maintaining core meaning from the source sentence constant. Many of the existing style transfer benchmarks primarily focus on individual high-level semantic changes (e.g. positive to negative), which enable controllability at a high level but do not offer fine-grained control involving sentence structure,… ▽ More

    Submitted 12 April, 2021; originally announced April 2021.

    Comments: NAACL 2021, code available at https://github.com/lvyiwei1/StylePTB/

  12. arXiv:2012.04812  [pdf, other

    cs.CL

    Improving Relation Extraction by Leveraging Knowledge Graph Link Prediction

    Authors: George Stoica, Emmanouil Antonios Platanios, Barnabás Póczos

    Abstract: Relation extraction (RE) aims to predict a relation between a subject and an object in a sentence, while knowledge graph link prediction (KGLP) aims to predict a set of objects, O, given a subject and a relation from a knowledge graph. These two problems are closely related as their respective objectives are intertwined: given a sentence containing a subject and an object o, a RE model predicts a… ▽ More

    Submitted 8 December, 2020; originally announced December 2020.

  13. arXiv:2010.02500  [pdf, other

    cs.CL cs.LG

    Efficient Meta Lifelong-Learning with Limited Memory

    Authors: Zirui Wang, Sanket Vaibhav Mehta, Barnabás Póczos, Jaime Carbonell

    Abstract: Current natural language processing models work well on a single task, yet they often fail to continuously learn new tasks without forgetting previous ones as they are re-trained throughout their lifetime, a challenge known as lifelong learning. State-of-the-art lifelong language learning methods store past examples in episodic memory and replay them at both training and inference time. However, a… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

    Comments: Published as a main conference paper at EMNLP 2020

  14. arXiv:2009.08424  [pdf, other

    cs.CL cs.AI cs.LG

    Modeling Task Effects on Meaning Representation in the Brain via Zero-Shot MEG Prediction

    Authors: Mariya Toneva, Otilia Stretcu, Barnabas Poczos, Leila Wehbe, Tom M. Mitchell

    Abstract: How meaning is represented in the brain is still one of the big open questions in neuroscience. Does a word (e.g., bird) always have the same representation, or does the task under which the word is processed alter its representation (answering "can you eat it?" versus "can it fly?")? The brain activity of subjects who read the same word while performing different semantic tasks has been shown to… ▽ More

    Submitted 15 November, 2020; v1 submitted 17 September, 2020; originally announced September 2020.

    Comments: accepted at NeurIPS 2020

  15. arXiv:2008.08148  [pdf, other

    cs.CV cs.LG

    Robust Handwriting Recognition with Limited and Noisy Data

    Authors: Hai Pham, Amrith Setlur, Saket Dingliwal, Tzu-Hsiang Lin, Barnabas Poczos, Kang Huang, Zhuo Li, Jae Lim, Collin McCormack, Tam Vu

    Abstract: Despite the advent of deep learning in computer vision, the general handwriting recognition problem is far from solved. Most existing approaches focus on handwriting datasets that have clearly written text and carefully segmented labels. In this paper, we instead focus on learning handwritten characters from maintenance logs, a constrained setting where data is very limited and noisy. We break the… ▽ More

    Submitted 18 August, 2020; originally announced August 2020.

    Comments: icfhr2020

  16. Deep Generative Models for Galaxy Image Simulations

    Authors: Francois Lanusse, Rachel Mandelbaum, Siamak Ravanbakhsh, Chun-Liang Li, Peter Freeman, Barnabas Poczos

    Abstract: Image simulations are essential tools for preparing and validating the analysis of current and future wide-field optical surveys. However, the galaxy models used as the basis for these simulations are typically limited to simple parametric light profiles, or use a fairly limited amount of available space-based data. In this work, we propose a methodology based on Deep Generative Models to create c… ▽ More

    Submitted 9 August, 2020; originally announced August 2020.

    Comments: 14 pages, submitted to MNRAS. Comments most welcome

  17. arXiv:2007.12948  [pdf, ps, other

    eess.AS cs.LG cs.SD stat.ML

    Nonlinear ISA with Auxiliary Variables for Learning Speech Representations

    Authors: Amrith Setlur, Barnabas Poczos, Alan W Black

    Abstract: This paper extends recent work on nonlinear Independent Component Analysis (ICA) by introducing a theoretical framework for nonlinear Independent Subspace Analysis (ISA) in the presence of auxiliary variables. Observed high dimensional acoustic features like log Mel spectrograms can be considered as surface level manifestations of nonlinear transformations over individual multivariate sources of i… ▽ More

    Submitted 25 July, 2020; originally announced July 2020.

    Comments: To be presented at Interspeech 2020

  18. arXiv:2007.02523  [pdf, other

    cs.LG stat.ML

    Covariate Distribution Aware Meta-learning

    Authors: Amrith Setlur, Saket Dingliwal, Barnabas Poczos

    Abstract: Meta-learning has proven to be successful for few-shot learning across the regression, classification, and reinforcement learning paradigms. Recent approaches have adopted Bayesian interpretations to improve gradient-based meta-learners by quantifying the uncertainty of the post-adaptation estimates. Most of these works almost completely ignore the latent relationship between the covariate distrib… ▽ More

    Submitted 27 November, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

    Journal ref: ICML 2020 Lifelong Learning Workshop

  19. arXiv:2004.14257  [pdf, other

    cs.CL

    Politeness Transfer: A Tag and Generate Approach

    Authors: Aman Madaan, Amrith Setlur, Tanmay Parekh, Barnabas Poczos, Graham Neubig, Yiming Yang, Ruslan Salakhutdinov, Alan W Black, Shrimai Prabhumoye

    Abstract: This paper introduces a new task of politeness transfer which involves converting non-polite sentences to polite sentences while preserving the meaning. We also provide a dataset of more than 1.39 instances automatically labeled for politeness to encourage benchmark evaluations on this new task. We design a tag and generate pipeline that identifies stylistic attributes and subsequently generates a… ▽ More

    Submitted 1 May, 2020; v1 submitted 29 April, 2020; originally announced April 2020.

    Comments: To appear at ACL 2020

  20. arXiv:2004.08597  [pdf, other

    math.ST cs.LG stat.ML

    Robust Density Estimation under Besov IPM Losses

    Authors: Ananya Uppal, Shashank Singh, Barnabas Poczos

    Abstract: We study minimax convergence rates of nonparametric density estimation in the Huber contamination model, in which a proportion of the data comes from an unknown outlier distribution. We provide the first results for this problem under a large family of losses, called Besov integral probability metrics (IPMs), that includes $\mathcal{L}^p$, Wasserstein, Kolmogorov-Smirnov, and other common distance… ▽ More

    Submitted 6 September, 2021; v1 submitted 18 April, 2020; originally announced April 2020.

  21. arXiv:2004.05665  [pdf, other

    cs.LG stat.ML

    Minimizing FLOPs to Learn Efficient Sparse Representations

    Authors: Biswajit Paria, Chih-Kuan Yeh, Ian E. H. Yen, Ning Xu, Pradeep Ravikumar, Barnabás Póczos

    Abstract: Deep representation learning has become one of the most widely adopted approaches for visual search, recommendation, and identification. Retrieval of such representations from a large database is however computationally challenging. Approximate methods based on learning compact representations, have been widely explored for this problem, such as locality sensitive hashing, product quantization, an… ▽ More

    Submitted 12 April, 2020; originally announced April 2020.

    Comments: Published at ICLR 2020

  22. arXiv:2002.08528  [pdf, other

    cs.LG math.OC stat.ML

    Adaptive Sampling Distributed Stochastic Variance Reduced Gradient for Heterogeneous Distributed Datasets

    Authors: Ilqar Ramazanli, Han Nguyen, Hai Pham, Sashank J. Reddi, Barnabas Poczos

    Abstract: We study distributed optimization algorithms for minimizing the average of \emph{heterogeneous} functions distributed across several machines with a focus on communication efficiency. In such settings, naively using the classical stochastic gradient descent (SGD) or its variants (e.g., SVRG) with a uniform sampling of machines typically yields poor performance. It often leads to the dependence of… ▽ More

    Submitted 17 November, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

  23. arXiv:2002.02431  [pdf, other

    cs.LG stat.ML

    Optimal Exact Matrix Completion Under new Parametrization

    Authors: Ilqar Ramazanli, Barnabas Poczos

    Abstract: We study the problem of exact completion for $m \times n$ sized matrix of rank $r$ with the adaptive sampling method. We introduce a relation of the exact completion problem with the sparsest vector of column and row spaces (which we call \textit{sparsity-number} here). Using this relation, we propose matrix completion algorithms that exactly recovers the target matrix. These algorithms are supe… ▽ More

    Submitted 4 March, 2022; v1 submitted 6 February, 2020; originally announced February 2020.

    Comments: It has been decided different sections of this work to be part of different projects

  24. arXiv:2001.10119  [pdf, other

    cs.LG stat.ML

    Unsupervised Program Synthesis for Images By Sampling Without Replacement

    Authors: Chenghui Zhou, Chun-Liang Li, Barnabas Poczos

    Abstract: Program synthesis has emerged as a successful approach to the image parsing task. Most prior works rely on a two-step scheme involving supervised pretraining of a Seq2Seq model with synthetic programs followed by reinforcement learning (RL) for fine-tuning with real reference images. Fully unsupervised approaches promise to train the model directly on the target images without requiring curated pr… ▽ More

    Submitted 14 June, 2021; v1 submitted 27 January, 2020; originally announced January 2020.

    Comments: Accepted to UAI 2021

    Journal ref: UAI 2021

  25. arXiv:2001.09938  [pdf, other

    physics.app-ph cs.LG

    Autonomous discovery of battery electrolytes with robotic experimentation and machine-learning

    Authors: Adarsh Dave, Jared Mitchell, Kirthevasan Kandasamy, Sven Burke, Biswajit Paria, Barnabas Poczos, Jay Whitacre, Venkatasubramanian Viswanathan

    Abstract: Innovations in batteries take years to formulate and commercialize, requiring extensive experimentation during the design and optimization phases. We approached the design and selection of a battery electrolyte through a black-box optimization algorithm directly integrated into a robotic test-stand. We report here the discovery of a novel battery electrolyte by this experiment completely guided by… ▽ More

    Submitted 22 October, 2019; originally announced January 2020.

    Comments: 23 pages, 4 figures, 10 pages of Extended Data

    Journal ref: Cell Reports Physical Science, 1, (2020) 100264

  26. arXiv:1912.10787  [pdf, other

    cs.GR cs.LG stat.ML

    Learned Interpolation for 3D Generation

    Authors: Austin Dill, Songwei Ge, Eunsu Kang, Chun-Liang Li, Barnabas Poczos

    Abstract: In order to generate novel 3D shapes with machine learning, one must allow for interpolation. The typical approach for incorporating this creative process is to interpolate in a learned latent space so as to avoid the problem of generating unrealistic instances by exploiting the model's learned structure. The process of the interpolation is supposed to form a semantically smooth morphing. While th… ▽ More

    Submitted 24 January, 2020; v1 submitted 8 December, 2019; originally announced December 2019.

    Comments: Creativity and Design Workshop at NeurIPS 2019

  27. arXiv:1911.11960  [pdf, other

    cs.CV cs.LG

    LucidDream: Controlled Temporally-Consistent DeepDream on Videos

    Authors: Joel Ruben Antony Moniz, Eunsu Kang, Barnabás Póczos

    Abstract: In this work, we aim to propose a set of techniques to improve the controllability and aesthetic appeal when DeepDream, which uses a pre-trained neural network to modify images by hallucinating objects into them, is applied to videos. In particular, we demonstrate a simple modification that improves control over the class of object that DeepDream is induced to hallucinate. We also show that the fl… ▽ More

    Submitted 27 November, 2019; originally announced November 2019.

    Comments: Workshop on Machine Learning for Creativity and Design, NeurIPS 2019

  28. arXiv:1911.07427  [pdf, other

    cs.LG stat.ML

    RotationOut as a Regularization Method for Neural Network

    Authors: Kai Hu, Barnabas Poczos

    Abstract: In this paper, we propose a novel regularization method, RotationOut, for neural networks. Different from Dropout that handles each neuron/channel independently, RotationOut regards its input layer as an entire vector and introduces regularization by randomly rotating the vector. RotationOut can also be used in convolutional layers and recurrent layers with small modifications. We further use a no… ▽ More

    Submitted 17 November, 2019; originally announced November 2019.

    Comments: 20 pages, 8 figures

  29. arXiv:1910.10211  [pdf, other

    cs.LG stat.ML

    Better Approximate Inference for Partial Likelihood Models with a Latent Structure

    Authors: Amrith Setlur, Barnabás Póczós

    Abstract: Temporal Point Processes (TPP) with partial likelihoods involving a latent structure often entail an intractable marginalization, thus making inference hard. We propose a novel approach to Maximum Likelihood Estimation (MLE) involving approximate inference over the latent variables by minimizing a tight upper bound on the approximation gap. Given a discrete latent variable $Z$, the proposed approx… ▽ More

    Submitted 19 December, 2019; v1 submitted 22 October, 2019; originally announced October 2019.

    Journal ref: NeurIPS 2019 Workshop on Learning with Temporal Point Processes

  30. arXiv:1910.07029  [pdf, other

    hep-ex physics.ins-det

    End-to-end particle and event identification at the Large Hadron Collider with CMS Open Data

    Authors: John Alison, Sitong An, Michael Andrews, Patrick Bryant, Bjorn Burkle, Sergei Gleyzer, Ulrich Heintz, Meenakshi Narain, Manfred Paulini, Barnabas Poczos, Emanuele Usai

    Abstract: From particle identification to the discovery of the Higgs boson, deep learning algorithms have become an increasingly important tool for data analysis at the Large Hadron Collider (LHC). We present an innovative end-to-end deep learning approach for jet identification at the Compact Muon Solenoid (CMS) experiment at the LHC. The method combines deep neural networks with low-level detector informa… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

    Comments: Talk presented at the 2019 Meeting of the Division of Particles and Fields of the American Physical Society (DPF2019), July 29 - August 2, 2019, Northeastern University, Boston, C1907293

  31. arXiv:1908.07587  [pdf, other

    cs.LG cs.AI cs.GR stat.ML

    Develo** Creative AI to Generate Sculptural Objects

    Authors: Songwei Ge, Austin Dill, Eunsu Kang, Chun-Liang Li, Lingyao Zhang, Manzil Zaheer, Barnabas Poczos

    Abstract: We explore the intersection of human and machine creativity by generating sculptural objects through machine learning. This research raises questions about both the technical details of automatic art generation and the interaction between AI and people, as both artists and the audience of art. We introduce two algorithms for generating 3D point clouds and then discuss their actualization as sculpt… ▽ More

    Submitted 20 August, 2019; originally announced August 2019.

    Comments: In the Proceedings of International Symposium on Electronic Art (ISEA 2019)

  32. arXiv:1908.01425  [pdf, other

    cs.LG physics.chem-ph stat.ML

    ChemBO: Bayesian Optimization of Small Organic Molecules with Synthesizable Recommendations

    Authors: Ksenia Korovina, Sailun Xu, Kirthevasan Kandasamy, Willie Neiswanger, Barnabas Poczos, Jeff Schneider, Eric P. Xing

    Abstract: In applications such as molecule design or drug discovery, it is desirable to have an algorithm which recommends new candidate molecules based on the results of past tests. These molecules first need to be synthesized and then tested for objective properties. We describe ChemBO, a Bayesian optimization framework for generating and optimizing organic molecules for desired molecular properties. Whil… ▽ More

    Submitted 21 October, 2019; v1 submitted 4 August, 2019; originally announced August 2019.

  33. arXiv:1906.08809  [pdf, other

    cs.LG cs.AI stat.ML

    A Deep Reinforcement Learning Approach for Global Routing

    Authors: Haiguang Liao, Wentai Zhang, Xuliang Dong, Barnabas Poczos, Kenji Shimada, Levent Burak Kara

    Abstract: Global routing has been a historically challenging problem in electronic circuit design, where the challenge is to connect a large and arbitrary number of circuit components with wires without violating the design rules for the printed circuit boards or integrated circuits. Similar routing problems also exist in the design of complex hydraulic systems, pipe systems and logistic networks. Existing… ▽ More

    Submitted 20 June, 2019; originally announced June 2019.

    Comments: Preprint submitted to ASME JMD

  34. arXiv:1905.13192  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Graph Neural Tangent Kernel: Fusing Graph Neural Networks with Graph Kernels

    Authors: Simon S. Du, Kangcheng Hou, Barnabás Póczos, Ruslan Salakhutdinov, Ruosong Wang, Keyulu Xu

    Abstract: While graph kernels (GKs) are easy to train and enjoy provable theoretical guarantees, their practical performances are limited by their expressive power, as the kernel function often depends on hand-crafted combinatorial features of graphs. Compared to graph kernels, graph neural networks (GNNs) usually achieve better practical performance, as GNNs use multi-layer architectures and non-linear act… ▽ More

    Submitted 4 November, 2019; v1 submitted 30 May, 2019; originally announced May 2019.

    Comments: In NeurIPS 2019. Code available: https://github.com/KangchengHou/gntk

  35. arXiv:1904.10037  [pdf, other

    cs.CV cs.LG

    LBS Autoencoder: Self-supervised Fitting of Articulated Meshes to Point Clouds

    Authors: Chun-Liang Li, Tomas Simon, Jason Saragih, Barnabás Póczos, Yaser Sheikh

    Abstract: We present LBS-AE; a self-supervised autoencoding algorithm for fitting articulated mesh models to point clouds. As input, we take a sequence of point clouds to be registered as well as an artist-rigged mesh, i.e. a template mesh equipped with a linear-blend skinning (LBS) deformation space parameterized by a skeleton hierarchy. As output, we learn an LBS-based autoencoder that produces registered… ▽ More

    Submitted 22 April, 2019; originally announced April 2019.

    Comments: In the Proceedings of IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2019)

  36. arXiv:1903.09848  [pdf, other

    cs.CL cs.LG stat.ML

    Competence-based Curriculum Learning for Neural Machine Translation

    Authors: Emmanouil Antonios Platanios, Otilia Stretcu, Graham Neubig, Barnabas Poczos, Tom M. Mitchell

    Abstract: Current state-of-the-art NMT systems use large neural networks that are not only slow to train, but also often require many heuristics and optimization tricks, such as specialized learning rate schedules and large batch sizes. This is undesirable as it requires extensive hyperparameter tuning. In this paper, we propose a curriculum learning framework for NMT that reduces training time, reduces the… ▽ More

    Submitted 26 March, 2019; v1 submitted 23 March, 2019; originally announced March 2019.

    Journal ref: NAACL 2019

  37. arXiv:1903.06694  [pdf, other

    stat.ML cs.AI cs.LG

    Tuning Hyperparameters without Grad Students: Scalable and Robust Bayesian Optimisation with Dragonfly

    Authors: Kirthevasan Kandasamy, Karun Raju Vysyaraju, Willie Neiswanger, Biswajit Paria, Christopher R. Collins, Jeff Schneider, Barnabas Poczos, Eric P. Xing

    Abstract: Bayesian Optimisation (BO) refers to a suite of techniques for global optimisation of expensive black box functions, which use introspective Bayesian models of the function to efficiently search for the optimum. While BO has been applied successfully in many applications, modern optimisation tasks usher in new challenges where conventional methods fail spectacularly. In this work, we present Drago… ▽ More

    Submitted 19 April, 2020; v1 submitted 15 March, 2019; originally announced March 2019.

    Comments: Journal of Machine Learning Research 2020, Special Issue on Bayesian Optimization

  38. arXiv:1902.10214  [pdf, other

    stat.ML cs.AI cs.LG

    Implicit Kernel Learning

    Authors: Chun-Liang Li, Wei-Cheng Chang, Youssef Mroueh, Yiming Yang, Barnabás Póczos

    Abstract: Kernels are powerful and versatile tools in machine learning and statistics. Although the notion of universal kernels and characteristic kernels has been studied, kernel selection still greatly influences the empirical performance. While learning the kernel in a data driven way has been investigated, in this paper we explore learning the spectral distribution of kernel via implicit generative mode… ▽ More

    Submitted 26 February, 2019; originally announced February 2019.

    Comments: In the Proceedings of the 22nd International Conference on Artificial Intelligence and Statistics (AISTATS 2019)

  39. arXiv:1902.10159  [pdf, ps, other

    astro-ph.IM astro-ph.CO

    The Role of Machine Learning in the Next Decade of Cosmology

    Authors: Michelle Ntampaka, Camille Avestruz, Steven Boada, Joao Caldeira, Jessi Cisewski-Kehe, Rosanne Di Stefano, Cora Dvorkin, August E. Evrard, Arya Farahi, Doug Finkbeiner, Shy Genel, Alyssa Goodman, Andy Goulding, Shirley Ho, Arthur Kosowsky, Paul La Plante, Francois Lanusse, Michelle Lochner, Rachel Mandelbaum, Daisuke Nagai, Jeffrey A. Newman, Brian Nord, J. E. G. Peek, Austin Peel, Barnabas Poczos , et al. (5 additional authors not shown)

    Abstract: In recent years, machine learning (ML) methods have remarkably improved how cosmologists can interpret data. The next decade will bring new opportunities for data-driven cosmological discovery, but will also present new challenges for adopting ML methodologies and understanding the results. ML could transform our field, but this transformation will require the astronomy community to both foster an… ▽ More

    Submitted 14 January, 2021; v1 submitted 26 February, 2019; originally announced February 2019.

    Comments: Submitted to the Astro2020 call for science white papers

  40. arXiv:1902.08276  [pdf, ps, other

    hep-ex cs.CV cs.LG physics.data-an

    End-to-End Jet Classification of Quarks and Gluons with the CMS Open Data

    Authors: Michael Andrews, John Alison, Sitong An, Patrick Bryant, Bjorn Burkle, Sergei Gleyzer, Meenakshi Narain, Manfred Paulini, Barnabas Poczos, Emanuele Usai

    Abstract: We describe the construction of end-to-end jet image classifiers based on simulated low-level detector data to discriminate quark- vs. gluon-initiated jets with high-fidelity simulated CMS Open Data. We highlight the importance of precise spatial information and demonstrate competitive performance to existing state-of-the-art jet classifiers. We further generalize the end-to-end approach to event-… ▽ More

    Submitted 23 October, 2020; v1 submitted 21 February, 2019; originally announced February 2019.

    Comments: 10 pages, 5 figures, 7 tables; v2: published version

    Journal ref: Nucl. Instrum. Methods Phys. Res. A 977, 164304 (2020)

  41. A Robust and Efficient Deep Learning Method for Dynamical Mass Measurements of Galaxy Clusters

    Authors: Matthew Ho, Markus Michael Rau, Michelle Ntampaka, Arya Farahi, Hy Trac, Barnabas Poczos

    Abstract: We demonstrate the ability of convolutional neural networks (CNNs) to mitigate systematics in the virial scaling relation and produce dynamical mass estimates of galaxy clusters with remarkably low bias and scatter. We present two models, CNN$_\mathrm{1D}$ and CNN$_\mathrm{2D}$, which leverage this deep learning tool to infer cluster masses from distributions of member galaxy dynamics. Our first m… ▽ More

    Submitted 22 December, 2020; v1 submitted 15 February, 2019; originally announced February 2019.

    Comments: 22 pages, 10 figures, 4 tables, accepted for publication at ApJ

    Journal ref: 2019 ApJ, 887, 25

  42. arXiv:1902.03511  [pdf, other

    math.ST cs.IT cs.LG stat.ML

    Nonparametric Density Estimation & Convergence Rates for GANs under Besov IPM Losses

    Authors: Ananya Uppal, Shashank Singh, Barnabás Póczos

    Abstract: We study the problem of estimating a nonparametric probability density under a large family of losses called Besov IPMs, which include, for example, $\mathcal{L}^p$ distances, total variation distance, and generalizations of both Wasserstein and Kolmogorov-Smirnov distances. For a wide variety of settings, we provide both lower and upper bounds, identifying precisely how the choice of loss functio… ▽ More

    Submitted 13 January, 2020; v1 submitted 9 February, 2019; originally announced February 2019.

    Comments: Advances in Neural Information Processing Systems. 2019

  43. arXiv:1901.11515  [pdf, other

    cs.LG cs.AI stat.ML

    ProBO: Versatile Bayesian Optimization Using Any Probabilistic Programming Language

    Authors: Willie Neiswanger, Kirthevasan Kandasamy, Barnabas Poczos, Jeff Schneider, Eric Xing

    Abstract: Optimizing an expensive-to-query function is a common task in science and engineering, where it is beneficial to keep the number of queries to a minimum. A popular strategy is Bayesian optimization (BO), which leverages probabilistic models for this task. Most BO today uses Gaussian processes (GPs), or a few other surrogate models. However, there is a broad set of Bayesian modeling techniques that… ▽ More

    Submitted 4 July, 2019; v1 submitted 31 January, 2019; originally announced January 2019.

  44. arXiv:1901.06077  [pdf, other

    stat.ML cs.LG

    Kernel Change-point Detection with Auxiliary Deep Generative Models

    Authors: Wei-Cheng Chang, Chun-Liang Li, Yiming Yang, Barnabás Póczos

    Abstract: Detecting the emergence of abrupt property changes in time series is a challenging problem. Kernel two-sample test has been studied for this task which makes fewer assumptions on the distributions than traditional parametric approaches. However, selecting kernels is non-trivial in practice. Although kernel selection for two-sample test has been studied, the insufficient samples in change point det… ▽ More

    Submitted 17 January, 2019; originally announced January 2019.

    Comments: To appear in ICLR 2019

  45. arXiv:1812.07809  [pdf, other

    cs.LG cs.CL cs.CV cs.HC stat.ML

    Found in Translation: Learning Robust Joint Representations by Cyclic Translations Between Modalities

    Authors: Hai Pham, Paul Pu Liang, Thomas Manzini, Louis-Philippe Morency, Barnabas Poczos

    Abstract: Multimodal sentiment analysis is a core research area that studies speaker sentiment expressed from the language, visual, and acoustic modalities. The central challenge in multimodal learning involves inferring joint representations that can process and relate information from these modalities. However, existing work learns joint representations by requiring all modalities as input and as a result… ▽ More

    Submitted 28 February, 2020; v1 submitted 19 December, 2018; originally announced December 2018.

    Comments: AAAI 2019, code available at https://github.com/hainow/MCTN

  46. arXiv:1811.09751  [pdf, other

    cs.LG stat.ML

    Characterizing and Avoiding Negative Transfer

    Authors: Zirui Wang, Zihang Dai, Barnabás Póczos, Jaime Carbonell

    Abstract: When labeled data is scarce for a specific target task, transfer learning often offers an effective solution by utilizing data from a related source task. However, when transferring knowledge from a less related source, it may inversely hurt the target performance, a phenomenon known as negative transfer. Despite its pervasiveness, negative transfer is usually described in an informal manner, lack… ▽ More

    Submitted 4 October, 2019; v1 submitted 23 November, 2018; originally announced November 2018.

    Comments: Published at CVPR 2019

  47. arXiv:1811.06533  [pdf, other

    astro-ph.CO cs.AI cs.LG

    Learning to Predict the Cosmological Structure Formation

    Authors: Siyu He, Yin Li, Yu Feng, Shirley Ho, Siamak Ravanbakhsh, Wei Chen, Barnabás Póczos

    Abstract: Matter evolved under influence of gravity from minuscule density fluctuations. Non-perturbative structure formed hierarchically over all scales, and developed non-Gaussian features in the Universe, known as the Cosmic Web. To fully understand the structure formation of the Universe is one of the holy grails of modern astrophysics. Astrophysicists survey large volumes of the Universe and employ a l… ▽ More

    Submitted 31 July, 2019; v1 submitted 15 November, 2018; originally announced November 2018.

    Comments: 8 pages, 5 figures, 1 table

    Journal ref: PNAS July 9, 2019 116 (28) 13825-13832

  48. arXiv:1811.05389  [pdf, other

    cs.AI cs.CV

    Hallucinating Point Cloud into 3D Sculptural Object

    Authors: Chun-Liang Li, Eunsu Kang, Songwei Ge, Lingyao Zhang, Austin Dill, Manzil Zaheer, Barnabas Poczos

    Abstract: Our team of artists and machine learning researchers designed a creative algorithm that can generate authentic sculptural artworks. These artworks do not mimic any given forms and cannot be easily categorized into the dataset categories. Our approach extends DeepDream from images to 3D point clouds. The proposed algorithm, Amalgamated DeepDream (ADD), leverages the properties of point clouds to cr… ▽ More

    Submitted 28 November, 2018; v1 submitted 13 November, 2018; originally announced November 2018.

    Comments: Accepted by Second Workshop on Machine Learning for Creativity and Design, NIPS 2018

  49. arXiv:1810.05795  [pdf, other

    cs.LG stat.ML

    Point Cloud GAN

    Authors: Chun-Liang Li, Manzil Zaheer, Yang Zhang, Barnabas Poczos, Ruslan Salakhutdinov

    Abstract: Generative Adversarial Networks (GAN) can achieve promising performance on learning complex data distributions on different types of data. In this paper, we first show a straightforward extension of existing GAN algorithm is not applicable to point clouds, because the constraint required for discriminators is undefined for set data. We propose a two fold modification to GAN algorithm for learning… ▽ More

    Submitted 13 October, 2018; originally announced October 2018.

  50. arXiv:1810.02054  [pdf, other

    cs.LG math.OC stat.ML

    Gradient Descent Provably Optimizes Over-parameterized Neural Networks

    Authors: Simon S. Du, Xiyu Zhai, Barnabas Poczos, Aarti Singh

    Abstract: One of the mysteries in the success of neural networks is randomly initialized first order methods like gradient descent can achieve zero training loss even though the objective function is non-convex and non-smooth. This paper demystifies this surprising phenomenon for two-layer fully connected ReLU activated neural networks. For an $m$ hidden node shallow neural network with ReLU activation and… ▽ More

    Submitted 4 February, 2019; v1 submitted 4 October, 2018; originally announced October 2018.

    Comments: ICLR 2019