Skip to main content

Showing 1–50 of 148 results for author: Nowak, R

.
  1. arXiv:2406.10522  [pdf, other

    cs.LG cs.AI cs.CL

    Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon Captioning

    Authors: Jifan Zhang, Lalit Jain, Yang Guo, Jiayi Chen, Kuan Lok Zhou, Siddharth Suresh, Andrew Wagenmaker, Scott Sievert, Timothy Rogers, Kevin Jamieson, Robert Mankoff, Robert Nowak

    Abstract: We present a novel multimodal preference dataset for creative tasks, consisting of over 250 million human ratings on more than 2.2 million captions, collected through crowdsourcing rating data for The New Yorker's weekly cartoon caption contest over the past eight years. This unique dataset supports the development and evaluation of multimodal large language models and preference-based fine-tuning… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  2. arXiv:2406.05064  [pdf, other

    cs.LG

    Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning

    Authors: Subhojyoti Mukherjee, Josiah P. Hanna, Qiaomin Xie, Robert Nowak

    Abstract: In this paper, we study multi-task structured bandit problem where the goal is to learn a near-optimal algorithm that minimizes cumulative regret. The tasks share a common structure and the algorithm exploits the shared structure to minimize the cumulative regret for an unseen but related test task. We use a transformer as a decision-making algorithm to learn this shared structure so as to general… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  3. arXiv:2406.02529  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    ReLUs Are Sufficient for Learning Implicit Neural Representations

    Authors: Joseph Shenouda, Yamin Zhou, Robert D. Nowak

    Abstract: Motivated by the growing theoretical understanding of neural networks that employ the Rectified Linear Unit (ReLU) as their activation function, we revisit the use of ReLU activation functions for learning implicit neural representations (INRs). Inspired by second order B-spline wavelets, we incorporate a set of simple constraints to the ReLU neurons in each layer of a deep neural network (DNN) to… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted to ICML 2024

  4. arXiv:2406.02165  [pdf, other

    cs.LG

    SaVeR: Optimal Data Collection Strategy for Safe Policy Evaluation in Tabular MDP

    Authors: Subhojyoti Mukherjee, Josiah P. Hanna, Robert Nowak

    Abstract: In this paper, we study safe data collection for the purpose of policy evaluation in tabular Markov decision processes (MDPs). In policy evaluation, we are given a \textit{target} policy and asked to estimate the expected cumulative reward it will obtain. Policy evaluation requires data and we are interested in the question of what \textit{behavior} policy should collect the data for the most accu… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  5. arXiv:2404.10784  [pdf, other

    cs.SI cs.LG

    Graph Vertex Embeddings: Distance, Regularization and Community Detection

    Authors: Radosław Nowak, Adam Małkowski, Daniel Cieślak, Piotr Sokół, Paweł Wawrzyński

    Abstract: Graph embeddings have emerged as a powerful tool for representing complex network structures in a low-dimensional space, enabling the use of efficient methods that employ the metric structure in the embedding space as a proxy for the topological structure of the data. In this paper, we explore several aspects that affect the quality of a vertex embedding of graph-structured data. To this effect, w… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    MSC Class: I.2.6

  6. arXiv:2402.07102  [pdf, other

    cs.LG cs.AI

    Future Prediction Can be a Strong Evidence of Good History Representation in Partially Observable Environments

    Authors: Jeongyeol Kwon, Liu Yang, Robert Nowak, Josiah Hanna

    Abstract: Learning a good history representation is one of the core challenges of reinforcement learning (RL) in partially observable environments. Recent works have shown the advantages of various auxiliary tasks for facilitating representation learning. However, the effectiveness of such auxiliary tasks has not been fully convincing, especially in partially observable environments that require long-term m… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

  7. arXiv:2402.04896  [pdf, other

    cs.NI cs.LG

    Learning from the Best: Active Learning for Wireless Communications

    Authors: Nasim Soltani, Jifan Zhang, Batool Salehi, Debashri Roy, Robert Nowak, Kaushik Chowdhury

    Abstract: Collecting an over-the-air wireless communications training dataset for deep learning-based communication tasks is relatively simple. However, labeling the dataset requires expert involvement and domain knowledge, may involve private intellectual properties, and is often computationally and financially expensive. Active learning is an emerging area of research in machine learning that aims to redu… ▽ More

    Submitted 23 January, 2024; originally announced February 2024.

  8. arXiv:2401.06692  [pdf, other

    cs.CL cs.AI cs.LG

    An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models

    Authors: Gantavya Bhatt, Yifang Chen, Arnav M. Das, Jifan Zhang, Sang T. Truong, Stephen Mussmann, Yinglun Zhu, Jeffrey Bilmes, Simon S. Du, Kevin Jamieson, Jordan T. Ash, Robert D. Nowak

    Abstract: Supervised finetuning (SFT) on instruction datasets has played a crucial role in achieving the remarkable zero-shot generalization capabilities observed in modern large language models (LLMs). However, the annotation efforts required to produce high quality responses for instructions are becoming prohibitively expensive, especially as the number of tasks spanned by instruction datasets continues t… ▽ More

    Submitted 6 May, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

  9. arXiv:2401.02756  [pdf, other

    cs.DC q-bio.GN

    Lock-free de Bruijn graph

    Authors: Daniel Górniak, Robert Nowak

    Abstract: De Bruijn graph is one of the most important data structures used in de-novo genome assembly algorithms, especially for NGS data. There is a growing need for parallel data structures and algorithms due to the increasing number of cores in modern computers. The assembly task is an indispensable step in sequencing genomes of new organisms and studying structural genomic changes. In recent years, the… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

    ACM Class: H.4

  10. arXiv:2312.09196  [pdf, other

    cs.LG cs.AI cs.CV

    DIRECT: Deep Active Learning under Imbalance and Label Noise

    Authors: Shyam Nuggehalli, Jifan Zhang, Lalit Jain, Robert Nowak

    Abstract: Class imbalance is a prevalent issue in real world machine learning applications, often leading to poor performance in rare and minority classes. With an abundance of wild unlabeled data, active learning is perhaps the most effective technique in solving the problem at its root -- collecting a more balanced and informative set of labeled examples during annotation. Label noise is another common is… ▽ More

    Submitted 20 May, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

  11. arXiv:2311.12424  [pdf, other

    cs.LG cs.NE

    Looped Transformers are Better at Learning Learning Algorithms

    Authors: Liu Yang, Kangwook Lee, Robert Nowak, Dimitris Papailiopoulos

    Abstract: Transformers have demonstrated effectiveness in in-context solving data-fitting problems from various (latent) models, as reported by Garg et al. However, the absence of an inherent iterative structure in the transformer architecture presents a challenge in emulating the iterative algorithms, which are commonly employed in traditional machine learning methods. To address this, we propose the utili… ▽ More

    Submitted 16 March, 2024; v1 submitted 21 November, 2023; originally announced November 2023.

    Comments: Accepted for publication at ICLR 2024

  12. arXiv:2311.00327  [pdf, other

    cs.LG

    Multi-task Representation Learning for Pure Exploration in Bilinear Bandits

    Authors: Subhojyoti Mukherjee, Qiaomin Xie, Josiah P. Hanna, Robert Nowak

    Abstract: We study multi-task representation learning for the problem of pure exploration in bilinear bandits. In bilinear bandits, an action takes the form of a pair of arms from two different entity types and the reward is a bilinear function of the known feature vectors of the arms. In the \textit{multi-task bilinear bandit problem}, we aim to find optimal actions for multiple tasks that share a common l… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: Accepted in 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  13. arXiv:2309.12535  [pdf, other

    cond-mat.supr-con cond-mat.str-el

    Superconductivity in Compositionally-Complex Cuprates with the YBa$_2$Cu$_3$O$_{7-x}$ Structure

    Authors: Aditya Raghavan, Nathan Arndt, Nayelie Morales-Colón, Eli Wennen, Megan Wolfe, Carolina Oliveira Gandin, Kade Nelson, Robert Nowak, Sam Dillon, Keon Sahebkar, Ryan F. Need

    Abstract: High-temperature superconductivity is reported in a series of compositionally-complex cuprates with varying degrees of size and spin disorder. Three compositions of Y-site alloyed YBa$_2$Cu$_3$O$_{7-x}$, i.e., (5Y)BCO, were prepared using solid-state methods with different sets of rare earth ions on the Y-site. Synchrotron X-ray diffraction and energy-dispersive X-ray spectroscopy confirm these sa… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: 6 pages, 3 figures, 1 table

    Journal ref: Phys. Rev. Mater. 8, 024801 (2024)

  14. arXiv:2309.01753  [pdf, other

    math.OC cs.LG

    On Penalty Methods for Nonconvex Bilevel Optimization and First-Order Stochastic Approximation

    Authors: Jeongyeol Kwon, Dohyun Kwon, Stephen Wright, Robert Nowak

    Abstract: In this work, we study first-order algorithms for solving Bilevel Optimization (BO) where the objective functions are smooth but possibly nonconvex in both levels and the variables are restricted to closed convex sets. As a first step, we study the landscape of BO through the lens of penalty methods, in which the upper- and lower-level objectives are combined in a weighted sum with penalty paramet… ▽ More

    Submitted 11 February, 2024; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: ICLR 2024

  15. arXiv:2307.15772  [pdf, ps, other

    stat.ML cs.LG math.NA

    Weighted variation spaces and approximation by shallow ReLU networks

    Authors: Ronald DeVore, Robert D. Nowak, Rahul Parhi, Jonathan W. Siegel

    Abstract: We investigate the approximation of functions $f$ on a bounded domain $Ω\subset \mathbb{R}^d$ by the outputs of single-hidden-layer ReLU neural networks of width $n$. This form of nonlinear $n$-term dictionary approximation has been intensely studied since it is the simplest case of neural network approximation (NNA). There are several celebrated approximation results for this form of NNA that int… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

  16. arXiv:2306.09910  [pdf, other

    cs.LG cs.AI cs.CV

    LabelBench: A Comprehensive Framework for Benchmarking Adaptive Label-Efficient Learning

    Authors: Jifan Zhang, Yifang Chen, Gregory Canal, Stephen Mussmann, Arnav M. Das, Gantavya Bhatt, Yinglun Zhu, Jeffrey Bilmes, Simon Shaolei Du, Kevin Jamieson, Robert D Nowak

    Abstract: Labeled data are critical to modern machine learning applications, but obtaining labels can be expensive. To mitigate this cost, machine learning methods, such as transfer learning, semi-supervised learning and active learning, aim to be label-efficient: achieving high predictive performance from relatively few labeled examples. While obtaining the best label-efficiency in practice often requires… ▽ More

    Submitted 1 March, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

  17. arXiv:2306.09158  [pdf, other

    cs.LG

    Feed Two Birds with One Scone: Exploiting Wild Data for Both Out-of-Distribution Generalization and Detection

    Authors: Haoyue Bai, Gregory Canal, Xuefeng Du, Jeongyeol Kwon, Robert Nowak, Yixuan Li

    Abstract: Modern machine learning models deployed in the wild can encounter both covariate and semantic shifts, giving rise to the problems of out-of-distribution (OOD) generalization and OOD detection respectively. While both problems have received significant research attention lately, they have been pursued independently. This may not be surprising, since the two tasks have seemingly conflicting goals. T… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: ICML 2023

  18. arXiv:2305.16534  [pdf, other

    stat.ML cs.LG

    Variation Spaces for Multi-Output Neural Networks: Insights on Multi-Task Learning and Network Compression

    Authors: Joseph Shenouda, Rahul Parhi, Kangwook Lee, Robert D. Nowak

    Abstract: This paper introduces a novel theoretical framework for the analysis of vector-valued neural networks through the development of vector-valued variation spaces, a new class of reproducing kernel Banach spaces. These spaces emerge from studying the regularization effect of weight decay in training networks with activations like the rectified linear unit (ReLU). This framework offers a deeper unders… ▽ More

    Submitted 9 March, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

  19. arXiv:2302.07972  [pdf, other

    eess.IV

    Filtered Iterative Denoising for Linear Inverse Problems

    Authors: Danica Fliss, Willem Marais, Robert D. Nowak

    Abstract: Iterative denoising algorithms (IDAs) have been tremendously successful in a range of linear inverse problems arising in signal and image processing. The classic instance of this is the famous Iterative Soft-Thresholding Algorithm (ISTA), based on soft-thresholding of wavelet coefficients. More modern approaches to IDAs replace soft-thresholding with a black-box denoiser, such as BM3D or a learned… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

  20. arXiv:2302.07317  [pdf, other

    cs.LG cs.AI cs.CV

    Algorithm Selection for Deep Active Learning with Imbalanced Datasets

    Authors: Jifan Zhang, Shuai Shao, Saurabh Verma, Robert Nowak

    Abstract: Label efficiency has become an increasingly important objective in deep learning applications. Active learning aims to reduce the number of labeled examples needed to train deep networks, but the empirical performance of active learning algorithms can vary dramatically across datasets and applications. It is difficult to know in advance which active learning strategy will perform well or best in a… ▽ More

    Submitted 2 November, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

  21. arXiv:2301.12357  [pdf, other

    stat.ML cs.LG

    SPEED: Experimental Design for Policy Evaluation in Linear Heteroscedastic Bandits

    Authors: Subhojyoti Mukherjee, Qiaomin Xie, Josiah Hanna, Robert Nowak

    Abstract: In this paper, we study the problem of optimal data collection for policy evaluation in linear bandits. In policy evaluation, we are given a target policy and asked to estimate the expected reward it will obtain when executed in a multi-armed bandit environment. Our work is the first work that focuses on such optimal data collection strategy for policy evaluation involving heteroscedastic reward n… ▽ More

    Submitted 29 February, 2024; v1 submitted 28 January, 2023; originally announced January 2023.

  22. arXiv:2301.10945  [pdf, other

    math.OC cs.AI cs.LG

    A Fully First-Order Method for Stochastic Bilevel Optimization

    Authors: Jeongyeol Kwon, Dohyun Kwon, Stephen Wright, Robert Nowak

    Abstract: We consider stochastic unconstrained bilevel optimization problems when only the first-order gradient oracles are available. While numerous optimization methods have been proposed for tackling bilevel problems, existing methods either tend to require possibly expensive calculations regarding Hessians of lower-level objectives, or lack rigorous finite-time performance guarantees. In this work, we p… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

  23. arXiv:2301.09554  [pdf, other

    stat.ML cs.LG eess.SP

    Deep Learning Meets Sparse Regularization: A Signal Processing Perspective

    Authors: Rahul Parhi, Robert D. Nowak

    Abstract: Deep learning has been wildly successful in practice and most state-of-the-art machine learning methods are based on neural networks. Lacking, however, is a rigorous mathematical theory that adequately explains the amazing performance of deep neural networks. In this article, we present a relatively new mathematical framework that provides the beginning of a deeper understanding of deep learning.… ▽ More

    Submitted 8 June, 2023; v1 submitted 23 January, 2023; originally announced January 2023.

    Journal ref: IEEE Signal Processing Magazine, vol. 40, no. 6, pp. 63-74, Sept. 2023

  24. arXiv:2210.16727  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Spin-polarized transport in magnetic tunnel junctions with ZnTe barriers

    Authors: W. G. Wang, C. Ni, A. Ozbay, L. R. Shah, X. Fan, X. M. Kou, E. R. Nowak, J. Q. Xiao

    Abstract: Magnetic tunnel junctions with wide band gap semiconductor ZnTe barrier were fabricated. A very low barrier height and sizable magnetoresistance were observed in the Fe/ZnTe/Fe junctions at room temperature. The nonlinear I-V characteristic curve confirmed the observed magnetoresistance is due to spin-dependent tunneling effect. Temperature dependent study indicated that the total conductance of t… ▽ More

    Submitted 29 October, 2022; originally announced October 2022.

  25. arXiv:2210.09691  [pdf, other

    cs.PF

    OpenStack and Google Cloud performance comparison in Infrastructure as a Service model

    Authors: Michał Łątkowski, Robert Nowak

    Abstract: Cloud computing is becoming common, and the choice of proper infrastructure is essential. One of main issues is choosing between private and public clound, between commercial and non-commercial solutions. This paper aims to compare the parameters of OpenStack and Google Cloud systems. Both systems deliver a computing cloud service, enabling the user to use the infrastructure as a service (IaaS) mo… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: 22 pages, 10 figures

    ACM Class: C.4

  26. arXiv:2210.08367  [pdf, ps, other

    cs.LG stat.ML

    Active Learning with Neural Networks: Insights from Nonparametric Statistics

    Authors: Yinglun Zhu, Robert Nowak

    Abstract: Deep neural networks have great representation power, but typically require large numbers of training examples. This motivates deep active learning methods that can significantly reduce the amount of labeled training data. Empirical successes of deep active learning have been recently reported in the literature, however, rigorous label complexity guarantees of deep active learning have remained el… ▽ More

    Submitted 15 October, 2022; originally announced October 2022.

    Comments: To appear at NeurIPS 2022

  27. arXiv:2210.06865  [pdf, other

    q-bio.GN cs.CE

    Fast genomic optical map assembly algorithm using binary representation

    Authors: Przemysław Stawczyk, Robert Nowak

    Abstract: Reducing the cost of sequencing genomes provided by next-generation sequencing technologies has greatly increased the number of genomic projects. As a result, there is a growing need for better assembly and assembly validation methods. One promising idea is to use heterogeneous data in assembly projects. Optical Map** (OM) is beneficial in validating genomic assemblies, correction and scaffoldin… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    ACM Class: H.4; I.7

  28. arXiv:2210.03069  [pdf, other

    cs.LG

    PathProx: A Proximal Gradient Algorithm for Weight Decay Regularized Deep Neural Networks

    Authors: Liu Yang, Jifan Zhang, Joseph Shenouda, Dimitris Papailiopoulos, Kangwook Lee, Robert D. Nowak

    Abstract: Weight decay is one of the most widely used forms of regularization in deep learning, and has been shown to improve generalization and robustness. The optimization objective driving weight decay is a sum of losses plus a term proportional to the sum of squared weights. This paper argues that stochastic gradient descent (SGD) may be an inefficient algorithm for this objective. For neural networks w… ▽ More

    Submitted 5 July, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

  29. arXiv:2207.08102  [pdf

    cond-mat.mtrl-sci

    Comprehensive structural changes in nanoscale-deformed silicon modelled with an integrated atomic potential

    Authors: Rafał Abram, Dariusz Chrobak, Jesper Byggmästar, Kai H. Nordlund, Roman Nowak

    Abstract: In spite of remarkable developments in the field of advanced materials, silicon remains one of the foremost semiconductors of the day. Of enduring relevance to science and technology is silicon's nanomechanical behaviour including phase transformation, amorphization and dislocations generation, particularly in the context of molecular dynamics and materials research. So far, comprehensive modellin… ▽ More

    Submitted 8 August, 2022; v1 submitted 17 July, 2022; originally announced July 2022.

    Comments: 35 pages, 10 figures

  30. arXiv:2207.03609  [pdf, other

    stat.ML cs.AI cs.LG

    One for All: Simultaneous Metric and Preference Learning over Multiple Users

    Authors: Gregory Canal, Blake Mason, Ramya Korlakai Vinayak, Robert Nowak

    Abstract: This paper investigates simultaneous preference and metric learning from a crowd of respondents. A set of items represented by $d$-dimensional feature vectors and paired comparisons of the form ``item $i$ is preferable to item $j$'' made by each user is given. Our model jointly learns a distance metric that characterizes the crowd's general measure of item similarities along with a latent ideal po… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  31. arXiv:2206.09464  [pdf

    cond-mat.mtrl-sci

    Mo-Si-B alloys for ultra-high temperature space and ground applications: liquid assisted fabrication under various temperature and time conditions

    Authors: G. Bruzda, W. Polkowski, R. Nowak, A. Polkowska, S. Lech, K. Karczewski, M. Książek, D Giuranno

    Abstract: Boron-doped molybdenum silicides have been already recognized as attractive candidates for space and ground ultra-high temperature applications far beyond limits of state-of-the-art nickel based superalloys. In this work, we are exploring a new method for fabricating Mo-Si-B alloys (as coatings or small bulk components) by utilizing a pressure-less reactive melt infiltration approach. The basic as… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Journal ref: Journal of Materials Science (2022)

  32. arXiv:2204.00043  [pdf, other

    stat.ML cs.LG

    Efficient Active Learning with Abstention

    Authors: Yinglun Zhu, Robert Nowak

    Abstract: The goal of active learning is to achieve the same accuracy achievable by passive learning, while using much fewer labels. Exponential savings in terms of label complexity have been proved in very special cases, but fundamental lower bounds show that such improvements are impossible in general. This suggests a need to explore alternative goals for active learning. Learning with abstention is one s… ▽ More

    Submitted 15 October, 2022; v1 submitted 31 March, 2022; originally announced April 2022.

    Comments: To appear at NeurIPS 2022

  33. arXiv:2203.04510  [pdf, other

    cs.LG

    ReVar: Strengthening Policy Evaluation via Reduced Variance Sampling

    Authors: Subhojyoti Mukherjee, Josiah P. Hanna, Robert Nowak

    Abstract: This paper studies the problem of data collection for policy evaluation in Markov decision processes (MDPs). In policy evaluation, we are given a target policy and asked to estimate the expected cumulative reward it will obtain in an environment formalized as an MDP. We develop theory for optimal data collection within the class of tree-structured MDPs by first deriving an oracle data collection s… ▽ More

    Submitted 17 June, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

    Comments: Accepted for the $38^{\text {th }}$ Conference on Uncertainty in Artificial Intelligence (UAI 2022)

  34. arXiv:2202.03299  [pdf, other

    cs.LG cs.AI

    Training OOD Detectors in their Natural Habitats

    Authors: Julian Katz-Samuels, Julia Nakhleh, Robert Nowak, Yixuan Li

    Abstract: Out-of-distribution (OOD) detection is important for machine learning models deployed in the wild. Recent methods use auxiliary outlier data to regularize the model for improved OOD detection. However, these approaches make a strong distributional assumption that the auxiliary outlier data is completely separable from the in-distribution (ID) data. In this paper, we propose a novel framework that… ▽ More

    Submitted 28 June, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

    Comments: Accepted to International Conference on Machine Learning (ICML) 2022

  35. arXiv:2202.01402  [pdf, other

    cs.LG cs.AI cs.CV

    GALAXY: Graph-based Active Learning at the Extreme

    Authors: Jifan Zhang, Julian Katz-Samuels, Robert Nowak

    Abstract: Active learning is a label-efficient approach to train highly effective models while interactively selecting only small subsets of unlabelled data for labelling and training. In "open world" settings, the classes of interest can make up a small fraction of the overall dataset -- most of the data may be viewed as an out-of-distribution or irrelevant class. This leads to extreme class-imbalance, and… ▽ More

    Submitted 26 May, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

  36. arXiv:2111.04915  [pdf, other

    cs.LG stat.ML

    Practical, Provably-Correct Interactive Learning in the Realizable Setting: The Power of True Believers

    Authors: Julian Katz-Samuels, Blake Mason, Kevin Jamieson, Rob Nowak

    Abstract: We consider interactive learning in the realizable setting and develop a general framework to handle problems ranging from best arm identification to active classification. We begin our investigation with the observation that agnostic algorithms \emph{cannot} be minimax-optimal in the realizable setting. Hence, we design novel computationally efficient algorithms for the realizable setting that ma… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

  37. arXiv:2111.01768  [pdf, other

    stat.ML cs.LG

    Nearly Optimal Algorithms for Level Set Estimation

    Authors: Blake Mason, Romain Camilleri, Subhojyoti Mukherjee, Kevin Jamieson, Robert Nowak, Lalit Jain

    Abstract: The level set estimation problem seeks to find all points in a domain ${\cal X}$ where the value of an unknown function $f:{\cal X}\rightarrow \mathbb{R}$ exceeds a threshold $α$. The estimation is based on noisy function evaluations that may be acquired at sequentially and adaptively chosen locations in ${\cal X}$. The threshold value $α$ can either be \emph{explicit} and provided a priori, or \e… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: 9 pages + appendices. 6 Figures

  38. arXiv:2109.08844  [pdf, other

    stat.ML cs.LG math.ST

    Near-Minimax Optimal Estimation With Shallow ReLU Neural Networks

    Authors: Rahul Parhi, Robert D. Nowak

    Abstract: We study the problem of estimating an unknown function from noisy data using shallow ReLU neural networks. The estimators we study minimize the sum of squared data-fitting errors plus a regularization term proportional to the squared Euclidean norm of the network weights. This minimization corresponds to the common approach of training a neural network with weight decay. We quantify the performanc… ▽ More

    Submitted 12 October, 2022; v1 submitted 18 September, 2021; originally announced September 2021.

    Comments: IEEE Transactions on Information Theory (in press)

    Journal ref: IEEE Transactions on Information Theory, vol. 69, no. 2, pp. 1125-1140, Feb. 2023

  39. arXiv:2109.05131  [pdf, other

    stat.ML cs.LG

    Near Instance Optimal Model Selection for Pure Exploration Linear Bandits

    Authors: Yinglun Zhu, Julian Katz-Samuels, Robert Nowak

    Abstract: We introduce the model selection problem in pure exploration linear bandits, where the learner needs to adapt to the instance-dependent complexity measure of the smallest hypothesis class containing the true model. We design algorithms in both fixed confidence and fixed budget settings with near instance optimal guarantees. The core of our algorithms is a new optimization problem based on experime… ▽ More

    Submitted 17 March, 2022; v1 submitted 10 September, 2021; originally announced September 2021.

  40. arXiv:2106.12034  [pdf, other

    stat.ML cs.LG

    Pure Exploration in Kernel and Neural Bandits

    Authors: Yinglun Zhu, Dongruo Zhou, Ruoxi Jiang, Quanquan Gu, Rebecca Willett, Robert Nowak

    Abstract: We study pure exploration in bandits, where the dimension of the feature representation can be much larger than the number of arms. To overcome the curse of dimensionality, we propose to adaptively embed the feature representation of each arm into a lower-dimensional space and carefully deal with the induced model misspecification. Our approach is conceptually very different from existing works th… ▽ More

    Submitted 17 March, 2022; v1 submitted 22 June, 2021; originally announced June 2021.

  41. arXiv:2106.07335  [pdf, other

    quant-ph cond-mat.quant-gas cond-mat.stat-mech

    Quantum Kibble-Zurek mechanism: Kink correlations after a quench in the quantum Ising chain

    Authors: Radosław J. Nowak, Jacek Dziarmaga

    Abstract: The transverse field in the quantum Ising chain is linearly ramped from the para- to the ferromagnetic phase across the quantum critical point at a rate characterized by a quench time $τ_Q$. We calculate a connected kink-kink correlator in the final state at zero transverse field. The correlator is a sum of two terms: a negative (anti-bunching) Gaussian that depends on the Kibble-Zurek (KZ) correl… ▽ More

    Submitted 27 August, 2021; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: 10 pages; last two paragraphs added in Sec. IX; version to appear in PRB

    Journal ref: Phys. Rev. B 104, 075448 (2021)

  42. arXiv:2105.03361  [pdf, other

    stat.ML cs.LG

    What Kinds of Functions do Deep Neural Networks Learn? Insights from Variational Spline Theory

    Authors: Rahul Parhi, Robert D. Nowak

    Abstract: We develop a variational framework to understand the properties of functions learned by fitting deep neural networks with rectified linear unit activations to data. We propose a new function space, which is reminiscent of classical bounded variation-type spaces, that captures the compositional structure associated with deep neural networks. We derive a representer theorem showing that deep ReLU ne… ▽ More

    Submitted 26 September, 2021; v1 submitted 7 May, 2021; originally announced May 2021.

    Journal ref: SIAM Journal on Mathematics of Data Science, vol. 4, no. 2, pp. 464-489, 2022

  43. arXiv:2103.05057  [pdf, ps, other

    stat.ML cs.LG

    Nearest Neighbor Search Under Uncertainty

    Authors: Blake Mason, Ardhendu Tripathy, Robert Nowak

    Abstract: Nearest Neighbor Search (NNS) is a central task in knowledge representation, learning, and reasoning. There is vast literature on efficient algorithms for constructing data structures and performing exact and approximate NNS. This paper studies NNS under Uncertainty (NNSU). Specifically, consider the setting in which an NNS algorithm has access only to a stochastic distance oracle that provides a… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: 22 pages

  44. arXiv:2102.06593  [pdf, other

    stat.ML cs.LG

    Pareto Optimal Model Selection in Linear Bandits

    Authors: Yinglun Zhu, Robert Nowak

    Abstract: We study model selection in linear bandits, where the learner must adapt to the dimension (denoted by $d_\star$) of the smallest hypothesis class containing the true linear model while balancing exploration and exploitation. Previous papers provide various guarantees for this model selection problem, but have limitations; i.e., the analysis requires favorable conditions that allow for inexpensive… ▽ More

    Submitted 16 March, 2022; v1 submitted 12 February, 2021; originally announced February 2021.

  45. arXiv:2012.08073  [pdf, other

    stat.ML cs.LG

    Chernoff Sampling for Active Testing and Extension to Active Regression

    Authors: Subhojyoti Mukherjee, Ardhendu Tripathy, Robert Nowak

    Abstract: Active learning can reduce the number of samples needed to perform a hypothesis test and to estimate the parameters of a model. In this paper, we revisit the work of Chernoff that described an asymptotically optimal algorithm for performing a hypothesis test. We obtain a novel sample complexity bound for Chernoff's algorithm, with a non-asymptotic term that characterizes its performance at a fixed… ▽ More

    Submitted 10 March, 2022; v1 submitted 14 December, 2020; originally announced December 2020.

    Comments: 47 pages, 9 figures

  46. arXiv:2009.09988  [pdf, other

    stat.ML cs.LG

    Robust Outlier Arm Identification

    Authors: Yinglun Zhu, Sumeet Katariya, Robert Nowak

    Abstract: We study the problem of Robust Outlier Arm Identification (ROAI), where the goal is to identify arms whose expected rewards deviate substantially from the majority, by adaptively sampling from their reward distributions. We compute the outlier threshold using the median and median absolute deviation of the expected rewards. This is a robust choice for the threshold compared to using the mean and s… ▽ More

    Submitted 21 September, 2020; originally announced September 2020.

    Comments: Full version of our ICML 2020 paper

  47. arXiv:2007.11512  [pdf, other

    cs.GR cs.HC

    InCorr: Interactive Data-Driven Correlation Panels for Digital Outcrop Analysis

    Authors: Thomas Ortner, Andreas Walch, Rebecca Nowak, Robert Barnes, Thomas Höllt, Eduard Gröller

    Abstract: Geological analysis of 3D Digital Outcrop Models (DOMs) for reconstruction of ancient habitable environments is a key aspect of the upcoming ESA ExoMars 2022 Rosalind Franklin Rover and the NASA 2020 Rover Perseverance missions in seeking signs of past life on Mars. Geologists measure and interpret 3D DOMs, create sedimentary logs and combine them in `correlation panels' to map the extents of key… ▽ More

    Submitted 8 November, 2020; v1 submitted 22 July, 2020; originally announced July 2020.

  48. arXiv:2007.00077  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Similarity Search for Efficient Active Learning and Search of Rare Concepts

    Authors: Cody Coleman, Edward Chou, Julian Katz-Samuels, Sean Culatana, Peter Bailis, Alexander C. Berg, Robert Nowak, Roshan Sumbaly, Matei Zaharia, I. Zeki Yalniz

    Abstract: Many active learning and search approaches are intractable for large-scale industrial settings with billions of unlabeled examples. Existing approaches search globally for the optimal examples to label, scaling linearly or even quadratically with the unlabeled data. In this paper, we improve the computational efficiency of active learning and search methods by restricting the candidate pool for la… ▽ More

    Submitted 22 July, 2021; v1 submitted 30 June, 2020; originally announced July 2020.

  49. arXiv:2006.14785  [pdf, other

    stat.ML cs.LG

    On Regret with Multiple Best Arms

    Authors: Yinglun Zhu, Robert Nowak

    Abstract: We study a regret minimization problem with the existence of multiple best/near-optimal arms in the multi-armed bandit setting. We consider the case when the number of arms/actions is comparable or much larger than the time horizon, and make no assumptions about the structure of the bandit instance. Our goal is to design algorithms that can automatically adapt to the unknown hardness of the proble… ▽ More

    Submitted 22 October, 2020; v1 submitted 26 June, 2020; originally announced June 2020.

  50. arXiv:2006.08850  [pdf, other

    stat.ML cs.LG

    Finding All ε-Good Arms in Stochastic Bandits

    Authors: Blake Mason, Lalit Jain, Ardhendu Tripathy, Robert Nowak

    Abstract: The pure-exploration problem in stochastic multi-armed bandits aims to find one or more arms with the largest (or near largest) means. Examples include finding an ε-good arm, best-arm identification, top-k arm identification, and finding all arms with means above a specified threshold. However, the problem of finding all ε-good arms has been overlooked in past work, although arguably this may be t… ▽ More

    Submitted 11 September, 2020; v1 submitted 15 June, 2020; originally announced June 2020.

    Comments: 93 total pages (8 main pages + appendices), 12 figures, submitted to NeurIPS 2020