Skip to main content

Showing 1–50 of 97 results for author: Jamieson, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17339  [pdf, other

    cs.IT eess.SP

    Optimizing Configuration Selection in Reconfigurable-Antenna MIMO Systems: Physics-Inspired Heuristic Solvers

    Authors: I. Krikidis, C. Psomas, A. K. Singh, K. Jamieson

    Abstract: Reconfigurable antenna multiple-input multiple-output (MIMO) is a foundational technology for the continuing evolution of cellular systems, including upcoming 6G communication systems. In this paper, we address the problem of flexible/reconfigurable antenna configuration selection for point-to-point MIMO antenna systems by using physics-inspired heuristics. Firstly, we optimize the antenna configu… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2403.12571

    Journal ref: IEEE Transactions on Communications, 2004

  2. arXiv:2406.10522  [pdf, other

    cs.LG cs.AI cs.CL

    Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon Captioning

    Authors: Jifan Zhang, Lalit Jain, Yang Guo, Jiayi Chen, Kuan Lok Zhou, Siddharth Suresh, Andrew Wagenmaker, Scott Sievert, Timothy Rogers, Kevin Jamieson, Robert Mankoff, Robert Nowak

    Abstract: We present a novel multimodal preference dataset for creative tasks, consisting of over 250 million human ratings on more than 2.2 million captions, collected through crowdsourcing rating data for The New Yorker's weekly cartoon caption contest over the past eight years. This unique dataset supports the development and evaluation of multimodal large language models and preference-based fine-tuning… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  3. arXiv:2406.06856  [pdf, ps, other

    cs.LG cs.AI

    Sample Complexity Reduction via Policy Difference Estimation in Tabular Reinforcement Learning

    Authors: Adhyyan Narang, Andrew Wagenmaker, Lillian Ratliff, Kevin Jamieson

    Abstract: In this paper, we study the non-asymptotic sample complexity for the pure exploration problem in contextual bandits and tabular reinforcement learning (RL): identifying an epsilon-optimal policy from a set of policies with high probability. Existing work in bandits has shown that it is possible to identify the best policy by estimating only the difference between the behaviors of individual polici… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 59 pages, 2 Figures

  4. arXiv:2405.19547  [pdf, other

    cs.LG cs.CV

    CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning

    Authors: Yi** Wang, Yifang Chen, Wendan Yan, Alex Fang, Wen**g Zhou, Kevin Jamieson, Simon Shaolei Du

    Abstract: Data selection has emerged as a core issue for large-scale visual-language model pretaining (e.g., CLIP), particularly with noisy web-curated datasets. Three main data selection approaches are: (1) leveraging external non-CLIP models to aid data selection, (2) training new CLIP-style embedding models that are more effective at selecting high-quality data than the original OpenAI CLIP model, and (3… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: This paper supercedes our previous VAS paper (arXiv:2402.02055)

  5. arXiv:2405.06754  [pdf, other

    cs.NI eess.SP

    Wall-Street: Smart Surface-Enabled 5G mmWave for Roadside Networking

    Authors: Kun Woo Cho, Prasanthi Maddala, Ivan Seskar, Kyle Jamieson

    Abstract: 5G mmWave roadside networks promise high-speed wireless connectivity, but face significant challenges in maintaining reliable connections for users moving at high speed. Frequent handovers, complex beam alignment, and signal attenuation due to obstacles like car bodies lead to service interruptions and degraded performance. We present Wall-Street, a smart surface installed on vehicles to enhance 5… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 15 pages, 22 figures, under submission

  6. arXiv:2404.05631  [pdf, other

    cs.ET

    Multi Digit Ising Map** for Low Precision Ising Solvers

    Authors: Abhishek Kumar Singh, Kyle Jamieson

    Abstract: The last couple of years have seen an ever-increasing interest in using different Ising solvers, like Quantum annealers, Coherent Ising machines, and Oscillator-based Ising machines, for solving tough computational problems in various domains. Although the simulations predict massive performance improvements for several tough computational problems, the real implementations of the Ising solvers te… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: version 1.0

  7. arXiv:2403.12571  [pdf, other

    cs.IT eess.SP

    Optimizing Reconfigurable Antenna MIMO Systems with Coherent Ising Machines

    Authors: Ioannis Krikidis, Abhishek Kumar Singh, Kyle Jamieson

    Abstract: Reconfigurable antenna multiple-input multiple-output (MIMO) is a promising technology for upcoming 6G communication systems. In this paper, we deal with the problem of configuration selection for reconfigurable antenna MIMO by leveraging Coherent Ising Machines (CIMs). By adopting the CIM as a heuristic solver for the Ising problem, the optimal antenna configuration that maximizes the received si… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Journal ref: IEEE International Conference on Communications (ICC), June 2024

  8. arXiv:2402.18778  [pdf, other

    cs.NI quant-ph

    X-ResQ: Reverse Annealing for Quantum MIMO Detection with Flexible Parallelism

    Authors: Minsung Kim, Abhishek Kumar Singh, Davide Venturelli, John Kaewell, Kyle Jamieson

    Abstract: Quantum Annealing (QA)-accelerated MIMO detection is an emerging research approach in the context of NextG wireless networks. The opportunity is to enable large MIMO systems and thus improve wireless performance. The approach aims to leverage QA to expedite the computation required for theoretically optimal but computationally-demanding Maximum Likelihood detection to overcome the limitations of t… ▽ More

    Submitted 9 March, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: 22 pages

  9. arXiv:2402.04454  [pdf, other

    cs.NI

    Evolving Mobile Cloud Gaming with 5G Standalone Network Telemetry

    Authors: Haoran Wan, Kyle Jamieson

    Abstract: Mobile cloud gaming places the simultaneous demands of high capacity and low latency on the wireless network, demands that Private and Metropolitan-Area Standalone 5G networks are poised to meet. However, lacking introspection into the 5G Radio Access Network (RAN), cloud gaming servers are ill-poised to cope with the vagaries of the wireless last hop to a mobile client, while 5G network operators… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  10. arXiv:2402.02055  [pdf, other

    cs.LG cs.AI

    Variance Alignment Score: A Simple But Tough-to-Beat Data Selection Method for Multimodal Contrastive Learning

    Authors: Yi** Wang, Yifang Chen, Wendan Yan, Kevin Jamieson, Simon Shaolei Du

    Abstract: In recent years, data selection has emerged as a core issue for large-scale visual-language model pretraining, especially on noisy web-curated datasets. One widely adopted strategy assigns quality scores such as CLIP similarity for each sample and retains the data pairs with the highest scores. However, these approaches are agnostic of data distribution and always fail to select the most informati… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

    Comments: 17 pages, 4 figures

  11. arXiv:2401.06692  [pdf, other

    cs.CL cs.AI cs.LG

    An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models

    Authors: Gantavya Bhatt, Yifang Chen, Arnav M. Das, Jifan Zhang, Sang T. Truong, Stephen Mussmann, Yinglun Zhu, Jeffrey Bilmes, Simon S. Du, Kevin Jamieson, Jordan T. Ash, Robert D. Nowak

    Abstract: Supervised finetuning (SFT) on instruction datasets has played a crucial role in achieving the remarkable zero-shot generalization capabilities observed in modern large language models (LLMs). However, the annotation efforts required to produce high quality responses for instructions are becoming prohibitively expensive, especially as the number of tasks spanned by instruction datasets continues t… ▽ More

    Submitted 6 May, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

  12. arXiv:2312.12642  [pdf, other

    cs.NI

    LoLa: Low-Latency Realtime Video Conferencing over Multiple Cellular Carriers

    Authors: Sara Ayoubi, Giulio Grassi, Giovanni Pau, Kyle Jamieson, Renata Teixeira

    Abstract: LoLa is a novel multi-path system for video conferencing applications over cellular networks. It provides significant gains over single link solutions when the link quality over different cellular networks fluctuate dramatically and independently over time, or when aggregating the throughput across different cellular links improves the perceived video quality. LoLa achieves this by continuously es… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: 9 pages, 9 figures

  13. arXiv:2312.08559  [pdf, other

    cs.LG cs.CY stat.ML

    Fair Active Learning in Low-Data Regimes

    Authors: Romain Camilleri, Andrew Wagenmaker, Jamie Morgenstern, Lalit Jain, Kevin Jamieson

    Abstract: In critical machine learning applications, ensuring fairness is essential to avoid perpetuating social inequities. In this work, we address the challenges of reducing bias and improving accuracy in data-scarce environments, where the cost of collecting labeled data prohibits the use of large, labeled datasets. In such settings, active learning promises to maximize marginal accuracy gains of small… ▽ More

    Submitted 13 December, 2023; originally announced December 2023.

  14. arXiv:2311.16128  [pdf, other

    cs.IT cs.ET

    Physics-Inspired Discrete-Phase Optimization for 3D Beamforming with PIN-Diode Extra-Large Antenna Arrays

    Authors: Minsung Kim, Annalise Stockley, Keith Briggs, Kyle Jamieson

    Abstract: Large antenna arrays can steer narrow beams towards a target area, and thus improve the communications capacity of wireless channels and the fidelity of radio sensing. Hardware that is capable of continuously-variable phase shifts is expensive, presenting scaling challenges. PIN diodes that apply only discrete phase shifts are promising and cost-effective; however, unlike continuous phase shifters… ▽ More

    Submitted 3 January, 2024; v1 submitted 30 October, 2023; originally announced November 2023.

  15. arXiv:2310.18465  [pdf, other

    cs.LG stat.ML

    Minimax Optimal Submodular Optimization with Bandit Feedback

    Authors: Artin Tajdini, Lalit Jain, Kevin Jamieson

    Abstract: We consider maximizing a monotonic, submodular set function $f: 2^{[n]} \rightarrow [0,1]$ under stochastic bandit feedback. Specifically, $f$ is unknown to the learner but at each time $t=1,\dots,T$ the learner chooses a set $S_t \subset [n]$ with $|S_t| \leq k$ and receives reward $f(S_t) + η_t$ where $η_t$ is mean-zero sub-Gaussian noise. The objective is to minimize the learner's regret over… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

  16. arXiv:2310.16252  [pdf, other

    cs.LG cs.GT

    Near-Optimal Pure Exploration in Matrix Games: A Generalization of Stochastic Bandits & Dueling Bandits

    Authors: Arnab Maiti, Ross Boczar, Kevin Jamieson, Lillian J. Ratliff

    Abstract: We study the sample complexity of identifying the pure strategy Nash equilibrium (PSNE) in a two-player zero-sum matrix game with noise. Formally, we are given a stochastic model where any learner can sample an entry $(i,j)$ of the input matrix $A\in[-1,1]^{n\times m}$ and observe $A_{i,j}+η$ where $η$ is a zero-mean 1-sub-Gaussian noise. The aim of the learner is to identify the PSNE of $A$, when… ▽ More

    Submitted 27 November, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: 22 pages, 5 figures

  17. arXiv:2310.16236  [pdf, other

    cs.GT

    Query-Efficient Algorithms to Find the Unique Nash Equilibrium in a Two-Player Zero-Sum Matrix Game

    Authors: Arnab Maiti, Ross Boczar, Kevin Jamieson, Lillian J. Ratliff

    Abstract: We study the query complexity of identifying Nash equilibria in two-player zero-sum matrix games. Grigoriadis and Khachiyan (1995) showed that any deterministic algorithm needs to query $Ω(n^2)$ entries in worst case from an $n\times n$ input matrix in order to compute an $\varepsilon$-approximate Nash equilibrium, where $\varepsilon<\frac{1}{2}$. Moreover, they designed a randomized algorithm tha… ▽ More

    Submitted 27 November, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: 17 pages

  18. arXiv:2310.11551  [pdf, other

    cs.NI eess.SP

    WaveFlex: A Smart Surface for Private CBRS Wireless Cellular Networks

    Authors: Fan Yi, Kun Woo Cho, Yaxiong Xie, Kyle Jamieson

    Abstract: We present the design and implementation of WaveFlex, the first smart surface that enhances Private LTE/5G networks operating under the shared-license framework in the Citizens Broadband Radio Service frequency band. WaveFlex works in the presence of frequency diversity: multiple nearby base stations operating on different frequencies, as dictated by a Spectrum Access System coordinator. It also h… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: 15 pages

  19. arXiv:2310.06069  [pdf, other

    stat.ML cs.LG

    Optimal Exploration is no harder than Thompson Sampling

    Authors: Zhaoqi Li, Kevin Jamieson, Lalit Jain

    Abstract: Given a set of arms $\mathcal{Z}\subset \mathbb{R}^d$ and an unknown parameter vector $θ_\ast\in\mathbb{R}^d$, the pure exploration linear bandit problem aims to return $\arg\max_{z\in \mathcal{Z}} z^{\top}θ_{\ast}$, with high probability through noisy measurements of $x^{\top}θ_{\ast}$ with $x\in \mathcal{X}\subset \mathbb{R}^d$. Existing (asymptotically) optimal methods require either a) potenti… ▽ More

    Submitted 24 October, 2023; v1 submitted 9 October, 2023; originally announced October 2023.

  20. arXiv:2309.13224  [pdf, other

    cs.RO cs.AI cs.LG

    Pick Planning Strategies for Large-Scale Package Manipulation

    Authors: Shuai Li, Azarakhsh Keipour, Kevin Jamieson, Nicolas Hudson, Sicong Zhao, Charles Swan, Kostas Bekris

    Abstract: Automating warehouse operations can reduce logistics overhead costs, ultimately driving down the final price for consumers, increasing the speed of delivery, and enhancing the resiliency to market fluctuations. This extended abstract showcases a large-scale package manipulation from unstructured piles in Amazon Robotics' Robot Induction (Robin) fleet, which is used for picking and singulating up… ▽ More

    Submitted 8 October, 2023; v1 submitted 22 September, 2023; originally announced September 2023.

    Comments: 2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Learning Meets Model-based Methods for Manipulation and Gras** Workshop

  21. arXiv:2307.15154  [pdf, other

    cs.LG stat.ML

    A/B Testing and Best-arm Identification for Linear Bandits with Robustness to Non-stationarity

    Authors: Zhihan Xiong, Romain Camilleri, Maryam Fazel, Lalit Jain, Kevin Jamieson

    Abstract: We investigate the fixed-budget best-arm identification (BAI) problem for linear bandits in a potentially non-stationary environment. Given a finite arm set $\mathcal{X}\subset\mathbb{R}^d$, a fixed budget $T$, and an unpredictable sequence of parameters $\left\lbraceθ_t\right\rbrace_{t=1}^{T}$, an algorithm will aim to correctly identify the best arm… ▽ More

    Submitted 15 February, 2024; v1 submitted 27 July, 2023; originally announced July 2023.

    Comments: 25 pages, 6 figures

  22. arXiv:2306.13233  [pdf, other

    cs.LG cs.GT

    Logarithmic Regret for Matrix Games against an Adversary with Noisy Bandit Feedback

    Authors: Arnab Maiti, Kevin Jamieson, Lillian J. Ratliff

    Abstract: This paper considers a variant of zero-sum matrix games where at each timestep the row player chooses row $i$, the column player chooses column $j$, and the row player receives a noisy reward with mean $A_{i,j}$. The objective of the row player is to accumulate as much reward as possible, even against an adversarial column player. If the row player uses the EXP3 strategy, an algorithm known for ob… ▽ More

    Submitted 24 October, 2023; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: 68 pages, 2 figures

  23. arXiv:2306.09910  [pdf, other

    cs.LG cs.AI cs.CV

    LabelBench: A Comprehensive Framework for Benchmarking Adaptive Label-Efficient Learning

    Authors: Jifan Zhang, Yifang Chen, Gregory Canal, Stephen Mussmann, Arnav M. Das, Gantavya Bhatt, Yinglun Zhu, Jeffrey Bilmes, Simon Shaolei Du, Kevin Jamieson, Robert D Nowak

    Abstract: Labeled data are critical to modern machine learning applications, but obtaining labels can be expensive. To mitigate this cost, machine learning methods, such as transfer learning, semi-supervised learning and active learning, aim to be label-efficient: achieving high predictive performance from relatively few labeled examples. While obtaining the best label-efficiency in practice often requires… ▽ More

    Submitted 1 March, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

  24. arXiv:2306.09210  [pdf, other

    cs.LG cs.RO eess.SY math.OC stat.ML

    Optimal Exploration for Model-Based RL in Nonlinear Systems

    Authors: Andrew Wagenmaker, Guanya Shi, Kevin Jamieson

    Abstract: Learning to control unknown nonlinear dynamical systems is a fundamental problem in reinforcement learning and control theory. A commonly applied approach is to first explore the environment (exploration), learn an accurate model of it (system identification), and then compute an optimal controller with the minimum cost on this estimated system (policy optimization). While existing work has shown… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  25. arXiv:2306.08942  [pdf, other

    cs.LG cs.RO

    Active Representation Learning for General Task Space with Applications in Robotics

    Authors: Yifang Chen, Yingbing Huang, Simon S. Du, Kevin Jamieson, Guanya Shi

    Abstract: Representation learning based on multi-task pretraining has become a powerful approach in many domains. In particular, task-aware representation learning aims to learn an optimal representation for a specific target task by sampling data from a set of source tasks, while task-agnostic representation learning seeks to learn a universal representation for a class of tasks. In this paper, we propose… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

  26. arXiv:2306.02556  [pdf, other

    cs.LG cs.AI

    Improved Active Multi-Task Representation Learning via Lasso

    Authors: Yi** Wang, Yifang Chen, Kevin Jamieson, Simon S. Du

    Abstract: To leverage the copious amount of data from source tasks and overcome the scarcity of the target task samples, representation learning based on multi-task pretraining has become a standard approach in many applications. However, up until now, most existing works design a source task selection strategy from a purely empirical perspective. Recently, \citet{chen2022active} gave the first active multi… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: Accepted by ICML 2023

  27. arXiv:2305.10272  [pdf, other

    cs.RO cs.AI cs.LG

    Demonstrating Large-Scale Package Manipulation via Learned Metrics of Pick Success

    Authors: Shuai Li, Azarakhsh Keipour, Kevin Jamieson, Nicolas Hudson, Charles Swan, Kostas Bekris

    Abstract: Automating warehouse operations can reduce logistics overhead costs, ultimately driving down the final price for consumers, increasing the speed of delivery, and enhancing the resiliency to workforce fluctuations. The past few years have seen increased interest in automating such repeated tasks but mostly in controlled settings. Tasks such as picking objects from unstructured, cluttered piles have… ▽ More

    Submitted 27 June, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: Robotics: Science and Systems (RSS 2023) conference, July 10 - 14, 2023, Daegu, Republic of Korea

  28. arXiv:2304.12830  [pdf, other

    cs.NI cs.IT eess.SP

    Uplink MIMO Detection using Ising Machines: A Multi-Stage Ising Approach

    Authors: Abhishek Kumar Singh, Ari Kapelyan, Davide Venturelli, Kyle Jamieson

    Abstract: Multiple-Input-Multiple-Output~(MIMO) signal detection is central to every state-of-the-art communication system, and enhancements in error performance and computation complexity of MIMO detection would significantly enhance data rate and latency experienced by the users. Theoretically, the optimal MIMO detector is the maximum-likelihood (ML) MIMO detector; however, due to its extremely high compl… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: Initial draft. arXiv admin note: text overlap with arXiv:2205.05020

  29. arXiv:2303.10565  [pdf, other

    cs.GT cs.LG

    Instance-dependent Sample Complexity Bounds for Zero-sum Matrix Games

    Authors: Arnab Maiti, Kevin Jamieson, Lillian J. Ratliff

    Abstract: We study the sample complexity of identifying an approximate equilibrium for two-player zero-sum $n\times 2$ matrix games. That is, in a sequence of repeated game plays, how many rounds must the two players play before reaching an approximate equilibrium (e.g., Nash)? We derive instance-dependent bounds that define an ordering over game matrices that captures the intuition that the dynamics of som… ▽ More

    Submitted 19 March, 2023; originally announced March 2023.

    Comments: 53 pages, Accepted at AISTATS 2023

  30. arXiv:2212.07663  [pdf, other

    cs.NI eess.SP

    Cross-Link Channel Prediction for Massive IoT Networks

    Authors: Kun Woo Cho, Marco Cominelli, Francesco Gringoli, Joerg Widmer, Kyle Jamieson

    Abstract: Tomorrow's massive-scale IoT sensor networks are poised to drive uplink traffic demand, especially in areas of dense deployment. To meet this demand, however, network designers leverage tools that often require accurate estimates of Channel State Information (CSI), which incurs a high overhead and thus reduces network throughput. Furthermore, the overhead generally scales with the number of client… ▽ More

    Submitted 15 December, 2022; originally announced December 2022.

    Comments: 13 pages, 12 figures

  31. arXiv:2210.10854  [pdf, other

    cs.NI quant-ph

    Decoding Polar Codes via Noisy Quantum Gates: Quantum Circuits and Insights

    Authors: Srikar Kasi, John Kaewell, Shahab Hamidi-Rad, Kyle Jamieson

    Abstract: The use of quantum computation for wireless network applications is emerging as a promising paradigm to bridge the performance gap between in-practice and optimal wireless algorithms. While today's quantum technology offers limited number of qubits and low fidelity gates, application-based quantum solutions help us to understand and improve the performance of such technology even further. This pap… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

  32. arXiv:2209.11554  [pdf, other

    cs.NI

    mmWall: A Transflective Metamaterial Surface for mmWave Networks

    Authors: Kun Woo Cho, Mohammad H. Mazaheri, Jeremy Gummeson, Omid Abari, Kyle Jamieson

    Abstract: Mobile operators are poised to leverage millimeter wave technology as 5G evolves, but despite efforts to bolster their reliability indoors and outdoors, mmWave links remain vulnerable to blockage by walls, people, and obstacles. Further, there is significant interest in bringing outdoor mmWave coverage indoors, which for similar reasons remains challenging today. This paper presents the design, ha… ▽ More

    Submitted 25 September, 2022; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: 18 pages, 18 figures

  33. arXiv:2207.02575  [pdf, other

    cs.LG stat.ML

    Instance-Dependent Near-Optimal Policy Identification in Linear MDPs via Online Experiment Design

    Authors: Andrew Wagenmaker, Kevin Jamieson

    Abstract: While much progress has been made in understanding the minimax sample complexity of reinforcement learning (RL) -- the complexity of learning on the "worst-case" instance -- such measures of complexity often do not capture the true difficulty of learning. In practice, on an "easy" instance, we might hope to achieve a complexity far better than that achievable on the worst-case instance. In this wo… ▽ More

    Submitted 20 July, 2023; v1 submitted 6 July, 2022; originally announced July 2022.

  34. arXiv:2207.02357  [pdf, ps, other

    stat.ML cs.LG

    Instance-optimal PAC Algorithms for Contextual Bandits

    Authors: Zhaoqi Li, Lillian Ratliff, Houssam Nassif, Kevin Jamieson, Lalit Jain

    Abstract: In the stochastic contextual bandit setting, regret-minimizing algorithms have been extensively researched, but their instance-minimizing best-arm identification counterparts remain seldom studied. In this work, we focus on the stochastic bandit problem in the $(ε,δ)$-$\textit{PAC}$ setting: given a policy class $Π$ the goal of the learner is to return a policy $π\in Π$ whose expected reward is wi… ▽ More

    Submitted 5 July, 2022; originally announced July 2022.

    Journal ref: Conference on Neural Information Processing Systems (NeurIPS'22), New Orleans, pp. 37590-37603, 2022

  35. arXiv:2206.14939  [pdf, other

    cs.NI

    Towards Dual-band Reconfigurable Metamaterial Surfaces for Satellite Networking

    Authors: Kun Woo Cho, Yasaman Ghasempour, Kyle Jamieson

    Abstract: The first low earth orbit satellite networks for internet service have recently been deployed and are growing in size, yet will face deployment challenges in many practical circumstances of interest. This paper explores how a dual-band, electronically tunable smart surface can enable dynamic beam alignment between the satellite and mobile users, make service possible in urban canyons, and improve… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

    Comments: 9 pages including references, 9 figures

    ACM Class: C.2.5; C.3

  36. arXiv:2206.11183  [pdf, other

    cs.LG stat.ML

    Active Learning with Safety Constraints

    Authors: Romain Camilleri, Andrew Wagenmaker, Jamie Morgenstern, Lalit Jain, Kevin Jamieson

    Abstract: Active learning methods have shown great promise in reducing the number of samples necessary for learning. As automated learning systems are adopted into real-time, real-world decision-making pipelines, it is increasingly important that such algorithms are designed with safety in mind. In this work we investigate the complexity of learning the best safe decision in interactive environments. We red… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

  37. A Finite-Range Search Formulation of Maximum Likelihood MIMO Detection for Coherent Ising Machines

    Authors: Abhishek Kumar Singh, Davide Venturelli, Kyle Jamieson

    Abstract: The last couple of years have seen an emergence of physics-inspired computing for maximum likelihood MIMO detection. These methods involve transforming the MIMO detection problem into an Ising minimization problem, which can then be solved on an Ising Machine. Recent works have shown promising projections for MIMO wireless detection using Quantum Annealing optimizers and Coherent Ising Machines. W… ▽ More

    Submitted 10 May, 2022; originally announced May 2022.

    Comments: Article under review for IEEE Globecom 2022

  38. arXiv:2204.12954  [pdf, other

    cs.NI

    Dashlet: Taming Swipe Uncertainty for Robust Short Video Streaming

    Authors: Zhuqi Li, Yaxiong Xie, Ravi Netravali, Kyle Jamieson

    Abstract: Short video streaming applications have recently gained substantial traction, but the non-linear video presentation they afford swi** users fundamentally changes the problem of maximizing user quality of experience in the face of the vagaries of network throughput and user swipe timing. This paper describes the design and implementation of Dashlet, a system tailored for high quality of experienc… ▽ More

    Submitted 27 April, 2022; originally announced April 2022.

  39. arXiv:2204.07570  [pdf, other

    cs.NI

    TreeStep: Tree Search for Vector Perturbation Precoding under per-Antenna Power Constraint

    Authors: Abhishek Kumar Singh, Kyle Jamieson

    Abstract: Vector Perturbation Precoding (VPP) can speed up downlink data transmissions in Large and Massive Multi-User MIMO systems but is known to be NP-hard. While there are several algorithms in the literature for VPP under total power constraint, they are not applicable for VPP under per-antenna power constraint. This paper proposes a novel, parallel tree search algorithm for VPP under per-antenna power… ▽ More

    Submitted 26 April, 2022; v1 submitted 15 April, 2022; originally announced April 2022.

    Comments: Article under review for IEEE Globecom 22

  40. arXiv:2202.00911  [pdf, other

    cs.LG cs.AI

    Active Multi-Task Representation Learning

    Authors: Yifang Chen, Simon S. Du, Kevin Jamieson

    Abstract: To leverage the power of big data from source tasks and overcome the scarcity of the target task samples, representation learning based on multi-task pretraining has become a standard approach in many applications. However, up until now, choosing which source tasks to include in the multi-task learning has been more art than science. In this paper, we give the first formal study on resource task s… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

  41. arXiv:2201.11206  [pdf, other

    cs.LG stat.ML

    Reward-Free RL is No Harder Than Reward-Aware RL in Linear Markov Decision Processes

    Authors: Andrew Wagenmaker, Yifang Chen, Max Simchowitz, Simon S. Du, Kevin Jamieson

    Abstract: Reward-free reinforcement learning (RL) considers the setting where the agent does not have access to a reward function during exploration, but must propose a near-optimal policy for an arbitrary reward function revealed only after exploring. In the the tabular setting, it is well known that this is a more difficult problem than reward-aware (PAC) RL -- where the agent has access to the reward fun… ▽ More

    Submitted 18 June, 2022; v1 submitted 26 January, 2022; originally announced January 2022.

  42. NG-Scope: Fine-Grained Telemetry for NextG Cellular Networks

    Authors: Yaxiong Xie, Kyle Jamieson

    Abstract: Accurate and highly-granular channel capacity telemetry of the cellular last hop is crucial for the effective operation of transport layer protocols and cutting-edge applications, such as video on demand and videotelephony. This paper presents the design, implementation, and experimental performance evaluation of NG-Scope, the first such telemetry tool able to fuse physical-layer channel occupancy… ▽ More

    Submitted 18 January, 2022; v1 submitted 13 January, 2022; originally announced January 2022.

  43. arXiv:2201.05179  [pdf, other

    cs.NI

    CurvingLoRa to Boost LoRa Network Capacity via Concurrent Transmission

    Authors: Chenning Li, Xiuzhen Guo, Longfei Shangguan, Zhichao Cao, Kyle Jamieson

    Abstract: LoRaWAN has emerged as an appealing technology to connect IoT devices but it functions without explicit coordination among transmitters, which can lead to many packet collisions as the network scales. State-of-the-art work proposes various approaches to deal with these collisions, but most functions only in high signal-to-interference ratio (SIR) conditions and thus does not scale to many scenario… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

  44. arXiv:2112.03432  [pdf, other

    cs.LG stat.ML

    First-Order Regret in Reinforcement Learning with Linear Function Approximation: A Robust Estimation Approach

    Authors: Andrew Wagenmaker, Yifang Chen, Max Simchowitz, Simon S. Du, Kevin Jamieson

    Abstract: Obtaining first-order regret bounds -- regret bounds scaling not as the worst-case but with some measure of the performance of the optimal policy on a given instance -- is a core question in sequential decision-making. While such bounds exist in many settings, they have proven elusive in reinforcement learning with large state spaces. In this work we address this gap, and show that it is possible… ▽ More

    Submitted 20 October, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

  45. arXiv:2111.12151  [pdf, other

    cs.LG stat.ML

    Best Arm Identification with Safety Constraints

    Authors: Zhenlin Wang, Andrew Wagenmaker, Kevin Jamieson

    Abstract: The best arm identification problem in the multi-armed bandit setting is an excellent model of many real-world decision-making problems, yet it fails to capture the fact that in the real-world, safety constraints often must be met while learning. In this work we study the question of best-arm identification in safety-critical settings, where the goal of the agent is to find the best safe option ou… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

  46. arXiv:2111.04915  [pdf, other

    cs.LG stat.ML

    Practical, Provably-Correct Interactive Learning in the Realizable Setting: The Power of True Believers

    Authors: Julian Katz-Samuels, Blake Mason, Kevin Jamieson, Rob Nowak

    Abstract: We consider interactive learning in the realizable setting and develop a general framework to handle problems ranging from best arm identification to active classification. We begin our investigation with the observation that agnostic algorithms \emph{cannot} be minimax-optimal in the realizable setting. Hence, we design novel computationally efficient algorithms for the realizable setting that ma… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

  47. arXiv:2111.01768  [pdf, other

    stat.ML cs.LG

    Nearly Optimal Algorithms for Level Set Estimation

    Authors: Blake Mason, Romain Camilleri, Subhojyoti Mukherjee, Kevin Jamieson, Robert Nowak, Lalit Jain

    Abstract: The level set estimation problem seeks to find all points in a domain ${\cal X}$ where the value of an unknown function $f:{\cal X}\rightarrow \mathbb{R}$ exceeds a threshold $α$. The estimation is based on noisy function evaluations that may be acquired at sequentially and adaptively chosen locations in ${\cal X}$. The threshold value $α$ can either be \emph{explicit} and provided a priori, or \e… ▽ More

    Submitted 2 November, 2021; originally announced November 2021.

    Comments: 9 pages + appendices. 6 Figures

  48. arXiv:2110.14864  [pdf, other

    cs.LG

    Selective Sampling for Online Best-arm Identification

    Authors: Romain Camilleri, Zhihan Xiong, Maryam Fazel, Lalit Jain, Kevin Jamieson

    Abstract: This work considers the problem of selective-sampling for best-arm identification. Given a set of potential options $\mathcal{Z}\subset\mathbb{R}^d$, a learner aims to compute with probability greater than $1-δ$, $\arg\max_{z\in \mathcal{Z}} z^{\top}θ_{\ast}$ where $θ_{\ast}$ is unknown. At each time step, a potential measurement $x_t\in \mathcal{X}\subset\mathbb{R}^d$ is drawn IID and the learner… ▽ More

    Submitted 1 November, 2021; v1 submitted 27 October, 2021; originally announced October 2021.

    Comments: 42 pages, 2 figures, Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021)

  49. arXiv:2109.01465  [pdf, other

    cs.NI quant-ph

    A Cost and Power Feasibility Analysis of Quantum Annealing for NextG Cellular Wireless Networks

    Authors: Srikar Kasi, P. A. Warburton, John Kaewell, Kyle Jamieson

    Abstract: In order to meet mobile cellular users' ever-increasing data demands, today's 4G and 5G networks are designed mainly with the goal of maximizing spectral efficiency. While they have made progress in this regard, controlling the carbon footprint and operational costs of such networks remains a long-standing problem among network designers. This paper takes a long view on this problem, envisioning a… ▽ More

    Submitted 14 January, 2022; v1 submitted 3 September, 2021; originally announced September 2021.

  50. arXiv:2108.02717  [pdf, other

    cs.LG stat.ML

    Beyond No Regret: Instance-Dependent PAC Reinforcement Learning

    Authors: Andrew Wagenmaker, Max Simchowitz, Kevin Jamieson

    Abstract: The theory of reinforcement learning has focused on two fundamental problems: achieving low regret, and identifying $ε$-optimal policies. While a simple reduction allows one to apply a low-regret algorithm to obtain an $ε$-optimal policy and achieve the worst-case optimal rate, it is unknown whether low-regret algorithms can obtain the instance-optimal rate for policy identification. We show this… ▽ More

    Submitted 21 June, 2022; v1 submitted 5 August, 2021; originally announced August 2021.