Skip to main content

Showing 1–50 of 56 results for author: Sadler, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10892  [pdf, other

    cs.LG

    DIPPER: Direct Preference Optimization to Accelerate Primitive-Enabled Hierarchical Reinforcement Learning

    Authors: Utsav Singh, Souradip Chakraborty, Wesley A. Suttle, Brian M. Sadler, Vinay P Namboodiri, Amrit Singh Bedi

    Abstract: Learning control policies to perform complex robotics tasks from human preference data presents significant challenges. On the one hand, the complexity of such tasks typically requires learning policies to perform a variety of subtasks, then combining them to achieve the overall goal. At the same time, comprehensive, well-engineered reward functions are typically unavailable in such problems, whil… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  2. arXiv:2404.13423  [pdf, other

    cs.LG

    PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling

    Authors: Utsav Singh, Wesley A. Suttle, Brian M. Sadler, Vinay P. Namboodiri, Amrit Singh Bedi

    Abstract: In this work, we introduce PIPER: Primitive-Informed Preference-based Hierarchical reinforcement learning via Hindsight Relabeling, a novel approach that leverages preference-based learning to learn a reward model, and subsequently uses this reward model to relabel higher-level replay buffers. Since this reward is unaffected by lower primitive behavior, our relabeling-based approach is able to mit… ▽ More

    Submitted 16 June, 2024; v1 submitted 20 April, 2024; originally announced April 2024.

  3. arXiv:2404.00538  [pdf, ps, other

    cs.CR stat.AP

    Eclipse Attack Detection on a Blockchain Network as a Non-Parametric Change Detection Problem

    Authors: Anurag Gupta, Vikram Krishnamurthy, Brian M. Sadler

    Abstract: This paper introduces a novel non-parametric change detection algorithm to identify eclipse attacks on a blockchain network; the non-parametric algorithm relies only on the empirical mean and variance of the dataset, making it highly adaptable. An eclipse attack occurs when malicious actors isolate blockchain users, disrupting their ability to reach consensus with the broader network, thereby dist… ▽ More

    Submitted 30 May, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

  4. arXiv:2403.11925  [pdf, other

    cs.LG

    Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles

    Authors: Bhrij Patel, Wesley A. Suttle, Alec Koppel, Vaneet Aggarwal, Brian M. Sadler, Amrit Singh Bedi, Dinesh Manocha

    Abstract: In the context of average-reward reinforcement learning, the requirement for oracle knowledge of the mixing time, a measure of the duration a Markov chain under a fixed policy needs to achieve its stationary distribution, poses a significant challenge for the global convergence of policy gradient methods. This requirement is particularly problematic due to the difficulty and expense of estimating… ▽ More

    Submitted 20 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: 26 Pages, 2 Figures

  5. arXiv:2403.04007  [pdf, other

    cs.LG math.OC

    Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical Systems

    Authors: Wesley A. Suttle, Vipul K. Sharma, Krishna C. Kosaraju, S. Sivaranjani, Ji Liu, Vijay Gupta, Brian M. Sadler

    Abstract: We develop provably safe and convergent reinforcement learning (RL) algorithms for control of nonlinear dynamical systems, bridging the gap between the hard safety guarantees of control theory and the convergence guarantees of RL theory. Recent advances at the intersection of control and RL follow a two-stage, safety filter approach to enforcing hard safety constraints: model-free RL is used to le… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: 20 pages, 7 figures

  6. arXiv:2402.10340  [pdf, other

    cs.RO cs.AI

    Highlighting the Safety Concerns of Deploying LLMs/VLMs in Robotics

    Authors: Xiyang Wu, Souradip Chakraborty, Ruiqi Xian, **g Liang, Tianrui Guan, Fuxiao Liu, Brian M. Sadler, Dinesh Manocha, Amrit Singh Bedi

    Abstract: In this paper, we highlight the critical issues of robustness and safety associated with integrating large language models (LLMs) and vision-language models (VLMs) into robotics applications. Recent works focus on using LLMs and VLMs to improve the performance of robotics tasks, such as manipulation and navigation. Despite these improvements, analyzing the safety of such systems remains underexplo… ▽ More

    Submitted 16 June, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  7. arXiv:2402.06552  [pdf, other

    cs.LG

    Deceptive Path Planning via Reinforcement Learning with Graph Neural Networks

    Authors: Michael Y. Fatemi, Wesley A. Suttle, Brian M. Sadler

    Abstract: Deceptive path planning (DPP) is the problem of designing a path that hides its true goal from an outside observer. Existing methods for DPP rely on unrealistic assumptions, such as global state observability and perfect model knowledge, and are typically problem-specific, meaning that even minor changes to a previously solved problem can force expensive computation of an entirely new solution. Gi… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: 11 pages, 14 figures

    MSC Class: 68T05

  8. arXiv:2310.17660  [pdf, other

    eess.SP cs.IT eess.IV

    An Invitation to Hypercomplex Phase Retrieval: Theory and Applications

    Authors: Roman Jacome, Kumar Vijay Mishra, Brian M. Sadler, Henry Arguello

    Abstract: Hypercomplex signal processing (HSP) provides state-of-the-art tools to handle multidimensional signals by harnessing intrinsic correlation of the signal dimensions through Clifford algebra. Recently, the hypercomplex representation of the phase retrieval (PR) problem, wherein a complex-valued signal is estimated through its intensity-only projections, has attracted significant interest. The hyper… ▽ More

    Submitted 22 April, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

    Comments: 10 pages, 4 figures, 2 tables

  9. arXiv:2310.14167  [pdf, other

    cs.IT eess.SP

    Factor Graph Processing for Dual-Blind Deconvolution at ISAC Receiver

    Authors: Roman Jacome, Edwin Vargas, Kumar Vijay Mishra, Brian M. Sadler, Henry Arguello

    Abstract: Integrated sensing and communications (ISAC) systems have gained significant interest because of their ability to jointly and efficiently access, utilize, and manage the scarce electromagnetic spectrum. The co-existence approach toward ISAC focuses on the receiver processing of overlaid radar and communications signals coming from independent transmitters. A specific ISAC coexistence problem is du… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: 13 pages, 4 figures

  10. arXiv:2308.15784  [pdf, other

    cs.IT eess.IV

    Octonion Phase Retrieval

    Authors: Roman Jacome, Kumar Vijay Mishra, Brian M. Sadler, Henry Arguello

    Abstract: Signal processing over hypercomplex numbers arises in many optical imaging applications. In particular, spectral image or color stereo data are often processed using octonion algebra. Recently, the eight-band multispectral image phase recovery has gained salience, wherein it is desired to recover the eight bands from the phaseless measurements. In this paper, we tackle this hitherto unaddressed hy… ▽ More

    Submitted 1 June, 2024; v1 submitted 30 August, 2023; originally announced August 2023.

    Comments: 5 pages, 3 figures

  11. arXiv:2306.06192  [pdf, other

    cs.RO cs.AI cs.LG

    Ada-NAV: Adaptive Trajectory Length-Based Sample Efficient Policy Learning for Robotic Navigation

    Authors: Bhrij Patel, Kasun Weerakoon, Wesley A. Suttle, Alec Koppel, Brian M. Sadler, Tianyi Zhou, Amrit Singh Bedi, Dinesh Manocha

    Abstract: Trajectory length stands as a crucial hyperparameter within reinforcement learning (RL) algorithms, significantly contributing to the sample inefficiency in robotics applications. Motivated by the pivotal role trajectory length plays in the training process, we introduce Ada-NAV, a novel adaptive trajectory length scheme designed to enhance the training sample efficiency of RL algorithms in roboti… ▽ More

    Submitted 20 March, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: 11 pages, 9 figures, 2 tables

  12. arXiv:2303.13609  [pdf, other

    cs.IT eess.SP math.FA stat.ML

    Multi-Antenna Dual-Blind Deconvolution for Joint Radar-Communications via SoMAN Minimization

    Authors: Roman Jacome, Edwin Vargas, Kumar Vijay Mishra, Brian M. Sadler, Henry Arguello

    Abstract: In joint radar-communications (JRC) applications such as secure military receivers, often the radar and communications signals are overlaid in the received signal. In these passive listening outposts, the signals and channels of both radar and communications are unknown to the receiver. The ill-posed problem of recovering all signal and channel parameters from the overlaid signal is termed as \tex… ▽ More

    Submitted 28 March, 2024; v1 submitted 23 March, 2023; originally announced March 2023.

    Comments: 30 pages, 7 figures

  13. arXiv:2301.12083  [pdf, other

    cs.LG math.OC stat.ML

    Beyond Exponentially Fast Mixing in Average-Reward Reinforcement Learning via Multi-Level Monte Carlo Actor-Critic

    Authors: Wesley A. Suttle, Amrit Singh Bedi, Bhrij Patel, Brian M. Sadler, Alec Koppel, Dinesh Manocha

    Abstract: Many existing reinforcement learning (RL) methods employ stochastic gradient iteration on the back end, whose stability hinges upon a hypothesis that the data-generating process mixes exponentially fast with a rate parameter that appears in the step-size selection. Unfortunately, this assumption is violated for large state spaces or settings with sparse rewards, and the mixing time is unknown, mak… ▽ More

    Submitted 1 February, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  14. arXiv:2212.04088  [pdf, other

    cs.AI cs.CL cs.CV cs.LG cs.RO

    LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models

    Authors: Chan Hee Song, Jiaman Wu, Clayton Washington, Brian M. Sadler, Wei-Lun Chao, Yu Su

    Abstract: This study focuses on using large language models (LLMs) as a planner for embodied agents that can follow natural language instructions to complete complex tasks in a visually-perceived environment. The high data cost and poor sample efficiency of existing methods hinders the development of versatile agents that are capable of many tasks and can learn new tasks quickly. In this work, we propose a… ▽ More

    Submitted 30 March, 2023; v1 submitted 8 December, 2022; originally announced December 2022.

    Comments: 14 pages, 5 figures

    Report number: ICCV 2023

  15. arXiv:2211.09253  [pdf, other

    cs.IT eess.SP math.FA stat.ML

    Beurling-Selberg Extremization for Dual-Blind Deconvolution Recovery in Joint Radar-Communications

    Authors: Jonathan Monsalve, Edwin Vargas, Kumar Vijay Mishra, Brian M. Sadler, Henry Arguello

    Abstract: Recent interest in integrated sensing and communications has led to the design of novel signal processing techniques to recover information from an overlaid radar-communications signal. Here, we focus on a spectral coexistence scenario, wherein the channels and transmit signals of both radar and communications systems are unknown to the common receiver. In this dual-blind deconvolution (DBD) probl… ▽ More

    Submitted 27 October, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

    Comments: 5 pages, 3 figures

  16. arXiv:2211.06967  [pdf, ps, other

    eess.SP cs.LG

    Identifying Coordination in a Cognitive Radar Network -- A Multi-Objective Inverse Reinforcement Learning Approach

    Authors: Luke Snow, Vikram Krishnamurthy, Brian M. Sadler

    Abstract: Consider a target being tracked by a cognitive radar network. If the target can intercept some radar network emissions, how can it detect coordination among the radars? By 'coordination' we mean that the radar emissions satisfy Pareto optimality with respect to multi-objective optimization over each radar's utility. This paper provides a novel multi-objective inverse reinforcement learning approac… ▽ More

    Submitted 13 November, 2022; originally announced November 2022.

  17. arXiv:2209.11944  [pdf, other

    cs.LG cs.AI

    Communication-Efficient {Federated} Learning Using Censored Heavy Ball Descent

    Authors: Yicheng Chen, Rick S. Blum, Brian M. Sadler

    Abstract: Distributed machine learning enables scalability and computational offloading, but requires significant levels of communication. Consequently, communication efficiency in distributed learning settings is an important consideration, especially when the communications are wireless and battery-driven devices are employed. In this paper we develop a censoring-based heavy ball (CHB) method for distribu… ▽ More

    Submitted 24 September, 2022; originally announced September 2022.

  18. arXiv:2208.04381  [pdf, other

    cs.IT

    Dual-Blind Deconvolution for Overlaid Radar-Communications Systems

    Authors: Edwin Vargas, Kumar Vijay Mishra, Roman Jacome, Brian M. Sadler, Henry Arguello

    Abstract: The increasingly crowded spectrum has spurred the design of joint radar-communications systems that share hardware resources and efficiently use the radio frequency spectrum. We study a general spectral coexistence scenario, wherein the channels and transmit signals of both radar and communications systems are unknown at the receiver. In this dual-blind deconvolution (DBD) problem, a common receiv… ▽ More

    Submitted 19 June, 2023; v1 submitted 8 August, 2022; originally announced August 2022.

    Comments: 26 pages, 13 figures, 1 table

  19. arXiv:2206.10815  [pdf, other

    cs.LG cs.DC math.OC

    FedBC: Calibrating Global and Local Models via Federated Learning Beyond Consensus

    Authors: Amrit Singh Bedi, Chen Fan, Alec Koppel, Anit Kumar Sahu, Brian M. Sadler, Furong Huang, Dinesh Manocha

    Abstract: In this work, we quantitatively calibrate the performance of global and local models in federated learning through a multi-criterion optimization-based framework, which we cast as a constrained program. The objective of a device is its local objective, which it seeks to minimize while satisfying nonlinear constraints that quantify the proximity between the local and the global model. By considerin… ▽ More

    Submitted 1 February, 2023; v1 submitted 21 June, 2022; originally announced June 2022.

  20. arXiv:2206.05166  [pdf, other

    cs.IT eess.SP

    Multi-dimensional dual-blind deconvolution approach toward joint radar-communications

    Authors: Roman Jacome, Kumar Vijay Mishra, Edwin Vargas, Brian M. Sadler, Henry Arguello

    Abstract: We consider a joint multiple-antenna radar-communications system in a co-existence scenario. Contrary to conventional applications, wherein at least the radar waveform and communications channel are known or estimated \textit{a priori}, we investigate the case when the channels and transmit signals of both systems are unknown. In radar applications, this problem arises in multistatic or passive sy… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Comments: 5 pages, 3 figures

  21. arXiv:2206.01306  [pdf, ps, other

    math.OC cs.AI

    Deceptive Planning for Resource Allocation

    Authors: Shenghui Chen, Yagiz Savas, Mustafa O. Karabag, Brian M. Sadler, Ufuk Topcu

    Abstract: We consider a team of autonomous agents that navigate in an adversarial environment and aim to achieve a task by allocating their resources over a set of target locations. An adversary in the environment observes the autonomous team's behavior to infer their objective and responds against the team. In this setting, we propose strategies for controlling the density of the autonomous team so that th… ▽ More

    Submitted 5 October, 2023; v1 submitted 2 June, 2022; originally announced June 2022.

  22. arXiv:2206.01162  [pdf, other

    cs.LG math.OC stat.ML

    Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning

    Authors: Souradip Chakraborty, Amrit Singh Bedi, Alec Koppel, Brian M. Sadler, Furong Huang, Pratap Tokekar, Dinesh Manocha

    Abstract: Model-based approaches to reinforcement learning (MBRL) exhibit favorable performance in practice, but their theoretical guarantees in large spaces are mostly restricted to the setting when transition model is Gaussian or Lipschitz, and demands a posterior estimate whose representational complexity grows unbounded with time. In this work, we develop a novel MBRL method (i) which relaxes the assump… ▽ More

    Submitted 4 May, 2023; v1 submitted 2 June, 2022; originally announced June 2022.

  23. arXiv:2202.07028  [pdf, other

    cs.AI cs.CL cs.CV cs.LG cs.RO

    One Step at a Time: Long-Horizon Vision-and-Language Navigation with Milestones

    Authors: Chan Hee Song, Jihyung Kil, Tai-Yu Pan, Brian M. Sadler, Wei-Lun Chao, Yu Su

    Abstract: We study the problem of develo** autonomous agents that can follow human instructions to infer and perform a sequence of actions to complete the underlying task. Significant progress has been made in recent years, especially for tasks with short horizons. However, when it comes to long-horizon tasks with extended sequences of actions, an agent can easily ignore some instructions or get stuck in… ▽ More

    Submitted 10 June, 2022; v1 submitted 14 February, 2022; originally announced February 2022.

    Comments: 10 pages, 5 figures. Accepted to CVPR 2022

    Journal ref: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 15482-15491

  24. arXiv:2202.02580  [pdf, ps, other

    cs.LG cs.AI

    Communication Efficient Federated Learning via Ordered ADMM in a Fully Decentralized Setting

    Authors: Yicheng Chen, Rick S. Blum, Brian M. Sadler

    Abstract: The challenge of communication-efficient distributed optimization has attracted attention in recent years. In this paper, a communication efficient algorithm, called ordering-based alternating direction method of multipliers (OADMM) is devised in a general fully decentralized network setting where a worker can only exchange messages with neighbors. Compared to the classical ADMM, a key feature of… ▽ More

    Submitted 5 February, 2022; originally announced February 2022.

  25. arXiv:2202.02491  [pdf, ps, other

    cs.LG cs.AI

    Distributed Learning With Sparsified Gradient Differences

    Authors: Yicheng Chen, Rick S. Blum, Martin Takac, Brian M. Sadler

    Abstract: A very large number of communications are typically required to solve distributed learning tasks, and this critically limits scalability and convergence speed in wireless communications applications. In this paper, we devise a Gradient Descent method with Sparsification and Error Correction (GD-SEC) to improve the communications efficiency in a general worker-server architecture. Motivated by a va… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

  26. arXiv:2201.12332  [pdf, other

    cs.LG cs.AI math.OC

    On the Hidden Biases of Policy Mirror Ascent in Continuous Action Spaces

    Authors: Amrit Singh Bedi, Souradip Chakraborty, Anjaly Parayil, Brian Sadler, Pratap Tokekar, Alec Koppel

    Abstract: We focus on parameterized policy search for reinforcement learning over continuous action spaces. Typically, one assumes the score function associated with a policy is bounded, which fails to hold even for Gaussian policies. To properly address this issue, one must introduce an exploration tolerance parameter to quantify the region in which it is bounded. Doing so incurs a persistent bias that app… ▽ More

    Submitted 30 January, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

  27. arXiv:2201.11384  [pdf, other

    eess.SP cs.IR

    Phase Retrieval for Radar Waveform Design

    Authors: Samuel Pinilla, Kumar Vijay Mishra, Brian M. Sadler, Henry Arguello

    Abstract: The ability of a radar to discriminate in both range and Doppler velocity is completely characterized by the ambiguity function (AF) of its transmit waveform. Mathematically, it is obtained by correlating the waveform with its Doppler-shifted and delayed replicas. We consider the inverse problem of designing a radar transmit waveform that satisfies the specified AF magnitude. This process may be v… ▽ More

    Submitted 9 June, 2024; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: 40 pages, 13 figures, 1 table

  28. arXiv:2111.13670  [pdf, other

    cs.IT eess.SP

    Non-Convex Recovery from Phaseless Low-Resolution Blind Deconvolution Measurements using Noisy Masked Patterns

    Authors: Samuel Pinilla, Kumar Vijay Mishra, Brian M. Sadler

    Abstract: This paper addresses recovery of a kernel $\boldsymbol{h}\in \mathbb{C}^{n}$ and a signal $\boldsymbol{x}\in \mathbb{C}^{n}$ from the low-resolution phaseless measurements of their noisy circular convolution $\boldsymbol{y} = \left \rvert \boldsymbol{F}_{lo}( \boldsymbol{x}\circledast \boldsymbol{h}) \right \rvert^{2} + \boldsymbolη$, where $\boldsymbol{F}_{lo}\in \mathbb{C}^{m\times n}$ stands fo… ▽ More

    Submitted 4 December, 2021; v1 submitted 26 November, 2021; originally announced November 2021.

    Comments: 5 pages, 4 figures

  29. arXiv:2111.06304  [pdf, other

    eess.SP cs.IT

    Joint Radar-Communications Processing from a Dual-Blind Deconvolution Perspective

    Authors: Edwin Vargas, Kumar Vijay Mishra, Roman Jacome, Brian M. Sadler, Henry Arguello

    Abstract: We consider a general spectral coexistence scenario, wherein the channels and transmit signals of both radar and communications systems are unknown at the receiver. In this \textit{dual-blind deconvolution} (DBD) problem, a common receiver admits the multi-carrier wireless communications signal that is overlaid with the radar signal reflected-off multiple targets. When the radar receiver is not co… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: 5 pages, 2 figures, submitted to ICASSP 2022

  30. arXiv:2109.12343  [pdf, other

    cs.RO cs.LG cs.MA eess.SY

    Beyond Robustness: A Taxonomy of Approaches towards Resilient Multi-Robot Systems

    Authors: Amanda Prorok, Matthew Malencia, Luca Carlone, Gaurav S. Sukhatme, Brian M. Sadler, Vijay Kumar

    Abstract: Robustness is key to engineering, automation, and science as a whole. However, the property of robustness is often underpinned by costly requirements such as over-provisioning, known uncertainty and predictive models, and known adversaries. These conditions are idealistic, and often not satisfiable. Resilience on the other hand is the capability to endure unexpected disruptions, to recover swiftly… ▽ More

    Submitted 25 September, 2021; originally announced September 2021.

  31. arXiv:2106.13358  [pdf, other

    cs.RO cs.LG cs.MA eess.SP eess.SY

    Scalable Perception-Action-Communication Loops with Convolutional and Graph Neural Networks

    Authors: Ting-Kuei Hu, Fernando Gama, Tianlong Chen, Wenqing Zheng, Zhangyang Wang, Alejandro Ribeiro, Brian M. Sadler

    Abstract: In this paper, we present a perception-action-communication loop design using Vision-based Graph Aggregation and Inference (VGAI). This multi-agent decentralized learning-to-control framework maps raw visual observations to agent actions, aided by local communication among neighboring agents. Our framework is implemented by a cascade of a convolutional and a graph neural network (CNN / GNN), addre… ▽ More

    Submitted 5 November, 2021; v1 submitted 24 June, 2021; originally announced June 2021.

  32. arXiv:2106.02892  [pdf, other

    cs.LG eess.SP

    Training Robust Graph Neural Networks with Topology Adaptive Edge Drop**

    Authors: Zhan Gao, Subhrajit Bhattacharya, Leiming Zhang, Rick S. Blum, Alejandro Ribeiro, Brian M. Sadler

    Abstract: Graph neural networks (GNNs) are processing architectures that exploit graph structural information to model representations from network data. Despite their success, GNNs suffer from sub-optimal generalization performance given limited training data, referred to as over-fitting. This paper proposes Topology Adaptive Edge Drop** (TADropEdge) method as an adaptive data augmentation technique to i… ▽ More

    Submitted 5 June, 2021; originally announced June 2021.

  33. Secrecy of Multi-Antenna Transmission with Full-Duplex User in the Presence of Randomly Located Eavesdroppers

    Authors: Ishmam Zabir, Ahmed Maksud, Gaojie Chen, Brian M. Sadler, Yingbo Hua

    Abstract: This paper considers the secrecy performance of several schemes for multi-antenna transmission to single-antenna users with full-duplex (FD) capability against randomly distributed single-antenna eavesdroppers (EDs). These schemes and related scenarios include transmit antenna selection (TAS), transmit antenna beamforming (TAB), artificial noise (AN) from the transmitter, user selection based thei… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

    Comments: Paper accepted for publication in IEEE Transactions on Information Forensics and Security

  34. arXiv:2011.07743  [pdf, other

    cs.CL cs.AI cs.LG

    Beyond I.I.D.: Three Levels of Generalization for Question Answering on Knowledge Bases

    Authors: Yu Gu, Sue Kase, Michelle Vanni, Brian Sadler, Percy Liang, Xifeng Yan, Yu Su

    Abstract: Existing studies on question answering on knowledge bases (KBQA) mainly operate with the standard i.i.d assumption, i.e., training distribution over questions is the same as the test distribution. However, i.i.d may be neither reasonably achievable nor desirable on large-scale KBs because 1) true user distribution is hard to capture and 2) randomly sample training examples from the enormous space… ▽ More

    Submitted 22 February, 2021; v1 submitted 16 November, 2020; originally announced November 2020.

    Comments: Accepted to TheWebConf 2021 (previously WWW)

    ACM Class: I.2.7

  35. arXiv:2010.04790  [pdf, other

    cs.SI

    Inter-cluster Transmission Control Using Graph Modal Barriers

    Authors: Leiming Zhang, Brian M. Sadler, Rick S. Blum, Subhrajit Bhattacharya

    Abstract: In this paper we consider the problem of transmission across a graph and how to effectively control/restrict it with limited resources. Transmission can represent information transfer across a social network, spread of a malicious virus across a computer network, or spread of an infectious disease across communities. The key insight is to assign proper weights to bottleneck edges of the graph base… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

    Comments: 16 pages

  36. arXiv:2006.09310  [pdf, other

    cs.CV cs.LG

    Deep Multimodal Transfer-Learned Regression in Data-Poor Domains

    Authors: Levi McClenny, Mulugeta Haile, Vahid Attari, Brian Sadler, Ulisses Braga-Neto, Raymundo Arroyave

    Abstract: In many real-world applications of deep learning, estimation of a target may rely on various types of input data modes, such as audio-video, image-text, etc. This task can be further complicated by a lack of sufficient data. Here we propose a Deep Multimodal Transfer-Learned Regressor (DMTL-R) for multimodal learning of image and feature data in a deep regression architecture effective at predicti… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

  37. arXiv:2003.12637  [pdf, other

    eess.SP cs.IT

    Collaborative Beamforming Under Localization Errors: A Discrete Optimization Approach

    Authors: Erfaun Noorani, Yagiz Savas, Alec Koppel, John Baras, Ufuk Topcu, Brian M. Sadler

    Abstract: We consider a network of agents that locate themselves in an environment through sensor measurements and aim to transmit a message signal to a base station via collaborative beamforming. The agents' sensor measurements result in localization errors, which degrade the quality of service at the base station due to unknown phase offsets that arise in the agents' communication channels. Assuming that… ▽ More

    Submitted 17 March, 2021; v1 submitted 27 March, 2020; originally announced March 2020.

  38. arXiv:2003.10550  [pdf, other

    cs.LG cs.IT stat.ML

    Regret and Belief Complexity Trade-off in Gaussian Process Bandits via Information Thresholding

    Authors: Amrit Singh Bedi, Dheeraj Peddireddy, Vaneet Aggarwal, Brian M. Sadler, Alec Koppel

    Abstract: Bayesian optimization is a framework for global search via maximum a posteriori updates rather than simulated annealing, and has gained prominence for decision-making under uncertainty. In this work, we cast Bayesian optimization as a multi-armed bandit problem, where the payoff function is sampled from a Gaussian process (GP). Further, we focus on action selections via upper confidence bound (UCB… ▽ More

    Submitted 21 March, 2022; v1 submitted 23 March, 2020; originally announced March 2020.

  39. arXiv:2002.02308  [pdf, other

    eess.SY cs.CV

    VGAI: End-to-End Learning of Vision-Based Decentralized Controllers for Robot Swarms

    Authors: Ting-Kuei Hu, Fernando Gama, Tianlong Chen, Zhangyang Wang, Alejandro Ribeiro, Brian M. Sadler

    Abstract: Decentralized coordination of a robot swarm requires addressing the tension between local perceptions and actions, and the accomplishment of a global objective. In this work, we propose to learn decentralized controllers based on solely raw visual inputs. For the first time, that integrates the learning of two key components: communication and visual perception, in one end-to-end framework. More s… ▽ More

    Submitted 10 December, 2020; v1 submitted 6 February, 2020; originally announced February 2020.

  40. arXiv:1910.08194  [pdf, other

    cs.CL cs.AI

    HiExpan: Task-Guided Taxonomy Construction by Hierarchical Tree Expansion

    Authors: Jiaming Shen, Zeqiu Wu, Dongming Lei, Chao Zhang, Xiang Ren, Michelle T. Vanni, Brian M. Sadler, Jiawei Han

    Abstract: Taxonomies are of great value to many knowledge-rich applications. As the manual taxonomy curation costs enormous human effects, automatic taxonomy construction is in great demand. However, most existing automatic taxonomy construction methods can only build hypernymy taxonomies wherein each edge is limited to expressing the "is-a" relation. Such a restriction limits their applicability to more di… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

    Comments: KDD 2018 accepted

  41. arXiv:1909.11555  [pdf, other

    eess.SP cs.LG math.OC

    Optimally Compressed Nonparametric Online Learning

    Authors: Alec Koppel, Amrit Singh Bedi, Ketan Rajawat, Brian M. Sadler

    Abstract: Batch training of machine learning models based on neural networks is now well established, whereas to date streaming methods are largely based on linear models. To go beyond linear in the online setting, nonparametric methods are of interest due to their universality and ability to stably incorporate new information via convexity or Bayes' Rule. Unfortunately, when used online, nonparametric meth… ▽ More

    Submitted 17 January, 2020; v1 submitted 25 September, 2019; originally announced September 2019.

  42. arXiv:1909.10279  [pdf, other

    math.ST cs.CC stat.CO

    Nearly Consistent Finite Particle Estimates in Streaming Importance Sampling

    Authors: Alec Koppel, Amrit Singh Bedi, Brian M. Sadler, Victor Elvira

    Abstract: In Bayesian inference, we seek to compute information about random variables such as moments or quantiles on the basis of {available data} and prior information. When the distribution of random variables is {intractable}, Monte Carlo (MC) sampling is usually required. {Importance sampling is a standard MC tool that approximates this unavailable distribution with a set of weighted samples.} This pr… ▽ More

    Submitted 5 April, 2021; v1 submitted 23 September, 2019; originally announced September 2019.

  43. arXiv:1909.05442  [pdf, other

    math.OC cs.LG eess.SP

    Nonstationary Nonparametric Online Learning: Balancing Dynamic Regret and Model Parsimony

    Authors: Amrit Singh Bedi, Alec Koppel, Ketan Rajawat, Brian M. Sadler

    Abstract: An open challenge in supervised learning is \emph{conceptual drift}: a data point begins as classified according to one label, but over time the notion of that label changes. Beyond linear autoregressive models, transfer and meta learning address drift, but require data that is representative of disparate domains at the outset of training. To relax this requirement, we propose a memory-efficient \… ▽ More

    Submitted 11 September, 2019; originally announced September 2019.

  44. arXiv:1812.09551  [pdf, other

    cs.DB

    TaxoGen: Unsupervised Topic Taxonomy Construction by Adaptive Term Embedding and Clustering

    Authors: Chao Zhang, Fangbo Tao, Xiusi Chen, Jiaming Shen, Meng Jiang, Brian Sadler, Michelle Vanni, Jiawei Han

    Abstract: Taxonomy construction is not only a fundamental task for semantic analysis of text corpora, but also an important step for applications such as information filtering, recommendation, and Web search. Existing pattern-based methods extract hypernym-hyponym term pairs and then organize these pairs into a taxonomy. However, by considering each term as an independent concept node, they overlook the top… ▽ More

    Submitted 22 December, 2018; originally announced December 2018.

  45. arXiv:1811.07032  [pdf, other

    cs.CL

    Mining Entity Synonyms with Efficient Neural Set Generation

    Authors: Jiaming Shen, Ruiliang Lyu, Xiang Ren, Michelle Vanni, Brian Sadler, Jiawei Han

    Abstract: Mining entity synonym sets (i.e., sets of terms referring to the same entity) is an important task for many entity-leveraging applications. Previous work either rank terms based on their similarity to a given query term, or treats the problem as a two-phase task (i.e., detecting synonymy pairs, followed by organizing these pairs into synonym sets). However, these approaches fail to model the holis… ▽ More

    Submitted 16 November, 2018; originally announced November 2018.

    Comments: AAAI 2019 camera-ready version

  46. arXiv:1808.06740  [pdf, other

    cs.CL cs.AI

    Interactive Semantic Parsing for If-Then Recipes via Hierarchical Reinforcement Learning

    Authors: Ziyu Yao, Xiujun Li, Jianfeng Gao, Brian Sadler, Huan Sun

    Abstract: Given a text description, most existing semantic parsers synthesize a program in one shot. However, it is quite challenging to produce a correct program solely based on the description, which in reality is often ambiguous or incomplete. In this paper, we investigate interactive semantic parsing, where the agent can ask the user clarification questions to resolve ambiguities via a multi-turn dialog… ▽ More

    Submitted 14 November, 2018; v1 submitted 20 August, 2018; originally announced August 2018.

    Comments: 13 pages, 2 figures, accepted by AAAI 2019

  47. arXiv:1808.05933  [pdf, other

    math.OC cs.DC cs.LG

    Decentralized Dictionary Learning Over Time-Varying Digraphs

    Authors: Amir Daneshmand, Ying Sun, Gesualdo Scutari, Francisco Facchinei, Brian M. Sadler

    Abstract: This paper studies Dictionary Learning problems wherein the learning task is distributed over a multi-agent network, modeled as a time-varying directed graph. This formulation is relevant, for instance, in Big Data scenarios where massive amounts of data are collected/stored in different locations (e.g., sensors, clouds) and aggregating and/or processing all data in a fusion center might be ineffi… ▽ More

    Submitted 5 March, 2019; v1 submitted 17 August, 2018; originally announced August 2018.

  48. arXiv:1707.05851  [pdf, other

    cs.SI physics.soc-ph

    Graph Filters and the Z-Laplacian

    Authors: Xiaoran Yan, Brian M. Sadler, Robert J. Drost, Paul L. Yu, Kristina Lerman

    Abstract: In network science, the interplay between dynamical processes and the underlying topologies of complex systems has led to a diverse family of models with different interpretations. In graph signal processing, this is manifested in the form of different graph shifts and their induced algebraic systems. In this paper, we propose the unifying Z-Laplacian framework, whose instances can act as graph sh… ▽ More

    Submitted 18 July, 2017; originally announced July 2017.

    Comments: IEEE Journal of Selected Topics in Signal Processing (September 2017 special issue on Graph Signal Processing)

  49. Unequal Error Protection Querying Policies for the Noisy 20 Questions Problem

    Authors: Hye Won Chung, Brian M. Sadler, Lizhong Zheng, Alfred O. Hero

    Abstract: In this paper, we propose an open-loop unequal-error-protection querying policy based on superposition coding for the noisy 20 questions problem. In this problem, a player wishes to successively refine an estimate of the value of a continuous random variable by posing binary queries and receiving noisy responses. When the queries are designed non-adaptively as a single block and the noisy response… ▽ More

    Submitted 28 September, 2017; v1 submitted 29 June, 2016; originally announced June 2016.

    Comments: To appear in IEEE Transactions on Information Theory

  50. arXiv:1606.05578  [pdf, other

    cs.MA eess.SY stat.CO

    Proximity Without Consensus in Online Multi-Agent Optimization

    Authors: Alec Koppel, Brian M. Sadler, Alejandro Ribeiro

    Abstract: We consider stochastic optimization problems in multi-agent settings, where a network of agents aims to learn parameters which are optimal in terms of a global objective, while giving preference to locally observed streaming information. To do so, we depart from the canonical decentralized optimization framework where agreement constraints are enforced, and instead formulate a problem where each a… ▽ More

    Submitted 17 June, 2016; originally announced June 2016.