Search | arXiv e-print repository

Capture Point Control in Thruster-Assisted Bipedal Locomotion

Authors: Shreyansh Pitroda, Aditya Bondada, Kaushik Venkatesh Krishnamurthy, Adarsh Salagame, Chenghao Wang, Taoran Liu, Bibek Gupta, Eric Sihite, Reza Nemovi, Alireza Ramezani, Morteza Gharib

Abstract: Despite major advancements in control design that are robust to unplanned disturbances, bipedal robots are still susceptible to falling over and struggle to negotiate rough terrains. By utilizing thrusters in our bipedal robot, we can perform additional posture manipulation and expand the modes of locomotion to enhance the robot's stability and ability to negotiate rough and difficult-to-navigate… ▽ More Despite major advancements in control design that are robust to unplanned disturbances, bipedal robots are still susceptible to falling over and struggle to negotiate rough terrains. By utilizing thrusters in our bipedal robot, we can perform additional posture manipulation and expand the modes of locomotion to enhance the robot's stability and ability to negotiate rough and difficult-to-navigate terrains. In this paper, we present our efforts in designing a controller based on capture point control for our thruster-assisted walking model named Harpy and explore its control design possibilities. While capture point control based on centroidal models for bipedal systems has been extensively studied, the incorporation of external forces that can influence the dynamics of linear inverted pendulum models, often used in capture point-based works, has not been explored before. The inclusion of these external forces can lead to interesting interpretations of locomotion, such as virtual buoyancy studied in aquatic-legged locomotion. This paper outlines the dynamical model of our robot, the capture point method we use to assist the upper body stabilization, and the simulation work done to show the controller's feasibility. △ Less

Submitted 20 June, 2024; originally announced June 2024.

Comments: Submitted and to be presented at IEEE AIM 2024. arXiv admin note: substantial text overlap with arXiv:2103.15952

arXiv:2406.13118 [pdf, other]

Thruster-Assisted Incline Walking

Authors: Kaushik Venkatesh Krishnamurthy, Chenghao Wang, Shreyansh Pitroda, Adarsh Salagame, Eric Sihite, Reza Nemovi, Alireza Ramezani, Morteza Gharib

Abstract: In this study, our aim is to evaluate the effectiveness of thruster-assisted steep slope walking for the Husky Carbon, a quadrupedal robot equipped with custom-designed actuators and plural electric ducted fans, through simulation prior to conducting experimental trials. Thruster-assisted steep slope walking draws inspiration from wing-assisted incline running (WAIR) observed in birds, and intrigu… ▽ More In this study, our aim is to evaluate the effectiveness of thruster-assisted steep slope walking for the Husky Carbon, a quadrupedal robot equipped with custom-designed actuators and plural electric ducted fans, through simulation prior to conducting experimental trials. Thruster-assisted steep slope walking draws inspiration from wing-assisted incline running (WAIR) observed in birds, and intriguingly incorporates posture manipulation and thrust vectoring, a locomotion technique not previously explored in the animal kingdom. Our approach involves develo** a reduced-order model of the Husky robot, followed by the application of an optimization-based controller utilizing collocation methods and dynamics interpolation to determine control actions. Through simulation testing, we demonstrate the feasibility of hardware implementation of our controller. △ Less

Submitted 18 June, 2024; originally announced June 2024.

Comments: 7 pages, 7 figures, submitted to CDC 2024 conference. arXiv admin note: text overlap with arXiv:2405.06070

arXiv:2405.06070 [pdf, other]

Narrow-Path, Dynamic Walking Using Integrated Posture Manipulation and Thrust Vectoring

Authors: Kaushik Venkatesh Krishnamurthy, Chenghao Wang, Shreyansh Pitroda, Adarsh Salagame, Eric Sihite, Reza Nemovi, Alireza Ramezani, Morteza Gharib

Abstract: This research concentrates on enhancing the navigational capabilities of Northeastern Universitys Husky, a multi-modal quadrupedal robot, that can integrate posture manipulation and thrust vectoring, to traverse through narrow pathways such as walking over pipes and slacklining. The Husky is outfitted with thrusters designed to stabilize its body during dynamic walking over these narrow paths. The… ▽ More This research concentrates on enhancing the navigational capabilities of Northeastern Universitys Husky, a multi-modal quadrupedal robot, that can integrate posture manipulation and thrust vectoring, to traverse through narrow pathways such as walking over pipes and slacklining. The Husky is outfitted with thrusters designed to stabilize its body during dynamic walking over these narrow paths. The project involves modeling the robot using the HROM (Husky Reduced Order Model) and develo** an optimal control framework. This framework is based on polynomial approximation of the HROM and a collocation approach to derive optimal thruster commands necessary for achieving dynamic walking on narrow paths. The effectiveness of the modeling and control design approach is validated through simulations conducted using Matlab. △ Less

Submitted 9 May, 2024; originally announced May 2024.

Comments: arXiv admin note: text overlap with arXiv:2312.12586

arXiv:2405.04636 [pdf, ps, other]

Data-driven Error Estimation: Upper Bounding Multiple Errors with No Technical Debt

Authors: Sanath Kumar Krishnamurthy, Susan Athey, Emma Brunskill

Abstract: We formulate the problem of constructing multiple simultaneously valid confidence intervals (CIs) as estimating a high probability upper bound on the maximum error for a class/set of estimate-estimand-error tuples, and refer to this as the error estimation problem. For a single such tuple, data-driven confidence intervals can often be used to bound the error in our estimate. However, for a class o… ▽ More We formulate the problem of constructing multiple simultaneously valid confidence intervals (CIs) as estimating a high probability upper bound on the maximum error for a class/set of estimate-estimand-error tuples, and refer to this as the error estimation problem. For a single such tuple, data-driven confidence intervals can often be used to bound the error in our estimate. However, for a class of estimate-estimand-error tuples, nontrivial high probability upper bounds on the maximum error often require class complexity as input -- limiting the practicality of such methods and often resulting in loose bounds. Rather than deriving theoretical class complexity-based bounds, we propose a completely data-driven approach to estimate an upper bound on the maximum error. The simple and general nature of our solution to this fundamental challenge lends itself to several applications including: multiple CI construction, multiple hypothesis testing, estimating excess risk bounds (a fundamental measure of uncertainty in machine learning) for any training/fine-tuning algorithm, and enabling the development of a contextual bandit pipeline that can leverage any reward model estimation procedure as input (without additional mathematical analysis). △ Less

Submitted 7 May, 2024; originally announced May 2024.

arXiv:2404.11058 [pdf, other]

Multimodal Fusion of Echocardiography and Electronic Health Records for the Detection of Cardiac Amyloidosis

Authors: Zishun Feng, Joseph A. Sivak, Ashok K. Krishnamurthy

Abstract: Cardiac amyloidosis, a rare and highly morbid condition, presents significant challenges for detection through echocardiography. Recently, there has been a surge in proposing machine-learning algorithms to identify cardiac amyloidosis, with the majority being imaging-based deep-learning approaches that require extensive data. In this study, we introduce a novel transformer-based multimodal fusion… ▽ More Cardiac amyloidosis, a rare and highly morbid condition, presents significant challenges for detection through echocardiography. Recently, there has been a surge in proposing machine-learning algorithms to identify cardiac amyloidosis, with the majority being imaging-based deep-learning approaches that require extensive data. In this study, we introduce a novel transformer-based multimodal fusion algorithm that leverages information from both imaging and electronic health records. Specifically, our approach utilizes echocardiography videos from both the parasternal long-axis (PLAX) view and the apical 4-chamber (A4C) view along with patients' demographic data, laboratory tests, and cardiac metrics to predict the probability of cardiac amyloidosis. We evaluated our method using 5-fold cross-validation on a dataset comprising 41 patients and achieved an Area Under the Receiver Operating Characteristic curve (AUROC) of 0.94. The experimental results demonstrate that our approach can achieve competitive results with a significantly smaller dataset compared to prior imaging-based methods that required data from thousands of patients. This underscores the potential of leveraging multimodal data to enhance diagnostic accuracy in the identification of complex cardiac conditions such as cardiac amyloidosis. △ Less

Submitted 7 June, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

arXiv:2401.07123 [pdf, other]

One Agent Too Many: User Perspectives on Approaches to Multi-agent Conversational AI

Authors: Christopher Clarke, Karthik Krishnamurthy, Walter Talamonti, Yi** Kang, Lingjia Tang, Jason Mars

Abstract: Conversational agents have been gaining increasing popularity in recent years. Influenced by the widespread adoption of task-oriented agents such as Apple Siri and Amazon Alexa, these agents are being deployed into various applications to enhance user experience. Although these agents promote "ask me anything" functionality, they are typically built to focus on a single or finite set of expertise.… ▽ More Conversational agents have been gaining increasing popularity in recent years. Influenced by the widespread adoption of task-oriented agents such as Apple Siri and Amazon Alexa, these agents are being deployed into various applications to enhance user experience. Although these agents promote "ask me anything" functionality, they are typically built to focus on a single or finite set of expertise. Given that complex tasks often require more than one expertise, this results in the users needing to learn and adopt multiple agents. One approach to alleviate this is to abstract the orchestration of agents in the background. However, this removes the option of choice and flexibility, potentially harming the ability to complete tasks. In this paper, we explore these different interaction experiences (one agent for all) vs (user choice of agents) for conversational AI. We design prototypes for each, systematically evaluating their ability to facilitate task completion. Through a series of conducted user studies, we show that users have a significant preference for abstracting agent orchestration in both system usability and system performance. Additionally, we demonstrate that this mode of interaction is able to provide quality responses that are rated within 1% of human-selected answers. △ Less

Submitted 13 January, 2024; originally announced January 2024.

arXiv:2312.12586 [pdf]

Towards dynamic Narrow path walking on NU's Husky

Authors: Kaushik Venkatesh Krishnamurthy

Abstract: This research focuses on enabling Northeastern University's Husky, a multi-modal quadrupedal robot, to navigate narrow paths akin to various animals in nature. The Husky is equipped with thrusters to stabilize its body during dynamic maneuvers, addressing challenges inherent in aerial-legged systems. The approach involves modeling the robot as HROM (Husky Reduced Model) and creating an optimal con… ▽ More This research focuses on enabling Northeastern University's Husky, a multi-modal quadrupedal robot, to navigate narrow paths akin to various animals in nature. The Husky is equipped with thrusters to stabilize its body during dynamic maneuvers, addressing challenges inherent in aerial-legged systems. The approach involves modeling the robot as HROM (Husky Reduced Model) and creating an optimal control framework using linearized dynamics for narrow path walking. The thesis introduces a gait scheduling method to generate an open-loop walking gait and validates these gaits through a high-fidelity Simscape simulation. Experimental results of the open-loop walking are presented, accompanied by potential directions for advancing this robotic system. △ Less

Submitted 19 December, 2023; originally announced December 2023.

Comments: 60 pages, 27 figures

arXiv:2309.06629 [pdf, other]

The Relational Bottleneck as an Inductive Bias for Efficient Abstraction

Authors: Taylor W. Webb, Steven M. Frankland, Awni Altabaa, Simon Segert, Kamesh Krishnamurthy, Declan Campbell, Jacob Russin, Tyler Giallanza, Zack Dulberg, Randall O'Reilly, John Lafferty, Jonathan D. Cohen

Abstract: A central challenge for cognitive science is to explain how abstract concepts are acquired from limited experience. This has often been framed in terms of a dichotomy between connectionist and symbolic cognitive models. Here, we highlight a recently emerging line of work that suggests a novel reconciliation of these approaches, by exploiting an inductive bias that we term the relational bottleneck… ▽ More A central challenge for cognitive science is to explain how abstract concepts are acquired from limited experience. This has often been framed in terms of a dichotomy between connectionist and symbolic cognitive models. Here, we highlight a recently emerging line of work that suggests a novel reconciliation of these approaches, by exploiting an inductive bias that we term the relational bottleneck. In that approach, neural networks are constrained via their architecture to focus on relations between perceptual inputs, rather than the attributes of individual inputs. We review a family of models that employ this approach to induce abstractions in a data-efficient manner, emphasizing their potential as candidate models for the acquisition of abstract concepts in the human mind and brain. △ Less

Submitted 1 May, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

arXiv:2308.03754 [pdf, other]

High-Dimensional Non-Convex Landscapes and Gradient Descent Dynamics

Authors: Tony Bonnaire, Davide Ghio, Kamesh Krishnamurthy, Francesca Mignacco, Atsushi Yamamura, Giulio Biroli

Abstract: In these lecture notes we present different methods and concepts developed in statistical physics to analyze gradient descent dynamics in high-dimensional non-convex landscapes. Our aim is to show how approaches developed in physics, mainly statistical physics of disordered systems, can be used to tackle open questions on high-dimensional dynamics in Machine Learning. In these lecture notes we present different methods and concepts developed in statistical physics to analyze gradient descent dynamics in high-dimensional non-convex landscapes. Our aim is to show how approaches developed in physics, mainly statistical physics of disordered systems, can be used to tackle open questions on high-dimensional dynamics in Machine Learning. △ Less

Submitted 10 November, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

Comments: Lectures given by G. Biroli at the 2022 Les Houches Summer School "Statistical Physics and Machine Learning"

arXiv:2307.06398 [pdf, other]

Trainability, Expressivity and Interpretability in Gated Neural ODEs

Authors: Timothy Doyeon Kim, Tankut Can, Kamesh Krishnamurthy

Abstract: Understanding how the dynamics in biological and artificial neural networks implement the computations required for a task is a salient open question in machine learning and neuroscience. In particular, computations requiring complex memory storage and retrieval pose a significant challenge for these networks to implement or learn. Recently, a family of models described by neural ordinary differen… ▽ More Understanding how the dynamics in biological and artificial neural networks implement the computations required for a task is a salient open question in machine learning and neuroscience. In particular, computations requiring complex memory storage and retrieval pose a significant challenge for these networks to implement or learn. Recently, a family of models described by neural ordinary differential equations (nODEs) has emerged as powerful dynamical neural network models capable of capturing complex dynamics. Here, we extend nODEs by endowing them with adaptive timescales using gating interactions. We refer to these as gated neural ODEs (gnODEs). Using a task that requires memory of continuous quantities, we demonstrate the inductive bias of the gnODEs to learn (approximate) continuous attractors. We further show how reduced-dimensional gnODEs retain their modeling power while greatly improving interpretability, even allowing explicit visualization of the structure of learned attractors. We introduce a novel measure of expressivity which probes the capacity of a neural network to generate complex trajectories. Using this measure, we explore how the phase-space dimension of the nODEs and the complexity of the function modeling the flow field contribute to expressivity. We see that a more complex function for modeling the flow field allows a lower-dimensional nODE to capture a given target dynamics. Finally, we demonstrate the benefit of gating in nODEs on several real-world tasks. △ Less

Submitted 12 July, 2023; originally announced July 2023.

arXiv:2307.02108 [pdf, other]

Proportional Response: Contextual Bandits for Simple and Cumulative Regret Minimization

Authors: Sanath Kumar Krishnamurthy, Ruohan Zhan, Susan Athey, Emma Brunskill

Abstract: In many applications, e.g. in healthcare and e-commerce, the goal of a contextual bandit may be to learn an optimal treatment assignment policy at the end of the experiment. That is, to minimize simple regret. However, this objective remains understudied. We propose a new family of computationally efficient bandit algorithms for the stochastic contextual bandit setting, where a tuning parameter de… ▽ More In many applications, e.g. in healthcare and e-commerce, the goal of a contextual bandit may be to learn an optimal treatment assignment policy at the end of the experiment. That is, to minimize simple regret. However, this objective remains understudied. We propose a new family of computationally efficient bandit algorithms for the stochastic contextual bandit setting, where a tuning parameter determines the weight placed on cumulative regret minimization (where we establish near-optimal minimax guarantees) versus simple regret minimization (where we establish state-of-the-art guarantees). Our algorithms work with any function class, are robust to model misspecification, and can be used in continuous arm settings. This flexibility comes from constructing and relying on "conformal arm sets" (CASs). CASs provide a set of arms for every context, encompassing the context-specific optimal arm with a certain probability across the context distribution. Our positive results on simple and cumulative regret guarantees are contrasted with a negative result, which shows that no algorithm can achieve instance-dependent simple regret guarantees while simultaneously achieving minimax optimal cumulative regret guarantees. △ Less

Submitted 2 November, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

arXiv:2302.00284 [pdf, other]

Selective Uncertainty Propagation in Offline RL

Authors: Sanath Kumar Krishnamurthy, Shrey Modi, Tanmay Gangwani, Sumeet Katariya, Branislav Kveton, Anshuka Rangi

Abstract: We consider the finite-horizon offline reinforcement learning (RL) setting, and are motivated by the challenge of learning the policy at any step h in dynamic programming (DP) algorithms. To learn this, it is sufficient to evaluate the treatment effect of deviating from the behavioral policy at step h after having optimized the policy for all future steps. Since the policy at any step can affect n… ▽ More We consider the finite-horizon offline reinforcement learning (RL) setting, and are motivated by the challenge of learning the policy at any step h in dynamic programming (DP) algorithms. To learn this, it is sufficient to evaluate the treatment effect of deviating from the behavioral policy at step h after having optimized the policy for all future steps. Since the policy at any step can affect next-state distributions, the related distributional shift challenges can make this problem far more statistically hard than estimating such treatment effects in the stochastic contextual bandit setting. However, the hardness of many real-world RL instances lies between the two regimes. We develop a flexible and general method called selective uncertainty propagation for confidence interval construction that adapts to the hardness of the associated distribution shift challenges. We show benefits of our approach on toy environments and demonstrate the benefits of these techniques for offline policy learning. △ Less

Submitted 12 February, 2024; v1 submitted 1 February, 2023; originally announced February 2023.

arXiv:2211.12004 [pdf, other]

Contextual Bandits in a Survey Experiment on Charitable Giving: Within-Experiment Outcomes versus Policy Learning

Authors: Susan Athey, Undral Byambadalai, Vitor Hadad, Sanath Kumar Krishnamurthy, Weiwen Leung, Joseph Jay Williams

Abstract: We design and implement an adaptive experiment (a ``contextual bandit'') to learn a targeted treatment assignment policy, where the goal is to use a participant's survey responses to determine which charity to expose them to in a donation solicitation. The design balances two competing objectives: optimizing the outcomes for the subjects in the experiment (``cumulative regret minimization'') and g… ▽ More We design and implement an adaptive experiment (a ``contextual bandit'') to learn a targeted treatment assignment policy, where the goal is to use a participant's survey responses to determine which charity to expose them to in a donation solicitation. The design balances two competing objectives: optimizing the outcomes for the subjects in the experiment (``cumulative regret minimization'') and gathering data that will be most useful for policy learning, that is, for learning an assignment rule that will maximize welfare if used after the experiment (``simple regret minimization''). We evaluate alternative experimental designs by collecting pilot data and then conducting a simulation study. Next, we implement our selected algorithm. Finally, we perform a second simulation study anchored to the collected data that evaluates the benefits of the algorithm we chose. Our first result is that the value of a learned policy in this setting is higher when data is collected via a uniform randomization rather than collected adaptively using standard cumulative regret minimization or policy learning algorithms. We propose a simple heuristic for adaptive experimentation that improves upon uniform randomization from the perspective of policy learning at the expense of increasing cumulative regret relative to alternative bandit algorithms. The heuristic modifies an existing contextual bandit algorithm by (i) imposing a lower bound on assignment probabilities that decay slowly so that no arm is discarded too quickly, and (ii) after adaptively collecting data, restricting policy learning to select from arms where sufficient data has been gathered. △ Less

Submitted 21 November, 2022; originally announced November 2022.

ACM Class: G.3; I.2.6

arXiv:2208.06900 [pdf, other]

Convolutional Spiking Neural Networks for Detecting Anticipatory Brain Potentials Using Electroencephalogram

Authors: Nathan Lutes, Venkata Sriram Siddhardh Nadendla, K. Krishnamurthy

Abstract: Spiking neural networks (SNNs) are receiving increased attention because they mimic synaptic connections in biological systems and produce spike trains, which can be approximated by binary values for computational efficiency. Recently, the addition of convolutional layers to combine the feature extraction power of convolutional networks with the computational efficiency of SNNs has been introduced… ▽ More Spiking neural networks (SNNs) are receiving increased attention because they mimic synaptic connections in biological systems and produce spike trains, which can be approximated by binary values for computational efficiency. Recently, the addition of convolutional layers to combine the feature extraction power of convolutional networks with the computational efficiency of SNNs has been introduced. This paper studies the feasibility of using a convolutional spiking neural network (CSNN) to detect anticipatory slow cortical potentials (SCPs) related to braking intention in human participants using an electroencephalogram (EEG). Data was collected during an experiment wherein participants operated a remote-controlled vehicle on a testbed designed to simulate an urban environment. Participants were alerted to an incoming braking event via an audio countdown to elicit anticipatory potentials that were measured using an EEG. The CSNN's performance was compared to a standard CNN, EEGNet and three graph neural networks via 10-fold cross-validation. The CSNN outperformed all the other neural networks, and had a predictive accuracy of 99.06 percent with a true positive rate of 98.50 percent, a true negative rate of 99.20 percent and an F1-score of 0.98. Performance of the CSNN was comparable to the CNN in an ablation study using a subset of EEG channels that localized SCPs. Classification performance of the CSNN degraded only slightly when the floating-point EEG data were converted into spike trains via delta modulation to mimic synaptic connections. △ Less

Submitted 24 March, 2024; v1 submitted 14 August, 2022; originally announced August 2022.

Comments: 16 pages, 6 figures, Scientific Reports submission

arXiv:2207.12254 [pdf, other]

A Letter on Progress Made on Husky Carbon: A Legged-Aerial, Multi-modal Platform

Authors: Adarsh Salagame, Shoghair Manjikian, Chenghao Wang, Kaushik Venkatesh Krishnamurthy, Shreyansh Pitroda, Bibek Gupta, Tobias Jacob, Benjamin Mottis, Eric Sihite, Milad Ramezani, Alireza Ramezani

Abstract: Animals, such as birds, widely use multi-modal locomotion by combining legged and aerial mobility with dominant inertial effects. The robotic biomimicry of this multi-modal locomotion feat can yield ultra-flexible systems in terms of their ability to negotiate their task spaces. The main objective of this paper is to discuss the challenges in achieving multi-modal locomotion, and to report our pro… ▽ More Animals, such as birds, widely use multi-modal locomotion by combining legged and aerial mobility with dominant inertial effects. The robotic biomimicry of this multi-modal locomotion feat can yield ultra-flexible systems in terms of their ability to negotiate their task spaces. The main objective of this paper is to discuss the challenges in achieving multi-modal locomotion, and to report our progress in develo** our quadrupedal robot capable of multi-modal locomotion (legged and aerial locomotion), the Husky Carbon. We report the mechanical and electrical components utilized in our robot, in addition to the simulation and experimentation done to achieve our goal in develo** a versatile multi-modal robotic platform. △ Less

Submitted 25 July, 2022; originally announced July 2022.

Comments: arXiv admin note: text overlap with arXiv:2104.05834, arXiv:2205.06392

arXiv:2205.09732 [pdf, other]

Enhancing Slot Tagging with Intent Features for Task Oriented Natural Language Understanding using BERT

Authors: Shruthi Hariharan, Vignesh Kumar Krishnamurthy, Utkarsh, Jayantha Gowda Sarapanahalli

Abstract: Recent joint intent detection and slot tagging models have seen improved performance when compared to individual models. In many real-world datasets, the slot labels and values have a strong correlation with their intent labels. In such cases, the intent label information may act as a useful feature to the slot tagging model. In this paper, we examine the effect of leveraging intent label features… ▽ More Recent joint intent detection and slot tagging models have seen improved performance when compared to individual models. In many real-world datasets, the slot labels and values have a strong correlation with their intent labels. In such cases, the intent label information may act as a useful feature to the slot tagging model. In this paper, we examine the effect of leveraging intent label features through 3 techniques in the slot tagging task of joint intent and slot detection models. We evaluate our techniques on benchmark spoken language datasets SNIPS and ATIS, as well as over a large private Bixby dataset and observe an improved slot-tagging performance over state-of-the-art models. △ Less

Submitted 23 May, 2022; v1 submitted 19 May, 2022; originally announced May 2022.

Comments: 11 pages, 1 figure

arXiv:2203.16668 [pdf, other]

Flexible and Efficient Contextual Bandits with Heterogeneous Treatment Effect Oracles

Authors: Aldo Gael Carranza, Sanath Kumar Krishnamurthy, Susan Athey

Abstract: Contextual bandit algorithms often estimate reward models to inform decision-making. However, true rewards can contain action-independent redundancies that are not relevant for decision-making. We show it is more data-efficient to estimate any function that explains the reward differences between actions, that is, the treatment effects. Motivated by this observation, building on recent work on ora… ▽ More Contextual bandit algorithms often estimate reward models to inform decision-making. However, true rewards can contain action-independent redundancies that are not relevant for decision-making. We show it is more data-efficient to estimate any function that explains the reward differences between actions, that is, the treatment effects. Motivated by this observation, building on recent work on oracle-based bandit algorithms, we provide the first reduction of contextual bandits to general-purpose heterogeneous treatment effect estimation, and we design a simple and computationally efficient algorithm based on this reduction. Our theoretical and experimental results demonstrate that heterogeneous treatment effect estimation in contextual bandits offers practical advantages over reward estimation, including more efficient model estimation and greater flexibility to model misspecification. △ Less

Submitted 24 February, 2023; v1 submitted 30 March, 2022; originally announced March 2022.

arXiv:2203.07665 [pdf, other]

One Agent To Rule Them All: Towards Multi-agent Conversational AI

Authors: Christopher Clarke, Joseph Joshua Peper, Karthik Krishnamurthy, Walter Talamonti, Kevin Leach, Walter Lasecki, Yi** Kang, Lingjia Tang, Jason Mars

Abstract: The increasing volume of commercially available conversational agents (CAs) on the market has resulted in users being burdened with learning and adopting multiple agents to accomplish their tasks. Though prior work has explored supporting a multitude of domains within the design of a single agent, the interaction experience suffers due to the large action space of desired capabilities. To address… ▽ More The increasing volume of commercially available conversational agents (CAs) on the market has resulted in users being burdened with learning and adopting multiple agents to accomplish their tasks. Though prior work has explored supporting a multitude of domains within the design of a single agent, the interaction experience suffers due to the large action space of desired capabilities. To address these problems, we introduce a new task BBAI: Black-Box Agent Integration, focusing on combining the capabilities of multiple black-box CAs at scale. We explore two techniques: question agent pairing and question response pairing aimed at resolving this task. Leveraging these techniques, we design One For All (OFA), a scalable system that provides a unified interface to interact with multiple CAs. Additionally, we introduce MARS: Multi-Agent Response Selection, a new encoder model for question response pairing that jointly encodes user question and agent response pairs. We demonstrate that OFA is able to automatically and accurately integrate an ensemble of commercially available CAs spanning disparate domains. Specifically, using the MARS encoder we achieve the highest accuracy on our BBAI task, outperforming strong baselines. △ Less

Submitted 15 March, 2022; originally announced March 2022.

arXiv:2109.03879 [pdf, other]

Emergence of memory manifolds

Authors: Tankut Can, Kamesh Krishnamurthy

Abstract: The ability to store continuous variables in the state of a biological system (e.g. a neural network) is critical for many behaviours. Most models for implementing such a memory manifold require hand-crafted symmetries in the interactions or precise fine-tuning of parameters. We present a general principle that we refer to as {\it frozen stabilisation} (FS), which allows a family of neural network… ▽ More The ability to store continuous variables in the state of a biological system (e.g. a neural network) is critical for many behaviours. Most models for implementing such a memory manifold require hand-crafted symmetries in the interactions or precise fine-tuning of parameters. We present a general principle that we refer to as {\it frozen stabilisation} (FS), which allows a family of neural networks to self-organise to a critical state exhibiting multiple memory manifolds without parameter fine-tuning or symmetries. Memory manifolds arising from FS exhibit a wide range of emergent relaxational timescales and can be used as general purpose integrators for inputs aligned with the manifold. Moreover, FS allows robust memory manifolds in small networks, and this is relevant to debates of implementing continuous attractors with a small number of neurons in light of recent experimental discoveries. △ Less

Submitted 5 December, 2021; v1 submitted 8 September, 2021; originally announced September 2021.

Comments: 18 pages, 9 figures

arXiv:2107.10383 [pdf, ps, other]

Online-Learning Deep Neuro-Adaptive Dynamic Inversion Controller for Model Free Control

Authors: Nathan Lutes, K. Krishnamurthy, Venkata Sriram Siddhardh Nadendla, S. N. Balakrishnan

Abstract: Adaptive methods are popular within the control literature due to the flexibility and forgiveness they offer in the area of modelling. Neural network adaptive control is favorable specifically for the powerful nature of the machine learning algorithm to approximate unknown functions and for the ability to relax certain constraints within traditional adaptive control. Deep neural networks are large… ▽ More Adaptive methods are popular within the control literature due to the flexibility and forgiveness they offer in the area of modelling. Neural network adaptive control is favorable specifically for the powerful nature of the machine learning algorithm to approximate unknown functions and for the ability to relax certain constraints within traditional adaptive control. Deep neural networks are large framework networks with vastly superior approximation characteristics than their shallow counterparts. However, implementing a deep neural network can be difficult due to size specific complications such as vanishing/exploding gradients in training. In this paper, a neuro-adaptive controller is implemented featuring a deep neural network trained on a new weight update law that escapes the vanishing/exploding gradient problem by only incorporating the sign of the gradient. The type of controller designed is an adaptive dynamic inversion controller utilizing a modified state observer in a secondary estimation loop to train the network. The deep neural network learns the entire plant model on-line, creating a controller that is completely model free. The controller design is tested in simulation on a 2 link planar robot arm. The controller is able to learn the nonlinear plant quickly and displays good performance in the tracking control problem. △ Less

Submitted 21 July, 2021; originally announced July 2021.

Comments: 8 pages, 4 fugures, manuscript under review for CDC'2021

arXiv:2106.06483 [pdf, ps, other]

Towards Costless Model Selection in Contextual Bandits: A Bias-Variance Perspective

Authors: Sanath Kumar Krishnamurthy, Adrienne Margaret Propp, Susan Athey

Abstract: Model selection in supervised learning provides costless guarantees as if the model that best balances bias and variance was known a priori. We study the feasibility of similar guarantees for cumulative regret minimization in the stochastic contextual bandit setting. Recent work [Marinov and Zimmert, 2021] identifies instances where no algorithm can guarantee costless regret bounds. Nevertheless,… ▽ More Model selection in supervised learning provides costless guarantees as if the model that best balances bias and variance was known a priori. We study the feasibility of similar guarantees for cumulative regret minimization in the stochastic contextual bandit setting. Recent work [Marinov and Zimmert, 2021] identifies instances where no algorithm can guarantee costless regret bounds. Nevertheless, we identify benign conditions where costless model selection is feasible: gradually increasing class complexity, and diminishing marginal returns for best-in-class policy value with increasing class complexity. Our algorithm is based on a novel misspecification test, and our analysis demonstrates the benefits of using model selection for reward estimation. Unlike prior work on model selection in contextual bandits, our algorithm carefully adapts to the evolving bias-variance trade-off as more data is collected. In particular, our algorithm and analysis go beyond adapting to the complexity of the simplest realizable class and instead adapt to the complexity of the simplest class whose estimation variance dominates the bias. For short horizons, this provides improved regret guarantees that depend on the complexity of simpler classes. △ Less

Submitted 23 October, 2023; v1 submitted 11 June, 2021; originally announced June 2021.

arXiv:2102.13240 [pdf, other]

Adapting to Misspecification in Contextual Bandits with Offline Regression Oracles

Authors: Sanath Kumar Krishnamurthy, Vitor Hadad, Susan Athey

Abstract: Computationally efficient contextual bandits are often based on estimating a predictive model of rewards given contexts and arms using past data. However, when the reward model is not well-specified, the bandit algorithm may incur unexpected regret, so recent work has focused on algorithms that are robust to misspecification. We propose a simple family of contextual bandit algorithms that adapt to… ▽ More Computationally efficient contextual bandits are often based on estimating a predictive model of rewards given contexts and arms using past data. However, when the reward model is not well-specified, the bandit algorithm may incur unexpected regret, so recent work has focused on algorithms that are robust to misspecification. We propose a simple family of contextual bandit algorithms that adapt to misspecification error by reverting to a good safe policy when there is evidence that misspecification is causing a regret increase. Our algorithm requires only an offline regression oracle to ensure regret guarantees that gracefully degrade in terms of a measure of the average misspecification level. Compared to prior work, we attain similar regret guarantees, but we do no rely on a master algorithm, and do not require more robust oracles like online or constrained regression oracles (e.g., Foster et al. (2020a); Krishnamurthy et al. (2020)). This allows us to design algorithms for more general function approximation classes. △ Less

Submitted 11 June, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

Comments: ICML 2021

arXiv:2010.13013 [pdf, other]

Tractable contextual bandits beyond realizability

Authors: Sanath Kumar Krishnamurthy, Vitor Hadad, Susan Athey

Abstract: Tractable contextual bandit algorithms often rely on the realizability assumption - i.e., that the true expected reward model belongs to a known class, such as linear functions. In this work, we present a tractable bandit algorithm that is not sensitive to the realizability assumption and computationally reduces to solving a constrained regression problem in every epoch. When realizability does no… ▽ More Tractable contextual bandit algorithms often rely on the realizability assumption - i.e., that the true expected reward model belongs to a known class, such as linear functions. In this work, we present a tractable bandit algorithm that is not sensitive to the realizability assumption and computationally reduces to solving a constrained regression problem in every epoch. When realizability does not hold, our algorithm ensures the same guarantees on regret achieved by realizability-based algorithms under realizability, up to an additive term that accounts for the misspecification error. This extra term is proportional to T times a function of the mean squared error between the best model in the class and the true model, where T is the total number of time-steps. Our work sheds light on the bias-variance trade-off for tractable contextual bandits. This trade-off is not captured by algorithms that assume realizability, since under this assumption there exists an estimator in the class that attains zero bias. △ Less

Submitted 25 February, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

Comments: 35 pages, 6 figures

arXiv:2007.14823 [pdf, other]

Theory of gating in recurrent neural networks

Authors: Kamesh Krishnamurthy, Tankut Can, David J. Schwab

Abstract: Recurrent neural networks (RNNs) are powerful dynamical models, widely used in machine learning (ML) and neuroscience. Prior theoretical work has focused on RNNs with additive interactions. However, gating - i.e. multiplicative - interactions are ubiquitous in real neurons and also the central feature of the best-performing RNNs in ML. Here, we show that gating offers flexible control of two salie… ▽ More Recurrent neural networks (RNNs) are powerful dynamical models, widely used in machine learning (ML) and neuroscience. Prior theoretical work has focused on RNNs with additive interactions. However, gating - i.e. multiplicative - interactions are ubiquitous in real neurons and also the central feature of the best-performing RNNs in ML. Here, we show that gating offers flexible control of two salient features of the collective dynamics: i) timescales and ii) dimensionality. The gate controlling timescales leads to a novel, marginally stable state, where the network functions as a flexible integrator. Unlike previous approaches, gating permits this important function without parameter fine-tuning or special symmetries. Gates also provide a flexible, context-dependent mechanism to reset the memory trace, thus complementing the memory function. The gate modulating the dimensionality can induce a novel, discontinuous chaotic transition, where inputs push a stable system to strong chaotic activity, in contrast to the typically stabilizing effect of inputs. At this transition, unlike additive RNNs, the proliferation of critical points (topological complexity) is decoupled from the appearance of chaotic dynamics (dynamical complexity). The rich dynamics are summarized in phase diagrams, thus providing a map for principled parameter initialization choices to ML practitioners. △ Less

Submitted 1 December, 2021; v1 submitted 29 July, 2020; originally announced July 2020.

Comments: 13 figures

arXiv:2002.09814 [pdf, other]

Survey Bandits with Regret Guarantees

Authors: Sanath Kumar Krishnamurthy, Susan Athey

Abstract: We consider a variant of the contextual bandit problem. In standard contextual bandits, when a user arrives we get the user's complete feature vector and then assign a treatment (arm) to that user. In a number of applications (like healthcare), collecting features from users can be costly. To address this issue, we propose algorithms that avoid needless feature collection while maintaining strong… ▽ More We consider a variant of the contextual bandit problem. In standard contextual bandits, when a user arrives we get the user's complete feature vector and then assign a treatment (arm) to that user. In a number of applications (like healthcare), collecting features from users can be costly. To address this issue, we propose algorithms that avoid needless feature collection while maintaining strong regret guarantees. △ Less

Submitted 22 February, 2020; originally announced February 2020.

Comments: 17 pages, 10 figures

arXiv:2002.00025 [pdf, other]

Gating creates slow modes and controls phase-space complexity in GRUs and LSTMs

Authors: Tankut Can, Kamesh Krishnamurthy, David J. Schwab

Abstract: Recurrent neural networks (RNNs) are powerful dynamical models for data with complex temporal structure. However, training RNNs has traditionally proved challenging due to exploding or vanishing of gradients. RNN models such as LSTMs and GRUs (and their variants) significantly mitigate these issues associated with training by introducing various types of gating units into the architecture. While t… ▽ More Recurrent neural networks (RNNs) are powerful dynamical models for data with complex temporal structure. However, training RNNs has traditionally proved challenging due to exploding or vanishing of gradients. RNN models such as LSTMs and GRUs (and their variants) significantly mitigate these issues associated with training by introducing various types of gating units into the architecture. While these gates empirically improve performance, how the addition of gates influences the dynamics and trainability of GRUs and LSTMs is not well understood. Here, we take the perspective of studying randomly initialized LSTMs and GRUs as dynamical systems, and ask how the salient dynamical properties are shaped by the gates. We leverage tools from random matrix theory and mean-field theory to study the state-to-state Jacobians of GRUs and LSTMs. We show that the update gate in the GRU and the forget gate in the LSTM can lead to an accumulation of slow modes in the dynamics. Moreover, the GRU update gate can poise the system at a marginally stable point. The reset gate in the GRU and the output and input gates in the LSTM control the spectral radius of the Jacobian, and the GRU reset gate also modulates the complexity of the landscape of fixed-points. Furthermore, for the GRU we obtain a phase diagram describing the statistical properties of fixed-points. We also provide a preliminary comparison of training performance to the various dynamical regimes realized by varying hyperparameters. Looking to the future, we have introduced a powerful set of techniques which can be adapted to a broad class of RNNs, to study the influence of various architectural choices on dynamics, and potentially motivate the principled discovery of novel architectures. △ Less

Submitted 15 June, 2020; v1 submitted 31 January, 2020; originally announced February 2020.

Comments: 18+18 pages, 4 figures, to appear in Proceedings of Machine Learning Research Vol. 107, 2020, 1st Annual Conference on Mathematical and Scientific Machine Learning

arXiv:1811.08673 [pdf, ps, other]

On the Proximity of Markets with Integral Equilibria

Authors: Siddharth Barman, Sanath Kumar Krishnamurthy

Abstract: We study Fisher markets that admit equilibria wherein each good is integrally assigned to some agent. While strong existence and computational guarantees are known for equilibria of Fisher markets with additive valuations, such equilibria, in general, assign goods fractionally to agents. Hence, Fisher markets are not directly applicable in the context of indivisible goods. In this work we show tha… ▽ More We study Fisher markets that admit equilibria wherein each good is integrally assigned to some agent. While strong existence and computational guarantees are known for equilibria of Fisher markets with additive valuations, such equilibria, in general, assign goods fractionally to agents. Hence, Fisher markets are not directly applicable in the context of indivisible goods. In this work we show that one can always bypass this hurdle and, up to a bounded change in agents' budgets, obtain markets that admit an integral equilibrium. We refer to such markets as pure markets and show that, for any given Fisher market (with additive valuations), one can efficiently compute a "near-by," pure market with an accompanying integral equilibrium. Our work on pure markets leads to novel algorithmic results for fair division of indivisible goods. Prior work in discrete fair division has shown that, under additive valuations, there always exist allocations that simultaneously achieve the seemingly incompatible properties of fairness and efficiency; here fairness refers to envy-freeness up to one good (EF1) and efficiency corresponds to Pareto efficiency. However, polynomial-time algorithms are not known for finding such allocations. Considering relaxations of proportionality and EF1, respectively, as our notions of fairness, we show that fair and Pareto efficient allocations can be computed in strongly polynomial time. △ Less

Submitted 21 November, 2018; originally announced November 2018.

Comments: 17 pages

arXiv:1801.09046 [pdf, ps, other]

Greedy Algorithms for Maximizing Nash Social Welfare

Authors: Siddharth Barman, Sanath Kumar Krishnamurthy, Rohit Vaish

Abstract: We study the problem of fairly allocating a set of indivisible goods among agents with additive valuations. The extent of fairness of an allocation is measured by its Nash social welfare, which is the geometric mean of the valuations of the agents for their bundles. While the problem of maximizing Nash social welfare is known to be APX-hard in general, we study the effectiveness of simple, greedy… ▽ More We study the problem of fairly allocating a set of indivisible goods among agents with additive valuations. The extent of fairness of an allocation is measured by its Nash social welfare, which is the geometric mean of the valuations of the agents for their bundles. While the problem of maximizing Nash social welfare is known to be APX-hard in general, we study the effectiveness of simple, greedy algorithms in solving this problem in two interesting special cases. First, we show that a simple, greedy algorithm provides a 1.061-approximation guarantee when agents have identical valuations, even though the problem of maximizing Nash social welfare remains NP-hard for this setting. Second, we show that when agents have binary valuations over the goods, an exact solution (i.e., a Nash optimal allocation) can be found in polynomial time via a greedy algorithm. Our results in the binary setting extend to provide novel, exact algorithms for optimizing Nash social welfare under concave valuations. Notably, for the above mentioned scenarios, our techniques provide a simple alternative to several of the existing, more sophisticated techniques for this problem such as constructing equilibria of Fisher markets or using real stable polynomials. △ Less

Submitted 27 January, 2018; originally announced January 2018.

Comments: 13 pages

arXiv:1711.07621 [pdf, other]

Groupwise Maximin Fair Allocation of Indivisible Goods

Authors: Siddharth Barman, Arpita Biswas, Sanath Kumar Krishnamurthy, Y. Narahari

Abstract: We study the problem of allocating indivisible goods among n agents in a fair manner. For this problem, maximin share (MMS) is a well-studied solution concept which provides a fairness threshold. Specifically, maximin share is defined as the minimum utility that an agent can guarantee for herself when asked to partition the set of goods into n bundles such that the remaining (n-1) agents pick thei… ▽ More We study the problem of allocating indivisible goods among n agents in a fair manner. For this problem, maximin share (MMS) is a well-studied solution concept which provides a fairness threshold. Specifically, maximin share is defined as the minimum utility that an agent can guarantee for herself when asked to partition the set of goods into n bundles such that the remaining (n-1) agents pick their bundles adversarially. An allocation is deemed to be fair if every agent gets a bundle whose valuation is at least her maximin share. Even though maximin shares provide a natural benchmark for fairness, it has its own drawbacks and, in particular, it is not sufficient to rule out unsatisfactory allocations. Motivated by these considerations, in this work we define a stronger notion of fairness, called groupwise maximin share guarantee (GMMS). In GMMS, we require that the maximin share guarantee is achieved not just with respect to the grand bundle, but also among all the subgroups of agents. Hence, this solution concept strengthens MMS and provides an ex-post fairness guarantee. We show that in specific settings, GMMS allocations always exist. We also establish the existence of approximate GMMS allocations under additive valuations, and develop a polynomial-time algorithm to find such allocations. Moreover, we establish a scale of fairness wherein we show that GMMS implies approximate envy freeness. Finally, we empirically demonstrate the existence of GMMS allocations in a large set of randomly generated instances. For the same set of instances, we additionally show that our algorithm achieves an approximation factor better than the established, worst-case bound. △ Less

Submitted 20 November, 2017; originally announced November 2017.

Comments: 19 pages

arXiv:1707.04731 [pdf, ps, other]

Finding Fair and Efficient Allocations

Authors: Siddharth Barman, Sanath Kumar Krishnamurthy, Rohit Vaish

Abstract: We study the problem of allocating a set of indivisible goods among a set of agents in a fair and efficient manner. An allocation is said to be fair if it is envy-free up to one good (EF1), which means that each agent prefers its own bundle over the bundle of any other agent up to the removal of one good. In addition, an allocation is deemed efficient if it satisfies Pareto optimality (PO). While… ▽ More We study the problem of allocating a set of indivisible goods among a set of agents in a fair and efficient manner. An allocation is said to be fair if it is envy-free up to one good (EF1), which means that each agent prefers its own bundle over the bundle of any other agent up to the removal of one good. In addition, an allocation is deemed efficient if it satisfies Pareto optimality (PO). While each of these well-studied properties is easy to achieve separately, achieving them together is far from obvious. Recently, Caragiannis et al. (2016) established the surprising result that when agents have additive valuations for the goods, there always exists an allocation that simultaneously satisfies these two seemingly incompatible properties. Specifically, they showed that an allocation that maximizes the Nash social welfare (NSW) objective is both EF1 and PO. However, the problem of maximizing NSW is NP-hard. As a result, this approach does not provide an efficient algorithm for finding a fair and efficient allocation. In this paper, we bypass this barrier, and develop a pseudopolynomial time algorithm for finding allocations that are EF1 and PO; in particular, when the valuations are bounded, our algorithm finds such an allocation in polynomial time. Furthermore, we establish a stronger existence result compared to Caragiannis et al. (2016): For additive valuations, there always exists an allocation that is EF1 and fractionally PO. Another contribution of our work is to show that our algorithm provides a polynomial-time 1.45-approximation to the NSW objective. This improves upon the best known approximation ratio for this problem (namely, the 2-approximation algorithm of Cole et al. (2017)). Unlike many of the existing approaches, our algorithm is completely combinatorial. △ Less

Submitted 11 May, 2018; v1 submitted 15 July, 2017; originally announced July 2017.

Comments: 40 pages. Updated version

arXiv:1707.01962 [pdf, other]

Disorder and the neural representation of complex odors: smelling in the real world

Authors: Kamesh Krishnamurthy, Ann M Hermundstad, Thierry Mora, Aleksandra M Walczak, Vijay Balasubramanian

Abstract: Animals smelling in the real world use a small number of receptors to sense a vast number of natural molecular mixtures, and proceed to learn arbitrary associations between odors and valences. Here, we propose a new interpretation of how the architecture of olfactory circuits is adapted to meet these immense complementary challenges. First, the diffuse binding of receptors to many molecules compre… ▽ More Animals smelling in the real world use a small number of receptors to sense a vast number of natural molecular mixtures, and proceed to learn arbitrary associations between odors and valences. Here, we propose a new interpretation of how the architecture of olfactory circuits is adapted to meet these immense complementary challenges. First, the diffuse binding of receptors to many molecules compresses a vast odor space into a tiny receptor space, while preserving similarity. Next, lateral interactions "densify" and decorrelate the response, enhancing robustness to noise. Finally, disordered projections from the periphery to the central brain reconfigure the densely packed information into a format suitable for flexible learning of associations and valences. We test our theory empirically using data from Drosophila. Our theory suggests that the neural processing of olfactory information differs from the other senses in its fundamental use of disorder. △ Less

Submitted 6 July, 2017; originally announced July 2017.

arXiv:1703.01851 [pdf, other]

Approximation Algorithms for Maximin Fair Division

Authors: Siddharth Barman, Sanath Kumar Krishnamurthy

Abstract: We consider the problem of allocating indivisible goods fairly among n agents who have additive and submodular valuations for the goods. Our fairness guarantees are in terms of the maximin share, that is defined to be the maximum value that an agent can ensure for herself, if she were to partition the goods into n bundles, and then receive a minimum valued bundle. Since maximin fair allocations (i… ▽ More We consider the problem of allocating indivisible goods fairly among n agents who have additive and submodular valuations for the goods. Our fairness guarantees are in terms of the maximin share, that is defined to be the maximum value that an agent can ensure for herself, if she were to partition the goods into n bundles, and then receive a minimum valued bundle. Since maximin fair allocations (i.e., allocations in which each agent gets at least her maximin share) do not always exist, prior work has focused on approximation results that aim to find allocations in which the value of the bundle allocated to each agent is (multiplicatively) as close to her maximin share as possible. In particular, Procaccia and Wang (2014) along with Amanatidis et al. (2015) have shown that under additive valuations a 2/3-approximate maximin fair allocation always exists and can be found in polynomial time. We complement these results by develo** a simple and efficient algorithm that achieves the same approximation guarantee. Furthermore, we initiate the study of approximate maximin fair division under submodular valuations. Specifically, we show that when the valuations of the agents are nonnegative, monotone, and submodular, then a 0.21-approximate maximin fair allocation is guaranteed to exist. In fact, we show that such an allocation can be efficiently found by using a simple round-robin algorithm. A technical contribution of the paper is to analyze the performance of this combinatorial algorithm by employing the concept of multilinear extensions. △ Less

Submitted 6 April, 2020; v1 submitted 6 March, 2017; originally announced March 2017.

Comments: 35 pages

arXiv:1603.06400 [pdf, ps, other]

Joint System and Algorithm Design for Computationally Efficient Fan Beam Coded Aperture X-ray Coherent Scatter Imaging

Authors: Ikenna Odinaka, Joseph A. O'Sullivan, David G. Politte, Kenneth P. MacCabe, Yan Kaganovsky, Joel A. Greenberg, Manu Lakshmanan, Kalyani Krishnamurthy, Anuj Kapadia, Lawrence Carin, David J. Brady

Abstract: In x-ray coherent scatter tomography, tomographic measurements of the forward scatter distribution are used to infer scatter densities within a volume. A radiopaque 2D pattern placed between the object and the detector array enables the disambiguation between different scatter events. The use of a fan beam source illumination to speed up data acquisition relative to a pencil beam presents computat… ▽ More In x-ray coherent scatter tomography, tomographic measurements of the forward scatter distribution are used to infer scatter densities within a volume. A radiopaque 2D pattern placed between the object and the detector array enables the disambiguation between different scatter events. The use of a fan beam source illumination to speed up data acquisition relative to a pencil beam presents computational challenges. To facilitate the use of iterative algorithms based on a penalized Poisson log-likelihood function, efficient computational implementation of the forward and backward models are needed. Our proposed implementation exploits physical symmetries and structural properties of the system and suggests a joint system-algorithm design, where the system design choices are influenced by computational considerations, and in turn lead to reduced reconstruction time. Computational-time speedups of approximately 146 and 32 are achieved in the computation of the forward and backward models, respectively. Results validating the forward model and reconstruction algorithm are presented on simulated analytic and Monte Carlo data. △ Less

Submitted 29 January, 2016; originally announced March 2016.

Comments: This paper has been submitted to IEEE Transactions on Computational Imaging for consideration. 18 pages, 6 figures

arXiv:1602.07577 [pdf, other]

Interfacial and morphological features of a twist-bend nematic drop

Authors: Kanakapura S. Krishnamurthy, Pramoda Kumar, Nani B. Palakurthy, Channabasaveshwar V. Yelamaggad, Epifanio G. Virga

Abstract: In this experimental and theoretical study, we examine the equilibrium shapes of quasitwo-dimensional twist-bend nematic (Ntb) drops formed within a planarly aligned nematic layer of the liquid crystal CB7CB. Initially, at the setting point of the Ntb phase, the drops assume a nonequilibrium cusped elliptical geometry with the major axis orthogonal to the director of the surrounding nematic fluid;… ▽ More In this experimental and theoretical study, we examine the equilibrium shapes of quasitwo-dimensional twist-bend nematic (Ntb) drops formed within a planarly aligned nematic layer of the liquid crystal CB7CB. Initially, at the setting point of the Ntb phase, the drops assume a nonequilibrium cusped elliptical geometry with the major axis orthogonal to the director of the surrounding nematic fluid; this growth is governed principally by anisotropic heat diffusion. The drops attain equilibrium through thermally driven dynamical evolutions close to their melting temperature. They are associated with a characteristic twin-striped morphology that transforms into the familiar focal conic texture as the temperature is lowered. At equilibrium, large millimetric drops are tactoidlike, elongated along the director of the surrounding nematic fluid. This geometry is explained by a mathematical model that features two dimensionless parameters, of which one is the structural cone angle of the Ntb phase and the other, the relative strength of mismatch elastic energy at the drop's interface. Both parameters are extracted from the observations by measuring the aspect ratio of the equilibrium shapes and the inner corner angle of the cusps. △ Less

Submitted 24 February, 2016; originally announced February 2016.

arXiv:1406.2287 [pdf, ps, other]

doi 10.1088/2041-8205/790/2/L18

Nonuniform Expansion of the Youngest Galactic Supernova Remnant G1.9+0.3

Authors: K. J. Borkowski, S. P. Reynolds, D. A. Green, U. Hwang, R. Petre, K. Krishnamurthy, R. Willett

Abstract: We report measurements of X-ray expansion of the youngest Galactic supernova remnant, G1.9+0.3, using Chandra observations in 2007, 2009, and 2011. The measured rates strongly deviate from uniform expansion, decreasing radially by about 60% along the X-ray bright SE-NW axis from 0.84% +/- 0.06% per yr to 0.52% +/- 0.03% per yr. This corresponds to undecelerated ages of 120-190 yr, confirming the y… ▽ More We report measurements of X-ray expansion of the youngest Galactic supernova remnant, G1.9+0.3, using Chandra observations in 2007, 2009, and 2011. The measured rates strongly deviate from uniform expansion, decreasing radially by about 60% along the X-ray bright SE-NW axis from 0.84% +/- 0.06% per yr to 0.52% +/- 0.03% per yr. This corresponds to undecelerated ages of 120-190 yr, confirming the young age of G1.9+0.3, and implying a significant deceleration of the blast wave. The synchrotron-dominated X-ray emission brightens at a rate of 1.9% +/- 0.4% per yr. We identify bright outer and inner rims with the blast wave and reverse shock, respectively. Sharp density gradients in either ejecta or ambient medium are required to produce the sudden deceleration of the reverse shock or the blast wave implied by the large spread in expansion ages. The blast wave could have been decelerated recently by an encounter with a modest density discontinuity in the ambient medium, such as found at a wind termination shock, requiring strong mass loss in the progenitor. Alternatively, the reverse shock might have encountered an order-of-magnitude density discontinuity within the ejecta, such as found in pulsating delayed-detonation Type Ia models. We demonstrate that the blast wave is much more decelerated than the reverse shock in these models for remnants at ages similar to G1.9+0.3. Similar effects may also be produced by dense shells possibly associated with high-velocity features in Type Ia spectra. Accounting for the asymmetry of G1.9+0.3 will require more realistic 3D Type Ia models. △ Less

Submitted 1 July, 2014; v1 submitted 9 June, 2014; originally announced June 2014.

Comments: 6 pages, 4 figures, accepted for publication in ApJ Letters, minor revisions

arXiv:1305.7399 [pdf, ps, other]

doi 10.1088/2041-8205/771/1/L9

Supernova Ejecta in the Youngest Galactic Supernova Remnant G1.9+0.3

Authors: K. J. Borkowski, S. P. Reynolds, U. Hwang, D. A. Green, R. Petre, K. Krishnamurthy, R. Willett

Abstract: G1.9+0.3 is the youngest known Galactic supernova remnant (SNR), with an estimated supernova (SN) explosion date of about 1900, and most likely located near the Galactic Center. Only the outermost ejecta layers with free-expansion velocities larger than about 18,000 km/s have been shocked so far in this dynamically young, likely Type Ia SNR. A long (980 ks) Chandra observation in 2011 allowed spat… ▽ More G1.9+0.3 is the youngest known Galactic supernova remnant (SNR), with an estimated supernova (SN) explosion date of about 1900, and most likely located near the Galactic Center. Only the outermost ejecta layers with free-expansion velocities larger than about 18,000 km/s have been shocked so far in this dynamically young, likely Type Ia SNR. A long (980 ks) Chandra observation in 2011 allowed spatially-resolved spectroscopy of heavy-element ejecta. We denoised Chandra data with the spatio-spectral method of Krishnamurthy et al., and used a wavelet-based technique to spatially localize thermal emission produced by intermediate-mass elements (IMEs: Si and S) and iron. The spatial distribution of both IMEs and Fe is extremely asymmetric, with the strongest ejecta emission in the northern rim. Fe Kalpha emission is particularly prominent there, and fits with thermal models indicate strongly oversolar Fe abundances. In a localized, outlying region in the northern rim, IMEs are less abundant than Fe, indicating that undiluted Fe-group elements (including 56Ni) with velocities larger than 18,000 km/s were ejected by this SN. But in the inner west rim, we find Si- and S-rich ejecta without any traces of Fe, so high-velocity products of O-burning were also ejected. G1.9+0.3 appears similar to energetic Type Ia SNe such as SN 2010jn where iron-group elements at such high free-expansion velocities have been recently detected. The pronounced asymmetry in the ejecta distribution and abundance inhomogeneities are best explained by a strongly asymmetric SN explosion, similar to those produced in some recent 3D delayed-detonation Type Ia models. △ Less

Submitted 31 May, 2013; originally announced May 2013.

Comments: 6 pages, 3 figures, submitted to ApJ Letters

arXiv:1209.3990 [pdf, other]

doi 10.1137/120891927

Level set estimation from projection measurements: Performance guarantees and fast computation

Authors: Kalyani Krishnamurthy, Waheed U. Bajwa, Rebecca Willett

Abstract: Estimation of the level set of a function (i.e., regions where the function exceeds some value) is an important problem with applications in digital elevation map**, medical imaging, astronomy, etc. In many applications, the function of interest is not observed directly. Rather, it is acquired through (linear) projection measurements, such as tomographic projections, interferometric measurements… ▽ More Estimation of the level set of a function (i.e., regions where the function exceeds some value) is an important problem with applications in digital elevation map**, medical imaging, astronomy, etc. In many applications, the function of interest is not observed directly. Rather, it is acquired through (linear) projection measurements, such as tomographic projections, interferometric measurements, coded-aperture measurements, and random projections associated with compressed sensing. This paper describes a new methodology for rapid and accurate estimation of the level set from such projection measurements. The key defining characteristic of the proposed method, called the projective level set estimator, is its ability to estimate the level set from projection measurements without an intermediate reconstruction step. This leads to significantly faster computation relative to heuristic "plug-in" methods that first estimate the function, typically with an iterative algorithm, and then threshold the result. The paper also includes a rigorous theoretical analysis of the proposed method, which utilizes the recent results from the non-asymptotic theory of random matrices results from the literature on concentration of measure and characterizes the estimator's performance in terms of geometry of the measurement operator and 1-norm of the discretized function. △ Less

Submitted 2 May, 2013; v1 submitted 18 September, 2012; originally announced September 2012.

Comments: 23 pages, 20 figures

MSC Class: 62; 68

Journal ref: SIAM J. Imaging Sciences, vol. 6, no. 4, pp. 2047-2074, Oct. 2013

arXiv:1112.0504 [pdf, other]

doi 10.1186/1687-6180-2012-205

Target Detection Performance Bounds in Compressive Imaging

Authors: Kalyani Krishnamurthy, Rebecca Willett, Maxim Raginsky

Abstract: This paper describes computationally efficient approaches and associated theoretical performance guarantees for the detection of known targets and anomalies from few projection measurements of the underlying signals. The proposed approaches accommodate signals of different strengths contaminated by a colored Gaussian background, and perform detection without reconstructing the underlying signals f… ▽ More This paper describes computationally efficient approaches and associated theoretical performance guarantees for the detection of known targets and anomalies from few projection measurements of the underlying signals. The proposed approaches accommodate signals of different strengths contaminated by a colored Gaussian background, and perform detection without reconstructing the underlying signals from the observations. The theoretical performance bounds of the target detector highlight fundamental tradeoffs among the number of measurements collected, amount of background signal present, signal-to-noise ratio, and similarity among potential targets coming from a known dictionary. The anomaly detector is designed to control the number of false discoveries. The proposed approach does not depend on a known sparse representation of targets; rather, the theoretical performance bounds exploit the structure of a known dictionary of targets and the distance preservation property of the measurement matrix. Simulation experiments illustrate the practicality and effectiveness of the proposed approaches. △ Less

Submitted 14 August, 2012; v1 submitted 2 December, 2011; originally announced December 2011.

Comments: Submitted to the EURASIP Journal on Advances in Signal Processing

arXiv:1106.4498 [pdf, ps, other]

doi 10.1088/2041-8205/737/1/L22

Expansion of the Youngest Galactic Supernova Remnant G1.9+0.3

Authors: A. K. Carlton, K. J. Borkowski, S. P. Reynolds, U. Hwang, R. Petre, D. A. Green, K. Krishnamurthy, R. Willett

Abstract: We present a measurement of the expansion and brightening of G1.9+0.3, the youngest Galactic supernova remnant, comparing Chandra X-ray images obtained in 2007 and 2009. A simple uniform expansion model describes the data well, giving an expansion rate of 0.642 +/- 0.049 % yr^-1, and a flux increase of 1.7 +/- 1.0 % yr^-1. Without deceleration, the remnant age would then be 156 +/- 11 yr, consiste… ▽ More We present a measurement of the expansion and brightening of G1.9+0.3, the youngest Galactic supernova remnant, comparing Chandra X-ray images obtained in 2007 and 2009. A simple uniform expansion model describes the data well, giving an expansion rate of 0.642 +/- 0.049 % yr^-1, and a flux increase of 1.7 +/- 1.0 % yr^-1. Without deceleration, the remnant age would then be 156 +/- 11 yr, consistent with earlier results. Since deceleration must have occurred, this age is an upper limit; we estimate an age of about 110 yr, or an explosion date of about 1900. The flux increase is comparable to reported increases at radio wavelengths. G1.9+0.3 is the only Galactic supernova remnant increasing in flux, with implications for the physics of electron acceleration in shock waves △ Less

Submitted 22 June, 2011; originally announced June 2011.

Comments: 14 pages, 3 figures. Accepted by ApJ (Letters)

arXiv:1006.3552 [pdf, other]

doi 10.1088/2041-8205/724/2/L161

Radioactive Scandium in the Youngest Galactic Supernova Remnant G1.9+0.3

Authors: Kazimierz J. Borkowski, Stephen P. Reynolds, David A. Green, Una Hwang, Robert Petre, Kalyani Krishnamurthy, Rebecca Willett

Abstract: We report the discovery of thermal X-ray emission from the youngest Galactic supernova remnant G1.9+0.3, from a 237-ks Chandra observation. We detect strong K-shell lines of Si, S, Ar, Ca, and Fe. In addition, we detect a 4.1 keV line with 99.971% confidence which we attribute to 44Sc, produced by electron capture from 44Ti. Combining the data with our earlier Chandra observation allows us to dete… ▽ More We report the discovery of thermal X-ray emission from the youngest Galactic supernova remnant G1.9+0.3, from a 237-ks Chandra observation. We detect strong K-shell lines of Si, S, Ar, Ca, and Fe. In addition, we detect a 4.1 keV line with 99.971% confidence which we attribute to 44Sc, produced by electron capture from 44Ti. Combining the data with our earlier Chandra observation allows us to detect the line in two regions independently. For a remnant age of 100 yr, our measured total line strength indicates synthesis of $(1 - 7) \times 10^{-5}$ solar masses of 44Ti, in the range predicted for both Type Ia and core-collapse supernovae, but somewhat smaller than the $2 \times 10^{-4}$ solar masses reported for Cas A. The line spectrum indicates supersolar abundances. The Fe emission has a width of about 28,000 km/s, consistent with an age of about 100 yr and with the inferred mean shock velocity of 14,000 km/s deduced assuming a distance of 8.5 kpc. Most thermal emission comes from regions of lower X-ray but higher radio surface brightness. Deeper observations should allow more detailed spatial map** of scandium, with significant implications for models of nucleosynthesis in Type Ia supernovae. △ Less

Submitted 21 October, 2010; v1 submitted 17 June, 2010; originally announced June 2010.

Comments: 5 pages, 3 figures, accepted for publication in ApJL

Showing 1–40 of 40 results for author: Krishnamurthy, K