Skip to main content

Showing 101–150 of 835 results for author: Krishnamurthy

.
  1. arXiv:2308.03754  [pdf, other

    cond-mat.dis-nn cond-mat.stat-mech

    High-Dimensional Non-Convex Landscapes and Gradient Descent Dynamics

    Authors: Tony Bonnaire, Davide Ghio, Kamesh Krishnamurthy, Francesca Mignacco, Atsushi Yamamura, Giulio Biroli

    Abstract: In these lecture notes we present different methods and concepts developed in statistical physics to analyze gradient descent dynamics in high-dimensional non-convex landscapes. Our aim is to show how approaches developed in physics, mainly statistical physics of disordered systems, can be used to tackle open questions on high-dimensional dynamics in Machine Learning.

    Submitted 10 November, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

    Comments: Lectures given by G. Biroli at the 2022 Les Houches Summer School "Statistical Physics and Machine Learning"

  2. arXiv:2307.15157  [pdf, other

    cs.CV cs.LG eess.IV

    R-LPIPS: An Adversarially Robust Perceptual Similarity Metric

    Authors: Sara Ghazanfari, Siddharth Garg, Prashanth Krishnamurthy, Farshad Khorrami, Alexandre Araujo

    Abstract: Similarity metrics have played a significant role in computer vision to capture the underlying semantics of images. In recent years, advanced similarity metrics, such as the Learned Perceptual Image Patch Similarity (LPIPS), have emerged. These metrics leverage deep features extracted from trained neural networks and have demonstrated a remarkable ability to closely align with human perception whe… ▽ More

    Submitted 31 July, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

  3. arXiv:2307.12093  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Spinon continuum in the Heisenberg quantum chain compound Sr$_2$V$_3$O$_9$

    Authors: Shang Gao, Ling-Fang Lin, Pontus Laurell, Qiang Chen, Qing Huang, Clarina dela Cruz, Krishnamurthy V. Vemuru, Mark D. Lumsden, Stephen E. Nagler, Gonzalo Alvarez, Elbio Dagotto, Haidong Zhou, Andrew D. Christianson, Matthew B. Stone

    Abstract: Magnetic excitations in the spin chain candidate Sr$_2$V$_3$O$_9$ have been investigated by inelastic neutron scattering on a single crystal sample. A spinon continuum with a bandwidth of $\sim22$ meV is observed along the chain formed by alternating magnetic V$^{4+}$ and nonmagnetic V$^{5+}$ ions. Incipient magnetic Bragg peaks due to weak ferromagnetic interchain couplings emerge when approachin… ▽ More

    Submitted 25 July, 2023; v1 submitted 22 July, 2023; originally announced July 2023.

    Comments: 19 pages, 9 figures

    Journal ref: Phys. Rev. B 109, L020402 (2024)

  4. arXiv:2307.09901  [pdf, other

    eess.SY

    Using Circulation to Mitigate Spurious Equilibria in Control Barrier Function -- Extended Version

    Authors: Vinicius Mariano Goncalves, Prashanth Krishnamurthy, Anthony Tzes, Farshad Khorrami

    Abstract: Control Barrier Functions and Quadratic Programming are increasingly used for designing controllers that consider critical safety constraints. However, like Artificial Potential Fields, they can suffer from the stable spurious equilibrium point problem, which can result in the controller failing to reach the goal. To address this issue, we propose introducing circulation inequalities as a constrai… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  5. arXiv:2307.06398  [pdf, other

    cs.LG q-bio.NC

    Trainability, Expressivity and Interpretability in Gated Neural ODEs

    Authors: Timothy Doyeon Kim, Tankut Can, Kamesh Krishnamurthy

    Abstract: Understanding how the dynamics in biological and artificial neural networks implement the computations required for a task is a salient open question in machine learning and neuroscience. In particular, computations requiring complex memory storage and retrieval pose a significant challenge for these networks to implement or learn. Recently, a family of models described by neural ordinary differen… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

  6. arXiv:2307.05422  [pdf, other

    cs.CR cs.LG

    Differential Analysis of Triggers and Benign Features for Black-Box DNN Backdoor Detection

    Authors: Hao Fu, Prashanth Krishnamurthy, Siddharth Garg, Farshad Khorrami

    Abstract: This paper proposes a data-efficient detection method for deep neural networks against backdoor attacks under a black-box scenario. The proposed approach is motivated by the intuition that features corresponding to triggers have a higher influence in determining the backdoored network output than any other benign features. To quantitatively measure the effects of triggers and benign features on de… ▽ More

    Submitted 14 July, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

    Comments: Published in the IEEE Transactions on Information Forensics and Security

    Journal ref: IEEE Transactions on Information Forensics and Security 2023

  7. arXiv:2307.04392  [pdf, other

    cs.CV

    FODVid: Flow-guided Object Discovery in Videos

    Authors: Silky Singh, Shripad Deshmukh, Mausoom Sarkar, Rishabh Jain, Mayur Hemani, Balaji Krishnamurthy

    Abstract: Segmentation of objects in a video is challenging due to the nuances such as motion blurring, parallax, occlusions, changes in illumination, etc. Instead of addressing these nuances separately, we focus on building a generalizable solution that avoids overfitting to the individual intricacies. Such a solution would also help us save enormous resources involved in human annotation of video corpora.… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: CVPR 2023 (L3D-IVU workshop)

  8. arXiv:2307.03181  [pdf, ps, other

    cs.GT econ.TH

    Markov Persuasion Processes with Endogenous Agent Beliefs

    Authors: Krishnamurthy Iyer, Haifeng Xu, You Zu

    Abstract: We consider a dynamic Bayesian persuasion setting where a single long-lived sender persuades a stream of ``short-lived'' agents (receivers) by sharing information about a payoff-relevant state. The state transitions are Markovian and the sender seeks to maximize the long-run average reward by committing to a (possibly history-dependent) signaling mechanism. While most previous studies of Markov pe… ▽ More

    Submitted 13 July, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: Minor revisions

    MSC Class: 91A28; 90C40; 60J20 ACM Class: F.2; G.3

  9. arXiv:2307.02108  [pdf, other

    cs.LG stat.ML

    Proportional Response: Contextual Bandits for Simple and Cumulative Regret Minimization

    Authors: Sanath Kumar Krishnamurthy, Ruohan Zhan, Susan Athey, Emma Brunskill

    Abstract: In many applications, e.g. in healthcare and e-commerce, the goal of a contextual bandit may be to learn an optimal treatment assignment policy at the end of the experiment. That is, to minimize simple regret. However, this objective remains understudied. We propose a new family of computationally efficient bandit algorithms for the stochastic contextual bandit setting, where a tuning parameter de… ▽ More

    Submitted 2 November, 2023; v1 submitted 5 July, 2023; originally announced July 2023.

  10. arXiv:2307.01479  [pdf, other

    math.NA

    Optimal Surrogate Boundary Selection and Scalability Studies for the Shifted Boundary Method on Octree Meshes

    Authors: Cheng-Hau Yang, Kumar Saurabh, Guglielmo Scovazzi, Claudio Canuto, Adarsh Krishnamurthy, Baskar Ganapathysubramanian

    Abstract: The accurate and efficient simulation of Partial Differential Equations (PDEs) in and around arbitrarily defined geometries is critical for many application domains. Immersed boundary methods (IBMs) alleviate the usually laborious and time-consuming process of creating body-fitted meshes around complex geometry models (described by CAD or other representations, e.g., STL, point clouds), especially… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  11. Screening Mixed-Metal Sn$_2$M(III)Ch$_2$X$_3$ Chalcohalides for Photovoltaic Applications

    Authors: Pascal Henkel, **grui Li, G. Krishnamurthy Grandhi, Paola Vivo, Patrick Rinke

    Abstract: Quaternary mixed-metal chalcohalides (Sn$_2$BCh$_2$X$_3$) are emerging as promising lead-free perovskite-inspired photovoltaic absorbers. Motivated by recent developments of a first Sn$_2$BCh$_2$X$_3$-based device, we used density functional theory to identify lead-free Sn$_2$BCh$_2$X$_3$ materials that are structurally and energetically stable within Cmcm, Cmc2$_1$ and P2$_1$/c space groups and h… ▽ More

    Submitted 12 September, 2023; v1 submitted 30 June, 2023; originally announced June 2023.

  12. arXiv:2306.16503  [pdf, other

    cs.LG cs.AI

    SARC: Soft Actor Retrospective Critic

    Authors: Sukriti Verma, Ayush Chopra, Jayakumar Subramanian, Mausoom Sarkar, Nikaash Puri, Piyush Gupta, Balaji Krishnamurthy

    Abstract: The two-time scale nature of SAC, which is an actor-critic algorithm, is characterised by the fact that the critic estimate has not converged for the actor at any given time, but since the critic learns faster than the actor, it ensures eventual consistency between the two. Various strategies have been introduced in literature to learn better gradient estimates to help achieve better convergence.… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: Accepted at RLDM 2022

  13. arXiv:2306.10740  [pdf, other

    math.NA

    A semi-implicit finite volume scheme for dissipative measure-valued solutions to the barotropic Euler system

    Authors: K. R. Arun, Amogh Krishnamurthy

    Abstract: A semi-implicit in time, entropy stable finite volume scheme for the compressible barotropic Euler system is designed and analyzed and its weak convergence to a dissipative measure-valued (DMV) solution [E. Feireisl et al., Dissipative measure-valued solutions to the compressible Navier-Stokes system, Calc. Var. Partial Differential Equations, 2016] of the Euler system is shown. The entropy stabil… ▽ More

    Submitted 6 December, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    MSC Class: Primary 35L45; 35L60; 35L65; 35L67; Secondary 35D99; 35R06; 65M08

  14. arXiv:2306.08424  [pdf, other

    cs.HC cs.AI cs.LG

    Selective Concept Models: Permitting Stakeholder Customisation at Test-Time

    Authors: Matthew Barker, Katherine M. Collins, Krishnamurthy Dvijotham, Adrian Weller, Umang Bhatt

    Abstract: Concept-based models perform prediction using a set of concepts that are interpretable to stakeholders. However, such models often involve a fixed, large number of concepts, which may place a substantial cognitive load on stakeholders. We propose Selective COncept Models (SCOMs) which make predictions using only a subset of concepts and can be customised by stakeholders at test-time according to t… ▽ More

    Submitted 14 June, 2023; originally announced June 2023.

  15. arXiv:2306.08183  [pdf, other

    cs.CV

    ZeroForge: Feedforward Text-to-Shape Without 3D Supervision

    Authors: Kelly O. Marshall, Minh Pham, Ameya Joshi, Anushrut Jignasu, Aditya Balu, Adarsh Krishnamurthy, Chinmay Hegde

    Abstract: Current state-of-the-art methods for text-to-shape generation either require supervised training using a labeled dataset of pre-defined 3D shapes, or perform expensive inference-time optimization of implicit neural representations. In this work, we present ZeroForge, an approach for zero-shot text-to-shape generation that avoids both pitfalls. To achieve open-vocabulary shape generation, we requir… ▽ More

    Submitted 15 June, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

    Comments: 19 pages, High resolution figures needed to demonstrate 3D results

  16. arXiv:2306.07923  [pdf, other

    cs.LG

    Oracle-Efficient Pessimism: Offline Policy Optimization in Contextual Bandits

    Authors: Lequn Wang, Akshay Krishnamurthy, Aleksandrs Slivkins

    Abstract: We consider offline policy optimization (OPO) in contextual bandits, where one is given a fixed dataset of logged interactions. While pessimistic regularizers are typically used to mitigate distribution shift, prior implementations thereof are either specialized or computationally inefficient. We present the first general oracle-efficient algorithm for pessimistic OPO: it reduces to supervised lea… ▽ More

    Submitted 25 October, 2023; v1 submitted 13 June, 2023; originally announced June 2023.

  17. arXiv:2306.04431  [pdf, other

    cs.LG

    Faithful Knowledge Distillation

    Authors: Tom A. Lamb, Rudy Brunel, Krishnamurthy DJ Dvijotham, M. Pawan Kumar, Philip H. S. Torr, Francisco Eiras

    Abstract: Knowledge distillation (KD) has received much attention due to its success in compressing networks to allow for their deployment in resource-constrained systems. While the problem of adversarial robustness has been studied before in the KD setting, previous works overlook what we term the relative calibration of the student network with respect to its teacher in terms of soft confidences. In parti… ▽ More

    Submitted 11 August, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 7pgs (main content), 4 figures

  18. arXiv:2306.00946  [pdf, other

    cs.LG cs.CL

    Exposing Attention Glitches with Flip-Flop Language Modeling

    Authors: Bingbin Liu, Jordan T. Ash, Surbhi Goel, Akshay Krishnamurthy, Cyril Zhang

    Abstract: Why do large language models sometimes output factual inaccuracies and exhibit erroneous reasoning? The brittleness of these models, particularly when executing long chains of reasoning, currently seems to be an inevitable price to pay for their advanced capabilities of coherently synthesizing knowledge, pragmatics, and abstract thought. Towards making sense of this fundamentally unsolved problem,… ▽ More

    Submitted 30 October, 2023; v1 submitted 1 June, 2023; originally announced June 2023.

    Comments: v2: NeurIPS 2023 camera-ready + data release

  19. arXiv:2305.18461  [pdf, ps, other

    cs.NI cs.DC cs.DM cs.LG

    Bandwidth Optimal Pipeline Schedule for Collective Communication

    Authors: Liangyu Zhao, Arvind Krishnamurthy

    Abstract: We present a strongly polynomial-time algorithm to generate bandwidth optimal allgather/reduce-scatter on any network topology, with or without switches. Our algorithm constructs pipeline schedules achieving provably the best possible bandwidth performance on a given topology. To provide a universal solution, we model the network topology as a directed graph with heterogeneous link capacities and… ▽ More

    Submitted 31 May, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

  20. arXiv:2305.18393  [pdf, other

    cs.LG cs.CR

    Training Private Models That Know What They Don't Know

    Authors: Stephan Rabanser, Anvith Thudi, Abhradeep Thakurta, Krishnamurthy Dvijotham, Nicolas Papernot

    Abstract: Training reliable deep learning models which avoid making overconfident but incorrect predictions is a longstanding challenge. This challenge is further exacerbated when learning has to be differentially private: protection provided to sensitive data comes at the price of injecting additional randomness into the learning process. In this work, we conduct a thorough empirical investigation of selec… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

  21. arXiv:2305.13991  [pdf, other

    cs.LG cs.CR stat.ML

    Expressive Losses for Verified Robustness via Convex Combinations

    Authors: Alessandro De Palma, Rudy Bunel, Krishnamurthy Dvijotham, M. Pawan Kumar, Robert Stanforth, Alessio Lomuscio

    Abstract: In order to train networks for verified adversarial robustness, it is common to over-approximate the worst-case loss over perturbation regions, resulting in networks that attain verifiability at the expense of standard performance. As shown in recent work, better trade-offs between accuracy and robustness can be obtained by carefully coupling adversarial training with over-approximations. We hypot… ▽ More

    Submitted 18 March, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: ICLR 2024

  22. arXiv:2305.10621  [pdf, other

    cs.NI cs.DC

    TSoR: TCP Socket over RDMA Container Network for Cloud Native Computing

    Authors: Yulin Sun, Qingming Qu, Chenxingyu Zhao, Arvind Krishnamurthy, Hong Chang, Ying Xiong

    Abstract: Cloud-native containerized applications constantly seek high-performance and easy-to-operate container network solutions. RDMA network is a potential enabler with higher throughput and lower latency than the standard TCP/IP network stack. However, several challenges remain in equip** containerized applications with RDMA network: 1) How to deliver transparent improvements without modifying applic… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  23. arXiv:2305.10157  [pdf, other

    cs.LG math-ph

    Efficient Error Certification for Physics-Informed Neural Networks

    Authors: Francisco Eiras, Adel Bibi, Rudy Bunel, Krishnamurthy Dj Dvijotham, Philip Torr, M. Pawan Kumar

    Abstract: Recent work provides promising evidence that Physics-Informed Neural Networks (PINN) can efficiently solve partial differential equations (PDE). However, previous works have failed to provide guarantees on the worst-case residual error of a PINN across the spatio-temporal domain - a measure akin to the tolerance of numerical solvers - focusing instead on point-wise comparisons between their soluti… ▽ More

    Submitted 29 May, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: Accepted to ICML'24

  24. arXiv:2305.09758  [pdf, other

    cs.CV cs.CL

    A Video Is Worth 4096 Tokens: Verbalize Videos To Understand Them In Zero Shot

    Authors: Aanisha Bhattacharya, Yaman K Singla, Balaji Krishnamurthy, Rajiv Ratn Shah, Changyou Chen

    Abstract: Multimedia content, such as advertisements and story videos, exhibit a rich blend of creativity and multiple modalities. They incorporate elements like text, visuals, audio, and storytelling techniques, employing devices like emotions, symbolism, and slogans to convey meaning. There is a dearth of large annotated training datasets in the multimedia domain hindering the development of supervised le… ▽ More

    Submitted 26 October, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: Accepted to EMNLP-23 TL;DR: Video understanding lags far behind NLP; LLMs excel in zero-shot. Our approach utilizes LLMs to verbalize videos, creating stories for zero-shot video understanding. This yields state-of-the-art results across five datasets, covering fifteen tasks

  25. arXiv:2305.09258  [pdf, other

    cs.IR cs.CL

    HyHTM: Hyperbolic Geometry based Hierarchical Topic Models

    Authors: Simra Shahid, Tanay Anand, Nikitha Srikanth, Sumit Bhatia, Balaji Krishnamurthy, Nikaash Puri

    Abstract: Hierarchical Topic Models (HTMs) are useful for discovering topic hierarchies in a collection of documents. However, traditional HTMs often produce hierarchies where lowerlevel topics are unrelated and not specific enough to their higher-level topics. Additionally, these methods can be computationally expensive. We present HyHTM - a Hyperbolic geometry based Hierarchical Topic Models - that addres… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: This paper is accepted in Findings of the Association for Computational Linguistics (2023)

  26. arXiv:2305.07120  [pdf, other

    cs.GR

    Geometric Modeling and Physics Simulation Framework for Building a Digital Twin of Extrusion-based Additive Manufacturing

    Authors: Dhruv Gamdha, Kumar Saurabh, Baskar Ganapathysubramanian, Adarsh Krishnamurthy

    Abstract: Accurate simulation of the printing process is essential for improving print quality, reducing waste, and optimizing the printing parameters of extrusion-based additive manufacturing. Traditional additive manufacturing simulations are very compute-intensive and are not scalable to simulate even moderately-sized geometries. In this paper, we propose a general framework for creating a digital twin o… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: 13 pages

  27. arXiv:2305.06902  [pdf, other

    cs.CR

    REMaQE: Reverse Engineering Math Equations from Executables

    Authors: Meet Udeshi, Prashanth Krishnamurthy, Hammond Pearce, Ramesh Karri, Farshad Khorrami

    Abstract: Cybersecurity attacks on embedded devices for industrial control systems and cyber-physical systems may cause catastrophic physical damage as well as economic loss. This could be achieved by infecting device binaries with malware that modifies the physical characteristics of the system operation. Mitigating such attacks benefits from reverse engineering tools that recover sufficient semantic knowl… ▽ More

    Submitted 11 April, 2024; v1 submitted 11 May, 2023; originally announced May 2023.

    ACM Class: C.3; D.2.5

  28. arXiv:2305.06677  [pdf, other

    cs.CL cs.AI cs.LG

    INGENIOUS: Using Informative Data Subsets for Efficient Pre-Training of Language Models

    Authors: H S V N S Kowndinya Renduchintala, Krishnateja Killamsetty, Sumit Bhatia, Milan Aggarwal, Ganesh Ramakrishnan, Rishabh Iyer, Balaji Krishnamurthy

    Abstract: A salient characteristic of pre-trained language models (PTLMs) is a remarkable improvement in their generalization capability and emergence of new capabilities with increasing model capacity and pre-training dataset size. Consequently, we are witnessing the development of enormous models pushing the state-of-the-art. It is, however, imperative to realize that this inevitably leads to prohibitivel… ▽ More

    Submitted 19 October, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  29. arXiv:2305.06499  [pdf, other

    eess.SY

    State Constrained Stochastic Optimal Control for Continuous and Hybrid Dynamical Systems Using DFBSDE

    Authors: Bolun Dai, Prashanth Krishnamurthy, Andrew Papanicolaou, Farshad Khorrami

    Abstract: We develop a computationally efficient learning-based forward-backward stochastic differential equations (FBSDE) controller for both continuous and hybrid dynamical (HD) systems subject to stochastic noise and state constraints. Solutions to stochastic optimal control (SOC) problems satisfy the Hamilton-Jacobi-Bellman (HJB) equation. Using current FBSDE-based solutions, the optimal control can be… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

  30. arXiv:2305.05479  [pdf, other

    cs.CR cs.DC eess.SP eess.SY

    Multiple-stop** time Sequential Detection for Energy Efficient Mining in Blockchain-Enabled IoT

    Authors: Anurag Gupta, Vikram Krishnamurthy

    Abstract: What are the optimal times for an Internet of Things (IoT) device to act as a blockchain miner? The aim is to minimize the energy consumed by low-power IoT devices that log their data into a secure (tamper-proof) distributed ledger. We formulate a multiple stop** time Bayesian sequential detection problem to address energy-efficient blockchain mining for IoT devices. The objective is to identify… ▽ More

    Submitted 17 August, 2023; v1 submitted 9 May, 2023; originally announced May 2023.

  31. arXiv:2305.04073  [pdf, other

    cs.AI cs.LG

    Explaining RL Decisions with Trajectories

    Authors: Shripad Vilasrao Deshmukh, Arpan Dasgupta, Balaji Krishnamurthy, Nan Jiang, Chirag Agarwal, Georgios Theocharous, Jayakumar Subramanian

    Abstract: Explanation is a key component for the adoption of reinforcement learning (RL) in many real-world decision-making problems. In the literature, the explanation is often provided by saliency attribution to the features of the RL agent's state. In this work, we propose a complementary approach to these explanations, particularly for offline RL, where we attribute the policy decisions of a trained RL… ▽ More

    Submitted 22 January, 2024; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: Published at International Conference on Learning Representations (ICLR), 2023

  32. The mass determination of TOI-519 b: a close-in giant planet transiting a metal-rich mid-M dwarf

    Authors: Taiki Kagetani, Norio Narita, Tadahiro Kimura, Teruyuki Hirano, Masahiro Ikoma, Hiroyuki Tako Ishikawa, Steven Giacalone, Akihiko Fukui, Takanori Kodama, Rebecca Gore, Ashley Schroeder, Yasunori Hori, Kiyoe Kawauchi, Noriharu Watanabe, Mayuko Mori, Yujie Zou, Kai Ikuta, Vigneshwaran Krishnamurthy, Jon Zink, Kevin Hardegree-Ullman, Hiroki Harakawa, Tomoyuki Kudo, Takayuki Kotani, Takashi Kurokawa, Nobuhiko Kusakabe , et al. (11 additional authors not shown)

    Abstract: We report the mass determination of TOI-519 b, a transiting substellar object around a mid-M dwarf. We carried out radial velocity measurements using Subaru / InfraRed Doppler (IRD), revealing that TOI-519 b is a planet with a mass of $0.463^{+0.082}_{-0.088}~M_{\rm Jup}$. We also find that the host star is metal rich ($\rm [Fe/H] = 0.27 \pm 0.09$ dex) and has the lowest effective temperature (… ▽ More

    Submitted 1 May, 2023; v1 submitted 28 April, 2023; originally announced April 2023.

    Comments: 10 pages, 5 figures. Accepted for publication in PASJ

  33. arXiv:2304.09125  [pdf, other

    eess.SP

    Statistical Detection of Coordination in a Cognitive Radar Network through Inverse Multi-objective Optimization

    Authors: Luke Snow, Vikram Krishnamurthy

    Abstract: Consider a target being tracked by a cognitive radar network. If the target can intercept noisy radar emissions, how can it detect coordination in the radar network? By 'coordination' we mean that the radar emissions satisfy Pareto optimality with respect to multi-objective optimization over the objective functions of each radar and a constraint on total network power output. This paper provides a… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  34. arXiv:2304.09123  [pdf, other

    cs.LG stat.ML

    Finite-Sample Bounds for Adaptive Inverse Reinforcement Learning using Passive Langevin Dynamics

    Authors: Luke Snow, Vikram Krishnamurthy

    Abstract: This paper provides a finite-sample analysis of a passive stochastic gradient Langevin dynamics algorithm (PSGLD) designed to achieve adaptive inverse reinforcement learning (IRL). By passive, we mean that the noisy gradients available to the PSGLD algorithm (inverse learning process) are evaluated at randomly chosen points by an external stochastic gradient algorithm (forward learner) that aims t… ▽ More

    Submitted 27 September, 2023; v1 submitted 18 April, 2023; originally announced April 2023.

  35. Safe Navigation and Obstacle Avoidance Using Differentiable Optimization Based Control Barrier Functions

    Authors: Bolun Dai, Rooholla Khorrambakht, Prashanth Krishnamurthy, Vinícius Gonçalves, Anthony Tzes, Farshad Khorrami

    Abstract: Control barrier functions (CBFs) have been widely applied to safety-critical robotic applications. However, the construction of control barrier functions for robotic systems remains a challenging task. Recently, collision detection using differentiable optimization has provided a way to compute the minimum uniform scaling factor that results in an intersection between two convex shapes and to also… ▽ More

    Submitted 21 November, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

  36. arXiv:2304.01720  [pdf, other

    cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.str-el

    Higher-order Bragg gaps in the electronic band structure of bilayer graphene renormalized by recursive supermoiré potential

    Authors: Mohit Kumar Jat, Priya Tiwari, Robin Bajaj, Ishita Shitut, Shinjan Mandal, Kenji Watanabe, Takashi Taniguchi, H. R. Krishnamurthy, Manish Jain, Aveek Bid

    Abstract: This letter presents our findings on the recursive band gap engineering of chiral fermions in bilayer graphene doubly aligned with hBN. By utilizing two interfering moiré potentials, we generate a supermoiré pattern which renormalizes the electronic bands of the pristine bilayer graphene, resulting in higher-order fractal gaps even at very low energies. These Bragg gaps can be mapped using a uniqu… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: 29 pages (including Supplementary Materials)

  37. arXiv:2303.16741  [pdf, other

    cs.LG cs.SI

    Who You Play Affects How You Play: Predicting Sports Performance Using Graph Attention Networks With Temporal Convolution

    Authors: Rui Luo, Vikram Krishnamurthy

    Abstract: This study presents a novel deep learning method, called GATv2-GCN, for predicting player performance in sports. To construct a dynamic player interaction graph, we leverage player statistics and their interactions during gameplay. We use a graph attention network to capture the attention that each player pays to each other, allowing for more accurate modeling of the dynamic player interactions. T… ▽ More

    Submitted 29 March, 2023; originally announced March 2023.

  38. arXiv:2303.15122  [pdf, other

    cs.CV

    Parameter Efficient Local Implicit Image Function Network for Face Segmentation

    Authors: Mausoom Sarkar, Nikitha SR, Mayur Hemani, Rishabh Jain, Balaji Krishnamurthy

    Abstract: Face parsing is defined as the per-pixel labeling of images containing human faces. The labels are defined to identify key facial regions like eyes, lips, nose, hair, etc. In this work, we make use of the structural consistency of the human face to propose a lightweight face-parsing method using a Local Implicit Function network, FP-LIIF. We propose a simple architecture having a convolutional enc… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: Accepted at CVPR 2023

  39. arXiv:2303.13588  [pdf, ps, other

    cs.AI cs.LG cs.SC

    Efficient Symbolic Reasoning for Neural-Network Verification

    Authors: Zi Wang, Somesh Jha, Krishnamurthy, Dvijotham

    Abstract: The neural network has become an integral part of modern software systems. However, they still suffer from various problems, in particular, vulnerability to adversarial attacks. In this work, we present a novel program reasoning framework for neural-network verification, which we refer to as symbolic reasoning. The key components of our framework are the use of the symbolic domain and the quadrati… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

  40. arXiv:2303.12872  [pdf, other

    cs.HC cs.AI cs.LG

    Human Uncertainty in Concept-Based AI Systems

    Authors: Katherine M. Collins, Matthew Barker, Mateo Espinosa Zarlenga, Naveen Raman, Umang Bhatt, Mateja Jamnik, Ilia Sucholutsky, Adrian Weller, Krishnamurthy Dvijotham

    Abstract: Placing a human in the loop may abate the risks of deploying AI systems in safety-critical settings (e.g., a clinician working with a medical AI system). However, mitigating risks arising from human error and uncertainty within such human-AI interactions is an important and understudied issue. In this work, we study human uncertainty in the context of concept-based models, a family of AI systems t… ▽ More

    Submitted 22 March, 2023; originally announced March 2023.

  41. arXiv:2303.11676  [pdf

    cs.CV

    Deep Learning Pipeline for Preprocessing and Segmenting Cardiac Magnetic Resonance of Single Ventricle Patients from an Image Registry

    Authors: Tina Yao, Nicole St. Clair, Gabriel F. Miller, Adam L. Dorfman, Mark A. Fogel, Sunil Ghelani, Rajesh Krishnamurthy, Christopher Z. Lam, Joshua D. Robinson, David Schidlow, Timothy C. Slesnick, Justin Weigand, Michael Quail, Rahul Rathod, Jennifer A. Steeden, Vivek Muthurangu

    Abstract: Purpose: To develop and evaluate an end-to-end deep learning pipeline for segmentation and analysis of cardiac magnetic resonance images to provide core-lab processing for a multi-centre registry of Fontan patients. Materials and Methods: This retrospective study used training (n = 175), validation (n = 25) and testing (n = 50) cardiac magnetic resonance image exams collected from 13 institution… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 17 pages, 6 figures

  42. arXiv:2303.10753  [pdf, other

    cs.SI eess.SP

    Fréchet Statistics Based Change Point Detection in Dynamic Social Networks

    Authors: Rui Luo, Vikram Krishnamurthy

    Abstract: This paper proposes a method to detect change points in dynamic social networks using Fréchet statistics. We address two main questions: (1) what metric can quantify the distances between graph Laplacians in a dynamic network and enable efficient computation, and (2) how can the Fréchet statistics be extended to detect multiple change points while maintaining the significance level of the hypothes… ▽ More

    Submitted 19 March, 2023; originally announced March 2023.

  43. arXiv:2303.09990  [pdf, other

    cs.SI

    Mutual Information Measure for Glass Ceiling Effect in Preferential Attachment Models

    Authors: Rui Luo, Buddhika Nettasinghe, Vikram Krishnamurthy

    Abstract: We propose a new way to measure inequalities such as the glass ceiling effect in attributed networks. Existing measures typically rely solely on node degree distribution or degree assortativity, but our approach goes beyond these measures by using mutual information (based on Shannon and more generally, Renyi entropy) between the conditional probability distributions of node attributes given node… ▽ More

    Submitted 17 March, 2023; originally announced March 2023.

  44. arXiv:2303.09678  [pdf, other

    eess.SY

    Neural Lyapunov Control for Nonlinear Systems with Unstructured Uncertainties

    Authors: Shiqing Wei, Prashanth Krishnamurthy, Farshad Khorrami

    Abstract: Stabilizing controller design and region of attraction (RoA) estimation are essential in nonlinear control. Moreover, it is challenging to implement a control Lyapunov function (CLF) in practice when only partial knowledge of the system is available. We propose a learning framework that can synthesize state-feedback controllers and a CLF for control-affine nonlinear systems with unstructured uncer… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: Accepted at the 2023 American Control Conference (ACC)

  45. arXiv:2303.08926  [pdf, other

    eess.SY

    Data-Driven Deep Learning Based Feedback Linearization of Systems with Unknown Dynamics

    Authors: Raktim Gautam Goswami, Prashanth Krishnamurthy, Farshad Khorrami

    Abstract: A methodology is developed to learn a feedback linearization (i.e., nonlinear change of coordinates and input transformation) using a data-driven approach for a single input control-affine nonlinear system with unknown dynamics. We employ deep neural networks to learn the feedback law (input transformation) in conjunction with an extension of invertible neural networks to learn the nonlinear chang… ▽ More

    Submitted 21 May, 2023; v1 submitted 15 March, 2023; originally announced March 2023.

  46. arXiv:2303.05973  [pdf, other

    eess.SY

    Data-Efficient Control Barrier Function Refinement

    Authors: Bolun Dai, Heming Huang, Prashanth Krishnamurthy, Farshad Khorrami

    Abstract: Control barrier functions (CBFs) have been widely used for synthesizing controllers in safety-critical applications. When used as a safety filter, it provides a simple and computationally efficient way to obtain safe controls from a possibly unsafe performance controller. Despite its conceptual simplicity, constructing a valid CBF is well known to be challenging, especially for high-relative degre… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: Accepted at 2023 American Control Conference

  47. arXiv:2303.02535  [pdf, other

    cs.LG

    Streaming Active Learning with Deep Neural Networks

    Authors: Akanksha Saran, Safoora Yousefi, Akshay Krishnamurthy, John Langford, Jordan T. Ash

    Abstract: Active learning is perhaps most naturally posed as an online learning problem. However, prior active learning approaches with deep neural networks assume offline access to the entire dataset ahead of time. This paper proposes VeSSAL, a new algorithm for batch active learning with deep neural networks in streaming settings, which samples groups of points to query for labels at the moment they are e… ▽ More

    Submitted 6 June, 2023; v1 submitted 4 March, 2023; originally announced March 2023.

    Comments: ICML 2023

  48. arXiv:2302.14753  [pdf, other

    cs.LG cs.AI stat.ML

    Learning Hidden Markov Models Using Conditional Samples

    Authors: Sham M. Kakade, Akshay Krishnamurthy, Gaurav Mahajan, Cyril Zhang

    Abstract: This paper is concerned with the computational complexity of learning the Hidden Markov Model (HMM). Although HMMs are some of the most widely used tools in sequential and time series modeling, they are cryptographically hard to learn in the standard setting where one has access to i.i.d. samples of observation sequences. In this paper, we depart from this setup and consider an interactive access… ▽ More

    Submitted 24 February, 2024; v1 submitted 28 February, 2023; originally announced February 2023.

  49. arXiv:2302.14703  [pdf, other

    cs.LG cs.AI cs.NE

    Improving Expert Specialization in Mixture of Experts

    Authors: Yamuna Krishnamurthy, Chris Watkins, Thomas Gaertner

    Abstract: Mixture of experts (MoE), introduced over 20 years ago, is the simplest gated modular neural network architecture. There is renewed interest in MoE because the conditional computation allows only parts of the network to be used during each inference, as was recently demonstrated in large scale natural language processing models. MoE is also of potential interest for continual learning, as experts… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: 14 pages including appendix

  50. arXiv:2302.13934  [pdf, other

    cs.LG stat.ML

    Statistical Learning under Heterogeneous Distribution Shift

    Authors: Max Simchowitz, Anurag Ajay, Pulkit Agrawal, Akshay Krishnamurthy

    Abstract: This paper studies the prediction of a target $\mathbf{z}$ from a pair of random variables $(\mathbf{x},\mathbf{y})$, where the ground-truth predictor is additive $\mathbb{E}[\mathbf{z} \mid \mathbf{x},\mathbf{y}] = f_\star(\mathbf{x}) +g_{\star}(\mathbf{y})$. We study the performance of empirical risk minimization (ERM) over functions $f+g$, $f \in F$ and $g \in G$, fit on a given training distri… ▽ More

    Submitted 27 October, 2023; v1 submitted 27 February, 2023; originally announced February 2023.