Skip to main content

Showing 1–41 of 41 results for author: Agarwal, C

.
  1. arXiv:2406.10625  [pdf, other

    cs.CL

    On the Hardness of Faithful Chain-of-Thought Reasoning in Large Language Models

    Authors: Sree Harsha Tanneru, Dan Ley, Chirag Agarwal, Himabindu Lakkaraju

    Abstract: As Large Language Models (LLMs) are increasingly being employed in real-world applications in critical domains such as healthcare, it is important to ensure that the Chain-of-Thought (CoT) reasoning generated by these models faithfully captures their underlying behavior. While LLMs are known to generate CoT reasoning that is appealing to humans, prior studies have shown that these explanations d… ▽ More

    Submitted 1 July, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

  2. arXiv:2403.03744  [pdf, other

    cs.AI

    MedSafetyBench: Evaluating and Improving the Medical Safety of Large Language Models

    Authors: Tessa Han, Aounon Kumar, Chirag Agarwal, Himabindu Lakkaraju

    Abstract: As large language models (LLMs) develop increasingly sophisticated capabilities and find applications in medical settings, it becomes important to assess their medical safety due to their far-reaching implications for personal and public health, patient safety, and human rights. However, there is little to no understanding of the notion of medical safety in the context of LLMs, let alone how to ev… ▽ More

    Submitted 13 June, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  3. arXiv:2402.14145  [pdf, other

    stat.ML cs.LG stat.ME

    Multiply Robust Estimation for Local Distribution Shifts with Multiple Domains

    Authors: Steven Wilkins-Reeves, Xu Chen, Qi Ma, Christine Agarwal, Aude Hofleitner

    Abstract: Distribution shifts are ubiquitous in real-world machine learning applications, posing a challenge to the generalization of models trained on one data distribution to another. We focus on scenarios where data distributions vary across multiple segments of the entire population and only make local assumptions about the differences between training and test (deployment) distributions within each seg… ▽ More

    Submitted 3 June, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: 9 pages, 4 figures

  4. arXiv:2402.06625  [pdf, other

    cs.CL

    Understanding the Effects of Iterative Prompting on Truthfulness

    Authors: Satyapriya Krishna, Chirag Agarwal, Himabindu Lakkaraju

    Abstract: The development of Large Language Models (LLMs) has notably transformed numerous sectors, offering impressive text generation capabilities. Yet, the reliability and truthfulness of these models remain pressing concerns. To this end, we investigate iterative prompting, a strategy hypothesized to refine LLM responses, assessing its impact on LLM truthfulness, an area which has not been thoroughly ex… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  5. arXiv:2402.04614  [pdf, other

    cs.CL

    Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from Large Language Models

    Authors: Chirag Agarwal, Sree Harsha Tanneru, Himabindu Lakkaraju

    Abstract: Large Language Models (LLMs) are deployed as powerful tools for several natural language processing (NLP) applications. Recent works show that modern LLMs can generate self-explanations (SEs), which elicit their intermediate reasoning steps for explaining their behavior. Self-explanations have seen widespread adoption owing to their conversational and plausible nature. However, there is little to… ▽ More

    Submitted 13 March, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

  6. arXiv:2311.03533  [pdf, other

    cs.CL

    Quantifying Uncertainty in Natural Language Explanations of Large Language Models

    Authors: Sree Harsha Tanneru, Chirag Agarwal, Himabindu Lakkaraju

    Abstract: Large Language Models (LLMs) are increasingly used as powerful tools for several high-stakes natural language processing (NLP) applications. Recent prompting works claim to elicit intermediate reasoning steps and key tokens that serve as proxy explanations for LLM predictions. However, there is no certainty whether these explanations are reliable and reflect the LLMs behavior. In this work, we mak… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  7. arXiv:2310.05797  [pdf, other

    cs.CL cs.AI cs.LG

    Are Large Language Models Post Hoc Explainers?

    Authors: Nicholas Kroeger, Dan Ley, Satyapriya Krishna, Chirag Agarwal, Himabindu Lakkaraju

    Abstract: The increasing use of predictive models in high-stakes settings highlights the need for ensuring that relevant stakeholders understand and trust the decisions made by these models. To this end, several approaches have been proposed in recent literature to explain the behavior of complex predictive models in a post hoc fashion. However, despite the growing number of such post hoc explanation techni… ▽ More

    Submitted 26 February, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

  8. arXiv:2309.16452  [pdf, other

    cs.LG

    On the Trade-offs between Adversarial Robustness and Actionable Explanations

    Authors: Satyapriya Krishna, Chirag Agarwal, Himabindu Lakkaraju

    Abstract: As machine learning models are increasingly being employed in various high-stakes settings, it becomes important to ensure that predictions of these models are not only adversarially robust, but also readily explainable to relevant stakeholders. However, it is unclear if these two notions can be simultaneously achieved or if there exist trade-offs between them. In this work, we make one of the fir… ▽ More

    Submitted 28 September, 2023; originally announced September 2023.

  9. arXiv:2309.02705  [pdf, other

    cs.CL cs.AI cs.CR cs.LG

    Certifying LLM Safety against Adversarial Prompting

    Authors: Aounon Kumar, Chirag Agarwal, Suraj Srinivas, Aaron Jiaxun Li, Soheil Feizi, Himabindu Lakkaraju

    Abstract: Large language models (LLMs) are vulnerable to adversarial attacks that add malicious tokens to an input prompt to bypass the safety guardrails of an LLM and cause it to produce harmful content. In this work, we introduce erase-and-check, the first framework for defending against adversarial prompts with certifiable safety guarantees. Given a prompt, our procedure erases tokens individually and in… ▽ More

    Submitted 12 February, 2024; v1 submitted 6 September, 2023; originally announced September 2023.

  10. arXiv:2307.13192  [pdf, other

    cs.AI cs.LG

    Counterfactual Explanation Policies in RL

    Authors: Shripad V. Deshmukh, Srivatsan R, Supriti Vijay, Jayakumar Subramanian, Chirag Agarwal

    Abstract: As Reinforcement Learning (RL) agents are increasingly employed in diverse decision-making problems using reward preferences, it becomes important to ensure that policies learned by these frameworks in map** observations to a probability distribution of the possible actions are explainable. However, there is little to no work in the systematic understanding of these complex policies in a contras… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: ICML Workshop on Counterfactuals in Minds and Machines, 2023

  11. arXiv:2305.04073  [pdf, other

    cs.AI cs.LG

    Explaining RL Decisions with Trajectories

    Authors: Shripad Vilasrao Deshmukh, Arpan Dasgupta, Balaji Krishnamurthy, Nan Jiang, Chirag Agarwal, Georgios Theocharous, Jayakumar Subramanian

    Abstract: Explanation is a key component for the adoption of reinforcement learning (RL) in many real-world decision-making problems. In the literature, the explanation is often provided by saliency attribution to the features of the RL agent's state. In this work, we propose a complementary approach to these explanations, particularly for offline RL, where we attribute the policy decisions of a trained RL… ▽ More

    Submitted 22 January, 2024; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: Published at International Conference on Learning Representations (ICLR), 2023

  12. arXiv:2304.12631  [pdf, other

    cs.IR cs.CL

    Explain like I am BM25: Interpreting a Dense Model's Ranked-List with a Sparse Approximation

    Authors: Michael Llordes, Debasis Ganguly, Sumit Bhatia, Chirag Agarwal

    Abstract: Neural retrieval models (NRMs) have been shown to outperform their statistical counterparts owing to their ability to capture semantic meaning via dense document representations. These models, however, suffer from poor interpretability as they do not rely on explicit term matching. As a form of local per-query explanations, we introduce the notion of equivalent queries that are generated by maximi… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: Accepted at SIGIR 2023

  13. arXiv:2303.10431  [pdf, other

    cs.CV

    DeAR: Debiasing Vision-Language Models with Additive Residuals

    Authors: Ashish Seth, Mayur Hemani, Chirag Agarwal

    Abstract: Large pre-trained vision-language models (VLMs) reduce the time for develo** predictive models for various vision-grounded language downstream tasks by providing rich, adaptable image and text representations. However, these models suffer from societal biases owing to the skewed distribution of various identity groups in the training data. These biases manifest as the skewed similarity between t… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

    Comments: Accepted to CVPR'23. Codes and dataset will be released soon

  14. arXiv:2302.13406  [pdf, other

    cs.LG cs.AI

    GNNDelete: A General Strategy for Unlearning in Graph Neural Networks

    Authors: Jiali Cheng, George Dasoulas, Huan He, Chirag Agarwal, Marinka Zitnik

    Abstract: Graph unlearning, which involves deleting graph elements such as nodes, node labels, and relationships from a trained graph neural network (GNN) model, is crucial for real-world applications where data elements may become irrelevant, inaccurate, or privacy-sensitive. However, existing methods for graph unlearning either deteriorate model weights shared across all nodes or fail to effectively delet… ▽ More

    Submitted 26 February, 2023; originally announced February 2023.

    Comments: Accepted to ICLR2023

  15. arXiv:2301.06928  [pdf, other

    cs.LG cs.AI

    Towards Estimating Transferability using Hard Subsets

    Authors: Tarun Ram Menta, Surgan Jandial, Akash Patil, Vimal KB, Saketh Bachu, Balaji Krishnamurthy, Vineeth N. Balasubramanian, Chirag Agarwal, Mausoom Sarkar

    Abstract: As transfer learning techniques are increasingly used to transfer knowledge from the source model to the target task, it becomes important to quantify which source models are suitable for a given target task without performing computationally expensive fine tuning. In this work, we propose HASTE (HArd Subset TransfErability), a new strategy to estimate the transferability of a source model to a pa… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

    Comments: First three authors contributed equally

  16. arXiv:2211.16731  [pdf, other

    cs.LG cs.AI

    Towards Training GNNs using Explanation Directed Message Passing

    Authors: Valentina Giunchiglia, Chirag Varun Shukla, Guadalupe Gonzalez, Chirag Agarwal

    Abstract: With the increasing use of Graph Neural Networks (GNNs) in critical real-world applications, several post hoc explanation methods have been proposed to understand their predictions. However, there has been no work in generating explanations on the fly during model training and utilizing them to improve the expressive power of the underlying GNN models. In this work, we introduce a novel explanatio… ▽ More

    Submitted 1 December, 2022; v1 submitted 29 November, 2022; originally announced November 2022.

    Comments: Accepted to the proceedings of the First Learning on Graphs Conference (LoG 2022)

  17. arXiv:2208.09339  [pdf, other

    cs.LG cs.AI

    Evaluating Explainability for Graph Neural Networks

    Authors: Chirag Agarwal, Owen Queen, Himabindu Lakkaraju, Marinka Zitnik

    Abstract: As post hoc explanations are increasingly used to understand the behavior of graph neural networks (GNNs), it becomes crucial to evaluate the quality and reliability of GNN explanations. However, assessing the quality of GNN explanations is challenging as existing graph datasets have no or unreliable ground-truth explanations for a given task. Here, we introduce a synthetic graph data generator, S… ▽ More

    Submitted 16 January, 2023; v1 submitted 19 August, 2022; originally announced August 2022.

  18. arXiv:2206.11104  [pdf, other

    cs.LG cs.AI

    OpenXAI: Towards a Transparent Evaluation of Model Explanations

    Authors: Chirag Agarwal, Dan Ley, Satyapriya Krishna, Eshika Saxena, Martin Pawelczyk, Nari Johnson, Isha Puri, Marinka Zitnik, Himabindu Lakkaraju

    Abstract: While several types of post hoc explanation methods have been proposed in recent literature, there is very little work on systematically benchmarking these methods. Here, we introduce OpenXAI, a comprehensive and extensible open-source framework for evaluating and benchmarking post hoc explanation methods. OpenXAI comprises of the following key components: (i) a flexible synthetic data generator a… ▽ More

    Submitted 13 March, 2024; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: Newer version with updated results and code

  19. arXiv:2206.01465  [pdf, other

    eess.SY cs.LG

    PAC Statistical Model Checking of Mean Payoff in Discrete- and Continuous-Time MDP

    Authors: Chaitanya Agarwal, Shibashis Guha, Jan Křetínský, M. Pazhamalai

    Abstract: Markov decision processes (MDP) and continuous-time MDP (CTMDP) are the fundamental models for non-deterministic systems with probabilistic uncertainty. Mean payoff (a.k.a. long-run average reward) is one of the most classic objectives considered in their context. We provide the first algorithm to compute mean payoff probably approximately correctly in unknown MDP; further, we extend it to unknown… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

    Comments: Full version of CAV 2022 paper, 57 pages

  20. arXiv:2203.06877  [pdf, other

    cs.LG

    Rethinking Stability for Attribution-based Explanations

    Authors: Chirag Agarwal, Nari Johnson, Martin Pawelczyk, Satyapriya Krishna, Eshika Saxena, Marinka Zitnik, Himabindu Lakkaraju

    Abstract: As attribution-based explanation methods are increasingly used to establish model trustworthiness in high-stakes situations, it is critical to ensure that these explanations are stable, e.g., robust to infinitesimal perturbations to an input. However, previous works have shown that state-of-the-art explanation methods generate unstable explanations. Here, we introduce metrics to quantify the stabi… ▽ More

    Submitted 14 March, 2022; originally announced March 2022.

  21. arXiv:2107.13098  [pdf, other

    cs.CV cs.LG

    A Tale Of Two Long Tails

    Authors: Daniel D'souza, Zach Nussbaum, Chirag Agarwal, Sara Hooker

    Abstract: As machine learning models are increasingly employed to assist human decision-makers, it becomes critical to communicate the uncertainty associated with these model predictions. However, the majority of work on uncertainty has focused on traditional probabilistic or ranking approaches - where the model assigns low probabilities or scores to uncertain examples. While this captures what examples are… ▽ More

    Submitted 27 July, 2021; originally announced July 2021.

    Comments: Preliminary results accepted to Workshop on Uncertainty and Robustness in Deep Learning (UDL), ICML, 2021

  22. arXiv:2106.09992  [pdf, other

    cs.LG cs.AI

    Exploring Counterfactual Explanations Through the Lens of Adversarial Examples: A Theoretical and Empirical Analysis

    Authors: Martin Pawelczyk, Chirag Agarwal, Shalmali Joshi, Sohini Upadhyay, Himabindu Lakkaraju

    Abstract: As machine learning (ML) models become more widely deployed in high-stakes applications, counterfactual explanations have emerged as key tools for providing actionable model explanations in practice. Despite the growing popularity of counterfactual explanations, a deeper understanding of these explanations is still lacking. In this work, we systematically analyze counterfactual explanations throug… ▽ More

    Submitted 19 October, 2021; v1 submitted 18 June, 2021; originally announced June 2021.

    Journal ref: International Conference on Artificial Intelligence and Statistics (AISTATS), 28-30 March 2022

  23. arXiv:2106.09078  [pdf, other

    cs.LG

    Probing GNN Explainers: A Rigorous Theoretical and Empirical Analysis of GNN Explanation Methods

    Authors: Chirag Agarwal, Marinka Zitnik, Himabindu Lakkaraju

    Abstract: As Graph Neural Networks (GNNs) are increasingly being employed in critical real-world applications, several methods have been proposed in recent literature to explain the predictions of these models. However, there has been little to no work on systematically analyzing the reliability of these methods. Here, we introduce the first-ever theoretical analysis of the reliability of state-of-the-art G… ▽ More

    Submitted 22 February, 2022; v1 submitted 16 June, 2021; originally announced June 2021.

    Comments: Accepted to AISTATS 2022

  24. arXiv:2102.13186  [pdf, other

    cs.LG

    Towards a Unified Framework for Fair and Stable Graph Representation Learning

    Authors: Chirag Agarwal, Himabindu Lakkaraju, Marinka Zitnik

    Abstract: As the representations output by Graph Neural Networks (GNNs) are increasingly employed in real-world applications, it becomes important to ensure that these representations are fair and stable. In this work, we establish a key connection between counterfactual fairness and stability and leverage it to propose a novel framework, NIFTY (uNIfying Fairness and stabiliTY), which can be used with any G… ▽ More

    Submitted 16 June, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

    Comments: Accepted to UAI'21

    Report number: Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence (UAI 2021),

    Journal ref: PMLR 161:2114-2124, 2021

  25. arXiv:2102.10618  [pdf, other

    cs.LG

    Towards the Unification and Robustness of Perturbation and Gradient Based Explanations

    Authors: Sushant Agarwal, Shahin Jabbari, Chirag Agarwal, Sohini Upadhyay, Zhiwei Steven Wu, Himabindu Lakkaraju

    Abstract: As machine learning black boxes are increasingly being deployed in critical domains such as healthcare and criminal justice, there has been a growing emphasis on develo** techniques for explaining these black boxes in a post hoc manner. In this work, we analyze two popular post hoc interpretation techniques: SmoothGrad which is a gradient based method, and a variant of LIME which is a perturbati… ▽ More

    Submitted 19 July, 2021; v1 submitted 21 February, 2021; originally announced February 2021.

    Comments: The short version of this paper appears in the proceedings of ICML-21

  26. arXiv:2008.11600  [pdf, other

    cs.CV cs.LG

    Estimating Example Difficulty Using Variance of Gradients

    Authors: Chirag Agarwal, Daniel D'souza, Sara Hooker

    Abstract: In machine learning, a question of great interest is understanding what examples are challenging for a model to classify. Identifying atypical examples ensures the safe deployment of models, isolates samples that require further human inspection and provides interpretability into model behavior. In this work, we propose Variance of Gradients (VoG) as a valuable and efficient metric to rank data by… ▽ More

    Submitted 21 June, 2022; v1 submitted 26 August, 2020; originally announced August 2020.

    Comments: Accepted to CVPR 2022

  27. arXiv:2007.04663  [pdf, other

    cs.AI

    Automation Strategies for Unconstrained Crossword Puzzle Generation

    Authors: Charu Agarwal, Rushikesh K. Joshi

    Abstract: An unconstrained crossword puzzle is a generalization of the constrained crossword problem. In this problem, only the word vocabulary, and optionally the grid dimensions are known. Hence, it not only requires the algorithm to determine the word locations, but it also needs to come up with the grid geometry. This paper discusses algorithmic strategies for automatic crossword puzzle generation in su… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

    Comments: 28 pages, 28 figures, category: cs, preprint

  28. arXiv:2006.09373  [pdf, other

    cs.CV cs.LG

    The shape and simplicity biases of adversarially robust ImageNet-trained CNNs

    Authors: Peijie Chen, Chirag Agarwal, Anh Nguyen

    Abstract: Increasingly more similarities between human vision and convolutional neural networks (CNNs) have been revealed in the past few years. Yet, vanilla CNNs often fall short in generalizing to adversarial or out-of-distribution (OOD) examples which humans demonstrate superior performance. Adversarial training is a leading learning algorithm for improving the robustness of CNNs on adversarial and OOD d… ▽ More

    Submitted 12 September, 2022; v1 submitted 16 June, 2020; originally announced June 2020.

  29. arXiv:2003.08754  [pdf, other

    cs.CV cs.LG

    SAM: The Sensitivity of Attribution Methods to Hyperparameters

    Authors: Naman Bansal, Chirag Agarwal, Anh Nguyen

    Abstract: Attribution methods can provide powerful insights into the reasons for a classifier's decision. We argue that a key desideratum of an explanation method is its robustness to input hyperparameters which are often randomly set or empirically tuned. High sensitivity to arbitrary hyperparameter choices does not only impede reproducibility but also questions the correctness of an explanation and impair… ▽ More

    Submitted 12 April, 2020; v1 submitted 4 March, 2020; originally announced March 2020.

    Comments: Oral paper at CVPR 2020

  30. Deep-URL: A Model-Aware Approach To Blind Deconvolution Based On Deep Unfolded Richardson-Lucy Network

    Authors: Chirag Agarwal, Shahin Khobahi, Arindam Bose, Mojtaba Soltanalian, Dan Schonfeld

    Abstract: The lack of interpretability in current deep learning models causes serious concerns as they are extensively used for various life-critical applications. Hence, it is of paramount importance to develop interpretable deep learning models. In this paper, we consider the problem of blind deconvolution and propose a novel model-aware deep architecture that allows for the recovery of both the blur kern… ▽ More

    Submitted 7 June, 2020; v1 submitted 3 February, 2020; originally announced February 2020.

    Comments: Accepted. 27th IEEE International Conference on Image Processing (ICIP), 2020

  31. arXiv:1910.04256  [pdf, other

    cs.LG cs.CV stat.ML

    Explaining image classifiers by removing input features using generative models

    Authors: Chirag Agarwal, Anh Nguyen

    Abstract: Perturbation-based explanation methods often measure the contribution of an input feature to an image classifier's outputs by heuristically removing it via e.g. blurring, adding noise, or graying out, which often produce unrealistic, out-of-samples. Instead, we propose to integrate a generative inpainter into three representative attribution methods to remove an input feature. Our proposed change… ▽ More

    Submitted 6 October, 2020; v1 submitted 9 October, 2019; originally announced October 2019.

    Comments: Accepted to Asian Conference on Computer Vision (ACCV), 2020

  32. arXiv:1811.00621  [pdf, ps, other

    cs.CR cs.LG

    Improving Adversarial Robustness by Encouraging Discriminative Features

    Authors: Chirag Agarwal, Anh Nguyen, Dan Schonfeld

    Abstract: Deep neural networks (DNNs) have achieved state-of-the-art results in various pattern recognition tasks. However, they perform poorly on out-of-distribution adversarial examples i.e. inputs that are specifically crafted by an adversary to cause DNNs to misbehave, questioning the security and reliability of applications. In this paper, we encourage DNN classifiers to learn more discriminative featu… ▽ More

    Submitted 8 May, 2019; v1 submitted 1 November, 2018; originally announced November 2018.

    Comments: This article corresponds to the accepted version at IEEE ICIP 2019. We will link the DOI as soon as it is available

    Journal ref: 2019 26th IEEE International Conference on Image Processing (ICIP)

  33. arXiv:1806.01477  [pdf, other

    cs.CR cs.LG stat.ML

    An Explainable Adversarial Robustness Metric for Deep Learning Neural Networks

    Authors: Chirag Agarwal, Bo Dong, Dan Schonfeld, Anthony Hoogs

    Abstract: Deep Neural Networks(DNN) have excessively advanced the field of computer vision by achieving state of the art performance in various vision tasks. These results are not limited to the field of vision but can also be seen in speech recognition and machine translation tasks. Recently, DNNs are found to poorly fail when tested with samples that are crafted by making imperceptible changes to the orig… ▽ More

    Submitted 6 June, 2018; v1 submitted 4 June, 2018; originally announced June 2018.

  34. arXiv:1711.02581  [pdf, ps, other

    cs.MM

    Convolutional Neural Network Steganalysis's Application to Steganography

    Authors: Mehdi Sharifzadeh, Chirag Agarwal, Mohammed Aloraini, Dan Schonfeld

    Abstract: This paper presents a novel approach to increase the performance bounds of image steganography under the criteria of minimizing distortion. The proposed approach utilizes a steganalysis convolutional neural network (CNN) framework to understand an image's model and embed in less detectable regions to preserve the model. In other word, the trained steganalysis CNN is used to calculate derivatives o… ▽ More

    Submitted 6 November, 2017; originally announced November 2017.

    Comments: arXiv admin note: substantial text overlap with arXiv:1705.08616

  35. arXiv:1705.08616  [pdf, ps, other

    cs.MM cs.CR

    A New Parallel Message-distribution Technique for Cost-based Steganography

    Authors: Mehdi Sharifzadeh, Chirag Agarwal, Mahdi Salarian, Dan Schonfeld

    Abstract: This paper presents two novel approaches to increase performance bounds of image steganography under the criteria of minimizing distortion. First, in order to efficiently use the images' capacities, we propose using parallel images in the embedding stage. The result is then used to prove sub-optimality of the message distribution technique used by all cost based algorithms including HUGO, S-UNIWAR… ▽ More

    Submitted 7 July, 2019; v1 submitted 24 May, 2017; originally announced May 2017.

  36. arXiv:1705.07404  [pdf, other

    cs.CV cs.LG

    Convergence of backpropagation with momentum for network architectures with skip connections

    Authors: Chirag Agarwal, Joe Klobusicky, Dan Schonfeld

    Abstract: We study a class of deep neural networks with networks that form a directed acyclic graph (DAG). For backpropagation defined by gradient descent with adaptive momentum, we show weights converge for a large class of nonlinear activation functions. The proof generalizes the results of Wu et al. (2008) who showed convergence for a feed forward network with one hidden layer. For an example of the effe… ▽ More

    Submitted 18 January, 2020; v1 submitted 21 May, 2017; originally announced May 2017.

  37. arXiv:1209.2631  [pdf

    cond-mat.mtrl-sci cond-mat.dis-nn

    Role of Heterogeneities in Staebler-Wronski Effect

    Authors: S. C. Agarwal

    Abstract: The effect of light soaking (LS) on the properties of hydrogenated amorphous silicon presents many challenging puzzles. Some of them are discussed here, along with their present status. In particular the role of the heterogeneities in LS is examined. We find that for the majority of the solved as well unsolved puzzles the long range potential fluctuations arising from the heterogeneities in the fi… ▽ More

    Submitted 12 September, 2012; originally announced September 2012.

    Comments: 10 pages, 7 figures

    Report number: iitk/2012/5

  38. arXiv:1207.5426  [pdf

    cond-mat.mtrl-sci

    Ion beam generated surface ripples: new insight in the underlying mechanism

    Authors: Tanuj Kumar, Ashish Kumar, D. C. Agarwal, N. P. Lalla, D. Kanjilal

    Abstract: A new hydrodynamic mechanism is proposed for the ion beam induced surface patterning on solid surfaces. Unlike the standard mechanisms based on the ion beam impact generated erosion and mass redistribution at the free surface (proposed by Bradley-Harper (BH) and its extended theories), the new mechanism proposes that the ion beam induced saltation and creep processes, coupled with incompressible s… ▽ More

    Submitted 1 June, 2013; v1 submitted 23 July, 2012; originally announced July 2012.

    Comments: 12 pages, 6 figures. arXiv admin note: substantial text overlap with arXiv:1206.0821

  39. arXiv:1206.0821   

    cond-mat.mtrl-sci

    Hydro-dynamics of surface patterning by ion beam irradiation: an interface phenomenon

    Authors: Tanuj Kumar, D. C. Agarwal, S. A. Khan, N. P. Lalla, D. Kanjilal

    Abstract: We show that the ion beam induced incompressible amorphous solid flow in terms of advection transport mechanism leads to the erosion and deposition of atoms at the amorphous/crystalline (a/c) interface resulting in the formation of pattern at the a/c interface as well as at the free surface. The ion beam impact generated erosion and mass redistribution at the free surface are found to have insigni… ▽ More

    Submitted 21 February, 2014; v1 submitted 5 June, 2012; originally announced June 2012.

    Comments: This paper has been withdrawn by the author due to a crucial error in calculation in equation 1

  40. Origin of the anomalous magnetic circular dichroism spectral shape in ferromagnetic (Ga,Mn)As: Impurity bands inside the band gap

    Authors: K. Ando, H. Saito, K. C. Agarwal, M. C. Debnath, V. Zayets

    Abstract: The electronic structure of a prototype dilute magnetic semiconductor (DMS), Ga1-xMnxAs, is studied by magnetic circular dichroism (MCD) spectroscopy. We prove that the optical transitions originated from impurity bands cause the strong positive MCD background. The MCD signal due to the E0 transition from the valence band to the conduction band is negative indicating that the p-d exchange intera… ▽ More

    Submitted 13 October, 2007; originally announced October 2007.

    Comments: 13 pages, 3 figures

  41. arXiv:cond-mat/0505574  [pdf

    cond-mat.mtrl-sci

    Ion Beam Synthesis of embedded SiC

    Authors: Y. S. Katharria, V. BAranwal, D. C. Agarwal, R. Krishna, P. Kumar, F. Singh, D. Kanjilal

    Abstract: The synthesis of embedded silicon carbide was carried out in N type silicon samples having (100) and (111) orientations using high dose implantation of carbon ions at room temperature. The variation of dose was employed to get dose dependence of silicon carbide formation. Postimplant annealing at 1000 C in order to anneal out the implantion induced defects and to get silicon carbide precipitates… ▽ More

    Submitted 24 May, 2005; originally announced May 2005.

    Comments: 10 pages, 6 figures