Skip to main content

Showing 1–50 of 68 results for author: Khanna, R

.
  1. arXiv:2401.12332  [pdf, other

    cs.LG math.OC

    A Precise Characterization of SGD Stability Using Loss Surface Geometry

    Authors: Gregory Dexter, Borja Ocejo, Sathiya Keerthi, Aman Gupta, Ayan Acharya, Rajiv Khanna

    Abstract: Stochastic Gradient Descent (SGD) stands as a cornerstone optimization algorithm with proven real-world empirical successes but relatively limited theoretical understanding. Recent research has illuminated a key factor contributing to its practical efficacy: the implicit regularization it instigates. Several studies have investigated the linear stability property of SGD in the vicinity of a statio… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: To appear at ICLR 2024

  2. arXiv:2312.10049  [pdf

    cs.IR

    Knowledge Graph Reasoning Based on Attention GCN

    Authors: Meera Gupta, Ravi Khanna, Divya Choudhary, Nandini Rao

    Abstract: We propose a novel technique to enhance Knowledge Graph Reasoning by combining Graph Convolution Neural Network (GCN) with the Attention Mechanism. This approach utilizes the Attention Mechanism to examine the relationships between entities and their neighboring nodes, which helps to develop detailed feature vectors for each entity. The GCN uses shared parameters to effectively represent the chara… ▽ More

    Submitted 27 January, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

  3. arXiv:2310.00488  [pdf, other

    cs.LG cs.AI

    On Memorization and Privacy Risks of Sharpness Aware Minimization

    Authors: Young In Kim, Pratiksha Agrawal, Johannes O. Royset, Rajiv Khanna

    Abstract: In many recent works, there is an increased focus on designing algorithms that seek flatter optima for neural network loss optimization as there is empirical evidence that it leads to better generalization performance in many datasets. In this work, we dissect these performance gains through the lens of data memorization in overparameterized models. We define a new metric that helps us identify wh… ▽ More

    Submitted 3 January, 2024; v1 submitted 30 September, 2023; originally announced October 2023.

  4. arXiv:2308.05221  [pdf, other

    cs.HC cs.AI cs.RO

    Alexa, play with robot: Introducing the First Alexa Prize SimBot Challenge on Embodied AI

    Authors: Hangjie Shi, Leslie Ball, Govind Thattai, Desheng Zhang, Lucy Hu, Qiaozi Gao, Suhaila Shakiah, Xiaofeng Gao, Aishwarya Padmakumar, Bofei Yang, Cadence Chung, Dinakar Guthy, Gaurav Sukhatme, Karthika Arumugam, Matthew Wen, Osman Ipek, Patrick Lange, Rohan Khanna, Shreyas Pansare, Vasu Sharma, Chao Zhang, Cris Flagg, Daniel Pressel, Lavina Vaz, Luke Dai , et al. (17 additional authors not shown)

    Abstract: The Alexa Prize program has empowered numerous university students to explore, experiment, and showcase their talents in building conversational agents through challenges like the SocialBot Grand Challenge and the TaskBot Challenge. As conversational agents increasingly appear in multimodal and embodied contexts, it is important to explore the affordances of conversational interaction augmented wi… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

  5. arXiv:2307.02501  [pdf, ps, other

    stat.ML cs.LG

    Generalization Guarantees via Algorithm-dependent Rademacher Complexity

    Authors: Sarah Sachs, Tim van Erven, Liam Hodgkinson, Rajiv Khanna, Umut Simsekli

    Abstract: Algorithm- and data-dependent generalization bounds are required to explain the generalization behavior of modern machine learning algorithms. In this context, there exists information theoretic generalization bounds that involve (various forms of) mutual information, as well as bounds based on hypothesis set stability. We propose a conceptually related, but technically distinct complexity measure… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

  6. arXiv:2303.14284  [pdf, ps, other

    cs.LG stat.ML

    Feature Space Sketching for Logistic Regression

    Authors: Gregory Dexter, Rajiv Khanna, Jawad Raheel, Petros Drineas

    Abstract: We present novel bounds for coreset construction, feature selection, and dimensionality reduction for logistic regression. All three approaches can be thought of as sketching the logistic regression inputs. On the coreset construction front, we resolve open problems from prior work and present novel bounds for the complexity of coreset construction methods. On the feature selection and dimensional… ▽ More

    Submitted 24 March, 2023; originally announced March 2023.

  7. arXiv:2303.01586  [pdf, other

    cs.HC cs.AI cs.RO

    Alexa Arena: A User-Centric Interactive Platform for Embodied AI

    Authors: Qiaozi Gao, Govind Thattai, Suhaila Shakiah, Xiaofeng Gao, Shreyas Pansare, Vasu Sharma, Gaurav Sukhatme, Hangjie Shi, Bofei Yang, Desheng Zheng, Lucy Hu, Karthika Arumugam, Shui Hu, Matthew Wen, Dinakar Guthy, Cadence Chung, Rohan Khanna, Osman Ipek, Leslie Ball, Kate Bland, Heather Rocker, Yadunandana Rao, Michael Johnston, Reza Ghanadan, Arindam Mandal , et al. (2 additional authors not shown)

    Abstract: We introduce Alexa Arena, a user-centric simulation platform for Embodied AI (EAI) research. Alexa Arena provides a variety of multi-room layouts and interactable objects, for the creation of human-robot interaction (HRI) missions. With user-friendly graphics and control mechanisms, Alexa Arena supports the development of gamified robotic tasks readily accessible to general human users, thus openi… ▽ More

    Submitted 7 June, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

  8. arXiv:2302.09693  [pdf, other

    stat.ML cs.LG

    mSAM: Micro-Batch-Averaged Sharpness-Aware Minimization

    Authors: Kayhan Behdin, Qingquan Song, Aman Gupta, Sathiya Keerthi, Ayan Acharya, Borja Ocejo, Gregory Dexter, Rajiv Khanna, David Durfee, Rahul Mazumder

    Abstract: Modern deep learning models are over-parameterized, where different optima can result in widely varying generalization performance. The Sharpness-Aware Minimization (SAM) technique modifies the fundamental loss function that steers gradient descent methods toward flatter minima, which are believed to exhibit enhanced generalization prowess. Our study delves into a specific variant of SAM known as… ▽ More

    Submitted 30 September, 2023; v1 submitted 19 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2212.04343

  9. arXiv:2202.13718  [pdf, other

    cs.LG cs.CY

    Fast Feature Selection with Fairness Constraints

    Authors: Francesco Quinzan, Rajiv Khanna, Moshik Hershcovitch, Sarel Cohen, Daniel G. Waddington, Tobias Friedrich, Michael W. Mahoney

    Abstract: We study the fundamental problem of selecting optimal features for model construction. This problem is computationally challenging on large datasets, even with the use of greedy algorithm variants. To address this challenge, we extend the adaptive query model, recently proposed for the greedy forward selection for submodular functions, to the faster paradigm of Orthogonal Matching Pursuit for non-… ▽ More

    Submitted 3 February, 2023; v1 submitted 28 February, 2022; originally announced February 2022.

  10. arXiv:2109.13978  [pdf, other

    cs.AI

    Identifying Reasoning Flaws in Planning-Based RL Using Tree Explanations

    Authors: Kin-Ho Lam, Zhengxian Lin, Jed Irvine, Jonathan Dodge, Zeyad T Shureih, Roli Khanna, Minsuk Kahng, Alan Fern

    Abstract: Enabling humans to identify potential flaws in an agent's decision making is an important Explainable AI application. We consider identifying such flaws in a planning-based deep reinforcement learning (RL) agent for a complex real-time strategy game. In particular, the agent makes decisions via tree search using a learned model and evaluation function over interpretable states and actions. This gi… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

  11. arXiv:2108.00781  [pdf, other

    stat.ML cs.LG

    Generalization Bounds using Lower Tail Exponents in Stochastic Optimizers

    Authors: Liam Hodgkinson, Umut Şimşekli, Rajiv Khanna, Michael W. Mahoney

    Abstract: Despite the ubiquitous use of stochastic optimization algorithms in machine learning, the precise impact of these algorithms and their dynamics on generalization performance in realistic non-convex settings is still poorly understood. While recent work has revealed connections between generalization and heavy-tailed behavior in stochastic optimization, this work mainly relied on continuous-time ap… ▽ More

    Submitted 11 July, 2022; v1 submitted 2 August, 2021; originally announced August 2021.

    Comments: 22 pages, 6 figures

  12. arXiv:2105.07320  [pdf, other

    cs.DC stat.ML

    LocalNewton: Reducing Communication Bottleneck for Distributed Learning

    Authors: Vipul Gupta, Avishek Ghosh, Michal Derezinski, Rajiv Khanna, Kannan Ramchandran, Michael Mahoney

    Abstract: To address the communication bottleneck problem in distributed optimization within a master-worker framework, we propose LocalNewton, a distributed second-order algorithm with local averaging. In LocalNewton, the worker machines update their model in every iteration by finding a suitable second-order descent direction using only the data and model stored in their own local memory. We let the worke… ▽ More

    Submitted 15 May, 2021; originally announced May 2021.

    Comments: To be published in Uncertainty in Artificial Intelligence (UAI) 2021

  13. Counterfactual State Explanations for Reinforcement Learning Agents via Generative Deep Learning

    Authors: Matthew L. Olson, Roli Khanna, Lawrence Neal, Fuxin Li, Weng-Keen Wong

    Abstract: Counterfactual explanations, which deal with "why not?" scenarios, can provide insightful explanations to an AI agent's behavior. In this work, we focus on generating counterfactual explanations for deep reinforcement learning (RL) agents which operate in visual input environments like Atari. We introduce counterfactual state explanations, a novel example-based approach to counterfactual explanati… ▽ More

    Submitted 29 January, 2021; originally announced January 2021.

    Comments: Full source code available at https://github.com/mattolson93/counterfactual-state-explanations

    Journal ref: Artificial Intelligence, 2021, 103455, ISSN 0004-3702

  14. arXiv:2007.05869  [pdf, other

    cs.LG stat.ML

    Adversarially-Trained Deep Nets Transfer Better: Illustration on Image Classification

    Authors: Francisco Utrera, Evan Kravitz, N. Benjamin Erichson, Rajiv Khanna, Michael W. Mahoney

    Abstract: Transfer learning has emerged as a powerful methodology for adapting pre-trained deep neural networks on image recognition tasks to new domains. This process consists of taking a neural network pre-trained on a large feature-rich source dataset, freezing the early layers that encode essential generic image properties, and then fine-tuning the last few layers in order to capture specific informatio… ▽ More

    Submitted 23 April, 2021; v1 submitted 11 July, 2020; originally announced July 2020.

    Comments: Published as a conference paper at ICLR 2021

  15. arXiv:2007.05086  [pdf, other

    cs.LG stat.ML

    Boundary thickness and robustness in learning models

    Authors: Yaoqing Yang, Rajiv Khanna, Yaodong Yu, Amir Gholami, Kurt Keutzer, Joseph E. Gonzalez, Kannan Ramchandran, Michael W. Mahoney

    Abstract: Robustness of machine learning models to various adversarial and non-adversarial corruptions continues to be of interest. In this paper, we introduce the notion of the boundary thickness of a classifier, and we describe its connection with and usefulness for model robustness. Thick decision boundaries lead to improved performance, while thin decision boundaries lead to overfitting (e.g., measured… ▽ More

    Submitted 12 January, 2021; v1 submitted 9 July, 2020; originally announced July 2020.

    Journal ref: NeurIPS 2020

  16. arXiv:2007.00715  [pdf, other

    stat.ML cs.LG stat.CO

    Bayesian Coresets: Revisiting the Nonconvex Optimization Perspective

    Authors: Jacky Y. Zhang, Rajiv Khanna, Anastasios Kyrillidis, Oluwasanmi Koyejo

    Abstract: Bayesian coresets have emerged as a promising approach for implementing scalable Bayesian inference. The Bayesian coreset problem involves selecting a (weighted) subset of the data samples, such that the posterior inference using the selected subset closely approximates the posterior inference using the full dataset. This manuscript revisits Bayesian coresets through the lens of sparsity constrain… ▽ More

    Submitted 25 February, 2021; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: AISTATS 2021 (Oral)

  17. arXiv:2005.00792  [pdf, other

    cs.LG stat.ML

    ForecastQA: A Question Answering Challenge for Event Forecasting with Temporal Text Data

    Authors: Woojeong **, Rahul Khanna, Suji Kim, Dong-Ho Lee, Fred Morstatter, Aram Galstyan, Xiang Ren

    Abstract: Event forecasting is a challenging, yet important task, as humans seek to constantly plan for the future. Existing automated forecasting studies rely mostly on structured data, such as time-series or event-based knowledge graphs, to help predict future events. In this work, we aim to formulate a task, construct a dataset, and provide benchmarks for develo** methods for event forecasting with lar… ▽ More

    Submitted 7 June, 2021; v1 submitted 2 May, 2020; originally announced May 2020.

    Comments: Accepted to ACL 2021. Project page: https://inklab.usc.edu/ForecastQA/

  18. arXiv:2005.00782  [pdf, other

    cs.CL cs.AI cs.LO

    RICA: Evaluating Robust Inference Capabilities Based on Commonsense Axioms

    Authors: Pei Zhou, Rahul Khanna, Seyeon Lee, Bill Yuchen Lin, Daniel Ho, Jay Pujara, Xiang Ren

    Abstract: Pre-trained language models (PTLMs) have achieved impressive performance on commonsense inference benchmarks, but their ability to employ commonsense to make robust inferences, which is crucial for effective communications with humans, is debated. In the pursuit of advancing fluid human-AI communication, we propose a new challenge, RICA: Robust Inference capability based on Commonsense Axioms, tha… ▽ More

    Submitted 9 September, 2021; v1 submitted 2 May, 2020; originally announced May 2020.

    Comments: Accepted in EMNLP 2021 main conference. 20 pages, 8 figures

  19. arXiv:2005.00683  [pdf, other

    cs.CL cs.AI

    Birds have four legs?! NumerSense: Probing Numerical Commonsense Knowledge of Pre-trained Language Models

    Authors: Bill Yuchen Lin, Seyeon Lee, Rahul Khanna, Xiang Ren

    Abstract: Recent works show that pre-trained language models (PTLMs), such as BERT, possess certain commonsense and factual knowledge. They suggest that it is promising to use PTLMs as "neural knowledge bases" via predicting masked words. Surprisingly, we find that this may not work for numerical commonsense knowledge (e.g., a bird usually has two legs). In this paper, we investigate whether and to what ext… ▽ More

    Submitted 17 September, 2020; v1 submitted 1 May, 2020; originally announced May 2020.

    Comments: To appear in Proceedings of EMNLP 2020. Project page: http://inklab.usc.edu/NumerSense/

  20. arXiv:2004.07499  [pdf, other

    cs.CL cs.AI cs.LG

    LEAN-LIFE: A Label-Efficient Annotation Framework Towards Learning from Explanation

    Authors: Dong-Ho Lee, Rahul Khanna, Bill Yuchen Lin, Jamin Chen, Seyeon Lee, Qinyuan Ye, Elizabeth Boschee, Leonardo Neves, Xiang Ren

    Abstract: Successfully training a deep neural network demands a huge corpus of labeled data. However, each label only provides limited information to learn from and collecting the requisite number of labels involves massive human effort. In this work, we introduce LEAN-LIFE, a web-based, Label-Efficient AnnotatioN framework for sequence labeling and classification tasks, with an easy-to-use UI that not only… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

    Comments: Accepted to the ACL 2020 (demo). The first two authors contributed equally. Project page: http://inklab.usc.edu/leanlife/

  21. A1: A Distributed In-Memory Graph Database

    Authors: Chiranjeeb Buragohain, Knut Magne Risvik, Paul Brett, Miguel Castro, Wonhee Cho, Joshua Cowhig, Nikolas Gloy, Karthik Kalyanaraman, Richendra Khanna, John Pao, Matthew Renzelmann, Alex Shamis, Timothy Tan, Shuheng Zheng

    Abstract: A1 is an in-memory distributed database used by the Bing search engine to support complex queries over structured data. The key enablers for A1 are availability of cheap DRAM and high speed RDMA (Remote Direct Memory Access) networking in commodity hardware. A1 uses FaRM as its underlying storage layer and builds the graph abstraction and query engine on top. The combination of in-memory storage a… ▽ More

    Submitted 12 April, 2020; originally announced April 2020.

  22. arXiv:2002.09073  [pdf, other

    cs.LG stat.ML

    Improved guarantees and a multiple-descent curve for Column Subset Selection and the Nyström method

    Authors: Michał Dereziński, Rajiv Khanna, Michael W. Mahoney

    Abstract: The Column Subset Selection Problem (CSSP) and the Nyström method are among the leading tools for constructing small low-rank approximations of large datasets in machine learning and scientific computing. A fundamental question in this area is: how well can a data subset of size k compete with the best rank k approximation? We develop techniques which exploit spectral properties of the data matrix… ▽ More

    Submitted 18 December, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

    Comments: Minor typo corrections and clarifications; slight change in the title; moved part of the related work and background discussion to the appendix

  23. The quotient Unimodular Vector group is nilpotent

    Authors: Reema Khanna, Selby Jose, Sampat Sharma, Ravi A. Rao

    Abstract: Jose-Rao introduced and studied the Special Unimodular Vector group $SUm_r(R)$ and $EUm_r(R)$, its Elementary Unimodular Vector subgroup. They proved that for $r \geq 2$, $EUm_r(R)$ is a normal subgroup of $SUm_r(R)$. The Jose-Rao theorem says that the quotient Unimodular Vector group, $SUm_r(R)/EUm_r(R)$, for $r \geq 2$, is a subgroup of the orthogonal quotient group… ▽ More

    Submitted 20 January, 2020; originally announced January 2020.

    Journal ref: Leavitt Path algebra and Classical K-Theory, (2020) 225-240. Indian statistical institute series. Springer, Singapore

  24. Building an Aerial-Ground Robotics System for Precision Farming: An Adaptable Solution

    Authors: Alberto Pretto, Stéphanie Aravecchia, Wolfram Burgard, Nived Chebrolu, Christian Dornhege, Tillmann Falck, Freya Fleckenstein, Alessandra Fontenla, Marco Imperoli, Raghav Khanna, Frank Liebisch, Philipp Lottes, Andres Milioto, Daniele Nardi, Sandro Nardi, Johannes Pfeifer, Marija Popović, Ciro Potena, Cédric Pradalier, Elisa Rothacker-Feder, Inkyu Sa, Alexander Schaefer, Roland Siegwart, Cyrill Stachniss, Achim Walter , et al. (3 additional authors not shown)

    Abstract: The application of autonomous robots in agriculture is gaining increasing popularity thanks to the high impact it may have on food security, sustainability, resource use efficiency, reduction of chemical treatments, and the optimization of human effort and yield. With this vision, the Flourish research project aimed to develop an adaptable robotic solution for precision farming that combines the a… ▽ More

    Submitted 7 June, 2022; v1 submitted 8 November, 2019; originally announced November 2019.

    Comments: Published in IEEE Robotics & Automation Magazine, vol. 28, no. 3, pp. 29-49, Sept. 2021

    Journal ref: IEEE Robotics & Automation Magazine, vol. 28, no. 3, pp. 29-49, Sept. 2021

  25. arXiv:1910.13389  [pdf, ps, other

    stat.ML cs.LG math.OC

    Learning Sparse Distributions using Iterative Hard Thresholding

    Authors: Jacky Y. Zhang, Rajiv Khanna, Anastasios Kyrillidis, Oluwasanmi Koyejo

    Abstract: Iterative hard thresholding (IHT) is a projected gradient descent algorithm, known to achieve state of the art performance for a wide range of structured estimation problems, such as sparse inference. In this work, we consider IHT as a solution to the problem of learning sparse discrete distributions. We study the hardness of using IHT on the space of measures. As a practical alternative, we propo… ▽ More

    Submitted 30 January, 2020; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019

  26. An Efficient Sampling-based Method for Online Informative Path Planning in Unknown Environments

    Authors: Lukas Schmid, Michael Pantic, Raghav Khanna, Lionel Ott, Roland Siegwart, Juan Nieto

    Abstract: The ability to plan informative paths online is essential to robot autonomy. In particular, sampling-based approaches are often used as they are capable of using arbitrary information gain formulations. However, they are prone to local minima, resulting in sub-optimal trajectories, and sometimes do not reach global coverage. In this paper, we present a new RRT*-inspired online informative path pla… ▽ More

    Submitted 14 January, 2020; v1 submitted 20 September, 2019; originally announced September 2019.

    Comments: 8 pages, 6 figures, video: https://youtu.be/lEadqJ1_8Do, framework: https://github.com/ethz-asl/mav_active_3d_planning

    Journal ref: IEEE Robotics and Automation Letters, Vol. 5, Iss. 2, April 2020

  27. arXiv:1907.08410  [pdf, other

    stat.ML cs.LG

    Geometric Rates of Convergence for Kernel-based Sampling Algorithms

    Authors: Rajiv Khanna, Liam Hodgkinson, Michael W. Mahoney

    Abstract: The rate of convergence of weighted kernel herding (WKH) and sequential Bayesian quadrature (SBQ), two kernel-based sampling algorithms for estimating integrals with respect to some target probability measure, is investigated. Under verifiable conditions on the chosen kernel and target measure, we establish a near-geometric rate of convergence for target measures that are nearly atomic. Furthermor… ▽ More

    Submitted 31 October, 2021; v1 submitted 19 July, 2019; originally announced July 2019.

    Comments: Accepted to UAI 2021 (Oral)

  28. arXiv:1810.10118  [pdf, other

    cs.LG stat.ML

    Interpreting Black Box Predictions using Fisher Kernels

    Authors: Rajiv Khanna, Been Kim, Joydeep Ghosh, Oluwasanmi Koyejo

    Abstract: Research in both machine learning and psychology suggests that salient examples can help humans to interpret learning models. To this end, we take a novel look at black box interpretation of test predictions in terms of training examples. Our goal is to ask `which training examples are most responsible for a given set of predictions'? To answer this question, we make use of Fisher kernels as the d… ▽ More

    Submitted 23 October, 2018; originally announced October 2018.

  29. arXiv:1810.03329  [pdf, ps, other

    math.KT

    The Pillars of Relative Quillen--Suslin Theory

    Authors: Rabeya Basu, Reema Khanna, Ravi A. Rao

    Abstract: We deduce the relative version of the equivalences relating the relative Local Global Principle and the Normality of the relative Elementary subgroups of the traditional classical groups, viz. general linear, symplectic and orthogonal groups. This generalizes our previous result for the absolute case.

    Submitted 8 October, 2018; originally announced October 2018.

    Comments: 12 pages

    MSC Class: 13C10; 11E57; 11E70; 15A63; 19B10; 19B14

  30. AgriColMap: Aerial-Ground Collaborative 3D Map** for Precision Farming

    Authors: Ciro Potena, Raghav Khanna, Juan Nieto, Roland Siegwart, Daniele Nardi, Alberto Pretto

    Abstract: The combination of aerial survey capabilities of Unmanned Aerial Vehicles with targeted intervention abilities of agricultural Unmanned Ground Vehicles can significantly improve the effectiveness of robotic systems applied to precision agriculture. In this context, building and updating a common map of the field is an essential but challenging task. The maps built using robots of different types s… ▽ More

    Submitted 14 March, 2019; v1 submitted 30 September, 2018; originally announced October 2018.

    Comments: Published in IEEE Robotics and Automation Letters, 2019

    Journal ref: IEEE Robotics and Automation Letters, Vol: 4, Issue: 2, April 2019, pages 1085-1092

  31. arXiv:1808.00100  [pdf, other

    cs.RO

    WeedMap: A large-scale semantic weed map** framework using aerial multispectral imaging and deep neural network for precision farming

    Authors: Inkyu Sa, Marija Popovic, Raghav Khanna, Zetao Chen, Philipp Lottes, Frank Liebisch, Juan Nieto, Cyrill Stachniss, Achim Walter, Roland Siegwart

    Abstract: We present a novel weed segmentation and map** framework that processes multispectral images obtained from an unmanned aerial vehicle (UAV) using a deep neural network (DNN). Most studies on crop/weed semantic segmentation only consider single images for processing and classification. Images taken by UAVs often cover only a few hundred square meters with either color only or color and near-infra… ▽ More

    Submitted 6 September, 2018; v1 submitted 31 July, 2018; originally announced August 2018.

    Comments: 25 pages, 14 figures, MDPI Remote Sensing

  32. arXiv:1806.02185  [pdf, other

    stat.ML cs.LG

    Boosting Black Box Variational Inference

    Authors: Francesco Locatello, Gideon Dresdner, Rajiv Khanna, Isabel Valera, Gunnar Rätsch

    Abstract: Approximating a probability density in a tractable manner is a central task in Bayesian statistics. Variational Inference (VI) is a popular technique that achieves tractability by choosing a relatively simple variational family. Borrowing ideas from the classic boosting framework, recent approaches attempt to \emph{boost} VI by replacing the selection of a single density with a greedily constructe… ▽ More

    Submitted 28 November, 2018; v1 submitted 6 June, 2018; originally announced June 2018.

  33. arXiv:1712.09379  [pdf, other

    math.OC cs.DS cs.LG math.NA stat.ML

    IHT dies hard: Provable accelerated Iterative Hard Thresholding

    Authors: Rajiv Khanna, Anastasios Kyrillidis

    Abstract: We study --both in theory and practice-- the use of momentum motions in classic iterative hard thresholding (IHT) methods. By simply modifying plain IHT, we investigate its convergence behavior on convex optimization criteria with non-convex constraints, under standard assumptions. In diverse scenaria, we observe that acceleration in IHT leads to significant improvements, compared to state of the… ▽ More

    Submitted 13 September, 2019; v1 submitted 26 December, 2017; originally announced December 2017.

    Comments: accepted to AISTATS 2018

  34. arXiv:1711.00548  [pdf, other

    cs.RO

    Autonomous Electric Race Car Design

    Authors: Niklas Funk, Nikhilesh Alatur, Robin Deuber, Frederick Gonon, Nico Messikommer, Julian Nubert, Moritz Patriarca, Simon Schaefer, Dominic Scotoni, Nicholas Bünger, Renaud Dube, Raghav Khanna, Mark Pfeiffer, Erik Wilhelm, Roland Siegwart

    Abstract: Autonomous driving and electric vehicles are nowadays very active research and development areas. In this paper we present the conversion of a standard Kyburz eRod into an autonomous vehicle that can be operated in challenging environments such as Swiss mountain passes. The overall hardware and software architectures are described in detail with a special emphasis on the sensor requirements for au… ▽ More

    Submitted 1 November, 2017; originally announced November 2017.

    Comments: Paper submitted and accepted to the EVS30 Symposium, from October 9-11, 2017 in Stuttgart, Germany

  35. arXiv:1709.03329  [pdf, other

    cs.CV cs.RO

    weedNet: Dense Semantic Weed Classification Using Multispectral Images and MAV for Smart Farming

    Authors: Inkyu Sa, Zetao Chen, Marija Popovic, Raghav Khanna, Frank Liebisch, Juan Nieto, Roland Siegwart

    Abstract: Selective weed treatment is a critical step in autonomous crop management as related to crop health and yield. However, a key challenge is reliable, and accurate weed detection to minimize damage to surrounding plants. In this paper, we present an approach for dense semantic weed classification with multispectral images collected by a micro aerial vehicle (MAV). We use the recently developed encod… ▽ More

    Submitted 11 September, 2017; originally announced September 2017.

  36. Build Your Own Visual-Inertial Drone: A Cost-Effective and Open-Source Autonomous Drone

    Authors: Inkyu Sa, Mina Kamel, Michael Burri, Michael Bloesch, Raghav Khanna, Marija Popovic, Juan Nieto, Roland Siegwart

    Abstract: This paper describes an approach to building a cost-effective and research grade visual-inertial odometry aided vertical taking-off and landing (VTOL) platform. We utilize an off-the-shelf visual-inertial sensor, an onboard computer, and a quadrotor platform that are factory-calibrated and mass-produced, thereby sharing similar hardware and sensor specifications (e.g., mass, dimensions, intrinsic… ▽ More

    Submitted 6 September, 2018; v1 submitted 22 August, 2017; originally announced August 2017.

    Comments: 21 pages, 10 figures, accepted to IEEE Robotics & Automation Magazine

    Journal ref: IEEE Robotics & Automation Magazine 2017

  37. arXiv:1708.01733  [pdf, other

    cs.LG cs.AI stat.ML

    Boosting Variational Inference: an Optimization Perspective

    Authors: Francesco Locatello, Rajiv Khanna, Joydeep Ghosh, Gunnar Rätsch

    Abstract: Variational inference is a popular technique to approximate a possibly intractable Bayesian posterior with a more tractable one. Recently, boosting variational inference has been proposed as a new paradigm to approximate the posterior by a mixture of densities by greedily adding components to the mixture. However, as is the case with many other variational inference algorithms, its theoretical pro… ▽ More

    Submitted 7 March, 2018; v1 submitted 5 August, 2017; originally announced August 2017.

    Journal ref: AISTATS 2018

  38. arXiv:1703.02723  [pdf, other

    stat.ML cs.IT cs.LG

    Scalable Greedy Feature Selection via Weak Submodularity

    Authors: Rajiv Khanna, Ethan Elenberg, Alexandros G. Dimakis, Sahand Negahban, Joydeep Ghosh

    Abstract: Greedy algorithms are widely used for problems in machine learning such as feature selection and set function optimization. Unfortunately, for large datasets, the running time of even greedy algorithms can be quite high. This is because for each greedy step we need to refit a model or calculate a function using the previously selected choices and the new candidate. Two algorithms that are faster… ▽ More

    Submitted 8 March, 2017; originally announced March 2017.

    Comments: To appear in AISTATS 2017

  39. arXiv:1703.02721  [pdf, other

    stat.ML cs.IT cs.LG

    On Approximation Guarantees for Greedy Low Rank Optimization

    Authors: Rajiv Khanna, Ethan Elenberg, Alexandros G. Dimakis, Sahand Negahban

    Abstract: We provide new approximation guarantees for greedy low rank matrix estimation under standard assumptions of restricted strong convexity and smoothness. Our novel analysis also uncovers previously unknown connections between the low rank estimation and combinatorial optimization, so much so that our bounds are reminiscent of corresponding approximation bounds in submodular maximization. Additionall… ▽ More

    Submitted 8 March, 2017; originally announced March 2017.

  40. arXiv:1702.06457  [pdf, other

    cs.LG stat.ML

    A Unified Optimization View on Generalized Matching Pursuit and Frank-Wolfe

    Authors: Francesco Locatello, Rajiv Khanna, Michael Tschannen, Martin Jaggi

    Abstract: Two of the most fundamental prototypes of greedy optimization are the matching pursuit and Frank-Wolfe algorithms. In this paper, we take a unified view on both classes of methods, leading to the first explicit convergence rates of matching pursuit methods in an optimization sense, for general sets of atoms. We derive sublinear ($1/t$) convergence for both classes on general smooth objectives, and… ▽ More

    Submitted 7 March, 2017; v1 submitted 21 February, 2017; originally announced February 2017.

    Comments: appearing at AISTATS 2017

  41. arXiv:1701.08623  [pdf, other

    cs.RO

    Dynamic System Identification, and Control for a cost effective open-source VTOL MAV

    Authors: Inkyu Sa, Mina Kamel, Raghav Khanna, Marija Popovic, Juan Nieto, Roland Siegwart

    Abstract: This paper describes dynamic system identification, and full control of a cost-effective vertical take-off and landing (VTOL) multi-rotor micro-aerial vehicle (MAV) --- DJI Matrice 100. The dynamics of the vehicle and autopilot controllers are identified using only a built-in IMU and utilized to design a subsequent model predictive controller (MPC). Experimental results for the control performance… ▽ More

    Submitted 9 March, 2017; v1 submitted 30 January, 2017; originally announced January 2017.

    Comments: 8 pages, 12 figures

  42. arXiv:1612.00804  [pdf, other

    stat.ML cs.IT cs.LG

    Restricted Strong Convexity Implies Weak Submodularity

    Authors: Ethan R. Elenberg, Rajiv Khanna, Alexandros G. Dimakis, Sahand Negahban

    Abstract: We connect high-dimensional subset selection and submodular maximization. Our results extend the work of Das and Kempe (2011) from the setting of linear regression to arbitrary objective functions. For greedy feature selection, this connection allows us to obtain strong multiplicative performance bounds on several methods without statistical modeling assumptions. We also derive recovery guarantees… ▽ More

    Submitted 12 October, 2017; v1 submitted 2 December, 2016; originally announced December 2016.

  43. AstroSat CZT Imager observations of GRB 151006A: timing, spectroscopy, and polarisation study

    Authors: A. R. Rao, Vikas Chand, M. K. Hingar, S. Iyyani, Rakesh Khanna, A. P. K. Kutty, J. P. Malkar, D. Paul, V. B. Bhalerao, D. Bhattacharya, G. C. Dewangan, Pramod Pawar, A. M. Vibhute, T. Chattopadhyay, N. P. S. Mithun, S. V. Vadawale, N. Vagshette, R. Basak, P. Pradeep, Essy Samuel, S. Sreekumar, P. Vinod, K. H. Navalgund, R. Pandiyan, K. S. Sarma , et al. (2 additional authors not shown)

    Abstract: AstroSat is a multi-wavelength satellite launched on 2015 September 28. The CZT Imager of AstroSat on its very first day of operation detected a long duration gamma-ray burst (GRB) namely GRB 151006A. Using the off-axis imaging and spectral response of the instrument, we demonstrate that CZT Imager can localise this GRB correct to about a few degrees and it can provide, in conjunction with Swift,… ▽ More

    Submitted 26 August, 2016; originally announced August 2016.

    Comments: Submitted to ApJ

  44. arXiv:1608.06038  [pdf, other

    astro-ph.IM physics.ins-det

    Charged Particle Monitor on the AstroSat mission

    Authors: A. R. Rao, M. H. Patil, Yash Bhargava, Rakesh Khanna, M. K. Hingar, A. P. K. Kutty, J. P. Malkar, Rupal Basak, S. Sreekumar, Essy Samuel, P. Priya, P. Vinod, D. Bhattacharya, V. Bhalerao, S. V. Vadawale, N. P. S. Mithun, R. Pandiyan, K. Subbarao, S. Seetha, K. Suryanarayana Sarma

    Abstract: Charged Particle Monitor (CPM) on-board the AstroSat satellite is an instrument designed to detect the flux of charged particles at the satellite location. A Cesium Iodide Thallium (CsI(Tl)) crystal is used with a Kapton window to detect protons with energies greater than 1 MeV. The ground calibration of CPM was done using gamma-rays from radioactive sources and protons from particle accelerators.… ▽ More

    Submitted 21 August, 2016; originally announced August 2016.

    Comments: 9 pages, 8 figures, 5 tables. To appear in JAA special issue on AstroSat

  45. arXiv:1608.03408  [pdf, other

    astro-ph.IM astro-ph.HE

    The Cadmium Zinc Telluride Imager on AstroSat

    Authors: V. Bhalerao, D. Bhattacharya, A. Vibhute, P. Pawar, A. R. Rao, M. K. Hingar, Rakesh Khanna, A. P. K. Kutty, J. P. Malkar, M. H. Patil, Y. K. Arora, S. Sinha, P. Priya, Essy Samuel, S. Sreekumar, P. Vinod, N. P. S. Mithun, S. V. Vadawale, N. Vagshette, K. H. Navalgund, K. S. Sarma, R. Pandiyan, S. Seetha, K. Subbarao

    Abstract: The Cadmium Zinc Telluride Imager (CZTI) is a high energy, wide-field imaging instrument on AstroSat. CZT's namesake Cadmium Zinc Telluride detectors cover an energy range from 20 keV to > 200 keV, with 11% energy resolution at 60 keV. The coded aperture mask attains an angular resolution of 17' over a 4.6 deg x 4.6 deg (FWHM) field of view. CZTI functions as an open detector above 100 keV, contin… ▽ More

    Submitted 11 August, 2016; originally announced August 2016.

    Comments: 9 pages, 6 figures, 1 table. To appear in Astrosat special issue of the Journal of Astronomy and Astrophysics

  46. arXiv:1607.03204  [pdf, other

    stat.ML cs.LG

    Information Projection and Approximate Inference for Structured Sparse Variables

    Authors: Rajiv Khanna, Joydeep Ghosh, Russell Poldrack, Oluwasanmi Koyejo

    Abstract: Approximate inference via information projection has been recently introduced as a general-purpose approach for efficient probabilistic inference given sparse variables. This manuscript goes beyond classical sparsity by proposing efficient algorithms for approximate inference via information projection that are applicable to any structure on the set of variables that admits enumeration using a \em… ▽ More

    Submitted 11 July, 2016; originally announced July 2016.

  47. arXiv:1602.04208  [pdf, other

    cs.LG stat.ML

    Pursuits in Structured Non-Convex Matrix Factorizations

    Authors: Rajiv Khanna, Michael Tschannen, Martin Jaggi

    Abstract: Efficiently representing real world data in a succinct and parsimonious manner is of central importance in many fields. We present a generalized greedy pursuit framework, allowing us to efficiently solve structured matrix factorization problems, where the factors are allowed to be from arbitrary sets of structured vectors. Such structure may include sparsity, non-negativeness, order, or a combinat… ▽ More

    Submitted 12 February, 2016; originally announced February 2016.

  48. arXiv:1511.02024  [pdf, other

    cs.LG cs.CL

    Towards a Better Understanding of Predict and Count Models

    Authors: S. Sathiya Keerthi, Tobias Schnabel, Rajiv Khanna

    Abstract: In a recent paper, Levy and Goldberg pointed out an interesting connection between prediction-based word embedding models and count models based on pointwise mutual information. Under certain conditions, they showed that both models end up optimizing equivalent objective functions. This paper explores this connection in more detail and lays out the factors leading to differences between these mode… ▽ More

    Submitted 6 November, 2015; originally announced November 2015.

    Comments: 17 pages

  49. arXiv:1507.01135  [pdf, other

    stat.AP

    DPM: A State Space Model for Large-Scale Direct Marketing

    Authors: Yubin Park, Rajiv Khanna, Joydeep Ghosh, Daniel Mihalko

    Abstract: We propose a novel statistical model to answer three challenges in direct marketing: which channel to use, which offer to make, and when to offer. There are several potential applications for the proposed model, for example, develo** personalized marketing strategies and monitoring members' needs. Furthermore, the results from the model can complement and can be integrated with other existing mo… ▽ More

    Submitted 4 July, 2015; originally announced July 2015.

  50. Kinetically engendered sub-spinodal length scales in spontaneous dewetting of thin liquid films

    Authors: TirumalaRao Kotni, Jayati Sarkar, Rajesh Khanna

    Abstract: Numerical simulations of spontaneous dewetting of non-slip**, variable viscosity unstable thin liquid films on homogeneous substrates reveal the existence of sub-spinodal lengthscales through formation of satellite holes, a marker of nucleated dewetting and/or heterogeneous substrates, in the late stages of dewetting if the liquid viscosity decreases continually with decreasing film thickness. T… ▽ More

    Submitted 11 March, 2014; originally announced March 2014.

    Comments: 5 pages, to be submitted to PRL