Skip to main content

Showing 1–50 of 54 results for author: Sankaranarayanan, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18562  [pdf, other

    cs.CV cs.LG

    Views Can Be Deceiving: Improved SSL Through Feature Space Augmentation

    Authors: Kimia Hamidieh, Haoran Zhang, Swami Sankaranarayanan, Marzyeh Ghassemi

    Abstract: Supervised learning methods have been found to exhibit inductive biases favoring simpler features. When such features are spuriously correlated with the label, this can result in suboptimal performance on minority subgroups. Despite the growing popularity of methods which learn from unlabeled data, the extent to which these representations rely on spurious features for prediction is unclear. In th… ▽ More

    Submitted 28 May, 2024; originally announced June 2024.

  2. arXiv:2405.16344  [pdf, other

    cs.RO

    Large Language Models Enable Automated Formative Feedback in Human-Robot Interaction Tasks

    Authors: Emily Jensen, Sriram Sankaranarayanan, Bradley Hayes

    Abstract: We claim that LLMs can be paired with formal analysis methods to provide accessible, relevant feedback for HRI tasks. While logic specifications are useful for defining and assessing a task, these representations are not easily interpreted by non-experts. Luckily, LLMs are adept at generating easy-to-understand text that explains difficult concepts. By integrating task assessment outcomes and othe… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: Presented at Human-LLM Interaction Workshop at HRI 2024

  3. arXiv:2405.15982  [pdf, other

    cs.RO cs.HC

    Automated Assessment and Adaptive Multimodal Formative Feedback Improves Psychomotor Skills Training Outcomes in Quadrotor Teleoperation

    Authors: Emily Jensen, Sriram Sankaranarayanan, Bradley Hayes

    Abstract: The workforce will need to continually upskill in order to meet the evolving demands of industry, especially working with robotic and autonomous systems. Current training methods are not scalable and do not adapt to the skills that learners already possess. In this work, we develop a system that automatically assesses learner skill in a quadrotor teleoperation task using temporal logic task specif… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: Under review at Human-Agent Interaction 2024 conference

  4. arXiv:2405.07119  [pdf, ps, other

    cs.GT

    Best-response Algorithms for Integer Convex Quadratic Simultaneous Games

    Authors: Sriram Sankaranarayanan

    Abstract: We evaluate the best-response algorithm in the context of pure-integer convex quadratic games. We provide a sufficient condition that if certain interaction matrices (the product of the inverse of the positive definite matrix defining the convex quadratic terms and the matrix that connects one player's problem to another's) have all their singular values less than 1, then finite termination of the… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  5. arXiv:2405.00687  [pdf, other

    cs.RO cs.LO

    Optimal Planning for Timed Partial Order Specifications

    Authors: Kandai Watanabe, Georgios Fainekos, Bardh Hoxha, Morteza Lahijanian, Hideki Okamoto, Sriram Sankaranarayanan

    Abstract: This paper addresses the challenge of planning a sequence of tasks to be performed by multiple robots while minimizing the overall completion time subject to timing and precedence constraints. Our approach uses the Timed Partial Orders (TPO) model to specify these constraints. We translate this problem into a Traveling Salesman Problem (TSP) variant with timing and precedent constraints, and we so… ▽ More

    Submitted 8 March, 2024; originally announced May 2024.

    Comments: 2024 IEEE International Conference on Robotics and Automation

  6. arXiv:2404.07170  [pdf, other

    cs.SE cs.AI cs.LG cs.PF cs.PL

    Worst-Case Convergence Time of ML Algorithms via Extreme Value Theory

    Authors: Saeid Tizpaz-Niari, Sriram Sankaranarayanan

    Abstract: This paper leverages the statistics of extreme values to predict the worst-case convergence times of machine learning algorithms. Timing is a critical non-functional property of ML systems, and providing the worst-case converge times is essential to guarantee the availability of ML and its services. However, timing properties such as worst-case convergence times (WCCT) are difficult to verify sinc… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: In 3rd International Conference on AI Engineering: Software Engineering for AI (CAIN 2024)

  7. arXiv:2401.09456  [pdf, ps, other

    cs.CY cs.LG stat.ML

    Parametric Constraints for Bayesian Knowledge Tracing from First Principles

    Authors: Denis Shchepakin, Sreecharan Sankaranarayanan, Dawn Zimmaro

    Abstract: Bayesian Knowledge Tracing (BKT) is a probabilistic model of a learner's state of mastery corresponding to a knowledge component. It considers the learner's state of mastery as a "hidden" or latent binary variable and updates this state based on the observed correctness of the learner's response using parameters that represent transition probabilities between states. BKT is often represented as a… ▽ More

    Submitted 22 December, 2023; originally announced January 2024.

    MSC Class: 62F15 (Primary) 62M05; 60J20; 68T30; 91E40 (Secondary)

  8. arXiv:2311.08594  [pdf, other

    cs.LG stat.ML

    Variational Temporal IRT: Fast, Accurate, and Explainable Inference of Dynamic Learner Proficiency

    Authors: Yunsung Kim, Sreechan Sankaranarayanan, Chris Piech, Candace Thille

    Abstract: Dynamic Item Response Models extend the standard Item Response Theory (IRT) to capture temporal dynamics in learner ability. While these models have the potential to allow instructional systems to actively monitor the evolution of learner proficiency in real time, existing dynamic item response models rely on expensive inference algorithms that scale poorly to massive datasets. In this work, we pr… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

    Comments: 9 pages, 16th International Conference on Educational Data Mining (EDM'23)

  9. arXiv:2306.02817  [pdf, other

    math.OC cs.GT

    Integer Programming Games: A Gentle Computational Overview

    Authors: Margarida Carvalho, Gabriele Dragotto, Andrea Lodi, Sriram Sankaranarayanan

    Abstract: In this tutorial, we present a computational overview on computing Nash equilibria in Integer Programming Games ($IPG$s), $i.e.$, how to compute solutions for a class of non-cooperative and nonconvex games where each player solves a mixed-integer optimization problem. $IPG$s are a broad class of games extending the modeling power of mixed-integer optimization to multi-agent settings. This class of… ▽ More

    Submitted 12 June, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: To appear in INFORMS TutORials in Operations Research 2023

  10. arXiv:2303.06582  [pdf, other

    cs.RO

    Certifiably-correct Control Policies for Safe Learning and Adaptation in Assistive Robotics

    Authors: Keyvan Majd, Geoffrey Clark, Tanmay Khandait, Siyu Zhou, Sriram Sankaranarayanan, Georgios Fainekos, Heni Ben Amor

    Abstract: Guaranteeing safety in human-centric applications is critical in robot learning as the learned policies may demonstrate unsafe behaviors in formerly unseen scenarios. We present a framework to locally repair an erroneous policy network to satisfy a set of formal safety constraints using Mixed Integer Quadratic Programming (MIQP). Our MIQP formulation explicitly imposes the safety constraints to th… ▽ More

    Submitted 12 March, 2023; originally announced March 2023.

    Comments: Appeared in the 36th Conference on Neural Information Processing Systems (NeurIPS) - Robot Learning Workshop. arXiv admin note: substantial text overlap with arXiv:2303.04431

  11. arXiv:2303.04431  [pdf, other

    cs.RO

    Safe Robot Learning in Assistive Devices through Neural Network Repair

    Authors: Keyvan Majd, Geoffrey Clark, Tanmay Khandait, Siyu Zhou, Sriram Sankaranarayanan, Georgios Fainekos, Heni Ben Amor

    Abstract: Assistive robotic devices are a particularly promising field of application for neural networks (NN) due to the need for personalization and hard-to-model human-machine interaction dynamics. However, NN based estimators and controllers may produce potentially unsafe outputs over previously unseen data points. In this paper, we introduce an algorithm for updating NN control policies to satisfy a gi… ▽ More

    Submitted 8 March, 2023; originally announced March 2023.

    Journal ref: PMLR 205:2148-2158, 2023

  12. MERLIN: Multi-agent offline and transfer learning for occupant-centric energy flexible operation of grid-interactive communities using smart meter data and CityLearn

    Authors: Kingsley Nweye, Siva Sankaranarayanan, Zoltan Nagy

    Abstract: The decarbonization of buildings presents new challenges for the reliability of the electrical grid as a result of the intermittency of renewable energy sources and increase in grid load brought about by end-use electrification. To restore reliability, grid-interactive efficient buildings can provide flexibility services to the grid through demand response. Residential demand response programs are… ▽ More

    Submitted 31 December, 2022; originally announced January 2023.

    Comments: under review

  13. arXiv:2211.11031  [pdf, other

    cs.LG

    Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors

    Authors: Thomas Hartvigsen, Swami Sankaranarayanan, Hamid Palangi, Yoon Kim, Marzyeh Ghassemi

    Abstract: Deployed language models decay over time due to shifting inputs, changing user needs, or emergent world-knowledge gaps. When such problems are identified, we want to make targeted edits while avoiding expensive retraining. However, current model editors, which modify such behaviors of pre-trained models, degrade model performance quickly across multiple, sequential edits. We propose GRACE, a lifel… ▽ More

    Submitted 17 October, 2023; v1 submitted 20 November, 2022; originally announced November 2022.

    Comments: Accepted to NeurIPS 2023

  14. arXiv:2211.08194  [pdf

    cond-mat.mtrl-sci cs.CV cs.LG

    Machine learning for classifying and interpreting coherent X-ray speckle patterns

    Authors: Mingren Shen, Dina Sheyfer, Troy David Loeffler, Subramanian K. R. S. Sankaranarayanan, G. Brian Stephenson, Maria K. Y. Chan, Dane Morgan

    Abstract: Speckle patterns produced by coherent X-ray have a close relationship with the internal structure of materials but quantitative inversion of the relationship to determine structure from speckle patterns is challenging. Here, we investigate the link between coherent X-ray speckle patterns and sample structures using a model 2D disk system and explore the ability of machine learning to learn aspects… ▽ More

    Submitted 1 September, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

  15. MLExchange: A web-based platform enabling exchangeable machine learning workflows for scientific studies

    Authors: Zhuowen Zhao, Tanny Chavez, Elizabeth A. Holman, Guanhua Hao, Adam Green, Harinarayan Krishnan, Dylan McReynolds, Ronald Pandolfi, Eric J. Roberts, Petrus H. Zwart, Howard Yanxon, Nicholas Schwarz, Subramanian Sankaranarayanan, Sergei V. Kalinin, Apurva Mehta, Stuart Campbell, Alexander Hexemer

    Abstract: Machine learning (ML) algorithms are showing a growing trend in hel** the scientific communities across different disciplines and institutions to address large and diverse data problems. However, many available ML tools are programmatically demanding and computationally costly. The MLExchange project aims to build a collaborative platform equipped with enabling tools that allow scientists and fa… ▽ More

    Submitted 26 January, 2023; v1 submitted 20 August, 2022; originally announced August 2022.

    Comments: The accepted version with DOI and IEEE copyright notice in the first page

    Journal ref: 2022 4th IEEE/ACM Annual Workshop on Extreme-scale Experiment-in-the-Loop Computing (XLOOP)

  16. arXiv:2207.10074  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Semantic uncertainty intervals for disentangled latent spaces

    Authors: Swami Sankaranarayanan, Anastasios N. Angelopoulos, Stephen Bates, Yaniv Romano, Phillip Isola

    Abstract: Meaningful uncertainty quantification in computer vision requires reasoning about semantic information -- say, the hair color of the person in a photo or the location of a car on the street. To this end, recent breakthroughs in generative modeling allow us to represent semantic information in disentangled latent spaces, but providing uncertainties on the semantic latent variables has remained chal… ▽ More

    Submitted 30 November, 2022; v1 submitted 20 July, 2022; originally announced July 2022.

    Comments: Accepted to NeurIPS 2022. Project page: https://swamiviv.github.io/semantic_uncertainty_intervals/

  17. arXiv:2205.12722  [pdf, other

    cs.LG cs.HC cs.RO

    Mathematical Models of Human Drivers Using Artificial Risk Fields

    Authors: Emily Jensen, Maya Luster, Hansol Yoon, Brandon Pitts, Sriram Sankaranarayanan

    Abstract: In this paper, we use the concept of artificial risk fields to predict how human operators control a vehicle in response to upcoming road situations. A risk field assigns a non-negative risk measure to the state of the system in order to model how close that state is to violating a safety property, such as hitting an obstacle or exiting the road. Using risk fields, we construct a stochastic model… ▽ More

    Submitted 31 August, 2022; v1 submitted 24 May, 2022; originally announced May 2022.

    Comments: 8 pages, 4 figures, accepted to Intelligent Transportation Systems Conference

  18. arXiv:2203.17274  [pdf, other

    cs.CV

    Exploring Visual Prompts for Adapting Large-Scale Models

    Authors: Hyo** Bahng, Ali Jahanian, Swami Sankaranarayanan, Phillip Isola

    Abstract: We investigate the efficacy of visual prompting to adapt large-scale models in vision. Following the recent approach from prompt tuning and adversarial reprogramming, we learn a single image perturbation such that a frozen model prompted with this perturbation performs a new task. Through comprehensive experiments, we demonstrate that visual prompting is particularly effective for CLIP and robust… ▽ More

    Submitted 3 June, 2022; v1 submitted 31 March, 2022; originally announced March 2022.

    Comments: 16 pages, 10 figures

  19. arXiv:2201.04308  [pdf, other

    cs.GT math.OC

    Cooperative Security Against Interdependent Risks

    Authors: Sanjith Gopalakrishnan, Sriram Sankaranarayanan

    Abstract: Firms in inter-organizational networks such as supply chains or strategic alliances are exposed to interdependent risks. These are risks that are transferable across partner firms. They can be decomposed into intrinsic risks a firm faces from its own operations and extrinsic risks transferred from its partners. Firms broadly have access to two security strategies: either they can independently eli… ▽ More

    Submitted 8 May, 2023; v1 submitted 12 January, 2022; originally announced January 2022.

  20. arXiv:2111.07932  [pdf, other

    cs.GT math.OC

    ZERO: Playing Mathematical Programming Games

    Authors: Gabriele Dragotto, Sriram Sankaranarayanan, Margarida Carvalho, Andrea Lodi

    Abstract: We present ZERO, a modular and extensible C++ library interfacing Mathematical Programming and Game Theory. ZERO provides a comprehensive toolkit of modeling interfaces and algorithms for Reciprocally Bilinear Games (RBGs), i.e., simultaneous non-cooperative games where each player solves a mathematical program with a linear objective in the player's variable and bilinear in its opponents' variabl… ▽ More

    Submitted 12 December, 2021; v1 submitted 15 November, 2021; originally announced November 2021.

  21. arXiv:2111.05726  [pdf, other

    math.OC cs.GT

    The Cut-and-Play Algorithm: Computing Nash Equilibria via Outer Approximations

    Authors: Margarida Carvalho, Gabriele Dragotto, Andrea Lodi, Sriram Sankaranarayanan

    Abstract: We introduce Cut-and-Play, a practically-efficient algorithm for computing Nash equilibria in simultaneous non-cooperative games where players decide via nonconvex and possibly unbounded optimization problems with separable payoff functions. Our algorithm exploits an intrinsic relationship between the equilibria of the original nonconvex game and the ones of a convexified counterpart. In practice,… ▽ More

    Submitted 3 May, 2024; v1 submitted 10 November, 2021; originally announced November 2021.

  22. arXiv:2109.14053  [pdf, other

    physics.app-ph cond-mat.mtrl-sci cs.AI cs.CV

    AutoPhaseNN: Unsupervised Physics-aware Deep Learning of 3D Nanoscale Bragg Coherent Diffraction Imaging

    Authors: Yudong Yao, Henry Chan, Subramanian Sankaranarayanan, Prasanna Balaprakash, Ross J. Harder, Mathew J. Cherukara

    Abstract: The problem of phase retrieval, or the algorithmic recovery of lost phase information from measured intensity alone, underlies various imaging methods from astronomy to nanoscale imaging. Traditional methods of phase retrieval are iterative in nature, and are therefore computationally expensive and time consuming. More recently, deep learning (DL) models have been developed to either provide learn… ▽ More

    Submitted 4 April, 2022; v1 submitted 28 September, 2021; originally announced September 2021.

    MSC Class: 68T07; 00A79

  23. arXiv:2109.14041  [pdf, other

    cs.LG eess.SY

    Local Repair of Neural Networks Using Optimization

    Authors: Keyvan Majd, Siyu Zhou, Heni Ben Amor, Georgios Fainekos, Sriram Sankaranarayanan

    Abstract: In this paper, we propose a framework to repair a pre-trained feed-forward neural network (NN) to satisfy a set of properties. We formulate the properties as a set of predicates that impose constraints on the output of NN over the target input domain. We define the NN repair problem as a Mixed Integer Quadratic Program (MIQP) to adjust the weights of a single layer subject to the given predicates… ▽ More

    Submitted 28 September, 2021; originally announced September 2021.

  24. arXiv:2108.01227  [pdf, other

    cs.RO

    Predictive Runtime Monitoring for Mobile Robots using Logic-Based Bayesian Intent Inference

    Authors: Hansol Yoon, Sriram Sankaranarayanan

    Abstract: We propose a predictive runtime monitoring framework that forecasts the distribution of future positions of mobile robots in order to detect and avoid impending property violations such as collisions with obstacles or other agents. Our approach uses a restricted class of temporal logic formulas to represent the likely intentions of the agents along with a combination of temporal logic-based optima… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: Presented at ICRA 2021

  25. arXiv:2108.00893  [pdf, other

    cs.LG cs.AI

    Static analysis of ReLU neural networks with tropical polyhedra

    Authors: Eric Goubault, Sébastien Palumby, Sylvie Putot, Louis Rustenholz, Sriram Sankaranarayanan

    Abstract: This paper studies the problem of range analysis for feedforward neural networks, which is a basic primitive for applications such as robustness of neural networks, compliance to specifications and reachability analysis of neural-network feedback systems. Our approach focuses on ReLU (rectified linear unit) feedforward neural nets that present specific difficulties: approaches that exploit derivat… ▽ More

    Submitted 23 August, 2021; v1 submitted 30 July, 2021; originally announced August 2021.

    MSC Class: 68T01; 68N30 ACM Class: F.3.1; I.2.0

  26. arXiv:2107.00218  [pdf

    cs.SE

    Comparing Example-Based Collaborative Reflection to Problem Solving Practice for Learning during Team-Based Software Engineering Projects

    Authors: Sreecharan Sankaranarayanan, Siddharth Reddy Kandimalla, Christopher Bogart, R. Charles Murray, Haokang An, Michael Hilton, Majd Sakr, Carolyn Rosé

    Abstract: Contributing to the literature on aptitude-treatment interactions between worked examples and problem-solving, this paper addresses differential learning from the two approaches when students are positioned as domain experts learning new concepts. Our evaluation is situated in a team project that is part of an advanced software engineering course. In this course, students who possess foundational… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

    Comments: 4 pages, 1 image, 1 table, 14th Computer Supported Collaborative Learning (CSCL) Proceedings at the Annual Meeting of the International Society of the Learning Sciences (ISLS)

    Journal ref: 14th Computer-Supported Collaborative Learning Proceedings at the Annual Meeting of the International Society of the Learning Sciences 2021, pp. 213-216

  27. arXiv:2006.09441  [pdf

    eess.IV cond-mat.mtrl-sci cs.LG physics.app-ph

    Real-time 3D Nanoscale Coherent Imaging via Physics-aware Deep Learning

    Authors: Henry Chan, Youssef S. G. Nashed, Saugat Kandel, Stephan Hruszkewycz, Subramanian Sankaranarayanan, Ross J. Harder, Mathew J. Cherukara

    Abstract: Phase retrieval, the problem of recovering lost phase information from measured intensity alone, is an inverse problem that is widely faced in various imaging modalities ranging from astronomy to nanoscale imaging. The current process of phase recovery is iterative in nature. As a result, the image formation is time-consuming and computationally expensive, precluding real-time imaging. Here, we us… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

  28. arXiv:2006.03963  [pdf, other

    cs.LG stat.ML

    Combinatorial Black-Box Optimization with Expert Advice

    Authors: Hamid Dadkhahi, Karthikeyan Shanmugam, Jesus Rios, Payel Das, Samuel Hoffman, Troy David Loeffler, Subramanian Sankaranarayanan

    Abstract: We consider the problem of black-box function optimization over the boolean hypercube. Despite the vast literature on black-box function optimization over continuous domains, not much attention has been paid to learning models for optimization over combinatorial domains until recently. However, the computational complexity of the recently devised algorithms are prohibitive even for moderate number… ▽ More

    Submitted 13 October, 2020; v1 submitted 6 June, 2020; originally announced June 2020.

    Journal ref: KDD 2020

  29. arXiv:2002.10401  [pdf

    cs.CE cond-mat.mes-hall cond-mat.mtrl-sci

    BLAST: Bridging Length/time scales via Atomistic Simulation Toolkit

    Authors: Henry Chan, Badri Narayanan, Mathew Cherukara, Troy D. Loeffler, Michael G. Sternberg, Anthony Avarca, Subramanian K. R. S. Sankaranarayanan

    Abstract: The ever-increasing power of supercomputers coupled with highly scalable simulation codes have made molecular dynamics an indispensable tool in applications ranging from predictive modeling of materials to computational design and discovery of new materials for a broad range of applications. Multi-fidelity scale bridging between the various flavors of molecular dynamics i.e. ab-initio, classical a… ▽ More

    Submitted 21 February, 2020; originally announced February 2020.

  30. arXiv:2001.08088  [pdf, other

    math.OC cs.LG eess.SY stat.ML

    Training Neural Network Controllers Using Control Barrier Functions in the Presence of Disturbances

    Authors: Shakiba Yaghoubi, Georgios Fainekos, Sriram Sankaranarayanan

    Abstract: Control Barrier Functions (CBF) have been recently utilized in the design of provably safe feedback control laws for nonlinear systems. These feedback control methods typically compute the next control input by solving an online Quadratic Program (QP). Solving QP in real-time can be a computationally expensive process for resource constraint systems. In this work, we propose to use imitation learn… ▽ More

    Submitted 18 January, 2020; originally announced January 2020.

  31. arXiv:1912.08112  [pdf, other

    math.OC cs.LG

    A learning-based algorithm to quickly compute good primal solutions for Stochastic Integer Programs

    Authors: Yoshua Bengio, Emma Fre**ger, Andrea Lodi, Rahul Patel, Sriram Sankaranarayanan

    Abstract: We propose a novel approach using supervised learning to obtain near-optimal primal solutions for two-stage stochastic integer programming (2SIP) problems with constraints in the first and second stages. The goal of the algorithm is to predict a "representative scenario" (RS) for the problem such that, deterministically solving the 2SIP with the random realization equal to the RS, gives a near-opt… ▽ More

    Submitted 17 December, 2019; originally announced December 2019.

  32. arXiv:1910.06452  [pdf, other

    cs.GT math.OC

    When Nash Meets Stackelberg

    Authors: Margarida Carvalho, Gabriele Dragotto, Felipe Feijoo, Andrea Lodi, Sriram Sankaranarayanan

    Abstract: This article introduces a class of $Nash$ games among $Stackelberg$ players ($NASPs$), namely, a class of simultaneous non-cooperative games where the players solve sequential Stackelberg games. Specifically, each player solves a Stackelberg game where a leader optimizes a (parametrized) linear objective function subject to linear constraints while its followers solve convex quadratic problems sub… ▽ More

    Submitted 2 November, 2022; v1 submitted 14 October, 2019; originally announced October 2019.

  33. arXiv:1907.10159  [pdf, other

    cs.CR cs.LG cs.SE

    Efficient Detection and Quantification of Timing Leaks with Neural Networks

    Authors: Saeid Tizpaz-Niari, Pavol Cerny, Sriram Sankaranarayanan, Ashutosh Trivedi

    Abstract: Detection and quantification of information leaks through timing side channels are important to guarantee confidentiality. Although static analysis remains the prevalent approach for detecting timing side channels, it is computationally challenging for real-world applications. In addition, the detection techniques are usually restricted to 'yes' or 'no' answers. In practice, real-world application… ▽ More

    Submitted 23 July, 2019; originally announced July 2019.

    Comments: To Appear in RV'19

  34. arXiv:1902.03680  [pdf, other

    cs.LG cs.CV stat.ML

    Learning From Noisy Labels By Regularized Estimation Of Annotator Confusion

    Authors: Ryutaro Tanno, Ardavan Saeedi, Swami Sankaranarayanan, Daniel C. Alexander, Nathan Silberman

    Abstract: The predictive performance of supervised learning algorithms depends on the quality of labels. In a typical label collection process, multiple annotators provide subjective noisy estimates of the "truth" under the influence of their varying skill-levels and biases. Blindly treating these noisy labels as the ground truth limits the accuracy of learning algorithms in the presence of strong disagreem… ▽ More

    Submitted 17 June, 2019; v1 submitted 10 February, 2019; originally announced February 2019.

    Comments: CVPR 2019, code snippets included

  35. arXiv:1806.04552  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Combining Model-Free Q-Ensembles and Model-Based Approaches for Informed Exploration

    Authors: Sreecharan Sankaranarayanan, Raghuram Mandyam Annasamy, Katia Sycara, Carolyn Penstein Rosé

    Abstract: Q-Ensembles are a model-free approach where input images are fed into different Q-networks and exploration is driven by the assumption that uncertainty is proportional to the variance of the output Q-values obtained. They have been shown to perform relatively well compared to other exploration strategies. Further, model-based approaches, such as encoder-decoder models have been used successfully f… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.

    Comments: Submitted to the Thirty-Second Annual Conference on Neural Information Processing Systems (NIPS 2018)

  36. arXiv:1804.05288  [pdf, other

    cs.RO

    Path-Following through Control Funnel Functions

    Authors: Hadi Ravanbakhsh, Sina Aghli, Christoffer Heckman, Sriram Sankaranarayanan

    Abstract: We present an approach to path following using so-called control funnel functions. Synthesizing controllers to "robustly" follow a reference trajectory is a fundamental problem for autonomous vehicles. Robustness, in this context, requires our controllers to handle a specified amount of deviation from the desired trajectory. Our approach considers a timing law that describes how fast to move along… ▽ More

    Submitted 2 August, 2018; v1 submitted 14 April, 2018; originally announced April 2018.

  37. arXiv:1804.01159  [pdf, other

    cs.CV

    Crystal Loss and Quality Pooling for Unconstrained Face Verification and Recognition

    Authors: Rajeev Ranjan, Ankan Bansal, Hongyu Xu, Swami Sankaranarayanan, Jun-Cheng Chen, Carlos D. Castillo, Rama Chellappa

    Abstract: In recent years, the performance of face verification and recognition systems based on deep convolutional neural networks (DCNNs) has significantly improved. A typical pipeline for face verification includes training a deep network for subject classification with softmax loss, using the penultimate layer output as the feature descriptor, and generating a cosine similarity score given a pair of fac… ▽ More

    Submitted 3 February, 2019; v1 submitted 3 April, 2018; originally announced April 2018.

    Comments: Previously portions of this work appeared in arXiv:1703.09507, which was a conference version. This version is an extended journal version of it

  38. arXiv:1712.00699  [pdf, other

    cs.LG cs.CR stat.ML

    Improving Network Robustness against Adversarial Attacks with Compact Convolution

    Authors: Rajeev Ranjan, Swami Sankaranarayanan, Carlos D. Castillo, Rama Chellappa

    Abstract: Though Convolutional Neural Networks (CNNs) have surpassed human-level performance on tasks such as object classification and face verification, they can easily be fooled by adversarial attacks. These attacks add a small perturbation to the input image that causes the network to misclassify the sample. In this paper, we focus on neutralizing adversarial attacks by compact feature learning. In part… ▽ More

    Submitted 22 March, 2018; v1 submitted 2 December, 2017; originally announced December 2017.

  39. A Class of Control Certificates to Ensure Reach-While-Stay for Switched Systems

    Authors: Hadi Ravanbakhsh, Sriram Sankaranarayanan

    Abstract: In this article, we consider the problem of synthesizing switching controllers for temporal properties through the composition of simple primitive reach-while-stay (RWS) properties. Reach-while-stay properties specify that the system states starting from an initial set I, must reach a goal (target) set G in finite time, while remaining inside a safe set S. Our approach synthesizes switched control… ▽ More

    Submitted 28 November, 2017; originally announced November 2017.

    Comments: In Proceedings SYNT 2017, arXiv:1711.10224

    Journal ref: EPTCS 260, 2017, pp. 44-61

  40. arXiv:1711.06969  [pdf, other

    cs.CV cs.LG stat.ML

    Learning from Synthetic Data: Addressing Domain Shift for Semantic Segmentation

    Authors: Swami Sankaranarayanan, Yogesh Balaji, Arpit Jain, Ser Nam Lim, Rama Chellappa

    Abstract: Visual Domain Adaptation is a problem of immense importance in computer vision. Previous approaches showcase the inability of even deep neural networks to learn informative representations across domain shift. This problem is more severe for tasks where acquiring hand labeled data is extremely hard and tedious. In this work, we focus on adapting the representations learned by segmentation networks… ▽ More

    Submitted 1 April, 2018; v1 submitted 19 November, 2017; originally announced November 2017.

    Comments: Accepted as spotlight talk at CVPR 2018. Code available here: https://github.com/swamiviv/LSD-seg

  41. arXiv:1705.07819  [pdf, other

    cs.CV cs.LG stat.ML

    Regularizing deep networks using efficient layerwise adversarial training

    Authors: Swami Sankaranarayanan, Arpit Jain, Rama Chellappa, Ser Nam Lim

    Abstract: Adversarial training has been shown to regularize deep neural networks in addition to increasing their robustness to adversarial examples. However, its impact on very deep state of the art networks has not been fully investigated. In this paper, we present an efficient approach to perform adversarial training by perturbing intermediate layer activations and study the use of such perturbations as a… ▽ More

    Submitted 28 May, 2018; v1 submitted 22 May, 2017; originally announced May 2017.

    Comments: Published at the Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-18). Official link: https://www.aaai.org/ocs/index.php/AAAI/AAAI18/paper/view/16634

  42. arXiv:1704.05543  [pdf

    cs.CY cs.AI cs.CL cs.HC

    Coordinating Collaborative Chat in Massive Open Online Courses

    Authors: Gaurav Singh Tomar, Sreecharan Sankaranarayanan, Xu Wang, Carolyn Penstein Rosé

    Abstract: An earlier study of a collaborative chat intervention in a Massive Open Online Course (MOOC) identified negative effects on attrition stemming from a requirement for students to be matched with exactly one partner prior to beginning the activity. That study raised questions about how to orchestrate a collaborative chat intervention in a MOOC context in order to provide the benefit of synchronous s… ▽ More

    Submitted 18 April, 2017; originally announced April 2017.

    Comments: 8 pages

    Journal ref: Proceedings of the International Conference of the Learning Sciences 2016, Volume 1, pp 607-614

  43. arXiv:1704.01705  [pdf, other

    cs.CV

    Generate To Adapt: Aligning Domains using Generative Adversarial Networks

    Authors: Swami Sankaranarayanan, Yogesh Balaji, Carlos D. Castillo, Rama Chellappa

    Abstract: Domain Adaptation is an actively researched problem in Computer Vision. In this work, we propose an approach that leverages unsupervised data to bring the source and target distributions closer in a learned joint feature space. We accomplish this by inducing a symbiotic relationship between the learned embedding and a generative adversarial network. This is in contrast to methods which use the adv… ▽ More

    Submitted 12 April, 2018; v1 submitted 6 April, 2017; originally announced April 2017.

    Comments: Accepted as spotlight talk at CVPR 2018. Code available here: https://github.com/yogeshbalaji/Generate_To_Adapt

  44. arXiv:1703.07928  [pdf, other

    cs.CV cs.AI stat.ML

    Self corrective Perturbations for Semantic Segmentation and Classification

    Authors: Swami Sankaranarayanan, Arpit Jain, Ser Nam Lim

    Abstract: Convolutional Neural Networks have been a subject of great importance over the past decade and great strides have been made in their utility for producing state of the art performance in many computer vision problems. However, the behavior of deep networks is yet to be fully understood and is still an active area of research. In this work, we present an intriguing behavior: pre-trained CNNs can be… ▽ More

    Submitted 3 August, 2017; v1 submitted 23 March, 2017; originally announced March 2017.

    Comments: Accepted to ICCV 2017

  45. arXiv:1702.07103  [pdf, other

    cs.PL cs.CR cs.FL cs.LG cs.SE

    Discriminating Traces with Time

    Authors: Saeid Tizpaz-Niari, Pavol Cerny, Bor-Yuh Evan Chang, Sriram Sankaranarayanan, Ashutosh Trivedi

    Abstract: What properties about the internals of a program explain the possible differences in its overall running time for different inputs? In this paper, we propose a formal framework for considering this question we dub trace-set discrimination. We show that even though the algorithmic problem of computing maximum likelihood discriminants is NP-hard, approaches based on integer linear programming (ILP)… ▽ More

    Submitted 23 February, 2017; originally announced February 2017.

    Comments: Published in TACAS 2017

  46. Deep Convolutional Neural Network Features and the Original Image

    Authors: Connor J. Parde, Carlos Castillo, Matthew Q. Hill, Y. Ivette Colon, Swami Sankaranarayanan, Jun-Cheng Chen, Alice J. O'Toole

    Abstract: Face recognition algorithms based on deep convolutional neural networks (DCNNs) have made progress on the task of recognizing faces in unconstrained viewing conditions. These networks operate with compact feature-based face representations derived from learning a very large number of face images. While the learned features produced by DCNNs can be highly robust to changes in viewpoint, illuminatio… ▽ More

    Submitted 6 November, 2016; originally announced November 2016.

    Comments: Submitted to Face and Gesture Conference, 2017

  47. arXiv:1611.00851  [pdf, other

    cs.CV

    An All-In-One Convolutional Neural Network for Face Analysis

    Authors: Rajeev Ranjan, Swami Sankaranarayanan, Carlos D. Castillo, Rama Chellappa

    Abstract: We present a multi-purpose algorithm for simultaneous face detection, face alignment, pose estimation, gender recognition, smile detection, age estimation and face recognition using a single deep convolutional neural network (CNN). The proposed method employs a multi-task learning framework that regularizes the shared parameters of CNN and builds a synergy among different domains and tasks. Extens… ▽ More

    Submitted 2 November, 2016; originally announced November 2016.

  48. arXiv:1605.02686  [pdf, other

    cs.CV

    Unconstrained Still/Video-Based Face Verification with Deep Convolutional Neural Networks

    Authors: Jun-Cheng Chen, Rajeev Ranjan, Swami Sankaranarayanan, Amit Kumar, Ching-Hui Chen, Vishal M. Patel, Carlos D. Castillo, Rama Chellappa

    Abstract: Over the last five years, methods based on Deep Convolutional Neural Networks (DCNNs) have shown impressive performance improvements for object detection and recognition problems. This has been made possible due to the availability of large annotated datasets, a better understanding of the non-linear map** between input images and class labels as well as the affordability of GPUs. In this paper,… ▽ More

    Submitted 17 July, 2017; v1 submitted 9 May, 2016; originally announced May 2016.

    Comments: accepted by IJCV

  49. arXiv:1604.05417  [pdf, other

    cs.CV cs.LG stat.ML

    Triplet Probabilistic Embedding for Face Verification and Clustering

    Authors: Swami Sankaranarayanan, Azadeh Alavi, Carlos Castillo, Rama Chellappa

    Abstract: Despite significant progress made over the past twenty five years, unconstrained face verification remains a challenging problem. This paper proposes an approach that couples a deep CNN-based approach with a low-dimensional discriminative embedding learned using triplet probability constraints to solve the unconstrained face verification problem. Aside from yielding performance improvements, this… ▽ More

    Submitted 17 January, 2017; v1 submitted 18 April, 2016; originally announced April 2016.

    Comments: Oral Paper in BTAS 2016; NVIDIA Best paper Award (http://ieee-biometrics.org/btas2016/awards.html)

  50. arXiv:1602.03418  [pdf, ps, other

    cs.CV

    Triplet Similarity Embedding for Face Verification

    Authors: Swami Sankaranarayanan, Azadeh Alavi, Rama Chellappa

    Abstract: In this work, we present an unconstrained face verification algorithm and evaluate it on the recently released IJB-A dataset that aims to push the boundaries of face verification methods. The proposed algorithm couples a deep CNN-based approach with a low-dimensional discriminative embedding learnt using triplet similarity constraints in a large margin fashion. Aside from yielding performance impr… ▽ More

    Submitted 13 March, 2016; v1 submitted 10 February, 2016; originally announced February 2016.