-
TsSHAP: Robust model agnostic feature-based explainability for time series forecasting
Authors:
Vikas C. Raykar,
Arindam Jati,
Sumanta Mukherjee,
Nupur Aggarwal,
Kanthi Sarpatwar,
Giridhar Ganapavarapu,
Roman Vaculin
Abstract:
A trustworthy machine learning model should be accurate as well as explainable. Understanding why a model makes a certain decision defines the notion of explainability. While various flavors of explainability have been well-studied in supervised learning paradigms like classification and regression, literature on explainability for time series forecasting is relatively scarce.
In this paper, we…
▽ More
A trustworthy machine learning model should be accurate as well as explainable. Understanding why a model makes a certain decision defines the notion of explainability. While various flavors of explainability have been well-studied in supervised learning paradigms like classification and regression, literature on explainability for time series forecasting is relatively scarce.
In this paper, we propose a feature-based explainability algorithm, TsSHAP, that can explain the forecast of any black-box forecasting model. The method is agnostic of the forecasting model and can provide explanations for a forecast in terms of interpretable features defined by the user a prior.
The explanations are in terms of the SHAP values obtained by applying the TreeSHAP algorithm on a surrogate model that learns a map** between the interpretable feature space and the forecast of the black-box model.
Moreover, we formalize the notion of local, semi-local, and global explanations in the context of time series forecasting, which can be useful in several scenarios. We validate the efficacy and robustness of TsSHAP through extensive experiments on multiple datasets.
△ Less
Submitted 22 March, 2023;
originally announced March 2023.
-
Modeling and Analysis of Unmanned Remote Guided Vehicle on Rough and Loose Snow Terrain
Authors:
Abhishek D. Patange,
Sharad S. Mulik,
R. Jegadeeshwaran,
Dhananjay R. Jadhav,
Prateek J. Ghatage,
Gaurav R. Doshi,
Rushikesh V Raykar
Abstract:
Survival in remote snow bounded areas is unsafe and risky for mankind. Many problems like arthritis, frostbite, asthma, starvation can caused and lead to death. Indian Military provides transportation vehicles which are heavily built and needs manpower for monitoring. Hence it necessitates facilitating compact transportation to fulfill all requirements. This research aimed at design and analysis o…
▽ More
Survival in remote snow bounded areas is unsafe and risky for mankind. Many problems like arthritis, frostbite, asthma, starvation can caused and lead to death. Indian Military provides transportation vehicles which are heavily built and needs manpower for monitoring. Hence it necessitates facilitating compact transportation to fulfill all requirements. This research aimed at design and analysis of mobile unmanned vehicle for transportation & providing medical help, food and other essential things necessary for surviving in such areas. This can also be used for military services to save the life of solider with less risk. It is typical medium weight, high speed vehicle which carries up to 35 kg load and can negotiate through loose snow, rough terrain with use of caterpillar track. The noteworthy feature of the vehicle is that it constitutes of spiral blades and V shape snowplow to make its way through snow. Hence it will repel the snow in outward direction for self-extraction. It also incorporates skis and hubs for changing the direction and smooth suspension. 3D model of the vehicle is drafted in CATIA and structural analysis is carried out in ANSYS. Control system design and mechatronics integration is proposed to develop the prototype by assembling various components.
△ Less
Submitted 13 January, 2021;
originally announced January 2021.
-
Explainable AI based Interventions for Pre-season Decision Making in Fashion Retail
Authors:
Shravan Sajja,
Nupur Aggarwal,
Sumanta Mukherjee,
Kushagra Manglik,
Satyam Dwivedi,
Vikas Raykar
Abstract:
Future of sustainable fashion lies in adoption of AI for a better understanding of consumer shop** behaviour and using this understanding to further optimize product design, development and sourcing to finally reduce the probability of overproducing inventory. Explainability and interpretability are highly effective in increasing the adoption of AI based tools in creative domains like fashion. I…
▽ More
Future of sustainable fashion lies in adoption of AI for a better understanding of consumer shop** behaviour and using this understanding to further optimize product design, development and sourcing to finally reduce the probability of overproducing inventory. Explainability and interpretability are highly effective in increasing the adoption of AI based tools in creative domains like fashion. In a fashion house, stakeholders like buyers, merchandisers and financial planners have a more quantitative approach towards decision making with primary goals of high sales and reduced dead inventory. Whereas, designers have a more intuitive approach based on observing market trends, social media and runways shows. Our goal is to build an explainable new product forecasting tool with capabilities of interventional analysis such that all the stakeholders (with competing goals) can participate in collaborative decision making process of new product design, development and launch.
△ Less
Submitted 27 July, 2020;
originally announced August 2020.
-
Hyper-local sustainable assortment planning
Authors:
Nupur Aggarwal,
Abhishek Bansal,
Kushagra Manglik,
Kedar Kulkarni,
Vikas Raykar
Abstract:
Assortment planning, an important seasonal activity for any retailer, involves choosing the right subset of products to stock in each store.While existing approaches only maximize the expected revenue, we propose including the environmental impact too, through the Higg Material Sustainability Index. The trade-off between revenue and environmental impact is balanced through a multi-objective optimi…
▽ More
Assortment planning, an important seasonal activity for any retailer, involves choosing the right subset of products to stock in each store.While existing approaches only maximize the expected revenue, we propose including the environmental impact too, through the Higg Material Sustainability Index. The trade-off between revenue and environmental impact is balanced through a multi-objective optimization approach, that yields a Pareto-front of optimal assortments for merchandisers to choose from. Using the proposed approach on a few product categories of a leading fashion retailer shows that choosing assortments with lower environmental impact with a minimal impact on revenue is possible.
△ Less
Submitted 27 July, 2020;
originally announced July 2020.
-
Multi-modal dialog for browsing large visual catalogs using exploration-exploitation paradigm in a joint embedding space
Authors:
Indrani Bhattacharya,
Arkabandhu Chowdhury,
Vikas Raykar
Abstract:
We present a multi-modal dialog system to assist online shoppers in visually browsing through large catalogs. Visual browsing is different from visual search in that it allows the user to explore the wide range of products in a catalog, beyond the exact search matches. We focus on a slightly asymmetric version of the complete multi-modal dialog where the system can understand both text and image q…
▽ More
We present a multi-modal dialog system to assist online shoppers in visually browsing through large catalogs. Visual browsing is different from visual search in that it allows the user to explore the wide range of products in a catalog, beyond the exact search matches. We focus on a slightly asymmetric version of the complete multi-modal dialog where the system can understand both text and image queries but responds only in images. We formulate our problem of "showing $k$ best images to a user" based on the dialog context so far, as sampling from a Gaussian Mixture Model in a high dimensional joint multi-modal embedding space, that embed both the text and the image queries. Our system remembers the context of the dialog and uses an exploration-exploitation paradigm to assist in visual browsing. We train and evaluate the system on a multi-modal dialog dataset that we generate from large catalog data. Our experiments are promising and show that the agent is capable of learning and can display relevant results with an average cosine similarity of 0.85 to the ground truth. Our preliminary human evaluation also corroborates the fact that such a multi-modal dialog system for visual browsing is well-received and is capable of engaging human users.
△ Less
Submitted 29 January, 2019; v1 submitted 28 January, 2019;
originally announced January 2019.
-
Styling with Attention to Details
Authors:
Ayushi Dalmia,
Sachindra Joshi,
Raghavendra Singh,
Vikas Raykar
Abstract:
Fashion as characterized by its nature, is driven by style. In this paper, we propose a method that takes into account the style information to complete a given set of selected fashion items with a complementary fashion item. Complementary items are those items that can be worn along with the selected items according to the style. Addressing this problem facilitates in automatically generating sty…
▽ More
Fashion as characterized by its nature, is driven by style. In this paper, we propose a method that takes into account the style information to complete a given set of selected fashion items with a complementary fashion item. Complementary items are those items that can be worn along with the selected items according to the style. Addressing this problem facilitates in automatically generating stylish fashion ensembles leading to a richer shop** experience for users.
Recently, there has been a surge of online social websites where fashion enthusiasts post the outfit of the day and other users can like and comment on them. These posts contain a gold-mine of information about style. In this paper, we exploit these posts to train a deep neural network which captures style in an automated manner. We pose the problem of predicting complementary fashion items as a sequence to sequence problem where the input is the selected set of fashion items and the output is a complementary fashion item based on the style information learned by the model. We use the encoder decoder architecture to solve this problem of completing the set of fashion items. We evaluate the goodness of the proposed model through a variety of experiments. We empirically observe that our proposed model outperforms competitive baseline like apriori algorithm by ~28 in terms of accuracy for top-1 recommendation to complete the fashion ensemble. We also perform retrieval based experiments to understand the ability of the model to learn style and rank the complementary fashion items and find that using attention in our encoder decoder model helps in improving the mean reciprocal rank by ~24. Qualitatively we find the complementary fashion items generated by our proposed model are richer than the apriori algorithm.
△ Less
Submitted 3 July, 2018;
originally announced July 2018.
-
DeepSolarEye: Power Loss Prediction and Weakly Supervised Soiling Localization via Fully Convolutional Networks for Solar Panels
Authors:
Sachin Mehta,
Amar P. Azad,
Saneem A. Chemmengath,
Vikas Raykar,
Shivkumar Kalyanaraman
Abstract:
The impact of soiling on solar panels is an important and well-studied problem in renewable energy sector. In this paper, we present the first convolutional neural network (CNN) based approach for solar panel soiling and defect analysis. Our approach takes an RGB image of solar panel and environmental factors as inputs to predict power loss, soiling localization, and soiling type. In computer visi…
▽ More
The impact of soiling on solar panels is an important and well-studied problem in renewable energy sector. In this paper, we present the first convolutional neural network (CNN) based approach for solar panel soiling and defect analysis. Our approach takes an RGB image of solar panel and environmental factors as inputs to predict power loss, soiling localization, and soiling type. In computer vision, localization is a complex task which typically requires manually labeled training data such as bounding boxes or segmentation masks. Our proposed approach consists of specialized four stages which completely avoids localization ground truth and only needs panel images with power loss labels for training. The region of impact area obtained from the predicted localization masks are classified into soiling types using the webly supervised learning. For improving localization capabilities of CNNs, we introduce a novel bi-directional input-aware fusion (BiDIAF) block that reinforces the input at different levels of CNN to learn input-specific feature maps. Our empirical study shows that BiDIAF improves the power loss prediction accuracy by about 3% and localization accuracy by about 4%. Our end-to-end model yields further improvement of about 24% on localization when learned in a weakly supervised manner. Our approach is generalizable and showed promising results on web crawled solar panel images. Our system has a frame rate of 22 fps (including all steps) on a NVIDIA TitanX GPU. Additionally, we collected first of it's kind dataset for solar panel image analysis consisting 45,000+ images.
△ Less
Submitted 18 March, 2018; v1 submitted 10 October, 2017;
originally announced October 2017.
-
Joint Learning of Correlated Sequence Labelling Tasks Using Bidirectional Recurrent Neural Networks
Authors:
Vardaan Pahuja,
Anirban Laha,
Shachar Mirkin,
Vikas Raykar,
Lili Kotlerman,
Guy Lev
Abstract:
The stream of words produced by Automatic Speech Recognition (ASR) systems is typically devoid of punctuations and formatting. Most natural language processing applications expect segmented and well-formatted texts as input, which is not available in ASR output. This paper proposes a novel technique of jointly modeling multiple correlated tasks such as punctuation and capitalization using bidirect…
▽ More
The stream of words produced by Automatic Speech Recognition (ASR) systems is typically devoid of punctuations and formatting. Most natural language processing applications expect segmented and well-formatted texts as input, which is not available in ASR output. This paper proposes a novel technique of jointly modeling multiple correlated tasks such as punctuation and capitalization using bidirectional recurrent neural networks, which leads to improved performance for each of these tasks. This method could be extended for joint modeling of any other correlated sequence labeling tasks.
△ Less
Submitted 18 July, 2017; v1 submitted 14 March, 2017;
originally announced March 2017.
-
An Empirical Evaluation of various Deep Learning Architectures for Bi-Sequence Classification Tasks
Authors:
Anirban Laha,
Vikas Raykar
Abstract:
Several tasks in argumentation mining and debating, question-answering, and natural language inference involve classifying a sequence in the context of another sequence (referred as bi-sequence classification). For several single sequence classification tasks, the current state-of-the-art approaches are based on recurrent and convolutional neural networks. On the other hand, for bi-sequence classi…
▽ More
Several tasks in argumentation mining and debating, question-answering, and natural language inference involve classifying a sequence in the context of another sequence (referred as bi-sequence classification). For several single sequence classification tasks, the current state-of-the-art approaches are based on recurrent and convolutional neural networks. On the other hand, for bi-sequence classification problems, there is not much understanding as to the best deep learning architecture. In this paper, we attempt to get an understanding of this category of problems by extensive empirical evaluation of 19 different deep learning architectures (specifically on different ways of handling context) for various problems originating in natural language processing like debating, textual entailment and question-answering. Following the empirical evaluation, we offer our insights and conclusions regarding the architectures we have considered. We also establish the first deep learning baselines for three argumentation mining tasks.
△ Less
Submitted 2 October, 2016; v1 submitted 17 July, 2016;
originally announced July 2016.
-
Taxonomy grounded aggregation of classifiers with different label sets
Authors:
Amrita Saha,
Sathish Indurthi,
Shantanu Godbole,
Subendhu Rongali,
Vikas C. Raykar
Abstract:
We describe the problem of aggregating the label predictions of diverse classifiers using a class taxonomy. Such a taxonomy may not have been available or referenced when the individual classifiers were designed and trained, yet map** the output labels into the taxonomy is desirable to integrate the effort spent in training the constituent classifiers. A hierarchical taxonomy representing some d…
▽ More
We describe the problem of aggregating the label predictions of diverse classifiers using a class taxonomy. Such a taxonomy may not have been available or referenced when the individual classifiers were designed and trained, yet map** the output labels into the taxonomy is desirable to integrate the effort spent in training the constituent classifiers. A hierarchical taxonomy representing some domain knowledge may be different from, but partially mappable to, the label sets of the individual classifiers. We present a heuristic approach and a principled graphical model to aggregate the label predictions by grounding them into the available taxonomy. Our model aggregates the labels using the taxonomy structure as constraints to find the most likely hierarchically consistent class. We experimentally validate our proposed method on image and text classification tasks.
△ Less
Submitted 1 December, 2015;
originally announced December 2015.
-
An Autoencoder Approach to Learning Bilingual Word Representations
Authors:
Sarath Chandar A P,
Stanislas Lauly,
Hugo Larochelle,
Mitesh M. Khapra,
Balaraman Ravindran,
Vikas Raykar,
Amrita Saha
Abstract:
Cross-language learning allows us to use training data from one language to build models for a different language. Many approaches to bilingual learning require that we have word-level alignment of sentences from parallel corpora. In this work we explore the use of autoencoder-based methods for cross-language learning of vectorial word representations that are aligned between two languages, while…
▽ More
Cross-language learning allows us to use training data from one language to build models for a different language. Many approaches to bilingual learning require that we have word-level alignment of sentences from parallel corpora. In this work we explore the use of autoencoder-based methods for cross-language learning of vectorial word representations that are aligned between two languages, while not relying on word-level alignments. We show that by simply learning to reconstruct the bag-of-words representations of aligned sentences, within and between languages, we can in fact learn high-quality representations and do without word alignments. Since training autoencoders on word observations presents certain computational issues, we propose and compare different variations adapted to this setting. We also propose an explicit correlation maximizing regularizer that leads to significant improvement in the performance. We empirically investigate the success of our approach on the problem of cross-language test classification, where a classifier trained on a given language (e.g., English) must learn to generalize to a different language (e.g., German). These experiments demonstrate that our approaches are competitive with the state-of-the-art, achieving up to 10-14 percentage point improvements over the best reported results on this task.
△ Less
Submitted 6 February, 2014;
originally announced February 2014.