Skip to main content

Showing 1–50 of 81 results for author: Sahay, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.12241  [pdf, other

    cs.CL cs.AI

    Introducing v0.5 of the AI Safety Benchmark from MLCommons

    Authors: Bertie Vidgen, Adarsh Agrawal, Ahmed M. Ahmed, Victor Akinwande, Namir Al-Nuaimi, Najla Alfaraj, Elie Alhajjar, Lora Aroyo, Trupti Bavalatti, Max Bartolo, Borhane Blili-Hamelin, Kurt Bollacker, Rishi Bomassani, Marisa Ferrara Boston, Siméon Campos, Kal Chakra, Canyu Chen, Cody Coleman, Zacharie Delpierre Coudert, Leon Derczynski, Debojyoti Dutta, Ian Eisenberg, James Ezick, Heather Frase, Brian Fuller , et al. (75 additional authors not shown)

    Abstract: This paper introduces v0.5 of the AI Safety Benchmark, which has been created by the MLCommons AI Safety Working Group. The AI Safety Benchmark has been designed to assess the safety risks of AI systems that use chat-tuned language models. We introduce a principled approach to specifying and constructing the benchmark, which for v0.5 covers only a single use case (an adult chatting to a general-pu… ▽ More

    Submitted 13 May, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

  2. arXiv:2311.18041  [pdf, other

    cs.CL

    Zero-shot Conversational Summarization Evaluations with small Large Language Models

    Authors: Ramesh Manuvinakurike, Saurav Sahay, Sangeeta Manepalli, Lama Nachman

    Abstract: Large Language Models (LLMs) exhibit powerful summarization abilities. However, their capabilities on conversational summarization remains under explored. In this work we evaluate LLMs (approx. 10 billion parameters) on conversational summarization and showcase their performance on various prompts. We show that the summaries generated by models depend on the instructions and the performance of LLM… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Accepted at RoF0Mo workshop at Neurips 2023

  3. Design Theory for Societal Digital Transformation: The Case of Digital Global Health

    Authors: Jorn Braa, Sundeep Sahay, Eric Monteiro

    Abstract: With societal challenges, including but not limited to human development, equity, social justice, and climate change, societal-level digital transformation (SDT) is of imminent relevance and theoretical interest. While building on local-level efforts, societal-level transformation is a nonlinear extension of the local level. Unfortunately, academic discourse on digital transformation has largely l… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Journal ref: Journal of the AIS, 24(6), 2023

  4. arXiv:2310.11079  [pdf, other

    cs.CL cs.AI

    Learning from Red Teaming: Gender Bias Provocation and Mitigation in Large Language Models

    Authors: Hsuan Su, Cheng-Chu Cheng, Hua Farn, Shachi H Kumar, Saurav Sahay, Shang-Tse Chen, Hung-yi Lee

    Abstract: Recently, researchers have made considerable improvements in dialogue systems with the progress of large language models (LLMs) such as ChatGPT and GPT-4. These LLM-based chatbots encode the potential biases while retaining disparities that can harm humans during interactions. The traditional biases investigation methods often rely on human-written test cases. However, these test cases are usually… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  5. arXiv:2306.00482  [pdf, other

    cs.CY cs.CL cs.SD eess.AS math.HO

    Inspecting Spoken Language Understanding from Kids for Basic Math Learning at Home

    Authors: Eda Okur, Roddy Fuentes Alba, Saurav Sahay, Lama Nachman

    Abstract: Enriching the quality of early childhood education with interactive math learning at home systems, empowered by recent advances in conversational AI technologies, is slowly becoming a reality. With this motivation, we implement a multimodal dialogue system to support play-based learning experiences at home, guiding kids to master basic math concepts. This work explores Spoken Language Understandin… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: Proceedings of the 18th Workshop on Innovative Use of NLP for Building Educational Applications (BEA) at ACL 2023

  6. arXiv:2303.04361  [pdf, other

    cs.CL cs.CV

    Sample Efficient Multimodal Semantic Augmentation for Incremental Summarization

    Authors: Sumanta Bhattacharyya, Ramesh Manuvinakurike, Sahisnu Mazumder, Saurav Sahay

    Abstract: In this work, we develop a prompting approach for incremental summarization of task videos. We develop a sample-efficient few-shot approach for extracting semantic concepts as an intermediate step. We leverage an existing model for extracting the concepts from the images and extend it to videos and introduce a clustering and querying approach for sample efficiency, motivated by the recent advances… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  7. arXiv:2302.05888  [pdf, other

    cs.CL cs.AI cs.LG

    Position Matters! Empirical Study of Order Effect in Knowledge-grounded Dialogue

    Authors: Hsuan Su, Shachi H Kumar, Sahisnu Mazumder, Wenda Chen, Ramesh Manuvinakurike, Eda Okur, Saurav Sahay, Lama Nachman, Shang-Tse Chen, Hung-yi Lee

    Abstract: With the power of large pretrained language models, various research works have integrated knowledge into dialogue systems. The traditional techniques treat knowledge as part of the input sequence for the dialogue system, prepending a set of knowledge statements in front of dialogue history. However, such a mechanism forces knowledge sets to be concatenated in an ordered manner, making models impl… ▽ More

    Submitted 12 February, 2023; originally announced February 2023.

  8. arXiv:2212.01032  [pdf, other

    cs.CL cs.AI

    Systematic Analysis for Pretrained Language Model Priming for Parameter-Efficient Fine-tuning

    Authors: Shih-Cheng Huang, Shih-Heng Wang, Min-Han Shih, Saurav Sahay, Hung-yi Lee

    Abstract: Parameter-efficient (PE) methods (like Prompts or Adapters) for adapting pre-trained language models (PLM) to downstream tasks have been popular recently. However, hindrances still prevent these methods from reaching their full potential. For example, two significant challenges are few-shot adaptation and cross-task generalization. To tackle these issues, we propose a general PE priming framework… ▽ More

    Submitted 30 May, 2024; v1 submitted 2 December, 2022; originally announced December 2022.

  9. arXiv:2211.03511  [pdf, other

    cs.CL

    End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics

    Authors: Eda Okur, Saurav Sahay, Roddy Fuentes Alba, Lama Nachman

    Abstract: The advances in language-based Artificial Intelligence (AI) technologies applied to build educational applications can present AI for social-good opportunities with a broader positive impact. Across many disciplines, enhancing the quality of mathematics education is crucial in building critical thinking and problem-solving skills at younger ages. Conversational AI systems have started maturing to… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: Proceedings of the 1st Workshop on Mathematical Natural Language Processing (MathNLP) at EMNLP 2022

  10. arXiv:2211.01824  [pdf, other

    cs.CL

    Human in the loop approaches in multi-modal conversational task guidance system development

    Authors: Ramesh Manuvinakurike, Sovan Biswas, Giuseppe Raffa, Richard Beckwith, Anthony Rhodes, Meng Shi, Gesem Gudino Mejia, Saurav Sahay, Lama Nachman

    Abstract: Development of task guidance systems for aiding humans in a situated task remains a challenging problem. The role of search (information retrieval) and conversational systems for task guidance has immense potential to help the task performers achieve various goals. However, there are several technical challenges that need to be addressed to deliver such conversational systems, where common supervi… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: SCAI @ SIGIR

  11. arXiv:2206.03931  [pdf, other

    cs.CL cs.AI cs.LG

    Learning to Generate Prompts for Dialogue Generation through Reinforcement Learning

    Authors: Hsuan Su, Pohan Chi, Shih-Cheng Huang, Chung Ho Lam, Saurav Sahay, Shang-Tse Chen, Hung-yi Lee

    Abstract: Much literature has shown that prompt-based learning is an efficient method to make use of the large pre-trained language model. Recent works also exhibit the possibility of steering a chatbot's output by plugging in an appropriate prompt. Gradient-based methods are often used to perturb the prompts. However, some language models are not even available to the public. In this work, we first explore… ▽ More

    Submitted 13 October, 2022; v1 submitted 8 June, 2022; originally announced June 2022.

  12. Deep Reinforcement Learning for Cybersecurity Threat Detection and Protection: A Review

    Authors: Mohit Sewak, Sanjay K. Sahay, Hemant Rathore

    Abstract: The cybersecurity threat landscape has lately become overly complex. Threat actors leverage weaknesses in the network and endpoint security in a very coordinated manner to perpetuate sophisticated attacks that could bring down the entire network and many critical hosts in the network. Increasingly advanced deep and machine learning-based solutions have been used in threat detection and protection.… ▽ More

    Submitted 6 June, 2022; originally announced June 2022.

    Journal ref: International Conference On Secure Knowledge Management In Artificial Intelligence Era. Springer, Cham, 2021

  13. arXiv:2205.13754  [pdf, other

    cs.CL cs.HC

    NLU for Game-based Learning in Real: Initial Evaluations

    Authors: Eda Okur, Saurav Sahay, Lama Nachman

    Abstract: Intelligent systems designed for play-based interactions should be contextually aware of the users and their surroundings. Spoken Dialogue Systems (SDS) are critical for these interactive agents to carry out effective goal-oriented communication with users in real-time. For the real-world (i.e., in-the-wild) deployment of such conversational agents, improving the Natural Language Understanding (NL… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Comments: Proceedings of the Games and Natural Language Processing Workshop at LREC 2022

  14. arXiv:2205.04006  [pdf, other

    cs.CL cs.AI

    Data Augmentation with Paraphrase Generation and Entity Extraction for Multimodal Dialogue System

    Authors: Eda Okur, Saurav Sahay, Lama Nachman

    Abstract: Contextually aware intelligent agents are often required to understand the users and their surroundings in real-time. Our goal is to build Artificial Intelligence (AI) systems that can assist children in their learning process. Within such complex frameworks, Spoken Dialogue Systems (SDS) are crucial building blocks to handle efficient task-oriented communication with children in game-based learni… ▽ More

    Submitted 8 May, 2022; originally announced May 2022.

    Comments: Proceedings of the 13th International Conference on Language Resources and Evaluation (LREC 2022)

  15. arXiv:2203.07657  [pdf, other

    cs.CL cs.AI cs.CY

    Seamlessly Integrating Factual Information and Social Content with Persuasive Dialogue

    Authors: Maximillian Chen, Weiyan Shi, Feifan Yan, Ryan Hou, **gwen Zhang, Saurav Sahay, Zhou Yu

    Abstract: Complex conversation settings such as persuasion involve communicating changes in attitude or behavior, so users' perspectives need to be addressed, even when not directly related to the topic. In this work, we contribute a novel modular dialogue system framework that seamlessly integrates factual information and social content into persuasive dialogue. Our framework is generalizable to any dialog… ▽ More

    Submitted 23 September, 2022; v1 submitted 15 March, 2022; originally announced March 2022.

    Comments: To appear in Proceedings of AACL-IJCNLP 2022; 16 pages, 4 figures, 7 tables

  16. arXiv:2112.02246  [pdf, other

    cs.CL

    Controllable Response Generation for Assistive Use-cases

    Authors: Shachi H Kumar, Hsuan Su, Ramesh Manuvinakurike, Saurav Sahay, Lama Nachman

    Abstract: Conversational agents have become an integral part of the general population for simple task enabling situations. However, these systems are yet to have any social impact on the diverse and minority population, for example, hel** people with neurological disorders, for example ALS, and people with speech, language and social communication disorders. Language model technology can play a huge role… ▽ More

    Submitted 4 December, 2021; originally announced December 2021.

  17. arXiv:2111.14484  [pdf, other

    cs.ET

    Energy-Efficient Implementation of Generative Adversarial Networks on Passive RRAM Crossbar Arrays

    Authors: Siddharth Satyam, Honey Nikam, Shubham Sahay

    Abstract: Generative algorithms such as GANs are at the cusp of next revolution in the field of unsupervised learning and large-scale artificial data generation. However, the adversarial (competitive) co-training of the discriminative and generative networks in GAN makes them computationally intensive and hinders their deployment on the resource-constrained IoT edge devices. Moreover, the frequent data tran… ▽ More

    Submitted 19 April, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

  18. Long Short-Term Memory Implementation Exploiting Passive RRAM Crossbar Array

    Authors: Honey Nikam, Siddharth Satyam, Shubham Sahay

    Abstract: The ever-increasing demand to extract temporal correlations across sequential data and perform context-based learning in this era of big data has led to the development of long short-term memory (LSTM) networks. Furthermore, there is an urgent need to perform these time-series data-dependent applications including speech/video processing and recognition, language modelling and translation, etc. on… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

  19. arXiv:2110.09654  [pdf, other

    cs.CR

    Privacy-Preserving Mutual Authentication and Key Agreement Scheme for Multi-Server Healthcare System

    Authors: Trupil Limbasiya, Sanjay K. Sahay, Bharath Sridharan

    Abstract: The usage of different technologies and smart devices helps people to get medical services remotely for multiple benefits. Thus, critical and sensitive data is exchanged between a user and a doctor. When health data is transmitted over a common channel, it becomes essential to preserve various privacy and security properties in the system. Further, the number of users for remote services is increa… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

    Comments: 22 Pages

    Journal ref: Information Systems Frontiers, Vol. 23, No. 4, p. 835, 2021

  20. ADVERSARIALuscator: An Adversarial-DRL Based Obfuscator and Metamorphic Malware SwarmGenerator

    Authors: Mohit Sewak, Sanjay K. Sahay, Hemant Rathore

    Abstract: Advanced metamorphic malware and ransomware, by using obfuscation, could alter their internal structure with every attack. If such malware could intrude even into any of the IoT networks, then even if the original malware instance gets detected, by that time it can still infect the entire network. It is challenging to obtain training data for such evasive malware. Therefore, in this paper, we pres… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Journal ref: 2021 International Joint Conference on Neural Networks (IJCNN), 2021, pp. 1-9

  21. LSTM Hyper-Parameter Selection for Malware Detection: Interaction Effects and Hierarchical Selection Approach

    Authors: Mohit Sewak, Sanjay K. Sahay, Hemant Rathore

    Abstract: Long-Short-Term-Memory (LSTM) networks have shown great promise in artificial intelligence (AI) based language modeling. Recently, LSTM networks have also become popular for designing AI-based Intrusion Detection Systems (IDS). However, its applicability in IDS is studied largely in the default settings as used in language models. Whereas security applications offer distinct conditions and hence w… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Journal ref: 2021 International Joint Conference on Neural Networks (IJCNN), 2021, pp. 1-9

  22. DRo: A data-scarce mechanism to revolutionize the performance of Deep Learning based Security Systems

    Authors: Mohit Sewak, Sanjay K. Sahay, Hemant Rathore

    Abstract: Supervised Deep Learning requires plenty of labeled data to converge, and hence perform optimally for task-specific learning. Therefore, we propose a novel mechanism named DRo (for Deep Routing) for data-scarce domains like security. The DRo approach builds upon some of the recent developments in Deep-Clustering. In particular, it exploits the self-augmented training mechanism using synthetically… ▽ More

    Submitted 12 September, 2021; originally announced September 2021.

    Journal ref: 2021 IEEE 46th Conference on Local Computer Networks (LCN), 2021, pp. 581-588

  23. arXiv:2108.09950  [pdf

    cs.CY

    Digital Resilience for What? Case Study of South Korea

    Authors: Kyung Ryul Park, Sundeep Sahay, Jørn Braa, Pamod Amarakoon

    Abstract: Resilience has become an emerging topic in various fields of academic research. In spite of its widespread use, there remains conceptual confusion over what resilience means particularly in multi-disciplinary studies including the field of ICT and Development. With the potential of digital technology, research is needed to critically question what key socio-institutional values related to resilien… ▽ More

    Submitted 23 August, 2021; originally announced August 2021.

    Comments: In proceedings of the 1st Virtual Conference on Implications of Information and Digital Technologies for Development, 2021

  24. arXiv:2108.09731  [pdf

    cs.CY

    Reflections, Learnings and Proposed Interventions on Data Validation and Data Use for Action in Health: A Case of Mozambique

    Authors: Nilza Collinson, Zeferino Saugene, Jørn Braa, Sundeep Sahay, Emilio Mosse

    Abstract: The ideal of a country's health information system (HIS) is to develop processes that ensure easy collection of relevant data and enable their conversion to useful health indicators, which guide decision making and support health interventions. In many Low- and Middle-Income Countries (LMICs), actively engaged in health reform efforts, the role of HIS is crucial, particularly in terms of quality o… ▽ More

    Submitted 22 August, 2021; originally announced August 2021.

    Comments: In proceedings of the 1st Virtual Conference on Implications of Information and Digital Technologies for Development, 2021

  25. arXiv:2108.09727  [pdf

    cs.CY

    Building Agility in COVID-19 Information Systems Response in Sri Lanka: Recommendations for Practice

    Authors: Pamod Amarakoon, Jorn Braa, Sundeep Sahay

    Abstract: COVID-19 pandemic tested the capacity of information systems in countries on the ability to rapidly respond to requirements which were not anticipated. This article analyzes the socio-technical determinants of agility in building the IS response to the COVID-19 pandemic in Sri Lanka. We deploy qualitative research methods to explore the case study of implementation of COVID-19 surveillance system… ▽ More

    Submitted 22 August, 2021; originally announced August 2021.

    Comments: In proceedings of the 1st Virtual Conference on Implications of Information and Digital Technologies for Development, 2021

  26. arXiv:2108.09726  [pdf

    cs.CY

    Building Resilient Information Systems for Child Nutrition in Post-conflict Sri Lanka during COVID-19 Pandemic

    Authors: Pamod Amarakoon, Jørn Braa, Sundeep Sahay, Lakmini Magodarathna, Rajeev Moorthy

    Abstract: Post-conflict, low-resource settings are menaced with challenges related to low-resources, economic and social instability. The objective of the study is to understand the socio-technical determinants of resilience of resilience of routine information systems a backdrop of an implementation of a mobile-based nutrition information system in a post-conflict district in Sri Lanka. The longitudinal ev… ▽ More

    Submitted 22 August, 2021; originally announced August 2021.

    Comments: In proceedings of the 1st Virtual Conference on Implications of Information and Digital Technologies for Development, 2021

  27. arXiv:2108.09718  [pdf

    cs.CY

    Digital Global Public Goods

    Authors: Johan Ivar Sæbø, Brian Nicholson, Petter Nielsen, Sundeep Sahay

    Abstract: The purpose of this paper is to define and conceptualize digital global public goods (DGPGs) and illustrate the importance of contextual relevance in ICT4D projects. Recent studies have examined the importance of digital artefacts with public goods traits, emphasizing the significant potential for socio-economic development. However, we know little about the theoretical and practical dimensions of… ▽ More

    Submitted 22 August, 2021; originally announced August 2021.

    Comments: In proceedings of the 1st Virtual Conference on Implications of Information and Digital Technologies for Development, 2021

  28. arXiv:2104.13406  [pdf, other

    cs.CL cs.HC

    Semi-supervised Interactive Intent Labeling

    Authors: Saurav Sahay, Eda Okur, Nagib Hakim, Lama Nachman

    Abstract: Building the Natural Language Understanding (NLU) modules of task-oriented Spoken Dialogue Systems (SDS) involves a definition of intents and entities, collection of task-relevant data, annotating the data with intents and entities, and then repeating the same process over and over again for adding any functionality/enhancement to the SDS. In this work, we showcase an Intent Bulk Labeling system w… ▽ More

    Submitted 11 May, 2021; v1 submitted 27 April, 2021; originally announced April 2021.

    Comments: NAACL 2021 - Workshop on Data Science with Human-in-the-loop: Language Advances (DaSH-LA)

  29. arXiv:2103.16429  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Put Chatbot into Its Interlocutor's Shoes: New Framework to Learn Chatbot Responding with Intention

    Authors: Hsuan Su, Jiun-Hao Jhan, Fan-yun Sun, Saurav Sahay, Hung-yi Lee

    Abstract: Most chatbot literature that focuses on improving the fluency and coherence of a chatbot, is dedicated to making chatbots more human-like. However, very little work delves into what really separates humans from chatbots -- humans intrinsically understand the effect their responses have on the interlocutor and often respond with an intention such as proposing an optimistic view to make the interloc… ▽ More

    Submitted 23 April, 2021; v1 submitted 30 March, 2021; originally announced March 2021.

    Comments: Accepted at NAACL-HLT 2021

  30. arXiv:2103.00643  [pdf, other

    cs.CR cs.LG

    Identification of Significant Permissions for Efficient Android Malware Detection

    Authors: Hemant Rathore, Sanjay K. Sahay, Ritvik Rajvanshi, Mohit Sewak

    Abstract: Since Google unveiled Android OS for smartphones, malware are thriving with 3Vs, i.e. volume, velocity, and variety. A recent report indicates that one out of every five business/industry mobile application leaks sensitive personal data. Traditional signature/heuristic-based malware detection systems are unable to cope up with current malware challenges and thus threaten the Android ecosystem. The… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

    Comments: BROADNETS, 2020

  31. arXiv:2103.00637  [pdf, other

    cs.CR cs.LG

    Detection of Malicious Android Applications: Classical Machine Learning vs. Deep Neural Network Integrated with Clustering

    Authors: Hemant Rathore, Sanjay K. Sahay, Shivin Thukral, Mohit Sewak

    Abstract: Today anti-malware community is facing challenges due to the ever-increasing sophistication and volume of malware attacks developed by adversaries. Traditional malware detection mechanisms are not able to cope-up with next-generation malware attacks. Therefore in this paper, we propose effective and efficient Android malware detection models based on machine learning and deep learning integrated w… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

    Comments: BROADNETS, 2020

  32. arXiv:2102.00898  [pdf, other

    cs.CR cs.AI cs.LG

    DRLDO: A novel DRL based De-ObfuscationSystem for Defense against Metamorphic Malware

    Authors: Mohit Sewak, Sanjay K. Sahay, Hemant Rathore

    Abstract: In this paper, we propose a novel mechanism to normalize metamorphic and obfuscated malware down at the opcode level and hence create an advanced metamorphic malware de-obfuscation and defense system. We name this system DRLDO, for Deep Reinforcement Learning based De-Obfuscator. With the inclusion of the DRLDO as a sub-component, an existing Intrusion Detection System could be augmented with defe… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Journal ref: Defence Science Journal, 71(1), 55-65

  33. Robust Android Malware Detection System against Adversarial Attacks using Q-Learning

    Authors: Hemant Rathore, Sanjay K. Sahay, Piyush Nikam, Mohit Sewak

    Abstract: The current state-of-the-art Android malware detection systems are based on machine learning and deep learning models. Despite having superior performance, these models are susceptible to adversarial attacks. Therefore in this paper, we developed eight Android malware detection models based on machine learning and deep neural network and investigated their robustness against adversarial attacks. F… ▽ More

    Submitted 27 January, 2021; originally announced January 2021.

    Comments: Inf Syst Front (2020)

  34. arXiv:2012.15375  [pdf, other

    cs.CL cs.AI

    Refine and Imitate: Reducing Repetition and Inconsistency in Persuasion Dialogues via Reinforcement Learning and Human Demonstration

    Authors: Weiyan Shi, Yu Li, Saurav Sahay, Zhou Yu

    Abstract: Persuasion dialogue systems reflect the machine's ability to make strategic moves beyond verbal communication, and therefore differentiate themselves from task-oriented or open-domain dialogue systems and have their own unique values. However, the repetition and inconsistency problems still persist in dialogue response generation and could substantially impact user experience and impede the persua… ▽ More

    Submitted 22 October, 2022; v1 submitted 30 December, 2020; originally announced December 2020.

    Comments: EMNLP 2021 Findings

  35. Assessment of the Relative Importance of different hyper-parameters of LSTM for an IDS

    Authors: Mohit Sewak, Sanjay K. Sahay, Hemant Rathore

    Abstract: Recurrent deep learning language models like the LSTM are often used to provide advanced cyber-defense for high-value assets. The underlying assumption for using LSTM networks for malware-detection is that the op-code sequence of malware could be treated as a (spoken) language representation. There are differences between any spoken-language (sequence of words/sentences) and the machine-language (… ▽ More

    Submitted 26 December, 2020; originally announced December 2020.

    Journal ref: 2020 IEEE REGION 10 CONFERENCE (TENCON), Osaka, Japan, 2020, pp. 414-419

  36. arXiv:2010.08608  [pdf, other

    cs.CR cs.AI cs.LG

    DOOM: A Novel Adversarial-DRL-Based Op-Code Level Metamorphic Malware Obfuscator for the Enhancement of IDS

    Authors: Mohit Sewak, Sanjay K. Sahay, Hemant Rathore

    Abstract: We designed and developed DOOM (Adversarial-DRL based Opcode level Obfuscator to generate Metamorphic malware), a novel system that uses adversarial deep reinforcement learning to obfuscate malware at the op-code level for the enhancement of IDS. The ultimate goal of DOOM is not to give a potent weapon in the hands of cyber-attackers, but to create defensive-mechanisms against advanced zero-day at… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

  37. DeepIntent: ImplicitIntent based Android IDS with E2E Deep Learning architecture

    Authors: Mohit Sewak, Sanjay K. Sahay, Hemant Rathore

    Abstract: The Intent in Android plays an important role in inter-process and intra-process communications. The implicit Intent that an application could accept are declared in its manifest and are amongst the easiest feature to extract from an apk. Implicit Intents could even be extracted online and in real-time. So far neither the feasibility of develo** an Intrusion Detection System solely on implicit I… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

  38. arXiv:2008.02797  [pdf, other

    cs.CV eess.IV

    A Novel Spatial-Spectral Framework for the Classification of Hyperspectral Satellite Imagery

    Authors: Shriya TP Gupta, Sanjay K Sahay

    Abstract: Hyper-spectral satellite imagery is now widely being used for accurate disaster prediction and terrain feature classification. However, in such classification tasks, most of the present approaches use only the spectral information contained in the images. Therefore, in this paper, we present a novel framework that takes into account both the spectral and spatial information contained in the data f… ▽ More

    Submitted 22 July, 2020; originally announced August 2020.

    Comments: 13 Pages, 15 Figures, EANN-2020

    Journal ref: Springer, INNS, Vol. 2, pp 227-239, 2020

  39. arXiv:2007.03876  [pdf, other

    cs.CL

    Audio-Visual Understanding of Passenger Intents for In-Cabin Conversational Agents

    Authors: Eda Okur, Shachi H Kumar, Saurav Sahay, Lama Nachman

    Abstract: Building multimodal dialogue understanding capabilities situated in the in-cabin context is crucial to enhance passenger comfort in autonomous vehicle (AV) interaction systems. To this end, understanding passenger intents from spoken interactions and vehicle vision systems is a crucial component for develo** contextual and visually grounded conversational agents for AV. Towards this goal, we exp… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: ACL 2020 - Second Grand-Challenge and Workshop on Multimodal Language (Challenge-HML)

  40. arXiv:2007.02038  [pdf, other

    cs.CL

    Low Rank Fusion based Transformers for Multimodal Sequences

    Authors: Saurav Sahay, Eda Okur, Shachi H Kumar, Lama Nachman

    Abstract: Our senses individually work in a coordinated fashion to express our emotional intentions. In this work, we experiment with modeling modality-specific sensory signals to attend to our latent multimodal emotional intentions and vice versa expressed via low-rank multimodal fusion and multimodal transformers. The low-rank factorization of multimodal fusion amongst the modalities helps represent appro… ▽ More

    Submitted 4 July, 2020; originally announced July 2020.

    Comments: ACL 2020 workshop on Second Grand Challenge and Workshop on Multimodal Language

  41. arXiv:2004.10010  [pdf, other

    cs.CR

    Secure and Energy-Efficient Key-Agreement Protocol for Multi-Server Architecture

    Authors: Trupil Limbasiya, Sanjay K. Sahay

    Abstract: Authentication schemes are practised globally to verify the legitimacy of users and servers for the exchange of data in different facilities. Generally, the server verifies a user to provide resources for different purposes. But due to the large network system, the authentication process has become complex and therefore, time-to-time different authentication protocols have been proposed for the mu… ▽ More

    Submitted 19 April, 2020; originally announced April 2020.

    Comments: 17 Pages, SKM-2019

    Journal ref: Springer, CCIS, Vol. 1186, pp. 82-97, 2020

  42. arXiv:2002.05839  [pdf, other

    cs.CR

    LinkedIn's Audience Engagements API: A Privacy Preserving Data Analytics System at Scale

    Authors: Ryan Rogers, Subbu Subramaniam, Sean Peng, David Durfee, Seunghyun Lee, Santosh Kumar Kancha, Shraddha Sahay, Parvez Ahammad

    Abstract: We present a privacy system that leverages differential privacy to protect LinkedIn members' data while also providing audience engagement insights to enable marketing analytics related applications. We detail the differentially private algorithms and other privacy safeguards used to provide results that can be used with existing real-time data analytics platforms, specifically with the open sourc… ▽ More

    Submitted 16 November, 2020; v1 submitted 13 February, 2020; originally announced February 2020.

  43. Effects of Persuasive Dialogues: Testing Bot Identities and Inquiry Strategies

    Authors: Weiyan Shi, Xuewei Wang, Yoo Jung Oh, **gwen Zhang, Saurav Sahay, Zhou Yu

    Abstract: Intelligent conversational agents, or chatbots, can take on various identities and are increasingly engaging in more human-centered conversations with persuasive goals. However, little is known about how identities and inquiry strategies influence the conversation's effectiveness. We conducted an online study involving 790 participants to be persuaded by a chatbot for charity donation. We designed… ▽ More

    Submitted 18 January, 2020; v1 submitted 13 January, 2020; originally announced January 2020.

    Comments: 15 pages, 10 figures. Full paper to appear at ACM CHI 2020

  44. arXiv:1912.12884  [pdf, other

    cs.CR

    Secure Communication Protocol for Smart Transportation Based on Vehicular Cloud

    Authors: Trupil Limbasiya, Debasis Das, Sanjay K. Sahay

    Abstract: The pioneering concept of connected vehicles has transformed the way of thinking for researchers and entrepreneurs by collecting relevant data from nearby objects. However, this data is useful for a specific vehicle only. Moreover, vehicles get a high amount of data (e.g., traffic, safety, and multimedia infotainment) on the road. Thus, vehicles expect adequate storage devices for this data, but i… ▽ More

    Submitted 4 January, 2020; v1 submitted 30 December, 2019; originally announced December 2019.

    Comments: 10 Pages, 1 figure, Conference

    Journal ref: ACM Proceedings of the 2019 ACM International Joint Conference on Pervasive and Ubiquitous Computing and Proceedings of the 2019 ACM International Symposium on Wearable Computers, pp. 372-376

  45. arXiv:1912.10132  [pdf, ps, other

    cs.CL

    Exploring Context, Attention and Audio Features for Audio Visual Scene-Aware Dialog

    Authors: Shachi H Kumar, Eda Okur, Saurav Sahay, Jonathan Huang, Lama Nachman

    Abstract: We are witnessing a confluence of vision, speech and dialog system technologies that are enabling the IVAs to learn audio-visual groundings of utterances and have conversations with users about the objects, activities and events surrounding them. Recent progress in visual grounding techniques and Audio Understanding are enabling machines to understand shared semantic concepts and listen to the var… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

    Comments: Presented at the Visual Question Answering and Dialog Workshop, CVPR 2019, Long Beach, USA. arXiv admin note: substantial text overlap with arXiv:1912.10131

  46. arXiv:1912.10131  [pdf, other

    cs.MM cs.CL cs.SD eess.AS

    Leveraging Topics and Audio Features with Multimodal Attention for Audio Visual Scene-Aware Dialog

    Authors: Shachi H Kumar, Eda Okur, Saurav Sahay, Jonathan Huang, Lama Nachman

    Abstract: With the recent advancements in Artificial Intelligence (AI), Intelligent Virtual Assistants (IVA) such as Alexa, Google Home, etc., have become a ubiquitous part of many homes. Currently, such IVAs are mostly audio-based, but going forward, we are witnessing a confluence of vision, speech and dialog system technologies that are enabling the IVAs to learn audio-visual groundings of utterances. Thi… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

    Comments: Presented at the 3rd Visually Grounded Interaction and Language (ViGIL) Workshop, NeurIPS 2019, Vancouver, Canada. arXiv admin note: substantial text overlap with arXiv:1812.08407, arXiv:1912.10132

  47. arXiv:1912.10130  [pdf, other

    cs.CL

    Modeling Intent, Dialog Policies and Response Adaptation for Goal-Oriented Interactions

    Authors: Saurav Sahay, Shachi H Kumar, Eda Okur, Haroon Syed, Lama Nachman

    Abstract: Building a machine learning driven spoken dialog system for goal-oriented interactions involves careful design of intents and data collection along with development of intent recognition models and dialog policy learning algorithms. The models should be robust enough to handle various user distractions during the interaction flow and should steer the user back into an engaging interaction for succ… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.

    Comments: Presented as a full-paper at the 23rd Workshop on the Semantics and Pragmatics of Dialogue (SemDial 2019 - LondonLogue), Sep 4-6, 2019, London, UK

    Journal ref: Proceedings of the 23rd Workshop on the Semantics and Pragmatics of Dialogue (SEMDIAL), pp. 146-155, London, United Kingdom, September 2019

  48. arXiv:1909.13714  [pdf, ps, other

    cs.MM cs.CL cs.CV

    Towards Multimodal Understanding of Passenger-Vehicle Interactions in Autonomous Vehicles: Intent/Slot Recognition Utilizing Audio-Visual Data

    Authors: Eda Okur, Shachi H Kumar, Saurav Sahay, Lama Nachman

    Abstract: Understanding passenger intents from spoken interactions and car's vision (both inside and outside the vehicle) are important building blocks towards develo** contextual dialog systems for natural interactions in autonomous vehicles (AV). In this study, we continued exploring AMIE (Automated-vehicle Multimodal In-cabin Experience), the in-cabin agent responsible for handling certain multimodal p… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

    Comments: Presented as a short-paper at the 23rd Workshop on the Semantics and Pragmatics of Dialogue (SemDial 2019 - LondonLogue), Sep 4-6, 2019, London, UK

    Journal ref: Proceedings of the 23rd Workshop on the Semantics and Pragmatics of Dialogue (SEMDIAL), pp. 213-215, London, United Kingdom, September 2019

  49. arXiv:1908.02472  [pdf

    cs.ET cs.AR cs.NE

    3D-aCortex: An Ultra-Compact Energy-Efficient Neurocomputing Platform Based on Commercial 3D-NAND Flash Memories

    Authors: Mohammad Bavandpour, Shubham Sahay, Mohammad Reza Mahmoodi, Dmitri B. Strukov

    Abstract: The first contribution of this paper is the development of extremely dense, energy-efficient mixed-signal vector-by-matrix-multiplication (VMM) circuits based on the existing 3D-NAND flash memory blocks, without any need for their modification. Such compatibility is achieved using time-domain-encoded VMM design. Our detailed simulations have shown that, for example, the 5-bit VMM of 200-element ve… ▽ More

    Submitted 7 August, 2019; originally announced August 2019.

    Comments: 14 pages, 9 figures, 2 tables

  50. arXiv:1905.13747  [pdf, ps, other

    cs.CR

    A Survey on the Detection of Android Malicious Apps

    Authors: Sanjay K. Sahay, Ashu Sharma

    Abstract: Android-based smart devices are exponentially growing, and due to the ubiquity of the Internet, these devices are globally connected to the different devices/networks. Its popularity, attractive features, and mobility make malware creator to put a number of malicious apps in the market to disrupt and annoy the victims. Although to identify the malicious apps, time-to-time various techniques are pr… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

    Comments: Conference paper, 11 pages

    Journal ref: Springer, Advances in Computer Communication and Computational Sciences, pp 437-446, 2019