Skip to main content

Showing 1–15 of 15 results for author: Deshpande, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.18558  [pdf, other

    cs.RO eess.SY

    "Golden Ratio Yoshimura" for Meta-Stable and Massively Reconfigurable Deployment

    Authors: Vishrut Deshpande, Yogesh Phalak, Ziyang Zhou, Ian Walker, Suyi Li

    Abstract: Yoshimura origami is a classical folding pattern that has inspired many deployable structure designs. Its applications span from space exploration, kinetic architectures, and soft robots to even everyday household items. However, despite its wide usage, Yoshimura has been fixated on a set of design constraints to ensure its flat-foldability. Through extensive kinematic analysis and prototype tests… ▽ More

    Submitted 30 May, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  2. arXiv:2404.02204  [pdf, other

    cs.CL cs.LG

    Emergent Abilities in Reduced-Scale Generative Language Models

    Authors: Sherin Muckatira, Vijeta Deshpande, Vladislav Lialin, Anna Rumshisky

    Abstract: Large language models can solve new tasks without task-specific fine-tuning. This ability, also known as in-context learning (ICL), is considered an emergent ability and is primarily seen in large language models with billions of parameters. This study investigates if such emergent properties are strictly tied to model size or can be demonstrated by smaller models trained on reduced-scale data. To… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 16 pages, 4 figures. Accepted to NAACL 2024 Findings

  3. arXiv:2402.13452  [pdf, other

    cs.SI cs.CL cs.LG

    LocalTweets to LocalHealth: A Mental Health Surveillance Framework Based on Twitter Data

    Authors: Vijeta Deshpande, Minhwa Lee, Zonghai Yao, Zihao Zhang, Jason Brian Gibbons, Hong Yu

    Abstract: Prior research on Twitter (now X) data has provided positive evidence of its utility in develo** supplementary health surveillance systems. In this study, we present a new framework to surveil public health, focusing on mental health (MH) outcomes. We hypothesize that locally posted tweets are indicative of local MH outcomes and collect tweets posted from 765 neighborhoods (census block groups)… ▽ More

    Submitted 26 March, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

    Journal ref: LREC-COLING 2024

  4. arXiv:2401.16914  [pdf, other

    cs.LG cond-mat.mtrl-sci

    Energy-conserving equivariant GNN for elasticity of lattice architected metamaterials

    Authors: Ivan Grega, Ilyes Batatia, Gábor Csányi, Sri Karlapati, Vikram S. Deshpande

    Abstract: Lattices are architected metamaterials whose properties strongly depend on their geometrical design. The analogy between lattices and graphs enables the use of graph neural networks (GNNs) as a faster surrogate model compared to traditional methods such as finite element modelling. In this work, we generate a big dataset of structure-property relationships for strut-based lattices. The dataset is… ▽ More

    Submitted 20 March, 2024; v1 submitted 30 January, 2024; originally announced January 2024.

    Comments: International Conference on Learning Representations 2024

  5. arXiv:2312.01642  [pdf, other

    cs.CL

    Voice-Based Smart Assistant System for Vehicles using RASA

    Authors: Aditya Paranjape, Yash Patwardhan, Vedant Deshpande, Aniket Darp, Jayashree Jagdale

    Abstract: Conversational AIs, or chatbots, mimic human speech when conversing. Smart assistants facilitate the automation of several tasks that needed human intervention earlier. Because of their accuracy, absence of dependence on human resources, and accessibility around the clock, chatbots can be employed in vehicles too. Due to people's propensity to divert their attention away from the task of driving w… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: 6 pages, 4 figures, accepted at IEEE International Conference on Computational Intelligence, Networks and Security ICCINS-2023

  6. arXiv:2312.00348  [pdf, other

    cs.CV

    Student Activity Recognition in Classroom Environments using Transfer Learning

    Authors: Anagha Deshpande, Vedant Deshpande

    Abstract: The recent advances in artificial intelligence and deep learning facilitate automation in various applications including home automation, smart surveillance systems, and healthcare among others. Human Activity Recognition is one of its emerging applications, which can be implemented in a classroom environment to enhance safety, efficiency, and overall educational quality. This paper proposes a sys… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

    Comments: 6 pages, 12 figures, accepted at the IEEE International Conference on Computational Intelligence, Networks and Security (ICCINS) 2023

  7. arXiv:2311.18739  [pdf, other

    cs.CL

    Mavericks at NADI 2023 Shared Task: Unravelling Regional Nuances through Dialect Identification using Transformer-based Approach

    Authors: Vedant Deshpande, Yash Patwardhan, Kshitij Deshpande, Sudeep Mangalvedhekar, Ravindra Murumkar

    Abstract: In this paper, we present our approach for the "Nuanced Arabic Dialect Identification (NADI) Shared Task 2023". We highlight our methodology for subtask 1 which deals with country-level dialect identification. Recognizing dialects plays an instrumental role in enhancing the performance of various downstream NLP tasks such as speech recognition and translation. The task uses the Twitter dataset (TW… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: 5 pages, 1 figure, accepted at the NADI ArabicNLP Workshop, EMNLP 2023

  8. arXiv:2311.18730  [pdf, other

    cs.CL

    Mavericks at ArAIEval Shared Task: Towards a Safer Digital Space -- Transformer Ensemble Models Tackling Deception and Persuasion

    Authors: Sudeep Mangalvedhekar, Kshitij Deshpande, Yash Patwardhan, Vedant Deshpande, Ravindra Murumkar

    Abstract: In this paper, we highlight our approach for the "Arabic AI Tasks Evaluation (ArAiEval) Shared Task 2023". We present our approaches for task 1-A and task 2-A of the shared task which focus on persuasion technique detection and disinformation detection respectively. Detection of persuasion techniques and disinformation has become imperative to avoid distortion of authentic information. The tasks u… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: 6 pages, 1 figure, accepted at the ArAIEval ArabicNLP workshop, EMNLP conference 2023

  9. arXiv:2305.17266  [pdf, other

    cs.CL

    Honey, I Shrunk the Language: Language Model Behavior at Reduced Scale

    Authors: Vijeta Deshpande, Dan Pechi, Shree Thatte, Vladislav Lialin, Anna Rumshisky

    Abstract: In recent years, language models have drastically grown in size, and the abilities of these models have been shown to improve with scale. The majority of recent scaling laws studies focused on high-compute high-parameter count settings, leaving the question of when these abilities begin to emerge largely unanswered. In this paper, we investigate whether the effects of pre-training can be observed… ▽ More

    Submitted 30 May, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted to ACL 2023 Findings

  10. arXiv:2303.15647  [pdf, other

    cs.CL

    Scaling Down to Scale Up: A Guide to Parameter-Efficient Fine-Tuning

    Authors: Vladislav Lialin, Vijeta Deshpande, Anna Rumshisky

    Abstract: This paper presents a systematic overview and comparison of parameter-efficient fine-tuning methods covering over 40 papers published between February 2019 and February 2023. These methods aim to resolve the infeasibility and impracticality of fine-tuning large language models by only training a small set of parameters. We provide a taxonomy that covers a broad range of methods and present a detai… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

  11. arXiv:2301.07057  [pdf

    cs.CL cs.AI cs.LG

    Transformer Based Implementation for Automatic Book Summarization

    Authors: Siddhant Porwal, Laxmi Bewoor, Vivek Deshpande

    Abstract: Document Summarization is the procedure of generating a meaningful and concise summary of a given document with the inclusion of relevant and topic-important points. There are two approaches: one is picking up the most relevant statements from the document itself and adding it to the Summary known as Extractive and the other is generating sentences for the Summary known as Abstractive Summarizatio… ▽ More

    Submitted 17 January, 2023; originally announced January 2023.

    Comments: Published at - https://ijisae.org/index.php/IJISAE/article/view/2421

    Journal ref: IJISAE Vol 10 No. 3 (2022)

  12. A Ferroelectric Tunnel Junction-based Integrate-and-Fire Neuron

    Authors: Paolo Gibertini, Luca Fehlings, Suzanne Lancaster, Quang Duong, Thomas Mikolajick, Catherine Dubourdieu, Stefan Slesazeck, Erika Covi, Veeresh Deshpande

    Abstract: Event-based neuromorphic systems provide a low-power solution by using artificial neurons and synapses to process data asynchronously in the form of spikes. Ferroelectric Tunnel Junctions (FTJs) are ultra low-power memory devices and are well-suited to be integrated in these systems. Here, we present a hybrid FTJ-CMOS Integrate-and-Fire neuron which constitutes a fundamental building block for new… ▽ More

    Submitted 4 November, 2022; originally announced November 2022.

    Journal ref: 2022 29th IEEE International Conference on Electronics, Circuits and Systems (ICECS)

  13. arXiv:2209.07859  [pdf

    cs.IR cs.AI cs.LG

    Extracting Biomedical Factual Knowledge Using Pretrained Language Model and Electronic Health Record Context

    Authors: Zonghai Yao, Yi Cao, Zhichao Yang, Vijeta Deshpande, Hong Yu

    Abstract: Language Models (LMs) have performed well on biomedical natural language processing applications. In this study, we conducted some experiments to use prompt methods to extract knowledge from LMs as new knowledge Bases (LMs as KBs). However, prompting can only be used as a low bound for knowledge extraction, and perform particularly poorly on biomedical domain KBs. In order to make LMs as KBs more… ▽ More

    Submitted 20 October, 2022; v1 submitted 25 August, 2022; originally announced September 2022.

    Comments: Presented at the AMIA 2022 Annual Symposium as an oral paper. Revised some content in Introduction and Related Work section

  14. arXiv:1812.07782  [pdf

    cs.DC

    Decentralized Periodic Approach for Adaptive Fault Diagnosis in Distributed Systems

    Authors: Latika Sarna, Sumedha Shenolikar, Poorva Kulkarni, Varsha Deshpande, Supriya Kelkar

    Abstract: In this paper, Decentralized Periodic Approach for Adaptive Fault Diagnosis (DP-AFD) algorithm is proposed for fault diagnosis in distributed systems with arbitrary topology. Faulty nodes may be either unresponsive, may have either software or hardware faults. The proposed algorithm detects the faulty nodes situated in geographically distributed locations. This algorithm does not depend on a singl… ▽ More

    Submitted 19 December, 2018; originally announced December 2018.

    Comments: 19 pages, 13 figures, 1 table

    ACM Class: C.2.4; C.2.5; C.2.6

  15. arXiv:1812.07771  [pdf

    cs.DC

    Fault Diagnosis for Distributed Systems using Accuracy Technique

    Authors: Poorva Kulkarni, Varsha Deshpande, Latika Sarna, Sumedha Shenolikar, Supriya Kelkar

    Abstract: Distributed Systems involve two or more computer systems which may be situated at geographically distinct locations and are connected by a communication network. Due to failures in the communication link, faults arise which may make the entire system dysfunctional. To enable seamless operation of the distributed system, these faults need to be detected and located accurately. This paper examines v… ▽ More

    Submitted 19 December, 2018; originally announced December 2018.

    Comments: 13 pages, 10 figures, 3 tables

    ACM Class: C.2.4