Skip to main content

Showing 1–50 of 93 results for author: Bhattacharjee, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17375  [pdf, other

    cs.CL

    An Empirical Study on the Characteristics of Bias upon Context Length Variation for Bangla

    Authors: Jayanta Sadhu, Ayan Antik Khan, Abhik Bhattacharjee, Rifat Shahriyar

    Abstract: Pretrained language models inherently exhibit various social biases, prompting a crucial examination of their social impact across various linguistic contexts due to their widespread usage. Previous studies have provided numerous methods for intrinsic bias measurements, predominantly focused on high-resource languages. In this work, we aim to extend these investigations to Bangla, a low-resource l… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: Accepted in Findings of ACL, 2024

  2. arXiv:2406.15117  [pdf, other

    eess.IV cs.AI cs.CV

    FA-Net: A Fuzzy Attention-aided Deep Neural Network for Pneumonia Detection in Chest X-Rays

    Authors: Ayush Roy, Anurag Bhattacharjee, Diego Oliva, Oscar Ramos-Soto, Francisco J. Alvarez-Padilla, Ram Sarkar

    Abstract: Pneumonia is a respiratory infection caused by bacteria, fungi, or viruses. It affects many people, particularly those in develo** or underdeveloped nations with high pollution levels, unhygienic living conditions, overcrowding, and insufficient medical infrastructure. Pneumonia can cause pleural effusion, where fluids fill the lungs, leading to respiratory difficulty. Early diagnosis is crucial… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

  3. arXiv:2406.12263  [pdf, other

    cs.CL

    Defending Against Social Engineering Attacks in the Age of LLMs

    Authors: Lin Ai, Tharindu Kumarage, Amrita Bhattacharjee, Zizhou Liu, Zheng Hui, Michael Davinroy, James Cook, Laura Cassani, Kirill Trapeznikov, Matthias Kirchner, Arslan Basharat, Anthony Hoogs, Joshua Garland, Huan Liu, Julia Hirschberg

    Abstract: The proliferation of Large Language Models (LLMs) poses challenges in detecting and mitigating digital deception, as these models can emulate human conversational patterns and facilitate chat-based social engineering (CSE) attacks. This study investigates the dual capabilities of LLMs as both facilitators and defenders against CSE threats. We develop a novel dataset, SEConvo, simulating CSE scenar… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  4. arXiv:2405.15194  [pdf, other

    cs.LG cs.AI

    Efficient Reinforcement Learning via Large Language Model-based Search

    Authors: Siddhant Bhambri, Amrita Bhattacharjee, Huan Liu, Subbarao Kambhampati

    Abstract: Reinforcement Learning (RL) suffers from sample inefficiency in sparse reward domains, and the problem is pronounced if there are stochastic transitions. To improve the sample efficiency, reward sha** is a well-studied approach to introduce intrinsic rewards that can help the RL agent converge to an optimal policy faster. However, designing a useful reward sha** function specific to each probl… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 9 pages + Appendix

  5. arXiv:2405.04793  [pdf, other

    cs.CL cs.AI cs.LG

    Zero-shot LLM-guided Counterfactual Generation for Text

    Authors: Amrita Bhattacharjee, Raha Moraffah, Joshua Garland, Huan Liu

    Abstract: Counterfactual examples are frequently used for model development and evaluation in many natural language processing (NLP) tasks. Although methods for automated counterfactual generation have been explored, such methods depend on models such as pre-trained language models that are then fine-tuned on auxiliary, often task-specific datasets. Collecting and annotating such datasets for counterfactual… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2309.13340

  6. arXiv:2404.17698  [pdf, other

    cs.HC

    "Actually I Can Count My Blessings": User-Centered Design of an Application to Promote Gratitude Among Young Adults

    Authors: Ananya Bhattacharjee, Zichen Gong, Bingcheng Wang, Timothy James Luckcock, Emma Watson, Elena Allica Abellan, Leslie Gutman, Anne Hsu, Joseph Jay Williams

    Abstract: Regular practice of gratitude has the potential to enhance psychological wellbeing and foster stronger social connections among young adults. However, there is a lack of research investigating user needs and expectations regarding gratitude-promoting applications. To address this gap, we employed a user-centered design approach to develop a mobile application that facilitates gratitude practice. O… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  7. arXiv:2404.17284  [pdf

    cs.LG

    Machine Learning based prediction of Vanadium Redox Flow Battery temperature rise under different charge-discharge conditions

    Authors: Anirudh Narayan D, Akshat Johar, Divye Kalra, Bhavya Ardeshna, Ankur Bhattacharjee

    Abstract: Accurate prediction of battery temperature rise is very essential for designing an efficient thermal management scheme. In this paper, machine learning (ML) based prediction of Vanadium Redox Flow Battery (VRFB) thermal behavior during charge-discharge operation has been demonstrated for the first time. Considering different currents with a specified electrolyte flow rate, the temperature of a kW… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: 21 pages, 5 figures

  8. PIVOT- Input-aware Path Selection for Energy-efficient ViT Inference

    Authors: Abhishek Moitra, Abhiroop Bhattacharjee, Priyadarshini Panda

    Abstract: The attention module in vision transformers(ViTs) performs intricate spatial correlations, contributing significantly to accuracy and delay. It is thereby important to modulate the number of attentions according to the input feature complexity for optimal delay-accuracy tradeoffs. To this end, we propose PIVOT - a co-optimization framework which selectively performs attention skip** based on the… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: Accepted to 61st ACM/IEEE Design Automation Conference (DAC '24), June 23--27, 2024, San Francisco, CA, USA (6 Pages)

  9. arXiv:2403.15952  [pdf, other

    cs.CV cs.CL

    IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language Models

    Authors: Haz Sameen Shahgir, Khondker Salman Sayeed, Abhik Bhattacharjee, Wasi Uddin Ahmad, Yue Dong, Rifat Shahriyar

    Abstract: The advent of Vision Language Models (VLM) has allowed researchers to investigate the visual understanding of a neural network using natural language. Beyond object classification and detection, VLMs are capable of visual comprehension and common-sense reasoning. This naturally led to the question: How do VLMs respond when the image itself is inherently unreasonable? To this end, we present Illusi… ▽ More

    Submitted 30 March, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

  10. arXiv:2403.15690  [pdf, other

    cs.CL cs.AI cs.LG

    EAGLE: A Domain Generalization Framework for AI-generated Text Detection

    Authors: Amrita Bhattacharjee, Raha Moraffah, Joshua Garland, Huan Liu

    Abstract: With the advancement in capabilities of Large Language Models (LLMs), one major step in the responsible and safe use of such LLMs is to be able to detect text generated by these models. While supervised AI-generated text detectors perform well on text generated by older LLMs, with the frequent release of new LLMs, building supervised detectors for identifying text from such new models would requir… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

  11. arXiv:2403.13704  [pdf, other

    cs.CE cs.LG math.NA

    Improving the Adaptive Moment Estimation (ADAM) stochastic optimizer through an Implicit-Explicit (IMEX) time-step** approach

    Authors: Abhinab Bhattacharjee, Andrey A. Popov, Arash Sarshar, Adrian Sandu

    Abstract: The Adam optimizer, often used in Machine Learning for neural network training, corresponds to an underlying ordinary differential equation (ODE) in the limit of very small learning rates. This work shows that the classical Adam algorithm is a first order implicit-explicit (IMEX) Euler discretization of the underlying ODE. Employing the time discretization point of view, we propose new extensions… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Report number: CSL-TR-2024-2

  12. arXiv:2403.12403  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Interpretable Hate Speech Detection using Large Language Model-extracted Rationales

    Authors: Ayushi Nirmal, Amrita Bhattacharjee, Paras Sheth, Huan Liu

    Abstract: Although social media platforms are a prominent arena for users to engage in interpersonal discussions and express opinions, the facade and anonymity offered by social media may allow users to spew hate speech and offensive content. Given the massive scale of such platforms, there arises a need to automatically identify and flag instances of hate speech. Although several hate speech detection meth… ▽ More

    Submitted 7 May, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Camera-ready for NAACL WOAH 2024 (Workshop on Online Abuse and Harms). First two authors contributed equally

  13. arXiv:2403.08035  [pdf, other

    cs.CL cs.AI

    Harnessing Artificial Intelligence to Combat Online Hate: Exploring the Challenges and Opportunities of Large Language Models in Hate Speech Detection

    Authors: Tharindu Kumarage, Amrita Bhattacharjee, Joshua Garland

    Abstract: Large language models (LLMs) excel in many diverse applications beyond language generation, e.g., translation, summarization, and sentiment analysis. One intriguing application is in text classification. This becomes pertinent in the realm of identifying hateful or toxic speech -- a domain fraught with challenges and ethical dilemmas. In our study, we have two objectives: firstly, to offer a liter… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  14. arXiv:2402.13446  [pdf, other

    cs.CL

    Large Language Models for Data Annotation: A Survey

    Authors: Zhen Tan, Dawei Li, Song Wang, Alimohammad Beigi, Bohan Jiang, Amrita Bhattacharjee, Mansooreh Karami, Jundong Li, Lu Cheng, Huan Liu

    Abstract: Data annotation generally refers to the labeling or generating of raw data with relevant information, which could be used for improving the efficacy of machine learning models. The process, however, is labor-intensive and costly. The emergence of advanced Large Language Models (LLMs), exemplified by GPT-4, presents an unprecedented opportunity to automate the complicated process of data annotation… ▽ More

    Submitted 23 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  15. arXiv:2402.06655  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    Adversarial Text Purification: A Large Language Model Approach for Defense

    Authors: Raha Moraffah, Shubh Khandelwal, Amrita Bhattacharjee, Huan Liu

    Abstract: Adversarial purification is a defense mechanism for safeguarding classifiers against adversarial attacks without knowing the type of attacks or training of the classifier. These techniques characterize and eliminate adversarial perturbations from the attacked inputs, aiming to restore purified samples that retain similarity to the initially attacked ones and are correctly classified by the classif… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: PAKDD 2024

  16. arXiv:2402.06221  [pdf, other

    cs.CL cs.IR

    ResumeFlow: An LLM-facilitated Pipeline for Personalized Resume Generation and Refinement

    Authors: Saurabh Bhausaheb Zinjad, Amrita Bhattacharjee, Amey Bhilegaonkar, Huan Liu

    Abstract: Crafting the ideal, job-specific resume is a challenging task for many job applicants, especially for early-career applicants. While it is highly recommended that applicants tailor their resume to the specific role they are applying for, manually tailoring resumes to job descriptions and role-specific requirements is often (1) extremely time-consuming, and (2) prone to human errors. Furthermore, p… ▽ More

    Submitted 7 May, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: Accepted to SIGIR 2024 (Demo)

  17. arXiv:2402.02586  [pdf, other

    cs.LG cs.ET

    ClipFormer: Key-Value Clip** of Transformers on Memristive Crossbars for Write Noise Mitigation

    Authors: Abhiroop Bhattacharjee, Abhishek Moitra, Priyadarshini Panda

    Abstract: Transformers have revolutionized various real-world applications from natural language processing to computer vision. However, traditional von-Neumann computing paradigm faces memory and bandwidth limitations in accelerating transformers owing to their massive model sizes. To this end, In-memory Computing (IMC) crossbars based on Non-volatile Memories (NVMs), due to their ability to perform highly… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: 9 pages, 10 figures, 3 tables, 1 appendix

  18. arXiv:2402.02018  [pdf, other

    cs.LG

    The Landscape and Challenges of HPC Research and LLMs

    Authors: Le Chen, Nesreen K. Ahmed, Akash Dutta, Arijit Bhattacharjee, Sixing Yu, Quazi Ishtiaque Mahmud, Waqwoya Abebe, Hung Phan, Aishwarya Sarkar, Branden Butler, Niranjan Hasabnis, Gal Oren, Vy A. Vo, Juan Pablo Munoz, Theodore L. Willke, Tim Mattson, Ali Jannesari

    Abstract: Recently, language models (LMs), especially large language models (LLMs), have revolutionized the field of deep learning. Both encoder-decoder models and prompt-based techniques have shown immense potential for natural language processing and code-based tasks. Over the past several years, many research labs and institutions have invested heavily in high-performance computing, approaching or breach… ▽ More

    Submitted 6 February, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  19. arXiv:2402.01049  [pdf, other

    cs.CV

    IMUGPT 2.0: Language-Based Cross Modality Transfer for Sensor-Based Human Activity Recognition

    Authors: Zikang Leng, Amitrajit Bhattacharjee, Hrudhai Rajasekhar, Lizhe Zhang, Elizabeth Bruda, Hyeokhyen Kwon, Thomas Plötz

    Abstract: One of the primary challenges in the field of human activity recognition (HAR) is the lack of large labeled datasets. This hinders the development of robust and generalizable models. Recently, cross modality transfer approaches have been explored that can alleviate the problem of data scarcity. These approaches convert existing datasets from a source modality, such as video, to a target modality (… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  20. arXiv:2401.16445  [pdf, other

    cs.SE cs.DC cs.LG

    OMPGPT: A Generative Pre-trained Transformer Model for OpenMP

    Authors: Le Chen, Arijit Bhattacharjee, Nesreen Ahmed, Niranjan Hasabnis, Gal Oren, Vy Vo, Ali Jannesari

    Abstract: Large language models (LLMs)such as ChatGPT have significantly advanced the field of Natural Language Processing (NLP). This trend led to the development of code-based large language models such as StarCoder, WizardCoder, and CodeLlama, which are trained extensively on vast repositories of code and programming languages. While the generic abilities of these code LLMs are useful for many programmer… ▽ More

    Submitted 21 June, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

  21. arXiv:2312.13581  [pdf, other

    cs.HC

    Understanding the Role of Large Language Models in Personalizing and Scaffolding Strategies to Combat Academic Procrastination

    Authors: Ananya Bhattacharjee, Yuchen Zeng, Sarah Yi Xu, Dana Kulzhabayeva, Minyi Ma, Rachel Kornfield, Syed Ishtiaque Ahmed, Alex Mariakakis, Mary P Czerwinski, Anastasia Kuzminykh, Michael Liut, Joseph Jay Williams

    Abstract: Traditional interventions for academic procrastination often fail to capture the nuanced, individual-specific factors that underlie them. Large language models (LLMs) hold immense potential for addressing this gap by permitting open-ended inputs, including the ability to customize interventions to individuals' unique needs. However, user expectations and potential limitations of LLMs in this conte… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  22. arXiv:2312.03559  [pdf, other

    cs.AR

    MCAIMem: a Mixed SRAM and eDRAM Cell for Area and Energy-efficient on-chip AI Memory

    Authors: Duy-Thanh Nguyen, Abhiroop Bhattacharjee, Abhishek Moitra, Priyadarshini Panda

    Abstract: AI chips commonly employ SRAM memory as buffers for their reliability and speed, which contribute to high performance. However, SRAM is expensive and demands significant area and energy consumption. Previous studies have explored replacing SRAM with emerging technologies like non-volatile memory, which offers fast-read memory access and a small cell area. Despite these advantages, non-volatile mem… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  23. arXiv:2311.06505  [pdf, other

    cs.LG

    CompCodeVet: A Compiler-guided Validation and Enhancement Approach for Code Dataset

    Authors: Le Chen, Arijit Bhattacharjee, Nesreen K. Ahmed, Niranjan Hasabnis, Gal Oren, Bin Lei, Ali Jannesari

    Abstract: Large language models (LLMs) have become increasingly prominent in academia and industry due to their remarkable performance in diverse applications. As these models evolve with increasing parameters, they excel in tasks like sentiment analysis and machine translation. However, even models with billions of parameters face challenges in tasks demanding multi-step reasoning. Code generation and comp… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

  24. arXiv:2310.18326  [pdf, other

    cs.AI cs.CY cs.HC cs.LG

    Using Adaptive Bandit Experiments to Increase and Investigate Engagement in Mental Health

    Authors: Harsh Kumar, Tong Li, Jiakai Shi, Ilya Musabirov, Rachel Kornfield, Jonah Meyerhoff, Ananya Bhattacharjee, Chris Karr, Theresa Nguyen, David Mohr, Anna Rafferty, Sofia Villar, Nina Deliu, Joseph Jay Williams

    Abstract: Digital mental health (DMH) interventions, such as text-message-based lessons and activities, offer immense potential for accessible mental health support. While these interventions can be effective, real-world experimental testing can further enhance their design and impact. Adaptive experimentation, utilizing algorithms like Thompson Sampling for (contextual) multi-armed bandit (MAB) problems, c… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Report number: Volume 38, Issue 21

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence (IAAI) 2024

  25. arXiv:2310.06845  [pdf, other

    cs.CR cs.AI cs.CV cs.LG

    RobustEdge: Low Power Adversarial Detection for Cloud-Edge Systems

    Authors: Abhishek Moitra, Abhiroop Bhattacharjee, Youngeun Kim, Priyadarshini Panda

    Abstract: In practical cloud-edge scenarios, where a resource constrained edge performs data acquisition and a cloud system (having sufficient resources) performs inference tasks with a deep neural network (DNN), adversarial robustness is critical for reliability and ubiquitous deployment. Adversarial detection is a prime adversarial defence technique used in prior literature. However, in prior detection wo… ▽ More

    Submitted 5 September, 2023; originally announced October 2023.

    Comments: IEEE Transactions on Emerging Topics in Computational Intelligence (TETCI)

  26. arXiv:2309.13340  [pdf, other

    cs.CL cs.AI cs.LG

    Towards LLM-guided Causal Explainability for Black-box Text Classifiers

    Authors: Amrita Bhattacharjee, Raha Moraffah, Joshua Garland, Huan Liu

    Abstract: With the advent of larger and more complex deep learning models, such as in Natural Language Processing (NLP), model qualities like explainability and interpretability, albeit highly desirable, are becoming harder challenges to tackle and solve. For example, state-of-the-art models in text classification are black-box by design. Although standard explanation methods provide some degree of explaina… ▽ More

    Submitted 29 January, 2024; v1 submitted 23 September, 2023; originally announced September 2023.

    Comments: Camera-ready for AAAI ReLM 2024

  27. arXiv:2309.03992  [pdf, other

    cs.CL cs.AI cs.LG

    ConDA: Contrastive Domain Adaptation for AI-generated Text Detection

    Authors: Amrita Bhattacharjee, Tharindu Kumarage, Raha Moraffah, Huan Liu

    Abstract: Large language models (LLMs) are increasingly being used for generating text in a variety of use cases, including journalistic news articles. Given the potential malicious nature in which these LLMs can be used to generate disinformation at scale, it is important to build effective detectors for such AI-generated text. Given the surge in development of new LLMs, acquiring labeled training data for… ▽ More

    Submitted 20 September, 2023; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: Camera-ready for IJCNLP-AACL 2023 main track

  28. arXiv:2309.03683  [pdf

    cs.RO eess.SY

    An anthropomorphic continuum robotic neck actuated by SMA spring-based multipennate muscle architecture

    Authors: Ratnangshu Das, Yashaswi Sinha, Anirudha Bhattacharjee, Bishakh Bhattacharya

    Abstract: This work presents a novel Shape Memory Alloy spring actuated continuum robotic neck that derives inspiration from pennate muscle architecture. The proposed design has 2DOF, and experimental studies reveal that the designed joint can replicate the human head's anthropomorphic range of motion. We enumerate the analytical modelling for SMA actuators and the kinematic model of the proposed design con… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

  29. arXiv:2309.03388  [pdf, other

    cs.NE

    Are SNNs Truly Energy-efficient? $-$ A Hardware Perspective

    Authors: Abhiroop Bhattacharjee, Ruokai Yin, Abhishek Moitra, Priyadarshini Panda

    Abstract: Spiking Neural Networks (SNNs) have gained attention for their energy-efficient machine learning capabilities, utilizing bio-inspired activation functions and sparse binary spike-data representations. While recent SNN algorithmic advances achieve high accuracy on large-scale computer vision tasks, their energy-efficiency claims rely on certain impractical estimation metrics. This work studies two… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: 5 pages

  30. arXiv:2309.03164  [pdf, other

    cs.CL cs.AI

    J-Guard: Journalism Guided Adversarially Robust Detection of AI-generated News

    Authors: Tharindu Kumarage, Amrita Bhattacharjee, Djordje Padejski, Kristy Roschke, Dan Gillmor, Scott Ruston, Huan Liu, Joshua Garland

    Abstract: The rapid proliferation of AI-generated text online is profoundly resha** the information landscape. Among various types of AI-generated text, AI-generated news presents a significant threat as it can be a prominent source of misinformation online. While several recent efforts have focused on detecting AI-generated text in general, these methods require enhanced reliability, given concerns about… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: This Paper is Accepted to The 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics (IJCNLP-AACL 2023)

  31. arXiv:2309.00597  [pdf, other

    cs.CE cs.DC cs.ET q-bio.NC quant-ph

    The QUATRO Application Suite: Quantum Computing for Models of Human Cognition

    Authors: Raghavendra Pradyumna Pothukuchi, Leon Lufkin, Yu Jun Shen, Alejandro Simon, Rome Thorstenson, Bernardo Eilert Trevisan, Michael Tu, Mudi Yang, Ben Foxman, Viswanatha Srinivas Pothukuchi, Gunnar Ep**, Thi Ha Kyaw, Bryant J Jongkees, Yongshan Ding, Jerome R Busemeyer, Jonathan D Cohen, Abhishek Bhattacharjee

    Abstract: Research progress in quantum computing has, thus far, focused on a narrow set of application domains. Expanding the suite of quantum application domains is vital for the discovery of new software toolchains and architectural abstractions. In this work, we unlock a new class of applications ripe for quantum computing research -- computational cognitive modeling. Cognitive models are critical to und… ▽ More

    Submitted 8 December, 2023; v1 submitted 1 September, 2023; originally announced September 2023.

  32. arXiv:2308.01284  [pdf, other

    cs.CL cs.AI

    Fighting Fire with Fire: Can ChatGPT Detect AI-generated Text?

    Authors: Amrita Bhattacharjee, Huan Liu

    Abstract: Large language models (LLMs) such as ChatGPT are increasingly being used for various use cases, including text content generation at scale. Although detection methods for such AI-generated text exist already, we investigate ChatGPT's performance as a detector on such AI-generated text, inspired by works that use ChatGPT as a data labeler or annotator. We evaluate the zero-shot performance of ChatG… ▽ More

    Submitted 17 August, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

    Comments: to appear in SIGKDD Explorations (December 2023)

  33. HyDe: A Hybrid PCM/FeFET/SRAM Device-search for Optimizing Area and Energy-efficiencies in Analog IMC Platforms

    Authors: Abhiroop Bhattacharjee, Abhishek Moitra, Priyadarshini Panda

    Abstract: Today, there are a plethora of In-Memory Computing (IMC) devices- SRAMs, PCMs & FeFETs, that emulate convolutions on crossbar-arrays with high throughput. Each IMC device offers its own pros & cons during inference of Deep Neural Networks (DNNs) on crossbars in terms of area overhead, programming energy and non-idealities. A design-space exploration is, therefore, imperative to derive a hybrid-dev… ▽ More

    Submitted 24 October, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

    Comments: Accepted to IEEE Journal on Emerging and Selected Topics in Circuits and Systems (JETCAS)

    Journal ref: IEEE Journal on Emerging and Selected Topics in Circuits and Systems (JETCAS), 2023

  34. arXiv:2307.13637  [pdf, other

    cs.HC

    Cognitive Engagement for STEM+C Education: Investigating Serious Game Impact on Graph Structure Learning with fNIRS

    Authors: Shayla Sharmin, Reza Koiler, Rifat Sadik, Arpan Bhattacharjee, Priyanka Raju Patre, Pinar Kullu, Charles Hohensee, Nancy Getchell, Roghayeh Leila Barmaki

    Abstract: For serious games on education, understanding the effectiveness of different learning methods in influencing cognitive processes remains a significant challenge. This study investigates the impact of serious games on graph structure learning. For this, we compared our in-house game-based learning (GBL) and video-based learning (VBL) methodologies by evaluating their effectiveness on cognitive proc… ▽ More

    Submitted 7 March, 2024; v1 submitted 25 July, 2023; originally announced July 2023.

    Comments: 19 pages, 9 figures

  35. arXiv:2306.14039  [pdf, other

    eess.IV cs.CV

    Semantic Segmentation of Porosity in 4D Spatio-Temporal X-ray ÎŒCT of Titanium Coated Ni wires using Deep Learning

    Authors: Pradyumna Elavarthi, Arun Bhattacharjee, Ashley Paz y Puente, Anca Ralescu

    Abstract: A fully convolutional neural network was used to measure the evolution of the volume fraction of two different Kirkendall pores during the homogenization of Ti coated Ni wires. Traditional methods like Otsus thresholding and the largest connected component analysis were used to obtain the masks for training the segmentation model. Once trained, the model was used to semantically segment the two ty… ▽ More

    Submitted 24 June, 2023; originally announced June 2023.

  36. Examining the Role and Limits of Batchnorm Optimization to Mitigate Diverse Hardware-noise in In-memory Computing

    Authors: Abhiroop Bhattacharjee, Abhishek Moitra, Youngeun Kim, Yeshwanth Venkatesha, Priyadarshini Panda

    Abstract: In-Memory Computing (IMC) platforms such as analog crossbars are gaining focus as they facilitate the acceleration of low-precision Deep Neural Networks (DNNs) with high area- & compute-efficiencies. However, the intrinsic non-idealities in crossbars, which are often non-deterministic and non-linear, degrade the performance of the deployed DNNs. In addition to quantization errors, most frequently… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: Accepted in Great Lakes Symposium on VLSI 2023 (GLSVLSI 2023) conference

    Journal ref: Great Lakes Symposium on VLSI 2023 (GLSVLSI 2023) conference

  37. arXiv:2305.17244  [pdf, other

    cs.LG

    Mitigating Catastrophic Forgetting in Long Short-Term Memory Networks

    Authors: Ketaki Joshi, Raghavendra Pradyumna Pothukuchi, Andre Wibisono, Abhishek Bhattacharjee

    Abstract: Continual learning on sequential data is critical for many machine learning (ML) deployments. Unfortunately, LSTM networks, which are commonly used to learn on sequential data, suffer from catastrophic forgetting and are limited in their ability to learn multiple tasks continually. We discover that catastrophic forgetting in LSTM networks can be overcome in two novel and readily-implementable ways… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  38. arXiv:2303.17646  [pdf, other

    cs.CV

    XPert: Peripheral Circuit & Neural Architecture Co-search for Area and Energy-efficient Xbar-based Computing

    Authors: Abhishek Moitra, Abhiroop Bhattacharjee, Youngeun Kim, Priyadarshini Panda

    Abstract: The hardware-efficiency and accuracy of Deep Neural Networks (DNNs) implemented on In-memory Computing (IMC) architectures primarily depend on the DNN architecture and the peripheral circuit parameters. It is therefore essential to holistically co-search the network and peripheral parameters to achieve optimal performance. To this end, we propose XPert, which co-searches network architecture in ta… ▽ More

    Submitted 21 November, 2023; v1 submitted 30 March, 2023; originally announced March 2023.

    Comments: Accepted to Design and Automation Conference (DAC)

    Journal ref: 60th DAC, 2023

  39. arXiv:2303.09024  [pdf, other

    cs.CR eess.SY

    DeeBBAA: A benchmark Deep Black Box Adversarial Attack against Cyber-Physical Power Systems

    Authors: Arnab Bhattacharjee, Tapan K. Saha, Ashu Verma, Sukumar Mishra

    Abstract: An increased energy demand, and environmental pressure to accommodate higher levels of renewable energy and flexible loads like electric vehicles have led to numerous smart transformations in the modern power systems. These transformations make the cyber-physical power system highly susceptible to cyber-adversaries targeting its numerous operations. In this work, a novel black box adversarial atta… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

  40. arXiv:2303.07627  [pdf, ps, other

    cs.LG stat.ML

    Best arm identification in rare events

    Authors: Anirban Bhattacharjee, Sushant Vijayan, Sandeep K Juneja

    Abstract: We consider the best arm identification problem in the stochastic multi-armed bandit framework where each arm has a tiny probability of realizing large rewards while with overwhelming probability the reward is zero. A key application of this framework is in online advertising where click rates of advertisements could be a fraction of a single percent and final conversion to sales, while highly pro… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

    Comments: 32 pages

  41. arXiv:2303.03697  [pdf, other

    cs.CL cs.LG

    Stylometric Detection of AI-Generated Text in Twitter Timelines

    Authors: Tharindu Kumarage, Joshua Garland, Amrita Bhattacharjee, Kirill Trapeznikov, Scott Ruston, Huan Liu

    Abstract: Recent advancements in pre-trained language models have enabled convenient methods for generating human-like text at a large scale. Though these generation capabilities hold great potential for breakthrough applications, it can also be a tool for an adversary to generate misinformation. In particular, social media platforms like Twitter are highly susceptible to AI-generated misinformation. A pote… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

  42. arXiv:2302.07769  [pdf, other

    cs.LG cs.ET

    XploreNAS: Explore Adversarially Robust & Hardware-efficient Neural Architectures for Non-ideal Xbars

    Authors: Abhiroop Bhattacharjee, Abhishek Moitra, Priyadarshini Panda

    Abstract: Compute In-Memory platforms such as memristive crossbars are gaining focus as they facilitate acceleration of Deep Neural Networks (DNNs) with high area and compute-efficiencies. However, the intrinsic non-idealities associated with the analog nature of computing in crossbars limits the performance of the deployed DNNs. Furthermore, DNNs are shown to be vulnerable to adversarial attacks leading to… ▽ More

    Submitted 15 April, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: Accepted to ACM Transactions on Embedded Computing Systems in April 2023

    Journal ref: ACM Transactions on Embedded Computing Systems (2023)

  43. arXiv:2302.04712  [pdf, other

    cs.LG cs.ET

    DeepCAM: A Fully CAM-based Inference Accelerator with Variable Hash Lengths for Energy-efficient Deep Neural Networks

    Authors: Duy-Thanh Nguyen, Abhiroop Bhattacharjee, Abhishek Moitra, Priyadarshini Panda

    Abstract: With ever increasing depth and width in deep neural networks to achieve state-of-the-art performance, deep learning computation has significantly grown, and dot-products remain dominant in overall computation time. Most prior works are built on conventional dot-product where weighted input summation is used to represent the neuron operation. However, another implementation of dot-product based on… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: Accepted to Design, Automation and Test in Europe (DATE) Conference, 2023

    Journal ref: Design, Automation and Test in Europe (DATE) Conference, 2023

  44. arXiv:2302.00102  [pdf, other

    cs.CL cs.LG

    Towards Detecting Harmful Agendas in News Articles

    Authors: Melanie Subbiah, Amrita Bhattacharjee, Yilun Hua, Tharindu Kumarage, Huan Liu, Kathleen McKeown

    Abstract: Manipulated news online is a growing problem which necessitates the use of automated systems to curtail its spread. We argue that while misinformation and disinformation detection have been studied, there has been a lack of investment in the important open challenge of detecting harmful agendas in news articles; identifying harmful agendas is critical to flag news campaigns with the greatest poten… ▽ More

    Submitted 2 August, 2023; v1 submitted 31 January, 2023; originally announced February 2023.

    Comments: Camera-ready for ACL-WASSA 2023. First two authors contributed equally

  45. arXiv:2301.10159  [pdf

    cs.LG cs.CE cs.CY eess.SP

    Computational Solar Energy -- Ensemble Learning Methods for Prediction of Solar Power Generation based on Meteorological Parameters in Eastern India

    Authors: Debojyoti Chakraborty, Jayeeta Mondal, Hrishav Bakul Barua, Ankur Bhattacharjee

    Abstract: The challenges in applications of solar energy lies in its intermittency and dependency on meteorological parameters such as; solar radiation, ambient temperature, rainfall, wind-speed etc., and many other physical parameters like dust accumulation etc. Hence, it is important to estimate the amount of solar photovoltaic (PV) power generation for a specific geographical location. Machine learning (… ▽ More

    Submitted 21 January, 2023; originally announced January 2023.

    Comments: Accepted in Renewable Energy Focus (Elsevier)

    Journal ref: Renewable Energy Focus Volume 44, March 2023, Pages 277-294

  46. arXiv:2301.03103  [pdf, other

    cs.DC cs.AR

    A Multi-Site Accelerator-Rich Processing Fabric for Scalable Brain-Computer Interfacing

    Authors: Karthik Sriram, Raghavendra Pradyumna Pothukuchi, MichaƂ Gerasimiuk, Oliver Ye, Muhammed Ugur, Rajit Manohar, Anurag Khandelwal, Abhishek Bhattacharjee

    Abstract: Hull is an accelerator-rich distributed implantable Brain-Computer Interface (BCI) that reads biological neurons at data rates that are 2-3 orders of magnitude higher than the prior state of art, while supporting many neuroscientific applications. Prior approaches have restricted brain interfacing to tens of megabits per second in order to meet two constraints necessary for effective operation and… ▽ More

    Submitted 8 January, 2023; originally announced January 2023.

    Comments: 16 pages, 13 figures

  47. arXiv:2211.10298  [pdf, other

    cs.AI

    Reinforcement Learning Methods for Wordle: A POMDP/Adaptive Control Approach

    Authors: Siddhant Bhambri, Amrita Bhattacharjee, Dimitri Bertsekas

    Abstract: In this paper we address the solution of the popular Wordle puzzle, using new reinforcement learning methods, which apply more generally to adaptive control of dynamic systems and to classes of Partially Observable Markov Decision Process (POMDP) problems. These methods are based on approximation in value space and the rollout approach, admit a straightforward implementation, and provide improved… ▽ More

    Submitted 29 November, 2022; v1 submitted 14 November, 2022; originally announced November 2022.

  48. arXiv:2210.12899  [pdf, other

    cs.NE cs.CV

    SpikeSim: An end-to-end Compute-in-Memory Hardware Evaluation Tool for Benchmarking Spiking Neural Networks

    Authors: Abhishek Moitra, Abhiroop Bhattacharjee, Runcong Kuang, Gokul Krishnan, Yu Cao, Priyadarshini Panda

    Abstract: SNNs are an active research domain towards energy efficient machine intelligence. Compared to conventional ANNs, SNNs use temporal spike data and bio-plausible neuronal activation functions such as Leaky-Integrate Fire/Integrate Fire (LIF/IF) for data processing. However, SNNs incur significant dot-product operations causing high memory and computation overhead in standard von-Neumann computing pl… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: 14 pages, 22 figures

  49. arXiv:2210.05109  [pdf, other

    cs.CL

    BanglaParaphrase: A High-Quality Bangla Paraphrase Dataset

    Authors: Ajwad Akil, Najrin Sultana, Abhik Bhattacharjee, Rifat Shahriyar

    Abstract: In this work, we present BanglaParaphrase, a high-quality synthetic Bangla Paraphrase dataset curated by a novel filtering pipeline. We aim to take a step towards alleviating the low resource status of the Bangla language in the NLP domain through the introduction of BanglaParaphrase, which ensures quality by preserving both semantics and diversity, making it particularly useful to enhance other B… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: AACL 2022 (camera-ready)

  50. arXiv:2206.11249  [pdf, other

    cs.CL cs.AI cs.LG

    GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

    Authors: Sebastian Gehrmann, Abhik Bhattacharjee, Abinaya Mahendiran, Alex Wang, Alexandros Papangelis, Aman Madaan, Angelina McMillan-Major, Anna Shvets, Ashish Upadhyay, Bingsheng Yao, Bryan Wilie, Chandra Bhagavatula, Chaobin You, Craig Thomson, Cristina Garbacea, Dakuo Wang, Daniel Deutsch, Deyi Xiong, Di **, Dimitra Gkatzia, Dragomir Radev, Elizabeth Clark, Esin Durmus, Faisal Ladhak, Filip Ginter , et al. (52 additional authors not shown)

    Abstract: Evaluation in machine learning is usually informed by past choices, for example which datasets or metrics to use. This standardization enables the comparison on equal footing using leaderboards, but the evaluation choices become sub-optimal as better alternatives arise. This problem is especially pertinent in natural language generation which requires ever-improving suites of datasets, metrics, an… ▽ More

    Submitted 24 June, 2022; v1 submitted 22 June, 2022; originally announced June 2022.