Skip to main content

Showing 1–50 of 164 results for author: Pal, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19314  [pdf, other

    cs.CL cs.AI cs.LG

    LiveBench: A Challenging, Contamination-Free LLM Benchmark

    Authors: Colin White, Samuel Dooley, Manley Roberts, Arka Pal, Ben Feuer, Siddhartha Jain, Ravid Shwartz-Ziv, Neel Jain, Khalid Saifullah, Siddartha Naidu, Chinmay Hegde, Yann LeCun, Tom Goldstein, Willie Neiswanger, Micah Goldblum

    Abstract: Test set contamination, wherein test data from a benchmark ends up in a newer model's training set, is a well-documented obstacle for fair LLM evaluation and can quickly render benchmarks obsolete. To mitigate this, many recent benchmarks crowdsource new prompts and evaluations from human or LLM judges; however, these can introduce significant biases, and break down when scoring hard questions. In… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.17437  [pdf, other

    cs.CV

    Advancing Question Answering on Handwritten Documents: A State-of-the-Art Recognition-Based Model for HW-SQuAD

    Authors: Aniket Pal, Ajoy Mondal, C. V. Jawahar

    Abstract: Question-answering handwritten documents is a challenging task with numerous real-world applications. This paper proposes a novel recognition-based approach that improves upon the previous state-of-the-art on the HW-SQuAD and BenthamQA datasets. Our model incorporates transformer-based document retrieval and ensemble methods at the model level, achieving an Exact Match score of 82.02% and 92.55% i… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: 16 pages

  3. arXiv:2406.08391  [pdf, other

    cs.LG cs.AI cs.CL stat.ML

    Large Language Models Must Be Taught to Know What They Don't Know

    Authors: Sanyam Kapoor, Nate Gruver, Manley Roberts, Katherine Collins, Arka Pal, Umang Bhatt, Adrian Weller, Samuel Dooley, Micah Goldblum, Andrew Gordon Wilson

    Abstract: When using large language models (LLMs) in high-stakes applications, we need to know when we can trust their predictions. Some works argue that prompting high-performance LLMs is sufficient to produce calibrated uncertainties, while others introduce sampling methods that can be prohibitively expensive. In this work, we first argue that prompting on its own is insufficient to achieve good calibrati… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Code available at: https://github.com/activatedgeek/calibration-tuning

  4. arXiv:2405.14490  [pdf

    cs.CL

    Impact of Non-Standard Unicode Characters on Security and Comprehension in Large Language Models

    Authors: Johan S Daniel, Anand Pal

    Abstract: The advancement of large language models has significantly improved natural language processing. However, challenges such as jailbreaks (prompt injections that cause an LLM to follow instructions contrary to its intended use), hallucinations (generating incorrect or misleading information), and comprehension errors remain prevalent. In this report, we present a comparative analysis of the performa… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 46 pages

  5. arXiv:2405.14176  [pdf, other

    cs.LG cs.AI

    Certified Robustness against Sparse Adversarial Perturbations via Data Localization

    Authors: Ambar Pal, René Vidal, Jeremias Sulam

    Abstract: Recent work in adversarial robustness suggests that natural data distributions are localized, i.e., they place high probability in small volume regions of the input space, and that this property can be utilized for designing classifiers with improved robustness guarantees for $\ell_2$-bounded perturbations. Yet, it is still unclear if this observation holds true for more general metrics. In this w… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  6. arXiv:2405.01592  [pdf

    cs.CL cs.AI

    Text and Audio Simplification: Human vs. ChatGPT

    Authors: Gondy Leroy, David Kauchak, Philip Harber, Ankit Pal, Akash Shukla

    Abstract: Text and audio simplification to increase information comprehension are important in healthcare. With the introduction of ChatGPT, an evaluation of its simplification performance is needed. We provide a systematic comparison of human and ChatGPT simplified texts using fourteen metrics indicative of text difficulty. We briefly introduce our online editor where these simplification tools, including… ▽ More

    Submitted 29 April, 2024; originally announced May 2024.

    Comments: AMIA Summit, Boston, 2024

    ACM Class: H.4

  7. arXiv:2404.19203  [pdf

    physics.app-ph cs.AR eess.SY

    Thermal Performance of a Liquid-cooling Assisted Thin Wickless Vapor Chamber

    Authors: Arani Mukhopadhyay, Anish Pal, Mohamad Jafari Gukeh, Constantine M. Megaridis

    Abstract: The ever-increasing need for power consumption in electronic devices, coupled with the requirement for thinner size, calls for the development of efficient heat spreading components. Vapor chambers (VCs), because of their ability to effectively spread heat over a large area by two-phase heat transfer, seem ideal for such applications. However, creating thin and efficient vapor chambers that work o… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Presented at IEEE ITherm (Intersociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems) 2023. Orlando, FL, US. Corresponding: [email protected]

  8. arXiv:2404.19195  [pdf

    eess.SY cs.AR physics.app-ph

    Evaluation of Thermal Performance of a Wick-free Vapor Chamber in Power Electronics Cooling

    Authors: Arani Mukhopadhyay, Anish Pal, Congbo Bao, Mohamad Jafari Gukeh, Sudip K. Mazumder, Constantine M. Megaridis

    Abstract: Efficient thermal management in high-power electronics cooling can be achieved using phase-change heat transfer devices, such as vapor chambers. Traditional vapor chambers use wicks to transport condensate for efficient thermal exchange and to prevent "dry-out" of the evaporator. However, wicks in vapor chambers present significant design challenges arising out of large pressure drops across the w… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Presented at IEEE ITherm (Intersociety Conference on Thermal and Thermomechanical Phenomena in Electronic Systems) 2023, Orlando FL. Corresponding author: [email protected]

  9. arXiv:2404.08961  [pdf, other

    cond-mat.stat-mech cs.PF eess.SY math.OC math.PR

    Queues with resetting: a perspective

    Authors: Reshmi Roy, Arup Biswas, Arnab Pal

    Abstract: Performance modeling is a key issue in queuing theory and operation research. It is well-known that the length of a queue that awaits service or the time spent by a job in a queue depends not only on the service rate, but also crucially on the fluctuations in service time. The larger the fluctuations, the longer the delay becomes and hence, this is a major hindrance for the queue to operate effici… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

    Comments: Invited Perspective article to Journal of Physics: Complexity

  10. arXiv:2402.17925  [pdf, other

    cs.IR

    [RE] Modeling Personalized Item Frequency Information for Next-basket Recommendation

    Authors: Sławomir Garcarz, Avik Pal, Pim Praat

    Abstract: This paper focuses on reproducing and extending the results of the paper: "Modeling Personalized Item Frequency Information for Next-basket Recommendation" which introduced the TIFU-KNN model and proposed to utilize Personalized Item Frequency (PIF) for Next Basket Recommendation (NBR). We utilized publicly available grocery shop** datasets used in the original paper and incorporated additional… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  11. arXiv:2402.16168  [pdf, other

    cs.CL cs.AI

    Hitting "Probe"rty with Non-Linearity, and More

    Authors: Avik Pal, Madhura Pawar

    Abstract: Structural probes learn a linear transformation to find how dependency trees are embedded in the hidden states of language models. This simple design may not allow for full exploitation of the structure of the encoded information. Hence, to investigate the structure of the encoded information to its full extent, we incorporate non-linear structural probes. We reformulate the design of non-linear s… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

  12. arXiv:2402.13228  [pdf, other

    cs.CL cs.AI cs.LG

    Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive

    Authors: Arka Pal, Deep Karkhanis, Samuel Dooley, Manley Roberts, Siddartha Naidu, Colin White

    Abstract: Direct Preference Optimisation (DPO) is effective at significantly improving the performance of large language models (LLMs) on downstream tasks such as reasoning, summarisation, and alignment. Using pairs of preferred and dispreferred data, DPO models the \textit{relative} probability of picking one response over another. In this work, first we show theoretically that the standard DPO loss can le… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  13. arXiv:2402.07023  [pdf, other

    cs.CL cs.AI cs.CV cs.HC cs.LG

    Gemini Goes to Med School: Exploring the Capabilities of Multimodal Large Language Models on Medical Challenge Problems & Hallucinations

    Authors: Ankit Pal, Malaikannan Sankarasubbu

    Abstract: Large language models have the potential to be valuable in the healthcare industry, but it's crucial to verify their safety and effectiveness through rigorous evaluation. For this purpose, we comprehensively evaluated both open-source LLMs and Google's new multimodal LLM called Gemini across Medical reasoning, hallucination detection, and Medical Visual Question Answering tasks. While Gemini showe… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

    Comments: Preprint version, Under Review

  14. arXiv:2312.06719  [pdf, other

    cs.CV

    SkyScenes: A Synthetic Dataset for Aerial Scene Understanding

    Authors: Sahil Khose, Anisha Pal, Aayushi Agarwal, Deepanshi, Judy Hoffman, Prithvijit Chattopadhyay

    Abstract: Real-world aerial scene understanding is limited by a lack of datasets that contain densely annotated images curated under a diverse set of conditions. Due to inherent challenges in obtaining such images in controlled real-world settings, we present SkyScenes, a synthetic dataset of densely annotated aerial images captured from Unmanned Aerial Vehicle (UAV) perspectives. We carefully curate SkySce… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  15. arXiv:2312.06129  [pdf, other

    cs.RO

    Household navigation and manipulation for everyday object rearrangement tasks

    Authors: Shrutheesh R. Iyer, Anwesan Pal, Jiaming Hu, Akanimoh Adeleye, Aditya Aggarwal, Henrik I. Christensen

    Abstract: We consider the problem of building an assistive robotic system that can help humans in daily household cleanup tasks. Creating such an autonomous system in real-world environments is inherently quite challenging, as a general solution may not suit the preferences of a particular customer. Moreover, such a system consists of multi-objective tasks comprising -- (i) Detection of misplaced objects an… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: Paper accepted at IEEE IRC-2023

  16. arXiv:2311.12166  [pdf

    eess.SP cs.LG

    Creating Temporally Correlated High-Resolution Power Injection Profiles Using Physics-Aware GAN

    Authors: Hritik Gopal Shah, Behrouz Azimian, Anamitra Pal

    Abstract: Traditional smart meter measurements lack the granularity needed for real-time decision-making. To address this practical problem, we create a generative adversarial networks (GAN) model that enforces temporal consistency on its high-resolution outputs via hard inequality constraints using a convex optimization layer. A unique feature of our GAN model is that it is trained solely on slow timescale… ▽ More

    Submitted 21 November, 2023; v1 submitted 20 November, 2023; originally announced November 2023.

    Comments: 5 pages

  17. arXiv:2311.10885  [pdf, ps, other

    cs.CV cs.RO

    A Video-Based Activity Classification of Human Pickers in Agriculture

    Authors: Abhishesh Pal, Antonio C. Leite, Jon G. O. Gjevestad, Pål J. From

    Abstract: In farming systems, harvesting operations are tedious, time- and resource-consuming tasks. Based on this, deploying a fleet of autonomous robots to work alongside farmworkers may provide vast productivity and logistics benefits. Then, an intelligent robotic system should monitor human behavior, identify the ongoing activities and anticipate the worker's needs. In this work, the main contribution c… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: 4 pages, 6 figures, 3 tables

  18. Analytical Verification of Performance of Deep Neural Network Based Time-Synchronized Distribution System State Estimation

    Authors: Behrouz Azimian, Shiva Moshtagh, Anamitra Pal, Shanshan Ma

    Abstract: Recently, we demonstrated success of a time-synchronized state estimator using deep neural networks (DNNs) for real-time unobservable distribution systems. In this letter, we provide analytical bounds on the performance of that state estimator as a function of perturbations in the input measurements. It has already been shown that evaluating performance based on only the test dataset might not eff… ▽ More

    Submitted 22 February, 2024; v1 submitted 12 November, 2023; originally announced November 2023.

    Comments: 8 pages, in Journal of Modern Power Systems and Clean Energy, 2023

  19. arXiv:2310.14063  [pdf, other

    cs.RO cs.CV

    Concept-based Anomaly Detection in Retail Stores for Automatic Correction using Mobile Robots

    Authors: Aditya Kapoor, Vartika Sengar, Nijil George, Vighnesh Vatsal, Jayavardhana Gubbi, Balamuralidhar P, Arpan Pal

    Abstract: Tracking of inventory and rearrangement of misplaced items are some of the most labor-intensive tasks in a retail environment. While there have been attempts at using vision-based techniques for these tasks, they mostly use planogram compliance for detection of any anomalies, a technique that has been found lacking in robustness and scalability. Moreover, existing systems rely on human interventio… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: 8 pages, 9 figures, 2 tables, IEEE Transactions on Systems, Man and Cybernetics

  20. arXiv:2310.13279  [pdf, other

    eess.IV cs.CV

    Pathologist-Like Explanations Unveiled: an Explainable Deep Learning System for White Blood Cell Classification

    Authors: Aditya Shankar Pal, Debojyoti Biswas, Joy Mahapatra, Debasis Banerjee, Prantar Chakrabarti, Utpal Garain

    Abstract: White blood cells (WBCs) play a crucial role in safeguarding the human body against pathogens and foreign substances. Leveraging the abundance of WBC imaging data and the power of deep learning algorithms, automated WBC analysis has the potential for remarkable accuracy. However, the capability of deep learning models to explain their WBC classification remains largely unexplored. In this study, w… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: 11 pages including supplementary material

  21. arXiv:2310.13146  [pdf, other

    cs.CL cs.AI cs.CY cs.DC cs.LG

    CLIFT: Analysing Natural Distribution Shift on Question Answering Models in Clinical Domain

    Authors: Ankit Pal

    Abstract: This paper introduces a new testbed CLIFT (Clinical Shift) for the clinical domain Question-answering task. The testbed includes 7.5k high-quality question answering samples to provide a diverse and reliable benchmark. We performed a comprehensive experimental study and evaluated several QA deep-learning models under the proposed testbed. Despite impressive results on the original test set, the pe… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: Accepted at NeurIPS 2022 (Robustness in Sequence Modeling)

    Journal ref: 36th Conference on Neural Information Processing Systems, Robustness in Sequence Modeling (NeurIPS 2022)

  22. arXiv:2309.16096  [pdf, other

    cs.LG cs.AI

    Adversarial Examples Might be Avoidable: The Role of Data Concentration in Adversarial Robustness

    Authors: Ambar Pal, Jeremias Sulam, René Vidal

    Abstract: The susceptibility of modern machine learning classifiers to adversarial examples has motivated theoretical results suggesting that these might be unavoidable. However, these results can be too general to be applicable to natural data distributions. Indeed, humans are quite robust for tasks involving vision. This apparent conflict motivates a deeper dive into the question: Are adversarial examples… ▽ More

    Submitted 25 May, 2024; v1 submitted 27 September, 2023; originally announced September 2023.

    Comments: Accepted to Neural Information Processing Systems (NeurIPS) 2023

  23. arXiv:2309.00993   

    cs.LG

    A Boosted Machine Learning Framework for the Improvement of Phase and Crystal Structure Prediction of High Entropy Alloys Using Thermodynamic and Configurational Parameters

    Authors: Debsundar Dey, Suchandan Das, Anik Pal, Santanu Dey, Chandan Kumar Raul, Arghya Chatterjee

    Abstract: The reason behind the remarkable properties of High-Entropy Alloys (HEAs) is rooted in the diverse phases and the crystal structures they contain. In the realm of material informatics, employing machine learning (ML) techniques to classify phases and crystal structures of HEAs has gained considerable significance. In this study, we assembled a new collection of 1345 HEAs with varying compositions… ▽ More

    Submitted 31 December, 2023; v1 submitted 2 September, 2023; originally announced September 2023.

    Comments: We want to modify this paper and extend some parts of it

  24. arXiv:2308.10882  [pdf, other

    cs.AI cs.CL

    Giraffe: Adventures in Expanding Context Lengths in LLMs

    Authors: Arka Pal, Deep Karkhanis, Manley Roberts, Samuel Dooley, Arvind Sundararajan, Siddartha Naidu

    Abstract: Modern large language models (LLMs) that rely on attention mechanisms are typically trained with fixed context lengths which enforce upper limits on the length of input sequences that they can handle at evaluation time. To use these models on sequences longer than the train-time context length, one might employ techniques from the growing family of context length extrapolation methods -- most of w… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

  25. arXiv:2308.10170  [pdf, other

    cs.CV cs.CL

    FashionNTM: Multi-turn Fashion Image Retrieval via Cascaded Memory

    Authors: Anwesan Pal, Sahil Wadhwa, Ayush Jaiswal, Xu Zhang, Yue Wu, Rakesh Chada, Pradeep Natarajan, Henrik I. Christensen

    Abstract: Multi-turn textual feedback-based fashion image retrieval focuses on a real-world setting, where users can iteratively provide information to refine retrieval results until they find an item that fits all their requirements. In this work, we present a novel memory-based method, called FashionNTM, for such a multi-turn system. Our framework incorporates a new Cascaded Memory Neural Turing Machine (… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    Comments: Paper accepted at ICCV-2023

  26. arXiv:2308.05101  [pdf, other

    cs.LG cs.AI

    DOST -- Domain Obedient Self-supervised Training for Multi Label Classification with Noisy Labels

    Authors: Soumadeep Saha, Utpal Garain, Arijit Ukil, Arpan Pal, Sundeep Khandelwal

    Abstract: The enormous demand for annotated data brought forth by deep learning techniques has been accompanied by the problem of annotation noise. Although this issue has been widely discussed in machine learning literature, it has been relatively unexplored in the context of "multi-label classification" (MLC) tasks which feature more complicated kinds of noise. Additionally, when the domain in question ha… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: Submitted to IEEE TNNLS on March 7th 2023. 8 pages, 4 figures

    ACM Class: I.2.6; I.2.0

  27. arXiv:2307.15343  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Med-HALT: Medical Domain Hallucination Test for Large Language Models

    Authors: Ankit Pal, Logesh Kumar Umapathi, Malaikannan Sankarasubbu

    Abstract: This research paper focuses on the challenges posed by hallucinations in large language models (LLMs), particularly in the context of the medical domain. Hallucination, wherein these models generate plausible yet unverified or incorrect information, can have serious consequences in healthcare applications. We propose a new benchmark and dataset, Med-HALT (Medical Domain Hallucination Test), design… ▽ More

    Submitted 14 October, 2023; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: Accepted at EMNLP 2023(The SIGNLL Conference on Computational Natural Language Learning)

  28. arXiv:2306.17427  [pdf

    cs.RO

    Modeling and parametric optimization of 3D tendon-sheath actuator system for upper limb soft exosuit

    Authors: Amit Yadav, Nitesh Kumar, Shaurya Surana, Aravind Ramasamy, Abhishek Rudra Pal, Sushma Santapuri, Lalan Kumar, Suriya Prakash Muthukrishnan, Shubhendu Bhasin, Sitikantha Roy

    Abstract: This paper presents an analysis of parametric characterization of a motor driven tendon-sheath actuator system for use in upper limb augmentation for applications such as rehabilitation, therapy, and industrial automation. The double tendon sheath system, which uses two sets of cables (agonist and antagonist side) guided through a sheath, is considered to produce smooth and natural-looking movemen… ▽ More

    Submitted 10 September, 2023; v1 submitted 30 June, 2023; originally announced June 2023.

  29. arXiv:2306.16482  [pdf, other

    cs.CV

    DenseBAM-GI: Attention Augmented DeneseNet with momentum aided GRU for HMER

    Authors: Aniket Pal, Krishna Pratap Singh

    Abstract: The task of recognising Handwritten Mathematical Expressions (HMER) is crucial in the fields of digital education and scholarly research. However, it is difficult to accurately determine the length and complex spatial relationships among symbols in handwritten mathematical expressions. In this study, we present a novel encoder-decoder architecture (DenseBAM-GI) for HMER, where the encoder has a Bo… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  30. arXiv:2306.03379  [pdf, other

    cs.CR cs.DB

    OptimShare: A Unified Framework for Privacy Preserving Data Sharing -- Towards the Practical Utility of Data with Privacy

    Authors: M. A. P. Chamikara, Seung Ick Jang, Ian Oppermann, Dongxi Liu, Musotto Roberto, Sushmita Ruj, Arindam Pal, Meisam Mohammady, Seyit Camtepe, Sylvia Young, Chris Dorrian, Nasir David

    Abstract: Tabular data sharing serves as a common method for data exchange. However, sharing sensitive information without adequate privacy protection can compromise individual privacy. Thus, ensuring privacy-preserving data sharing is crucial. Differential privacy (DP) is regarded as the gold standard in data privacy. Despite this, current DP methods tend to generate privacy-preserving tabular datasets tha… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  31. arXiv:2306.00013  [pdf, other

    cs.CL cs.LG

    Machine Learning Approach for Cancer Entities Association and Classification

    Authors: G. Jeyakodi, Arkadeep Pal, Debapratim Gupta, K. Sarukeswari, V. Amouda

    Abstract: According to the World Health Organization (WHO), cancer is the second leading cause of death globally. Scientific research on different types of cancers grows at an ever-increasing rate, publishing large volumes of research articles every year. The insight information and the knowledge of the drug, diagnostics, risk, symptoms, treatments, etc., related to genes are significant factors that help e… ▽ More

    Submitted 24 June, 2023; v1 submitted 30 May, 2023; originally announced June 2023.

    Comments: This paper got accepted for paper presentation at the International Conference on Knowledge Discoveries on Statistical Innovations and Recent Advances in Optimization (ICON-KSRAO) on 29th and 30th December 2022

  32. arXiv:2305.04746  [pdf, other

    cs.LG cs.AI

    Understanding Noise-Augmented Training for Randomized Smoothing

    Authors: Ambar Pal, Jeremias Sulam

    Abstract: Randomized smoothing is a technique for providing provable robustness guarantees against adversarial attacks while making minimal assumptions about a classifier. This method relies on taking a majority vote of any base classifier over multiple noise-perturbed inputs to obtain a smoothed classifier, and it remains the tool of choice to certify deep and complex neural network models. Nonetheless, no… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: Transactions on Machine Learning Research, 2023

  33. arXiv:2304.04265  [pdf, ps, other

    cs.PL

    Dependently Ty** R Vectors, Arrays, and Matrices

    Authors: John Wrenn, Anjali Pal, Alexa VanHattum, Shriram Krishnamurthi

    Abstract: The R programming language is widely used in large-scale data analyses. It contains especially rich built-in support for dealing with vectors, arrays, and matrices. These operations feature prominently in the applications that form R's raison d'être, making their behavior worth understanding. Furthermore, ostensibly for programmer convenience, their behavior in R is a notable extension over the co… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

    Comments: 10 pages

    ACM Class: D.3

  34. End-to-End Latency Optimization of Multi-view 3D Reconstruction for Disaster Response

    Authors: Xiaojie Zhang, Mingjun Li, Andrew Hilton, Amitangshu Pal, Soumyabrata Dey, Saptarshi Debroy

    Abstract: In order to plan rapid response during disasters, first responder agencies often adopt `bring your own device' (BYOD) model with inexpensive mobile edge devices (e.g., drones, robots, tablets) for complex video analytics applications, e.g., 3D reconstruction of a disaster scene. Unlike simpler video applications, widely used Multi-view Stereo (MVS) based 3D reconstruction applications (e.g., openM… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

    Comments: 2022 10th IEEE International Conference on Mobile Cloud Computing, Services, and Engineering (MobileCloud)

  35. arXiv:2303.10782  [pdf, ps, other

    cs.CL cs.CV

    On the Importance of Signer Overlap for Sign Language Detection

    Authors: Abhilash Pal, Stephan Huber, Cyrine Chaabani, Alessandro Manzotti, Oscar Koller

    Abstract: Sign language detection, identifying if someone is signing or not, is becoming crucially important for its applications in remote conferencing software and for selecting useful sign data for training sign language recognition or translation tasks. We argue that the current benchmark data sets for sign language detection estimate overly positive results that do not generalize well due to signer ove… ▽ More

    Submitted 19 March, 2023; originally announced March 2023.

  36. arXiv:2303.02262  [pdf, other

    cs.LG cs.AI math.NA

    Locally Regularized Neural Differential Equations: Some Black Boxes Were Meant to Remain Closed!

    Authors: Avik Pal, Alan Edelman, Chris Rackauckas

    Abstract: Implicit layer deep learning techniques, like Neural Differential Equations, have become an important modeling framework due to their ability to adapt to new problems automatically. Training a neural differential equation is effectively a search over a space of plausible dynamical systems. However, controlling the computational cost for these models is difficult since it relies on the number of st… ▽ More

    Submitted 2 June, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: Proceedings of the 40th International Conference on Machine Learning, Honolulu, Hawaii, USA. PMLR 202, 2023

  37. arXiv:2302.13125  [pdf, other

    cs.CY cs.CV

    Non-Intrusive Driver Behavior Characterization From Road-Side Cameras

    Authors: Pavana Pradeep Kumar, Krishna Kant, Amitangshu Pal

    Abstract: In this paper, we demonstrate a proof of concept for characterizing vehicular behavior using only the roadside cameras of the ITS system. The essential advantage of this method is that it can be implemented in the roadside infrastructure transparently and inexpensively and can have a global view of each vehicle's behavior without any involvement of or awareness by the individual vehicles or driver… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

  38. arXiv:2212.12861  [pdf, other

    quant-ph cs.GR

    An efficient quantum-classical hybrid algorithm for distorted alphanumeric character identification

    Authors: Ankur Pal, Abhishek Shukla, Anirban Pathak

    Abstract: An algorithm for image processing is proposed. The proposed algorithm, which can be viewed as a quantum-classical hybrid algorithm, can transform a low-resolution bitonal image of a character from the set of alphanumeric characters (A-Z, 0-9) into a high-resolution image. The quantum part of the proposed algorithm fruitfully utilizes a variant of Grover's search algorithm, known as the fixed point… ▽ More

    Submitted 25 December, 2022; originally announced December 2022.

    Comments: A quantum-assisted algorithm for optical character recognition (OCR) is proposed using fixed point Grover's algorithm

  39. arXiv:2212.03243  [pdf

    cs.LG physics.optics

    Machine Learning Assisted Inverse Design of Microresonators

    Authors: Arghadeep Pal, Alekhya Ghosh, Shuangyou Zhang, Toby Bi, Pascal DeľHaye

    Abstract: The high demand for fabricating microresonators with desired optical properties has led to various techniques to optimize geometries, mode structures, nonlinearities and dispersion. Depending on applications, the dispersion in such resonators counters their optical nonlinearities and influences the intracavity optical dynamics. In this paper, we demonstrate the use of a machine learning (ML) algor… ▽ More

    Submitted 10 November, 2022; originally announced December 2022.

    Comments: 9 pages, 4 figures

  40. arXiv:2212.01729  [pdf

    eess.SP cs.LG

    Time-Synchronized Full System State Estimation Considering Practical Implementation Challenges

    Authors: Antos Cheeramban Varghese, Hritik Shah, Behrouz Azimian, Anamitra Pal, Evangelos Farantatos

    Abstract: As the phasor measurement unit (PMU) placement problem involves a cost-benefit trade-off, more PMUs get placed on the higher voltage buses. However, this causes many of the lower voltage levels of the bulk power system to not be observed by PMUs. This lack of visibility then makes time-synchronized state estimation of the full system a challenging problem. We propose a Deep Neural network-based St… ▽ More

    Submitted 21 March, 2024; v1 submitted 3 December, 2022; originally announced December 2022.

  41. arXiv:2212.00770  [pdf, other

    cs.CV cs.AI

    On Utilizing Relationships for Transferable Few-Shot Fine-Grained Object Detection

    Authors: Ambar Pal, Arnau Ramisa, Amit Kumar K C, René Vidal

    Abstract: State-of-the-art object detectors are fast and accurate, but they require a large amount of well annotated training data to obtain good performance. However, obtaining a large amount of training annotations specific to a particular task, i.e., fine-grained annotations, is costly in practice. In contrast, obtaining common-sense relationships from text, e.g., "a table-lamp is a lamp that sits on top… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: 8 pages, 3 figures

  42. arXiv:2211.15424  [pdf, other

    cs.CL cs.IR cs.LG

    DeepParliament: A Legal domain Benchmark & Dataset for Parliament Bills Prediction

    Authors: Ankit Pal

    Abstract: This paper introduces DeepParliament, a legal domain Benchmark Dataset that gathers bill documents and metadata and performs various bill status classification tasks. The proposed dataset text covers a broad range of bills from 1986 to the present and contains richer information on parliament bill content. Data collection, detailed statistics and analyses are provided in the paper. Moreover, we ex… ▽ More

    Submitted 14 November, 2022; originally announced November 2022.

    Comments: Accepted at EMNLP 2022 (UM-IoS)

  43. arXiv:2211.07893  [pdf, other

    cs.LG cs.AI cs.CR cs.DC

    Federated Learning for Healthcare Domain - Pipeline, Applications and Challenges

    Authors: Madhura Joshi, Ankit Pal, Malaikannan Sankarasubbu

    Abstract: Federated learning is the process of develo** machine learning models over datasets distributed across data centers such as hospitals, clinical research labs, and mobile devices while preventing data leakage. This survey examines previous research and studies on federated learning in the healthcare sector across a range of use cases and applications. Our survey shows what challenges, methods, an… ▽ More

    Submitted 19 November, 2022; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: ACM Transactions on Computing for Healthcare, Vol. 3, No. 4, Article 40. Publication date: October 2022

    Journal ref: ACM Transactions on Computing for Healthcare, Vol. 3, No. 4, Article 40. Publication date: October 2022

  44. arXiv:2211.06295  [pdf

    cs.CY physics.flu-dyn q-bio.PE q-bio.QM

    A novel approach to preventing SARS-CoV-2 transmission in classrooms: An OpenFOAM based CFD Study

    Authors: Anish Pal, Riddhideep Biswas, Ritam Pal, Sourav Sarkar, Achintya Mukhopadhyay

    Abstract: The education sector has suffered a catastrophic setback due to ongoing COVID-pandemic, with classrooms being closed indefinitely. The current study aims to solve the existing dilemma by examining COVID transmission inside a classroom and providing long-term sustainable solutions. In this work, a standard 5m x 3m x 5m classroom is considered where 24 students are seated, accompanied by a teacher.… ▽ More

    Submitted 12 October, 2022; originally announced November 2022.

  45. arXiv:2210.13326  [pdf, other

    cs.CL cs.CV

    Clean Text and Full-Body Transformer: Microsoft's Submission to the WMT22 Shared Task on Sign Language Translation

    Authors: Subhadeep Dey, Abhilash Pal, Cyrine Chaabani, Oscar Koller

    Abstract: This paper describes Microsoft's submission to the first shared task on sign language translation at WMT 2022, a public competition tackling sign language to spoken language translation for Swiss German sign language. The task is very challenging due to data scarcity and an unprecedented vocabulary size of more than 20k words on the target side. Moreover, the data is taken from real broadcast news… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: accepted for publication at WMT2022

  46. arXiv:2210.12456  [pdf, other

    cs.LG

    Abstract Interpretation-Based Feature Importance for SVMs

    Authors: Abhinandan Pal, Francesco Ranzato, Caterina Urban, Marco Zanella

    Abstract: We propose a symbolic representation for support vector machines (SVMs) by means of abstract interpretation, a well-known and successful technique for designing and implementing static program analyses. We leverage this abstraction in two ways: (1) to enhance the interpretability of SVMs by deriving a novel feature importance measure, called abstract feature importance (AFI), that does not depend… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

  47. arXiv:2210.11728  [pdf, other

    cs.CV

    Distilling the Undistillable: Learning from a Nasty Teacher

    Authors: Surgan Jandial, Yash Khasbage, Arghya Pal, Vineeth N Balasubramanian, Balaji Krishnamurthy

    Abstract: The inadvertent stealing of private/sensitive information using Knowledge Distillation (KD) has been getting significant attention recently and has guided subsequent defense efforts considering its critical nature. Recent work Nasty Teacher proposed to develop teachers which can not be distilled or imitated by models attacking it. However, the promise of confidentiality offered by a nasty teacher… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: Published in main track of ECCV 2022, 17 pages with references, 5 figures, 6 tables

    Journal ref: ECCV 2022

  48. arXiv:2210.08780  [pdf, other

    cs.RO eess.SY

    Sample-efficient Model Predictive Control Design of Soft Robotics by Bayesian Optimization

    Authors: Anuj Pal, Tianyi He, Wenpeng Wei

    Abstract: This paper presents a sample-efficient data-driven method to design model predictive control (MPC) for cable-actuated soft robotics using Bayesian optimization. Instead of modeling the complex dynamics of the soft robots, the proposed approach uses Bayesian optimization to search the best-guessed low-dimensional prediction model and its associated controller to minimize the objective function of c… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: submitted to ACC 2023

  49. arXiv:2209.12474  [pdf, other

    cs.IR

    Legal Case Document Similarity: You Need Both Network and Text

    Authors: Paheli Bhattacharya, Kripabandhu Ghosh, Arindam Pal, Saptarshi Ghosh

    Abstract: Estimating the similarity between two legal case documents is an important and challenging problem, having various downstream applications such as prior-case retrieval and citation recommendation. There are two broad approaches for the task -- citation network-based and text-based. Prior citation network-based approaches consider citations only to prior-cases (also called precedents) (PCNet). This… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

    Comments: This work has been published in Information Processing and Management, Elsevier, vol. 59, issue 6, November 2022

  50. arXiv:2208.11472  [pdf, ps, other

    eess.IV cs.CV

    A Deep Learning Approach Using Masked Image Modeling for Reconstruction of Undersampled K-spaces

    Authors: Kyler Larsen, Arghya Pal, Yogesh Rathi

    Abstract: Magnetic Resonance Imaging (MRI) scans are time consuming and precarious, since the patients remain still in a confined space for extended periods of time. To reduce scanning time, some experts have experimented with undersampled k spaces, trying to use deep learning to predict the fully sampled result. These studies report that as many as 20 to 30 minutes could be saved off a scan that takes an h… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

    Comments: 15 pages, 13 figures

    ACM Class: J.3; I.2.10