Skip to main content

Showing 1–50 of 381 results for author: Ghosh, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18700  [pdf, other

    cs.CC

    On Fourier analysis of sparse Boolean functions over certain Abelian groups

    Authors: Sourav Chakraborty, Swarnalipa Datta, Pranjal Dutta, Arijit Ghosh, Swagato Sanyal

    Abstract: Given an Abelian group G, a Boolean-valued function f: G -> {-1,+1}, is said to be s-sparse, if it has at most s-many non-zero Fourier coefficients over the domain G. In a seminal paper, Gopalan et al. proved "Granularity" for Fourier coefficients of Boolean valued functions over Z_2^n, that have found many diverse applications in theoretical computer science and combinatorics. They also studied s… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. FLea: Addressing Data Scarcity and Label Skew in Federated Learning via Privacy-preserving Feature Augmentation

    Authors: Tong Xia, Abhirup Ghosh, Xinchi Qiu, Cecilia Mascolo

    Abstract: Federated Learning (FL) enables model development by leveraging data distributed across numerous edge devices without transferring local data to a central server. However, existing FL methods still face challenges when dealing with scarce and label-skewed data across devices, resulting in local model overfitting and drift, consequently hindering the performance of the global model. In response to… ▽ More

    Submitted 18 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: This work was intended as a replacement of arXiv:2312.02327 and any subsequent updates will appear there

  3. arXiv:2406.07661  [pdf, other

    cs.CV cs.RO

    ROADWork Dataset: Learning to Recognize, Observe, Analyze and Drive Through Work Zones

    Authors: Anurag Ghosh, Robert Tamburo, Shen Zheng, Juan R. Alvarez-Padilla, Hailiang Zhu, Michael Cardei, Nicholas Dunn, Christoph Mertz, Srinivasa G. Narasimhan

    Abstract: Perceiving and navigating through work zones is challenging and under-explored, even with major strides in self-driving research. An important reason is the lack of open datasets for develo** new algorithms to address this long-tailed scenario. We propose the ROADWork dataset to learn how to recognize, observe and analyze and drive through work zones. We find that state-of-the-art foundation mod… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  4. arXiv:2406.05288  [pdf, other

    cs.CV cs.AI cs.LG

    Optimal Eye Surgeon: Finding Image Priors through Sparse Generators at Initialization

    Authors: Avrajit Ghosh, Xitong Zhang, Kenneth K. Sun, Qing Qu, Saiprasad Ravishankar, Rongrong Wang

    Abstract: We introduce Optimal Eye Surgeon (OES), a framework for pruning and training deep image generator networks. Typically, untrained deep convolutional networks, which include image sampling operations, serve as effective image priors (Ulyanov et al., 2018). However, they tend to overfit to noise in image restoration tasks due to being overparameterized. OES addresses this by adaptively pruning networ… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: Pruning image generator networks at initialization to alleviate overfitting

    Journal ref: International Conference on Machine Learning (ICML 2024)

  5. arXiv:2406.04231  [pdf, other

    cs.MA cs.AI cs.CY cs.GT

    Quantifying Misalignment Between Agents

    Authors: Aidan Kierans, Avijit Ghosh, Hananel Hazan, Shiri Dori-Hacohen

    Abstract: Growing concerns about the AI alignment problem have emerged in recent years, with previous work focusing mainly on (1) qualitative descriptions of the alignment problem; (2) attempting to align AI actions with human interests by focusing on value specification and learning; and/or (3) focusing on a single agent or on humanity as a singular unit. Recent work in sociotechnical AI alignment has made… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 10 pages, 2 figures, 4 tables, submitted to AIES-24

    ACM Class: I.2.11; K.4.m

  6. arXiv:2406.04146  [pdf, other

    cs.CL

    Towards Understanding Task-agnostic Debiasing Through the Lenses of Intrinsic Bias and Forgetfulness

    Authors: Guangliang Liu, Milad Afshari, Xitong Zhang, Zhiyu Xue, Avrajit Ghosh, Bidhan Bashyal, Rongrong Wang, Kristen Johnson

    Abstract: While task-agnostic debiasing provides notable generalizability and reduced reliance on downstream data, its impact on language modeling ability and the risk of relearning social biases from downstream task-specific data remain as the two most significant challenges when debiasing Pretrained Language Models (PLMs). The impact on language modeling ability can be alleviated given a high-quality and… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  7. arXiv:2406.03864  [pdf, other

    cs.LG

    PairNet: Training with Observed Pairs to Estimate Individual Treatment Effect

    Authors: Lokesh Nagalapatti, Pranava Singhal, Avishek Ghosh, Sunita Sarawagi

    Abstract: Given a dataset of individuals each described by a covariate vector, a treatment, and an observed outcome on the treatment, the goal of the individual treatment effect (ITE) estimation task is to predict outcome changes resulting from a change in treatment. A fundamental challenge is that in the observational data, a covariate's outcome is observed only under one treatment, whereas we need to infe… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Lokesh and Pranava contributed equally. Accepted at ICML-24

  8. arXiv:2406.01149  [pdf, ps, other

    stat.ML cs.AI cs.IT cs.LG

    Agnostic Learning of Mixed Linear Regressions with EM and AM Algorithms

    Authors: Avishek Ghosh, Arya Mazumdar

    Abstract: Mixed linear regression is a well-studied problem in parametric statistics and machine learning. Given a set of samples, tuples of covariates and labels, the task of mixed linear regression is to find a small list of linear relationships that best fit the samples. Usually it is assumed that the label is generated stochastically by randomly selecting one of two or more linear functions, applying th… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: To appear in ICML 2024

  9. arXiv:2406.00808  [pdf, other

    cs.CV

    EchoNet-Synthetic: Privacy-preserving Video Generation for Safe Medical Data Sharing

    Authors: Hadrien Reynaud, Qingjie Meng, Mischa Dombrowski, Arijit Ghosh, Thomas Day, Alberto Gomez, Paul Leeson, Bernhard Kainz

    Abstract: To make medical datasets accessible without sharing sensitive patient information, we introduce a novel end-to-end approach for generative de-identification of dynamic medical imaging data. Until now, generative methods have faced constraints in terms of fidelity, spatio-temporal coherence, and the length of generation, failing to capture the complete details of dataset distributions. We present a… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: Accepted at MICCAI 2024

  10. arXiv:2405.20933  [pdf, ps, other

    cs.LG stat.ML

    Concentration Bounds for Optimized Certainty Equivalent Risk Estimation

    Authors: Ayon Ghosh, L. A. Prashanth, Krishna Jagannathan

    Abstract: We consider the problem of estimating the Optimized Certainty Equivalent (OCE) risk from independent and identically distributed (i.i.d.) samples. For the classic sample average approximation (SAA) of OCE, we derive mean-squared error as well as concentration bounds (assuming sub-Gaussianity). Further, we analyze an efficient stochastic approximation-based OCE estimator, and derive finite sample b… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  11. arXiv:2405.10167  [pdf, ps, other

    cs.DS

    Near Uniform Triangle Sampling Over Adjacency List Graph Streams

    Authors: Arijit Bishnu, Arijit Ghosh, Gopinath Mishra, Sayantan Sen

    Abstract: Triangle counting and sampling are two fundamental problems for streaming algorithms. Arguably, designing sampling algorithms is more challenging than their counting variants. It may be noted that triangle counting has received far greater attention in the literature than the sampling variant. In this work, we consider the problem of approximately sampling triangles in different models of streamin… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 26 pages

  12. arXiv:2405.09589  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.SD eess.AS

    Unveiling Hallucination in Text, Image, Video, and Audio Foundation Models: A Comprehensive Survey

    Authors: Pranab Sahoo, Prabhash Meharia, Akash Ghosh, Sriparna Saha, Vinija Jain, Aman Chadha

    Abstract: The rapid advancement of foundation models (FMs) across language, image, audio, and video domains has shown remarkable capabilities in diverse tasks. However, the proliferation of FMs brings forth a critical challenge: the potential to generate hallucinated outputs, particularly in high-stakes applications. The tendency of foundation models to produce hallucinated content arguably represents the b… ▽ More

    Submitted 20 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

  13. arXiv:2405.06671  [pdf, other

    cs.CL cs.CE cs.LG

    Parameter-Efficient Instruction Tuning of Large Language Models For Extreme Financial Numeral Labelling

    Authors: Subhendu Khatuya, Rajdeep Mukherjee, Akash Ghosh, Manjunath Hegde, Koustuv Dasgupta, Niloy Ganguly, Saptarshi Ghosh, Pawan Goyal

    Abstract: We study the problem of automatically annotating relevant numerals (GAAP metrics) occurring in the financial documents with their corresponding XBRL tags. Different from prior works, we investigate the feasibility of solving this extreme classification problem using a generative paradigm through instruction tuning of Large Language Models (LLMs). To this end, we leverage metric metadata informatio… ▽ More

    Submitted 15 May, 2024; v1 submitted 3 May, 2024; originally announced May 2024.

    Comments: This work has been accepted to appear at North American Chapter of the Association for Computational Linguistics (NAACL), 2024

  14. arXiv:2405.02173  [pdf, other

    cs.HC cs.CY

    Task Synthesis for Elementary Visual Programming in XLogoOnline Environment

    Authors: Chao Wen, Ahana Ghosh, Jacqueline Staub, Adish Singla

    Abstract: In recent years, the XLogoOnline programming platform has gained popularity among novice learners. It integrates the Logo programming language with visual programming, providing a visual interface for learning computing concepts. However, XLogoOnline offers only a limited set of tasks, which are inadequate for learners to master the computing concepts that require sufficient practice. To address t… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: Accepted as a paper at the AIED'24 conference in the late-breaking results track

  15. arXiv:2404.16156  [pdf, other

    quant-ph cs.AR cs.CR cs.LG

    Guardians of the Quantum GAN

    Authors: Archisman Ghosh, Debarshi Kundu, Avimita Chatterjee, Swaroop Ghosh

    Abstract: Quantum Generative Adversarial Networks (qGANs) are at the forefront of image-generating quantum machine learning models. To accommodate the growing demand for Noisy Intermediate-Scale Quantum (NISQ) devices to train and infer quantum machine learning models, the number of third-party vendors offering quantum hardware as a service is expected to rise. This expansion introduces the risk of untruste… ▽ More

    Submitted 15 May, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: 11 pages, 10 figures

  16. arXiv:2404.14332  [pdf, other

    hep-ex cs.AI cs.LG hep-ph

    Full Event Particle-Level Unfolding with Variable-Length Latent Variational Diffusion

    Authors: Alexander Shmakov, Kevin Greif, Michael James Fenton, Aishik Ghosh, Pierre Baldi, Daniel Whiteson

    Abstract: The measurements performed by particle physics experiments must account for the imperfect response of the detectors used to observe the interactions. One approach, unfolding, statistically adjusts the experimental data for detector effects. Recently, generative machine learning models have shown promise for performing unbinned unfolding in a high number of dimensions. However, all current generati… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: Submission to SciPost

  17. arXiv:2404.10875  [pdf, other

    cs.AR

    A Dataset for Large Language Model-Driven AI Accelerator Generation

    Authors: Mahmoud Nazzal, Deepak Vungarala, Mehrdad Morsali, Chao Zhang, Arnob Ghosh, Abdallah Khreishah, Shaahin Angizi

    Abstract: In the ever-evolving landscape of Deep Neural Networks (DNN) hardware acceleration, unlocking the true potential of systolic array accelerators has long been hindered by the daunting challenges of expertise and time investment. Large Language Models (LLMs) offer a promising solution for automating code generation which is key to unlocking unprecedented efficiency and performance in various domains… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 4 pages, 4 Figures

  18. arXiv:2404.07214  [pdf, other

    cs.CV cs.AI cs.CL

    Exploring the Frontier of Vision-Language Models: A Survey of Current Methodologies and Future Directions

    Authors: Akash Ghosh, Arkadeep Acharya, Sriparna Saha, Vinija Jain, Aman Chadha

    Abstract: The advent of Large Language Models (LLMs) has significantly reshaped the trajectory of the AI revolution. Nevertheless, these LLMs exhibit a notable limitation, as they are primarily adept at processing textual information. To address this constraint, researchers have endeavored to integrate visual capabilities with LLMs, resulting in the emergence of Vision-Language Models (VLMs). These advanced… ▽ More

    Submitted 12 April, 2024; v1 submitted 20 February, 2024; originally announced April 2024.

    Comments: The most extensive and up to date Survey on Visual Language Models covering 76 Visual Language Models

  19. arXiv:2404.04224  [pdf, other

    cs.LG physics.chem-ph physics.data-an q-bio.BM

    Active Causal Learning for Decoding Chemical Complexities with Targeted Interventions

    Authors: Zachary R. Fox, Ayana Ghosh

    Abstract: Predicting and enhancing inherent properties based on molecular structures is paramount to design tasks in medicine, materials science, and environmental management. Most of the current machine learning and deep learning approaches have become standard for predictions, but they face challenges when applied across different datasets due to reliance on correlations between molecular representation a… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  20. arXiv:2404.04125  [pdf, other

    cs.CV cs.CL cs.LG

    No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance

    Authors: Vishaal Udandarao, Ameya Prabhu, Adhiraj Ghosh, Yash Sharma, Philip H. S. Torr, Adel Bibi, Samuel Albanie, Matthias Bethge

    Abstract: Web-crawled pretraining datasets underlie the impressive "zero-shot" evaluation performance of multimodal models, such as CLIP for classification/retrieval and Stable-Diffusion for image generation. However, it is unclear how meaningful the notion of "zero-shot" generalization is for such multimodal models, as it is not known to what extent their pretraining datasets encompass the downstream conce… ▽ More

    Submitted 8 April, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: Extended version of the short paper accepted at DPFM, ICLR'24

  21. arXiv:2404.00401  [pdf, other

    cs.CL

    How Robust are the Tabular QA Models for Scientific Tables? A Study using Customized Dataset

    Authors: Akash Ghosh, B Venkata Sahith, Niloy Ganguly, Pawan Goyal, Mayank Singh

    Abstract: Question-answering (QA) on hybrid scientific tabular and textual data deals with scientific information, and relies on complex numerical reasoning. In recent years, while tabular QA has seen rapid progress, understanding their robustness on scientific information is lacking due to absence of any benchmark dataset. To investigate the robustness of the existing state-of-the-art QA models on scientif… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

  22. arXiv:2403.12712  [pdf, other

    cs.CV cs.LG

    Addressing Source Scale Bias via Image War** for Domain Adaptation

    Authors: Shen Zheng, Anurag Ghosh, Srinivasa G. Narasimhan

    Abstract: In visual recognition, scale bias is a key challenge due to the imbalance of object and image size distribution inherent in real scene datasets. Conventional solutions involve injecting scale invariance priors, oversampling the dataset at different scales during training, or adjusting scale at inference. While these strategies mitigate scale bias to some extent, their ability to adapt across diver… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

  23. arXiv:2403.12227  [pdf, other

    cs.CY

    Analyzing-Evaluating-Creating: Assessing Computational Thinking and Problem Solving in Visual Programming Domains

    Authors: Ahana Ghosh, Liina Malva, Adish Singla

    Abstract: Computational thinking (CT) and problem-solving skills are increasingly integrated into K-8 school curricula worldwide. Consequently, there is a growing need to develop reliable assessments for measuring students' proficiency in these skills. Recent works have proposed tests for assessing these skills across various CT concepts and practices, in particular, based on multi-choice items enabling psy… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: This extended version of the SIGCSE 2024 paper includes all 21 test items from ACE along with their answers in the appendix

  24. arXiv:2403.06890  [pdf, other

    quant-ph cs.LG q-bio.BM

    Application of Quantum Tensor Networks for Protein Classification

    Authors: Debarshi Kundu, Archisman Ghosh, Srinivasan Ekambaram, Jian Wang, Nikolay Dokholyan, Swaroop Ghosh

    Abstract: We show that protein sequences can be thought of as sentences in natural language processing and can be parsed using the existing Quantum Natural Language framework into parameterized quantum circuits of reasonable qubits, which can be trained to solve various protein-related machine-learning problems. We classify proteins based on their subcellular locations, a pivotal task in bioinformatics that… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: 7 pages, 8 figures

  25. arXiv:2403.01234  [pdf

    cs.LG physics.chem-ph physics.comp-ph physics.data-an

    Active Deep Kernel Learning of Molecular Functionalities: Realizing Dynamic Structural Embeddings

    Authors: Ayana Ghosh, Maxim Ziatdinov and, Sergei V. Kalinin

    Abstract: Exploring molecular spaces is crucial for advancing our understanding of chemical properties and reactions, leading to groundbreaking innovations in materials science, medicine, and energy. This paper explores an approach for active learning in molecular discovery using Deep Kernel Learning (DKL), a novel approach surpassing the limits of classical Variational Autoencoders (VAEs). Employing the QM… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  26. arXiv:2402.17604  [pdf, ps, other

    cs.LO cs.FL

    Equivariant ideals of polynomials

    Authors: Arka Ghosh, Sławomir Lasota

    Abstract: We study existence and computability of finite bases for ideals of polynomials over infinitely many variables. In our setting, variables come from a countable logical structure A, and embeddings from A to A act on polynomials by renaming variables. First, we give a sufficient and necessary condition for A to guarantee the following generalisation of Hilbert's Basis Theorem: every polynomial ideal… ▽ More

    Submitted 22 May, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    ACM Class: F.1.1; F.4.1

  27. arXiv:2402.10012  [pdf, other

    math.CO cs.CG

    Countably Colorful Hyperplane Transversal

    Authors: Sutanoya Chakraborty, Arijit Ghosh, Soumi Nandi

    Abstract: Let $\left\{ \mathcal{F}_{n}\right\}_{n \in \mathbb{N}}$ be an infinite sequence of families of compact connected sets in $\mathbb{R}^{d}$. An infinite sequence of compact connected sets $\left\{ B_{n} \right\}_{n\in \mathbb{N}}$ is called heterochromatic sequence from $\left\{ \mathcal{F}_{n}\right\}_{n \in \mathbb{N}}$ if there exists an infinite sequence… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: 23 pages, 7 figures

    MSC Class: 52A35

  28. arXiv:2402.09030  [pdf, other

    cs.RO

    Awareness in robotics: An early perspective from the viewpoint of the EIC Pathfinder Challenge "Awareness Inside''

    Authors: Cosimo Della Santina, Carlos Hernandez Corbato, Burak Sisman, Luis A. Leiva, Ioannis Arapakis, Michalis Vakalellis, Jean Vanderdonckt, Luis Fernando D'Haro, Guido Manzi, Cristina Becchio, Aïda Elamrani, Mohsen Alirezaei, Ginevra Castellano, Dimos V. Dimarogonas, Arabinda Ghosh, Sofie Haesaert, Sadegh Soudjani, Sybert Stroeve, Paul Verschure, Davide Bacciu, Ophelia Deroy, Bahador Bahrami, Claudio Gallicchio, Sabine Hauert, Ricardo Sanz , et al. (6 additional authors not shown)

    Abstract: Consciousness has been historically a heavily debated topic in engineering, science, and philosophy. On the contrary, awareness had less success in raising the interest of scholars in the past. However, things are changing as more and more researchers are getting interested in answering questions concerning what awareness is and how it can be artificially generated. The landscape is rapidly evolvi… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  29. arXiv:2402.08541  [pdf, other

    cs.GT econ.TH

    Continuous-Time Best-Response and Related Dynamics in Tullock Contests with Convex Costs

    Authors: Edith Elkind, Abheek Ghosh, Paul W. Goldberg

    Abstract: Tullock contests model real-life scenarios that range from competition among proof-of-work blockchain miners to rent-seeking and lobbying activities. We show that continuous-time best-response dynamics in Tullock contests with convex costs converges to the unique equilibrium using Lyapunov-style arguments. We then use this result to provide an algorithm for computing an approximate equilibrium. We… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  30. arXiv:2402.07039  [pdf, other

    cs.AI cs.CR cs.CY

    Coordinated Disclosure for AI: Beyond Security Vulnerabilities

    Authors: Sven Cattell, Avijit Ghosh, Lucie-Aimée Kaffee

    Abstract: Harm reporting in the field of Artificial Intelligence (AI) currently operates on an ad hoc basis, lacking a structured process for disclosing or addressing algorithmic flaws. In contrast, the Coordinated Vulnerability Disclosure (CVD) ethos and ecosystem play a pivotal role in software security and transparency. Globally, there are ongoing efforts to establish frameworks that promote transparency… ▽ More

    Submitted 24 May, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

  31. arXiv:2402.05592  [pdf, other

    cs.HC

    MERP: Metaverse Extended Realtiy Portal

    Authors: Anisha Ghosh, Aditya Mitra, Anik Saha, Sibi Chakkaravarthy Sethuraman, Anitha Subramanian

    Abstract: A standardized control system called Metaverse Extended Reality Portal (MERP) is presented as a solution to the issues with conventional VR eyewear. The MERP system improves user awareness of the physical world while offering an immersive 3D view of the metaverse by using a shouldermounted projector to display a Heads-Up Display (HUD) in a designated Metaverse Experience Room. To provide natural a… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  32. arXiv:2402.01033  [pdf, other

    cs.IT eess.SP

    End-to-End Deep Learning for TDD MIMO Systems in the 6G Upper Midbands

    Authors: Juseong Park, Foad Sohrabi, Amitava Ghosh, Jeffrey G. Andrews

    Abstract: This paper proposes and analyzes novel deep learning methods for downlink (DL) single-user multiple-input multiple-output (SU-MIMO) and multi-user MIMO (MU-MIMO) systems operating in time division duplex (TDD) mode. A motivating application is the 6G upper midbands (7-24 GHz), where the base station (BS) antenna arrays are large, user equipment (UE) array sizes are moderate, and theoretically opti… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  33. arXiv:2401.17594  [pdf, other

    cs.IT

    5G NR Positioning Enhancements in 3GPP Release-18

    Authors: Hyun-Su Cha, Gilsoo Lee, Amitava Ghosh, Matthew Baker, Sean Kelley, Juergen Hofmann

    Abstract: New radio (NR) positioning in the Third Generation Partnership Project (3GPP) Release 18 (Rel-18) enables 5G-advanced networks to achieve ultra-high accuracy positioning without dependence on global navigation satellite systems (GNSS) with key enablers such as the carrier phase positioning technique, standardized for the first time in a cellular communications standard and setting a new baseline f… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

  34. MatSciRE: Leveraging Pointer Networks to Automate Entity and Relation Extraction for Material Science Knowledge-base Construction

    Authors: Ankan Mullick, Akash Ghosh, G Sai Chaitanya, Samir Ghui, Tapas Nayak, Seung-Cheol Lee, Satadeep Bhattacharjee, Pawan Goyal

    Abstract: Material science literature is a rich source of factual information about various categories of entities (like materials and compositions) and various relations between these entities, such as conductivity, voltage, etc. Automatically extracting this information to generate a material science knowledge base is a challenging task. In this paper, we propose MatSciRE (Material Science Relation Extrac… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Journal ref: Computational Material Science 2023 (Elsevier)

  35. arXiv:2401.01596  [pdf, other

    cs.AI cs.CL

    MedSumm: A Multimodal Approach to Summarizing Code-Mixed Hindi-English Clinical Queries

    Authors: Akash Ghosh, Arkadeep Acharya, Prince Jha, Aniket Gaudgaul, Rajdeep Majumdar, Sriparna Saha, Aman Chadha, Raghav Jain, Setu Sinha, Shivani Agarwal

    Abstract: In the healthcare domain, summarizing medical questions posed by patients is critical for improving doctor-patient interactions and medical decision-making. Although medical data has grown in complexity and quantity, the current body of research in this domain has primarily concentrated on text-based methods, overlooking the integration of visual cues. Also prior works in the area of medical quest… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: ECIR 2024

  36. arXiv:2401.00629  [pdf, other

    cs.LG

    Adversarially Trained Actor Critic for offline CMDPs

    Authors: Honghao Wei, Xiyue Peng, Xin Liu, Arnob Ghosh

    Abstract: We propose a Safe Adversarial Trained Actor Critic (SATAC) algorithm for offline reinforcement learning (RL) with general function approximation in the presence of limited data coverage. SATAC operates as a two-player Stackelberg game featuring a refined objective function. The actor (leader player) optimizes the policy against two adversarially trained value critics (follower players), who focus… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

  37. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  38. arXiv:2312.11541  [pdf, other

    cs.AI cs.CL

    CLIPSyntel: CLIP and LLM Synergy for Multimodal Question Summarization in Healthcare

    Authors: Akash Ghosh, Arkadeep Acharya, Raghav Jain, Sriparna Saha, Aman Chadha, Setu Sinha

    Abstract: In the era of modern healthcare, swiftly generating medical question summaries is crucial for informed and timely patient care. Despite the increasing complexity and volume of medical data, existing studies have focused solely on text-based summarization, neglecting the integration of visual information. Recognizing the untapped potential of combining textual queries with visual representations of… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: AAAI 2024

  39. arXiv:2312.10725  [pdf, other

    cs.LG cs.AI cs.CV

    Addressing Sample Inefficiency in Multi-View Representation Learning

    Authors: Kumar Krishna Agrawal, Arna Ghosh, Adam Oberman, Blake Richards

    Abstract: Non-contrastive self-supervised learning (NC-SSL) methods like BarlowTwins and VICReg have shown great promise for label-free representation learning in computer vision. Despite the apparent simplicity of these techniques, researchers must rely on several empirical heuristics to achieve competitive performance, most notably using high-dimensional projector heads and two augmentations of the same i… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

  40. arXiv:2312.02327  [pdf, other

    cs.LG cs.CR cs.DC

    FLea: Improving federated learning on scarce and label-skewed data via privacy-preserving feature augmentation

    Authors: Tong Xia, Abhirup Ghosh, Cecilia Mascolo

    Abstract: Learning a global model by abstracting the knowledge, distributed across multiple clients, without aggregating the raw data is the primary goal of Federated Learning (FL). Typically, this works in rounds alternating between parallel local training at several clients, followed by model aggregation at a server. We found that existing FL methods under-perform when local datasets are small and present… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  41. arXiv:2311.17057  [pdf, other

    cs.CV

    ReMoS: 3D Motion-Conditioned Reaction Synthesis for Two-Person Interactions

    Authors: Anindita Ghosh, Rishabh Dabral, Vladislav Golyanik, Christian Theobalt, Philipp Slusallek

    Abstract: Current approaches for 3D human motion synthesis generate high-quality animations of digital humans performing a wide variety of actions and gestures. However, a notable technological gap exists in addressing the complex dynamics of multi-human interactions within this paradigm. In this work, we present ReMoS, a denoising diffusion-based model that synthesizes full-body reactive motion of a person… ▽ More

    Submitted 26 March, 2024; v1 submitted 28 November, 2023; originally announced November 2023.

    Comments: 17 pages, 7 figures, 5 tables

  42. arXiv:2311.13177  [pdf, other

    physics.med-ph cs.CV

    Volumetric Reconstruction Resolves Off-Resonance Artifacts in Static and Dynamic PROPELLER MRI

    Authors: Annesha Ghosh, Gordon Wetzstein, Mert Pilanci, Sara Fridovich-Keil

    Abstract: Off-resonance artifacts in magnetic resonance imaging (MRI) are visual distortions that occur when the actual resonant frequencies of spins within the imaging volume differ from the expected frequencies used to encode spatial information. These discrepancies can be caused by a variety of factors, including magnetic field inhomogeneities, chemical shifts, or susceptibility differences within the ti… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: Code is available at https://github.com/sarafridov/volumetric-propeller

  43. arXiv:2310.15119  [pdf, other

    cs.LG cs.AI

    Compressed Sensing of Generative Sparse-latent (GSL) Signals

    Authors: Antoine Honoré, Anubhab Ghosh, Saikat Chatterjee

    Abstract: We consider reconstruction of an ambient signal in a compressed sensing (CS) setup where the ambient signal has a neural network based generative model. The generative model has a sparse-latent input and we refer to the generated ambient signal as generative sparse-latent signal (GSL). The proposed sparsity inducing reconstruction algorithm is inherently non-convex, and we show that a gradient bas… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted at 31st European Signal Processing Conference, EUSIPCO 2023

  44. arXiv:2310.10543  [pdf, other

    cs.CL cs.CV

    ViPE: Visualise Pretty-much Everything

    Authors: Hassan Shahmohammadi, Adhiraj Ghosh, Hendrik P. A. Lensch

    Abstract: Figurative and non-literal expressions are profoundly integrated in human communication. Visualising such expressions allow us to convey our creative thoughts, and evoke nuanced emotions. Recent text-to-image models like Stable Diffusion, on the other hand, struggle to depict non-literal expressions. Recent works primarily deal with this issue by compiling humanly annotated datasets on a small sca… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: To be presented in EMNLP2023 Main Conference

  45. arXiv:2310.08914  [pdf, ps, other

    cs.SD cs.NE eess.AS

    Differential Evolution Algorithm based Hyper-Parameters Selection of Convolutional Neural Network for Speech Command Recognition

    Authors: Sandipan Dhar, Anuvab Sen, Aritra Bandyopadhyay, Nanda Dulal Jana, Arjun Ghosh, Zahra Sarayloo

    Abstract: Speech Command Recognition (SCR), which deals with identification of short uttered speech commands, is crucial for various applications, including IoT devices and assistive technology. Despite the promise shown by Convolutional Neural Networks (CNNs) in SCR tasks, their efficacy relies heavily on hyper-parameter selection, which is typically laborious and time-consuming when done manually. This pa… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 8 Pages, 7 Figures, 4 Tables, Accepted by the 15th International Joint Conference on Computational Intelligence (IJCCI 2023), November 13-15, 2023, Rome, Italy

    Journal ref: Proceedings of the 15th International Joint Conference on Computational Intelligence (2023)

  46. arXiv:2310.03528  [pdf, ps, other

    cs.GT econ.TH

    Best-Response Dynamics in Tullock Contests with Convex Costs

    Authors: Abheek Ghosh

    Abstract: We study the convergence of best-response dynamics in Tullock contests with convex cost functions (these games always have a unique pure-strategy Nash equilibrium). We show that best-response dynamics rapidly converges to the equilibrium for homogeneous agents. For two homogeneous agents, we show convergence to an $ε$-approximate equilibrium in $Θ(\log\log(1/ε))$ steps. For $n \ge 3$ agents, the d… ▽ More

    Submitted 6 October, 2023; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: 43 pages. WINE '23 version

  47. arXiv:2308.14435  [pdf, other

    cs.DL physics.soc-ph

    Do Successful Researchers Reach the Self-Organized Critical Point?

    Authors: Asim Ghosh, Bikas K. Chakrabarti

    Abstract: The index of success of the researchers is now mostly measured using the Hirsch index ($h$). Our recent precise demonstration, that statistically $h \sim \sqrt {N_c} \sim \sqrt {N_p}$, where $N_p$ and $N_c$ denote respectively the total number of publications and total citations for the researcher, suggests that average number of citations per paper ($N_c/N_p$), and hence $h$, are statistical numb… ▽ More

    Submitted 4 December, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: Invited contribution to Galam Special Issue in Physics (MDPI, in press)

  48. arXiv:2308.12199  [pdf, other

    cs.CV

    Towards Real-Time Analysis of Broadcast Badminton Videos

    Authors: Nitin Nilesh, Tushar Sharma, Anurag Ghosh, C. V. Jawahar

    Abstract: Analysis of player movements is a crucial subset of sports analysis. Existing player movement analysis methods use recorded videos after the match is over. In this work, we propose an end-to-end framework for player movement analysis for badminton matches on live broadcast match videos. We only use the visual inputs from the match and, unlike other approaches which use multi-modal sensor data, our… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

  49. arXiv:2308.10480  [pdf, other

    math.CO cs.CG

    Dimension Independent Helly Theorem for Lines and Flats

    Authors: Sutanoya Chakraborty, Arijit Ghosh, Soumi Nandi

    Abstract: We give a generalization of dimension independent Helly Theorem of Adiprasito, Bárány, Mustafa, and Terpai (Discrete & Computational Geometry 2022) to higher dimensional transversal. We also prove some impossibility results that establish the tightness of our extension.

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: 10 pages

    MSC Class: 52A35

  50. arXiv:2308.10479  [pdf, ps, other

    math.CO cs.CG

    Stabbing boxes with finitely many axis-parallel lines and flats

    Authors: Sutanoya Chakraborty, Arijit Ghosh, Soumi Nandi

    Abstract: We give necessary and sufficient condition for an infinite collection of axis-parallel boxes in $\mathbb{R}^{d}$ to be pierceable by finitely many axis-parallel $k$-flats, where $0 \leq k < d$. We also consider colorful generalizations of the above result and establish their feasibility. The problem considered in this paper is an infinite variant of the Hadwiger-Debrunner $(p,q)$-problem.

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: 13 pages

    MSC Class: 52A35