Skip to main content

Showing 1–50 of 165 results for author: Gupta, H

.
  1. arXiv:2406.15444  [pdf, other

    cs.CL

    Investigating the Robustness of LLMs on Math Word Problems

    Authors: Ujjwala Anantheswaran, Himanshu Gupta, Kevin Scaria, Shreyas Verma, Chitta Baral, Swaroop Mishra

    Abstract: Large Language Models (LLMs) excel at various tasks, including solving math word problems (MWPs), but struggle with real-world problems containing irrelevant information. To address this, we propose a prompting framework that generates adversarial variants of MWPs by adding irrelevant variables. We introduce a dataset, ProbleMATHIC, containing both adversarial and non-adversarial MWPs. Our experim… ▽ More

    Submitted 30 May, 2024; originally announced June 2024.

  2. arXiv:2406.14236  [pdf, other

    quant-ph cs.DC

    NAC-QFL: Noise Aware Clustered Quantum Federated Learning

    Authors: Himanshu Sahu, Hari Prabhat Gupta

    Abstract: Recent advancements in quantum computing, alongside successful deployments of quantum communication, hold promises for revolutionizing mobile networks. While Quantum Machine Learning (QML) presents opportunities, it contends with challenges like noise in quantum devices and scalability. Furthermore, the high cost of quantum communication constrains the practical application of QML in real-world sc… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  3. arXiv:2405.16129  [pdf, other

    cs.CL

    iREL at SemEval-2024 Task 9: Improving Conventional Prompting Methods for Brain Teasers

    Authors: Harshit Gupta, Manav Chaudhary, Tathagata Raha, Shivansh Subramanian, Vasudeva Varma

    Abstract: This paper describes our approach for SemEval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense. The BRAINTEASER task comprises multiple-choice Question Answering designed to evaluate the models' lateral thinking capabilities. It consists of Sentence Puzzle and Word Puzzle subtasks that require models to defy default common-sense associations and exhibit unconventional thinking. We propo… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  4. arXiv:2405.11192  [pdf, other

    cs.CL cs.SI

    BrainStorm @ iREL at SMM4H 2024: Leveraging Translation and Topical Embeddings for Annotation Detection in Tweets

    Authors: Manav Chaudhary, Harshit Gupta, Vasudeva Varma

    Abstract: The proliferation of LLMs in various NLP tasks has sparked debates regarding their reliability, particularly in annotation tasks where biases and hallucinations may arise. In this shared task, we address the challenge of distinguishing annotations made by LLMs from those made by human domain experts in the context of COVID-19 symptom detection from tweets in Latin American Spanish. This paper pres… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: Submitted to SMM4H, colocated at ACL 2024

  5. arXiv:2405.07501  [pdf, other

    quant-ph

    Optimized Generation of Entanglement by Real-Time Ordering of Swap** Operations

    Authors: Ranjani G Sundaram, Himanshu Gupta

    Abstract: Long-distance quantum communication in quantum networks faces significant challenges due to the constraints imposed by the no-cloning theorem. Most existing quantum communication protocols rely on the a priori distribution of entanglement pairs (EPs), a process known to incur considerable latency due to its stochastic nature. In this work, we consider the problem of minimizing the latency of estab… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  6. arXiv:2405.07499  [pdf, other

    quant-ph cs.ET

    Distributed Quantum Computation with Minimum Circuit Execution Time over Quantum Networks

    Authors: Ranjani G Sundaram, Himanshu Gupta, C. R. Ramakrishnan

    Abstract: Present quantum computers are constrained by limited qubit capacity and restricted physical connectivity, leading to challenges in large-scale quantum computations. Distributing quantum computations across a network of quantum computers is a promising way to circumvent these challenges and facilitate large quantum computations. However, distributed quantum computations require entanglements (to ex… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  7. arXiv:2405.06069  [pdf, ps, other

    math.CO math.RA

    Sufficient conditions for total positivity, compounds, and Dodgson condensation

    Authors: Shaun Fallat, Himanshu Gupta, Charles R. Johnson

    Abstract: A $n$-by-$n$ matrix is called totally positive ($TP$) if all its minors are positive and $TP_k$ if all of its $k$-by-$k$ submatrices are $TP$. For an arbitrary totally positive matrix or $TP_k$ matrix, we investigate if the $r$th compound ($1<r<n$) is in turn $TP$ or $TP_k$, and demonstrate a strong negative resolution in general. Focus is then shifted to Dodgson's algorithm for calculating the de… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

    Comments: 10 pages, 2 figures

    MSC Class: 15B48 (Primary); 15A15; 15A24 (Secondary)

  8. arXiv:2405.00222  [pdf, other

    quant-ph cs.NI

    Optimized Distribution of Entanglement Graph States in Quantum Networks

    Authors: Xiaojie Fan, Caitao Zhan, Himanshu Gupta, C. R. Ramakrishnan

    Abstract: Building large-scale quantum computers, essential to demonstrating quantum advantage, is a key challenge. Quantum Networks (QNs) can help address this challenge by enabling the construction of large, robust, and more capable quantum computing platforms by connecting smaller quantum computers. Moreover, unlike classical systems, QNs can enable fully secured long-distance communication. Thus, quantu… ▽ More

    Submitted 30 April, 2024; originally announced May 2024.

    Comments: 11 pages, 13 figures

  9. arXiv:2404.07164  [pdf, other

    cs.AR cs.AI cs.DC cs.LG

    Analysis of Distributed Optimization Algorithms on a Real Processing-In-Memory System

    Authors: Steve Rhyner, Haocong Luo, Juan Gómez-Luna, Mohammad Sadrosadati, Jiawei Jiang, Ataberk Olgun, Harshita Gupta, Ce Zhang, Onur Mutlu

    Abstract: Machine Learning (ML) training on large-scale datasets is a very expensive and time-consuming workload. Processor-centric architectures (e.g., CPU, GPU) commonly used for modern ML training workloads are limited by the data movement bottleneck, i.e., due to repeatedly accessing the training dataset. As a result, processor-centric systems suffer from performance degradation and high energy consumpt… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  10. arXiv:2404.00222  [pdf, ps, other

    math.CO math.AC

    Positivity preservers over finite fields

    Authors: Dominique Guillot, Himanshu Gupta, Prateek Kumar Vishwakarma

    Abstract: We resolve an algebraic version of Schoenberg's celebrated theorem [Duke Math. J., 1942] characterizing entrywise matrix transforms that preserve positive definiteness. Compared to the classical real and complex settings, we consider matrices with entries in a finite field and obtain a complete characterization of such preservers for matrices of a fixed dimension. When the dimension of the matrice… ▽ More

    Submitted 25 April, 2024; v1 submitted 29 March, 2024; originally announced April 2024.

    Comments: 29 pages, LaTeX; fixed typos and made minor corrections to the proof of Theorem 5.3 and of Theorem C

    MSC Class: 15B48 (primary); 15B33; 11T06; 05E30 (secondary)

  11. arXiv:2403.16399  [pdf, other

    math.PR

    Transient Waiting Time Distributions in Small Call Centres with Skills-Based Routing

    Authors: Mark Fackrell, Hritika Gupta, Peter G. Taylor

    Abstract: Many call centres are subject to service level agreements that stipulate that they must achieve targets in terms of the proportion of calls that are answered within a specified time. In order to manage a centre so that targets like these are met, we need to have a method of calculating the waiting time distributions experienced by customers. In this paper, we provide such a method for small call c… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  12. arXiv:2402.06689  [pdf, other

    q-fin.ST cs.LG

    A Study on Stock Forecasting Using Deep Learning and Statistical Models

    Authors: Himanshu Gupta, Aditya Jaiswal

    Abstract: Predicting a fast and accurate model for stock price forecasting is been a challenging task and this is an active area of research where it is yet to be found which is the best way to forecast the stock price. Machine learning, deep learning and statistical analysis techniques are used here to get the accurate result so the investors can see the future trend and maximize the return of investment i… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  13. arXiv:2402.03796  [pdf, other

    cs.CV cs.AI cs.LG

    Face Detection: Present State and Research Directions

    Authors: Purnendu Prabhat, Himanshu Gupta, Ajeet Kumar Vishwakarma

    Abstract: The majority of computer vision applications that handle images featuring humans use face detection as a core component. Face detection still has issues, despite much research on the topic. Face detection's accuracy and speed might yet be increased. This review paper shows the progress made in this area as well as the substantial issues that still need to be tackled. The paper provides research di… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  14. arXiv:2401.14521  [pdf

    cs.LG cs.AI

    Towards Interpretable Physical-Conceptual Catchment-Scale Hydrological Modeling using the Mass-Conserving-Perceptron

    Authors: Yuan-Heng Wang, Hoshin V. Gupta

    Abstract: We investigate the applicability of machine learning technologies to the development of parsimonious, interpretable, catchment-scale hydrologic models using directed-graph architectures based on the mass-conserving perceptron (MCP) as the fundamental computational unit. Here, we focus on architectural complexity (depth) at a single location, rather than universal applicability (breadth) across lar… ▽ More

    Submitted 22 May, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: 65 pages, 8 Figures, 4 Tables, 1 Supplementary Material

  15. arXiv:2312.13234  [pdf, other

    cs.LG

    Position Paper: Bridging the Gap Between Machine Learning and Sensitivity Analysis

    Authors: Christian A. Scholbeck, Julia Moosbauer, Giuseppe Casalicchio, Hoshin Gupta, Bernd Bischl, Christian Heumann

    Abstract: We argue that interpretations of machine learning (ML) models or the model-building process can bee seen as a form of sensitivity analysis (SA), a general methodology used to explain complex systems in many fields such as environmental modeling, engineering, or economics. We address both researchers and practitioners, calling attention to the benefits of a unified SA-based view of explanations in… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  16. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  17. arXiv:2312.03941  [pdf, other

    math.PR

    Minimising Numbers of Losses and Abandonments in Small Call Centres Under a Transient Regime

    Authors: Hritika Gupta, Peter G. Taylor

    Abstract: In this paper, we show how to calculate transient performance measures in models for small call centres that employ skills-based routing. In particular, we calculate the expected number of customer losses and call abandonments in a fixed time. We use the results to compare how call allocation policies can minimise the expected numbers of losses and abandonments, and make recommendations about whic… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

  18. arXiv:2311.09564  [pdf, other

    cs.CL cs.AI

    LongBoX: Evaluating Transformers on Long-Sequence Clinical Tasks

    Authors: Mihir Parmar, Aakanksha Naik, Himanshu Gupta, Disha Agrawal, Chitta Baral

    Abstract: Many large language models (LLMs) for medicine have largely been evaluated on short texts, and their ability to handle longer sequences such as a complete electronic health record (EHR) has not been systematically explored. Assessing these models on long sequences is crucial since prior work in the general domain has demonstrated performance degradation of LLMs on longer texts. Motivated by this,… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: 8 pages

  19. arXiv:2311.08085  [pdf, other

    eess.SY

    Optimizing Electric Vehicle Efficiency with Real-Time Telemetry using Machine Learning

    Authors: Aryaman Rao, Harshit Gupta, Parth Singh, Shivam Mittal, Utkrash Singh, Dinesh Kumar Vishwakarma

    Abstract: In the contemporary world with degrading natural resources, the urgency of energy efficiency has become imperative due to the conservation and environmental safeguarding. Therefore, it's crucial to look for advanced technology to minimize energy consumption. This research focuses on the optimization of battery-electric city style vehicles through the use of a real-time in-car telemetry system that… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  20. arXiv:2311.06658  [pdf

    eess.SY stat.AP

    Reconfigurable Inspection in Manufacturing: State of the Art and Taxonomy

    Authors: Harshit Gupta, Ashok Kumar Madan

    Abstract: This article provides an overview of the evolution of the product quality and measurement inspection procedure with emphasis on the Reconfigurable Inspection System and Machine. The major components of a reconfigurable manufacturing system have been examined, and the evolution of manufacturing processes has been briefly discussed. Different Reconfigurable Inspection Machines (RIMs) and their arran… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: 7th International Conference on Automation, Control and Robotics (ICACR) 2023

  21. arXiv:2311.04308  [pdf

    physics.flu-dyn

    Application of Response Surface Method and Genetic Algorithm in the Design of High-Efficiency Prototype Vehicle

    Authors: Paras Singh, Harshit Gupta, Ojas Vinayak, Aryan Tyagi

    Abstract: Breakthroughs in aerodynamic optimization have made it possible to develop efficient modes of transport with lesser exploitation of valuable resources. This makes it crucial for technical professionals such as engineers and scientists to understand the methodologies behind carrying out such optimizations. A common approach towards improving the aerodynamic properties of a vehicle is to alter its p… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: Accepted at the 14th Asian Computational Fluid Dynamics Conference (ACFD 2023): 16 pages, 9 figures

  22. arXiv:2310.17876  [pdf, other

    cs.CL

    TarGEN: Targeted Data Generation with Large Language Models

    Authors: Himanshu Gupta, Kevin Scaria, Ujjwala Anantheswaran, Shreyas Verma, Mihir Parmar, Saurabh Arjun Sawant, Chitta Baral, Swaroop Mishra

    Abstract: The rapid advancement of large language models (LLMs) has sparked interest in data synthesis techniques, aiming to generate diverse and high-quality synthetic datasets. However, these synthetic datasets often suffer from a lack of diversity and added noise. In this paper, we present TarGEN, a multi-step prompting strategy for generating high-quality synthetic datasets utilizing a LLM. An advantage… ▽ More

    Submitted 30 October, 2023; v1 submitted 26 October, 2023; originally announced October 2023.

    Comments: 10 pages, 6 tables, 5 figures, 5 pages references, 17 pages appendix

  23. arXiv:2310.08644  [pdf

    cs.LG cs.AI

    A Mass-Conserving-Perceptron for Machine Learning-Based Modeling of Geoscientific Systems

    Authors: Yuan-Heng Wang, Hoshin V. Gupta

    Abstract: Although decades of effort have been devoted to building Physical-Conceptual (PC) models for predicting the time-series evolution of geoscientific systems, recent work shows that Machine Learning (ML) based Gated Recurrent Neural Network technology can be used to develop models that are much more accurate. However, the difficulty of extracting physical understanding from ML-based models complicate… ▽ More

    Submitted 12 May, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: 68 pages, 7 figures in the main text, 10 figures, and 10 tables in the supplementary materials

  24. arXiv:2309.12281  [pdf

    physics.optics cond-mat.mtrl-sci

    Bound states in the continuum and long-range coupling of polaritons in hexagonal boron nitride nanoresonators

    Authors: Harsh Gupta, Giacomo Venturi, Tatiana Contino, Eli Janzen, James H. Edgar, Francesco de Angelis, Andrea Toma, Antonio Ambrosio, Michele Tamagnone

    Abstract: Bound states in the continuum (BICs) garnered significant for their potential to create new types of nanophotonic devices. Most prior demonstrations were based on arrays of dielectric resonators, which cannot be miniaturized beyond the diffraction limit, reducing the applicability of BICs for advanced functions. Here, we demonstrate BICs and quasi-BICs based on high-quality factor phonon-polariton… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: 13 pages, 5 figures, early version

  25. arXiv:2309.07330  [pdf, other

    cs.CV

    Automated Assessment of Critical View of Safety in Laparoscopic Cholecystectomy

    Authors: Yunfan Li, Himanshu Gupta, Haibin Ling, IV Ramakrishnan, Prateek Prasanna, Georgios Georgakis, Aaron Sasson

    Abstract: Cholecystectomy (gallbladder removal) is one of the most common procedures in the US, with more than 1.2M procedures annually. Compared with classical open cholecystectomy, laparoscopic cholecystectomy (LC) is associated with significantly shorter recovery period, and hence is the preferred method. However, LC is also associated with an increase in bile duct injuries (BDIs), resulting in significa… ▽ More

    Submitted 13 September, 2023; originally announced September 2023.

  26. arXiv:2309.06545  [pdf, other

    cs.CR cs.AR

    Evaluating Homomorphic Operations on a Real-World Processing-In-Memory System

    Authors: Harshita Gupta, Mayank Kabra, Juan Gómez-Luna, Konstantinos Kanellopoulos, Onur Mutlu

    Abstract: Computing on encrypted data is a promising approach to reduce data security and privacy risks, with homomorphic encryption serving as a facilitator in achieving this goal. In this work, we accelerate homomorphic operations using the Processing-in- Memory (PIM) paradigm to mitigate the large memory capacity and frequent data movement requirements. Using a real-world PIM system, we accelerate the Br… ▽ More

    Submitted 3 October, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: This work will be presented at IISWC 2023

  27. A Dataset of Inertial Measurement Units for Handwritten English Alphabets

    Authors: Hari Prabhat Gupta, Rahul Mishra

    Abstract: This paper presents an end-to-end methodology for collecting datasets to recognize handwritten English alphabets by utilizing Inertial Measurement Units (IMUs) and leveraging the diversity present in the Indian writing style. The IMUs are utilized to capture the dynamic movement patterns associated with handwriting, enabling more accurate recognition of alphabets. The Indian context introduces var… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: 10 pages, 12 figures

  28. arXiv:2307.02409  [pdf, other

    cs.DC

    Utility-Aware Load Shedding for Real-time Video Analytics at the Edge

    Authors: Enrique Saurez, Harshit Gupta, Henriette Roger, Sukanya Bhowmik, Umakishore Ramachandran, Kurt Rothermel

    Abstract: Real-time video analytics typically require video frames to be processed by a query to identify objects or activities of interest while adhering to an end-to-end frame processing latency constraint. Such applications impose a continuous and heavy load on backend compute and network infrastructure because of the need to stream and process all video frames. Video data has inherent redundancy and doe… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: This work was supported by the German Research Foundation (DFG) under the research grant "PRECEPT II" (RO 1086/19-2 and BH 154/1-2)

  29. Optimizing Initial State of Detector Sensors in Quantum Sensor Networks

    Authors: Caitao Zhan, Himanshu Gupta, Mark Hillery

    Abstract: In this paper, we consider a network of quantum sensors, where each sensor is a qubit detector that "fires," i.e., its state changes when an event occurs close by. The change in state due to the firing of a detector is given by a unitary operator which is the same for all sensors in the network. Such a network of detectors can be used to localize an event, using a protocol to determine the firing… ▽ More

    Submitted 7 March, 2024; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: 25 pages (single column), 9 figures. Paper accepted at ACM Transactions on Quantum Computing

  30. arXiv:2306.08872  [pdf, other

    cs.CL cs.AI

    Neural models for Factual Inconsistency Classification with Explanations

    Authors: Tathagata Raha, Mukund Choudhary, Abhinav Menon, Harshit Gupta, KV Aditya Srivatsa, Manish Gupta, Vasudeva Varma

    Abstract: Factual consistency is one of the most important requirements when editing high quality documents. It is extremely important for automatic text generation systems like summarization, question answering, dialog modeling, and language modeling. Still, automated factual inconsistency detection is rather under-studied. Existing work has focused on (a) finding fake news kee** a knowledge base in cont… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: ECML-PKDD 2023

  31. arXiv:2306.05539  [pdf, other

    cs.CL

    Instruction Tuned Models are Quick Learners

    Authors: Himanshu Gupta, Saurabh Arjun Sawant, Swaroop Mishra, Mutsumi Nakamura, Arindam Mitra, Santosh Mashetty, Chitta Baral

    Abstract: Instruction tuning of language models has demonstrated the ability to enhance model generalization to unseen tasks via in-context learning using a few examples. However, typical supervised learning still requires a plethora of downstream training data for finetuning. Often in real-world situations, there is a scarcity of data available for finetuning, falling somewhere between few shot inference a… ▽ More

    Submitted 17 May, 2023; originally announced June 2023.

    Comments: 9 pages, 5 figures, 19 Tables (inclusing appendix), 12 pages of Appendix

  32. arXiv:2306.04207  [pdf, ps, other

    cs.DC

    Resource Aware Clustering for Tackling the Heterogeneity of Participants in Federated Learning

    Authors: Rahul Mishra, Hari Prabhat Gupta, Garvit Banga

    Abstract: Federated Learning is a training framework that enables multiple participants to collaboratively train a shared model while preserving data privacy and minimizing communication overhead. The heterogeneity of devices and networking resources of the participants delay the training and aggregation in federated learning. This paper proposes a federated learning approach to manoeuvre the heterogeneity… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: 13 pages, 4 figures

  33. arXiv:2306.00195  [pdf, other

    quant-ph cs.ET

    Distributing Quantum Circuits Using Teleportations

    Authors: Ranjani G Sundaram, Himanshu Gupta

    Abstract: Scalability is currently one of the most sought-after objectives in the field of quantum computing. Distributing a quantum circuit across a quantum network is one way to facilitate large computations using current quantum computers. In this paper, we consider the problem of distributing a quantum circuit across a network of heterogeneous quantum computers, while minimizing the number of teleportat… ▽ More

    Submitted 31 May, 2023; originally announced June 2023.

  34. arXiv:2305.16357  [pdf, other

    cs.CL

    EDM3: Event Detection as Multi-task Text Generation

    Authors: Ujjwala Anantheswaran, Himanshu Gupta, Mihir Parmar, Kuntal Kumar Pal, Chitta Baral

    Abstract: Event detection refers to identifying event occurrences in a text and comprises of two subtasks; event identification and classification. We present EDM3, a novel approach for Event Detection that formulates three generative tasks: identification, classification, and combined detection. We show that EDM3 helps to learn transferable knowledge that can be leveraged to perform Event Detection and its… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: 9 pages, 4 figures, 10 tables, 5 Page appendix

  35. arXiv:2305.05079  [pdf, other

    cs.CL

    A Unified Evaluation Framework for Novelty Detection and Accommodation in NLP with an Instantiation in Authorship Attribution

    Authors: Neeraj Varshney, Himanshu Gupta, Eric Robertson, Bing Liu, Chitta Baral

    Abstract: State-of-the-art natural language processing models have been shown to achieve remarkable performance in 'closed-world' settings where all the labels in the evaluation set are known at training time. However, in real-world settings, 'novel' instances that do not belong to any known class are often observed. This renders the ability to deal with novelties crucial. To initiate a systematic research… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023

  36. arXiv:2304.12483  [pdf, other

    cs.CV

    Towards Realistic Generative 3D Face Models

    Authors: Aashish Rai, Hiresh Gupta, Ayush Pandey, Francisco Vicente Carrasco, Shingo Jason Takagi, Amaury Aubel, Daeil Kim, Aayush Prakash, Fernando de la Torre

    Abstract: In recent years, there has been significant progress in 2D generative face models fueled by applications such as animation, synthetic data generation, and digital avatars. However, due to the absence of 3D information, these 2D models often struggle to accurately disentangle facial attributes like pose, expression, and illumination, limiting their editing capabilities. To address this limitation,… ▽ More

    Submitted 26 October, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: Preprint

  37. arXiv:2304.08302  [pdf

    cs.NI

    Fog Computing& IoT: Overview, Architecture and Applications

    Authors: Harshit Gupta, Dr. Ajay Kumar Bharti

    Abstract: Fog computing is an emerging technology in the field of network services where data transfer from one device to another to perform some kind of activity. Fog computing is an extended concept of cloud computing. It works in-between the Internet of Things (IoT) and cloud data centers and reduces the communication gaps. Fog computing has made possible to have decreased latency and low network congest… ▽ More

    Submitted 14 April, 2023; originally announced April 2023.

    Comments: 5 pages, 2 figures

  38. arXiv:2302.08884  [pdf, other

    quant-ph

    Quantum Computing Toolkit From Nuts and Bolts to Sack of Tools

    Authors: Himanshu Sahu, Hari Prabhat Gupta

    Abstract: Quantum computing has the potential to provide exponential performance benefits in processing over classical computing. It utilizes quantum mechanics phenomena (such as superposition, entanglement, and interference) to solve a computational problem. It can explore atypical patterns over data that classical computers can't perform efficiently. Quantum computers are in the nascent stage of developme… ▽ More

    Submitted 6 March, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

    Comments: 27 Pages, 29 figures

  39. arXiv:2302.08624  [pdf, other

    cs.CL cs.LG

    InstructABSA: Instruction Learning for Aspect Based Sentiment Analysis

    Authors: Kevin Scaria, Himanshu Gupta, Siddharth Goyal, Saurabh Arjun Sawant, Swaroop Mishra, Chitta Baral

    Abstract: We introduce InstructABSA, an instruction learning paradigm for Aspect-Based Sentiment Analysis (ABSA) subtasks. Our method introduces positive, negative, and neutral examples to each training sample, and instruction tune the model (Tk-Instruct) for ABSA subtasks, yielding significant performance improvements. Experimental results on the Sem Eval 2014, 15, and 16 datasets demonstrate that Instruct… ▽ More

    Submitted 13 November, 2023; v1 submitted 16 February, 2023; originally announced February 2023.

    Comments: 4 pages, 3 figures, 9 tables, 9 appendix pages

  40. arXiv:2301.07760  [pdf

    astro-ph.GA astro-ph.IM

    Astronomical Detection of the Interstellar Anion C10H- towards TMC-1 from the GOTHAM Large Program on the GBT

    Authors: Anthony Remijan, Haley N. Scolati, Andrew M. Burkhardt, P. Bryan Changala, Steven B. Charnley, Ilsa R. Cooke, Martin A. Cordiner, Harshal Gupta, Eric Herbst, Kin Long Kelvin Lee, Ryan Loomis, Christopher N. Shingledecker, Mark A. Siebert, Ci Xue, Michael C. McCarthy, Brett A. McGuire

    Abstract: Using data from the GOTHAM (GBT Observations of TMC-1: Hunting for Aromatic Molecules) survey, we report the first astronomical detection of the C10H- anion. The astronomical observations also provided the necessary data to refine the spectroscopic parameters of C10H-. From the velocity stacked data and the matched filter response, C10H- is detected at >9σ confidence level at a column density of 4… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

    Comments: 38 Pages, 24 Figures, 12 Tables, 8 Appendices

    MSC Class: 85-11 ACM Class: A.1

  41. arXiv:2301.04027  [pdf

    cs.LG cs.CE physics.ao-ph physics.geo-ph

    Differentiable modeling to unify machine learning and physical models and advance Geosciences

    Authors: Chaopeng Shen, Alison P. Appling, Pierre Gentine, Toshiyuki Bandai, Hoshin Gupta, Alexandre Tartakovsky, Marco Baity-Jesi, Fabrizio Fenicia, Daniel Kifer, Li Li, Xiaofeng Liu, Wei Ren, Yi Zheng, Ciaran J. Harman, Martyn Clark, Matthew Farthing, Dapeng Feng, Praveen Kumar, Doaa Aboelyazeed, Farshid Rahmani, Hylke E. Beck, Tadd Bindas, Dipankar Dwivedi, Kuai Fang, Marvin Höge , et al. (5 additional authors not shown)

    Abstract: Process-Based Modeling (PBM) and Machine Learning (ML) are often perceived as distinct paradigms in the geosciences. Here we present differentiable geoscientific modeling as a powerful pathway toward dissolving the perceived barrier between them and ushering in a paradigm shift. For decades, PBM offered benefits in interpretability and physical consistency but struggled to efficiently leverage lar… ▽ More

    Submitted 26 December, 2023; v1 submitted 10 January, 2023; originally announced January 2023.

    Journal ref: Nat Rev Earth Environ 4, 552-567 (2023)

  42. A Roadmap to Domain Knowledge Integration in Machine Learning

    Authors: Himel Das Gupta, Victor S. Sheng

    Abstract: Many machine learning algorithms have been developed in recent years to enhance the performance of a model in different aspects of artificial intelligence. But the problem persists due to inadequate data and resources. Integrating knowledge in a machine learning model can help to overcome these obstacles up to a certain degree. Incorporating knowledge is a complex task though because of various fo… ▽ More

    Submitted 12 December, 2022; originally announced December 2022.

  43. arXiv:2211.08182  [pdf, other

    cs.CV cs.RO

    Gras** the Inconspicuous

    Authors: Hrishikesh Gupta, Stefan Thalhammer, Markus Leitner, Markus Vincze

    Abstract: Transparent objects are common in day-to-day life and hence find many applications that require robot gras**. Many solutions toward object gras** exist for non-transparent objects. However, due to the unique visual properties of transparent objects, standard 3D sensors produce noisy or distorted measurements. Modern approaches tackle this problem by either refining the noisy depth measurements… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  44. Quantum Sensor Network Algorithms for Transmitter Localization

    Authors: Caitao Zhan, Himanshu Gupta

    Abstract: A quantum sensor (QS) is able to measure various physical phenomena with extreme sensitivity. QSs have been used in several applications such as atomic interferometers, but few applications of a quantum sensor network (QSN) have been proposed or developed. We look at a natural application of QSN -- localization of an event (in particular, of a wireless signal transmitter). In this paper, we develo… ▽ More

    Submitted 31 July, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

    Comments: 11 pages, 10 figures, IEEE QCE 2023

  45. arXiv:2210.17348  [pdf, other

    astro-ph.SR astro-ph.GA physics.atm-clus physics.chem-ph

    Laboratory and astronomical discovery of magnesium dicarbide, MgC$_2$

    Authors: P. B. Changala, H. Gupta, J. Cernicharo, J. R. Pardo, M. Agúndez, C. Cabezas, B. Tercero, M. Guélin, M. C. McCarthy

    Abstract: We report the detection of magnesium dicarbide, MgC$_2$, in the laboratory at centimeter wavelengths and assign $^{24}$MgC$_2$, $^{25}$MgC$_2$, and $^{26}$MgC$_2$ to 14 unidentified lines in the radio spectrum of the circumstellar envelope of the evolved carbon star IRC+10216. The structure of MgC$_2$ is found to be T-shaped with a highly ionic bond between the metal atom and the C$_2$ unit, analo… ▽ More

    Submitted 31 October, 2022; originally announced October 2022.

    Comments: 15 pages, 5 figures

  46. Discrete outcome quantum sensor networks

    Authors: Mark Hillery, Himanshu Gupta, Caitao Zhan

    Abstract: We model a quantum sensor network using techniques from quantum state discrimination. The interaction between a qubit detector and the environment is described by a unitary operator, and we will assume that at most one detector does interact. The task is to determine which one does or if none do. This involves choosing an initial state of the detectors and a measurement. We consider global measure… ▽ More

    Submitted 30 May, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: Replaced with published version

    Journal ref: Physical Review A 107, 012435 (2023)

  47. arXiv:2210.11762  [pdf, other

    cs.CL

    Detecting Unintended Social Bias in Toxic Language Datasets

    Authors: Nihar Sahoo, Himanshu Gupta, Pushpak Bhattacharyya

    Abstract: With the rise of online hate speech, automatic detection of Hate Speech, Offensive texts as a natural language processing task is getting popular. However, very little research has been done to detect unintended social bias from these toxic language datasets. This paper introduces a new dataset ToxicBias curated from the existing dataset of Kaggle competition named "Jigsaw Unintended Bias in Toxic… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

  48. arXiv:2210.07471  [pdf, other

    cs.CL

    "John is 50 years old, can his son be 65?" Evaluating NLP Models' Understanding of Feasibility

    Authors: Himanshu Gupta, Neeraj Varshney, Swaroop Mishra, Kuntal Kumar Pal, Saurabh Arjun Sawant, Kevin Scaria, Siddharth Goyal, Chitta Baral

    Abstract: In current NLP research, large-scale language models and their abilities are widely being discussed. Some recent works have also found notable failures of these models. Often these failure examples involve complex reasoning abilities. This work focuses on a simple commonsense ability, reasoning about when an action (or its effect) is feasible. To this end, we introduce FeasibilityQA, a question-an… ▽ More

    Submitted 2 February, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: EACL 2023

  49. arXiv:2210.01588  [pdf, other

    cs.CV

    Cross-Geography Generalization of Machine Learning Methods for Classification of Flooded Regions in Aerial Images

    Authors: Sushant Lenka, Pratyush Kerhalkar, Pranav Shetty, Harsh Gupta, Bhavam Vidyarthi, Ujjwal Verma

    Abstract: Identification of regions affected by floods is a crucial piece of information required for better planning and management of post-disaster relief and rescue efforts. Traditionally, remote sensing images are analysed to identify the extent of damage caused by flooding. The data acquired from sensors onboard earth observation satellites are analyzed to detect the flooded regions, which can be affec… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

  50. arXiv:2209.15560  [pdf, ps, other

    cs.LG cs.NE

    Designing and Training of Lightweight Neural Networks on Edge Devices using Early Halting in Knowledge Distillation

    Authors: Rahul Mishra, Hari Prabhat Gupta

    Abstract: Automated feature extraction capability and significant performance of Deep Neural Networks (DNN) make them suitable for Internet of Things (IoT) applications. However, deploying DNN on edge devices becomes prohibitive due to the colossal computation, energy, and storage requirements. This paper presents a novel approach for designing and training lightweight DNN using large-size DNN. The approach… ▽ More

    Submitted 30 September, 2022; originally announced September 2022.

    Comments: 13 pages, 7 figures, 11 tables