Skip to main content

Showing 1–50 of 675 results for author: Agarwal, S

.
  1. arXiv:2407.00758  [pdf, other

    quant-ph cond-mat.mes-hall physics.optics

    Quantum noise induced nonreciprocity for single photon transport in parity-time symmetric systems

    Authors: Dibyendu Roy, G. S. Agarwal

    Abstract: We show nonreciprocal light propagation for single-photon inputs due to quantum noise in coupled optical systems with gain and loss. We consider two parity-time ($\mathcal{PT}$) symmetric linear optical systems consisting of either two directly coupled resonators or two finite-length waveguides evanescently coupled in parallel. One resonator or waveguide is filled with an active gain medium and th… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 12 pages, 4 figures

  2. arXiv:2406.12313  [pdf

    cs.DB

    A framework for develo** a knowledge management platform

    Authors: Marie Lisandra Zepeda Mendoza, Sonali Agarwal, James A. Blackshaw, Vanesa Bol, Audrey Fazzi, Filippo Fiorini, Amy Louise Foreman, Nancy George, Brett R. Johnson, Brian Martin, Dave McComb, Euphemia Mutasa-Gottgens, Helen Parkinson, Martin Romacker, Rolf Russell, Valérien Ségard, Shawn Zheng Kai Tan, Wei Kheng Teh, F. P. Winstanley, Benedict Wong, Adrian M. Smith

    Abstract: Knowledge management (KM) involves collecting, organizing, storing, and disseminating information to improve decision-making, innovation, and performance. Implementing KM at scale has become essential for organizations to effectively leverage vast accessible data. This paper is a compilation of concepts that emerged from KM workshops hosted by EMBL-EBI, attended by SMEs and industry. We provide gu… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 18 pages, 1 figure

  3. arXiv:2406.05276  [pdf, other

    cs.LG

    VTrans: Accelerating Transformer Compression with Variational Information Bottleneck based Pruning

    Authors: Oshin Dutta, Ritvik Gupta, Sumeet Agarwal

    Abstract: In recent years, there has been a growing emphasis on compressing large pre-trained transformer models for resource-constrained devices. However, traditional pruning methods often leave the embedding layer untouched, leading to model over-parameterization. Additionally, they require extensive compression time with large datasets to maintain performance in pruned models. To address these challenges… ▽ More

    Submitted 11 June, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

  4. arXiv:2406.03142  [pdf, ps, other

    cs.LG

    On the Power of Randomization in Fair Classification and Representation

    Authors: Sushant Agarwal, Amit Deshpande

    Abstract: Fair classification and fair representation learning are two important problems in supervised and unsupervised fair machine learning, respectively. Fair classification asks for a classifier that maximizes accuracy on a given data distribution subject to fairness constraints. Fair representation maps a given data distribution over the original feature space to a distribution over a new representati… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Appeared in ACM FAccT 2022

  5. arXiv:2405.20405  [pdf, other

    cs.DS cs.CR cs.IT cs.LG stat.ML

    Private Mean Estimation with Person-Level Differential Privacy

    Authors: Sushant Agarwal, Gautam Kamath, Mahbod Majid, Argyris Mouzakis, Rose Silver, Jonathan Ullman

    Abstract: We study differentially private (DP) mean estimation in the case where each person holds multiple samples. Commonly referred to as the "user-level" setting, DP here requires the usual notion of distributional stability when all of a person's datapoints can be modified. Informally, if $n$ people each have $m$ samples from an unknown $d$-dimensional distribution with bounded $k$-th moments, we show… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: 67 pages, 3 figures

  6. arXiv:2405.16101  [pdf, other

    quant-ph physics.atom-ph

    Entanglement generation in weakly-driven arrays of multilevel atoms via dipolar interactions

    Authors: Sanaa Agarwal, A. Piñeiro Orioli, J. K. Thompson, A. M. Rey

    Abstract: We investigate the driven-dissipative dynamics of 1D and 2D arrays of multilevel atoms interacting via dipole-dipole interactions and trapped at subwavelength scales. Here we show that in the weakly driven low excitation regime, multilevel atoms, in contrast to two-level atoms, can become strongly entangled. The entanglement manifests as the growth of collective spin-waves in the ground state mani… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

    Comments: 7+11 pages, 4+14 figures

  7. arXiv:2405.15947  [pdf, other

    quant-ph

    Mitigating scattering in a quantum system using only an integrating sphere

    Authors: Zhenfei Jiang, Tian Li, Matthew L. Boone, Zhenhuan Yi, Alexei V. Sokolov, Girish S. Agarwal, Marlan O. Scully

    Abstract: Strong quantum-correlated sources are essential but delicate resources for quantum information science and engineering protocols. Decoherence and loss are the two main disruptive processes that lead to the loss of nonclassical behavior in quantum correlations. In quantum systems, scattering can contribute to both decoherence and loss. In this work, we present an experimental scheme capable of sign… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 7 pages, 4 figures

  8. arXiv:2405.15152  [pdf, other

    cs.CL cs.AI

    Machine Unlearning in Large Language Models

    Authors: Saaketh Koundinya Gundavarapu, Shreya Agarwal, Arushi Arora, Chandana Thimmalapura Jagadeeshaiah

    Abstract: Machine unlearning, a novel area within artificial intelligence, focuses on addressing the challenge of selectively forgetting or reducing undesirable knowledge or behaviors in machine learning models, particularly in the context of large language models (LLMs). This paper introduces a methodology to align LLMs, such as Open Pre-trained Transformer Language Models, with ethical, privacy, and safet… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 10 pages

  9. arXiv:2405.12433  [pdf, other

    cs.AI

    LLM+Reasoning+Planning for supporting incomplete user queries in presence of APIs

    Authors: Sudhir Agarwal, Anu Sreepathy, David H. Alonso, Prarit Lamba

    Abstract: Recent availability of Large Language Models (LLMs) has led to the development of numerous LLM-based approaches aimed at providing natural language interfaces for various end-user tasks. These end-user tasks in turn can typically be accomplished by orchestrating a given set of APIs. In practice, natural language task requests (user queries) are often incomplete, i.e., they may not contain all the… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: 9 pages main content, 2 pages references, 12 pages appendix, 5 figures

  10. arXiv:2405.12102  [pdf, other

    quant-ph physics.chem-ph

    Collective Quantum Entanglement in Molecular Cavity Optomechanics

    Authors: Jian Huang, Dangyuan Lei, Girish S. Agarwal, Zhedong Zhang

    Abstract: We propose an optomechanical scheme for reaching quantum entanglement in vibration polaritons. The system involves $N$ molecules, whose vibrations can be fairly entangled with plasmonic cavities. We find that the vibration-photon entanglement can exist at room temperature and is robust against thermal noise. We further demonstrate the quantum entanglement between the vibrational modes through the… ▽ More

    Submitted 25 May, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 10 pages, 4 figures

  11. arXiv:2405.11346  [pdf

    cs.AI

    Decision support system for Forest fire management using Ontology with Big Data and LLMs

    Authors: Ritesh Chandra, Shashi Shekhar Kumar, Rushil Patra, Sonali Agarwal

    Abstract: Forests are crucial for ecological balance, but wildfires, a major cause of forest loss, pose significant risks. Fire weather indices, which assess wildfire risk and predict resource demands, are vital. With the rise of sensor networks in fields like healthcare and environmental monitoring, semantic sensor networks are increasingly used to gather climatic data such as wind speed, temperature, and… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  12. arXiv:2405.11215  [pdf, other

    cs.CL cs.CY

    MemeMQA: Multimodal Question Answering for Memes via Rationale-Based Inferencing

    Authors: Siddhant Agarwal, Shivam Sharma, Preslav Nakov, Tanmoy Chakraborty

    Abstract: Memes have evolved as a prevalent medium for diverse communication, ranging from humour to propaganda. With the rising popularity of image-focused content, there is a growing need to explore its potential harm from different aspects. Previous studies have analyzed memes in closed settings - detecting harm, applying semantic labels, and offering natural language explanations. To extend this researc… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

    Comments: The paper has been accepted in ACL'24 (Findings)

  13. arXiv:2405.09612  [pdf, other

    astro-ph.HE

    Imprint of 'local opacity' effect in gamma-ray spectrum of blazar jet

    Authors: Sushmita Agarwal, Amit Shukla, Karl Mannheim, Bhargav Vaidya, Biswajit Banerjee

    Abstract: Relativistic jets from accreting supermassive black holes at cosmological distances can be powerful emitters of $γ$-rays. However, the precise mechanisms and locations responsible for the dissipation of energy within these jets, leading to observable $γ$-ray radiation, remain elusive. We detect evidence for an intrinsic absorption feature in the $γ$-ray spectrum at energies exceeding $10\,$GeV, pr… ▽ More

    Submitted 18 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

    Comments: 10 pages, 3 figures, 1 table, Accepted for publication in ApJL

  14. arXiv:2405.08015  [pdf, other

    cs.LG cs.AI

    A Methodology-Oriented Study of Catastrophic Forgetting in Incremental Deep Neural Networks

    Authors: Ashutosh Kumar, Sonali Agarwal, D Jude Hemanth

    Abstract: Human being and different species of animals having the skills to gather, transferring knowledge, processing, fine-tune and generating information throughout their lifetime. The ability of learning throughout their lifespan is referred as continuous learning which is using neurocognition mechanism. Consequently, in real world computational system of incremental learning autonomous agents also need… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  15. arXiv:2405.07284  [pdf

    cs.CV cs.AI

    Zero Shot Context-Based Object Segmentation using SLIP (SAM+CLIP)

    Authors: Saaketh Koundinya Gundavarapu, Arushi Arora, Shreya Agarwal

    Abstract: We present SLIP (SAM+CLIP), an enhanced architecture for zero-shot object segmentation. SLIP combines the Segment Anything Model (SAM) \cite{kirillov2023segment} with the Contrastive Language-Image Pretraining (CLIP) \cite{radford2021learning}. By incorporating text prompts into SAM using CLIP, SLIP enables object segmentation without prior training on specific classes or categories. We fine-tune… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 5 pages, 3 figures

  16. arXiv:2405.05658  [pdf

    eess.IV cs.CV

    Artificial intelligence for abnormality detection in high volume neuroimaging: a systematic review and meta-analysis

    Authors: Siddharth Agarwal, David A. Wood, Mariusz Grzeda, Chandhini Suresh, Munaib Din, James Cole, Marc Modat, Thomas C Booth

    Abstract: Purpose: Most studies evaluating artificial intelligence (AI) models that detect abnormalities in neuroimaging are either tested on unrepresentative patient cohorts or are insufficiently well-validated, leading to poor generalisability to real-world tasks. The aim was to determine the diagnostic test accuracy and summarise the evidence supporting the use of AI models performing first-line, high-vo… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  17. arXiv:2405.05647  [pdf

    cs.CV

    Letter to the Editor: What are the legal and ethical considerations of submitting radiology reports to ChatGPT?

    Authors: Siddharth Agarwal, David Wood, Robin Carpenter, Yiran Wei, Marc Modat, Thomas C Booth

    Abstract: This letter critically examines the recent article by Infante et al. assessing the utility of large language models (LLMs) like GPT-4, Perplexity, and Bard in identifying urgent findings in emergency radiology reports. While acknowledging the potential of LLMs in generating labels for computer vision, concerns are raised about the ethical implications of using patient data without explicit approva… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  18. arXiv:2405.03113  [pdf, other

    cs.RO cs.AI

    Robot Air Hockey: A Manipulation Testbed for Robot Learning with Reinforcement Learning

    Authors: Caleb Chuck, Carl Qi, Michael J. Munje, Shuozhe Li, Max Rudolph, Chang Shi, Siddhant Agarwal, Harshit Sikchi, Abhinav Peri, Sarthak Dayal, Evan Kuo, Kavan Mehta, Anthony Wang, Peter Stone, Amy Zhang, Scott Niekum

    Abstract: Reinforcement Learning is a promising tool for learning complex policies even in fast-moving and object-interactive domains where human teleoperation or hard-coded policies might fail. To effectively reflect this challenging category of tasks, we introduce a dynamic, interactive RL testbed based on robot air hockey. By augmenting air hockey with a large family of tasks ranging from easy tasks like… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  19. arXiv:2405.02782  [pdf

    cs.CV

    A self-supervised text-vision framework for automated brain abnormality detection

    Authors: David A. Wood, Emily Guilhem, Sina Kafiabadi, Ayisha Al Busaidi, Kishan Dissanayake, Ahmed Hammam, Nina Mansoor, Matthew Townend, Siddharth Agarwal, Yiran Wei, Asif Mazumder, Gareth J. Barker, Peter Sasieni, Sebastien Ourselin, James H. Cole, Thomas C. Booth

    Abstract: Artificial neural networks trained on large, expert-labelled datasets are considered state-of-the-art for a range of medical image recognition tasks. However, categorically labelled datasets are time-consuming to generate and constrain classification to a pre-defined, fixed set of classes. For neuroradiological applications in particular, this represents a barrier to clinical adoption. To address… ▽ More

    Submitted 11 June, 2024; v1 submitted 4 May, 2024; originally announced May 2024.

    Comments: Under Review

  20. Reliable Student: Addressing Noise in Semi-Supervised 3D Object Detection

    Authors: Farzad Nozarian, Shashank Agarwal, Farzaneh Rezaeianaran, Danish Shahzad, Atanas Poibrenski, Christian Müller, Philipp Slusallek

    Abstract: Semi-supervised 3D object detection can benefit from the promising pseudo-labeling technique when labeled data is limited. However, recent approaches have overlooked the impact of noisy pseudo-labels during training, despite efforts to enhance pseudo-label quality through confidence-based filtering. In this paper, we examine the impact of noisy pseudo-labels on IoU-based target assignment and prop… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: Accepted at CVPR Workshop L3D-IVU 2023. Code: https://github.com/fnozarian/ReliableStudent

  21. arXiv:2404.17510  [pdf, other

    physics.optics

    Kerr Nonlinearity Induced Nonreciprocity in dissipatively coupled resonators

    Authors: Qingtian Miao, G. S. Agarwal

    Abstract: Nonlinearity induced nonreciprocity is studied in a system comprising two resonators coupled to a one-dimensional waveguide when the linear system does not exhibit nonreciprocity. The analysis is based on the Hamiltonian of the coupled system and includes the dissipative coupling between the waveguide and resonators, along with the input-output relations. We consider a large number of scenarios wh… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  22. arXiv:2404.16710  [pdf, other

    cs.CL cs.AI cs.LG

    LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding

    Authors: Mostafa Elhoushi, Akshat Shrivastava, Diana Liskovich, Basil Hosmer, Bram Wasti, Liangzhen Lai, Anas Mahmoud, Bilge Acun, Saurabh Agarwal, Ahmed Roman, Ahmed A Aly, Beidi Chen, Carole-Jean Wu

    Abstract: We present LayerSkip, an end-to-end solution to speed-up inference of large language models (LLMs). First, during training we apply layer dropout, with low dropout rates for earlier layers and higher dropout rates for later layers, and an early exit loss where all transformer layers share the same exit. Second, during inference, we show that this training recipe increases the accuracy of early exi… ▽ More

    Submitted 29 April, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Code open sourcing is in progress

  23. arXiv:2404.14995  [pdf, other

    astro-ph.SR astro-ph.HE

    Solar flare observations with the Radio Neutrino Observatory Greenland (RNO-G)

    Authors: S. Agarwal, J. A. Aguilar, S. Ali, P. Allison, M. Betts, D. Besson, A. Bishop, O. Botner, S. Bouma, S. Buitink, M. Cataldo, B. A. Clark, A. Coleman, K. Couberly, S. de Kockere, K. D. de Vries, C. Deaconu, M. A. DuVernois, C. Glaser, T. Glüsenkamp, A. Hallgren, S. Hallmann, J. C. Hanson, B. Hendricks, J. Henrichs , et al. (47 additional authors not shown)

    Abstract: The science program of the Radio Neutrino Observatory-Greenland (RNO-G) extends beyond particle astrophysics to include radioglaciology and, as we show herein, solar physics, as well. Impulsive solar flare observations not only permit direct measurements of light curves, spectral content, and polarization on time scales significantly shorter than most extant dedicated solar observatories, but also… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  24. arXiv:2404.04392  [pdf, other

    cs.CR cs.AI

    Increased LLM Vulnerabilities from Fine-tuning and Quantization

    Authors: Divyanshu Kumar, Anurakt Kumar, Sahil Agarwal, Prashanth Harshangi

    Abstract: Large Language Models (LLMs) have become very popular and have found use cases in many domains, such as chatbots, auto-task completion agents, and much more. However, LLMs are vulnerable to different types of attacks, such as jailbreaking, prompt injection attacks, and privacy leakage attacks. Foundational LLMs undergo adversarial and alignment training to learn not to generate malicious and toxic… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

  25. arXiv:2404.02912  [pdf, ps, other

    cs.CC cs.AI

    Probabilistic Generating Circuits -- Demystified

    Authors: Sanyam Agarwal, Markus Bläser

    Abstract: Zhang et al. (ICML 2021, PLMR 139, pp. 12447-1245) introduced probabilistic generating circuits (PGCs) as a probabilistic model to unify probabilistic circuits (PCs) and determinantal point processes (DPPs). At a first glance, PGCs store a distribution in a very different way, they compute the probability generating polynomial instead of the probability mass function and it seems that this is the… ▽ More

    Submitted 4 March, 2024; originally announced April 2024.

  26. arXiv:2404.02139  [pdf, other

    astro-ph.CO

    Lensed Type Ia Supernova "Encore" at z=2: The First Instance of Two Multiply-Imaged Supernovae in the Same Host Galaxy

    Authors: J. D. R. Pierel, A. B. Newman, S. Dhawan, M. Gu, B. A. Joshi, T. Li, S. Schuldt, L. G. Strolger, S. H. Suyu, G. B. Caminha, S. H. Cohen, J. M. Diego, J. C. J. Dsilva, S. Ertl, B. L. Frye, G. Granata, C. Grillo, A. M. Koekemoer, J. Li, A. Robotham, J. Summers, T. Treu, R. A. Windhorst, A. Zitrin, S. Agarwal , et al. (38 additional authors not shown)

    Abstract: A bright ($m_{\rm F150W,AB}$=24 mag), $z=1.95$ supernova (SN) candidate was discovered in JWST/NIRCam imaging acquired on 2023 November 17. The SN is quintuply-imaged as a result of strong gravitational lensing by a foreground galaxy cluster, detected in three locations, and remarkably is the second lensed SN found in the same host galaxy. The previous lensed SN was called "Requiem", and therefore… ▽ More

    Submitted 3 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: Submitted to ApJL

  27. arXiv:2404.00871  [pdf, other

    quant-ph

    Quantum Metrology of Absorption and Gain Parameters using Two-Mode Bright Squeezed Light

    Authors: Mrunal Kamble, Jiaxuan Wang, Girish S. Agarwal

    Abstract: Absorption and gain processes are fundamental to any light-matter interaction and a precise measurement of these parameters is important for various scientific and technological applications. Quantum probes, specifically the squeezed states have proved very successful, particularly in the applications that deal with phase shift and force measurements. In this paper, we focus on improving the sensi… ▽ More

    Submitted 31 March, 2024; originally announced April 2024.

    Comments: 9 pages, 4 figures

  28. arXiv:2403.15556  [pdf, other

    cond-mat.quant-gas physics.atom-ph quant-ph

    Directional superradiance in a driven ultracold atomic gas in free-space

    Authors: Sanaa Agarwal, Edwin Chaparro, Diego Barberena, A. Piñeiro Orioli, G. Ferioli, S. Pancaldi, I. Ferrier-Barbut, A. Browaeys, A. M. Rey

    Abstract: Ultra-cold atomic systems are among the most promising platforms that have the potential to shed light on the complex behavior of many-body quantum systems. One prominent example is the case of a dense ensemble illuminated by a strong coherent drive while interacting via dipole-dipole interactions. Despite being subjected to intense investigations, this system retains many open questions. A recent… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 25 pages, 19 figures

  29. arXiv:2403.14701  [pdf

    cs.CY cs.DB

    Rule based Complex Event Processing for an Air Quality Monitoring System in Smart City

    Authors: Shashi Shekhar Kumar, Ritesh Chandra, Sonali Agarwal

    Abstract: In recent years, smart city-based development has gained momentum due to its versatile nature in architecture and planning for the systematic habitation of human beings. According to World Health Organization (WHO) report, air pollution causes serious respiratory diseases. Hence, it becomes necessary to real-time monitoring of air quality to minimize effect by taking time-bound decisions by the st… ▽ More

    Submitted 16 March, 2024; originally announced March 2024.

  30. arXiv:2403.10431  [pdf, other

    physics.plasm-ph physics.app-ph

    Spatial characterization of debris ejection from the interaction of a tightly focused PW-laser pulse with metal targets

    Authors: I. -M. Vladisavlevici, C. Vlachos, J. -L. Dubois, A. Huerta, S. Agarwal, H. Ahmed, J. I. Apiñaniz, M. Cernaianu, M. Gugiu, M. Krupka, R. Lera, A. Morabito, D. Sangwan, D. Ursescu, A. Curcio, N. Fefeu, J. A. Pérez-Hernández, T. Vacek, P. Vicente, N. Woolsey, G. Gatti, M. D. Rodríguez-Frías, J. J. Santos, P. W. Bradford, M. Ehret

    Abstract: We present a novel scheme for rapid quantitative analysis of debris generated during experiments with solid targets following relativistic laser-plasma interaction at high-power laser facilities. Experimental data indicates that predictions by available modelling for non-mass-limited targets are reasonable, with debris on the order of hundreds ug-per-shot. We detect for the first time that several… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  31. arXiv:2403.09914  [pdf, other

    cs.CV

    ProMark: Proactive Diffusion Watermarking for Causal Attribution

    Authors: Vishal Asnani, John Collomosse, Tu Bui, Xiaoming Liu, Shruti Agarwal

    Abstract: Generative AI (GenAI) is transforming creative workflows through the capability to synthesize and manipulate images via high-level prompts. Yet creatives are not well supported to receive recognition or reward for the use of their content in GenAI training. To this end, we propose ProMark, a causal attribution technique to attribute a synthetically generated image to its training data concepts lik… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: Accepted to CVPR 2024

  32. arXiv:2403.08058  [pdf, other

    cs.LG cs.CL

    CHAI: Clustered Head Attention for Efficient LLM Inference

    Authors: Saurabh Agarwal, Bilge Acun, Basil Hosmer, Mostafa Elhoushi, Ye** Lee, Shivaram Venkataraman, Dimitris Papailiopoulos, Carole-Jean Wu

    Abstract: Large Language Models (LLMs) with hundreds of billions of parameters have transformed the field of machine learning. However, serving these models at inference time is both compute and memory intensive, where a single request can require multiple GPUs and tens of Gigabytes of memory. Multi-Head Attention is one of the key components of LLMs, which can account for over 50% of LLMs memory and comput… ▽ More

    Submitted 27 April, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  33. arXiv:2403.08043  [pdf, other

    cs.CL

    Authorship Style Transfer with Policy Optimization

    Authors: Shuai Liu, Shantanu Agarwal, Jonathan May

    Abstract: Authorship style transfer aims to rewrite a given text into a specified target while preserving the original meaning in the source. Existing approaches rely on the availability of a large number of target style exemplars for model training. However, these overlook cases where a limited number of target style examples are available. The development of parameter-efficient transfer learning technique… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  34. arXiv:2403.06938  [pdf, other

    cs.AR

    TCAM-SSD: A Framework for Search-Based Computing in Solid-State Drives

    Authors: Ryan Wong, Nikita Kim, Kevin Higgs, Sapan Agarwal, Engin Ipek, Saugata Ghose, Ben Feinberg

    Abstract: As the amount of data produced in society continues to grow at an exponential rate, modern applications are incurring significant performance and energy penalties due to high data movement between the CPU and memory/storage. While processing in main memory can alleviate these penalties, it is becoming increasingly difficult to keep large datasets entirely in main memory. This has led to a recent p… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  35. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  36. arXiv:2403.05513  [pdf, other

    cs.RO

    A Detection and Filtering Framework for Collaborative Localization

    Authors: Thirumalaesh Ashokkumar, Katherine A Skinner, Siddarth Agarwal, Ankit Vora, Ashutosh Bhown

    Abstract: Increasingly, autonomous vehicles (AVs) are becoming a reality, such as the Advanced Driver Assistance Systems (ADAS) in vehicles that assist drivers in driving and parking functions with vehicles today. The localization problem for AVs relies primarily on multiple sensors, including cameras, LiDARs, and radars. Manufacturing, installing, calibrating, and maintaining these sensors can be very expe… ▽ More

    Submitted 8 March, 2024; originally announced March 2024.

  37. arXiv:2403.04160  [pdf, other

    cs.IR cs.AI

    Improving Retrieval in Theme-specific Applications using a Corpus Topical Taxonomy

    Authors: SeongKu Kang, Shivam Agarwal, Bowen **, Dongha Lee, Hwanjo Yu, Jiawei Han

    Abstract: Document retrieval has greatly benefited from the advancements of large-scale pre-trained language models (PLMs). However, their effectiveness is often limited in theme-specific applications for specialized areas or industries, due to unique terminologies, incomplete contexts of user queries, and specialized search intents. To capture the theme-specific information and improve retrieval, we propos… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: TheWebConf'24

  38. arXiv:2403.03579  [pdf, other

    quant-ph

    Testing the unified bounds of quantum speed limit

    Authors: Yaozu Wu, Jiale Yuan, Chuanyu Zhang, Zitian Zhu, **feng Deng, Xu Zhang, Pengfei Zhang, Qiujiang Guo, Zhen Wang, Jiehui Huang, Chao Song, Hekang Li, Da-Wei Wang, H. Wang, Girish S. Agarwal

    Abstract: Quantum speed limits (QSLs) impose fundamental constraints on the evolution speed of quantum systems. Traditionally, the Mandelstam-Tamm (MT) and Margolus-Levitin (ML) bounds have been widely employed, relying on the standard deviation and mean of energy distribution to define the QSLs. However, these universal bounds only offer loose restrictions on the quantum evolution. Here we introduce the ge… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  39. arXiv:2403.02682  [pdf, other

    cs.LG eess.SP

    Time Weaver: A Conditional Time Series Generation Model

    Authors: Sai Shankar Narasimhan, Shubhankar Agarwal, Oguzhan Akcin, Sujay Sanghavi, Sandeep Chinchali

    Abstract: Imagine generating a city's electricity demand pattern based on weather, the presence of an electric vehicle, and location, which could be used for capacity planning during a winter freeze. Such real-world time series are often enriched with paired heterogeneous contextual metadata (weather, location, etc.). Current approaches to time series generation often ignore this paired metadata, and its he… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  40. arXiv:2402.18434  [pdf, other

    cs.LG cs.IR

    Graph Regularized Encoder Training for Extreme Classification

    Authors: Anshul Mittal, Shikhar Mohan, Deepak Saini, Suchith C. Prabhu, Jain jiao, Sumeet Agarwal, Soumen Chakrabarti, Purushottam Kar, Manik Varma

    Abstract: Deep extreme classification (XC) aims to train an encoder architecture and an accompanying classifier architecture to tag a data point with the most relevant subset of labels from a very large universe of labels. XC applications in ranking, recommendation and tagging routinely encounter tail labels for which the amount of training data is exceedingly small. Graph convolutional networks (GCN) prese… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  41. arXiv:2402.06608  [pdf, other

    cs.CL cs.AI

    TIC: Translate-Infer-Compile for accurate "text to plan" using LLMs and Logical Representations

    Authors: Sudhir Agarwal, Anu Sreepathy

    Abstract: We study the problem of generating plans for given natural language planning task requests. On one hand, LLMs excel at natural language processing but do not perform well on planning. On the other hand, classical planning tools excel at planning tasks but require input in a structured language such as the Planning Domain Definition Language (PDDL). We leverage the strengths of both the techniques… ▽ More

    Submitted 28 June, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    Comments: 20 pages (7 main + 2 references + 11 appendix), 4 figures, 2 tables

  42. arXiv:2402.05398  [pdf, other

    cs.CV

    On the Effect of Image Resolution on Semantic Segmentation

    Authors: Ritambhara Singh, Abhishek Jain, Pietro Perona, Shivani Agarwal, Junfeng Yang

    Abstract: High-resolution semantic segmentation requires substantial computational resources. Traditional approaches in the field typically downscale the input images before processing and then upscale the low-resolution outputs back to their original dimensions. While this strategy effectively identifies broad regions, it often misses finer details. In this study, we demonstrate that a streamlined model ca… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2209.08667 by other authors

  43. arXiv:2402.01829  [pdf, other

    q-bio.BM cs.CL cs.LG

    Predicting ATP binding sites in protein sequences using Deep Learning and Natural Language Processing

    Authors: Shreyas V, Swati Agarwal

    Abstract: Predicting ATP-Protein Binding sites in genes is of great significance in the field of Biology and Medicine. The majority of research in this field has been conducted through time- and resource-intensive 'wet experiments' in laboratories. Over the years, researchers have been investigating computational methods computational methods to accomplish the same goals, utilising the strength of advanced… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: Published at 3rd Annual AAAI Workshop on AI to Accelerate Science and Engineering (AI2ASE)

  44. arXiv:2402.01788  [pdf, other

    cs.CL cs.AI cs.IR

    LitLLM: A Toolkit for Scientific Literature Review

    Authors: Shubham Agarwal, Issam H. Laradji, Laurent Charlin, Christopher Pal

    Abstract: Conducting literature reviews for scientific papers is essential for understanding research, its limitations, and building on existing work. It is a tedious task which makes an automatic literature review generator appealing. Unfortunately, many existing works that generate such reviews using Large Language Models (LLMs) have significant limitations. They tend to hallucinate-generate non-actual in… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  45. arXiv:2402.01528  [pdf, other

    cs.LG cs.CL

    Decoding Speculative Decoding

    Authors: Minghao Yan, Saurabh Agarwal, Shivaram Venkataraman

    Abstract: Speculative Decoding is a widely used technique to speed up inference for Large Language Models (LLMs) without sacrificing quality. When performing inference, speculative decoding uses a smaller draft model to generate speculative tokens and then uses the target LLM to verify those draft tokens. The speedup provided by speculative decoding heavily depends on the choice of the draft model. In this… ▽ More

    Submitted 26 April, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  46. arXiv:2402.01055  [pdf, other

    cs.LG stat.ML

    Multiclass Learning from Noisy Labels for Non-decomposable Performance Measures

    Authors: Mingyuan Zhang, Shivani Agarwal

    Abstract: There has been much interest in recent years in learning good classifiers from data with noisy labels. Most work on learning from noisy labels has focused on standard loss-based performance measures. However, many machine learning problems require using non-decomposable performance measures which cannot be expressed as the expectation or sum of a loss on individual examples; these include for exam… ▽ More

    Submitted 23 April, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  47. arXiv:2401.11417  [pdf, other

    astro-ph.SR physics.plasm-ph physics.space-ph

    Study of Reconnection Dynamics and Plasma Relaxation in MHD simulation of a Solar Flare

    Authors: Satyam Agarwal, Ramit Bhattacharyya, Shangbin Yang

    Abstract: Self-organization in continuous systems is associated with dissipative processes. In particular, for magnetized plasmas, it is known as magnetic relaxation, where the magnetic energy is converted into heat and kinetic energy of flow through the process of magnetic reconnection. An example of such a system is the solar corona, where reconnection manifests as solar transients like flares and jets. C… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

  48. arXiv:2401.07919  [pdf, ps, other

    math.AT

    Steenrod operations on polyhedral products

    Authors: Sanjana Agarwal, Jelena Grbić, Michele Intermont, Milica Jovanović, Evgeniya Lagoda, Sarah Whitehouse

    Abstract: We describe the action of the mod $2$ Steenrod algebra on the cohomology of various polyhedral products and related spaces. We carry this out for Davis-Januszkiewicz spaces and their generalizations, for moment-angle complexes as well as for certain polyhedral joins. By studying the combinatorics of underlying simplicial complexes, we deduce some consequences for the lowest cohomological dimension… ▽ More

    Submitted 20 June, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: 21 pages, v2 minor changes, accepted version to appear in special issue of Topology and its Applications dedicated to proceedings of the Women in Topology 4 workshop

    MSC Class: 55U10; 05E45; 13F55; 55S10

  49. arXiv:2401.04855  [pdf, other

    cs.RO cs.LG

    LPAC: Learnable Perception-Action-Communication Loops with Applications to Coverage Control

    Authors: Saurav Agarwal, Ramya Muthukrishnan, Walker Gosrich, Vijay Kumar, Alejandro Ribeiro

    Abstract: Coverage control is the problem of navigating a robot swarm to collaboratively monitor features or a phenomenon of interest not known a priori. The problem is challenging in decentralized settings with robots that have limited communication and sensing capabilities. We propose a learnable Perception-Action-Communication (LPAC) architecture for the problem, wherein a convolution neural network (CNN… ▽ More

    Submitted 8 February, 2024; v1 submitted 9 January, 2024; originally announced January 2024.

  50. arXiv:2401.01596  [pdf, other

    cs.AI cs.CL

    MedSumm: A Multimodal Approach to Summarizing Code-Mixed Hindi-English Clinical Queries

    Authors: Akash Ghosh, Arkadeep Acharya, Prince Jha, Aniket Gaudgaul, Rajdeep Majumdar, Sriparna Saha, Aman Chadha, Raghav Jain, Setu Sinha, Shivani Agarwal

    Abstract: In the healthcare domain, summarizing medical questions posed by patients is critical for improving doctor-patient interactions and medical decision-making. Although medical data has grown in complexity and quantity, the current body of research in this domain has primarily concentrated on text-based methods, overlooking the integration of visual cues. Also prior works in the area of medical quest… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: ECIR 2024