Skip to main content

Showing 1–26 of 26 results for author: Aggarwal, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.08555  [pdf, other

    cs.LG cs.AI cs.CL

    RLHF Deciphered: A Critical Analysis of Reinforcement Learning from Human Feedback for LLMs

    Authors: Shreyas Chaudhari, Pranjal Aggarwal, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan, Ameet Deshpande, Bruno Castro da Silva

    Abstract: State-of-the-art large language models (LLMs) have become indispensable tools for various tasks. However, training LLMs to serve as effective assistants for humans requires careful consideration. A promising approach is reinforcement learning from human feedback (RLHF), which leverages human feedback to update the model in accordance with human preferences and mitigate issues like toxicity and hal… ▽ More

    Submitted 15 April, 2024; v1 submitted 12 April, 2024; originally announced April 2024.

  2. arXiv:2402.04967  [pdf, other

    cs.CL cs.AI cs.CV

    Text or Image? What is More Important in Cross-Domain Generalization Capabilities of Hate Meme Detection Models?

    Authors: Piush Aggarwal, Jawar Mehrabanian, Weigang Huang, Özge Alacam, Torsten Zesch

    Abstract: This paper delves into the formidable challenge of cross-domain generalization in multimodal hate meme detection, presenting compelling findings. We provide enough pieces of evidence supporting the hypothesis that only the textual component of hateful memes enables the existing multimodal classifier to generalize across different domains, while the image component proves highly sensitive to a spec… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: Accepted at EACL'2024 Findings

  3. arXiv:2311.09735  [pdf, other

    cs.LG cs.IR

    GEO: Generative Engine Optimization

    Authors: Pranjal Aggarwal, Vishvak Murahari, Tanmay Rajpurohit, Ashwin Kalyan, Karthik Narasimhan, Ameet Deshpande

    Abstract: The advent of large language models (LLMs) has ushered in a new paradigm of search engines that use generative models to gather and summarize information to answer user queries. This emerging technology, which we formalize under the unified framework of generative engines (GEs), can generate accurate and personalized responses, rapidly replacing traditional search engines like Google and Bing. Gen… ▽ More

    Submitted 28 June, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Accepted to KDD 2024

  4. arXiv:2311.02630  [pdf

    cs.CR cs.AI cs.CY cs.LG

    The New Frontier of Cybersecurity: Emerging Threats and Innovations

    Authors: Daksh Dave, Gauransh Sawhney, Pushkar Aggarwal, Nitish Silswal, Dhruv Khut

    Abstract: In today's digitally interconnected world, cybersecurity threats have reached unprecedented levels, presenting a pressing concern for individuals, organizations, and governments. This study employs a qualitative research approach to comprehensively examine the diverse threats of cybersecurity and their impacts across various sectors. Four primary categories of threats are identified and analyzed,… ▽ More

    Submitted 5 November, 2023; originally announced November 2023.

    Comments: 6 pages, 2 Tables

    Journal ref: 2023 29th International Conference on Telecommunications (ICT), pp. 1-6, 2023

  5. arXiv:2310.12963  [pdf, other

    cs.CL cs.AI

    AutoMix: Automatically Mixing Language Models

    Authors: Pranjal Aggarwal, Aman Madaan, Ankit Anand, Srividya Pranavi Potharaju, Swaroop Mishra, Pei Zhou, Aditya Gupta, Dheeraj Rajagopal, Karthik Kappaganthu, Yiming Yang, Shyam Upadhyay, Manaal Faruqui, Mausam

    Abstract: Large language models (LLMs) are now available from cloud API providers in various sizes and configurations. While this diversity offers a broad spectrum of choices, effectively leveraging the options to optimize computational cost and performance remains challenging. In this work, we present Automix, an approach that strategically routes queries to larger LMs, based on the approximate correctness… ▽ More

    Submitted 28 June, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

    Comments: The first two authors contributed equally. Work started and partly done during Aman's internship at Google. This version adds results on additional models and datasets

  6. arXiv:2310.10662  [pdf, other

    cs.CR

    Assessing the Influence of Different Types of Probing on Adversarial Decision-Making in a Deception Game

    Authors: Md Abu Sayed, Mohammad Ariful Islam Khan, Bryant A Allsup, Joshua Zamora, Palvi Aggarwal

    Abstract: Deception, which includes leading cyber-attackers astray with false information, has shown to be an effective method of thwarting cyber-attacks. There has been little investigation of the effect of probing action costs on adversarial decision-making, despite earlier studies on deception in cybersecurity focusing primarily on variables like network size and the percentage of honeypots utilized in g… ▽ More

    Submitted 8 January, 2024; v1 submitted 15 September, 2023; originally announced October 2023.

  7. arXiv:2308.11526  [pdf, other

    cs.CL cs.AI cs.SE

    Learning Representations on Logs for AIOps

    Authors: Pranjal Gupta, Harshit Kumar, Debanjana Kar, Karan Bhukar, Pooja Aggarwal, Prateeti Mohapatra

    Abstract: AI for IT Operations (AIOps) is a powerful platform that Site Reliability Engineers (SREs) use to automate and streamline operational workflows with minimal human intervention. Automated log analysis is a critical task in AIOps as it provides key insights for SREs to identify and address ongoing faults. Tasks such as log format detection, log classification, and log parsing are key components of a… ▽ More

    Submitted 18 August, 2023; originally announced August 2023.

    Comments: 11 pages, 2023 IEEE 16th International Conference on Cloud Computing (CLOUD)

  8. arXiv:2305.11860  [pdf, other

    cs.CL

    Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLMs

    Authors: Pranjal Aggarwal, Aman Madaan, Yiming Yang, Mausam

    Abstract: A popular approach for improving the correctness of output from large language models (LLMs) is Self-Consistency - poll the LLM multiple times and output the most frequent solution. Existing Self-Consistency techniques always generate a constant number of samples per question, where a better approach will be to non-uniformly distribute the available budget based on the amount of agreement in the s… ▽ More

    Submitted 16 November, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Published at EMNLP 2023

  9. arXiv:2304.03732  [pdf, other

    cs.NI cs.MM

    Enabling immersive experiences in challenging network conditions

    Authors: Pooja Aggarwal, Michael Luby, Lorenz Minder

    Abstract: Immersive experiences, such as remote collaboration and augmented and virtual reality, require delivery of large volumes of data with consistent ultra-low latency across wireless networks in fluctuating network conditions. We describe the high-level design behind a data delivery solution that meets these requirements and provide synthetic simulations and test results running in network conditions… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: 6 pages, 8 figures

  10. arXiv:2302.11710  [pdf, other

    cs.CV

    Controlled and Conditional Text to Image Generation with Diffusion Prior

    Authors: Pranav Aggarwal, Hareesh Ravi, Naveen Marri, Sachin Kelkar, Fengbin Chen, Vinh Khuc, Midhun Harikumar, Ritiz Tambi, Sudharshan Reddy Kakumanu, Purvak Lapsiya, Alvin Ghouas, Sarah Saber, Malavika Ramprasad, Baldo Faieta, A**kya Kale

    Abstract: Denoising Diffusion models have shown remarkable performance in generating diverse, high quality images from text. Numerous techniques have been proposed on top of or in alignment with models like Stable Diffusion and Imagen that generate images directly from text. A lesser explored approach is DALLE-2's two step process comprising a Diffusion Prior that generates a CLIP image embedding from text… ▽ More

    Submitted 1 August, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

  11. HateProof: Are Hateful Meme Detection Systems really Robust?

    Authors: Piush Aggarwal, Pranit Chawla, Mithun Das, Punyajoy Saha, Binny Mathew, Torsten Zesch, Animesh Mukherjee

    Abstract: Exploiting social media to spread hate has tremendously increased over the years. Lately, multi-modal hateful content such as memes has drawn relatively more traction than uni-modal content. Moreover, the availability of implicit content payloads makes them fairly challenging to be detected by existing hateful meme detection systems. In this paper, we present a use case study to analyze such syste… ▽ More

    Submitted 11 February, 2023; originally announced February 2023.

    Comments: Accepted at TheWebConf'2023 (WWW'2023)

  12. arXiv:2301.11309  [pdf, other

    cs.CL

    SemSup-XC: Semantic Supervision for Zero and Few-shot Extreme Classification

    Authors: Pranjal Aggarwal, Ameet Deshpande, Karthik Narasimhan

    Abstract: Extreme classification (XC) involves predicting over large numbers of classes (thousands to millions), with real-world applications like news article classification and e-commerce product tagging. The zero-shot version of this task requires generalization to novel classes without additional supervision. In this paper, we develop SemSup-XC, a model that achieves state-of-the-art zero-shot and few-s… ▽ More

    Submitted 22 June, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: Published at ICML 2023. V2: camera ready version at ICML 2023

  13. arXiv:2212.07424  [pdf

    cs.CL cs.AI cs.LG

    Hope Speech Detection on Social Media Platforms

    Authors: Pranjal Aggarwal, Pasupuleti Chandana, Jagrut Nemade, Shubham Sharma, Sunil Saumya, Shankar Biradar

    Abstract: Since personal computers became widely available in the consumer market, the amount of harmful content on the internet has significantly expanded. In simple terms, harmful content is anything online which causes a person distress or harm. It may include hate speech, violent content, threats, non-hope speech, etc. The online content must be positive, uplifting and supportive. Over the past few year… ▽ More

    Submitted 14 November, 2022; originally announced December 2022.

    Comments: 14 pages, 05 figures. accepted for publication in the book chapter "Cyber Crime in Social Media: Theory and Solutions"

  14. arXiv:2203.05321  [pdf, other

    cs.CV cs.CL

    StyleBabel: Artistic Style Tagging and Captioning

    Authors: Dan Ruta, Andrew Gilbert, Pranav Aggarwal, Naveen Marri, A**kya Kale, Jo Briggs, Chris Speed, Hailin **, Baldo Faieta, Alex Filipkowski, Zhe Lin, John Collomosse

    Abstract: We present StyleBabel, a unique open access dataset of natural language captions and free-form tags describing the artistic style of over 135K digital artworks, collected via a novel participatory method from experts studying at specialist art and design schools. StyleBabel was collected via an iterative method, inspired by `Grounded Theory': a qualitative approach that enables annotation while co… ▽ More

    Submitted 11 March, 2022; v1 submitted 10 March, 2022; originally announced March 2022.

  15. Rightsizing Clusters for Time-Limited Tasks

    Authors: Venkatesan T. Chakaravarthy, Padmanabha V. Seshadri, Pooja Aggarwal, Anamitra R. Choudhury, Ashok Pon Kumar, Yogish Sabharwal, Amith Singhee

    Abstract: In conventional public clouds, designing a suitable initial cluster for a given application workload is important in reducing the computational foot-print during run-time. In edge or on-premise clouds, cold-start rightsizing the cluster at the time of installation is crucial in avoiding the recurrent capital expenditure. In both these cases, rightsizing has to balance cost-performance trade-off fo… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

    Comments: An abridged version appears in IEEE Cloud 2021

  16. arXiv:2109.07622  [pdf, other

    cs.LG

    Towards Zero-shot Cross-lingual Image Retrieval and Tagging

    Authors: Pranav Aggarwal, Ritiz Tambi, A**kya Kale

    Abstract: There has been a recent spike in interest in multi-modal Language and Vision problems. On the language side, most of these models primarily focus on English since most multi-modal datasets are monolingual. We try to bridge this gap with a zero-shot approach for learning multi-modal representations using cross-lingual pre-training on the text side. We present a simple yet practical approach for bui… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: Presented at Workshop on Multilingual Search, in conjunction with 30th The Web Conference 2021. arXiv admin note: substantial text overlap with arXiv:2012.05107

  17. arXiv:2108.11037  [pdf, other

    cs.CR

    Decoys in Cybersecurity: An Exploratory Study to Test the Effectiveness of 2-sided Deception

    Authors: Palvi Aggarwal, Yinuo Du, Kuldeep Singh, Cleotilde Gonzalez

    Abstract: One of the widely used cyber deception techniques is decoying, where defenders create fictitious machines (i.e., honeypots) to lure attackers. Honeypots are deployed to entice attackers, but their effectiveness depends on their configuration as that would influence whether attackers will judge them as "real" machines or not. In this work, we study two-sided deception, where we manipulate the obser… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

    Comments: Presented at 1st International Workshop on Adaptive Cyber Defense, 2021 (arXiv:2108.08476)

    Report number: IJCAI-ACD/2021/110

  18. arXiv:2103.10730  [pdf, other

    cs.CL

    MuRIL: Multilingual Representations for Indian Languages

    Authors: Simran Khanuja, Diksha Bansal, Sarvesh Mehtani, Savya Khosla, Atreyee Dey, Balaji Gopalan, Dilip Kumar Margam, Pooja Aggarwal, Rajiv Teja Nagipogu, Shachi Dave, Shruti Gupta, Subhash Chandra Bose Gali, Vish Subramanian, Partha Talukdar

    Abstract: India is a multilingual society with 1369 rationalized languages and dialects being spoken across the country (INDIA, 2011). Of these, the 22 scheduled languages have a staggering total of 1.17 billion speakers and 121 languages have more than 10,000 speakers (INDIA, 2011). India also has the second largest (and an ever growing) digital footprint (Statista, 2020). Despite this, today's state-of-th… ▽ More

    Submitted 2 April, 2021; v1 submitted 19 March, 2021; originally announced March 2021.

  19. arXiv:2012.05107  [pdf, other

    cs.CL cs.CV cs.LG

    Towards Zero-shot Cross-lingual Image Retrieval

    Authors: Pranav Aggarwal, A**kya Kale

    Abstract: There has been a recent spike in interest in multi-modal Language and Vision problems. On the language side, most of these models primarily focus on English since most multi-modal datasets are monolingual. We try to bridge this gap with a zero-shot approach for learning multi-modal representations using cross-lingual pre-training on the text side. We present a simple yet practical approach for bui… ▽ More

    Submitted 24 November, 2020; originally announced December 2020.

  20. arXiv:2010.01666  [pdf, other

    cs.IR cs.LG

    Multi-Modal Retrieval using Graph Neural Networks

    Authors: Aashish Kumar Misraa, A**kya Kale, Pranav Aggarwal, Ali Aminian

    Abstract: Most real world applications of image retrieval such as Adobe Stock, which is a marketplace for stock photography and illustrations, need a way for users to find images which are both visually (i.e. aesthetically) and conceptually (i.e. containing the same salient objects) as a query image. Learning visual-semantic representations from images is a well studied problem for image retrieval. Filterin… ▽ More

    Submitted 4 October, 2020; originally announced October 2020.

  21. An Environment for Sustainable Research Software in Germany and Beyond: Current State, Open Challenges, and Call for Action

    Authors: Hartwig Anzt, Felix Bach, Stephan Druskat, Frank Löffler, Axel Loewe, Bernhard Y. Renard, Gunnar Seemann, Alexander Struck, Elke Achhammer, Piush Aggarwal, Franziska Appel, Michael Bader, Lutz Brusch, Christian Busse, Gerasimos Chourdakis, Piotr W. Dabrowski, Peter Ebert, Bernd Flemisch, Sven Friedl, Bernadette Fritzsch, Maximilian D. Funk, Volker Gast, Florian Goth, Jean-Noël Grad, Sibylle Hermann , et al. (18 additional authors not shown)

    Abstract: Research software has become a central asset in academic research. It optimizes existing and enables new research methods, implements and embeds research knowledge, and constitutes an essential research product in itself. Research software must be sustainable in order to understand, replicate, reproduce, and build upon existing research or conduct new research effectively. In other words, software… ▽ More

    Submitted 5 May, 2020; v1 submitted 27 April, 2020; originally announced May 2020.

    Comments: Official position paper 001 of de-RSE e.V. - Society for Research Software (https://de-rse.org) --- 16 pages, 1 figure + 1 page supplementary material, 4 figures --- Submitted to the F1000 Research Science Policy Research Gateway on 2020-04-03

    Journal ref: F1000Research 2020

  22. arXiv:1912.00576  [pdf

    cs.CV

    Skeleton based Activity Recognition by Fusing Part-wise Spatio-temporal and Attention Driven Residues

    Authors: Chhavi Dhiman, Dinesh Kumar Vishwakarma, Paras Aggarwal

    Abstract: There exist a wide range of intra class variations of the same actions and inter class similarity among the actions, at the same time, which makes the action recognition in videos very challenging. In this paper, we present a novel skeleton-based part-wise Spatiotemporal CNN RIAC Network-based 3D human action recognition framework to visualise the action dynamics in part wise manner and utilise ea… ▽ More

    Submitted 1 December, 2019; originally announced December 2019.

    Comments: 20 pages, 9 figures

  23. arXiv:1905.13339  [pdf, other

    cs.CV cs.IR

    Multitask Text-to-Visual Embedding with Titles and Clickthrough Data

    Authors: Pranav Aggarwal, Zhe Lin, Baldo Faieta, Saeid Motiian

    Abstract: Text-visual (or called semantic-visual) embedding is a central problem in vision-language research. It typically involves map** of an image and a text description to a common feature space through a CNN image encoder and a RNN language encoder. In this paper, we propose a new method for learning text-visual embedding using both image titles and click-through data from an image search engine. We… ▽ More

    Submitted 30 May, 2019; originally announced May 2019.

    Comments: 4 pages. Language and Vision Workshop, in conjunction with CVPR 2019

  24. arXiv:1712.00863  [pdf, other

    cs.CV

    A Deep Learning Approach to Drone Monitoring

    Authors: Yueru Chen, Pranav Aggarwal, Jongmoo Choi, C. -C. Jay Kuo

    Abstract: A drone monitoring system that integrates deep-learning-based detection and tracking modules is proposed in this work. The biggest challenge in adopting deep learning methods for drone detection is the limited amount of training drone images. To address this issue, we develop a model-based drone augmentation technique that automatically generates drone images with a bounding box label on drone's l… ▽ More

    Submitted 3 December, 2017; originally announced December 2017.

  25. arXiv:1705.07983  [pdf, other

    cs.DC cs.PF eess.SY

    Liquid Cloud Storage

    Authors: Michael G. Luby, Roberto Padovani, Thomas J. Richardson, Lorenz Minder, Pooja Aggarwal

    Abstract: A liquid system provides durable object storage based on spreading redundantly generated data across a network of hundreds to thousands of potentially unreliable storage nodes. A liquid system uses a combination of a large code, lazy repair, and a flow storage organization. We show that a liquid system can be operated to enable flexible and essentially optimal combinations of storage durability, s… ▽ More

    Submitted 22 May, 2017; originally announced May 2017.

    Comments: 44 pages, 21 figures, 1 table

  26. arXiv:1510.00116  [pdf, other

    cs.DC

    A Wait-Free Stack

    Authors: Seep Goel, Pooja Aggarwal, Smruti R. Sarangi

    Abstract: In this paper, we describe a novel algorithm to create a con- current wait-free stack. To the best of our knowledge, this is the first wait-free algorithm for a general purpose stack. In the past, researchers have proposed restricted wait-free implementations of stacks, lock-free implementations, and efficient universal constructions that can support wait-free stacks. The crux of our wait-free imp… ▽ More

    Submitted 1 October, 2015; originally announced October 2015.

    Comments: 21 pages, 5 figures