Skip to main content

Showing 1–17 of 17 results for author: Sukriti

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.10328  [pdf, other

    cs.CV cs.CL cs.LG

    From Pixels to Prose: A Large Dataset of Dense Image Captions

    Authors: Vasu Singla, Kaiyu Yue, Sukriti Paul, Reza Shirkavand, Mayuka Jayawardhana, Alireza Ganjdanesh, Heng Huang, Abhinav Bhatele, Gowthami Somepalli, Tom Goldstein

    Abstract: Training large vision-language models requires extensive, high-quality image-text pairs. Existing web-scraped datasets, however, are noisy and lack detailed image descriptions. To bridge this gap, we introduce PixelProse, a comprehensive dataset of over 16M (million) synthetically generated captions, leveraging cutting-edge vision-language models for detailed and accurate descriptions. To ensure d… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: pixelprose 16M dataset

  2. arXiv:2406.10231  [pdf

    cs.CV eess.IV

    Sign Language Recognition based on YOLOv5 Algorithm for the Telugu Sign Language

    Authors: Vipul Reddy. P, Vishnu Vardhan Reddy. B, Sukriti

    Abstract: Sign language recognition (SLR) technology has enormous promise to improve communication and accessibility for the difficulty of hearing. This paper presents a novel approach for identifying gestures in TSL using the YOLOv5 object identification framework. The main goal is to create an accurate and successful method for identifying TSL gestures so that the deaf community can use slr. After that, a… ▽ More

    Submitted 24 April, 2024; originally announced June 2024.

    Comments: 11 pages, 9 figures

  3. arXiv:2403.00826  [pdf, other

    cs.CL cs.CR cs.LG

    LLMGuard: Guarding Against Unsafe LLM Behavior

    Authors: Shubh Goyal, Medha Hira, Shubham Mishra, Sukriti Goyal, Arnav Goel, Niharika Dadu, Kirushikesh DB, Sameep Mehta, Nishtha Madaan

    Abstract: Although the rise of Large Language Models (LLMs) in enterprise settings brings new opportunities and capabilities, it also brings challenges, such as the risk of generating inappropriate, biased, or misleading content that violates regulations and can have legal concerns. To alleviate this, we present "LLMGuard", a tool that monitors user interactions with an LLM application and flags content aga… ▽ More

    Submitted 27 February, 2024; originally announced March 2024.

    Comments: accepted in demonstration track of AAAI-24

  4. arXiv:2312.12464  [pdf, other

    cs.LG cs.AI cs.CL

    Towards Better Serialization of Tabular Data for Few-shot Classification with Large Language Models

    Authors: Sukriti Jaitly, Tanay Shah, Ashish Shugani, Razik Singh Grewal

    Abstract: We present a study on the integration of Large Language Models (LLMs) in tabular data classification, emphasizing an efficient framework. Building upon existing work done in TabLLM (arXiv:2210.10723), we introduce three novel serialization techniques, including the standout LaTeX serialization method. This method significantly boosts the performance of LLMs in processing domain-specific datasets,… ▽ More

    Submitted 20 December, 2023; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 4 pages, 2 figures

  5. arXiv:2312.08063  [pdf, other

    cs.LG cs.AI cs.CL

    Estimation of Concept Explanations Should be Uncertainty Aware

    Authors: Vihari Piratla, Juyeon Heo, Katherine M. Collins, Sukriti Singh, Adrian Weller

    Abstract: Model explanations can be valuable for interpreting and debugging predictive models. We study a specific kind called Concept Explanations, where the goal is to interpret a model using human-understandable concepts. Although popular for their easy interpretation, concept explanations are known to be noisy. We begin our work by identifying various sources of uncertainty in the estimation pipeline th… ▽ More

    Submitted 5 April, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

  6. arXiv:2306.16503  [pdf, other

    cs.LG cs.AI

    SARC: Soft Actor Retrospective Critic

    Authors: Sukriti Verma, Ayush Chopra, Jayakumar Subramanian, Mausoom Sarkar, Nikaash Puri, Piyush Gupta, Balaji Krishnamurthy

    Abstract: The two-time scale nature of SAC, which is an actor-critic algorithm, is characterised by the fact that the critic estimate has not converged for the actor at any given time, but since the critic learns faster than the actor, it ensures eventual consistency between the two. Various strategies have been introduced in literature to learn better gradient estimates to help achieve better convergence.… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: Accepted at RLDM 2022

  7. arXiv:2306.14809  [pdf, other

    cs.LG

    Tanimoto Random Features for Scalable Molecular Machine Learning

    Authors: Austin Tripp, Sergio Bacallado, Sukriti Singh, José Miguel Hernández-Lobato

    Abstract: The Tanimoto coefficient is commonly used to measure the similarity between molecules represented as discrete fingerprints, either as a distance metric or a positive definite kernel. While many kernel methods can be accelerated using random feature approximations, at present there is a lack of such approximations for the Tanimoto kernel. In this paper we propose two kinds of novel random features… ▽ More

    Submitted 13 November, 2023; v1 submitted 26 June, 2023; originally announced June 2023.

    Comments: Camera-ready version presented at NeurIPS 2023. Updates include: notation changes, better description of features in section 4, updated experiments, link to code

  8. arXiv:2305.15288  [pdf, other

    cs.RO cs.MA

    Concurrent Constrained Optimization of Unknown Rewards for Multi-Robot Task Allocation

    Authors: Sukriti Singh, Anusha Srikanthan, Vivek Mallampati, Harish Ravichandar

    Abstract: Task allocation can enable effective coordination of multi-robot teams to accomplish tasks that are intractable for individual robots. However, existing approaches to task allocation often assume that task requirements or reward functions are known and explicitly specified by the user. In this work, we consider the challenge of forming effective coalitions for a given heterogeneous multi-robot tea… ▽ More

    Submitted 24 May, 2023; originally announced May 2023.

    Comments: 9 pages, 5 figures, to be published in RSS 2023

  9. arXiv:2212.00342  [pdf, other

    cs.AI

    xEM: Explainable Entity Matching in Customer 360

    Authors: Sukriti Jaitly, Deepa Mariam George, Balaji Ganesan, Muhammad Ameen, Srinivas Pusapati

    Abstract: Entity matching in Customer 360 is the task of determining if multiple records represent the same real world entity. Entities are typically people, organizations, locations, and events represented as attributed nodes in a graph, though they can also be represented as records in relational data. While probabilistic matching engines and artificial neural network models exist for this task, explainin… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: 4 pages, 5 figures. CODS-COMAD 2023 Demo

  10. arXiv:2105.06956  [pdf, other

    cs.LG

    Information-theoretic Evolution of Model Agnostic Global Explanations

    Authors: Sukriti Verma, Nikaash Puri, Piyush Gupta, Balaji Krishnamurthy

    Abstract: Explaining the behavior of black box machine learning models through human interpretable rules is an important research area. Recent work has focused on explaining model behavior locally i.e. for specific predictions as well as globally across the fields of vision, natural language, reinforcement learning and data science. We present a novel model-agnostic approach that derives rules to globally e… ▽ More

    Submitted 14 May, 2021; originally announced May 2021.

  11. arXiv:2009.01571  [pdf, other

    cs.LG stat.ML

    MixBoost: Synthetic Oversampling with Boosted Mixup for Handling Extreme Imbalance

    Authors: Anubha Kabra, Ayush Chopra, Nikaash Puri, Pinkesh Badjatiya, Sukriti Verma, Piyush Gupta, Balaji K

    Abstract: Training a classification model on a dataset where the instances of one class outnumber those of the other class is a challenging problem. Such imbalanced datasets are standard in real-world situations such as fraud detection, medical diagnosis, and computational advertising. We propose an iterative data augmentation method, MixBoost, which intelligently selects (Boost) and then combines (Mix) ins… ▽ More

    Submitted 3 September, 2020; originally announced September 2020.

    Comments: Work done as part of internship at MDSR

  12. arXiv:1912.12191  [pdf, other

    cs.CV cs.AI

    Explain Your Move: Understanding Agent Actions Using Specific and Relevant Feature Attribution

    Authors: Nikaash Puri, Sukriti Verma, Piyush Gupta, Dhruv Kayastha, Shripad Deshmukh, Balaji Krishnamurthy, Sameer Singh

    Abstract: As deep reinforcement learning (RL) is applied to more tasks, there is a need to visualize and understand the behavior of learned agents. Saliency maps explain agent behavior by highlighting the features of the input state that are most relevant for the agent in taking an action. Existing perturbation-based approaches to compute saliency often highlight regions of the input that are not relevant t… ▽ More

    Submitted 3 April, 2020; v1 submitted 23 December, 2019; originally announced December 2019.

    Comments: Accepted at the International Conference on Learning Representations (ICLR) 2020

  13. arXiv:1909.07806  [pdf, other

    cs.ET

    OpticalGAN : Generative Adversarial Networks for Continuous Variable Quantum Computation

    Authors: Nilay Shrivastava, Nikaash Puri, Piyush Gupta, Balaji Krishnamurthy, Sukriti Verma

    Abstract: We present OpticalGAN, an extension of quantum generative adversarial networks for continuous-variable quantum computation. OpticalGAN consists of photonic variational circuits comprising of optical Gaussian and Kerr gates. Photonic quantum computation is a realization of continuous variable quantum computing which involves encoding and processing information in the continuous quadrature amplitude… ▽ More

    Submitted 15 September, 2019; originally announced September 2019.

  14. arXiv:1712.06139  [pdf, other

    cs.DC cs.LG

    TensorFlow-Serving: Flexible, High-Performance ML Serving

    Authors: Christopher Olston, Noah Fiedel, Kiril Gorovoy, Jeremiah Harmsen, Li Lao, Fangwei Li, Vinu Rajashekhar, Sukriti Ramesh, Jordan Soyke

    Abstract: We describe TensorFlow-Serving, a system to serve machine learning models inside Google which is also available in the cloud and via open-source. It is extremely flexible in terms of the types of ML platforms it supports, and ways to integrate with systems that convey new models and updated versions from training to serving. At the same time, the core code paths around model lookup and inference h… ▽ More

    Submitted 27 December, 2017; v1 submitted 17 December, 2017; originally announced December 2017.

    Comments: Presented at NIPS 2017 Workshop on ML Systems (http://learningsys.org/nips17/acceptedpapers.html)

  15. arXiv:1708.04439  [pdf, other

    cs.CL cs.IR cs.LG

    Extractive Summarization using Deep Learning

    Authors: Sukriti Verma, Vagisha Nidhi

    Abstract: This paper proposes a text summarization approach for factual reports using a deep learning model. This approach consists of three phases: feature extraction, feature enhancement, and summary generation, which work together to assimilate core information and generate a coherent, understandable summary. We are exploring various features to improve the set of sentences selected for the summary, and… ▽ More

    Submitted 9 January, 2019; v1 submitted 15 August, 2017; originally announced August 2017.

    Comments: Accepted to 18th International Conference on Computational Linguistics and Intelligent Text Processing

  16. arXiv:1706.07160  [pdf, other

    cs.AI

    MAGIX: Model Agnostic Globally Interpretable Explanations

    Authors: Nikaash Puri, Piyush Gupta, Pratiksha Agarwal, Sukriti Verma, Balaji Krishnamurthy

    Abstract: Explaining the behavior of a black box machine learning model at the instance level is useful for building trust. However, it is also important to understand how the model behaves globally. Such an understanding provides insight into both the data on which the model was trained and the patterns that it learned. We present here an approach that learns if-then rules to globally explain the behavior… ▽ More

    Submitted 15 June, 2018; v1 submitted 21 June, 2017; originally announced June 2017.

  17. arXiv:1609.02404  [pdf, other

    cs.CR

    ITect: Scalable Information Theoretic Similarity for Malware Detection

    Authors: Sukriti Bhattacharya, Hector D. Menendez, Earl Barr, David Clark

    Abstract: Malware creators have been getting their way for too long now. String-based similarity measures can leverage ground truth in a scalable way and can operate at a level of abstraction that is difficult to combat from the code level. We introduce ITect, a scalable approach to malware similarity detection based on information theory. ITect targets file entropy patterns in different ways to achieve 100… ▽ More

    Submitted 8 September, 2016; originally announced September 2016.

    Comments: 14 pages