Skip to main content

Showing 1–19 of 19 results for author: Manjunatha, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.01008  [pdf, other

    cs.CV

    On Mechanistic Knowledge Localization in Text-to-Image Generative Models

    Authors: Samyadeep Basu, Keivan Rezaei, Priyatham Kattakinda, Ryan Rossi, Cherry Zhao, Vlad Morariu, Varun Manjunatha, Soheil Feizi

    Abstract: Identifying layers within text-to-image models which control visual attributes can facilitate efficient model editing through closed-form updates. Recent work, leveraging causal tracing show that early Stable-Diffusion variants confine knowledge primarily to the first layer of the CLIP text-encoder, while it diffuses throughout the UNet.Extending this framework, we observe that for recent models (… ▽ More

    Submitted 7 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: Appearing in ICML 2024

  2. arXiv:2404.01261  [pdf, other

    cs.CL cs.AI

    FABLES: Evaluating faithfulness and content selection in book-length summarization

    Authors: Yekyung Kim, Yapei Chang, Marzena Karpinska, Aparna Garimella, Varun Manjunatha, Kyle Lo, Tanya Goyal, Mohit Iyyer

    Abstract: While long-context large language models (LLMs) can technically summarize book-length documents (>100K tokens), the length and complexity of the documents have so far prohibited evaluations of input-dependent aspects like faithfulness. In this paper, we conduct the first large-scale human evaluation of faithfulness and content selection on LLM-generated summaries of fictional books. Our study miti… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: preprint - 39 pages

  3. arXiv:2310.13730  [pdf, other

    cs.CV

    Localizing and Editing Knowledge in Text-to-Image Generative Models

    Authors: Samyadeep Basu, Nanxuan Zhao, Vlad Morariu, Soheil Feizi, Varun Manjunatha

    Abstract: Text-to-Image Diffusion Models such as Stable-Diffusion and Imagen have achieved unprecedented quality of photorealism with state-of-the-art FID scores on MS-COCO and other generation benchmarks. Given a caption, image generation requires fine-grained knowledge about attributes such as object structure, style, and viewpoint amongst others. Where does this information reside in text-to-image genera… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: 61 pages

  4. arXiv:2305.14625  [pdf, other

    cs.CL

    KNN-LM Does Not Improve Open-ended Text Generation

    Authors: Shufan Wang, Yixiao Song, Andrew Drozdov, Aparna Garimella, Varun Manjunatha, Mohit Iyyer

    Abstract: In this paper, we study the generation quality of interpolation-based retrieval-augmented language models (LMs). These methods, best exemplified by the KNN-LM, interpolate the LM's predicted distribution of the next word with a distribution formed from the most relevant retrievals for a given prefix. While the KNN-LM and related methods yield impressive decreases in perplexity, we discover that th… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  5. arXiv:2210.14177  [pdf, other

    cs.CL cs.AI cs.LG stat.ML

    Influence Functions for Sequence Tagging Models

    Authors: Sarthak Jain, Varun Manjunatha, Byron C. Wallace, Ani Nenkova

    Abstract: Many language tasks (e.g., Named Entity Recognition, Part-of-Speech tagging, and Semantic Role Labeling) are naturally framed as sequence tagging problems. However, there has been comparatively little work on interpretability methods for sequence tagging models. In this paper, we extend influence functions - which aim to trace predictions back to the training points that informed them - to sequenc… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

    Comments: Accepted to Findings of EMNLP 2022

  6. arXiv:2207.07972  [pdf, other

    cs.LG cs.CR

    Certified Neural Network Watermarks with Randomized Smoothing

    Authors: Arpit Bansal, **-yeh Chiang, Michael Curry, Rajiv Jain, Curtis Wigington, Varun Manjunatha, John P Dickerson, Tom Goldstein

    Abstract: Watermarking is a commonly used strategy to protect creators' rights to digital images, videos and audio. Recently, watermarking methods have been extended to deep learning models -- in principle, the watermark should be preserved when an adversary tries to copy the model. However, in practice, watermarks can often be removed by an intelligent adversary. Several papers have proposed watermarking m… ▽ More

    Submitted 16 July, 2022; originally announced July 2022.

    Comments: ICML 2022

    Journal ref: ICML 2022

  7. arXiv:2106.03331  [pdf, other

    cs.CV cs.CL

    SelfDoc: Self-Supervised Document Representation Learning

    Authors: Peizhao Li, Jiuxiang Gu, Jason Kuen, Vlad I. Morariu, Handong Zhao, Rajiv Jain, Varun Manjunatha, Hongfu Liu

    Abstract: We propose SelfDoc, a task-agnostic pre-training framework for document image understanding. Because documents are multimodal and are intended for sequential reading, our framework exploits the positional, textual, and visual information of every semantically meaningful component in a document, and it models the contextualization between each block of content. Unlike existing document pre-training… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: To appear in CVPR'2021

  8. arXiv:2105.02584  [pdf, other

    cs.CL

    TABBIE: Pretrained Representations of Tabular Data

    Authors: Hiroshi Iida, Dung Thai, Varun Manjunatha, Mohit Iyyer

    Abstract: Existing work on tabular representation learning jointly models tables and associated text using self-supervised objective functions derived from pretrained language models such as BERT. While this joint pretraining improves tasks involving paired tables and text (e.g., answering questions about tables), we show that it underperforms on tasks that operate over tables without any associated text (e… ▽ More

    Submitted 6 May, 2021; originally announced May 2021.

  9. arXiv:2104.08689  [pdf, other

    cs.CV

    RPCL: A Framework for Improving Cross-Domain Detection with Auxiliary Tasks

    Authors: Kai Li, Curtis Wigington, Chris Tensmeyer, Vlad I. Morariu, Handong Zhao, Varun Manjunatha, Nikolaos Barmpalios, Yun Fu

    Abstract: Cross-Domain Detection (XDD) aims to train an object detector using labeled image from a source domain but have good performance in the target domain with only unlabeled images. Existing approaches achieve this either by aligning the feature maps or the region proposals from the two domains, or by transferring the style of source images to that of target image. Contrasted with prior work, this pap… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

    Comments: 10 pages, 5 figures

  10. arXiv:2104.07000  [pdf, other

    cs.CL

    IGA : An Intent-Guided Authoring Assistant

    Authors: Simeng Sun, Wenlong Zhao, Varun Manjunatha, Rajiv Jain, Vlad Morariu, Franck Dernoncourt, Balaji Vasan Srinivasan, Mohit Iyyer

    Abstract: While large-scale pretrained language models have significantly improved writing assistance functionalities such as autocomplete, more complex and controllable writing assistants have yet to be explored. We leverage advances in language modeling to build an interactive writing assistant that generates and rephrases text according to fine-grained author specifications. Users provide input to our In… ▽ More

    Submitted 19 September, 2021; v1 submitted 14 April, 2021; originally announced April 2021.

    Comments: EMNLP2021

  11. arXiv:2103.06922  [pdf, other

    cs.CL cs.LG

    Towards Interpreting and Mitigating Shortcut Learning Behavior of NLU Models

    Authors: Mengnan Du, Varun Manjunatha, Rajiv Jain, Ruchi Deshpande, Franck Dernoncourt, Jiuxiang Gu, Tong Sun, Xia Hu

    Abstract: Recent studies indicate that NLU models are prone to rely on shortcut features for prediction, without achieving true language understanding. As a result, these models fail to generalize to real-world out-of-distribution data. In this work, we show that the words in the NLU training set can be modeled as a long-tailed distribution. There are two findings: 1) NLU models have strong preference for f… ▽ More

    Submitted 13 April, 2021; v1 submitted 11 March, 2021; originally announced March 2021.

    Comments: Accepted by NAACL 2021

  12. arXiv:2006.03204  [pdf, other

    cs.CV cs.AI cs.LG

    Black-box Explanation of Object Detectors via Saliency Maps

    Authors: Vitali Petsiuk, Rajiv Jain, Varun Manjunatha, Vlad I. Morariu, Ashutosh Mehra, Vicente Ordonez, Kate Saenko

    Abstract: We propose D-RISE, a method for generating visual explanations for the predictions of object detectors. Utilizing the proposed similarity metric that accounts for both localization and categorization aspects of object detection allows our method to produce saliency maps that show image areas that most affect the prediction. D-RISE can be considered "black-box" in the software testing sense, as it… ▽ More

    Submitted 10 June, 2021; v1 submitted 4 June, 2020; originally announced June 2020.

    Comments: CVPR 2021 (oral). Project page https://cs-people.bu.edu/vpetsiuk/drise/

  13. arXiv:2003.13197  [pdf, other

    cs.CV

    Cross-Domain Document Object Detection: Benchmark Suite and Method

    Authors: Kai Li, Curtis Wigington, Chris Tensmeyer, Handong Zhao, Nikolaos Barmpalios, Vlad I. Morariu, Varun Manjunatha, Tong Sun, Yun Fu

    Abstract: Decomposing images of document pages into high-level semantic regions (e.g., figures, tables, paragraphs), document object detection (DOD) is fundamental for downstream tasks like intelligent document editing and understanding. DOD remains a challenging problem as document objects vary significantly in layout, size, aspect ratio, texture, etc. An additional challenge arises in practice because lar… ▽ More

    Submitted 29 March, 2020; originally announced March 2020.

    Comments: To appear in CVPR 2020

  14. arXiv:1811.07789  [pdf, other

    cs.CV

    Explicit Bias Discovery in Visual Question Answering Models

    Authors: Varun Manjunatha, Nirat Saini, Larry S. Davis

    Abstract: Researchers have observed that Visual Question Answering (VQA) models tend to answer questions by learning statistical biases in the data. For example, their answer to the question "What is the color of the grass?" is usually "Green", whereas a question like "What is the title of the book?" cannot be answered by inferring statistical biases. It is of interest to the community to explicitly discove… ▽ More

    Submitted 19 November, 2018; originally announced November 2018.

  15. arXiv:1804.06026  [pdf, other

    cs.CV cs.CL

    Learning to Color from Language

    Authors: Varun Manjunatha, Mohit Iyyer, Jordan Boyd-Graber, Larry Davis

    Abstract: Automatic colorization is the process of adding color to greyscale images. We condition this process on language, allowing end users to manipulate a colorized image by feeding in different captions. We present two different architectures for language-conditioned colorization, both of which produce more accurate and plausible colorizations than a language-agnostic version. Through this language-bas… ▽ More

    Submitted 16 April, 2018; originally announced April 2018.

    Comments: 6 pages

    Journal ref: North American Chapter of the Association for Computational Linguistics (NAACL), 2018

  16. arXiv:1804.00060  [pdf, other

    cs.CV

    Class Subset Selection for Transfer Learning using Submodularity

    Authors: Varun Manjunatha, Srikumar Ramalingam, Tim K. Marks, Larry Davis

    Abstract: In recent years, it is common practice to extract fully-connected layer (fc) features that were learned while performing image classification on a source dataset, such as ImageNet, and apply them generally to a wide range of other tasks. The general usefulness of some large training datasets for transfer learning is not yet well understood, and raises a number of questions. For example, in the con… ▽ More

    Submitted 30 March, 2018; originally announced April 2018.

  17. arXiv:1611.05118  [pdf, other

    cs.CV cs.CL

    The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives

    Authors: Mohit Iyyer, Varun Manjunatha, Anupam Guha, Yogarshi Vyas, Jordan Boyd-Graber, Hal Daumé III, Larry Davis

    Abstract: Visual narrative is often a combination of explicit information and judicious omissions, relying on the viewer to supply missing details. In comics, most movements in time and space are hidden in the "gutters" between panels. To follow the story, readers logically connect panels together by inferring unseen actions through a process called "closure". While computers can now describe what is explic… ▽ More

    Submitted 7 May, 2017; v1 submitted 15 November, 2016; originally announced November 2016.

  18. arXiv:1507.05215  [pdf, other

    cs.HC

    MetroViz: Visual Analysis of Public Transportation Data

    Authors: Fan Du, Joshua Brulé, Peter Enns, Varun Manjunatha, Yoav Segev

    Abstract: Understanding the quality and usage of public transportation resources is important for schedule optimization and resource allocation. Ridership and adherence are the two main dimensions for evaluating the quality of service. Using Automatic Vehicle Location (AVL), Automatic Passenger Count (APC), and Global Positioning System (GPS) data, ridership data and adherence data of public transportation… ▽ More

    Submitted 18 July, 2015; originally announced July 2015.

  19. arXiv:1502.00030  [pdf, other

    cs.CV

    SHOE: Supervised Hashing with Output Embeddings

    Authors: Sravanthi Bondugula, Varun Manjunatha, Larry S. Davis, David Doermann

    Abstract: We present a supervised binary encoding scheme for image retrieval that learns projections by taking into account similarity between classes obtained from output embeddings. Our motivation is that binary hash codes learned in this way improve both the visual quality of retrieval results and existing supervised hashing schemes. We employ a sequential greedy optimization that learns relationship awa… ▽ More

    Submitted 30 January, 2015; originally announced February 2015.