Skip to main content

Showing 1–37 of 37 results for author: Kamath, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.00542  [pdf

    cs.AR

    An Automated Validation Framework for Power Management and Data Retention Logic Kits of Standard Cell Library

    Authors: Akshay Karkal Kamath, Bharath Kumar, Sunil Aggarwal, Subramanian Parameswaran, Parag Lonkar, Debi Prasanna, Somasunder Sreenath

    Abstract: The development of a standard cell library involves characterization of a number of gate-level circuits at various cell-level abstractions. Verifying the behavior of these cells largely depends on the manual skills of the circuit designers. Especially challenging are the power management and data retention cells which must be checked thoroughly for voltage and power configurations in addition to t… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 33rd Design and Verification Conference and Exhibition United States (DVCon U.S. 2021)

  2. arXiv:2405.19315  [pdf, other

    cs.CV cs.CL cs.LG

    Matryoshka Query Transformer for Large Vision-Language Models

    Authors: Wenbo Hu, Zi-Yi Dou, Liunian Harold Li, Amita Kamath, Nanyun Peng, Kai-Wei Chang

    Abstract: Large Vision-Language Models (LVLMs) typically encode an image into a fixed number of visual tokens (e.g., 576) and process these tokens with a language model. Despite their strong performance, LVLMs face challenges in adapting to varying computational constraints. This raises the question: can we achieve flexibility in the number of visual tokens to suit different tasks and computational resource… ▽ More

    Submitted 6 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: Preprint. Our code and model are publicly available at https://github.com/gordonhu608/MQT-LLaVA

  3. arXiv:2405.00029  [pdf, ps, other

    cs.CV cs.IR

    Automatic Creative Selection with Cross-Modal Matching

    Authors: Alex Kim, Jia Huang, Rob Monarch, Jerry Kwac, Anikesh Kamath, Parmeshwar Khurd, Kailash Thiyagarajan, Goodman Gu

    Abstract: Application developers advertise their Apps by creating product pages with App images, and bidding on search terms. It is then crucial for App images to be highly relevant with the search terms. Solutions to this problem require an image-text matching model to predict the quality of the match between the chosen image and the search terms. In this work, we present a novel approach to matching an Ap… ▽ More

    Submitted 28 February, 2024; originally announced May 2024.

  4. arXiv:2404.18416  [pdf, other

    cs.AI cs.CL cs.CV cs.LG

    Capabilities of Gemini Models in Medicine

    Authors: Khaled Saab, Tao Tu, Wei-Hung Weng, Ryutaro Tanno, David Stutz, Ellery Wulczyn, Fan Zhang, Tim Strother, Chunjong Park, Elahe Vedadi, Juanma Zambrano Chaves, Szu-Yeu Hu, Mike Schaekermann, Aishwarya Kamath, Yong Cheng, David G. T. Barrett, Cathy Cheung, Basil Mustafa, Anil Palepu, Daniel McDuff, Le Hou, Tomer Golany, Luyang Liu, Jean-baptiste Alayrac, Neil Houlsby , et al. (42 additional authors not shown)

    Abstract: Excellence in a wide variety of medical applications poses considerable challenges for AI, requiring advanced reasoning, access to up-to-date medical knowledge and understanding of complex multimodal data. Gemini models, with strong general capabilities in multimodal and long-context reasoning, offer exciting possibilities in medicine. Building on these core strengths of Gemini, we introduce Med-G… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  5. arXiv:2404.18262  [pdf, other

    cs.AI

    Generating Situated Reflection Triggers about Alternative Solution Paths: A Case Study of Generative AI for Computer-Supported Collaborative Learning

    Authors: Atharva Naik, Jessica Ruhan Yin, Anusha Kamath, Qianou Ma, Sherry Tongshuang Wu, Charles Murray, Christopher Bogart, Majd Sakr, Carolyn P. Rose

    Abstract: An advantage of Large Language Models (LLMs) is their contextualization capability - providing different responses based on student inputs like solution strategy or prior discussion, to potentially better engage students than standard feedback. We present a design and evaluation of a proof-of-concept LLM application to offer students dynamic and contextualized feedback. Specifically, we augment an… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  6. arXiv:2404.09356  [pdf, other

    cs.CY cs.AI cs.CL cs.ET

    LLeMpower: Understanding Disparities in the Control and Access of Large Language Models

    Authors: Vishwas Sathish, Hannah Lin, Aditya K Kamath, Anish Nyayachavadi

    Abstract: Large Language Models (LLMs) are a powerful technology that augment human skill to create new opportunities, akin to the development of steam engines and the internet. However, LLMs come with a high cost. They require significant computing resources and energy to train and serve. Inequity in their control and access has led to concentration of ownership and power to a small collection of corporati… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: 11 total pages, 7 page text, 4 page references, 3 figures (with subfigures), 1 table

    ACM Class: K.4.0; K.7.4

  7. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  8. arXiv:2401.08908  [pdf, other

    cs.OS cs.LG

    Herding LLaMaS: Using LLMs as an OS Module

    Authors: Aditya K Kamath, Sujay Yadalam

    Abstract: Computer systems are becoming increasingly heterogeneous with the emergence of new memory technologies and compute devices. GPUs alongside CPUs have become commonplace and CXL is poised to be a mainstay of cloud systems. The operating system is responsible for managing these hardware resources, requiring modification every time a new device is released. Years of research and development are sunk i… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: ASPLOS 2023, Wild and Crazy Ideas session

  9. arXiv:2312.00647  [pdf, other

    cs.OS

    MaxMem: Colocation and Performance for Big Data Applications on Tiered Main Memory Servers

    Authors: Amanda Raybuck, Wei Zhang, Kayvan Mansoorshahi, Aditya K. Kamath, Mattan Erez, Simon Peter

    Abstract: We present MaxMem, a tiered main memory management system that aims to maximize Big Data application colocation and performance. MaxMem uses an application-agnostic and lightweight memory occupancy control mechanism based on fast memory miss ratios to provide application QoS under increasing colocation. By relying on memory access sampling and binning to quickly identify per-process memory heat gr… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

    Comments: 12 pages, 10 figures

  10. arXiv:2311.07948  [pdf, other

    cs.PL cs.LG

    Finding Inductive Loop Invariants using Large Language Models

    Authors: Adharsh Kamath, Aditya Senthilnathan, Saikat Chakraborty, Pantazis Deligiannis, Shuvendu K. Lahiri, Akash Lal, Aseem Rastogi, Subhajit Roy, Rahul Sharma

    Abstract: Loop invariants are fundamental to reasoning about programs with loops. They establish properties about a given loop's behavior. When they additionally are inductive, they become useful for the task of formal verification that seeks to establish strong mathematical guarantees about program's runtime behavior. The inductiveness ensures that the invariants can be checked locally without consulting t… ▽ More

    Submitted 14 November, 2023; originally announced November 2023.

  11. arXiv:2310.19785  [pdf, other

    cs.CL cs.CV cs.LG

    What's "up" with vision-language models? Investigating their struggle with spatial reasoning

    Authors: Amita Kamath, Jack Hessel, Kai-Wei Chang

    Abstract: Recent vision-language (VL) models are powerful, but can they reliably distinguish "right" from "left"? We curate three new corpora to quantify model comprehension of such basic spatial relations. These tests isolate spatial reasoning more precisely than existing datasets like VQAv2, e.g., our What'sUp benchmark contains sets of photographs varying only the spatial relations of objects, kee** th… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023

  12. arXiv:2305.14897  [pdf, other

    cs.CL cs.CV cs.LG

    Text encoders bottleneck compositionality in contrastive vision-language models

    Authors: Amita Kamath, Jack Hessel, Kai-Wei Chang

    Abstract: Performant vision-language (VL) models like CLIP represent captions using a single vector. How much information about language is lost in this bottleneck? We first curate CompPrompts, a set of increasingly compositional image captions that VL models should be able to capture (e.g., single object, to object+property, to multiple interacting objects). Then, we train text-only recovery probes that ai… ▽ More

    Submitted 30 October, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  13. arXiv:2304.10946  [pdf, other

    cs.CL cs.LG q-bio.BM

    CancerGPT: Few-shot Drug Pair Synergy Prediction using Large Pre-trained Language Models

    Authors: Tianhao Li, Sandesh Shetty, Advaith Kamath, Ajay Jaiswal, Xianqian Jiang, Ying Ding, Ye** Kim

    Abstract: Large pre-trained language models (LLMs) have been shown to have significant potential in few-shot learning across various fields, even with minimal training data. However, their ability to generalize to unseen tasks in more complex fields, such as biology, has yet to be fully evaluated. LLMs can offer a promising alternative approach for biological inference, particularly in cases where structure… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

  14. arXiv:2303.16133  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Exposing and Addressing Cross-Task Inconsistency in Unified Vision-Language Models

    Authors: Adyasha Maharana, Amita Kamath, Christopher Clark, Mohit Bansal, Aniruddha Kembhavi

    Abstract: As general purpose vision models get increasingly effective at a wide set of tasks, it is imperative that they be consistent across the tasks they support. Inconsistent AI models are considered brittle and untrustworthy by human users and are more challenging to incorporate into larger systems that take dependencies on their outputs. Measuring consistency between very heterogeneous tasks that migh… ▽ More

    Submitted 21 February, 2024; v1 submitted 28 March, 2023; originally announced March 2023.

    Comments: TMLR 2024; Project Website: https://adymaharana.github.io/cococon/

  15. Data-Model-Circuit Tri-Design for Ultra-Light Video Intelligence on Edge Devices

    Authors: Yimeng Zhang, Akshay Karkal Kamath, Qiucheng Wu, Zhiwen Fan, Wuyang Chen, Zhangyang Wang, Shiyu Chang, Sijia Liu, Cong Hao

    Abstract: In this paper, we propose a data-model-hardware tri-design framework for high-throughput, low-cost, and high-accuracy multi-object tracking (MOT) on High-Definition (HD) video stream. First, to enable ultra-light video intelligence, we propose temporal frame-filtering and spatial saliency-focusing approaches to reduce the complexity of massive video data. Second, we exploit structure-aware weight… ▽ More

    Submitted 17 October, 2022; v1 submitted 16 October, 2022; originally announced October 2022.

    Comments: Accepted to ASP-DAC'23

  16. arXiv:2210.07472  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Robust Candidate Generation for Entity Linking on Short Social Media Texts

    Authors: Liam Hebert, Raheleh Makki, Shubhanshu Mishra, Hamidreza Saghir, Anusha Kamath, Yuval Merhav

    Abstract: Entity Linking (EL) is the gateway into Knowledge Bases. Recent advances in EL utilize dense retrieval approaches for Candidate Generation, which addresses some of the shortcomings of the Lookup based approach of matching NER mentions against pre-computed dictionaries. In this work, we show that in the domain of Tweets, such methods suffer as users often include informal spelling, limited context,… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Comments: 7 pages, 2 figures. Accepted to Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). URL: https://aclanthology.org/2022.wnut-1.8

    MSC Class: 68T50; 68T07 ACM Class: I.2.7

    Journal ref: Proceedings of the Eighth Workshop on Noisy User-generated Text (W-NUT 2022). pages 83-89

  17. arXiv:2210.03112  [pdf, other

    cs.LG cs.CL cs.CV cs.RO

    A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning

    Authors: Aishwarya Kamath, Peter Anderson, Su Wang, **g Yu Koh, Alexander Ku, Austin Waters, Yinfei Yang, Jason Baldridge, Zarana Parekh

    Abstract: Recent studies in Vision-and-Language Navigation (VLN) train RL agents to execute natural-language navigation instructions in photorealistic environments, as a step towards robots that can follow human instructions. However, given the scarcity of human instruction data and limited diversity in the training environments, these agents still struggle with complex language grounding and spatial langua… ▽ More

    Submitted 17 April, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: CVPR 2023

  18. arXiv:2210.01750  [pdf, other

    cs.CL cs.AI

    Modular Approach to Machine Reading Comprehension: Mixture of Task-Aware Experts

    Authors: Anirudha Rayasam, Anusha Kamath, Gabriel Bayomi Tinoco Kalejaiye

    Abstract: In this work we present a Mixture of Task-Aware Experts Network for Machine Reading Comprehension on a relatively small dataset. We particularly focus on the issue of common-sense learning, enforcing the common ground knowledge by specifically training different expert networks to capture different kinds of relationships between each passage, question and choice triplet. Moreover, we take inspi ra… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

  19. arXiv:2206.07643  [pdf, other

    cs.CV cs.CL cs.LG

    Coarse-to-Fine Vision-Language Pre-training with Fusion in the Backbone

    Authors: Zi-Yi Dou, Aishwarya Kamath, Zhe Gan, Pengchuan Zhang, Jianfeng Wang, Linjie Li, Zicheng Liu, Ce Liu, Yann LeCun, Nanyun Peng, Jianfeng Gao, Lijuan Wang

    Abstract: Vision-language (VL) pre-training has recently received considerable attention. However, most existing end-to-end pre-training approaches either only aim to tackle VL tasks such as image-text retrieval, visual question answering (VQA) and image captioning that test high-level understanding of images, or only target region-level understanding for tasks such as phrase grounding and object detection.… ▽ More

    Submitted 18 November, 2022; v1 submitted 15 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022. Project Website: https://ashkamath.github.io/FIBER_page

  20. arXiv:2203.14518  [pdf, other

    cs.PL

    Certified Mergeable Replicated Data Types

    Authors: Vimala Soundarapandian, Adharsh Kamath, Kartik Nagar, KC Sivaramakrishnan

    Abstract: Replicated data types (RDTs) are data structures that permit concurrent modification of multiple, potentially geo-distributed, replicas without coordination between them. RDTs are designed in such a way that conflicting operations are eventually deterministically reconciled ensuring convergence. Constructing correct RDTs remains a difficult endeavour due to the complexity of reasoning about indepe… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: Conditionally accepted to PLDI 2022

  21. arXiv:2202.02317  [pdf, other

    cs.CV cs.CL

    Webly Supervised Concept Expansion for General Purpose Vision Models

    Authors: Amita Kamath, Christopher Clark, Tanmay Gupta, Eric Kolve, Derek Hoiem, Aniruddha Kembhavi

    Abstract: General Purpose Vision (GPV) systems are models that are designed to solve a wide array of visual tasks without requiring architectural changes. Today, GPVs primarily learn both skills and concepts from large fully supervised datasets. Scaling GPVs to tens of thousands of concepts by acquiring data to learn each concept for every skill quickly becomes prohibitive. This work presents an effective a… ▽ More

    Submitted 20 July, 2022; v1 submitted 4 February, 2022; originally announced February 2022.

    Comments: ECCV 2022

  22. arXiv:2109.06082  [pdf, other

    cs.CL

    xGQA: Cross-Lingual Visual Question Answering

    Authors: Jonas Pfeiffer, Gregor Geigle, Aishwarya Kamath, Jan-Martin O. Steitz, Stefan Roth, Ivan Vulić, Iryna Gurevych

    Abstract: Recent advances in multimodal vision and language modeling have predominantly focused on the English language, mostly due to the lack of multilingual multimodal datasets to steer modeling efforts. In this work, we address this gap and provide xGQA, a new multilingual evaluation benchmark for the visual question answering task. We extend the established English GQA dataset to 7 typologically divers… ▽ More

    Submitted 17 March, 2022; v1 submitted 13 September, 2021; originally announced September 2021.

    Comments: Findings of ACL 2022

  23. arXiv:2105.11338  [pdf, other

    cs.DS

    A Simple Proof of a New Set Disjointness with Applications to Data Streams

    Authors: Akshay Kamath, Eric Price, David P. Woodruff

    Abstract: The multiplayer promise set disjointness is one of the most widely used problems from communication complexity in applications. In this problem there are $k$ players with subsets $S^1, \ldots, S^k$, each drawn from $\{1, 2, \ldots, n\}$, and we are promised that either the sets are (1) pairwise disjoint, or (2) there is a unique element $j$ occurring in all the sets, which are otherwise pairwise d… ▽ More

    Submitted 24 May, 2021; originally announced May 2021.

    Comments: CCC 2021

  24. arXiv:2104.12763  [pdf, other

    cs.CV cs.CL cs.LG

    MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding

    Authors: Aishwarya Kamath, Mannat Singh, Yann LeCun, Gabriel Synnaeve, Ishan Misra, Nicolas Carion

    Abstract: Multi-modal reasoning systems rely on a pre-trained object detector to extract regions of interest from the image. However, this crucial module is typically used as a black box, trained independently of the downstream task and on a fixed vocabulary of objects and attributes. This makes it challenging for such systems to capture the long tail of visual concepts expressed in free form text. In this… ▽ More

    Submitted 11 October, 2021; v1 submitted 26 April, 2021; originally announced April 2021.

  25. arXiv:2104.00743  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Towards General Purpose Vision Systems

    Authors: Tanmay Gupta, Amita Kamath, Aniruddha Kembhavi, Derek Hoiem

    Abstract: Computer vision systems today are primarily N-purpose systems, designed and trained for a predefined set of tasks. Adapting such systems to new tasks is challenging and often requires non-trivial modifications to the network architecture (e.g. adding new output heads) or training process (e.g. adding new losses). To reduce the time and expertise required to develop new applications, we would like… ▽ More

    Submitted 19 April, 2022; v1 submitted 1 April, 2021; originally announced April 2021.

    Comments: CVPR 2022 Oral; Project page: https://prior.allenai.org/projects/gpv

  26. arXiv:2010.14019  [pdf, other

    cs.LG stat.ML

    Know Where To Drop Your Weights: Towards Faster Uncertainty Estimation

    Authors: Akshatha Kamath, Dwaraknath Gnaneshwar, Matias Valdenegro-Toro

    Abstract: Estimating epistemic uncertainty of models used in low-latency applications and Out-Of-Distribution samples detection is a challenge due to the computationally demanding nature of uncertainty estimation techniques. Estimating model uncertainty using approximation techniques like Monte Carlo Dropout (MCD), DropConnect (MCDC) requires a large number of forward passes through the network, rendering t… ▽ More

    Submitted 26 October, 2020; originally announced October 2020.

    Comments: 8 pages, 6 figures, 1 table, with appendix, submitted to a NeurIPS workshop

  27. arXiv:2007.07779  [pdf, other

    cs.CL

    AdapterHub: A Framework for Adapting Transformers

    Authors: Jonas Pfeiffer, Andreas Rücklé, Clifton Poth, Aishwarya Kamath, Ivan Vulić, Sebastian Ruder, Kyunghyun Cho, Iryna Gurevych

    Abstract: The current modus operandi in NLP involves downloading and fine-tuning pre-trained models consisting of millions or billions of parameters. Storing and sharing such large trained models is expensive, slow, and time-consuming, which impedes progress towards more general and versatile NLP methods that learn from and for many tasks. Adapters -- small learnt bottleneck layers inserted within each laye… ▽ More

    Submitted 6 October, 2020; v1 submitted 15 July, 2020; originally announced July 2020.

    Comments: EMNLP 2020: Systems Demonstrations

  28. arXiv:2006.09462  [pdf, other

    cs.CL cs.LG

    Selective Question Answering under Domain Shift

    Authors: Amita Kamath, Robin Jia, Percy Liang

    Abstract: To avoid giving wrong answers, question answering (QA) models need to know when to abstain from answering. Moreover, users often ask questions that diverge from the model's training data, making errors more likely and thus abstention more critical. In this work, we propose the setting of selective question answering under domain shift, in which a QA model is tested on a mixture of in-domain and ou… ▽ More

    Submitted 16 June, 2020; originally announced June 2020.

    Comments: ACL 2020

  29. arXiv:2005.00247  [pdf, other

    cs.CL

    AdapterFusion: Non-Destructive Task Composition for Transfer Learning

    Authors: Jonas Pfeiffer, Aishwarya Kamath, Andreas Rücklé, Kyunghyun Cho, Iryna Gurevych

    Abstract: Sequential fine-tuning and multi-task learning are methods aiming to incorporate knowledge from multiple tasks; however, they suffer from catastrophic forgetting and difficulties in dataset balancing. To address these shortcomings, we propose AdapterFusion, a new two stage learning algorithm that leverages knowledge from multiple tasks. First, in the knowledge extraction stage we learn task specif… ▽ More

    Submitted 26 January, 2021; v1 submitted 1 May, 2020; originally announced May 2020.

    Journal ref: Proceedings of EACL 2021

  30. arXiv:1912.02938  [pdf, ps, other

    cs.DS cs.IT cs.LG

    Lower Bounds for Compressed Sensing with Generative Models

    Authors: Akshay Kamath, Sushrut Karmalkar, Eric Price

    Abstract: The goal of compressed sensing is to learn a structured signal $x$ from a limited number of noisy linear measurements $y \approx Ax$. In traditional compressed sensing, "structure" is represented by sparsity in some known basis. Inspired by the success of deep learning in modeling images, recent work starting with~\cite{BJPD17} has instead considered structure to come from a generative model… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

  31. arXiv:1909.12221  [pdf, other

    cs.AR

    Storage Class Memory: Principles, Problems, and Possibilities

    Authors: Aditya K Kamath, Leslie Monis, A Tarun Karthik, Basavaraj Talawar

    Abstract: Storage Class Memory (SCM) is a class of memory technology which has recently become viable for use. Their namearises from the fact that they exhibit non-volatility of data, similar to secondary storage while also having latencies comparable toprimary memory and byte-addressibility. In this area, Phase Change Memory (PCM), Spin-Transfer-Torque Random Access Memory(STT-RAM), and Resistive RAM (ReRA… ▽ More

    Submitted 26 September, 2019; originally announced September 2019.

  32. arXiv:1909.04547  [pdf, other

    cs.CL

    What do Deep Networks Like to Read?

    Authors: Jonas Pfeiffer, Aishwarya Kamath, Iryna Gurevych, Sebastian Ruder

    Abstract: Recent research towards understanding neural networks probes models in a top-down manner, but is only able to identify model tendencies that are known a priori. We propose Susceptibility Identification through Fine-Tuning (SIFT), a novel abstractive method that uncovers a model's preferences without imposing any prior. By fine-tuning an autoencoder with the gradients from a fixed classifier, we ar… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

  33. arXiv:1906.07538  [pdf, other

    cs.CV

    Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detection

    Authors: Deepak Babu Sam, Skand Vishwanath Peri, Mukuntha Narayanan Sundararaman, Amogh Kamath, R. Venkatesh Babu

    Abstract: We introduce a detection framework for dense crowd counting and eliminate the need for the prevalent density regression paradigm. Typical counting models predict crowd density for an image as opposed to detecting every person. These regression methods, in general, fail to localize persons accurate enough for most applications other than counting. Hence, we adopt an architecture that locates every… ▽ More

    Submitted 15 February, 2020; v1 submitted 18 June, 2019; originally announced June 2019.

    Comments: Accepted in T-PAMI, 2020. Code available at : https://github.com/val-iisc/lsc-cnn

  34. arXiv:1812.00978  [pdf, other

    cs.CL

    A Survey on Semantic Parsing

    Authors: Aishwarya Kamath, Rajarshi Das

    Abstract: A significant amount of information in today's world is stored in structured and semi-structured knowledge bases. Efficient and simple methods to query them are essential and must not be restricted to only those who have expertise in formal query languages. The field of semantic parsing deals with converting natural language utterances to logical forms that can be easily executed on a knowledge ba… ▽ More

    Submitted 29 May, 2019; v1 submitted 3 December, 2018; originally announced December 2018.

    Comments: AKBC 2019

  35. arXiv:1708.03951  [pdf, other

    stat.ML cs.AI q-bio.QM

    Optimization of Ensemble Supervised Learning Algorithms for Increased Sensitivity, Specificity, and AUC of Population-Based Colorectal Cancer Screenings

    Authors: Anirudh Kamath, Aditya Singh, Raj Ramnani, Ayush Vyas, Jay Shenoy

    Abstract: Over 150,000 new people in the United States are diagnosed with colorectal cancer each year. Nearly a third die from it (American Cancer Society). The only approved noninvasive diagnosis tools currently involve fecal blood count tests (FOBTs) or stool DNA tests. Fecal blood count tests take only five minutes and are available over the counter for as low as \… ▽ More

    Submitted 14 August, 2017; v1 submitted 13 August, 2017; originally announced August 2017.

    Comments: 7 pages, 3 figures

  36. Home Automation Using SSVEP & Eye-Blink Detection Based Brain-Computer Interface

    Authors: Kratarth Goel, Raunaq Vohra, Anant Kamath, Veeky Baths

    Abstract: In this paper, we present a novel brain computer interface based home automation system using two responses - Steady State Visually Evoked Potential (SSVEP) and the eye-blink artifact, which is augmented by a Bluetooth based indoor localization system, to greatly increase the number of controllable devices. The hardware implementation of this system to control a table lamp and table fan using brai… ▽ More

    Submitted 26 December, 2014; originally announced December 2014.

    Comments: 2 pages, 1 table, published at IEEE SMC 2014

  37. arXiv:1302.5366  [pdf, ps, other

    cs.DS cs.CC

    Testing Uniformity of Stationary Distribution

    Authors: Sourav Chakraborty, Akshay Kamath, Rameshwar Pratap

    Abstract: A random walk on a directed graph gives a Markov chain on the vertices of the graph. An important question that arises often in the context of Markov chain is whether the uniform distribution on the vertices of the graph is a stationary distribution of the Markov chain. Stationary distribution of a Markov chain is a global property of the graph. In this paper, we prove that for a regular directed… ▽ More

    Submitted 10 March, 2016; v1 submitted 21 February, 2013; originally announced February 2013.