Skip to main content

Showing 1–50 of 188 results for author: Kishore

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.06386  [pdf, other

    cs.CV

    FPN-IAIA-BL: A Multi-Scale Interpretable Deep Learning Model for Classification of Mass Margins in Digital Mammography

    Authors: Julia Yang, Alina Jade Barnett, Jon Donnelly, Satvik Kishore, Jerry Fang, Fides Regina Schwartz, Chaofan Chen, Joseph Y. Lo, Cynthia Rudin

    Abstract: Digital mammography is essential to breast cancer detection, and deep learning offers promising tools for faster and more accurate mammogram analysis. In radiology and other high-stakes environments, uninterpretable ("black box") deep learning models are unsuitable and there is a call in these fields to make interpretable models. Recent work in interpretable computer vision provides transparency t… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 8 pages, 6 figures, Accepted for oral presentation at the 2024 CVPR Workshop on Domain adaptation, Explainability, Fairness in AI for Medical Image Analysis (DEF-AI-MIA)

  2. arXiv:2406.05967  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark

    Authors: David Romero, Chenyang Lyu, Haryo Akbarianto Wibowo, Teresa Lynn, Injy Hamed, Aditya Nanda Kishore, Aishik Mandal, Alina Dragonetti, Artem Abzaliev, Atnafu Lambebo Tonja, Bontu Fufa Balcha, Chenxi Whitehouse, Christian Salamea, Dan John Velasco, David Ifeoluwa Adelani, David Le Meur, Emilio Villa-Cueva, Fajri Koto, Fauzan Farooqui, Frederico Belcavello, Ganzorig Batnasan, Gisela Vallejo, Grainne Caulfield, Guido Ivetta, Haiyue Song , et al. (50 additional authors not shown)

    Abstract: Visual Question Answering (VQA) is an important task in multimodal AI, and it is often used to test the ability of vision-language models to understand and reason on knowledge present in both visual and textual data. However, most of the current VQA models use datasets that are primarily focused on English and a few major world languages, with images that are typically Western-centric. While recen… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  3. arXiv:2405.17393  [pdf, other

    cs.CV

    EASI-Tex: Edge-Aware Mesh Texturing from Single Image

    Authors: Sai Raj Kishore Perla, Yizhi Wang, Ali Mahdavi-Amiri, Hao Zhang

    Abstract: We present a novel approach for single-image mesh texturing, which employs a diffusion model with judicious conditioning to seamlessly transfer an object's texture from a single RGB image to a given 3D mesh object. We do not assume that the two objects belong to the same category, and even if they do, there can be significant discrepancies in their geometry and part proportions. Our method aims to… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: ACM Transactions on Graphics (Proceedings of SIGGRAPH), 2024. Project Page: https://sairajk.github.io/easi-tex/

  4. arXiv:2405.03725  [pdf, other

    cs.NE cs.AI cs.LG

    Deep Oscillatory Neural Network

    Authors: Nurani Rajagopal Rohan, Vigneswaran C, Sayan Ghosh, Kishore Rajendran, Gaurav A, V Srinivasa Chakravarthy

    Abstract: We propose a novel, brain-inspired deep neural network model known as the Deep Oscillatory Neural Network (DONN). Deep neural networks like the Recurrent Neural Networks indeed possess sequence processing capabilities but the internal states of the network are not designed to exhibit brain-like oscillatory activity. With this motivation, the DONN is designed to have oscillatory internal dynamics.… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

  5. arXiv:2405.00080  [pdf, other

    cs.LG cs.IR cs.NI

    Recommenadation aided Caching using Combinatorial Multi-armed Bandits

    Authors: Pavamana K J, Chandramani Kishore Singh

    Abstract: We study content caching with recommendations in a wireless network where the users are connected through a base station equipped with a finite-capacity cache. We assume a fixed set of contents with unknown user preferences and content popularities. We can recommend a subset of the contents to the users which encourages the users to request these contents. Recommendation can thus be used to increa… ▽ More

    Submitted 3 May, 2024; v1 submitted 30 April, 2024; originally announced May 2024.

  6. arXiv:2404.19369  [pdf, ps, other

    cs.CL cs.HC

    Evaluating Telugu Proficiency in Large Language Models_ A Comparative Analysis of ChatGPT and Gemini

    Authors: Katikela Sreeharsha Kishore, Rahimanuddin Shaik

    Abstract: The growing prominence of large language models (LLMs) necessitates the exploration of their capabilities beyond English. This research investigates the Telugu language proficiency of ChatGPT and Gemini, two leading LLMs. Through a designed set of 20 questions encompassing greetings, grammar, vocabulary, common phrases, task completion, and situational reasoning, the study delves into their streng… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

  7. arXiv:2404.09221  [pdf, other

    cs.CL cs.AI cs.LG

    Exploring and Improving Drafts in Blockwise Parallel Decoding

    Authors: Taehyeon Kim, Ananda Theertha Suresh, Kishore Papineni, Michael Riley, Sanjiv Kumar, Adrian Benton

    Abstract: Despite the remarkable strides made by autoregressive language models, their potential is often hampered by the slow inference speeds inherent in sequential token generation. Blockwise parallel decoding (BPD) was proposed by Stern et al. as a method to improve inference speed of language models by simultaneously predicting multiple future tokens, termed block drafts, which are subsequently verifie… ▽ More

    Submitted 5 June, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

  8. arXiv:2404.07437  [pdf, other

    cs.CR

    Privacy preserving layer partitioning for Deep Neural Network models

    Authors: Kishore Rajasekar, Randolph Loh, Kar Wai Fok, Vrizlynn L. L. Thing

    Abstract: MLaaS (Machine Learning as a Service) has become popular in the cloud computing domain, allowing users to leverage cloud resources for running private inference of ML models on their data. However, ensuring user input privacy and secure inference execution is essential. One of the approaches to protect data privacy and integrity is to use Trusted Execution Environments (TEEs) by enabling execution… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  9. arXiv:2404.05151  [pdf, other

    cs.RO

    STITCH: Augmented Dexterity for Suture Throws Including Thread Coordination and Handoffs

    Authors: Kush Hari, Hansoul Kim, Will Panitch, Kishore Srinivas, Vincent Schorp, Karthik Dharmarajan, Shreya Ganti, Tara Sadjadpour, Ken Goldberg

    Abstract: We present STITCH: an augmented dexterity pipeline that performs Suture Throws Including Thread Coordination and Handoffs. STITCH iteratively performs needle insertion, thread swee**, needle extraction, suture cinching, needle handover, and needle pose correction with failure recovery policies. We introduce a novel visual 6D needle pose estimation framework using a stereo camera pair and new sut… ▽ More

    Submitted 7 April, 2024; originally announced April 2024.

  10. arXiv:2404.03071  [pdf, other

    cs.SI cs.CY physics.soc-ph

    Human Mobility in the Metaverse

    Authors: Kishore Vasan, Marton Karsai, Albert-Laszlo Barabasi

    Abstract: The metaverse promises a shift in the way humans interact with each other, and with their digital and physical environments. The lack of geographical boundaries and travel costs in the metaverse prompts us to ask if the fundamental laws that govern human mobility in the physical world apply. We collected data on avatar movements, along with their network mobility extracted from NFT purchases. We f… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 4 figures

  11. arXiv:2404.02921  [pdf, other

    cs.DL cs.HC cs.IR

    Enhancing Research Information Systems with Identification of Domain Experts

    Authors: Gautam Kishore Shahi, Oliver Hummel

    Abstract: Research organisations and their research outputs have been growing considerably in the past decades. This large body of knowledge attracts various stakeholders, e.g., for knowledge sharing, technology transfer, or potential collaborations. However, due to the large amount of complex knowledge created, traditional methods of manually curating catalogues are often out of time, imprecise, and cumber… ▽ More

    Submitted 28 March, 2024; originally announced April 2024.

    Comments: 6 pages, 4 figures accepted paper at BIR 2024 Workshop

  12. arXiv:2404.01453  [pdf, other

    cs.CL cs.AI

    Unveiling Divergent Inductive Biases of LLMs on Temporal Data

    Authors: Sindhu Kishore, Hangfeng He

    Abstract: Unraveling the intricate details of events in natural language necessitates a subtle understanding of temporal dynamics. Despite the adeptness of Large Language Models (LLMs) in discerning patterns and relationships from data, their inherent comprehension of temporal dynamics remains a formidable challenge. This research meticulously explores these intrinsic challenges within LLMs, with a specific… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  13. arXiv:2403.02909  [pdf, other

    cs.CV cs.HC eess.IV

    Gaze-Vector Estimation in the Dark with Temporally Encoded Event-driven Neural Networks

    Authors: Abeer Banerjee, Naval K. Mehta, Shyam S. Prasad, Himanshu, Sumeet Saurav, Sanjay Singh

    Abstract: In this paper, we address the intricate challenge of gaze vector prediction, a pivotal task with applications ranging from human-computer interaction to driver monitoring systems. Our innovative approach is designed for the demanding setting of extremely low-light conditions, leveraging a novel temporal event encoding scheme, and a dedicated neural network architecture. The temporal encoding metho… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  14. arXiv:2403.02833  [pdf, other

    cs.LG cs.NE

    SOFIM: Stochastic Optimization Using Regularized Fisher Information Matrix

    Authors: Mrinmay Sen, A. K. Qin, Gayathri C, Raghu Kishore N, Yen-Wei Chen, Balasubramanian Raman

    Abstract: This paper introduces a new stochastic optimization method based on the regularized Fisher information matrix (FIM), named SOFIM, which can efficiently utilize the FIM to approximate the Hessian matrix for finding Newton's gradient update in large-scale stochastic optimization of machine learning models. It can be viewed as a variant of natural gradient descent, where the challenge of storing and… ▽ More

    Submitted 1 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  15. arXiv:2403.01646  [pdf, other

    cs.SI cs.HC cs.IR

    TweetInfo: An Interactive System to Mitigate Online Harm

    Authors: Gautam Kishore Shahi

    Abstract: The increase in active users on social networking sites (SNSs) has also observed an increase in harmful content on social media sites. Harmful content is described as an inappropriate activity to harm or deceive an individual or a group of users. Alongside existing methods to detect misinformation and hate speech, users still need to be well-informed about the harmfulness of the content on SNSs. T… ▽ More

    Submitted 3 March, 2024; originally announced March 2024.

    Comments: 3 pages

  16. arXiv:2402.00879  [pdf, other

    cs.NI cs.LG eess.SP

    Graph Representation Learning for Contention and Interference Management in Wireless Networks

    Authors: Zhouyou Gu, Branka Vucetic, Kishore Chikkam, Pasquale Aliberti, Wibowo Hardjawana

    Abstract: Restricted access window (RAW) in Wi-Fi 802.11ah networks manages contention and interference by grou** users and allocating periodic time slots for each group's transmissions. We will find the optimal user grou** decisions in RAW to maximize the network's worst-case user throughput. We review existing user grou** approaches and highlight their performance limitations in the above problem. W… ▽ More

    Submitted 15 January, 2024; originally announced February 2024.

    Comments: This work has been accepted in the IEEE/ACM Transactions on Networking. Copyright may be transferred without notice, after which this version may no longer be accessible

  17. arXiv:2401.16625  [pdf, other

    cs.IR cs.SI

    FakeClaim: A Multiple Platform-driven Dataset for Identification of Fake News on 2023 Israel-Hamas War

    Authors: Gautam Kishore Shahi, Amit Kumar Jaiswal, Thomas Mandl

    Abstract: We contribute the first publicly available dataset of factual claims from different platforms and fake YouTube videos on the 2023 Israel-Hamas war for automatic fake YouTube video classification. The FakeClaim data is collected from 60 fact-checking organizations in 30 languages and enriched with metadata from the fact-checking organizations curated by trained journalists specialized in fact-check… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: Accepted in the IR4Good Track at the 46th European Conference on Information Retrieval (ECIR) 2024

  18. arXiv:2312.12143  [pdf, other

    cs.CV eess.IV

    Integrating Human Vision Perception in Vision Transformers for Classifying Waste Items

    Authors: Akshat Kishore Shrivastava, Tapan Kumar Gandhi

    Abstract: In this paper, we propose an novel methodology aimed at simulating the learning phenomenon of nystagmus through the application of differential blurring on datasets. Nystagmus is a biological phenomenon that influences human vision throughout life, notably by diminishing head shake from infancy to adulthood. Leveraging this concept, we address the issue of waste classification, a pressing global c… ▽ More

    Submitted 20 December, 2023; v1 submitted 19 December, 2023; originally announced December 2023.

    Comments: 16 pages, 4 figures

    MSC Class: 68T45 ACM Class: I.2; I.4

  19. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  20. arXiv:2312.05803  [pdf, other

    cs.CV

    Transformer-based Selective Super-Resolution for Efficient Image Refinement

    Authors: Tianyi Zhang, Kishore Kasichainula, Yaoxin Zhuo, Baoxin Li, Jae-sun Seo, Yu Cao

    Abstract: Conventional super-resolution methods suffer from two drawbacks: substantial computational cost in upscaling an entire large image, and the introduction of extraneous or potentially detrimental information for downstream computer vision tasks during the refinement of the background. To solve these issues, we propose a novel transformer-based algorithm, Selective Super-Resolution (SSR), which parti… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

  21. arXiv:2311.02274  [pdf, other

    cs.CV

    Patch-based Selection and Refinement for Early Object Detection

    Authors: Tianyi Zhang, Kishore Kasichainula, Yaoxin Zhuo, Baoxin Li, Jae-Sun Seo, Yu Cao

    Abstract: Early object detection (OD) is a crucial task for the safety of many dynamic systems. Current OD algorithms have limited success for small objects at a long distance. To improve the accuracy and efficiency of such a task, we propose a novel set of algorithms that divide the image into patches, select patches with objects at various scales, elaborate the details of a small object, and detect it as… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Journal ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2024, pp. 729-738

  22. arXiv:2310.16176  [pdf, other

    cs.CL cs.AI

    Correction with Backtracking Reduces Hallucination in Summarization

    Authors: Zhenzhen Liu, Chao Wan, Varsha Kishore, ** Peng Zhou, Minmin Chen, Kilian Q. Weinberger

    Abstract: Abstractive summarization aims at generating natural language summaries of a source document that are succinct while preserving the important elements. Despite recent advances, neural text summarization models are known to be susceptible to hallucinating (or more correctly confabulating), that is to produce summaries with details that are not grounded in the source document. In this paper, we intr… ▽ More

    Submitted 31 October, 2023; v1 submitted 24 October, 2023; originally announced October 2023.

  23. arXiv:2307.10323  [pdf, other

    cs.IR cs.CL cs.LG

    IncDSI: Incrementally Updatable Document Retrieval

    Authors: Varsha Kishore, Chao Wan, Justin Lovelace, Yoav Artzi, Kilian Q. Weinberger

    Abstract: Differentiable Search Index is a recently proposed paradigm for document retrieval, that encodes information about a corpus of documents within the parameters of a neural network and directly maps queries to corresponding documents. These models have achieved state-of-the-art performances for document retrieval across many benchmarks. These kinds of models have a significant limitation: it is not… ▽ More

    Submitted 19 July, 2023; originally announced July 2023.

  24. arXiv:2307.09312  [pdf, other

    cs.CL cs.LG cs.MM cs.SI

    Multi-Modal Discussion Transformer: Integrating Text, Images and Graph Transformers to Detect Hate Speech on Social Media

    Authors: Liam Hebert, Gaurav Sahu, Yuxuan Guo, Nanda Kishore Sreenivas, Lukasz Golab, Robin Cohen

    Abstract: We present the Multi-Modal Discussion Transformer (mDT), a novel methodfor detecting hate speech in online social networks such as Reddit discussions. In contrast to traditional comment-only methods, our approach to labelling a comment as hate speech involves a holistic analysis of text and images grounded in the discussion context. This is done by leveraging graph transformers to capture the cont… ▽ More

    Submitted 22 February, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: Accepted to AAAI 2024 (AI for Social Impact Track)

  25. arXiv:2307.03882  [pdf, other

    cs.RO

    The Busboy Problem: Efficient Tableware Decluttering Using Consolidation and Multi-Object Grasps

    Authors: Kishore Srinivas, Shreya Ganti, Rishi Parikh, Ayah Ahmad, Wisdom Agboh, Mehmet Dogar, Ken Goldberg

    Abstract: We present the "Busboy Problem": automating an efficient decluttering of cups, bowls, and silverware from a planar surface. As gras** and transporting individual items is highly inefficient, we propose policies to generate grasps for multiple items. We introduce the metric of Objects per Trip (OpT) carried by the robot to the collection bin to analyze the improvement seen as a result of our poli… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

  26. arXiv:2307.00324  [pdf, other

    cs.CV cs.LG

    DeepMediX: A Deep Learning-Driven Resource-Efficient Medical Diagnosis Across the Spectrum

    Authors: Kishore Babu Nampalle, Pradeep Singh, Uppala Vivek Narayan, Balasubramanian Raman

    Abstract: In the rapidly evolving landscape of medical imaging diagnostics, achieving high accuracy while preserving computational efficiency remains a formidable challenge. This work presents \texttt{DeepMediX}, a groundbreaking, resource-efficient model that significantly addresses this challenge. Built on top of the MobileNetV2 architecture, DeepMediX excels in classifying brain MRI scans and skin cancer… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: 23 pages, 3 figures, 4 tables, 1 algorithm

    ACM Class: I.2.1

  27. arXiv:2306.17794  [pdf, other

    cs.LG cs.CR

    Vision Through the Veil: Differential Privacy in Federated Learning for Medical Image Classification

    Authors: Kishore Babu Nampalle, Pradeep Singh, Uppala Vivek Narayan, Balasubramanian Raman

    Abstract: The proliferation of deep learning applications in healthcare calls for data aggregation across various institutions, a practice often associated with significant privacy concerns. This concern intensifies in medical image analysis, where privacy-preserving mechanisms are paramount due to the data being sensitive in nature. Federated learning, which enables cooperative model training without direc… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: 18 pages, 3 figures, 1 table, 1 algorithm

    MSC Class: 68U10 ACM Class: I.2.1

  28. arXiv:2306.15574  [pdf, other

    cs.CV cs.LG

    See Through the Fog: Curriculum Learning with Progressive Occlusion in Medical Imaging

    Authors: Pradeep Singh, Kishore Babu Nampalle, Uppala Vivek Narayan, Balasubramanian Raman

    Abstract: In recent years, deep learning models have revolutionized medical image interpretation, offering substantial improvements in diagnostic accuracy. However, these models often struggle with challenging images where critical features are partially or fully occluded, which is a common scenario in clinical practice. In this paper, we propose a novel curriculum learning-based approach to train deep lear… ▽ More

    Submitted 30 June, 2023; v1 submitted 27 June, 2023; originally announced June 2023.

    Comments: 25 pages, 3 figures, 1 table (supplementary section added)

    MSC Class: 68T05; 68T10; 92C55 ACM Class: I.5.1

  29. arXiv:2306.12392  [pdf, other

    cs.RO cs.LG

    One-shot Imitation Learning via Interaction War**

    Authors: Ondrej Biza, Skye Thompson, Kishore Reddy Pagidi, Abhinav Kumar, Elise van der Pol, Robin Walters, Thomas Kipf, Jan-Willem van de Meent, Lawson L. S. Wong, Robert Platt

    Abstract: Imitation learning of robot policies from few demonstrations is crucial in open-ended applications. We propose a new method, Interaction War**, for learning SE(3) robotic manipulation policies from a single demonstration. We infer the 3D mesh of each object in the environment using shape war**, a technique for aligning point clouds across object instances. Then, we represent manipulation actio… ▽ More

    Submitted 4 November, 2023; v1 submitted 21 June, 2023; originally announced June 2023.

    Comments: CoRL 2023

  30. arXiv:2306.07998  [pdf, other

    cs.CV cs.AI

    Contrastive Attention Networks for Attribution of Early Modern Print

    Authors: Nikolai Vogler, Kartik Goyal, Kishore PV Reddy, Elizaveta Pertseva, Samuel V. Lemley, Christopher N. Warren, Max G'Sell, Taylor Berg-Kirkpatrick

    Abstract: In this paper, we develop machine learning techniques to identify unknown printers in early modern (c.~1500--1800) English printed books. Specifically, we focus on matching uniquely damaged character type-imprints in anonymously printed books to works with known printers in order to provide evidence of their origins. Until now, this work has been limited to manual investigations by analytical bibl… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: Proceedings of AAAI 2023

  31. arXiv:2306.06755  [pdf, other

    cs.PL cs.AI cs.SE

    CoTran: An LLM-based Code Translator using Reinforcement Learning with Feedback from Compiler and Symbolic Execution

    Authors: Prithwish Jana, Piyush Jha, Haoyang Ju, Gautham Kishore, Aryan Mahajan, Vijay Ganesh

    Abstract: In this paper, we present an LLM-based code translation method and an associated tool called CoTran, that translates whole-programs from one high-level programming language to another. Current LLM-based code translation methods lack a training approach to ensure that the translated code reliably compiles or bears substantial functional equivalence to the input code. In our work, we train an LLM vi… ▽ More

    Submitted 16 January, 2024; v1 submitted 11 June, 2023; originally announced June 2023.

    ACM Class: I.2.7; I.2.5; D.2

  32. arXiv:2305.15426  [pdf, other

    cs.CV cs.LG

    Transcending Grids: Point Clouds and Surface Representations Powering Neurological Processing

    Authors: Kishore Babu Nampalle, Pradeep Singh, Vivek Narayan Uppala, Sumit Gangwar, Rajesh Singh Negi, Balasubramanian Raman

    Abstract: In healthcare, accurately classifying medical images is vital, but conventional methods often hinge on medical data with a consistent grid structure, which may restrict their overall performance. Recent medical research has been focused on tweaking the architectures to attain better performance without giving due consideration to the representation of data. In this paper, we present a novel approa… ▽ More

    Submitted 2 June, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

  33. arXiv:2305.08970  [pdf, other

    cs.MA

    Deliberation and Voting in Approval-Based Multi-Winner Elections

    Authors: Kanav Mehra, Nanda Kishore Sreenivas, Kate Larson

    Abstract: Citizen-focused democratic processes where participants deliberate on alternatives and then vote to make the final decision are increasingly popular today. While the computational social choice literature has extensively investigated voting rules, there is limited work that explicitly looks at the interplay of the deliberative process and voting. In this paper, we build a deliberation model using… ▽ More

    Submitted 15 May, 2023; originally announced May 2023.

    Comments: Paper to appear in IJCAI 2023

  34. arXiv:2305.06942  [pdf, other

    cs.DC cs.AR

    Optimizing Distributed ML Communication with Fused Computation-Collective Operations

    Authors: Kishore Punniyamurthy, Khaled Hamidouche, Bradford M. Beckmann

    Abstract: In order to satisfy their ever increasing capacity and compute requirements, machine learning models are distributed across multiple nodes using numerous parallelism strategies. As a result, collective communications are often on the critical path, and hiding their latency by overlap** kernel-granular communication and computation is difficult due to the absence of independent computation. In th… ▽ More

    Submitted 23 April, 2024; v1 submitted 11 May, 2023; originally announced May 2023.

  35. arXiv:2304.08162  [pdf, other

    cs.NE

    Cardiac Arrhythmia Detection using Artificial Neural Network

    Authors: Prof Sangeetha R G, Kishore Anand K, Sreevatsan B, Vishal Kumar A

    Abstract: The prime purpose of this project is to develop a portable cardiac abnormality monitoring device which can drastically improvise the quality of the monitoring and the overall safety of the device. While a generic, low cost, wearable battery powered device for such applications may not yield sufficient performance, such devices combined with the capabilities of Artificial Neural Network algorithms… ▽ More

    Submitted 17 April, 2023; originally announced April 2023.

  36. A Neural Network Transformer Model for Composite Microstructure Homogenization

    Authors: Emil Pitz, Kishore Pochiraju

    Abstract: Heterogeneity and uncertainty in a composite microstructure lead to either computational bottlenecks if modeled rigorously or to solution inaccuracies in the stress field and failure predictions if approximated. Although methods suitable for analyzing arbitrary and non-linear microstructures exist, their computational cost makes them impractical to use in large-scale structural analysis. Surrogate… ▽ More

    Submitted 28 May, 2024; v1 submitted 16 April, 2023; originally announced April 2023.

    Comments: 27 pages, 18 Figures

  37. arXiv:2304.02583  [pdf, other

    cs.RO cs.LG eess.SP

    A force-sensing surgical drill for real-time force feedback in robotic mastoidectomy

    Authors: Yuxin Chen, Anna Goodridge, Manish Sahu, Aditi Kishore, Seena Vafaee, Harsha Mohan, Katherina Sapozhnikov, Francis Creighton, Russell Taylor, Deepa Galaiya

    Abstract: Purpose: Robotic assistance in otologic surgery can reduce the task load of operating surgeons during the removal of bone around the critical structures in the lateral skull base. However, safe deployment into the anatomical passageways necessitates the development of advanced sensing capabilities to actively limit the interaction forces between the surgical tools and critical anatomy. Methods:… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: Accepted at IPCAI2023

  38. arXiv:2303.16206  [pdf, other

    eess.IV cs.CV cs.MM

    Learning Iterative Neural Optimizers for Image Steganography

    Authors: Xiangyu Chen, Varsha Kishore, Kilian Q Weinberger

    Abstract: Image steganography is the process of concealing secret information in images through imperceptible changes. Recent work has formulated this task as a classic constrained optimization problem. In this paper, we argue that image steganography is inherently performed on the (elusive) manifold of natural images, and propose an iterative neural network trained to perform the optimization steps. In con… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: International Conference on Learning Representations (ICLR) 2023

  39. Regret, Delete, (Do Not) Repeat: An Analysis of Self-Cleaning Practices on Twitter After the Outbreak of the COVID-19 Pandemic

    Authors: Nicolás E. Díaz Ferreyra, Gautam Kishore Shahi, Catherine Tony, Stefan Stieglitz, Riccardo Scandariato

    Abstract: During the outbreak of the COVID-19 pandemic, many people shared their symptoms across Online Social Networks (OSNs) like Twitter, ho** for others' advice or moral support. Prior studies have shown that those who disclose health-related information across OSNs often tend to regret it and delete their publications afterwards. Hence, deleted posts containing sensitive data can be seen as manifesta… ▽ More

    Submitted 16 March, 2023; originally announced March 2023.

    Comments: Accepted at CHI '23 Late Breaking Work (LBW)

  40. arXiv:2303.07405  [pdf, other

    cs.AR

    Word-Level Structure Identification In FPGA Designs Using Cell Proximity Information

    Authors: Aparajithan Nathamuni-Venkatesan, Ram-Venkat Narayanan, Kishore Pula, Sundarakumar Muthukumaran, Ranga Vemuri

    Abstract: Reverse engineering of FPGA based designs from the flattened LUT level netlist to high level RTL helps in verification of the design or in understanding legacy designs. We focus on flattened netlists for FPGA devices from ** algorithm that makes use of the location information of the elements on the physical device after place and ro… ▽ More

    Submitted 7 March, 2023; originally announced March 2023.

    Comments: Paper accepted into proceedings of VLSID2023 conference

  41. arXiv:2303.02762  [pdf, other

    cs.AR

    Reverse Engineering Word-Level Models from Look-Up Table Netlists

    Authors: Ram Venkat Narayanan, Aparajithan Nathamuni Venkatesan, Kishore Pula, Sundarakumar Muthukumaran, Ranga Vemuri

    Abstract: Reverse engineering of FPGA designs from bitstreams to RTL models aids in understanding the high level functionality of the design and for validating and reconstructing legacy designs. Fast carry-chains are commonly used in synthesis of operators in FPGA designs. We propose a method to detect word-level structures by analyzing these carry-chains in LUT (Look-Up Table) level netlists. We also prese… ▽ More

    Submitted 5 March, 2023; originally announced March 2023.

    Comments: 8 pages, 6 figures, Accepted to appear in ISQED 2023 conference

  42. arXiv:2302.07588  [pdf, other

    cs.CL cs.AI q-bio.NC

    Word class representations spontaneously emerge in a deep neural network trained on next word prediction

    Authors: Kishore Surendra, Achim Schilling, Paul Stoewer, Andreas Maier, Patrick Krauss

    Abstract: How do humans learn language, and can the first language be learned at all? These fundamental questions are still hotly debated. In contemporary linguistics, there are two major schools of thought that give completely opposite answers. According to Chomsky's theory of universal grammar, language cannot be learned because children are not exposed to sufficient data in their linguistic environment.… ▽ More

    Submitted 15 February, 2023; originally announced February 2023.

    Comments: arXiv admin note: text overlap with arXiv:2301.06755

  43. arXiv:2301.10709  [pdf, other

    q-bio.QM cs.SI

    The Clinical Trials Puzzle: How Network Effects Limit Drug Discovery

    Authors: Kishore Vasan, Deisy Gysi, Albert-Laszlo Barabasi

    Abstract: The depth of knowledge offered by post-genomic medicine has carried the promise of new drugs, and cures for multiple diseases. To explore the degree to which this capability has materialized, we extract meta-data from 356,403 clinical trials spanning four decades, aiming to offer mechanistic insights into the innovation practices in drug discovery. We find that convention dominates over innovation… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

    Comments: manuscript + SI

  44. arXiv:2212.10805  [pdf, other

    cs.SI cs.CV cs.IR

    Beyond Information Exchange: An Approach to Deploy Network Properties for Information Diffusion

    Authors: Soumita Das, Anupam Biswas, Ravi Kishore Devarapalli

    Abstract: Information diffusion in Online Social Networks is a new and crucial problem in social network analysis field and requires significant research attention. Efficient diffusion of information are of critical importance in diverse situations such as; pandemic prevention, advertising, marketing etc. Although several mathematical models have been developed till date, but previous works lacked systemati… ▽ More

    Submitted 21 December, 2022; originally announced December 2022.

    Comments: To be published in BigDML 2021

    ACM Class: J.4; G.4; I.6

  45. arXiv:2212.09462  [pdf, other

    cs.CL cs.LG

    Latent Diffusion for Language Generation

    Authors: Justin Lovelace, Varsha Kishore, Chao Wan, Eliot Shekhtman, Kilian Q. Weinberger

    Abstract: Diffusion models have achieved great success in modeling continuous data modalities such as images, audio, and video, but have seen limited use in discrete domains such as language. Recent attempts to adapt diffusion to language have presented diffusion as an alternative to existing pretrained language models. We view diffusion and existing language models as complementary. We demonstrate that enc… ▽ More

    Submitted 7 November, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: NeurIPS 2023

  46. arXiv:2212.03291   

    cs.NI cs.AI

    Caching Contents with Varying Popularity using Restless Bandits

    Authors: Pavamana K J, Chandramani Kishore Singh

    Abstract: Mobile networks are experiencing prodigious increase in data volume and user density , which exerts a great burden on mobile core networks and backhaul links. An efficient technique to lessen this problem is to use caching i.e. to bring the data closer to the users by making use of the caches of edge network nodes, such as fixed or mobile access points and even user devices. The performance of a c… ▽ More

    Submitted 20 June, 2023; v1 submitted 31 October, 2022; originally announced December 2022.

    Comments: There were a mistakes while submitting updated version. I have submitted a fresh new submissions arXiv:2304.12227

  47. arXiv:2211.02293  [pdf, other

    cs.RO

    Automating Vascular Shunt Insertion with the dVRK Surgical Robot

    Authors: Karthik Dharmarajan, Will Panitch, Muyan Jiang, Kishore Srinivas, Baiyu Shi, Yahav Avigal, Huang Huang, Thomas Low, Danyal Fer, Ken Goldberg

    Abstract: Vascular shunt insertion is a fundamental surgical procedure used to temporarily restore blood flow to tissues. It is often performed in the field after major trauma. We formulate a problem of automated vascular shunt insertion and propose a pipeline to perform Automated Vascular Shunt Insertion (AVSI) using a da Vinci Research Kit. The pipeline uses a learned visual model to estimate the locus of… ▽ More

    Submitted 8 March, 2023; v1 submitted 4 November, 2022; originally announced November 2022.

    Comments: Published in: IEEE International Conference on Robotics and Automation (ICRA) 2023

  48. arXiv:2210.07653  [pdf, other

    cs.RO cs.MA eess.SY

    A Non-iterative Spatio-temporal Multi-task Assignments based Collision-free Trajectories for Music Playing Robots

    Authors: Shridhar Velhal, Krishna Kishore VS, Suresh Sundaram

    Abstract: In this paper, a non-iterative spatio-temporal multi-task assignment approach is used for playing piano music by a team of robots. This paper considers the piano playing problem, in which an algorithm needs to compute the trajectories for a dynamically sized team of robots who will play the musical notes by traveling through the specific locations associated with musical notes at their respective… ▽ More

    Submitted 17 February, 2023; v1 submitted 14 October, 2022; originally announced October 2022.

  49. arXiv:2210.07420  [pdf, other

    cs.RO cs.AI cs.LG

    Learning to Efficiently Plan Robust Frictional Multi-Object Grasps

    Authors: Wisdom C. Agboh, Satvik Sharma, Kishore Srinivas, Mallika Parulekar, Gaurav Datta, Tianshuang Qiu, Jeffrey Ichnowski, Eugen Solowjow, Mehmet Dogar, Ken Goldberg

    Abstract: We consider a decluttering problem where multiple rigid convex polygonal objects rest in randomly placed positions and orientations on a planar surface and must be efficiently transported to a packing box using both single and multi-object grasps. Prior work considered frictionless multi-object gras**. In this paper, we introduce friction to increase the number of potential grasps for a given gr… ▽ More

    Submitted 2 August, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: IEEE IROS 2023

  50. arXiv:2210.06339  [pdf, other

    cs.CV

    Self-Attention Message Passing for Contrastive Few-Shot Learning

    Authors: Ojas Kishorkumar Shirekar, Anuj Singh, Hadi Jamali-Rad

    Abstract: Humans have a unique ability to learn new representations from just a handful of examples with little to no supervision. Deep learning models, however, require an abundance of data and supervision to perform at a satisfactory level. Unsupervised few-shot learning (U-FSL) is the pursuit of bridging this gap between machines and humans. Inspired by the capacity of graph neural networks (GNNs) in dis… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.