Skip to main content

Showing 1–50 of 311 results for author: Anuj

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13608  [pdf, other

    cs.IT cs.CR

    Wiretapped Commitment over Binary Channels

    Authors: Anuj Kumar Yadav, Manideep Mamindlapally, Amitalok J. Budkuley

    Abstract: We propose the problem of wiretapped commitment, where two parties, say committer Alice and receiver Bob, engage in a commitment protocol using a noisy channel as a resource, in the presence of an eavesdropper, say Eve. Noisy versions of Alice's transmission over the wiretap channel are received at both Bob and Eve. We seek to determine the maximum commitment throughput in the presence of an eaves… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 13 Pages, 1 figure

  2. arXiv:2406.05599  [pdf, other

    quant-ph cs.IT

    Reliable Quantum Memories with Unreliable Components

    Authors: Anuj K. Nayak, Eric Chitambar, Lav R. Varshney

    Abstract: Quantum memory systems are vital in quantum information processing for dependable storage and retrieval of quantum states. Inspired by classical reliability theories that synthesize reliable computing systems from unreliable components, we formalize the problem of reliable storage of quantum information using noisy components. We introduce the notion of stable quantum memories and define the stora… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: 15 pages, 3 figures

  3. arXiv:2406.04744  [pdf, other

    cs.CL

    CRAG -- Comprehensive RAG Benchmark

    Authors: Xiao Yang, Kai Sun, Hao Xin, Yushi Sun, Nikita Bhalla, Xiangsen Chen, Sajal Choudhary, Rongze Daniel Gui, Ziran Will Jiang, Ziyu Jiang, Lingkun Kong, Brian Moran, Jiaqi Wang, Yifan Ethan Xu, An Yan, Chenyu Yang, Eting Yuan, Hanwen Zha, Nan Tang, Lei Chen, Nicolas Scheffer, Yue Liu, Nirav Shah, Rakesh Wanga, Anuj Kumar , et al. (2 additional authors not shown)

    Abstract: Retrieval-Augmented Generation (RAG) has recently emerged as a promising solution to alleviate Large Language Model (LLM)'s deficiency in lack of knowledge. Existing RAG datasets, however, do not adequately represent the diverse and dynamic nature of real-world Question Answering (QA) tasks. To bridge this gap, we introduce the Comprehensive RAG Benchmark (CRAG), a factual question answering bench… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  4. arXiv:2406.03747  [pdf, other

    cs.CV cs.AI cs.LG

    Instance Segmentation and Teeth Classification in Panoramic X-rays

    Authors: Devichand Budagam, Ayush Kumar, Sayan Ghosh, Anuj Shrivastav, Azamat Zhanatuly Imanbayev, Iskander Rafailovich Akhmetov, Dmitrii Kaplun, Sergey Antonov, Artem Rychenkov, Gleb Cyganov, Aleksandr Sinitca

    Abstract: Teeth segmentation and recognition are critical in various dental applications and dental diagnosis. Automatic and accurate segmentation approaches have been made possible by integrating deep learning models. Although teeth segmentation has been studied in the past, only some techniques were able to effectively classify and segment teeth simultaneously. This article offers a pipeline of two deep l… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: submtted to Expert Systems with Applications Journal

  5. arXiv:2405.19147  [pdf, ps, other

    cs.DM math.CO

    Homomorphism Counts to Trees

    Authors: Anuj Dawar

    Abstract: We construct a pair of non-isomorphic, bipartite graphs which are not distinguished by counting the number of homomorphisms to any tree. This answers a question raised by Atserias et al. (LICS 2021). In order to establish the construction, we analyse the equivalence relations induced by counting homomorphisms to trees of diameter two and three and obtain necessary and sufficient conditions for two… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 13 pages

    MSC Class: 05C60 (primary) 68R10 (secondary) ACM Class: G.2.1; G.2.2

  6. arXiv:2405.14876  [pdf, other

    cs.CV cs.AI

    Precise and Robust Sidewalk Detection: Leveraging Ensemble Learning to Surpass LLM Limitations in Urban Environments

    Authors: Ibne Farabi Shihab, Benjir Islam Alvee, Sudesh Ramesh Bhagat, Anuj Sharma

    Abstract: This study aims to compare the effectiveness of a robust ensemble model with the state-of-the-art ONE-PEACE Large Language Model (LLM) for accurate detection of sidewalks. Accurate sidewalk detection is crucial in improving road safety and urban planning. The study evaluated the model's performance on Cityscapes, Ade20k, and the Boston Dataset. The results showed that the ensemble model performed… ▽ More

    Submitted 1 April, 2024; originally announced May 2024.

  7. arXiv:2405.14830  [pdf, other

    hep-lat cond-mat.dis-nn cond-mat.str-el cs.LG hep-th

    Deep learning lattice gauge theories

    Authors: Anuj Apte, Anthony Ashmore, Clay Cordova, Tzu-Chen Huang

    Abstract: Monte Carlo methods have led to profound insights into the strong-coupling behaviour of lattice gauge theories and produced remarkable results such as first-principles computations of hadron masses. Despite tremendous progress over the last four decades, fundamental challenges such as the sign problem and the inability to simulate real-time dynamics remain. Neural network quantum states have emerg… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  8. arXiv:2405.10887  [pdf, ps, other

    cs.LO math.LO

    Preservation theorems on sparse classes revisited

    Authors: Anuj Dawar, Ioannis Eleftheriadis

    Abstract: We revisit the work studying homomorphism preservation for first-order logic in sparse classes of structures initiated in [Atserias et al., JACM 2006] and [Dawar, JCSS 2010]. These established that first-order logic has the homomorphism preservation property in any sparse class that is monotone and addable. It turns out that the assumption of addability is not strong enough for the proofs given. W… ▽ More

    Submitted 20 May, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: 16 pages

    MSC Class: 03B70; 03C13; 68Q19; 05C10

  9. arXiv:2405.09247  [pdf, other

    cs.CV cs.LG

    Graph Neural Network based Handwritten Trajectories Recognition

    Authors: Anuj Sharma, Sukhdeep Singh, S Ratna

    Abstract: The graph neural networks has been proved to be an efficient machine learning technique in real life applications. The handwritten recognition is one of the useful area in real life use where both offline and online handwriting recognition are required. The chain code as feature extraction technique has shown significant results in literature and we have been able to use chain codes with graph neu… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  10. arXiv:2404.14062  [pdf, other

    cs.CV cs.LG

    GatedLexiconNet: A Comprehensive End-to-End Handwritten Paragraph Text Recognition System

    Authors: Lalita Kumari, Sukhdeep Singh, Vaibhav Varish Singh Rathore, Anuj Sharma

    Abstract: The Handwritten Text Recognition problem has been a challenge for researchers for the last few decades, especially in the domain of computer vision, a subdomain of pattern recognition. Variability of texts amongst writers, cursiveness, and different font styles of handwritten texts with degradation of historical text images make it a challenging problem. Recognizing scanned document images in neur… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

  11. arXiv:2404.13767  [pdf, other

    cs.RO cs.AI cs.CV

    Autonomous Robot for Disaster Map** and Victim Localization

    Authors: Michael Potter, Rahil Bhowal, Richard Zhao, Anuj Patel, **gming Cheng

    Abstract: In response to the critical need for effective reconnaissance in disaster scenarios, this research article presents the design and implementation of a complete autonomous robot system using the Turtlebot3 with Robotic Operating System (ROS) Noetic. Upon deployment in closed, initially unknown environments, the system aims to generate a comprehensive map and identify any present 'victims' using Apr… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: Class final project for Northeastern University EECE 5550 Mobile Robotics Course

  12. arXiv:2404.12258  [pdf, ps, other

    cs.CV

    DeepLocalization: Using change point detection for Temporal Action Localization

    Authors: Mohammed Shaiqur Rahman, Ibne Farabi Shihab, Lynna Chu, Anuj Sharma

    Abstract: In this study, we introduce DeepLocalization, an innovative framework devised for the real-time localization of actions tailored explicitly for monitoring driver behavior. Utilizing the power of advanced deep learning methodologies, our objective is to tackle the critical issue of distracted driving-a significant factor contributing to road accidents. Our strategy employs a dual approach: leveragi… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  13. arXiv:2404.09432  [pdf, other

    cs.CV cs.AI cs.LG

    The 8th AI City Challenge

    Authors: Shuo Wang, David C. Anastasiu, Zheng Tang, Ming-Ching Chang, Yue Yao, Liang Zheng, Mohammed Shaiqur Rahman, Meenakshi S. Arya, Anuj Sharma, Pranamesh Chakraborty, Sanjita Prajapati, Quan Kong, Norimasa Kobori, Munkhjargal Gochoo, Munkh-Erdene Otgonbold, Fady Alnajjar, Ganzorig Batnasan, **-Yang Chen, Jun-Wei Hsieh, Xunlei Wu, Sameer Satish Pusegaonkar, Yizhou Wang, Sujit Biswas, Rama Chellappa

    Abstract: The eighth AI City Challenge highlighted the convergence of computer vision and artificial intelligence in areas like retail, warehouse settings, and Intelligent Traffic Systems (ITS), presenting significant research opportunities. The 2024 edition featured five tracks, attracting unprecedented interest from 726 teams in 47 countries and regions. Track 1 dealt with multi-target multi-camera (MTMC)… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

    Comments: Summary of the 8th AI City Challenge Workshop in conjunction with CVPR 2024

  14. arXiv:2404.08744  [pdf, other

    cs.NI cs.ET quant-ph

    Routing and Spectrum Allocation in Broadband Quantum Entanglement Distribution

    Authors: Rohan Bali, Ashley N. Tittelbaugh, Shelbi L. Jenkins, Anuj Agrawal, Jerry Horgan, Marco Ruffini, Daniel C. Kilper, Boulat A. Bash

    Abstract: We investigate resource allocation for quantum entanglement distribution over an optical network. We characterize and model a network architecture that employs a single quasi-deterministic time-frequency heralded Einstein-Podolsky-Rosen (EPR) pair source, and develop a routing scheme for distributing entangled photon pairs over such a network. We focus on max-min fairness in entanglement distribut… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2311.14613

  15. arXiv:2404.08011  [pdf, other

    cs.CV cs.LG

    An inclusive review on deep learning techniques and their scope in handwriting recognition

    Authors: Sukhdeep Singh, Sudhir Rohilla, Anuj Sharma

    Abstract: Deep learning expresses a category of machine learning algorithms that have the capability to combine raw inputs into intermediate features layers. These deep learning algorithms have demonstrated great results in different fields. Deep learning has particularly witnessed for a great achievement of human level performance across a number of domains in computer vision and pattern recognition. For t… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  16. arXiv:2403.16419  [pdf, other

    cs.RO

    Terrain-Attentive Learning for Efficient 6-DoF Kinodynamic Modeling on Vertically Challenging Terrain

    Authors: Aniket Datar, Chenhui Pan, Mohammad Nazeri, Anuj Pokhrel, Xuesu Xiao

    Abstract: Wheeled robots have recently demonstrated superior mechanical capability to traverse vertically challenging terrain (e.g., extremely rugged boulders comparable in size to the vehicles themselves). Negotiating such terrain introduces significant variations of vehicle pose in all six Degrees-of-Freedom (DoFs), leading to imbalanced contact forces, varying momentum, and chassis deformation due to non… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  17. arXiv:2403.15989  [pdf, other

    cs.LG cs.AI cs.CE

    Knowledge-guided Machine Learning: Current Trends and Future Prospects

    Authors: Anuj Karpatne, Xiaowei Jia, Vipin Kumar

    Abstract: This paper presents an overview of scientific modeling and discusses the complementary strengths and weaknesses of ML methods for scientific modeling in comparison to process-based models. It also provides an introduction to the current state of research in the emerging field of scientific knowledge-guided machine learning (KGML) that aims to use both scientific knowledge and data in ML frameworks… ▽ More

    Submitted 1 May, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

  18. arXiv:2403.15637  [pdf, other

    cs.RO

    CoNVOI: Context-aware Navigation using Vision Language Models in Outdoor and Indoor Environments

    Authors: Adarsh Jagan Sathyamoorthy, Kasun Weerakoon, Mohamed Elnoor, Anuj Zore, Brian Ichter, Fei Xia, Jie Tan, Wenhao Yu, Dinesh Manocha

    Abstract: We present ConVOI, a novel method for autonomous robot navigation in real-world indoor and outdoor environments using Vision Language Models (VLMs). We employ VLMs in two ways: first, we leverage their zero-shot image classification capability to identify the context or scenario (e.g., indoor corridor, outdoor terrain, crosswalk, etc) of the robot's surroundings, and formulate context-based naviga… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 9 pages, 4 figures

  19. arXiv:2403.15077  [pdf, other

    cs.LG

    GTAGCN: Generalized Topology Adaptive Graph Convolutional Networks

    Authors: Sukhdeep Singh, Anuj Sharma, Vinod Kumar Chauhan

    Abstract: Graph Neural Networks (GNN) have emerged as a popular and standard approach for learning from graph-structured data. The literature on GNN highlights the potential of this evolving research area and its widespread adoption in real-life applications. However, most of the approaches are either new in concept or derived from specific techniques. Therefore, the potential of more than one approach in h… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 2 figures, 3 tables and 26 pages

  20. arXiv:2403.12223  [pdf, ps, other

    cs.RO cs.HC

    HRI in Indian Education: Challenges Opportunities

    Authors: Chinmaya Mishra, Anuj Nandanwar, Sashikala Mishra

    Abstract: With the recent advancements in the field of robotics and the increased focus on having general-purpose robots widely available to the general public, it has become increasingly necessary to pursue research into Human-robot interaction (HRI). While there have been a lot of works discussing frameworks for teaching HRI in educational institutions with a few institutions already offering courses to s… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

    Comments: Presented at the Designing an Intro to HRI Course Workshop at HRI 2024 (arXiv:2403.05588)

    Report number: HRI101/2024/9

  21. arXiv:2403.08283  [pdf, other

    cs.CV

    Optimized Detection and Classification on GTRSB: Advancing Traffic Sign Recognition with Convolutional Neural Networks

    Authors: Dhruv Toshniwal, Saurabh Loya, Anuj Khot, Yash Marda

    Abstract: In the rapidly evolving landscape of transportation, the proliferation of automobiles has made road traffic more complex, necessitating advanced vision-assisted technologies for enhanced safety and navigation. These technologies are imperative for providing critical traffic sign information, influencing driver behavior, and supporting vehicle control, especially for drivers with disabilities and i… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

    Comments: 8 pages, 8 figures, 1 table

  22. arXiv:2403.07867  [pdf, other

    cs.RO

    The Virtues of Laziness: Multi-Query Kinodynamic Motion Planning with Lazy Methods

    Authors: Anuj Pasricha, Alessandro Roncone

    Abstract: In this work, we introduce LazyBoE, a multi-query method for kinodynamic motion planning with forward propagation. This algorithm allows for the simultaneous exploration of a robot's state and control spaces, thereby enabling a wider suite of dynamic tasks in real-world applications. Our contributions are three-fold: i) a method for discretizing the state and control spaces to amortize planning ti… ▽ More

    Submitted 4 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: Accepted to ICRA 2022 (International Conference on Robotics and Automation)

  23. arXiv:2403.07003  [pdf, other

    cs.AI cs.CY cs.LG cs.NI

    Evacuation Management Framework towards Smart City-wide Intelligent Emergency Interactive Response System

    Authors: Anuj Abraham, Yi Zhang, Shitala Prasad

    Abstract: A smart city solution toward future 6G network deployment allows small and medium sized enterprises (SMEs), industry, and government entities to connect with the infrastructures and play a crucial role in enhancing emergency preparedness with advanced sensors. The objective of this work is to propose a set of coordinated technological solutions to transform an existing emergency response system in… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  24. arXiv:2403.02439  [pdf, other

    cs.LG cs.AI

    Root Causing Prediction Anomalies Using Explainable AI

    Authors: Ramanathan Vishnampet, Rajesh Shenoy, Jianhui Chen, Anuj Gupta

    Abstract: This paper presents a novel application of explainable AI (XAI) for root-causing performance degradation in machine learning models that learn continuously from user engagement data. In such systems a single feature corruption can cause cascading feature, label and concept drifts. We have successfully applied this technique to improve the reliability of models used in personalized advertising. Per… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: Submitted to The 2nd World Conference on eXplainable Artificial Intelligence, 17-19 July, 2024, Malta, Valletta

  25. arXiv:2402.12937  [pdf, other

    cs.LG cs.SI

    GRAPHGINI: Fostering Individual and Group Fairness in Graph Neural Networks

    Authors: Anuj Kumar Sirohi, Anjali Gupta, Sayan Ranu, Sandeep Kumar, Amitabha Bagchi

    Abstract: We address the growing apprehension that GNNs, in the absence of fairness constraints, might produce biased decisions that disproportionately affect underprivileged groups or individuals. Departing from previous work, we introduce for the first time a method for incorporating the Gini coefficient as a measure of fairness to be used within the GNN framework. Our proposal, GRAPHGINI, works with the… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

  26. arXiv:2402.10600  [pdf, other

    cs.NI

    Envisioning the Future Role of 3D Wireless Networks in Preventing and Managing Disasters and Emergency Situations

    Authors: Ahmed Alhammadi, Anuj Abraham, Aymen Fakhreddine, Yu Tian, Jun Du, Faouzi Bader

    Abstract: In an era marked by unprecedented climatic upheavals and evolving urban landscapes, the role of advanced communication networks in disaster prevention and management is becoming increasingly critical. This paper explores the transformative potential of 3D wireless networks, an innovative amalgamation of terrestrial, aerial, and satellite technologies, in enhancing disaster response mechanisms. We… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  27. arXiv:2402.10465  [pdf, ps, other

    cs.IT

    Subfield codes of $C_D$-codes over $\mathbb{F}_2[x]/\langle x^3-x \rangle$ are really nice!

    Authors: Anuj Kumar Bhagat, Ritumoni Sarma, Vidya Sagar

    Abstract: A non-zero $\mathbb{F}$-linear map from a finite-dimensional commutative $\mathbb{F}$-algebra to $\mathbb{F}$ is called an $\mathbb{F}$-valued trace if its kernel does not contain any non-zero ideals. In this article, we utilize an $\mathbb{F}_2$-valued trace of the $\mathbb{F}_2$-algebra $\mathcal{R}_2:=\mathbb{F}_2[x]/\langle x^3-x\rangle$ to study binary subfield code $\mathcal{C}_D^{(2)}$ of… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  28. arXiv:2402.08017  [pdf, other

    cs.CV cs.CL cs.LG

    Lumos : Empowering Multimodal LLMs with Scene Text Recognition

    Authors: Ashish Shenoy, Yichao Lu, Srihari Jayakumar, Debojeet Chatterjee, Mohsen Moslehpour, Pierce Chuang, Abhay Harpale, Vikas Bhardwaj, Di Xu, Shicong Zhao, Longfang Zhao, Ankit Ramchandani, Xin Luna Dong, Anuj Kumar

    Abstract: We introduce Lumos, the first end-to-end multimodal question-answering system with text understanding capabilities. At the core of Lumos is a Scene Text Recognition (STR) component that extracts text from first person point-of-view images, the output of which is used to augment input to a Multimodal Large Language Model (MM-LLM). While building Lumos, we encountered numerous challenges related to… ▽ More

    Submitted 1 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted to KDD 2024 (ADS Track)

  29. arXiv:2402.07065  [pdf, other

    cs.RO

    CAHSOR: Competence-Aware High-Speed Off-Road Ground Navigation in SE(3)

    Authors: Anuj Pokhrel, Aniket Datar, Mohammad Nazeri, Xuesu Xiao

    Abstract: While the workspace of traditional ground vehicles is usually assumed to be in a 2D plane, i.e., SE(2), such an assumption may not hold when they drive at high speeds on unstructured off-road terrain: High-speed sharp turns on high-friction surfaces may lead to vehicle rollover; Turning aggressively on loose gravel or grass may violate the non-holonomic constraint and cause significant lateral sli… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

  30. arXiv:2402.06159  [pdf, other

    cs.CR

    Passwords Are Meant to Be Secret: A Practical Secure Password Entry Channel for Web Browsers

    Authors: Anuj Gautam, Tarun Kumar Yadav, Kent Seamons, Scott Ruoti

    Abstract: Password-based authentication faces various security and usability issues. Password managers help alleviate some of these issues by enabling users to manage their passwords effectively. However, malicious client-side scripts and browser extensions can steal passwords after they have been autofilled by the manager into the web page. In this paper, we explore what role the password manager can take… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  31. arXiv:2402.03255  [pdf, other

    cs.SI cs.CY cs.HC

    Security Advice for Parents and Children About Content Filtering and Circumvention as Found on YouTube and TikTok

    Authors: Ran Elgedawy, John Sadik, Anuj Gautam, Trinity Bissahoyo, Christopher Childress, Jacob Leonard, Clay Shubert, Scott Ruoti

    Abstract: In today's digital age, concerns about online security and privacy have become paramount. However, addressing these issues can be difficult, especially within the context of family relationships, wherein parents and children may have conflicting interests. In this environment, parents and children may turn to online security advice to determine how to proceed. In this paper, we examine the advice… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 15 pages, 5 figures, 8 tables

  32. arXiv:2402.00689  [pdf, other

    cs.CR cs.AI

    Ocassionally Secure: A Comparative Analysis of Code Generation Assistants

    Authors: Ran Elgedawy, John Sadik, Senjuti Dutta, Anuj Gautam, Konstantinos Georgiou, Farzin Gholamrezae, Fujiao Ji, Kyungchan Lim, Qian Liu, Scott Ruoti

    Abstract: $ $Large Language Models (LLMs) are being increasingly utilized in various applications, with code generations being a notable example. While previous research has shown that LLMs have the capability to generate both secure and insecure code, the literature does not take into account what factors help generate secure and effective code. Therefore in this paper we focus on identifying and understan… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 12 pages, 2 figures

  33. Assistant, Parrot, or Colonizing Loudspeaker? ChatGPT Metaphors for Develo** Critical AI Literacies

    Authors: Anuj Gupta, Yasser Atef, Anna Mills, Maha Bali

    Abstract: This study explores how discussing metaphors for AI can help build awareness of the frames that shape our understanding of AI systems, particularly large language models (LLMs) like ChatGPT. Given the pressing need to teach "critical AI literacy", discussion of metaphor provides an opportunity for inquiry and dialogue with space for nuance, playfulness, and critique. Using a collaborative autoethn… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: This is a preprint (accepted version) of an article that has been accepted for publication at the journal Open Praxis: https://openpraxis.org/

    ACM Class: I.2.0; K.3.0; K.3.1; K.4.0; K.4.2; J.4; J.5

  34. arXiv:2312.14919  [pdf, other

    cs.CV cs.LG

    Lift-Attend-Splat: Bird's-eye-view camera-lidar fusion using transformers

    Authors: James Gunn, Zygmunt Lenyk, Anuj Sharma, Andrea Donati, Alexandru Buburuzan, John Redford, Romain Mueller

    Abstract: Combining complementary sensor modalities is crucial to providing robust perception for safety-critical robotics applications such as autonomous driving (AD). Recent state-of-the-art camera-lidar fusion methods for AD rely on monocular depth estimation which is a notoriously difficult task compared to using depth information from the lidar directly. Here, we find that this approach does not levera… ▽ More

    Submitted 21 May, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: Updated method figure; camera ready

  35. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  36. arXiv:2312.10437  [pdf

    cs.CV

    Tender Notice Extraction from E-papers Using Neural Network

    Authors: Ashmin Bhattarai, Anuj Sedhai, Devraj Neupane, Manish Khadka, Rama Bastola

    Abstract: Tender notices are usually sought by most of the companies at regular intervals as a means for obtaining the contracts of various projects. These notices consist of all the required information like description of the work, period of construction, estimated amount of project, etc. In the context of Nepal, tender notices are usually published in national as well as local newspapers. The interested… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

  37. arXiv:2312.10080  [pdf, ps, other

    cs.IR cs.AI cs.LG

    No prejudice! Fair Federated Graph Neural Networks for Personalized Recommendation

    Authors: Nimesh Agrawal, Anuj Kumar Sirohi, Jayadeva, Sandeep Kumar

    Abstract: Ensuring fairness in Recommendation Systems (RSs) across demographic groups is critical due to the increased integration of RSs in applications such as personalized healthcare, finance, and e-commerce. Graph-based RSs play a crucial role in capturing intricate higher-order interactions among entities. However, integrating these graph models into the Federated Learning (FL) paradigm with fairness c… ▽ More

    Submitted 20 December, 2023; v1 submitted 10 December, 2023; originally announced December 2023.

    Comments: To appear as a full paper in AAAI 2024

  38. arXiv:2312.02548  [pdf, other

    cs.CV

    GeNIe: Generative Hard Negative Images Through Diffusion

    Authors: Soroush Abbasi Koohpayegani, Anuj Singh, K L Navaneet, Hadi Jamali-Rad, Hamed Pirsiavash

    Abstract: Data augmentation is crucial in training deep models, preventing them from overfitting to limited data. Recent advances in generative AI, e.g., diffusion models, have enabled more sophisticated augmentation techniques that produce data resembling natural images. We introduce GeNIe a novel augmentation method which leverages a latent diffusion model conditioned on a text prompt to merge contrasting… ▽ More

    Submitted 23 March, 2024; v1 submitted 5 December, 2023; originally announced December 2023.

    Comments: Our code is available https://github.com/UCDvision/GeNIe

  39. arXiv:2312.00796  [pdf

    q-bio.BM cs.CE

    Multiple Protein Profiler 1.0 (MPP): A webserver for predicting and visualizing physiochemical properties of proteins at the proteome level

    Authors: Gustavo Sganzerla Martinez, Mansi Dutt, Anuj Kumar, David J Kelvin

    Abstract: Determining the physicochemical properties of a protein can reveal important insights in their structure, biological functions, stability, and interactions with other molecules. Although tools for computing properties of proteins already existed, we could not find a comprehensive tool that enables the calculations of multiple properties for multiple input proteins on the proteome level at once. Fa… ▽ More

    Submitted 17 November, 2023; originally announced December 2023.

  40. arXiv:2312.00038  [pdf, other

    physics.comp-ph cs.LG physics.chem-ph physics.flu-dyn

    A Posteriori Evaluation of a Physics-Constrained Neural Ordinary Differential Equations Approach Coupled with CFD Solver for Modeling Stiff Chemical Kinetics

    Authors: Tadbhagya Kumar, Anuj Kumar, Pinaki Pal

    Abstract: The high computational cost associated with solving for detailed chemistry poses a significant challenge for predictive computational fluid dynamics (CFD) simulations of turbulent reacting flows. These models often require solving a system of coupled stiff ordinary differential equations (ODEs). While deep learning techniques have been experimented with to develop faster surrogate models, they oft… ▽ More

    Submitted 4 March, 2024; v1 submitted 22 November, 2023; originally announced December 2023.

  41. arXiv:2311.16766  [pdf, other

    cs.CV cs.LG

    Rescuing referral failures during automated diagnosis of domain-shifted medical images

    Authors: Anuj Srivastava, Karm Patel, Pradeep Shenoy, Devarajan Sridharan

    Abstract: The success of deep learning models deployed in the real world depends critically on their ability to generalize well across diverse data domains. Here, we address a fundamental challenge with selective classification during automated diagnosis with domain-shifted medical images. In this scenario, models must learn to avoid making predictions when label confidence is low, especially when tested wi… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

  42. arXiv:2311.14613  [pdf, other

    cs.NI

    Routing and Spectrum Allocation in Broadband Degenerate EPR-Pair Distribution

    Authors: Rohan Bali, Ashley Tittelbaugh, Shelbi L. Jenkins, Anuj Agrawal, Jerry Horgan, Marco Ruffini, Daniel Kilper, Boulat A. Bash

    Abstract: We investigate resource allocation for quantum entanglement distribution over an optical network. We characterize and model a network architecture that employs a single quasideterministic time-frequency heralded EPR-pair source, and develop a routing scheme for distributing entangled photon pairs over such a network. We focus on fairness in entanglement distribution, and compare both the performan… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  43. arXiv:2311.04157  [pdf, other

    cs.CV cs.AI

    A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis

    Authors: Dipanjyoti Paul, Arpita Chowdhury, Xinqi Xiong, Feng-Ju Chang, David Carlyn, Samuel Stevens, Kaiya L. Provost, Anuj Karpatne, Bryan Carstens, Daniel Rubenstein, Charles Stewart, Tanya Berger-Wolf, Yu Su, Wei-Lun Chao

    Abstract: We present a novel usage of Transformers to make image classification interpretable. Unlike mainstream classifiers that wait until the last fully connected layer to incorporate class information to make predictions, we investigate a proactive approach, asking each class to search for itself in an image. We realize this idea via a Transformer encoder-decoder inspired by DEtection TRansformer (DETR)… ▽ More

    Submitted 14 June, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: Accepted to International Conference on Learning Representations 2024 (ICLR 2024)

  44. arXiv:2310.09441  [pdf, other

    cs.CV physics.bio-ph q-bio.QM

    MEMTRACK: A Deep Learning-Based Approach to Microrobot Tracking in Dense and Low-Contrast Environments

    Authors: Medha Sawhney, Bhas Karmarkar, Eric J. Leaman, Arka Daw, Anuj Karpatne, Bahareh Behkam

    Abstract: Tracking microrobots is challenging, considering their minute size and high speed. As the field progresses towards develo** microrobots for biomedical applications and conducting mechanistic studies in physiologically relevant media (e.g., collagen), this challenge is exacerbated by the dense surrounding environments with feature size and shape comparable to microrobots. Herein, we report Motion… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  45. arXiv:2310.02351  [pdf

    stat.AP cs.LG

    Investigating Speed Deviation Patterns During Glucose Episodes: A Quantile Regression Approach

    Authors: Aparna Joshi, Jennifer Merickel, Cyrus V. Desouza, Matthew Rizzo, Pujitha Gunaratne, Anuj Sharma

    Abstract: Given the growing prevalence of diabetes, there has been significant interest in determining how diabetes affects instrumental daily functions, like driving. Complication of glucose control in diabetes includes hypoglycemic and hyperglycemic episodes, which may impair cognitive and psychomotor functions needed for safe driving. The goal of this paper was to determine patterns of diabetes speed beh… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 6 pages, 2 figures, 5 Tables, Accepted and Presented at IEEE ITSC 2023 Conference in Bilbao Spain

  46. arXiv:2309.16058  [pdf, other

    cs.LG cs.CL cs.CV

    AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model

    Authors: Seungwhan Moon, Andrea Madotto, Zhaojiang Lin, Tushar Nagarajan, Matt Smith, Shashank Jain, Chun-Fu Yeh, Prakash Murugesan, Peyman Heidari, Yue Liu, Kavya Srinet, Babak Damavandi, Anuj Kumar

    Abstract: We present Any-Modality Augmented Language Model (AnyMAL), a unified model that reasons over diverse input modality signals (i.e. text, image, video, audio, IMU motion sensor), and generates textual responses. AnyMAL inherits the powerful text-based reasoning abilities of the state-of-the-art LLMs including LLaMA-2 (70B), and converts modality-specific signals to the joint textual space through a… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

  47. arXiv:2309.14601  [pdf, other

    cs.LG cs.HC

    Neuro-Visualizer: An Auto-encoder-based Loss Landscape Visualization Method

    Authors: Mohannad Elhamod, Anuj Karpatne

    Abstract: In recent years, there has been a growing interest in visualizing the loss landscape of neural networks. Linear landscape visualization methods, such as principal component analysis, have become widely used as they intuitively help researchers study neural networks and their training process. However, these linear methods suffer from limitations and drawbacks due to their lack of flexibility and l… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

  48. arXiv:2309.09595  [pdf, ps, other

    cs.IT

    $\mathbb{F}$-valued trace of a finite-dimensional commutative $\mathbb{F}$-algebra

    Authors: Anuj Kr Bhagat, Ritumoni Sarma

    Abstract: A non-zero $\mathbb{F}$-valued $\mathbb{F}$-linear map on a finite dimensional $\mathbb{F}$-algebra is called an $\mathbb{F}$-valued trace if its kernel does not contain any non-zero ideals. However, given an $\mathbb{F}$-algebra such a map may not always exist. We find an infinite class of finite-dimensional commutative $\mathbb{F}$-algebras which admit an $\mathbb{F}$-valued trace. In fact, in t… ▽ More

    Submitted 19 September, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

  49. Adapted Large Language Models Can Outperform Medical Experts in Clinical Text Summarization

    Authors: Dave Van Veen, Cara Van Uden, Louis Blankemeier, Jean-Benoit Delbrouck, Asad Aali, Christian Bluethgen, Anuj Pareek, Malgorzata Polacin, Eduardo Pontes Reis, Anna Seehofnerova, Nidhi Rohatgi, Poonam Hosamani, William Collins, Neera Ahuja, Curtis P. Langlotz, Jason Hom, Sergios Gatidis, John Pauly, Akshay S. Chaudhari

    Abstract: Analyzing vast textual data and summarizing key information from electronic health records imposes a substantial burden on how clinicians allocate their time. Although large language models (LLMs) have shown promise in natural language processing (NLP), their effectiveness on a diverse range of clinical summarization tasks remains unproven. In this study, we apply adaptation methods to eight LLMs,… ▽ More

    Submitted 11 April, 2024; v1 submitted 14 September, 2023; originally announced September 2023.

    Comments: 27 pages, 19 figures

    Journal ref: Nature Medicine, 2024

  50. arXiv:2309.04892  [pdf, ps, other

    math.CO cs.LO

    Descriptive complexity of controllable graphs

    Authors: Aida Abiad, Anuj Dawar, Octavio Zapata

    Abstract: Let $G$ be a graph on $n$ vertices with adjacency matrix $A$, and let $\mathbf{1}$ be the all-ones vector. We call $G$ controllable if the set of vectors $\mathbf{1}, A\mathbf{1}, \dots, A^{n-1}\mathbf{1}$ spans the whole space $\mathbb{R}^n$. We characterize the isomorphism problem of controllable graphs in terms of other combinatorial, geometric and logical problems. We also describe a polynomia… ▽ More

    Submitted 9 September, 2023; originally announced September 2023.

    Comments: 14 pages