Skip to main content

Showing 1–50 of 119 results for author: Rao, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.19150  [pdf, other

    cs.CV cs.AI cs.IR

    RAVEN: Multitask Retrieval Augmented Vision-Language Learning

    Authors: Varun Nagaraj Rao, Siddharth Choudhary, Aditya Deshpande, Ravi Kumar Satzoda, Srikar Appalaraju

    Abstract: The scaling of large language models to encode all the world's knowledge in model parameters is unsustainable and has exacerbated resource barriers. Retrieval-Augmented Generation (RAG) presents a potential solution, yet its application to vision-language models (VLMs) is under explored. Existing methods focus on models designed for single tasks. Furthermore, they're limited by the need for resour… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

  2. arXiv:2406.10768  [pdf, other

    cs.CY cs.HC

    Rideshare Transparency: Translating Gig Worker Insights on AI Platform Design to Policy

    Authors: Varun Nagaraj Rao, Samantha Dalal, Eesha Agarwal, Dana Calacci, Andrés Monroy-Hernández

    Abstract: Rideshare platforms exert significant control over workers through algorithmic systems that can result in financial, emotional, and physical harm. What steps can platforms, designers, and practitioners take to mitigate these negative impacts and meet worker needs? In this paper, through a novel mixed methods study combining a LLM-based analysis of over 1 million comments posted to online platform… ▽ More

    Submitted 19 June, 2024; v1 submitted 15 June, 2024; originally announced June 2024.

  3. arXiv:2406.07667  [pdf, other

    cs.CV

    PLT-D3: A High-fidelity Dynamic Driving Simulation Dataset for Stereo Depth and Scene Flow

    Authors: Joshua Tokarsky, Ibrahim Abdulhafiz, Satya Ayyalasomayajula, Mostafa Mohsen, Navya G. Rao, Adam Forbes

    Abstract: Autonomous driving has experienced remarkable progress, bolstered by innovations in computational hardware and sophisticated deep learning methodologies. The foundation of these advancements rests on the availability and quality of datasets, which are crucial for the development and refinement of dependable and versatile autonomous driving algorithms. While numerous datasets have been developed to… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  4. arXiv:2405.10391  [pdf, other

    cs.RO cs.AI eess.IV

    Vision Transformers for End-to-End Vision-Based Quadrotor Obstacle Avoidance

    Authors: Anish Bhattacharya, Nishanth Rao, Dhruv Parikh, Pratik Kunapuli, Nikolai Matni, Vijay Kumar

    Abstract: We demonstrate the capabilities of an attention-based end-to-end approach for high-speed quadrotor obstacle avoidance in dense, cluttered environments, with comparison to various state-of-the-art architectures. Quadrotor unmanned aerial vehicles (UAVs) have tremendous maneuverability when flown fast; however, as flight speed increases, traditional vision-based navigation via independent map**, p… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 8 pages, 10 figures, 3 tables

  5. MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels

    Authors: Qi Chen, Xiubo Geng, Corby Rosset, Carolyn Buractaon, **gwen Lu, Tao Shen, Kun Zhou, Chenyan Xiong, Yeyun Gong, Paul Bennett, Nick Craswell, Xing Xie, Fan Yang, Bryan Tower, Nikhil Rao, Anlei Dong, Wenqi Jiang, Zheng Liu, Mingqin Li, Chuanjie Liu, Zengzhong Li, Rangan Majumder, Jennifer Neville, Andy Oakley, Knut Magne Risvik , et al. (6 additional authors not shown)

    Abstract: Recent breakthroughs in large models have highlighted the critical significance of data scale, labels and modals. In this paper, we introduce MS MARCO Web Search, the first large-scale information-rich web dataset, featuring millions of real clicked query-document labels. This dataset closely mimics real-world web document and query distribution, provides rich information for various kinds of down… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 10 pages, 6 figures, for associated dataset, see http://github.com/microsoft/MS-MARCO-Web-Search

  6. arXiv:2405.05345  [pdf, other

    cs.CL cs.HC

    QuaLLM: An LLM-based Framework to Extract Quantitative Insights from Online Forums

    Authors: Varun Nagaraj Rao, Eesha Agarwal, Samantha Dalal, Dan Calacci, Andrés Monroy-Hernández

    Abstract: Online discussion forums provide crucial data to understand the concerns of a wide range of real-world communities. However, the typical qualitative and quantitative methods used to analyze those data, such as thematic analysis and topic modeling, are infeasible to scale or require significant human effort to translate outputs to human readable forms. This study introduces QuaLLM, a novel LLM-base… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: Accepted to CHI LLM as Research Tools Workshop (2024)

  7. arXiv:2405.00820  [pdf, other

    cs.AR cs.LG

    HLSFactory: A Framework Empowering High-Level Synthesis Datasets for Machine Learning and Beyond

    Authors: Stefan Abi-Karam, Rishov Sarkar, Allison Seigler, Sean Lowe, Zhigang Wei, Hanqiu Chen, Nanditha Rao, Lizy John, Aman Arora, Cong Hao

    Abstract: Machine learning (ML) techniques have been applied to high-level synthesis (HLS) flows for quality-of-result (QoR) prediction and design space exploration (DSE). Nevertheless, the scarcity of accessible high-quality HLS datasets and the complexity of building such datasets present challenges. Existing datasets have limitations in terms of benchmark coverage, design space enumeration, vendor extens… ▽ More

    Submitted 17 May, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    Comments: Edit to "Section V.E" for proper attribution of open-source HLSyn, AutoDSE, and the Merlin compiler

  8. arXiv:2402.17896  [pdf, other

    cs.CL cs.AI

    Researchy Questions: A Dataset of Multi-Perspective, Decompositional Questions for LLM Web Agents

    Authors: Corby Rosset, Ho-Lam Chung, Guanghui Qin, Ethan C. Chau, Zhuo Feng, Ahmed Awadallah, Jennifer Neville, Nikhil Rao

    Abstract: Existing question answering (QA) datasets are no longer challenging to most powerful Large Language Models (LLMs). Traditional QA benchmarks like TriviaQA, NaturalQuestions, ELI5 and HotpotQA mainly study ``known unknowns'' with clear indications of both what information is missing, and how to find it to answer the question. Hence, good performance on these benchmarks provides a false sense of sec… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

  9. arXiv:2312.17479  [pdf, other

    cs.AI cs.CY cs.HC cs.LG

    Culturally-Attuned Moral Machines: Implicit Learning of Human Value Systems by AI through Inverse Reinforcement Learning

    Authors: Nigini Oliveira, Jasmine Li, Koosha Khalvati, Rodolfo Cortes Barragan, Katharina Reinecke, Andrew N. Meltzoff, Rajesh P. N. Rao

    Abstract: Constructing a universal moral code for artificial intelligence (AI) is difficult or even impossible, given that different human cultures have different definitions of morality and different societal norms. We therefore argue that the value system of an AI should be culturally attuned: just as a child raised in a particular culture learns the specific values and norms of that culture, we propose t… ▽ More

    Submitted 29 December, 2023; originally announced December 2023.

  10. arXiv:2312.10049  [pdf

    cs.IR

    Knowledge Graph Reasoning Based on Attention GCN

    Authors: Meera Gupta, Ravi Khanna, Divya Choudhary, Nandini Rao

    Abstract: We propose a novel technique to enhance Knowledge Graph Reasoning by combining Graph Convolution Neural Network (GCN) with the Attention Mechanism. This approach utilizes the Attention Mechanism to examine the relationships between entities and their neighboring nodes, which helps to develop detailed feature vectors for each entity. The GCN uses shared parameters to effectively represent the chara… ▽ More

    Submitted 27 January, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

  11. arXiv:2310.18918  [pdf, other

    cs.LG cs.SI

    Hyperbolic Graph Neural Networks at Scale: A Meta Learning Approach

    Authors: Nurendra Choudhary, Nikhil Rao, Chandan K. Reddy

    Abstract: The progress in hyperbolic neural networks (HNNs) research is hindered by their absence of inductive bias mechanisms, which are essential for generalizing to new tasks and facilitating scalable learning over large datasets. In this paper, we aim to alleviate these issues by learning generalizable inductive biases from the nodes' local subgraph and transfer them for faster learning over new subgrap… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023. 14 pages of main paper, 5 pages of supplementary

  12. arXiv:2310.05972  [pdf, other

    cs.ET

    Normality of I-V Measurements Using ML

    Authors: Anees Al-Najjar, Nageswara S. V. Rao, Craig A. Bridges, Sheng Dai

    Abstract: Electrochemistry ecosystems are promising for accelerating the design and discovery of electrochemical systems for energy storage and conversion, by automating significant parts of workflows that combine synthesis and characterization experiments with computations. They require the integration of flow controllers, solvent containers, pumps, fraction collectors, and potentiostats, all connected to… ▽ More

    Submitted 28 September, 2023; originally announced October 2023.

    Comments: published at eScience 2023

    Journal ref: in 2023 IEEE 19th International Conference on e-Science (e-Science), Limassol, Cyprus, 2023 pp. 1-2

  13. arXiv:2310.02409  [pdf, other

    cs.CL cs.AI cs.LG

    Dodo: Dynamic Contextual Compression for Decoder-only LMs

    Authors: Guanghui Qin, Corby Rosset, Ethan C. Chau, Nikhil Rao, Benjamin Van Durme

    Abstract: Transformer-based language models (LMs) are inefficient in long contexts. We propose Dodo, a solution for context compression. Instead of one vector per token in a standard transformer model, Dodo represents text with a dynamic number of hidden states at each layer, reducing the cost of self-attention to a fraction of typical time and space. Moreover, off-the-shelf models such as LLaMA can be adap… ▽ More

    Submitted 13 June, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ACL 2024 camera-ready. 15 pages and 7 figures

    ACM Class: I.2.7; I.2.6

  14. arXiv:2310.02263  [pdf, other

    cs.CL cs.AI cs.LG

    Automatic Pair Construction for Contrastive Post-training

    Authors: Canwen Xu, Corby Rosset, Ethan C. Chau, Luciano Del Corro, Shweti Mahajan, Julian McAuley, Jennifer Neville, Ahmed Hassan Awadallah, Nikhil Rao

    Abstract: Alignment serves as an important step to steer large language models (LLMs) towards human preferences. In this paper, we propose an automatic way to construct contrastive data for LLM, using preference pairs from multiple models of varying strengths (e.g., InstructGPT, ChatGPT and GPT-4). We compare the contrastive techniques of SLiC and DPO to SFT baselines and find that DPO provides a step-funct… ▽ More

    Submitted 2 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: NAACL 2024 (Findings)

  15. arXiv:2310.01602  [pdf, other

    cs.SE cs.AI

    CAT-LM: Training Language Models on Aligned Code And Tests

    Authors: Nikitha Rao, Kush Jain, Uri Alon, Claire Le Goues, Vincent J. Hellendoorn

    Abstract: Testing is an integral part of the software development process. Yet, writing tests is time-consuming and therefore often neglected. Classical test generation tools such as EvoSuite generate behavioral test suites by optimizing for coverage, but tend to produce tests that are hard to understand. Language models trained on code can generate code that is highly similar to that written by humans, but… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  16. arXiv:2309.11512  [pdf, other

    stat.AP cs.LG

    Multidimensional well-being of US households at a fine spatial scale using fused household surveys: fusionACS

    Authors: Kevin Ummel, Miguel Poblete-Cazenave, Karthik Akkiraju, Nick Graetz, Hero Ashman, Cora Kingdon, Steven Herrera Tenorio, Aaryaman "Sunny" Singhal, Daniel Aldana Cohen, Narasimha D. Rao

    Abstract: Social science often relies on surveys of households and individuals. Dozens of such surveys are regularly administered by the U.S. government. However, they field independent, unconnected samples with specialized questions, limiting research questions to those that can be answered by a single survey. The fusionACS project seeks to integrate data from multiple U.S. household surveys by statistical… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: 35 pages, 6 figures

  17. arXiv:2308.11809  [pdf, other

    q-bio.NC cs.AI cs.NE

    Expressive probabilistic sampling in recurrent neural networks

    Authors: Shirui Chen, Linxing Preston Jiang, Rajesh P. N. Rao, Eric Shea-Brown

    Abstract: In sampling-based Bayesian models of brain function, neural activities are assumed to be samples from probability distributions that the brain uses for probabilistic computation. However, a comprehensive understanding of how mechanistic models of neural dynamics can sample from arbitrary distributions is still lacking. We use tools from functional analysis and stochastic differential equations to… ▽ More

    Submitted 14 November, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

  18. arXiv:2308.07870  [pdf, other

    cs.AI cs.LG cs.NE

    Brain-Inspired Computational Intelligence via Predictive Coding

    Authors: Tommaso Salvatori, Ankur Mali, Christopher L. Buckley, Thomas Lukasiewicz, Rajesh P. N. Rao, Karl Friston, Alexander Ororbia

    Abstract: Artificial intelligence (AI) is rapidly becoming one of the key technologies of this century. The majority of results in AI thus far have been achieved using deep neural networks trained with the error backpropagation learning algorithm. However, the ubiquitous adoption of this approach has highlighted some important limitations such as substantial computational cost, difficulty in quantifying unc… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: 37 Pages, 9 Figures

  19. arXiv:2307.06883  [pdf, other

    cs.OH physics.ins-det

    Cyber Framework for Steering and Measurements Collection Over Instrument-Computing Ecosystems

    Authors: Anees Al-Najjar, Nageswara S. V. Rao, Ramanan Sankaran, Helia Zandi, Debangshu Mukherjee, Maxim Ziatdinov, Craig Bridges

    Abstract: We propose a framework to develop cyber solutions to support the remote steering of science instruments and measurements collection over instrument-computing ecosystems. It is based on provisioning separate data and control connections at the network level, and develo** software modules consisting of Python wrappers for instrument commands and Pyro server-client codes that make them available ac… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: Paper accepted for presentation at IEEE SMARTCOMP 2023

  20. Discrimination through Image Selection by Job Advertisers on Facebook

    Authors: Varun Nagaraj Rao, Aleksandra Korolova

    Abstract: Targeted advertising platforms are widely used by job advertisers to reach potential employees; thus issues of discrimination due to targeting that have surfaced have received widespread attention. Advertisers could misuse targeting tools to exclude people based on gender, race, location and other protected attributes from seeing their job ads. In response to legal actions, Facebook disabled the a… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: Published in FAccT 2023

  21. arXiv:2306.05912  [pdf, other

    eess.IV cs.CV

    Single-Image-Based Deep Learning for Segmentation of Early Esophageal Cancer Lesions

    Authors: Haipeng Li, Dingrui Liu, Yu Zeng, Shuaicheng Liu, Tao Gan, Nini Rao, **lin Yang, Bing Zeng

    Abstract: Accurate segmentation of lesions is crucial for diagnosis and treatment of early esophageal cancer (EEC). However, neither traditional nor deep learning-based methods up to today can meet the clinical requirements, with the mean Dice score - the most important metric in medical image analysis - hardly exceeding 0.75. In this paper, we present a novel deep learning approach for segmenting EEC lesio… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

  22. arXiv:2305.20015  [pdf, other

    cs.SE cs.AI

    AI for Low-Code for AI

    Authors: Nikitha Rao, Jason Tsay, Kiran Kate, Vincent J. Hellendoorn, Martin Hirzel

    Abstract: Low-code programming allows citizen developers to create programs with minimal coding effort, typically via visual (e.g. drag-and-drop) interfaces. In parallel, recent AI-powered tools such as Copilot and ChatGPT generate programs from natural language instructions. We argue that these modalities are complementary: tools like ChatGPT greatly reduce the need to memorize large APIs but still require… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

  23. arXiv:2305.09887  [pdf, other

    cs.LG cs.DC

    Simplifying Distributed Neural Network Training on Massive Graphs: Randomized Partitions Improve Model Aggregation

    Authors: Jiong Zhu, Aishwarya Reganti, Edward Huang, Charles Dickens, Nikhil Rao, Karthik Subbian, Danai Koutra

    Abstract: Distributed training of GNNs enables learning on massive graphs (e.g., social and e-commerce networks) that exceed the storage and computational capacity of a single machine. To reach performance comparable to centralized training, distributed frameworks focus on maximally recovering cross-instance node dependencies with either communication across instances or periodic fallback to centralized tra… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

    Comments: 14 pages, 3 figures

  24. arXiv:2304.02048  [pdf

    cond-mat.mtrl-sci cs.LG

    Deep Learning for Automated Experimentation in Scanning Transmission Electron Microscopy

    Authors: Sergei V. Kalinin, Debangshu Mukherjee, Kevin M. Roccapriore, Ben Blaiszik, Ayana Ghosh, Maxim A. Ziatdinov, A. Al-Najjar, Christina Doty, Sarah Akers, Nageswara S. Rao, Joshua C. Agar, Steven R. Spurgeon

    Abstract: Machine learning (ML) has become critical for post-acquisition data analysis in (scanning) transmission electron microscopy, (S)TEM, imaging and spectroscopy. An emerging trend is the transition to real-time analysis and closed-loop microscope operation. The effective use of ML in electron microscopy now requires the development of strategies for microscopy-centered experiment workflow design and… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: Review Article

  25. arXiv:2302.14189  [pdf, other

    cs.LG cs.AI cs.SI

    You Only Transfer What You Share: Intersection-Induced Graph Transfer Learning for Link Prediction

    Authors: Wenqing Zheng, Edward W Huang, Nikhil Rao, Zhangyang Wang, Karthik Subbian

    Abstract: Link prediction is central to many real-world applications, but its performance may be hampered when the graph of interest is sparse. To alleviate issues caused by sparsity, we investigate a previously overlooked phenomenon: in many cases, a densely connected, complementary graph can be found for the original graph. The denser graph may share nodes with the original graph, which offers a natural b… ▽ More

    Submitted 18 June, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

    Comments: Accepted in TMLR (https://openreview.net/forum?id=Nn71AdKyYH)

  26. arXiv:2211.14261  [pdf, ps, other

    cs.RO

    Temporal Waypoint Navigation of Multi-UAV Payload System using Barrier Functions

    Authors: Nishanth Rao, Suresh Sundaram, Pushpak Jagtap

    Abstract: Aerial package transportation often requires complex spatial and temporal specifications to be satisfied in order to ensure safe and timely delivery from one point to another. It is usually efficient to transport versatile payloads using multiple UAVs that can work collaboratively to achieve the desired task. The complex temporal specifications can be handled coherently by applying Signal Temporal… ▽ More

    Submitted 25 November, 2022; originally announced November 2022.

    Comments: Submitted to ECC 2023

  27. arXiv:2211.13328  [pdf, other

    cs.IR

    Search Behavior Prediction: A Hypergraph Perspective

    Authors: Yan Han, Edward W Huang, Wenqing Zheng, Nikhil Rao, Zhangyang Wang, Karthik Subbian

    Abstract: Although the bipartite shop** graphs are straightforward to model search behavior, they suffer from two challenges: 1) The majority of items are sporadically searched and hence have noisy/sparse query associations, leading to a \textit{long-tail} distribution. 2) Infrequent queries are more likely to link to popular items, leading to another hurdle known as \textit{disassortative mixing}. To add… ▽ More

    Submitted 28 November, 2022; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: WSDM 2023

  28. arXiv:2211.06548  [pdf, ps, other

    cs.RO

    Computationally Light Spectrally Normalized Memory Neuron Network based Estimator for GPS-Denied operation of Micro UAV

    Authors: Nishanth Rao, Suresh Sundaram, Varun Raghavendra

    Abstract: This paper addresses the problem of position estimation in UAVs operating in a cluttered environment where GPS information is unavailable. A model learning-based approach is proposed that takes in the rotor RPMs and past state as input and predicts the one-step-ahead position of the UAV using a novel spectral-normalized memory neural network (SN-MNN). The spectral normalization guarantees stable a… ▽ More

    Submitted 3 December, 2022; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: Submitted to L4DC 2023

  29. arXiv:2210.13461  [pdf, other

    cs.LG cs.AI cs.CV cs.NE q-bio.NC

    Active Predictive Coding: A Unified Neural Framework for Learning Hierarchical World Models for Perception and Planning

    Authors: Rajesh P. N. Rao, Dimitrios C. Gklezakos, Vishwas Sathish

    Abstract: Predictive coding has emerged as a prominent model of how the brain learns through predictions, anticipating the importance accorded to predictive learning in recent AI architectures such as transformers. Here we propose a new framework for predictive coding called active predictive coding which can learn hierarchical world models and solve two radically different open problems in AI: (1) how do w… ▽ More

    Submitted 23 October, 2022; originally announced October 2022.

    Comments: 15 pages, 10 figures, 2 supplementary figures

  30. arXiv:2210.11753  [pdf, other

    cs.CL

    TransLIST: A Transformer-Based Linguistically Informed Sanskrit Tokenizer

    Authors: Jivnesh Sandhan, Rathin Singha, Narein Rao, Suvendu Samanta, Laxmidhar Behera, Pawan Goyal

    Abstract: Sanskrit Word Segmentation (SWS) is essential in making digitized texts available and in deploying downstream tasks. It is, however, non-trivial because of the sandhi phenomenon that modifies the characters at the word boundaries, and needs special treatment. Existing lexicon driven approaches for SWS make use of Sanskrit Heritage Reader, a lexicon-driven shallow parser, to generate the complete c… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: Accepted at EMNLP22 (Findings)

  31. arXiv:2210.11478  [pdf, other

    q-bio.NC cs.AI

    Neural Co-Processors for Restoring Brain Function: Results from a Cortical Model of Gras**

    Authors: Matthew J. Bryan, Linxing Preston Jiang, Rajesh P N Rao

    Abstract: Objective: A major challenge in designing closed-loop brain-computer interfaces is finding optimal stimulation patterns as a function of ongoing neural activity for different subjects and objectives. Approach: To achieve goal-directed closed-loop neurostimulation, we propose "neural co-processors" which use artificial neural networks and deep learning to learn optimal closed-loop stimulation polic… ▽ More

    Submitted 20 March, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: 45 pages, 19 figures. Submitted the IOP Journal of Neural Engineering

  32. arXiv:2210.09791  [pdf, other

    cs.DC

    Enabling Autonomous Electron Microscopy for Networked Computation and Steering

    Authors: Anees Al-Najjar, Nageswara S. V. Rao, Ramanan Sankaran, Maxim Ziatdinov, Debangshu Mukherjee, Olga Ovchinnikova, Kevin Roccapriore, Andrew R. Lupini, Sergei V. Kalinin

    Abstract: Advanced electron microscopy workflows require an ecosystem of microscope instruments and computing systems possibly located at different sites to conduct remotely steered and automated experiments. Current workflow executions involve manual operations for steering and measurement tasks, which are typically performed from control workstations co-located with microscopes; consequently, their operat… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: 11 pages, 16 figures, accepted at IEEE eScience 2022 conference

  33. arXiv:2208.13301  [pdf, other

    cs.DC

    ECP SOLLVE: Validation and Verification Testsuite Status Update and Compiler Insight for OpenMP

    Authors: Thomas Huber, Swaroop Pophale, Nolan Baker, Michael Carr, Nikhil Rao, Jaydon Reap, Kristina Holsapple, Joshua Hoke Davis, Tobias Burnus, Seyong Lee, David E. Bernholdt, Sunita Chandrasekaran

    Abstract: The OpenMP language continues to evolve with every new specification release, as does the need to validate and verify the new features that have been introduced. With the release of OpenMP 5.0 and OpenMP 5.1, plenty of new target offload and host-based features have been introduced to the programming model. While OpenMP continues to grow in maturity, there is an observable growth in the number of… ▽ More

    Submitted 14 November, 2022; v1 submitted 28 August, 2022; originally announced August 2022.

  34. arXiv:2207.04375  [pdf, ps, other

    cs.RO

    An Input-Output Feedback Linearization based Exponentially Stable Controller for Multi-UAV Payload Transport

    Authors: Nishanth Rao, Suresh Sundaram

    Abstract: In this paper, an exponentially stable trajectory tracking controller is proposed for multi-UAV payload transport. The multi-UAV payload system has a 2-DOF magnetic spherical joint between the UAVs and the vertical rigid links of the payload frame, so the UAVs can roll or pitch freely. These vertical links are rigidly attached to the payload and cannot move. An input-output feedback linearized mod… ▽ More

    Submitted 10 July, 2022; originally announced July 2022.

    Comments: Submitted to IEEE - Transactions on Robotics (IEEE - TRO)

  35. arXiv:2207.03593  [pdf, other

    cs.LG

    Hyper-Universal Policy Approximation: Learning to Generate Actions from a Single Image using Hypernets

    Authors: Dimitrios C. Gklezakos, Rishi Jha, Rajesh P. N. Rao

    Abstract: Inspired by Gibson's notion of object affordances in human vision, we ask the question: how can an agent learn to predict an entire action policy for a novel object or environment given only a single glimpse? To tackle this problem, we introduce the concept of Universal Policy Functions (UPFs) which are state-to-action map**s that generalize not only to new goals but most importantly to novel, u… ▽ More

    Submitted 7 July, 2022; originally announced July 2022.

  36. arXiv:2207.02368  [pdf, other

    cs.IR cs.LG cs.SI

    Text Enriched Sparse Hyperbolic Graph Convolutional Networks

    Authors: Nurendra Choudhary, Nikhil Rao, Karthik Subbian, Chandan K. Reddy

    Abstract: Heterogeneous networks, which connect informative nodes containing text with different edge types, are routinely used to store and process information in various real-world applications. Graph Neural Networks (GNNs) and their hyperbolic variants provide a promising approach to encode such networks in a low-dimensional latent space through neighborhood aggregation and hierarchical feature extractio… ▽ More

    Submitted 7 July, 2022; v1 submitted 5 July, 2022; originally announced July 2022.

    Comments: Preprint under review. 13 pages, 10 figures, 6 tables

    ACM Class: I.2.4; I.2.6; G.2.2; F.2.2

  37. arXiv:2206.08462  [pdf, other

    cs.CV cs.LG

    Recursive Neural Programs: Variational Learning of Image Grammars and Part-Whole Hierarchies

    Authors: Ares Fisher, Rajesh P. N. Rao

    Abstract: Human vision involves parsing and representing objects and scenes using structured representations based on part-whole hierarchies. Computer vision and machine learning researchers have recently sought to emulate this capability using capsule networks, reference frames and active predictive coding, but a generative model formulation has been lacking. We introduce Recursive Neural Programs (RNPs),… ▽ More

    Submitted 25 June, 2022; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: 9 pages, 6 figures. fixed LaTeX typo for algorithm reference

  38. arXiv:2206.06588  [pdf, other

    cs.IR cs.LG

    Shop** Queries Dataset: A Large-Scale ESCI Benchmark for Improving Product Search

    Authors: Chandan K. Reddy, Lluís Màrquez, Fran Valero, Nikhil Rao, Hugo Zaragoza, Sambaran Bandyopadhyay, Arnab Biswas, Anlu Xing, Karthik Subbian

    Abstract: Improving the quality of search results can significantly enhance users experience and engagement with search engines. In spite of several recent advancements in the fields of machine learning and data mining, correctly classifying items for a particular user search query has been a long-standing challenge, which still has a large room for improvement. This paper introduces the "Shop** Queries D… ▽ More

    Submitted 14 June, 2022; originally announced June 2022.

  39. arXiv:2206.04416  [pdf

    cs.CY

    Analysis of Learner Independent Variables for Estimating Assessment Items Difficulty Level

    Authors: Shilpi Banerjee, N. J. Rao

    Abstract: The quality of assessment determines the quality of learning, and is characterized by validity, reliability and difficulty. Mastery of learning is generally represented by the difficulty levels of assessment items. A very large number of variables are identified in the literature to measure the difficulty level. These variables, which are not completely independent of one another, are categorized… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: 16 pages

  40. arXiv:2206.03040  [pdf, other

    stat.ML cs.IR cs.LG

    Learning Backward Compatible Embeddings

    Authors: Weihua Hu, Rajas Bansal, Kaidi Cao, Nikhil Rao, Karthik Subbian, Jure Leskovec

    Abstract: Embeddings, low-dimensional vector representation of objects, are fundamental in building modern machine learning systems. In industrial settings, there is usually an embedding team that trains an embedding model to solve intended tasks (e.g., product recommendation). The produced embeddings are then widely consumed by consumer teams to solve their unintended tasks (e.g., fraud detection). However… ▽ More

    Submitted 7 June, 2022; originally announced June 2022.

    Comments: KDD 2022, Applied Data Science Track

  41. arXiv:2205.10092  [pdf, other

    cs.RO

    An efficient Deep Spatio-Temporal Context Aware decision Network (DST-CAN) for Predictive Manoeuvre Planning

    Authors: Jayabrata Chowdhury, Suresh Sundaram, Nishant Rao, Narasimhan Sundararajan

    Abstract: To ensure the safety and efficiency of its maneuvers, an Autonomous Vehicle (AV) should anticipate the future intentions of surrounding vehicles using its sensor information. If an AV can predict its surrounding vehicles' future trajectories, it can make safe and efficient manoeuvre decisions. In this paper, we present such a Deep Spatio-Temporal Context-Aware decision Network (DST-CAN) model for… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

    Comments: 11 pages, 9 figures

  42. Comments on Comments: Where Code Review and Documentation Meet

    Authors: Nikitha Rao, Jason Tsay, Martin Hirzel, Vincent J. Hellendoorn

    Abstract: A central function of code review is to increase understanding; hel** reviewers understand a code change aids in knowledge transfer and finding bugs. Comments in code largely serve a similar purpose, hel** future readers understand the program. It is thus natural to study what happens when these two forms of understanding collide. We ask: what documentation-related comments do reviewers make a… ▽ More

    Submitted 31 March, 2022; originally announced April 2022.

  43. arXiv:2202.08335  [pdf, other

    cs.LG

    Task-Agnostic Graph Explanations

    Authors: Yaochen Xie, Sumeet Katariya, Xianfeng Tang, Edward Huang, Nikhil Rao, Karthik Subbian, Shuiwang Ji

    Abstract: Graph Neural Networks (GNNs) have emerged as powerful tools to encode graph-structured data. Due to their broad applications, there is an increasing need to develop tools to explain how GNNs make decisions given graph-structured data. Existing learning-based GNN explanation approaches are task-specific in training and hence suffer from crucial drawbacks. Specifically, they are incapable of produci… ▽ More

    Submitted 23 September, 2022; v1 submitted 16 February, 2022; originally announced February 2022.

    Comments: Accepted by NeurIPS 2022

  44. arXiv:2201.13033  [pdf, ps, other

    cs.RO

    Integrated Decision Control Approach for Cooperative Safety-Critical Payload Transport in a Cluttered Environment

    Authors: Nishanth Rao, Suresh Sundaram

    Abstract: In this paper, the problem of coordinated transportation of heavy payload by a team of UAVs in a cluttered environment is addressed. The payload is modeled as a rigid body and is assumed to track a pre-computed global flight trajectory from a start point to a goal point. Due to the presence of local dynamic obstacles in the environment, the UAVs must ensure that there is no collision between the p… ▽ More

    Submitted 31 January, 2022; originally announced January 2022.

    Comments: Submitted to IEEE Transactions on Intelligent Transporation Systems (IEEE T - ITS)

  45. arXiv:2201.08813  [pdf, other

    cs.CV cs.AI cs.LG

    Active Predictive Coding Networks: A Neural Solution to the Problem of Learning Reference Frames and Part-Whole Hierarchies

    Authors: Dimitrios C. Gklezakos, Rajesh P. N. Rao

    Abstract: We introduce Active Predictive Coding Networks (APCNs), a new class of neural networks that solve a major problem posed by Hinton and others in the fields of artificial intelligence and brain modeling: how can neural networks learn intrinsic reference frames for objects and parse visual scenes into part-whole hierarchies by dynamically allocating nodes in a parse tree? APCNs address this problem b… ▽ More

    Submitted 14 January, 2022; originally announced January 2022.

  46. arXiv:2112.04841  [pdf, other

    eess.AS cs.MM cs.SD eess.SP

    On The Effect Of Coding Artifacts On Acoustic Scene Classification

    Authors: Nagashree K. S. Rao, Nils Peters

    Abstract: Previous DCASE challenges contributed to an increase in the performance of acoustic scene classification systems. State-of-the-art classifiers demand significant processing capabilities and memory which is challenging for resource-constrained mobile or IoT edge devices. Thus, it is more likely to deploy these models on more powerful hardware and classify audio recordings previously uploaded (or st… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: paper presented at the 2021 Workshop on Detection and Classification of Acoustic Scenes and Events (DCASE)

  47. arXiv:2111.04840  [pdf, other

    cs.LG

    Cold Brew: Distilling Graph Node Representations with Incomplete or Missing Neighborhoods

    Authors: Wenqing Zheng, Edward W Huang, Nikhil Rao, Sumeet Katariya, Zhangyang Wang, Karthik Subbian

    Abstract: Graph Neural Networks (GNNs) have achieved state-of-the-art performance in node classification, regression, and recommendation tasks. GNNs work well when rich and high-quality connections are available. However, their effectiveness is often jeopardized in many real-world graphs in which node degrees have power-law distributions. The extreme case of this situation, where a node may have no neighbor… ▽ More

    Submitted 13 March, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

    Comments: Published as a conference paper in ICLR 2022

  48. arXiv:2110.14011  [pdf, other

    cs.LG stat.ML

    Cluster-and-Conquer: A Framework For Time-Series Forecasting

    Authors: Reese Pathak, Rajat Sen, Nikhil Rao, N. Benjamin Erichson, Michael I. Jordan, Inderjit S. Dhillon

    Abstract: We propose a three-stage framework for forecasting high-dimensional time-series data. Our method first estimates parameters for each univariate time series. Next, we use these parameters to cluster the time series. These clusters can be viewed as multivariate time series, for which we then compute parameters. The forecasted values of a single time series can depend on the history of other time ser… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: 25 pages, 3 figures

  49. arXiv:2110.13522  [pdf, other

    cs.LG cs.CL cs.IR

    Probabilistic Entity Representation Model for Reasoning over Knowledge Graphs

    Authors: Nurendra Choudhary, Nikhil Rao, Sumeet Katariya, Karthik Subbian, Chandan K. Reddy

    Abstract: Logical reasoning over Knowledge Graphs (KGs) is a fundamental technique that can provide efficient querying mechanism over large and incomplete databases. Current approaches employ spatial geometries such as boxes to learn query representations that encompass the answer entities and model the logical operations of projection and intersection. However, their geometry is restrictive and leads to no… ▽ More

    Submitted 30 October, 2021; v1 submitted 26 October, 2021; originally announced October 2021.

    Comments: Accepted at Thirty-fifth Conference on Neural Information Processing Systems 2021 (NeurIPS '21)

  50. arXiv:2110.08429  [pdf, other

    cs.CV cs.AI

    TorchEsegeta: Framework for Interpretability and Explainability of Image-based Deep Learning Models

    Authors: Soumick Chatterjee, Arnab Das, Chirag Mandal, Budhaditya Mukhopadhyay, Manish Vipinraj, Aniruddh Shukla, Rajatha Nagaraja Rao, Chompunuch Sarasaen, Oliver Speck, Andreas Nürnberger

    Abstract: Clinicians are often very sceptical about applying automatic image processing approaches, especially deep learning based methods, in practice. One main reason for this is the black-box nature of these approaches and the inherent problem of missing insights of the automatically derived decisions. In order to increase trust in these methods, this paper presents approaches that help to interpret and… ▽ More

    Submitted 7 February, 2022; v1 submitted 15 October, 2021; originally announced October 2021.