Skip to main content

Showing 1–50 of 61 results for author: Iyer, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.16860  [pdf, other

    cs.CV

    Cambrian-1: A Fully Open, Vision-Centric Exploration of Multimodal LLMs

    Authors: Shengbang Tong, Ellis Brown, Penghao Wu, Sanghyun Woo, Manoj Middepogu, Sai Charitha Akula, Jihan Yang, Shusheng Yang, Adithya Iyer, Xichen Pan, Austin Wang, Rob Fergus, Yann LeCun, Saining Xie

    Abstract: We introduce Cambrian-1, a family of multimodal LLMs (MLLMs) designed with a vision-centric approach. While stronger language models can enhance multimodal capabilities, the design choices for vision components are often insufficiently explored and disconnected from visual representation learning research. This gap hinders accurate sensory grounding in real-world scenarios. Our study uses LLMs and… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: Website at https://cambrian-mllm.github.io

  2. arXiv:2406.15941  [pdf, other

    cs.LG stat.ML

    Towards Exact Computation of Inductive Bias

    Authors: Akhilan Boopathy, William Yue, Jaedong Hwang, Abhiram Iyer, Ila Fiete

    Abstract: Much research in machine learning involves finding appropriate inductive biases (e.g. convolutional neural networks, momentum-based optimizers, transformers) to promote generalization on tasks. However, quantification of the amount of inductive bias associated with these architectures and hyperparameters has been limited. We propose a novel method for efficiently computing the inductive bias requi… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Comments: Published at IJCAI 2024

  3. arXiv:2406.14549  [pdf, other

    cs.CV cs.LG q-bio.NC

    Uncovering Latent Memories: Assessing Data Leakage and Memorization Patterns in Large Language Models

    Authors: Sunny Duan, Mikail Khona, Abhiram Iyer, Rylan Schaeffer, Ila R Fiete

    Abstract: The proliferation of large language models has revolutionized natural language processing tasks, yet it raises profound concerns regarding data privacy and security. Language models are trained on extensive corpora including potentially sensitive or proprietary information, and the risk of data leakage -- where the model response reveals pieces of such information -- remains inadequately understoo… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  4. arXiv:2406.06585  [pdf, other

    cs.LG cs.SC

    Expressive Symbolic Regression for Interpretable Models of Discrete-Time Dynamical Systems

    Authors: Adarsh Iyer, Nibodh Boddupalli, Jeff Moehlis

    Abstract: Interpretable mathematical expressions defining discrete-time dynamical systems (iterated maps) can model many phenomena of scientific interest, enabling a deeper understanding of system behaviors. Since formulating governing expressions from first principles can be difficult, it is of particular interest to identify expressions for iterated maps given only their data streams. In this work, we con… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: Research conducted through the UC Santa Barbara Research Mentorship Program

  5. arXiv:2405.01573  [pdf, other

    cs.SE cs.AI

    Class-Level Code Generation from Natural Language Using Iterative, Tool-Enhanced Reasoning over Repository

    Authors: A**kya Deshpande, Anmol Agarwal, Shashank Shet, Arun Iyer, Aditya Kanade, Ramakrishna Bairi, Suresh Parthasarathy

    Abstract: LLMs have demonstrated significant potential in code generation tasks, achieving promising results at the function or statement level across various benchmarks. However, the complexities associated with creating code artifacts like classes, particularly within the context of real-world software repositories, remain underexplored. Prior research treats class-level generation as an isolated task, ne… ▽ More

    Submitted 5 June, 2024; v1 submitted 21 April, 2024; originally announced May 2024.

    Comments: Preprint with additional experiments

  6. arXiv:2404.13698  [pdf, other

    cs.RO cs.LG stat.ML

    Resampling-free Particle Filters in High-dimensions

    Authors: Akhilan Boopathy, Aneesh Muppidi, Peggy Yang, Abhiram Iyer, William Yue, Ila Fiete

    Abstract: State estimation is crucial for the performance and safety of numerous robotic applications. Among the suite of estimation techniques, particle filters have been identified as a powerful solution due to their non-parametric nature. Yet, in high-dimensional state spaces, these filters face challenges such as 'particle deprivation' which hinders accurate representation of the true posterior distribu… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

    Comments: Published at ICRA 2024, 7 pages, 5 figures

  7. arXiv:2404.08155  [pdf, other

    cs.CL

    Graph Integrated Language Transformers for Next Action Prediction in Complex Phone Calls

    Authors: Amin Hosseiny Marani, Ulie Schnaithmann, Youngseo Son, Akil Iyer, Manas Paldhe, Arushi Raghuvanshi

    Abstract: Current Conversational AI systems employ different machine learning pipelines, as well as external knowledge sources and business logic to predict the next action. Maintaining various components in dialogue managers' pipeline adds complexity in expansion and updates, increases processing time, and causes additive noise through the pipeline that can lead to incorrect next action prediction. This pa… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

    Comments: Published in NAACL 2024 Industry Track

  8. arXiv:2403.13771  [pdf, other

    cs.CV cs.LG

    Describe-and-Dissect: Interpreting Neurons in Vision Networks with Language Models

    Authors: Nicholas Bai, Rahul A. Iyer, Tuomas Oikarinen, Tsui-Wei Weng

    Abstract: In this paper, we propose Describe-and-Dissect (DnD), a novel method to describe the roles of hidden neurons in vision networks. DnD utilizes recent advancements in multimodal deep learning to produce complex natural language descriptions, without the need for labeled training data or a predefined set of concepts to choose from. Additionally, DnD is training-free, meaning we don't train any new mo… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  9. arXiv:2403.07870  [pdf, other

    cs.RO

    OPEN TEACH: A Versatile Teleoperation System for Robotic Manipulation

    Authors: Aadhithya Iyer, Zhuoran Peng, Yinlong Dai, Irmak Guzey, Siddhant Haldar, Soumith Chintala, Lerrel Pinto

    Abstract: Open-sourced, user-friendly tools form the bedrock of scientific advancement across disciplines. The widespread adoption of data-driven learning has led to remarkable progress in multi-fingered dexterity, bimanual manipulation, and applications ranging from logistics to home robotics. However, existing data collection platforms are often proprietary, costly, or tailored to specific robotic morphol… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  10. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1092 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 14 June, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  11. arXiv:2402.13468  [pdf, other

    cs.LG cs.CL

    STENCIL: Submodular Mutual Information Based Weak Supervision for Cold-Start Active Learning

    Authors: Nathan Beck, Adithya Iyer, Rishabh Iyer

    Abstract: As supervised fine-tuning of pre-trained models within NLP applications increases in popularity, larger corpora of annotated data are required, especially with increasing parameter counts in large language models. Active learning, which attempts to mine and annotate unlabeled instances to improve model performance maximally fast, is a common choice for reducing the annotation cost; however, most m… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: 11 pages, 1 figure

  12. Making Short-Form Videos Accessible with Hierarchical Video Summaries

    Authors: Tess Van Daele, Akhil Iyer, Yuning Zhang, Jalyn C. Derry, Mina Huh, Amy Pavel

    Abstract: Short videos on platforms such as TikTok, Instagram Reels, and YouTube Shorts (i.e. short-form videos) have become a primary source of information and entertainment. Many short-form videos are inaccessible to blind and low vision (BLV) viewers due to their rapid visual changes, on-screen text, and music or meme-audio overlays. In our formative study, 7 BLV viewers who regularly watched short-form… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

    Comments: To appear at CHI 2024

  13. arXiv:2401.15447  [pdf, other

    cs.LG stat.ML

    Continuous Treatment Effect Estimation Using Gradient Interpolation and Kernel Smoothing

    Authors: Lokesh Nagalapatti, Akshay Iyer, Abir De, Sunita Sarawagi

    Abstract: We address the Individualized continuous treatment effect (ICTE) estimation problem where we predict the effect of any continuous-valued treatment on an individual using observational data. The main challenge in this estimation task is the potential confounding of treatment assignment with an individual's covariates in the training data, whereas during inference ICTE requires prediction on indepen… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: Accepted at AAAI 24

  14. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1325 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

  15. arXiv:2312.05385  [pdf, other

    cs.DC cs.LG

    Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving

    Authors: Yinwei Dai, Rui Pan, Anand Iyer, Kai Li, Ravi Netravali

    Abstract: Machine learning (ML) inference platforms are tasked with balancing two competing goals: ensuring high throughput given many requests, and delivering low-latency responses to support interactive applications. Unfortunately, existing platform knobs (e.g., batch sizes) fail to ease this fundamental tension, and instead only enable users to harshly trade off one property for the other. This paper exp… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: The first two authors contributed equally and are alphabetically ordered

  16. arXiv:2312.02337  [pdf, other

    cs.CL

    Measuring Distributional Shifts in Text: The Advantage of Language Model-Based Embeddings

    Authors: Gyandev Gupta, Bashir Rastegarpanah, Amalendu Iyer, Joshua Rubin, Krishnaram Kenthapadi

    Abstract: An essential part of monitoring machine learning models in production is measuring input and output data drift. In this paper, we present a system for measuring distributional shifts in natural language data and highlight and investigate the potential advantage of using large language models (LLMs) for this problem. Recent advancements in LLMs and their successful adoption in different domains ind… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  17. arXiv:2311.09445  [pdf, other

    cs.DC

    A Software-Hardware Co-Optimized Toolkit for Deep Reinforcement Learning on Heterogeneous Platforms

    Authors: Yuan Meng, Michael Kinsner, Deshanand Singh, Mahesh A Iyer, Viktor Prasanna

    Abstract: Deep Reinforcement Learning (DRL) is vital in various AI applications. DRL algorithms comprise diverse compute kernels, which may not be simultaneously optimized using a homogeneous architecture. However, even with available heterogeneous architectures, optimizing DRL performance remains a challenge due to the complexity of hardware and programming models employed in modern data centers. To addres… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

    Comments: Submitted to IPDPS 2024

  18. arXiv:2310.01892  [pdf, ps, other

    cs.LG cs.AI

    FiGURe: Simple and Efficient Unsupervised Node Representations with Filter Augmentations

    Authors: Chanakya Ekbote, A**kya Pankaj Deshpande, Arun Iyer, Ramakrishna Bairi, Sundararajan Sellamanickam

    Abstract: Unsupervised node representations learnt using contrastive learning-based methods have shown good performance on downstream tasks. However, these methods rely on augmentations that mimic low-pass filters, limiting their performance on tasks requiring different eigen-spectrum parts. This paper presents a simple filter-based augmentation method to capture different parts of the eigen-spectrum. We sh… ▽ More

    Submitted 4 October, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

  19. arXiv:2309.12499  [pdf, other

    cs.SE

    CodePlan: Repository-level Coding using LLMs and Planning

    Authors: Ramakrishna Bairi, Atharv Sonwane, Aditya Kanade, Vageesh D C, Arun Iyer, Suresh Parthasarathy, Sriram Rajamani, B. Ashok, Shashank Shet

    Abstract: Software engineering activities such as package migration, fixing errors reports from static analysis or testing, and adding type annotations or other specifications to a codebase, involve pervasively editing the entire repository of code. We formulate these activities as repository-level coding tasks. Recent tools like GitHub Copilot, which are powered by Large Language Models (LLMs), have succ… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

  20. arXiv:2307.16318  [pdf, other

    cs.RO

    Efficient Q-Learning over Visit Frequency Maps for Multi-agent Exploration of Unknown Environments

    Authors: Xuyang Chen, Ashvin N. Iyer, Zixing Wang, Ahmed H. Qureshi

    Abstract: The robot exploration task has been widely studied with applications spanning from novel environment map** to item delivery. For some time-critical tasks, such as rescue catastrophes, the agent is required to explore as efficiently as possible. Recently, Visit Frequency-based map representation achieved great success in such scenarios by discouraging repetitive visits with a frequency-based pena… ▽ More

    Submitted 30 July, 2023; originally announced July 2023.

    Comments: Accepted by IROS 2023. 8 pages

  21. arXiv:2304.08893  [pdf

    cs.RO cs.AI

    Autonomous Systems: Autonomous Systems: Indoor Drone Navigation

    Authors: Aswin Iyer, Santosh Narayan, Naren M, Manoj kumar Rajagopal

    Abstract: Drones are a promising technology for autonomous data collection and indoor sensing. In situations when human-controlled UAVs may not be practical or dependable, such as in uncharted or dangerous locations, the usage of autonomous UAVs offers flexibility, cost savings, and reduced risk. The system creates a simulated quadcopter capable of autonomously travelling in an indoor environment using the… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  22. arXiv:2302.00995  [pdf, other

    cs.CV

    Open-Set Multi-Source Multi-Target Domain Adaptation

    Authors: Rohit Lal, Arihant Gaur, Aadhithya Iyer, Muhammed Abdullah Shaikh, Ritik Agrawal

    Abstract: Single-Source Single-Target Domain Adaptation (1S1T) aims to bridge the gap between a labelled source domain and an unlabelled target domain. Despite 1S1T being a well-researched topic, they are typically not deployed to the real world. Methods like Multi-Source Domain Adaptation and Multi-Target Domain Adaptation have evolved to model real-world problems but still do not generalise well. The fact… ▽ More

    Submitted 3 February, 2023; v1 submitted 2 February, 2023; originally announced February 2023.

    Comments: Accepted in NeurIPS 2021 Workshop on Pre-registration in Machine Learning

  23. arXiv:2211.02218  [pdf, other

    stat.ML cs.LG

    Fully Bayesian inference for latent variable Gaussian process models

    Authors: Suraj Yerramilli, Akshay Iyer, Wei Chen, Daniel W. Apley

    Abstract: Real engineering and scientific applications often involve one or more qualitative inputs. Standard Gaussian processes (GPs), however, cannot directly accommodate qualitative inputs. The recently introduced latent variable Gaussian process (LVGP) overcomes this issue by first map** each qualitative factor to underlying latent variables (LVs), and then uses any standard GP covariance function ove… ▽ More

    Submitted 19 March, 2023; v1 submitted 3 November, 2022; originally announced November 2022.

  24. arXiv:2208.01034  [pdf, other

    eess.IV cs.AI cs.CV cs.LG physics.med-ph

    Learning to estimate a surrogate respiratory signal from cardiac motion by signal-to-signal translation

    Authors: Akshay Iyer, Clifford Lindsay, Hendrik Pretorius, Michael King

    Abstract: In this work, we develop a neural network-based method to convert a noisy motion signal generated from segmenting rebinned list-mode cardiac SPECT images, to that of a high-quality surrogate signal, such as those seen from external motion tracking systems (EMTs). This synthetic surrogate will be used as input to our pre-existing motion correction technique developed for EMT surrogate signals. In o… ▽ More

    Submitted 20 July, 2022; originally announced August 2022.

    Comments: Medical Imaging Meets NeurIPS

  25. arXiv:2207.12318  [pdf, other

    cs.CV cs.LG

    Action Quality Assessment using Transformers

    Authors: Abhay Iyer, Mohammad Alali, Hemanth Bodala, Sunit Vaidya

    Abstract: Action quality assessment (AQA) is an active research problem in video-based applications that is a challenging task due to the score variance per frame. Existing methods address this problem via convolutional-based approaches but suffer from its limitation of effectively capturing long-range dependencies. With the recent advancements in Transformers, we show that they are a suitable alternative t… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: 9 pages, 2 figures

  26. arXiv:2207.11186  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Learning to identify cracks on wind turbine blade surfaces using drone-based inspection images

    Authors: Akshay Iyer, Linh Nguyen, Shweta Khushu

    Abstract: Wind energy is expected to be one of the leading ways to achieve the goals of the Paris Agreement but it in turn heavily depends on effective management of its operations and maintenance (O&M) costs. Blade failures account for one-third of all O&M costs thus making accurate detection of blade damages, especially cracks, very important for sustained operations and cost savings. Traditionally, damag… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

    Comments: NeurIPS 2021 Workshop on Tackling Climate Change with Machine Learning

  27. arXiv:2207.04994  [pdf, other

    stat.ML cond-mat.mtrl-sci cs.LG

    Uncertainty-Aware Mixed-Variable Machine Learning for Materials Design

    Authors: Hengrui Zhang, Wei Wayne Chen, Akshay Iyer, Daniel W. Apley, Wei Chen

    Abstract: Data-driven design shows the promise of accelerating materials discovery but is challenging due to the prohibitive cost of searching the vast design space of chemistry, structure, and synthesis methods. Bayesian Optimization (BO) employs uncertainty-aware machine learning models to select promising designs to evaluate, hence reducing the cost. However, BO with mixed numerical and categorical varia… ▽ More

    Submitted 4 October, 2022; v1 submitted 11 July, 2022; originally announced July 2022.

    Journal ref: Scientific Reports 12, 19760 (2022)

  28. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  29. arXiv:2206.02285  [pdf, other

    cs.CR

    Story Beyond the Eye: Glyph Positions Break PDF Text Redaction

    Authors: Maxwell Bland, Anushya Iyer, Kirill Levchenko

    Abstract: In this work we find that many current redactions of PDF text are insecure due to non-redacted character positioning information. In particular, subpixel-sized horizontal shifts in redacted and non-redacted characters can be recovered and used to effectively deredact first and last names. Unfortunately these findings affect redactions where the text underneath the black box is removed from the PDF… ▽ More

    Submitted 13 November, 2022; v1 submitted 5 June, 2022; originally announced June 2022.

  30. arXiv:2205.10954  [pdf, other

    cs.AI cs.CV

    An Automated System for Detecting Visual Damages of Wind Turbine Blades

    Authors: Linh Nguyen, Akshay Iyer, Shweta Khushu

    Abstract: Wind energy's ability to compete with fossil fuels on a market level depends on lowering wind's high operational costs. Since damages on wind turbine blades are the leading cause for these operational problems, identifying blade damages is critical. However, recent works in visual identification of blade damages are still experimental and focus on optimizing the traditional machine learning metric… ▽ More

    Submitted 22 May, 2022; originally announced May 2022.

  31. arXiv:2204.05021  [pdf, other

    cs.SE cs.IR cs.PL

    Landmarks and Regions: A Robust Approach to Data Extraction

    Authors: Suresh Parthasarathy, Lincy Pattanaik, Anirudh Khatry, Arun Iyer, Arjun Radhakrishna, Sriram Rajamani, Mohammad Raza

    Abstract: We propose a new approach to extracting data items or field values from semi-structured documents. Examples of such problems include extracting passenger name, departure time and departure airport from a travel itinerary, or extracting price of an item from a purchase receipt. Traditional approaches to data extraction use machine learning or program synthesis to process the whole document to extra… ▽ More

    Submitted 11 April, 2022; originally announced April 2022.

    Comments: To be published at PLDI,2022

  32. arXiv:2203.01800  [pdf, other

    cs.CV

    Automatic Facial Paralysis Estimation with Facial Action Units

    Authors: Xuri Ge, Joemon M. Jose, Pengcheng Wang, Arunachalam Iyer, Xiao Liu, Hu Han

    Abstract: Facial palsy is unilateral facial nerve weakness or paralysis of rapid onset with unknown causes. Automatically estimating facial palsy severeness can be helpful for the diagnosis and treatment of people suffering from it across the world. In this work, we develop and experiment with a novel model for estimating facial palsy severity. For this, an effective Facial Action Units (AU) detection techn… ▽ More

    Submitted 30 March, 2022; v1 submitted 3 March, 2022; originally announced March 2022.

    Comments: 12 pages, 5 figures, resubmitted to IEEE Transactions on Affective Computing

  33. arXiv:2201.07705  [pdf, other

    cs.DC cs.AI

    GEMEL: Model Merging for Memory-Efficient, Real-Time Video Analytics at the Edge

    Authors: Arthi Padmanabhan, Neil Agarwal, Anand Iyer, Ganesh Ananthanarayanan, Yuanchao Shu, Nikolaos Karianakis, Guoqing Harry Xu, Ravi Netravali

    Abstract: Video analytics pipelines have steadily shifted to edge deployments to reduce bandwidth overheads and privacy violations, but in doing so, face an ever-growing resource tension. Most notably, edge-box GPUs lack the memory needed to concurrently house the growing number of (increasingly complex) models for real-time inference. Unfortunately, existing solutions that rely on time/space sharing of GPU… ▽ More

    Submitted 4 May, 2022; v1 submitted 19 January, 2022; originally announced January 2022.

  34. arXiv:2201.00042  [pdf, other

    cs.NE cs.AI cs.LG q-bio.NC

    Avoiding Catastrophe: Active Dendrites Enable Multi-Task Learning in Dynamic Environments

    Authors: Abhiram Iyer, Karan Grewal, Akash Velu, Lucas Oliveira Souza, Jeremy Forest, Subutai Ahmad

    Abstract: A key challenge for AI is to build embodied systems that operate in dynamically changing environments. Such systems must adapt to changing task contexts and learn continuously. Although standard deep learning systems achieve state of the art results on static benchmarks, they often struggle in dynamic scenarios. In these settings, error signals from multiple contexts can interfere with one another… ▽ More

    Submitted 25 April, 2022; v1 submitted 31 December, 2021; originally announced January 2022.

    Comments: 31 pages, 17 figures

    Journal ref: Frontiers in Neurorobotics 16 2022 (1-23)

  35. arXiv:2112.03499  [pdf, other

    cs.LG

    A Piece-wise Polynomial Filtering Approach for Graph Neural Networks

    Authors: Vijay Lingam, Chanakya Ekbote, Manan Sharma, Rahul Ragesh, Arun Iyer, Sundararajan Sellamanickam

    Abstract: Graph Neural Networks (GNNs) exploit signals from node features and the input graph topology to improve node classification task performance. However, these models tend to perform poorly on heterophilic graphs, where connected nodes have different labels. Recently proposed GNNs work across graphs having varying levels of homophily. Among these, models relying on polynomial graph filters have shown… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

    Comments: 28 pages, 9 figures, Under Review

  36. arXiv:2112.02969  [pdf, other

    cs.SE cs.PL

    Jigsaw: Large Language Models meet Program Synthesis

    Authors: Naman Jain, Skanda Vaidyanath, Arun Iyer, Nagarajan Natarajan, Suresh Parthasarathy, Sriram Rajamani, Rahul Sharma

    Abstract: Large pre-trained language models such as GPT-3, Codex, and Google's language model are now capable of generating code from natural language specifications of programmer intent. We view these developments with a mixture of optimism and caution. On the optimistic side, such large language models have the potential to improve productivity by providing an automated AI pair programmer for every progra… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: Accepted to ICSE'22

  37. arXiv:2109.08044  [pdf, other

    eess.IV cs.CV

    Eformer: Edge Enhancement based Transformer for Medical Image Denoising

    Authors: Achleshwar Luthra, Harsh Sulakhe, Tanish Mittal, Abhishek Iyer, Santosh Yadav

    Abstract: In this work, we present Eformer - Edge enhancement based transformer, a novel architecture that builds an encoder-decoder network using transformer blocks for medical image denoising. Non-overlap** window-based self-attention is used in the transformer block that reduces computational requirements. This work further incorporates learnable Sobel-Feldman operators to enhance edges in the image an… ▽ More

    Submitted 9 November, 2021; v1 submitted 16 September, 2021; originally announced September 2021.

    Comments: Accepted in ICCVW'2021

  38. arXiv:2107.13312  [pdf, other

    cs.LG cs.SI

    Effective Eigendecomposition based Graph Adaptation for Heterophilic Networks

    Authors: Vijay Lingam, Rahul Ragesh, Arun Iyer, Sundararajan Sellamanickam

    Abstract: Graph Neural Networks (GNNs) exhibit excellent performance when graphs have strong homophily property, i.e. connected nodes have the same labels. However, they perform poorly on heterophilic graphs. Several approaches address the issue of heterophily by proposing models that adapt the graph by optimizing task-specific loss function using labelled data. These adaptations are made either via attenti… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

    Comments: arXiv admin note: text overlap with arXiv:2106.12807

  39. arXiv:2107.04125  [pdf

    cs.AI

    The Multi-phase spatial meta-heuristic algorithm for public health emergency transportation

    Authors: Fariba Afrin Irany, Arnav Iyer, Rubenia Borge Flores, Armin R. Mikler

    Abstract: The delivery of Medical Countermeasures(MCMs) for mass prophylaxis in the case of a bio-terrorist attack is an active research topic that has interested the research community over the past decades. The objective of this study is to design an efficient algorithm for the Receive Reload and Store Problem(RSS) in which we aim to find feasible routes to deliver MCMs to a target population considering… ▽ More

    Submitted 5 July, 2021; originally announced July 2021.

    Comments: 17 pages, 3 figures, 3 tables, Journals

    Journal ref: International Journal of Scientific Research & Engineering Trends Volume 7, Issue 4, July-Aug-2020, ISSN (Online): 2395-566X

  40. arXiv:2106.15356  [pdf

    cs.LG stat.ML

    Scalable Gaussian Processes for Data-Driven Design using Big Data with Categorical Factors

    Authors: Liwei Wang, Suraj Yerramilli, Akshay Iyer, Daniel Apley, ** Zhu, Wei Chen

    Abstract: Scientific and engineering problems often require the use of artificial intelligence to aid understanding and the search for promising designs. While Gaussian processes (GP) stand out as easy-to-use and interpretable learners, they have difficulties in accommodating big datasets, categorical inputs, and multiple responses, which has become a common challenge for a growing number of data-driven des… ▽ More

    Submitted 29 June, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

    Comments: Preprint submitted to Journal of Mechanical Design

  41. arXiv:2106.12807  [pdf, other

    cs.LG

    Simple Truncated SVD based Model for Node Classification on Heterophilic Graphs

    Authors: Vijay Lingam, Rahul Ragesh, Arun Iyer, Sundararajan Sellamanickam

    Abstract: Graph Neural Networks (GNNs) have shown excellent performance on graphs that exhibit strong homophily with respect to the node labels i.e. connected nodes have same labels. However, they perform poorly on heterophilic graphs. Recent approaches have typically modified aggregation schemes, designed adaptive graph filters, etc. to address this limitation. In spite of this, the performance on heteroph… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: Accepted at Deep Learning on Graphs: Method and Applications (DLG-KDD 2021)

  42. arXiv:2104.12032  [pdf

    cs.CR cs.HC

    The Design of the User Interfaces for Privacy Enhancements for Android

    Authors: Jason I. Hong, Yuvraj Agarwal, Matt Fredrikson, Mike Czapik, Shawn Hanna, Swarup Sahoo, Judy Chun, Won-Woo Chung, Aniruddh Iyer, Ally Liu, Shen Lu, Rituparna Roychoudhury, Qian Wang, Shan Wang, Siqi Wang, Vida Zhang, Jessica Zhao, Yuan Jiang, Haojian **, Sam Kim, Evelyn Kuo, Tianshi Li, **** Liu, Yile Liu, Robert Zhang

    Abstract: We present the design and design rationale for the user interfaces for Privacy Enhancements for Android (PE for Android). These UIs are built around two core ideas, namely that developers should explicitly declare the purpose of why sensitive data is being used, and these permission-purpose pairs should be split by first party and third party uses. We also present a taxonomy of purposes and ways o… ▽ More

    Submitted 24 April, 2021; originally announced April 2021.

    Comments: 58 pages, 21 figures, 3 tables

  43. arXiv:2102.10403  [pdf, other

    cs.LG

    GLAM: Graph Learning by Modeling Affinity to Labeled Nodes for Graph Neural Networks

    Authors: Vijay Lingam, Arun Iyer, Rahul Ragesh

    Abstract: Graph Neural Networks have shown excellent performance on semi-supervised classification tasks. However, they assume access to a graph that may not be often available in practice. In the absence of any graph, constructing k-Nearest Neighbor (kNN) graphs from the given data have shown to give improvements when used with GNNs over other semi-supervised methods. This paper proposes a semi-supervised… ▽ More

    Submitted 20 February, 2021; originally announced February 2021.

    Comments: 11 pages, 4 figures

  44. arXiv:2102.07575  [pdf, other

    cs.IR cs.LG

    User Embedding based Neighborhood Aggregation Method for Inductive Recommendation

    Authors: Rahul Ragesh, Sundararajan Sellamanickam, Vijay Lingam, Arun Iyer, Ramakrishna Bairi

    Abstract: We consider the problem of learning latent features (aka embedding) for users and items in a recommendation setting. Given only a user-item interaction graph, the goal is to recommend items for each user. Traditional approaches employ matrix factorization-based collaborative filtering methods. Recent methods using graph convolutional networks (e.g., LightGCN) achieve state-of-the-art performance.… ▽ More

    Submitted 16 February, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

  45. arXiv:2012.04063  [pdf

    cs.LG cs.DC cs.RO

    Cost-effective Machine Learning Inference Offload for Edge Computing

    Authors: Christian Makaya, Amalendu Iyer, Jonathan Salfity, Madhu Athreya, M Anthony Lewis

    Abstract: Computing at the edge is increasingly important since a massive amount of data is generated. This poses challenges in transporting all that data to the remote data centers and cloud, where they can be processed and analyzed. On the other hand, harnessing the edge data is essential for offering data-driven and machine learning-based applications, if the challenges, such as device capabilities, conn… ▽ More

    Submitted 7 December, 2020; originally announced December 2020.

  46. arXiv:2008.12842  [pdf, other

    cs.CL cs.LG stat.ML

    HeteGCN: Heterogeneous Graph Convolutional Networks for Text Classification

    Authors: Rahul Ragesh, Sundararajan Sellamanickam, Arun Iyer, Ram Bairi, Vijay Lingam

    Abstract: We consider the problem of learning efficient and inductive graph convolutional networks for text classification with a large number of examples and features. Existing state-of-the-art graph embedding based methods such as predictive text embedding (PTE) and TextGCN have shortcomings in terms of predictive performance, scalability and inductive capability. To address these limitations, we propose… ▽ More

    Submitted 19 August, 2020; originally announced August 2020.

  47. arXiv:2007.08616  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Collision Avoidance Robotics Via Meta-Learning (CARML)

    Authors: Abhiram Iyer, Aravind Mahadevan

    Abstract: This paper presents an approach to exploring a multi-objective reinforcement learning problem with Model-Agnostic Meta-Learning. The environment we used consists of a 2D vehicle equipped with a LIDAR sensor. The goal of the environment is to reach some pre-determined target location but also effectively avoid any obstacles it may find along its path. We also compare this approach against a baselin… ▽ More

    Submitted 16 July, 2020; originally announced July 2020.

  48. arXiv:2004.03994  [pdf, other

    cs.LG stat.ML

    A Graph Convolutional Network Composition Framework for Semi-supervised Classification

    Authors: Rahul Ragesh, Sundararajan Sellamanickam, Vijay Lingam, Arun Iyer

    Abstract: Graph convolutional networks (GCNs) have gained popularity due to high performance achievable on several downstream tasks including node classification. Several architectural variants of these networks have been proposed and investigated with experimental studies in the literature. Motivated by a recent work on simplifying GCNs, we study the problem of designing other variants and propose a framew… ▽ More

    Submitted 8 April, 2020; originally announced April 2020.

  49. arXiv:1910.02133  [pdf, other

    eess.IV cond-mat.mtrl-sci cs.LG stat.ML

    A Conditional Generative Model for Predicting Material Microstructures from Processing Methods

    Authors: Akshay Iyer, Biswadip Dey, Arindam Dasgupta, Wei Chen, Amit Chakraborty

    Abstract: Microstructures of a material form the bridge linking processing conditions - which can be controlled, to the material property - which is the primary interest in engineering applications. Thus a critical task in material design is establishing the processing-structure relationship, which requires domain expertise and techniques that can model the high-dimensional material microstructure. This wor… ▽ More

    Submitted 4 October, 2019; originally announced October 2019.

  50. arXiv:1907.02577  [pdf

    physics.comp-ph cond-mat.mtrl-sci cs.LG stat.ML

    Data-Centric Mixed-Variable Bayesian Optimization For Materials Design

    Authors: Akshay Iyer, Yichi Zhang, Aditya Prasad, Siyu Tao, Yixing Wang, Linda Schadler, L Catherine Brinson, Wei Chen

    Abstract: Materials design can be cast as an optimization problem with the goal of achieving desired properties, by varying material composition, microstructure morphology, and processing conditions. Existence of both qualitative and quantitative material design variables leads to disjointed regions in property space, making the search for optimal design challenging. Limited availability of experimental dat… ▽ More

    Submitted 4 July, 2019; originally announced July 2019.