Skip to main content

Showing 1–20 of 20 results for author: Sridhar, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.05530  [pdf, other

    cs.RO cs.AI cs.CV

    This&That: Language-Gesture Controlled Video Generation for Robot Planning

    Authors: Boyang Wang, Nikhil Sridhar, Chao Feng, Mark Van der Merwe, Adam Fishman, Nima Fazeli, Jeong Joon Park

    Abstract: We propose a robot learning method for communicating, planning, and executing a wide range of tasks, dubbed This&That. We achieve robot planning for general tasks by leveraging the power of video generative models trained on internet-scale data containing rich physical and semantic context. In this work, we tackle three fundamental challenges in video-based planning: 1) unambiguous task communicat… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  2. arXiv:2405.18377  [pdf, other

    cs.AI

    LLaMA-NAS: Efficient Neural Architecture Search for Large Language Models

    Authors: Anthony Sarah, Sharath Nittur Sridhar, Maciej Szankin, Sairam Sundaresan

    Abstract: The abilities of modern large language models (LLMs) in solving natural language processing, complex reasoning, sentiment analysis and other tasks have been extraordinary which has prompted their extensive adoption. Unfortunately, these abilities come with very high memory and computational costs which precludes the use of LLMs on most hardware platforms. To mitigate this, we propose an effective… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  3. arXiv:2312.13301  [pdf, other

    cs.LG cs.AI

    SimQ-NAS: Simultaneous Quantization Policy and Neural Architecture Search

    Authors: Sharath Nittur Sridhar, Maciej Szankin, Fang Chen, Sairam Sundaresan, Anthony Sarah

    Abstract: Recent one-shot Neural Architecture Search algorithms rely on training a hardware-agnostic super-network tailored to a specific task and then extracting efficient sub-networks for different hardware platforms. Popular approaches separate the training of super-networks from the search for sub-networks, often employing predictors to alleviate the computational overhead associated with search. Additi… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

  4. arXiv:2308.15609  [pdf, other

    cs.LG cs.AI

    InstaTune: Instantaneous Neural Architecture Search During Fine-Tuning

    Authors: Sharath Nittur Sridhar, Souvik Kundu, Sairam Sundaresan, Maciej Szankin, Anthony Sarah

    Abstract: One-Shot Neural Architecture Search (NAS) algorithms often rely on training a hardware agnostic super-network for a domain specific task. Optimal sub-networks are then extracted from the trained super-network for different hardware platforms. However, training super-networks from scratch can be extremely time consuming and compute intensive especially for large models that rely on a two-stage trai… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  5. arXiv:2307.11764  [pdf, other

    cs.CL

    Sensi-BERT: Towards Sensitivity Driven Fine-Tuning for Parameter-Efficient BERT

    Authors: Souvik Kundu, Sharath Nittur Sridhar, Maciej Szankin, Sairam Sundaresan

    Abstract: Large pre-trained language models have recently gained significant traction due to their improved performance on various down-stream tasks like text classification and question answering, requiring only few epochs of fine-tuning. However, their large model sizes often prohibit their applications on resource-constrained edge devices. Existing solutions of yielding parameter-efficient BERT models la… ▽ More

    Submitted 31 August, 2023; v1 submitted 14 July, 2023; originally announced July 2023.

    Comments: 6 pages, 5 figures, 2 tables

  6. arXiv:2304.14912  [pdf, other

    eess.SP cs.AI cs.LG

    Human Activity Recognition Using Self-Supervised Representations of Wearable Data

    Authors: Maximilien Burq, Niranjan Sridhar

    Abstract: Automated and accurate human activity recognition (HAR) using body-worn sensors enables practical and cost efficient remote monitoring of Activity of DailyLiving (ADL), which are shown to provide clinical insights across multiple therapeutic areas. Development of accurate algorithms for human activity recognition(HAR) is hindered by the lack of large real-world labeled datasets. Furthermore, algor… ▽ More

    Submitted 26 April, 2023; originally announced April 2023.

    Comments: this article expands work introduced in arXiv:2112.12272

  7. arXiv:2302.03523  [pdf, other

    cs.CV

    Sparse Mixture Once-for-all Adversarial Training for Efficient In-Situ Trade-Off Between Accuracy and Robustness of DNNs

    Authors: Souvik Kundu, Sairam Sundaresan, Sharath Nittur Sridhar, Shunlin Lu, Han Tang, Peter A. Beerel

    Abstract: Existing deep neural networks (DNNs) that achieve state-of-the-art (SOTA) performance on both clean and adversarially-perturbed images rely on either activation or weight conditioned convolution operations. However, such conditional learning costs additional multiply-accumulate (MAC) or addition operations, increasing inference memory and compute costs. To that end, we present a sparse mixture onc… ▽ More

    Submitted 27 December, 2022; originally announced February 2023.

    Comments: 5 pages, 5 figures, 2 tables

  8. arXiv:2205.10358  [pdf, other

    cs.LG cs.NE

    A Hardware-Aware Framework for Accelerating Neural Architecture Search Across Modalities

    Authors: Daniel Cummings, Anthony Sarah, Sharath Nittur Sridhar, Maciej Szankin, Juan Pablo Munoz, Sairam Sundaresan

    Abstract: Recent advances in Neural Architecture Search (NAS) such as one-shot NAS offer the ability to extract specialized hardware-aware sub-network configurations from a task-specific super-network. While considerable effort has been employed towards improving the first stage, namely, the training of the super-network, the search for derivative high-performing sub-networks is still under-explored. Popula… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

  9. arXiv:2202.12954  [pdf, other

    cs.AI

    A Hardware-Aware System for Accelerating Deep Neural Network Optimization

    Authors: Anthony Sarah, Daniel Cummings, Sharath Nittur Sridhar, Sairam Sundaresan, Maciej Szankin, Tristan Webb, J. Pablo Munoz

    Abstract: Recent advances in Neural Architecture Search (NAS) which extract specialized hardware-aware configurations (a.k.a. "sub-networks") from a hardware-agnostic "super-network" have become increasingly popular. While considerable effort has been employed towards improving the first stage, namely, the training of the super-network, the search for derivative high-performing sub-networks is still largely… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

  10. arXiv:2202.12934  [pdf, other

    cs.NE

    Accelerating Neural Architecture Exploration Across Modalities Using Genetic Algorithms

    Authors: Daniel Cummings, Sharath Nittur Sridhar, Anthony Sarah, Maciej Szankin

    Abstract: Neural architecture search (NAS), the study of automating the discovery of optimal deep neural network architectures for tasks in domains such as computer vision and natural language processing, has seen rapid growth in the machine learning research community. While there have been many recent advancements in NAS, there is still a significant focus on reducing the computational cost incurred when… ▽ More

    Submitted 25 February, 2022; originally announced February 2022.

  11. arXiv:2202.12411  [pdf, other

    cs.CL

    TrimBERT: Tailoring BERT for Trade-offs

    Authors: Sharath Nittur Sridhar, Anthony Sarah, Sairam Sundaresan

    Abstract: Models based on BERT have been extremely successful in solving a variety of natural language processing (NLP) tasks. Unfortunately, many of these large models require a great deal of computational resources and/or time for pre-training and fine-tuning which limits wider adoptability. While self-attention layers have been well-studied, a strong justification for inclusion of the intermediate layers… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2012.11881

  12. arXiv:2112.12272  [pdf

    cs.LG cs.CV

    Human Activity Recognition on wrist-worn accelerometers using self-supervised neural networks

    Authors: Niranjan Sridhar, Lance Myers

    Abstract: Measures of Activity of Daily Living (ADL) are an important indicator of overall health but difficult to measure in-clinic. Automated and accurate human activity recognition (HAR) using wrist-worn accelerometers enables practical and cost efficient remote monitoring of ADL. Key obstacles in develo** high quality HAR is the lack of large labeled datasets and the performance loss when applying mod… ▽ More

    Submitted 22 December, 2021; originally announced December 2021.

  13. arXiv:2103.12335  [pdf, other

    cs.RO eess.SY

    Model Based Control of Commercial-Off-TheShelf (COTS) Unmanned Rotorcraft for BrickWall Construction

    Authors: Nithya Sridhar, Sai Abhinay. N, Chaithanya Krishna. B, Shubhankar Shobhit, Kaushik Das, Debasish Ghose

    Abstract: This work proposes a systematic framework for modelling and controller design of a Commercial-Off-The Shelf (COTS) unmanned rotorcraft using control theory and principles, for brick wall construction. With point to point navigation as the primary application, command velocities in the three axes of the Unmanned Aerial Vehicle (UAV) are considered as inputs of the system while its actual velocities… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

    Comments: MBZIRC Symposium 2020

  14. arXiv:2012.11881  [pdf, other

    cs.CL cs.AI

    Undivided Attention: Are Intermediate Layers Necessary for BERT?

    Authors: Sharath Nittur Sridhar, Anthony Sarah

    Abstract: In recent times, BERT-based models have been extremely successful in solving a variety of natural language processing (NLP) tasks such as reading comprehension, natural language inference, sentiment analysis, etc. All BERT-based architectures have a self-attention block followed by a block of intermediate layers as the basic building component. However, a strong justification for the inclusion of… ▽ More

    Submitted 4 April, 2023; v1 submitted 22 December, 2020; originally announced December 2020.

  15. arXiv:2012.09904  [pdf, other

    cs.CV cs.LG

    Attention-based Image Upsampling

    Authors: Souvik Kundu, Hesham Mostafa, Sharath Nittur Sridhar, Sairam Sundaresan

    Abstract: Convolutional layers are an integral part of many deep neural network solutions in computer vision. Recent work shows that replacing the standard convolution operation with mechanisms based on self-attention leads to improved performance on image classification and object detection tasks. In this work, we show how attention mechanisms can be used to replace another canonical operation: strided tra… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

  16. arXiv:1904.09348  [pdf, other

    cs.CV

    Compact Scene Graphs for Layout Composition and Patch Retrieval

    Authors: Subarna Tripathi, Sharath Nittur Sridhar, Sairam Sundaresan, Hanlin Tang

    Abstract: Structured representations such as scene graphs serve as an efficient and compact representation that can be used for downstream rendering or retrieval tasks. However, existing efforts to generate realistic images from scene graphs perform poorly on scene composition for cluttered or complex scenes. We propose two contributions to improve the scene composition. First, we enhance the scene graph re… ▽ More

    Submitted 19 April, 2019; originally announced April 2019.

    Comments: To appear in CVPRW 2019 (CEFRL)

  17. arXiv:1804.06511  [pdf, other

    cs.NE cs.LG

    Fast Weight Long Short-Term Memory

    Authors: T. Anderson Keller, Sharath Nittur Sridhar, Xin Wang

    Abstract: Associative memory using fast weights is a short-term memory mechanism that substantially improves the memory capacity and time scale of recurrent neural networks (RNNs). As recent studies introduced fast weights only to regular RNNs, it is unknown whether fast weight memory is beneficial to gated RNNs. In this work, we report a significant synergy between long short-term memory (LSTM) networks an… ▽ More

    Submitted 17 April, 2018; originally announced April 2018.

  18. arXiv:1610.01983  [pdf, other

    cs.CV cs.RO

    Driving in the Matrix: Can Virtual Worlds Replace Human-Generated Annotations for Real World Tasks?

    Authors: Matthew Johnson-Roberson, Charles Barto, Rounak Mehta, Sharath Nittur Sridhar, Karl Rosaen, Ram Vasudevan

    Abstract: Deep learning has rapidly transformed the state of the art algorithms used to address a variety of problems in computer vision and robotics. These breakthroughs have relied upon massive amounts of human annotated training data. This time consuming process has begun impeding the progress of these deep learning efforts. This paper describes a method to incorporate photo-realistic computer images fro… ▽ More

    Submitted 25 February, 2017; v1 submitted 6 October, 2016; originally announced October 2016.

    Comments: Proceedings of International Conference on Robotics and Automation (ICRA) 2017, 8 pages

  19. arXiv:1509.07543  [pdf, other

    cs.HC cs.CV

    On Optimizing Human-Machine Task Assignments

    Authors: Andreas Veit, Michael Wilber, Rajan Vaish, Serge Belongie, James Davis, Vishal Anand, Anshu Aviral, Prithvijit Chakrabarty, Yash Chandak, Sidharth Chaturvedi, Chinmaya Devaraj, Ankit Dhall, Utkarsh Dwivedi, Sanket Gupte, Sharath N. Sridhar, Karthik Paga, Anuj Pahuja, Aditya Raisinghani, Ayush Sharma, Shweta Sharma, Darpana Sinha, Nisarg Thakkar, K. Bala Vignesh, Utkarsh Verma, Kanniganti Abhishek , et al. (26 additional authors not shown)

    Abstract: When crowdsourcing systems are used in combination with machine inference systems in the real world, they benefit the most when the machine system is deeply integrated with the crowd workers. However, if researchers wish to integrate the crowd with "off-the-shelf" machine classifiers, this deep integration is not always possible. This work explores two strategies to increase accuracy and decrease… ▽ More

    Submitted 24 September, 2015; originally announced September 2015.

    Comments: HCOMP 2015 Work in Progress

  20. arXiv:1507.07838  [pdf, other

    cs.SI physics.soc-ph

    Shifting Behaviour of Users: Towards Understanding the Fundamental Law of Social Networks

    Authors: Yayati Gupta, S. R. S. Iyengar, Jaspal Singh Saini, Nidhi Sridhar

    Abstract: Social Networking Sites (SNSs) are powerful marketing and communication tools. There are hundreds of SNSs that have entered and exited the market over time. The coexistence of multiple SNSs is a rarely observed phenomenon. Most coexisting SNSs either serve different purposes for its users or have cultural differences among them. The introduction of a new SNS with a better set of features can lead… ▽ More

    Submitted 7 November, 2015; v1 submitted 28 July, 2015; originally announced July 2015.